These exercises cover the Geoms and Aesthetics of ggplot2 for Plotting
in R.
Exercise 1
These first few exercises will run through some of the simple
principles of creating a ggplot2 object, assigning aesthetics mappings
and geoms.
- Read in the cleaned patients dataset as we saw in ggplot2 course
earlier (“patients_clean_ggplot2.txt”)
## [1] "/__w/Plotting_In_R/Plotting_In_R/extdata"
Scatterplots
- Using the patient dataset generate a scatter plot of BMI versus
Weight.
- Extending the plot from exercise 2, add a colour scale to the
scatterplot based on the Height variable.
- Following from exercise 3, split the BMI vs Weight plot into a grid
of plots separated by Smoking status and Sex .
- Using an additional geom, add an extra layer of a fit line to the
solution from exercise 3.
## `geom_smooth()` using method = 'loess' and formula = 'y ~ x'
## Warning: The following aesthetics were dropped during statistical transformation:
## colour.
## i This can happen when ggplot fails to infer the correct grouping structure in
## the data.
## i Did you forget to specify a `group` aesthetic or to convert a numerical
## variable into a factor?
- Does the fit in question 5 look good? Look at the description for
?geom_smooth() and adjust the method for a better fit.
## `geom_smooth()` using formula = 'y ~ x'
## Warning: The following aesthetics were dropped during statistical transformation:
## colour.
## i This can happen when ggplot fails to infer the correct grouping structure in
## the data.
## i Did you forget to specify a `group` aesthetic or to convert a numerical
## variable into a factor?
###Boxplots and Violin plots
- Generate a boxplot of BMIs comparing smokers and non-smokers.
- Following from the boxplot comparing smokers and non-smokers in
exercise 7, colour boxplot edges by Sex.
- Now reproduce the boxplots in exercise 8 (grouped by smoker,
coloured by sex) but now include a separate facet for people of
different age (using Age column).
- Produce a similar boxplot of BMIs but this time group data by Sex,
colour by Age and facet by Smoking status.
HINT - Discrete values such as in factors are used for categorical
data.
- Regenerate the solution to exercise 10 but this time using a violin
plot.
###Histogram and Density plots
- Generate a histogram of BMIs with each bar coloured blue.
## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
- Generate density plots of BMIs coloured by Sex.
HINT: alpha can be used to control transparancy.
- Generate a separate density plot of BMI coloured by sex for each
Grade,