These exercises cover the Geoms and Aesthetics of ggplot2 for Plotting in R.

Exercise 1

These first few exercises will run through some of the simple principles of creating a ggplot2 object, assigning aesthetics mappings and geoms.

  1. Read in the cleaned patients dataset as we saw in ggplot2 course earlier (“patients_clean_ggplot2.txt”)
## [1] "/__w/Plotting_In_R/Plotting_In_R/extdata"

Scatterplots

  1. Using the patient dataset generate a scatter plot of BMI versus Weight.

  1. Extending the plot from exercise 2, add a colour scale to the scatterplot based on the Height variable.

  1. Following from exercise 3, split the BMI vs Weight plot into a grid of plots separated by Smoking status and Sex .

  1. Using an additional geom, add an extra layer of a fit line to the solution from exercise 3.
## `geom_smooth()` using method = 'loess' and formula = 'y ~ x'
## Warning: The following aesthetics were dropped during statistical transformation:
## colour.
## i This can happen when ggplot fails to infer the correct grouping structure in
##   the data.
## i Did you forget to specify a `group` aesthetic or to convert a numerical
##   variable into a factor?

  1. Does the fit in question 5 look good? Look at the description for ?geom_smooth() and adjust the method for a better fit.
## `geom_smooth()` using formula = 'y ~ x'
## Warning: The following aesthetics were dropped during statistical transformation:
## colour.
## i This can happen when ggplot fails to infer the correct grouping structure in
##   the data.
## i Did you forget to specify a `group` aesthetic or to convert a numerical
##   variable into a factor?

###Boxplots and Violin plots

  1. Generate a boxplot of BMIs comparing smokers and non-smokers.

  1. Following from the boxplot comparing smokers and non-smokers in exercise 7, colour boxplot edges by Sex.

  1. Now reproduce the boxplots in exercise 8 (grouped by smoker, coloured by sex) but now include a separate facet for people of different age (using Age column).

  1. Produce a similar boxplot of BMIs but this time group data by Sex, colour by Age and facet by Smoking status.

HINT - Discrete values such as in factors are used for categorical data.

  1. Regenerate the solution to exercise 10 but this time using a violin plot.

###Histogram and Density plots

  1. Generate a histogram of BMIs with each bar coloured blue.
## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.

  1. Generate density plots of BMIs coloured by Sex.

HINT: alpha can be used to control transparancy.

  1. Generate a separate density plot of BMI coloured by sex for each Grade,