06 | PCA, applied

Objectives

  1. Download the .csv file found in \assignments\06.inClass on the shared Google Drive.
  2. Load the file into a dataframe. Explore the data.
  3. Create a .md file that will serve as documentation of your work.
  4. Select the features that you would like to apply PCA to. In your documentation, explain why you have chosen these features.
  5. Create one or more visualizations of your selected variables.
  6. Apply PCA. Produce a scree plot. How many components would you need to explain the majority of your data?
  7. Plot the reprojected data and examine the loadings. In your documentation, explain the physical meanings of (at least) the first two principal components.
  8. Upload both your code and your .md file onto the class repository (under /inClass/06.PCA/yourGroupName/).

Deadline

You should complete this assignment by the end of class.