Skip to the content.

Last week we discussed general guidelines for first interacting with a new data set. This week we want to build on those activities by learning how to clean and tidy our data, and then beginning our journey to creating insights with data through data manipulation.

Specifically, this week you are going to learn:

  1. How to make your data “tidy”.
  2. How to transofrm your data to create insights from data.

Common transformation procedures include filtering observations by their values, reordering the rows, selecting variables, creating new variables with functions of existing variables, and collapsing values down to single summary summary statistics (i.e. mean, max, variance).

Consequently, this week will give you a strong foundation for managing and cleaning your data. This will prepare you for your second challenge in completing your course project - that of cleaning, tidying, and preparing your data for exploratory data analysis!

Below outlines the tutorials that you need to review, and the assignments you need to complete, prior to Monday’s class. The skills and functions introduced in these tutorials will be necessary for Monday’s in-class activities.


Assignments

Readings


Class

Please download the class material from canvas.

See you in class!

In addition, be sure to have identified which data you are going to use for your final project. Be sure to have access to this data because you will work on it during class. Furthermore, identify at least 10 specific questions you want to ask of your project data. Using what you learned this week, what type of data transformations do you need to make to help answer these questions? Be ready to use dplyr to answer these questions in class.