Skip to the content.

“What we have is a data glut.” - Vernon Vinge

Data are being generated by everything around us at all times. Every digital process and social media exchange produces it. Systems, sensors and mobile devices transmit it. Countless databases collect it. Data are arriving from multiple sources at an alarming rate and analysts and organizations are seeking ways to leverage these new sources of information. Consequently, analysts need to understand how to get data from these data sources.

Welcome to week 2! This week we will focus on:

  1. Creating reproducible documents with R Markdown.
  2. Understanding the basics of project workflow with R Projects, R Markdown, and R Notebooks.
  3. How to import (quickly!) flat files, and understanding the difference between Base R functions and Tidyverse functions to import data.
  4. Advanced importing capabilities such as importing data straight from relational databases (i.e. SQL), web scraping, and importing other statistical software data files (i.e. SPSS, SAS, STATA)

Consequently, this week will give you a strong foundation for the different ways to get your data into R and understanding the basics of your data set. This will prepare you for your first challenge in completing your course project - that of acquiring your data!

Below outlines the readings that you need to review and the assignments you need to complete after Monday’s class. The skills and functions introduced in these tutorials will be necessary for Monday’s in-class activities.


Assignments

Readings


Class

Please download the class material from canvas.

See you in class!