Data Wrangling in Stata, Part Two

This workshop has sessions on multiple days. You should plan to attend all the sessions.

"Data Wrangling" is the process of preparing data for analysis, which includes importing, cleaning, recoding, restructuring, combining, and anything else data needs before it can be analyzed. Data wrangling is a critical skill for research.

In this class you'll learn how to wrangle data using Stata. We'll cover some of the key concepts and workflows of data science as well as the structure and logic of Stata. We'll emphasize real-world issues like handling missing data and checking for errors, as well as best practices for research computing and reproducibility. Our goal is give you a strong foundation you can build on to become an expert data wrangler.

In part two, we'll learn more about working with data, especially grouped or hierarchical data (which is very common). We'll also discuss restructuring data and combining data sets. We'll cover sections four through nine of the online curriculum.

Students should take SSCC's Introduction to Stata or have equivalent experience before taking this class. You should be comfortable writing do files and understand all the component parts of a command like 'sum x if y>5, detail'. Graduate students may choose to take this class at the start of their graduate student career or wait until they are ready to start doing research.

Instructor: Dimond
Room: 3218 Sewell Social Sciences Building
Dates: 2/14, 2/16, 2/21, 2/23
Time: 2:00 - 4:00
Semester: spring22