Introduction to Web Scraping with R

This workshop has sessions on multiple days. You should plan to attend all the sessions.

Web scraping is the process of extracting data from websites, which can then be used for research.

In this workshop, you will be learn best practices for ethical web scraping, the basics of HTML syntax and CSS selectors, and how to extract information from a web page with the rvest package in R.

To benefit from this workshop, you will need an understanding of the fundamentals of working with data in R, such as you can get from our Data Wrangling in R workshop or online curriculum (https://sscc.wisc.edu/sscc/pubs/dwr/).

Familiarity and experience with HTML, CSS selectors, the purrr R package, and running jobs on Linstat (https://kb.wisc.edu/sscc/page.php?id=102669) will allow you to complete more advanced and larger web scraping tasks, but they are not required.

Instructor: Struck
Room: 3218 Sewell Social Sciences Building
Dates: 10/6, 10/13, 10/20
Time: 10:45 - 11:45
Semester: fall21