5 Download
As we just saw, the next step after using bow()
is to scrape()
. Scraping a single page requires only these two functions:
dat <-
bow("https://scrapethissite.com/pages/simple") |>
scrape()
Now we have this data downloaded as a data object, dat
. It is good practice to store this website as an object in our R environment, rather than piping the results into a data extraction function. The offline copy in our environment allows us to practice with CSS selectors without sending repeated requests to the website.