TRIADS Fall 2024 Training Series: Webscraping in Python

In this concise 1-session workshop, we will learn about web scraping using the Requests and BeautifulSoup library in Python. Often, the data essential for research is not neatly presented as a CSV or JSON file and we need to go out and search for it ourselves. Web scraping is one way of using an automated process to collect data from websites (like, Wikipedia). This workshop will introduce you to the basic components of html, which serves as the backbone for structuring content on web pages, and will guide you in constructing your very own web scraper.

This course is intended for graduate students, faculty and staff from any field at WashU interested in collecting data from websites. Participants are expected to have a basic proficiency in Python and some experience working with the Pandas library. 

This class will be fully in-person, and participants will use their own laptops.

TRIADS training workshops are co-sponsored by University Library Data Services, as part of the DataLab Workshops series.

Time: 2:00 – 3:30 p.m.

Location: Instruction Room 3, Olin Library, A Level

Instructor: Ishita Gopal

Max enrollment: Enrollment is limited to 30. If you enroll and elect not to attend, please let us know ASAP so we can offer the space to another participant. 

RSVP