HackAI: Web scraping for text data collection
An interactive workshop to learn how to scrape text data from the Internet.
This hands-on workshop will introduce participants to the principles of webscraping, including targeting key parts of web pages and minimising your impact on the website host. This workshop will assume some familiarity with Python, to the level addressed in our Introduction to Python workshop.
Attendees will be sent a set of learning materials covering the following topics:
- Browsing the web through Python
- Extracting content from webpages
- Structured formats for web data
- Ethical principles for collecting web data
For questions about this event, please contact idsai-ecrn-committee@exeter.ac.uk.
This activity is part of an ECRN Enhancement Award that has been funded by the University of Exeter Researcher Development and Research Culture team.
An interactive workshop to learn how to scrape text data from the Internet.
This hands-on workshop will introduce participants to the principles of webscraping, including targeting key parts of web pages and minimising your impact on the website host. This workshop will assume some familiarity with Python, to the level addressed in our Introduction to Python workshop.
Attendees will be sent a set of learning materials covering the following topics:
- Browsing the web through Python
- Extracting content from webpages
- Structured formats for web data
- Ethical principles for collecting web data
For questions about this event, please contact idsai-ecrn-committee@exeter.ac.uk.
This activity is part of an ECRN Enhancement Award that has been funded by the University of Exeter Researcher Development and Research Culture team.
Good to know
Highlights
- 4 hours
- In person
Location
University of Exeter
Stocker Road
Exeter EX4 4PY
How do you want to get there?
