Web Scraping

How to get structured Wikipedia data via DBPedia

Wikipedia contains a wealth of knowledge. While some of that knowledge consists of natural language descriptions, a rich share of information on Wikipedia is encoded in machine-readable format, such as “infoboxes” and other specially formatted parts. An infobox is rendered as a table that you typically see on the right-hand side of an article. While …

How to get structured Wikipedia data via DBPedia Read More »

How to Become a Web Scraping Pro with Python pt. 1

Scrapy is an excellent Python library for web scraping. For example, you could create an API with data that is populated via web scraping. This article covers some basic scrapy features, such as the shell and selectors. Install scrapy in virtual environment on your machine: $ virtualenv venv $ source venv/bin/activate $ pip install scrapy$ …

How to Become a Web Scraping Pro with Python pt. 1 Read More »