Getting ready

As an example, we will build a vary simple ETL process that will read the main planets page and store the planets data in MySQL. It will also pass a single message for each more info link in the page to a queue, where 0 or more processes can receive those requests and perform further processing on those links.

To access SQS from Python, we will revisit using the boto3 library.

Get Python Web Scraping Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.