Further Exploration

You can point this webbot at any web page, and it will generate a copy of each image that page uses, arranged in a directory structure that resembles the original. You can also develop other useful webbots based on this design. If you want to test your skills, consider the following challenges.

  • Write a similar webbot that detects hijacked images.

  • Improve the efficiency of the script by reworking it so that it doesn’t download an image it has downloaded previously.

  • Modify this webbot to create local backup copies of web pages.

  • Adjust the webbot to cache movies or audio files instead of images.

  • Modify the bot to monitor when images change on a web page.

Get Webbots, Spiders, and Screen Scrapers, 2nd Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.