Matt Marks thinks this is interesting: links = re.findall('<loc>(.*?)</loc>', sitemap) From Crawling your first website from Web Scraping with Python by Richard Lawson Publisher: Packt Publishing Released: October 2015 Note requires - import re Share this highlight http://learning.oreilly.com/a/web-scraping-with/3779530/ Twitter Facebook Google Plus Email Get Instant Access Now Start a Free Trial Have an account? Sign in. Minimise Unlock the rest of Web Scraping with Python and 30,000 other books and videos By clicking this box, you confirm that you have read and agree to the terms and conditions of our Membership Agreement, and you understand that when your trial period ends, you will be required to provide billing information if you wish to continue using the service. Unlock the rest of this book Start a Free 10-Day Trial loading Learn about Safari for Business Have an account? Sign in.