How to do it

First, this example does not intend to make an exhaustive examination or one that is optimally performant.  Just one that is simple to understand and can be easily applied and extended to our example of parsing job listings:

  1. We will examine this process using the following sentences from a StackOverflow job listing for SpaceX:

We are seeking developers with demonstrable experience in: ASP.NET, C#, SQL Server, and AngularJS. We are a fast-paced, highly iterative team that has to adapt quickly as our factory grows.

  1. There are a number of high value 2-grams in these two sentences (and I think job listings are a great place to look for 2-grams).  Just looking at it, I can pick out the following as being important:
    • ASP.NET

Get Python Web Scraping Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.