Part 2 Transform: Modifying and Manipulating Text

Now that we’ve figured out how to read text from various sources, we need to figure out the things we can do with text once we’ve got it into a script. Generally, this means either extracting some data from it or transforming it into a different format.

We’ll look at a variety of techniques, each useful in different scenarios. First, we’ll look at regular expressions, which will come in useful almost everywhere that we process text. Then we’ll learn how to write parsers for handling more formally structured text. Finally, we’ll take a whistle-stop tour of natural language processing, a broad but exciting topic.

Get Text Processing with Ruby now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.