The hack – splitting by word extraction

Word extraction can be done in several ways. In turn, we can use word extraction for splitting the words into tokens. We will look at Regex, or Regular Expressions for doing word extractions. It is a pattern driven string search mechanism where the pattern grammar is defined by the user.

Get Natural Language Processing with Python Quick Start Guide now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.