Chapter 5. Parsing – Analyzing Training Data

Parsing, also referred to as syntactic analysis, is one of the tasks in NLP. It is defined as the process of finding whether a character sequence, written in natural language, is in accordance with the rules defined in formal grammar. It is the process of breaking the sentences into words or phrase sequences and providing them a particular component category (noun, verb, preposition, and so on).

This chapter will include the following topics:

  • Treebank construction
  • Extracting Context-free Grammar (CFG) rules from Treebank
  • Creating a probabilistic Context-free Grammar from CFG
  • CYK chart parsing algorithm
  • Earley chart parsing algorithm

Introducing parsing

Parsing is one of the steps involved in NLP. It is defined ...

Get Natural Language Processing: Python and NLTK now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.