Chapter 13. A Digital Content Remastering Application

Document Understanding

The design of many software systems often involves the manipulation and processing of digital media or digital content.[1] For instance, the ability to deliver electronic services through internet-based delivery channels requires that printed material such as books, journals, newspapers, and magazines be converted into forms suitable for electronic distribution. This type of content manipulation often includes preprocessing, transformation from one format to another, extraction of metadata, and in many cases verification and validation of the resulting content.

Document understanding is one form of content understanding in which a system analyzes documents, including books, ...

Get Pattern-Oriented Analysis and Design: Composing Patterns to Design Software Systems now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.