O'Reilly logo

Handbook of Statistics by Venu Govindaraju, C.R. Rao

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 16

Learning Algorithms for Document Layout Analysis

Simone Marinai,    Dipartimento di Sistemi e Informatica, Universitá degli Studi di Firenze, Italy

Abstract

In this chapter we describe several approaches that have been proposed to use learning algorithm to analyze the layout of digitized documents. Layout analysis encompasses all the techniques that are used to infer the organization of the page layout of document images. From a physical point of view the layout can be described as composed by blocks, in most cases rectangular, that are arranged in the page and contain homogeneous content, such as text, vectorial graphics, or illustrations. From a logical point of view text blocks can have a different meaning on the basis of their content ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required