SKEW CORRECTION IN DOCUMENTS WITH SEVERAL DIFFERENTLY SKEWED TEXT AREAS

P. Saragiotis, N. Papamarkos

2007

Abstract

In this paper we propose a technique for detecting and correcting the skew of text areas in a document. The documents we work with may contain several areas of text with different skew angles. In the first stage, a text localization procedure is applied based on connected components analysis. Specifically, the connected components of the document are extracted and filtered according to their size and geometric characteristics. Next, the candidate characters are grouped using a nearest neighbour approach to form words, in a first step, and then text lines of any skew, in a second step. Using linear regression, two lines are estimated for each text line representing its top and bottom boundaries. The text lines in near locations with similar skew angles are grown to form text areas. These text areas are rotated independently to a horizontal or vertical plane. This technique has been tested and proved efficient and robust on a wide variety of documents including spreadsheets, book and magazine covers and advertisements.

Download


Paper Citation


in Harvard Style

Saragiotis P. and Papamarkos N. (2007). SKEW CORRECTION IN DOCUMENTS WITH SEVERAL DIFFERENTLY SKEWED TEXT AREAS . In Proceedings of the Second International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, ISBN 978-972-8865-73-3, pages 85-92. DOI: 10.5220/0002041800850092

in Bibtex Style

@conference{visapp07,
author={P. Saragiotis and N. Papamarkos},
title={SKEW CORRECTION IN DOCUMENTS WITH SEVERAL DIFFERENTLY SKEWED TEXT AREAS},
booktitle={Proceedings of the Second International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP,},
year={2007},
pages={85-92},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002041800850092},
isbn={978-972-8865-73-3},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Second International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP,
TI - SKEW CORRECTION IN DOCUMENTS WITH SEVERAL DIFFERENTLY SKEWED TEXT AREAS
SN - 978-972-8865-73-3
AU - Saragiotis P.
AU - Papamarkos N.
PY - 2007
SP - 85
EP - 92
DO - 10.5220/0002041800850092