COLOR SEGMENTATION OF COMPLEX DOCUMENT IMAGES

N. Nikolaou, N. Papamarkos

2006

Abstract

In this paper we present a new method for color segmentation of complex document images which can be used as a preprocessing step of a text information extraction application. From the edge map of an image, we choose a representative set of samples of the input color image and built the 3D histogram of the RGB color space. These samples are used to locate a relatively large number of proper points in the 3D color space and use them in order to initially reduce the colors. From this step an oversegmented image is produced which usually has no more than 100 colors. To extract the final result, a mean shift procedure starts from the calculated points and locates the final color clusters of the RGB color distribution. Also, to overcome noise problems, a proposed edge preserving smoothing filter is used to enhance the quality of the image. Experimental results showed the method’s capability of producing correctly segmented complex color documents while removing background noise or low contrast objects which is very desirable in text information extraction applications. Additionally, our method has the ability to cluster randomly shaped distributions.

Download


Paper Citation


in Harvard Style

Nikolaou N. and Papamarkos N. (2006). COLOR SEGMENTATION OF COMPLEX DOCUMENT IMAGES . In Proceedings of the First International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, ISBN 972-8865-40-6, pages 220-227. DOI: 10.5220/0001366202200227

in Bibtex Style

@conference{visapp06,
author={N. Nikolaou and N. Papamarkos},
title={COLOR SEGMENTATION OF COMPLEX DOCUMENT IMAGES},
booktitle={Proceedings of the First International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP,},
year={2006},
pages={220-227},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001366202200227},
isbn={972-8865-40-6},
}


in EndNote Style

TY - CONF
JO - Proceedings of the First International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP,
TI - COLOR SEGMENTATION OF COMPLEX DOCUMENT IMAGES
SN - 972-8865-40-6
AU - Nikolaou N.
AU - Papamarkos N.
PY - 2006
SP - 220
EP - 227
DO - 10.5220/0001366202200227