THRESHOLD CORRECTION OF DOCUMENT IMAGE BINARIZATION FOR TEXT EXTRACTION

Hiroshi Tanaka, Yusaku Fujii, Yoshinobu Hotta

2011

Abstract

In this paper, a simple threshold correction method for document image binarization for text extraction is presented. This method enhances the binary image of characters, which is often adversely influenced by neighboring strong pixels or background noise. The threshold correction method is based on a similar method applied to ruled-line extraction presented by the author, and is claimed to be effective to text extraction. The author also reveals the relationship between effectiveness of the method and the image resolution.

Download


Paper Citation


in Harvard Style

Tanaka H., Fujii Y. and Hotta Y. (2011). THRESHOLD CORRECTION OF DOCUMENT IMAGE BINARIZATION FOR TEXT EXTRACTION . In Proceedings of the International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2011) ISBN 978-989-8425-47-8, pages 387-391. DOI: 10.5220/0003396503870391

in Bibtex Style

@conference{visapp11,
author={Hiroshi Tanaka and Yusaku Fujii and Yoshinobu Hotta},
title={THRESHOLD CORRECTION OF DOCUMENT IMAGE BINARIZATION FOR TEXT EXTRACTION},
booktitle={Proceedings of the International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2011)},
year={2011},
pages={387-391},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003396503870391},
isbn={978-989-8425-47-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Computer Vision Theory and Applications - Volume 1: VISAPP, (VISIGRAPP 2011)
TI - THRESHOLD CORRECTION OF DOCUMENT IMAGE BINARIZATION FOR TEXT EXTRACTION
SN - 978-989-8425-47-8
AU - Tanaka H.
AU - Fujii Y.
AU - Hotta Y.
PY - 2011
SP - 387
EP - 391
DO - 10.5220/0003396503870391