Fast Arabic Glyph Recognizer based on Haar Cascade Classifiers

Ashraf AbdelRaouf, Colin A. Higgins, Tony Pridmore, Mahmoud I. Khalil

2014

Abstract

Optical Character Recognition (OCR) is an important technology. The Arabic language lacks both the variety of OCR systems and the depth of research relative to Roman scripts. A machine learning, Haar-Cascade classifier (HCC) approach was introduced by Viola and Jones (Viola and Jones 2001) to achieve rapid object detection based on a boosted cascade Haar-like features. Here, that approach is modified for the first time to suit Arabic glyph recognition. The HCC approach eliminates problematic steps in the pre-processing and recognition phases and, most importantly, the character segmentation stage. A recognizer was produced for each of the 61 Arabic glyphs that exist after the removal of diacritical marks. These recognizers were trained and tested on some 2,000 images each. The system was tested with real text images and produces a recognition rate for Arabic glyphs of 87%. The proposed method is fast, with an average document recognition time of 14.7 seconds compared with 15.8 seconds for commercial software.

Download


Paper Citation


in Harvard Style

AbdelRaouf A., A. Higgins C., Pridmore T. and I. Khalil M. (2014). Fast Arabic Glyph Recognizer based on Haar Cascade Classifiers . In Proceedings of the 3rd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM, ISBN 978-989-758-018-5, pages 350-357. DOI: 10.5220/0004925803500357

in Bibtex Style

@conference{icpram14,
author={Ashraf AbdelRaouf and Colin A. Higgins and Tony Pridmore and Mahmoud I. Khalil},
title={Fast Arabic Glyph Recognizer based on Haar Cascade Classifiers},
booktitle={Proceedings of the 3rd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,},
year={2014},
pages={350-357},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004925803500357},
isbn={978-989-758-018-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 3rd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,
TI - Fast Arabic Glyph Recognizer based on Haar Cascade Classifiers
SN - 978-989-758-018-5
AU - AbdelRaouf A.
AU - A. Higgins C.
AU - Pridmore T.
AU - I. Khalil M.
PY - 2014
SP - 350
EP - 357
DO - 10.5220/0004925803500357