ENSURING HIGH PERFORMANCE IN VALIDATING XML PARSER

Donglei Cao, Shuang Yu, Beijie Dai, Beihong Jin

2007

Abstract

An XML parser is the fundamental software for analyzing and processing XML documents. This paper presents the optimized validation algorithms in OnceXMLParser, a full-validating XML Parser. OnceXMLParser adopts a lightweight architecture and implements several efficient algorithms for validating. Since the element validating is a great challenge to the performance of a validating XML parser, this paper focused on two key algorithms to resolve it. The first one involves in an optimized automaton used to build these element validating rules efficiently. The second one is a statistical predictive algorithm to reduce the name string recognizing process. For a valid document, this algorithm could make precise prediction when the child elements are sequentially defined, and could fulfil the least cost prediction according to the cost function when the child elements are optionally defined. Performance testing shows OnceXMLParser after performance tuning has outstanding parsing efficiency.

Download


Paper Citation


in Harvard Style

Cao D., Yu S., Dai B. and Jin B. (2007). ENSURING HIGH PERFORMANCE IN VALIDATING XML PARSER . In Proceedings of the Third International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-972-8865-77-1, pages 210-215. DOI: 10.5220/0001276102100215

in Bibtex Style

@conference{webist07,
author={Donglei Cao and Shuang Yu and Beijie Dai and Beihong Jin},
title={ENSURING HIGH PERFORMANCE IN VALIDATING XML PARSER},
booktitle={Proceedings of the Third International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2007},
pages={210-215},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001276102100215},
isbn={978-972-8865-77-1},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Third International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - ENSURING HIGH PERFORMANCE IN VALIDATING XML PARSER
SN - 978-972-8865-77-1
AU - Cao D.
AU - Yu S.
AU - Dai B.
AU - Jin B.
PY - 2007
SP - 210
EP - 215
DO - 10.5220/0001276102100215