A SPATIAL QUERY LANGUAGE FOR PRESENTATION-ORIENTED DOCUMENTS

Ermelinda Oro, Francesco Riccetti, Massimo Ruffolo

2011

Abstract

In last years the huge relevance of accessing and acquiring information made available byWeb (HTML) pages and business (PDF) documents has grown much further. In this paper we present a textual query language, named ViQueL, whose main feature is to identify and extract relevant information from HTML and PDF documents on the base of their visual appearance by using easy-to-write queries. The proposed language is founded on spatial grammars, i.e. context free grammars extended by spatial constructs. Despite a considerable expressive power, combined complexity of ViQueL is in P-Time. Moreover, experiments show that ViQueL is reasonably efficient for real-life extraction tasks.

Download


Paper Citation


in Harvard Style

Oro E., Riccetti F. and Ruffolo M. (2011). A SPATIAL QUERY LANGUAGE FOR PRESENTATION-ORIENTED DOCUMENTS . In Proceedings of the 3rd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, ISBN 978-989-8425-40-9, pages 306-312. DOI: 10.5220/0003177603060312

in Bibtex Style

@conference{icaart11,
author={Ermelinda Oro and Francesco Riccetti and Massimo Ruffolo},
title={A SPATIAL QUERY LANGUAGE FOR PRESENTATION-ORIENTED DOCUMENTS},
booktitle={Proceedings of the 3rd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,},
year={2011},
pages={306-312},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003177603060312},
isbn={978-989-8425-40-9},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 3rd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,
TI - A SPATIAL QUERY LANGUAGE FOR PRESENTATION-ORIENTED DOCUMENTS
SN - 978-989-8425-40-9
AU - Oro E.
AU - Riccetti F.
AU - Ruffolo M.
PY - 2011
SP - 306
EP - 312
DO - 10.5220/0003177603060312