The AIS Project: Boosting Information Extraction from Legal Documents by using Ontologies

María G. Buey, Angel Luis Garrido, Carlos Bobed, Sergio Ilarri

2016

Abstract

In the legal field, it is a fact that a large number of documents are processed every day by management companies with the purpose of extracting data that they consider most relevant in order to be stored in their own databases. Despite technological advances, in many organizations, the task of examining these usually-extensive documents for extracting just a few essential data is still performed manually by people, which is expensive, time-consuming, and subject to human errors. Moreover, legal documents usually follow several conventions in both structure and use of language, which, while not completely formal, can be exploited to boost information extraction. In this work, we present an approach to obtain relevant information out from these legal documents based on the use of ontologies to capture and take advantage of such structure and language conventions. We have implemented our approach in a framework that allows to address different types of documents with minimal effort. Within this framework, we have also regarded one frequent problem that is found in this kind of documentation: the presence of overlapping elements, such as stamps or signatures, which greatly hinders the extraction work over scanned documents. Experimental results show promising results, showing the feasibility of our approach.

Download


Paper Citation


in Harvard Style

Buey M., Garrido A., Bobed C. and Ilarri S. (2016). The AIS Project: Boosting Information Extraction from Legal Documents by using Ontologies . In Proceedings of the 8th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART, ISBN 978-989-758-172-4, pages 438-445. DOI: 10.5220/0005757204380445

in Bibtex Style

@conference{icaart16,
author={María G. Buey and Angel Luis Garrido and Carlos Bobed and Sergio Ilarri},
title={The AIS Project: Boosting Information Extraction from Legal Documents by using Ontologies},
booktitle={Proceedings of the 8th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,},
year={2016},
pages={438-445},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005757204380445},
isbn={978-989-758-172-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 8th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,
TI - The AIS Project: Boosting Information Extraction from Legal Documents by using Ontologies
SN - 978-989-758-172-4
AU - Buey M.
AU - Garrido A.
AU - Bobed C.
AU - Ilarri S.
PY - 2016
SP - 438
EP - 445
DO - 10.5220/0005757204380445