ONTOLOGY LEARNING BY ANALYZING XML DOCUMENT STRUCTURE AND CONTENT

Nathalie Aussenac-Gilles, Mouna Kamel

2009

Abstract

Most existing methods for ontology learning from textual documents rely on natural language analysis. We extend these approaches by taking into account the document structure which bears additional knowledge. The documents that we deal with are XML specifications of databases. In addition to classical linguistic clues, the structural organization of such documents also contributes to convey meaning. In a first stage, we characterize the semantics of XML mark-up and of their relations. Then parsing rules are defined to exploit the XML structure of documents and to create ontology concepts and semantic relations. These rules make it possible to automatically learn a kernel of ontology from documents. In a second stage; this ontology is enriched with the results of text analysis by lexico-syntactic patterns. Both ontology learning rules and patterns are implemented in the Gate platform.

Download


Paper Citation


in Harvard Style

Aussenac-Gilles N. and Kamel M. (2009). ONTOLOGY LEARNING BY ANALYZING XML DOCUMENT STRUCTURE AND CONTENT . In Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2009) ISBN 978-989-674-012-2, pages 159-165. DOI: 10.5220/0002293301590165

in Bibtex Style

@conference{keod09,
author={Nathalie Aussenac-Gilles and Mouna Kamel},
title={ONTOLOGY LEARNING BY ANALYZING XML DOCUMENT STRUCTURE AND CONTENT},
booktitle={Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2009)},
year={2009},
pages={159-165},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002293301590165},
isbn={978-989-674-012-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2009)
TI - ONTOLOGY LEARNING BY ANALYZING XML DOCUMENT STRUCTURE AND CONTENT
SN - 978-989-674-012-2
AU - Aussenac-Gilles N.
AU - Kamel M.
PY - 2009
SP - 159
EP - 165
DO - 10.5220/0002293301590165