Title-based Approach to Relation Discovery from Wikipedia

Rim Zarrad, Narjes Doggaz, Ezzeddine Zagrouba

2013

Abstract

With the advent of the Web and the explosion of available textual data, the field of domain ontology engineering has gained more and more importance. The last decade, several successful tools for automatically harvesting knowledge from web data have been developed, but the extraction of taxonomic and non taxonomic ontological relationships is still far from being fully solved. This paper describes a new approach which extracts ontological relations from Wikipedia. The non-taxonomic relations extraction process is performed by analyzing the titles which appear in each document of the studied corpus. This method is based on regular expressions which appear in titles and from which we can extract not only the two arguments of the relationships but also the labels which describe the relations. The resulting set of labels is used in order to retrieve new relations by analyzing the title hierarchy in each document. Other relations can be extracted from titles and subtitles containing only one term. An enrichment step is also applied by considering each term which appears as a relation argument of the extracted links in order to discover new concepts and new relations. The experiments have been performed on French Wikipedia articles related to the medical field. The precision and recall values are encouraging and seem to validate our approach.

Download


Paper Citation


in Harvard Style

Zarrad R., Doggaz N. and Zagrouba E. (2013). Title-based Approach to Relation Discovery from Wikipedia . In Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2013) ISBN 978-989-8565-81-5, pages 70-80. DOI: 10.5220/0004547400700080

in Bibtex Style

@conference{keod13,
author={Rim Zarrad and Narjes Doggaz and Ezzeddine Zagrouba},
title={Title-based Approach to Relation Discovery from Wikipedia},
booktitle={Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2013)},
year={2013},
pages={70-80},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004547400700080},
isbn={978-989-8565-81-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Engineering and Ontology Development - Volume 1: KEOD, (IC3K 2013)
TI - Title-based Approach to Relation Discovery from Wikipedia
SN - 978-989-8565-81-5
AU - Zarrad R.
AU - Doggaz N.
AU - Zagrouba E.
PY - 2013
SP - 70
EP - 80
DO - 10.5220/0004547400700080