ArabRelat: Arabic Relation Extraction using Distant Supervision

Reham Mohamed, Nagwa M. El-Makky, Khaled Nagi

2015

Abstract

Relation Extraction is an important preprocessing task for a number of text mining applications, including: Information Retrieval, Question Answering, Ontology building, among others. In this paper, we propose a novel Arabic relation extraction method that leverages linguistic features of the Arabic language in Web data to infer relations between entities. Due to the lack of labeled Arabic corpora, we adopt the idea of distant supervision, where DBpedia, a large database of semantic relations extracted from Wikipedia, is used along with a large unlabeled text corpus to build the training data. We extract the sentences from the unlabeled text corpus, and tag them using the corresponding DBpedia relations. Finally, we build a relation classifier using this data which predicts the relation type of new instances. Our experimental results show that the system reaches 70% for the F-measure in detecting relations.

Download


Paper Citation


in Harvard Style

Mohamed R., M. El-Makky N. and Nagi K. (2015). ArabRelat: Arabic Relation Extraction using Distant Supervision . In Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 2: KEOD, (IC3K 2015) ISBN 978-989-758-158-8, pages 410-417. DOI: 10.5220/0005636604100417

in Bibtex Style

@conference{keod15,
author={Reham Mohamed and Nagwa M. El-Makky and Khaled Nagi},
title={ArabRelat: Arabic Relation Extraction using Distant Supervision},
booktitle={Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 2: KEOD, (IC3K 2015)},
year={2015},
pages={410-417},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005636604100417},
isbn={978-989-758-158-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 2: KEOD, (IC3K 2015)
TI - ArabRelat: Arabic Relation Extraction using Distant Supervision
SN - 978-989-758-158-8
AU - Mohamed R.
AU - M. El-Makky N.
AU - Nagi K.
PY - 2015
SP - 410
EP - 417
DO - 10.5220/0005636604100417