MINING NON-TAXONOMIC CONCEPT PAIRS FROM UNSTRUCTURED TEXT - A Concept Correlation Search Framework

Mei Kuan Wong, Syed Sibte Raza Abidi, Ian D. Jonsen

2011

Abstract

Ontology consists of concepts, taxonomic relations and non-taxonomic relations. The majority of the ontology learning tools focus on discovering concepts and taxonomic relations. Very little effort has been put on discovering non-taxonomic relations. In this paper, we present a concept correlation search framework to discover non-taxonomic concept pairs from unstructured text. Our framework features the (a) extraction of correlated concepts beyond ordinary search window size of a single sentence; (b) use of lift as interestingness measure for association rule mining; (c) harness of 2- itemsets association rules from n- itemsets association rules where n>2; and (d) identification of non-taxonomic concept pairs based on existing domain ontology. The proposed framework has been tested with the Fisheries Oceanography journals, and the results demonstrate significant improvements over traditional association rule approach in search of non-taxonomic concept pairs.

Download


Paper Citation


in Harvard Style

Kuan Wong M., Sibte Raza Abidi S. and D. Jonsen I. (2011). MINING NON-TAXONOMIC CONCEPT PAIRS FROM UNSTRUCTURED TEXT - A Concept Correlation Search Framework . In Proceedings of the 7th International Conference on Web Information Systems and Technologies - Volume 1: WTM, (WEBIST 2011) ISBN 978-989-8425-51-5, pages 707-716. DOI: 10.5220/0003482707070716

in Bibtex Style

@conference{wtm11,
author={Mei Kuan Wong and Syed Sibte Raza Abidi and Ian D. Jonsen},
title={MINING NON-TAXONOMIC CONCEPT PAIRS FROM UNSTRUCTURED TEXT - A Concept Correlation Search Framework},
booktitle={Proceedings of the 7th International Conference on Web Information Systems and Technologies - Volume 1: WTM, (WEBIST 2011)},
year={2011},
pages={707-716},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003482707070716},
isbn={978-989-8425-51-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 7th International Conference on Web Information Systems and Technologies - Volume 1: WTM, (WEBIST 2011)
TI - MINING NON-TAXONOMIC CONCEPT PAIRS FROM UNSTRUCTURED TEXT - A Concept Correlation Search Framework
SN - 978-989-8425-51-5
AU - Kuan Wong M.
AU - Sibte Raza Abidi S.
AU - D. Jonsen I.
PY - 2011
SP - 707
EP - 716
DO - 10.5220/0003482707070716