MAPPING KNOWLEDGE DOMAINS - Combining Symbolic Relations with Graph Theory

Eric SanJuan

2011

Abstract

We present a symbolic and graph-based approach for mapping knowledge domains. The symbolic component relies on shallow linguistic processing of texts to extract multi-word terms and cluster them based on lexico-syntactic relations. The clusters are subjected to graph decomposition basing on inherent graph theoretic properties of association graphs of items (authors-terms, documents-authors, etc). These include the search for complete minimal separators that can decompose the graphs into central (core topics) and peripheral atoms. The methodology is implemented in the TermWatch system and can be used for several text mining tasks. We also mined for frequent itemsets as a means of revealing dependencies between formal concepts in the corpus. A comparison of the frequent itemsets extracted on each dataset and the structure of the central atom shows an interesting overlap. The interesting features of our approach lie in the combination of state-of-the-art techniques from Natural Language Processing (NLP), Clustering and Graph Theory to develop a system and methodology adapted to uncovering hidden sub-structures from texts.

Download


Paper Citation


in Harvard Style

SanJuan E. (2011). MAPPING KNOWLEDGE DOMAINS - Combining Symbolic Relations with Graph Theory . In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: SSTM, (IC3K 2011) ISBN 978-989-8425-79-9, pages 519-528. DOI: 10.5220/0003721105270536

in Bibtex Style

@conference{sstm11,
author={Eric SanJuan},
title={MAPPING KNOWLEDGE DOMAINS - Combining Symbolic Relations with Graph Theory},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: SSTM, (IC3K 2011)},
year={2011},
pages={519-528},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003721105270536},
isbn={978-989-8425-79-9},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: SSTM, (IC3K 2011)
TI - MAPPING KNOWLEDGE DOMAINS - Combining Symbolic Relations with Graph Theory
SN - 978-989-8425-79-9
AU - SanJuan E.
PY - 2011
SP - 519
EP - 528
DO - 10.5220/0003721105270536