DOCUMENT RETRIEVAL USING A PROBABILISTIC KNOWLEDGE MODEL

Shuguang Wang, Shyam Visweswaran, Milos Hauskrecht

2009

Abstract

We are interested in enhancing information retrieval methods by incorporating domain knowledge. In this paper, we present a new document retrieval framework that learns a probabilistic knowledge model and exploits this model to improve document retrieval. The knowledge model is represented by a network of associations among concepts defining key domain entities and is extracted from a corpus of documents or from a curated domain knowledge base. This knowledge model is then used to perform concept-related probabilistic inferences using link analysis methods and applied to the task of document retrieval. We evaluate this new framework on two biomedical datasets and show that this novel knowledge-based approach outperforms the state-of-art Lemur/Indri document retrieval method.

Download


Paper Citation


in Harvard Style

Wang S., Visweswaran S. and Hauskrecht M. (2009). DOCUMENT RETRIEVAL USING A PROBABILISTIC KNOWLEDGE MODEL . In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2009) ISBN 978-989-674-011-5, pages 26-33. DOI: 10.5220/0002293400260033

in Bibtex Style

@conference{kdir09,
author={Shuguang Wang and Shyam Visweswaran and Milos Hauskrecht},
title={DOCUMENT RETRIEVAL USING A PROBABILISTIC KNOWLEDGE MODEL},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2009)},
year={2009},
pages={26-33},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002293400260033},
isbn={978-989-674-011-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2009)
TI - DOCUMENT RETRIEVAL USING A PROBABILISTIC KNOWLEDGE MODEL
SN - 978-989-674-011-5
AU - Wang S.
AU - Visweswaran S.
AU - Hauskrecht M.
PY - 2009
SP - 26
EP - 33
DO - 10.5220/0002293400260033