Pseudo Relevance Feedback Technique and Semantic Similarity for Corpus-based Expansion
Masnizah Mohd, Jaffar Atwan, Kiyoaki Shirai
2015
Abstract
The adaptation of a Query Expansion (QE) approach for Arabic documents may produce the worst rankings or irrelevant results. Therefore, we have introduced a technique, which is to utilise the Arabic WordNet in the corpus and query expansion level. A Point-wise Mutual Information (PMI) corpus-based measure is used to semantically select synonyms from the WordNet. In addition, Automatic Query Expansion (AQE) and Pseudo Relevance Feedback (PRF) methods were also explored to improve the performance of the Arabic information retrieval (AIR) system. The experimental results of our proposed techniques for AIR shows that the use of Arabic WordNet in the corpus and query level together with AQE, and the adaptation of PMI in the expansion process have successfully reduced the level of ambiguity as these techniques select the most appropriate synonym. It enhanced knowledge discovery by taking care of the relevancy aspect. The techniques also demonstrated an improvement in Mean Average Precision by 49%, with an increase of 7.3% in recall in comparison to the baseline.
DownloadPaper Citation
in Harvard Style
Mohd M., Atwan J. and Shirai K. (2015). Pseudo Relevance Feedback Technique and Semantic Similarity for Corpus-based Expansion . In Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2015) ISBN 978-989-758-158-8, pages 445-450. DOI: 10.5220/0005626904450450
in Bibtex Style
@conference{kdir15,
author={Masnizah Mohd and Jaffar Atwan and Kiyoaki Shirai},
title={Pseudo Relevance Feedback Technique and Semantic Similarity for Corpus-based Expansion},
booktitle={Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2015)},
year={2015},
pages={445-450},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005626904450450},
isbn={978-989-758-158-8},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2015)
TI - Pseudo Relevance Feedback Technique and Semantic Similarity for Corpus-based Expansion
SN - 978-989-758-158-8
AU - Mohd M.
AU - Atwan J.
AU - Shirai K.
PY - 2015
SP - 445
EP - 450
DO - 10.5220/0005626904450450