A SEARCH ENGINE FOR WEB IMAGES USING DOCUMENT TEXT STEMMING

Ryan Hardt, Ethan V. Munson, Hien Nguyen

2008

Abstract

A Web image search application was built using a previously-developed image relevance model for retrieval of images via text-based image retrieval. The application includes a text stemmer that converts a word to a canonical form, making it possible to match text in the face of changes in tense or plurality that have little effect on semantics. The usefulness of stemming in Web image retrieval was evaluated via a test on ten queries that were submitted both with and without stemming. Relevance of retrieved images was determined via ratings by three trained individuals. With stemming, the average unique relevance recall (a measure of the proportion of relevant images returned by one algorithm and not another) was 27.7%, while without stemming, it was only 0.5%. These results may more accurately apply to queries containing at least one plural noun, present tense verb, present participle verb, or past tense verb.

Download


Paper Citation


in Harvard Style

Hardt R., V. Munson E. and Nguyen H. (2008). A SEARCH ENGINE FOR WEB IMAGES USING DOCUMENT TEXT STEMMING . In Proceedings of the Fourth International Conference on Web Information Systems and Technologies - Volume 2: WEBIST, ISBN 978-989-8111-27-2, pages 223-230. DOI: 10.5220/0001526502230230

in Bibtex Style

@conference{webist08,
author={Ryan Hardt and Ethan V. Munson and Hien Nguyen},
title={A SEARCH ENGINE FOR WEB IMAGES USING DOCUMENT TEXT STEMMING},
booktitle={Proceedings of the Fourth International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,},
year={2008},
pages={223-230},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001526502230230},
isbn={978-989-8111-27-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Fourth International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,
TI - A SEARCH ENGINE FOR WEB IMAGES USING DOCUMENT TEXT STEMMING
SN - 978-989-8111-27-2
AU - Hardt R.
AU - V. Munson E.
AU - Nguyen H.
PY - 2008
SP - 223
EP - 230
DO - 10.5220/0001526502230230