DOES CAPITALIZATION MATTER IN WEB SEARCH?

Silviu Cucerzan

2010

Abstract

We investigate the capitalization features of queries submitted to Web search engines and the relation between capitalization information, either as received from users or as hypothesized based on Web statistics, and search relevance. We observe that users tend to lowercase words in their queries significantly more often than as predicted from Web data. More importantly, we determine that document relevance is strongly correlated with the matching in capitalization between the instances of query tokens in the target document and the tokens of the truecased form of the query as obtained by using Web n-gram data.

Download


Paper Citation


in Harvard Style

Cucerzan S. (2010). DOES CAPITALIZATION MATTER IN WEB SEARCH? . In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2010) ISBN 978-989-8425-28-7, pages 302-306. DOI: 10.5220/0003102503020306

in Bibtex Style

@conference{kdir10,
author={Silviu Cucerzan},
title={DOES CAPITALIZATION MATTER IN WEB SEARCH?},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2010)},
year={2010},
pages={302-306},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003102503020306},
isbn={978-989-8425-28-7},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2010)
TI - DOES CAPITALIZATION MATTER IN WEB SEARCH?
SN - 978-989-8425-28-7
AU - Cucerzan S.
PY - 2010
SP - 302
EP - 306
DO - 10.5220/0003102503020306