AN EFFECTIVE CLUSTERING APPROACH TO WEB QUERY LOG ANONYMIZATION

Amin Milani Fard, Ke Wang

2010

Abstract

Web query log data contain information useful to research; however, release of such data can re-identify the search engine users issuing the queries. These privacy concerns go far beyond removing explicitly identifying information such as name and address, since non-identifying personal data can be combined with publicly available information to pinpoint to an individual. In this work we model web query logs as unstructured transaction data and present a novel transaction anonymization technique based on clustering and generalization techniques to achieve the k-anonymity privacy. We conduct extensive experiments on the AOL query log data. Our results show that this method results in a higher data utility compared to the state-of-the-art transaction anonymization methods.

Download


Paper Citation


in Harvard Style

Milani Fard A. and Wang K. (2010). AN EFFECTIVE CLUSTERING APPROACH TO WEB QUERY LOG ANONYMIZATION . In Proceedings of the International Conference on Security and Cryptography - Volume 1: SECRYPT, (ICETE 2010) ISBN 978-989-8425-18-8, pages 109-119. DOI: 10.5220/0002924901090119

in Bibtex Style

@conference{secrypt10,
author={Amin Milani Fard and Ke Wang},
title={AN EFFECTIVE CLUSTERING APPROACH TO WEB QUERY LOG ANONYMIZATION},
booktitle={Proceedings of the International Conference on Security and Cryptography - Volume 1: SECRYPT, (ICETE 2010)},
year={2010},
pages={109-119},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002924901090119},
isbn={978-989-8425-18-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Security and Cryptography - Volume 1: SECRYPT, (ICETE 2010)
TI - AN EFFECTIVE CLUSTERING APPROACH TO WEB QUERY LOG ANONYMIZATION
SN - 978-989-8425-18-8
AU - Milani Fard A.
AU - Wang K.
PY - 2010
SP - 109
EP - 119
DO - 10.5220/0002924901090119