Latent Ambiguity in Latent Semantic Analysis?

Martin Emms, Alfredo Maldonado-Guerra

2013

Abstract

Latent Semantic Analyis (LSA) consists in the use of SVD-based dimensionality-reduction to reduce the high dimensionality of vector representations of documents, where the dimensions of the vectors correspond simply to word counts in the documents. We show that that there are two contending, inequivalent, formulations of LSA. The distinction between the two is not generally noted and while some work adheres to one formulation, other work adheres to the other formulation. We show that on both a tiny contrived data-set and also on a more substantial word-sense discovery data-set that the empirical outcomes achieved with LSA vary according to which formulation is chosen.

Download


Paper Citation


in Harvard Style

Emms M. and Maldonado-Guerra A. (2013). Latent Ambiguity in Latent Semantic Analysis? . In Proceedings of the 2nd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM, ISBN 978-989-8565-41-9, pages 115-120. DOI: 10.5220/0004178301150120

in Bibtex Style

@conference{icpram13,
author={Martin Emms and Alfredo Maldonado-Guerra},
title={Latent Ambiguity in Latent Semantic Analysis?},
booktitle={Proceedings of the 2nd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,},
year={2013},
pages={115-120},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004178301150120},
isbn={978-989-8565-41-9},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 2nd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,
TI - Latent Ambiguity in Latent Semantic Analysis?
SN - 978-989-8565-41-9
AU - Emms M.
AU - Maldonado-Guerra A.
PY - 2013
SP - 115
EP - 120
DO - 10.5220/0004178301150120