SPEEDING UP LATENT SEMANTIC ANALYSIS - A Streamed Distributed Algorithm for SVD Updates

Radim Řehůřek

2011

Abstract

Since its inception 20 years ago, Latent Semantic Analysis (LSA) has become a standard tool for robust, unsupervised inference of semantic structure from text corpora. At the core of LSA is the Singular Value Decomposition algorithm (SVD), a linear algebra routine for matrix factorization. This paper introduces a streamed distributed algorithm for incremental updates, which allows the factorization to be computed rapidly in a single pass over the input matrix on a cluster of autonomous computers.

Download


Paper Citation


in Harvard Style

Řehůřek R. (2011). SPEEDING UP LATENT SEMANTIC ANALYSIS - A Streamed Distributed Algorithm for SVD Updates . In Proceedings of the 3rd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, ISBN 978-989-8425-40-9, pages 446-451. DOI: 10.5220/0003191304460451

in Bibtex Style

@conference{icaart11,
author={Radim Řehůřek},
title={SPEEDING UP LATENT SEMANTIC ANALYSIS - A Streamed Distributed Algorithm for SVD Updates},
booktitle={Proceedings of the 3rd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,},
year={2011},
pages={446-451},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003191304460451},
isbn={978-989-8425-40-9},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 3rd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,
TI - SPEEDING UP LATENT SEMANTIC ANALYSIS - A Streamed Distributed Algorithm for SVD Updates
SN - 978-989-8425-40-9
AU - Řehůřek R.
PY - 2011
SP - 446
EP - 451
DO - 10.5220/0003191304460451