Twitter Topic Modeling for Breaking News Detection

Henning M. Wold, Linn Vikre, Jon Atle Gulla, Özlem Özgöbek, Xiaomeng Su

2016

Abstract

Social media platforms like Twitter have become increasingly popular for the dissemination and discussion of current events. Twitter makes it possible for people to share stories that they find interesting with their followers, and write updates on what is happening around them. In this paper we attempt to use topic models of tweets in real time to identify breaking news. Two different methods, Latent Dirichlet Allocation (LDA) and Hierarchical Dirichlet Process (HDP) are tested with each tweet in the training corpus as a document by itself, as well as with all the tweets of a unique user regarded as one document. This second approach emulates Author-Topic modeling (AT-modeling). The evaluation of methods relies on manual scoring of the accuracy of the modeling by volunteered participants. The experiments indicate topic modeling on tweets in real-time is not suitable for detecting breaking news by itself, but may be useful in analyzing and describing news tweets.

Download


Paper Citation


in Harvard Style

Wold H., Vikre L., Gulla J., Özgöbek Ö. and Su X. (2016). Twitter Topic Modeling for Breaking News Detection . In Proceedings of the 12th International Conference on Web Information Systems and Technologies - Volume 2: WEBIST, ISBN 978-989-758-186-1, pages 211-218. DOI: 10.5220/0005801902110218

in Bibtex Style

@conference{webist16,
author={Henning M. Wold and Linn Vikre and Jon Atle Gulla and Özlem Özgöbek and Xiaomeng Su},
title={Twitter Topic Modeling for Breaking News Detection},
booktitle={Proceedings of the 12th International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,},
year={2016},
pages={211-218},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005801902110218},
isbn={978-989-758-186-1},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 12th International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,
TI - Twitter Topic Modeling for Breaking News Detection
SN - 978-989-758-186-1
AU - Wold H.
AU - Vikre L.
AU - Gulla J.
AU - Özgöbek Ö.
AU - Su X.
PY - 2016
SP - 211
EP - 218
DO - 10.5220/0005801902110218