Real-Time Data Harvesting Method for Czech Twitter

Pavel Král, Václav Rajtmajer

2017

Abstract

This paper deals with automatic analysis of Czech social media. The main goal is to propose an approach to harvest interesting messages from Twitter in Czech language with high download speed. This method uses user lists to discover potentially interesting tweets to download. It is motivated by the fact that only about 20% of Twitter users are posting informative messages, whereas the remaining 80% not and that it is possible to identify the "important" users by the user lists. The experimental results show that the proposed method is very efficient because it harvests about 6 times more data than the other approaches. This approach should be integrated into an experimental system for the Czech News Agency to monitor the current data-flow on Twitter, download messages in real-time, analyze them and extract relevant events.

Download


Paper Citation


in Harvard Style

Král P. and Rajtmajer V. (2017). Real-Time Data Harvesting Method for Czech Twitter . In Proceedings of the 9th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART, ISBN 978-989-758-220-2, pages 259-265. DOI: 10.5220/0006212402590265

in Bibtex Style

@conference{icaart17,
author={Pavel Král and Václav Rajtmajer},
title={Real-Time Data Harvesting Method for Czech Twitter},
booktitle={Proceedings of the 9th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,},
year={2017},
pages={259-265},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006212402590265},
isbn={978-989-758-220-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 9th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,
TI - Real-Time Data Harvesting Method for Czech Twitter
SN - 978-989-758-220-2
AU - Král P.
AU - Rajtmajer V.
PY - 2017
SP - 259
EP - 265
DO - 10.5220/0006212402590265