INFORMATION UNIQUENESS IN WIKIPEDIA ARTICLES

Nikos Kirtsis, Sofia Stamou, Paraskevi Tzekou, Nikos Zotos

2010

Abstract

Wikipedia is one of the most successful worldwide collaborative efforts to put together user generated content in a meaningfully organized and intuitive manner. Currently, Wikipedia hosts millions of articles on a variety of topics, supplied by thousands of contributors. A critical factor in Wikipedia’s success is its open nature, which enables everyone edit, revise and /or question (via talk pages) the article contents. Considering the phenomenal growth of Wikipedia and the lack of a peer review process for its contents, it becomes evident that both editors and administrators have difficulty in validating its quality on a systematic and coordinated basis. This difficulty has motivated several research works on how to assess the quality of Wikipedia articles. In this paper, we propose the exploitation of a novel indicator for the Wikipedia articles’ quality, namely information uniqueness. In this respect, we describe a method that captures the information duplication across the article contents in an attempt to infer the amount of distinct information every article communicates. Our approach relies on the intuition that an article offering unique information about its subject is of better quality compared to an article that discusses issues already addressed in several other Wikipedia articles.

Download


Paper Citation


in Harvard Style

Kirtsis N., Stamou S., Tzekou P. and Zotos N. (2010). INFORMATION UNIQUENESS IN WIKIPEDIA ARTICLES . In Proceedings of the 6th International Conference on Web Information Systems and Technology - Volume 2: WEBIST, ISBN 978-989-674-025-2, pages 137-143. DOI: 10.5220/0002841401370143

in Bibtex Style

@conference{webist10,
author={Nikos Kirtsis and Sofia Stamou and Paraskevi Tzekou and Nikos Zotos},
title={INFORMATION UNIQUENESS IN WIKIPEDIA ARTICLES},
booktitle={Proceedings of the 6th International Conference on Web Information Systems and Technology - Volume 2: WEBIST,},
year={2010},
pages={137-143},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002841401370143},
isbn={978-989-674-025-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 6th International Conference on Web Information Systems and Technology - Volume 2: WEBIST,
TI - INFORMATION UNIQUENESS IN WIKIPEDIA ARTICLES
SN - 978-989-674-025-2
AU - Kirtsis N.
AU - Stamou S.
AU - Tzekou P.
AU - Zotos N.
PY - 2010
SP - 137
EP - 143
DO - 10.5220/0002841401370143