Unsupervised Irony Detection: A Probabilistic Model with Word Embeddings

Debora Nozza, Elisabetta Fersini, Enza Messina

2016

Abstract

The automatic detection of figurative language, such as irony and sarcasm, is one of the most challenging tasks of Natural Language Processing (NLP). This is because machine learning methods can be easily misled by the presence of words that have a strong polarity but are used ironically, which means that the opposite polarity was intended. In this paper, we propose an unsupervised framework for domain-independent irony detection. In particular, to derive an unsupervised Topic-Irony Model (TIM), we built upon an existing probabilistic topic model initially introduced for sentiment analysis purposes. Moreover, in order to improve its generalization abilities, we took advantage of Word Embeddings to obtain domain-aware ironic orientation of words. This is the first work that addresses this task in unsupervised settings and the first study on the topic-irony distribution. Experimental results have shown that TIM is comparable, and sometimes even better with respect to supervised state of the art approaches for irony detection. Moreover, when integrating the probabilistic model with word embeddings (TIM+WE), promising results have been obtained in a more complex and real world scenario.

Download


Paper Citation


in Harvard Style

Nozza D., Fersini E. and Messina E. (2016). Unsupervised Irony Detection: A Probabilistic Model with Word Embeddings . In Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2016) ISBN 978-989-758-203-5, pages 68-76. DOI: 10.5220/0006052000680076

in Bibtex Style

@conference{kdir16,
author={Debora Nozza and Elisabetta Fersini and Enza Messina},
title={Unsupervised Irony Detection: A Probabilistic Model with Word Embeddings},
booktitle={Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2016)},
year={2016},
pages={68-76},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006052000680076},
isbn={978-989-758-203-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, (IC3K 2016)
TI - Unsupervised Irony Detection: A Probabilistic Model with Word Embeddings
SN - 978-989-758-203-5
AU - Nozza D.
AU - Fersini E.
AU - Messina E.
PY - 2016
SP - 68
EP - 76
DO - 10.5220/0006052000680076