A Lexicon-based Approach for Sentiment Classification of Amazon Books Reviews in Italian Language

Franco Chiavetta, Giosuè Lo Bosco, Giovanni Pilato

2016

Abstract

We present a system aimed at the automatic classification of the sentiment orientation expressed into book reviews written in Italian language. The system we have developed is found on a lexicon-based approach and uses NLP techniques in order to take into account the linguistic relation between terms in the analyzed texts. The classification of a review is based on the average sentiment strenght of its sentences, while the classification of each sentence is obtained through a parsing process inspecting, for each term, a window of previous items to detect particular combinations of elements giving inversions or variations of polarity. The score of a single word depends on all the associated meanings considering also semantically related concepts as synonyms and hyperonims. Concepts associated to words are extracted from a proper stratification of linguistic resources that we adopt to solve the problems of lack of an opinion lexicon specifically tailored on the Italian language. The system has been prototyped by using Python language and it has been tested on a dataset of reviews crawled from Amazon.it, the Italian Amazon website. Experiments show that the proposed system is able to automatically classify both positive and negative reviews, with an average accuracy of above 82%.

Download


Paper Citation


in Harvard Style

Chiavetta F., Lo Bosco G. and Pilato G. (2016). A Lexicon-based Approach for Sentiment Classification of Amazon Books Reviews in Italian Language . In Proceedings of the 12th International Conference on Web Information Systems and Technologies - Volume 2: WEBIST, ISBN 978-989-758-186-1, pages 159-170. DOI: 10.5220/0005915301590170

in Bibtex Style

@conference{webist16,
author={Franco Chiavetta and Giosuè Lo Bosco and Giovanni Pilato},
title={A Lexicon-based Approach for Sentiment Classification of Amazon Books Reviews in Italian Language},
booktitle={Proceedings of the 12th International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,},
year={2016},
pages={159-170},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005915301590170},
isbn={978-989-758-186-1},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 12th International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,
TI - A Lexicon-based Approach for Sentiment Classification of Amazon Books Reviews in Italian Language
SN - 978-989-758-186-1
AU - Chiavetta F.
AU - Lo Bosco G.
AU - Pilato G.
PY - 2016
SP - 159
EP - 170
DO - 10.5220/0005915301590170