Semantic Scholar Open Access 2022 117 sitasi

Lexicon-Based vs. Bert-Based Sentiment Analysis: A Comparative Study in Italian

Rosario Catelli Serena Pelosi Massimo Esposito

Abstrak

Recent evolutions in the e-commerce market have led to an increasing importance attributed by consumers to product reviews made by third parties before proceeding to purchase. The industry, in order to improve the offer intercepting the discontent of consumers, has placed increasing attention towards systems able to identify the sentiment expressed by buyers, whether positive or negative. From a technological point of view, the literature in recent years has seen the development of two types of methodologies: those based on lexicons and those based on machine and deep learning techniques. This study proposes a comparison between these technologies in the Italian market, one of the largest in the world, exploiting an ad hoc dataset: scientific evidence generally shows the superiority of language models such as BERT built on deep neural networks, but it opens several considerations on the effectiveness and improvement of these solutions when compared to those based on lexicons in the presence of datasets of reduced size such as the one under study, a common condition for languages other than English or Chinese.

Penulis (3)

R

Rosario Catelli

S

Serena Pelosi

M

Massimo Esposito

Format Sitasi

Catelli, R., Pelosi, S., Esposito, M. (2022). Lexicon-Based vs. Bert-Based Sentiment Analysis: A Comparative Study in Italian. https://doi.org/10.3390/electronics11030374

Akses Cepat

Lihat di Sumber doi.org/10.3390/electronics11030374
Informasi Jurnal
Tahun Terbit
2022
Bahasa
en
Total Sitasi
117×
Sumber Database
Semantic Scholar
DOI
10.3390/electronics11030374
Akses
Open Access ✓