DOAJ Open Access 2020

Creating an Annotated Corpus for Aspect-Based Sentiment Analysis in Swedish

Jacobo Rouces Lars Borin Nina Tahmasebi

Abstrak

Aspect-Based Sentiment Analysis constitutes a more fine-grained alternative to traditional sentiment analysis at sentence level. In addition to a sentiment value denoting how positive or negative a particular opinion or sentiment expression is, it identifies additional aspects or ‘slots’ that characterize the opinion. Some typical aspects are target and source, i.e. who holds the opinion and about which entity or aspect is the opinion. We present a large Swedish corpus annotated for Aspect-Based Sentiment Analysis. Each sentiment expression is annotated as a tuple that contains the following fields: one among 5 possible sentiment values, the target, the source, and whether the sentiment expressed is ironic. In addition, the linguistic element that conveys the sentiment is identified too. Sentiment for a particular topic is also annotated at title, paragraph and document level. The documents are articles obtained from two Swedish media (Svenska Dagbladet and Aftonbladet) and one online forum (Flashback), totalling around 4000 documents. The corpus is freely available and we plan to use it for training and testing an Aspect-Based Sentiment Analysis system.

Penulis (3)

J

Jacobo Rouces

L

Lars Borin

N

Nina Tahmasebi

Format Sitasi

Rouces, J., Borin, L., Tahmasebi, N. (2020). Creating an Annotated Corpus for Aspect-Based Sentiment Analysis in Swedish. https://doi.org/10.5617/dhnbpub.11203

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber doi.org/10.5617/dhnbpub.11203
Informasi Jurnal
Tahun Terbit
2020
Sumber Database
DOAJ
DOI
10.5617/dhnbpub.11203
Akses
Open Access ✓