arXiv Open Access 2021

DWUG: A large Resource of Diachronic Word Usage Graphs in Four Languages

Dominik Schlechtweg Nina Tahmasebi Simon Hengchen Haim Dubossarsky Barbara McGillivray
Lihat Sumber

Abstrak

Word meaning is notoriously difficult to capture, both synchronically and diachronically. In this paper, we describe the creation of the largest resource of graded contextualized, diachronic word meaning annotation in four different languages, based on 100,000 human semantic proximity judgments. We thoroughly describe the multi-round incremental annotation process, the choice for a clustering algorithm to group usages into senses, and possible - diachronic and synchronic - uses for this dataset.

Topik & Kata Kunci

Penulis (5)

D

Dominik Schlechtweg

N

Nina Tahmasebi

S

Simon Hengchen

H

Haim Dubossarsky

B

Barbara McGillivray

Format Sitasi

Schlechtweg, D., Tahmasebi, N., Hengchen, S., Dubossarsky, H., McGillivray, B. (2021). DWUG: A large Resource of Diachronic Word Usage Graphs in Four Languages. https://arxiv.org/abs/2104.08540

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2021
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓