arXiv Open Access 2024

Comparative Analysis of Static and Contextual Embeddings for Analyzing Semantic Changes in Medieval Latin Charters

Yifan Liu Gelila Tilahun Xinxiang Gao Qianfeng Wen Michael Gervers
Lihat Sumber

Abstrak

The Norman Conquest of 1066 C.E. brought profound transformations to England's administrative, societal, and linguistic practices. The DEEDS (Documents of Early England Data Set) database offers a unique opportunity to explore these changes by examining shifts in word meanings within a vast collection of Medieval Latin charters. While computational linguistics typically relies on vector representations of words like static and contextual embeddings to analyze semantic changes, existing embeddings for scarce and historical Medieval Latin are limited and may not be well-suited for this task. This paper presents the first computational analysis of semantic change pre- and post-Norman Conquest and the first systematic comparison of static and contextual embeddings in a scarce historical data set. Our findings confirm that, consistent with existing studies, contextual embeddings outperform static word embeddings in capturing semantic change within a scarce historical corpus.

Topik & Kata Kunci

Penulis (5)

Y

Yifan Liu

G

Gelila Tilahun

X

Xinxiang Gao

Q

Qianfeng Wen

M

Michael Gervers

Format Sitasi

Liu, Y., Tilahun, G., Gao, X., Wen, Q., Gervers, M. (2024). Comparative Analysis of Static and Contextual Embeddings for Analyzing Semantic Changes in Medieval Latin Charters. https://arxiv.org/abs/2410.09283

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2024
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓