Semantic Scholar Open Access 2023 4 sitasi

Exploring the Use of Foundation Models for Named Entity Recognition and Lemmatization Tasks in Slavic Languages

Gabriela Pałka Artur Nowakowski

Abstrak

This paper describes Adam Mickiewicz University’s (AMU) solution for the 4th Shared Task on SlavNER. The task involves the identification, categorization, and lemmatization of named entities in Slavic languages. Our approach involved exploring the use of foundation models for these tasks. In particular, we used models based on the popular BERT and T5 model architectures. Additionally, we used external datasets to further improve the quality of our models. Our solution obtained promising results, achieving high metrics scores in both tasks. We describe our approach and the results of our experiments in detail, showing that the method is effective for NER and lemmatization in Slavic languages. Additionally, our models for lemmatization will be available at: https://huggingface.co/amu-cai.

Topik & Kata Kunci

Penulis (2)

G

Gabriela Pałka

A

Artur Nowakowski

Format Sitasi

Pałka, G., Nowakowski, A. (2023). Exploring the Use of Foundation Models for Named Entity Recognition and Lemmatization Tasks in Slavic Languages. https://doi.org/10.48550/arXiv.2304.05336

Akses Cepat

Lihat di Sumber doi.org/10.48550/arXiv.2304.05336
Informasi Jurnal
Tahun Terbit
2023
Bahasa
en
Total Sitasi
Sumber Database
Semantic Scholar
DOI
10.48550/arXiv.2304.05336
Akses
Open Access ✓