arXiv Open Access 2024

Türkçe Dil Modellerinin Performans Karşılaştırması Performance Comparison of Turkish Language Models

Eren Dogan M. Egemen Uzun Atahan Uz H. Emre Seyrek Ahmed Zeer +4 lainnya
Lihat Sumber

Abstrak

The developments that language models have provided in fulfilling almost all kinds of tasks have attracted the attention of not only researchers but also the society and have enabled them to become products. There are commercially successful language models available. However, users may prefer open-source language models due to cost, data privacy, or regulations. Yet, despite the increasing number of these models, there is no comprehensive comparison of their performance for Turkish. This study aims to fill this gap in the literature. A comparison is made among seven selected language models based on their contextual learning and question-answering abilities. Turkish datasets for contextual learning and question-answering were prepared, and both automatic and human evaluations were conducted. The results show that for question-answering, continuing pretraining before fine-tuning with instructional datasets is more successful in adapting multilingual models to Turkish and that in-context learning performances do not much related to question-answering performances.

Topik & Kata Kunci

Penulis (9)

E

Eren Dogan

M

M. Egemen Uzun

A

Atahan Uz

H

H. Emre Seyrek

A

Ahmed Zeer

E

Ezgi Sevi

H

H. Toprak Kesgin

M

M. Kaan Yuce

M

M. Fatih Amasyali

Format Sitasi

Dogan, E., Uzun, M.E., Uz, A., Seyrek, H.E., Zeer, A., Sevi, E. et al. (2024). Türkçe Dil Modellerinin Performans Karşılaştırması Performance Comparison of Turkish Language Models. https://arxiv.org/abs/2404.17010

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2024
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓