arXiv Open Access 2024

Türkçe Dil Modellerinin Performans Karşılaştırması Performance Comparison of Turkish Language Models

Eren Dogan M. Egemen Uzun Atahan Uz H. Emre Seyrek Ahmed Zeer +4 lainnya

Lihat Sumber

Abstrak

The developments that language models have provided in fulfilling almost all kinds of tasks have attracted the attention of not only researchers but also the society and have enabled them to become products. There are commercially successful language models available. However, users may prefer open-source language models due to cost, data privacy, or regulations. Yet, despite the increasing number of these models, there is no comprehensive comparison of their performance for Turkish. This study aims to fill this gap in the literature. A comparison is made among seven selected language models based on their contextual learning and question-answering abilities. Turkish datasets for contextual learning and question-answering were prepared, and both automatic and human evaluations were conducted. The results show that for question-answering, continuing pretraining before fine-tuning with instructional datasets is more successful in adapting multilingual models to Turkish and that in-context learning performances do not much related to question-answering performances.

Topik & Kata Kunci

cs.CL cs.AI

Penulis (9)

Eren Dogan

M. Egemen Uzun

Atahan Uz

H. Emre Seyrek

Ahmed Zeer

Ezgi Sevi

H. Toprak Kesgin

M. Kaan Yuce

M. Fatih Amasyali

Format Sitasi

APA MLA BibTeX

Dogan, E., Uzun, M.E., Uz, A., Seyrek, H.E., Zeer, A., Sevi, E. et al. (2024). Türkçe Dil Modellerinin Performans Karşılaştırması Performance Comparison of Turkish Language Models. https://arxiv.org/abs/2404.17010

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2024
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓