arXiv Open Access 2024

Instruction-tuned Large Language Models for Machine Translation in the Medical Domain

Miguel Rios
Lihat Sumber

Abstrak

Large Language Models (LLMs) have shown promising results on machine translation for high resource language pairs and domains. However, in specialised domains (e.g. medical) LLMs have shown lower performance compared to standard neural machine translation models. The consistency in the machine translation of terminology is crucial for users, researchers, and translators in specialised domains. In this study, we compare the performance between baseline LLMs and instruction-tuned LLMs in the medical domain. In addition, we introduce terminology from specialised medical dictionaries into the instruction formatted datasets for fine-tuning LLMs. The instruction-tuned LLMs significantly outperform the baseline models with automatic metrics.

Topik & Kata Kunci

Penulis (1)

M

Miguel Rios

Format Sitasi

Rios, M. (2024). Instruction-tuned Large Language Models for Machine Translation in the Medical Domain. https://arxiv.org/abs/2408.16440

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2024
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓