arXiv Open Access 2024

Instruction-tuned Large Language Models for Machine Translation in the Medical Domain

Miguel Rios

Lihat Sumber

Abstrak

Large Language Models (LLMs) have shown promising results on machine translation for high resource language pairs and domains. However, in specialised domains (e.g. medical) LLMs have shown lower performance compared to standard neural machine translation models. The consistency in the machine translation of terminology is crucial for users, researchers, and translators in specialised domains. In this study, we compare the performance between baseline LLMs and instruction-tuned LLMs in the medical domain. In addition, we introduce terminology from specialised medical dictionaries into the instruction formatted datasets for fine-tuning LLMs. The instruction-tuned LLMs significantly outperform the baseline models with automatic metrics.

Topik & Kata Kunci

cs.CL

Penulis (1)

Miguel Rios

Format Sitasi

APA MLA BibTeX

Rios, M. (2024). Instruction-tuned Large Language Models for Machine Translation in the Medical Domain. https://arxiv.org/abs/2408.16440

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2024
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓