arXiv Open Access 2024

Efficient Technical Term Translation: A Knowledge Distillation Approach for Parenthetical Terminology Translation

Jiyoon Myung Jihyeon Park Jungki Son Kyungro Lee Joohyung Han
Lihat Sumber

Abstrak

This paper addresses the challenge of accurately translating technical terms, which are crucial for clear communication in specialized fields. We introduce the Parenthetical Terminology Translation (PTT) task, designed to mitigate potential inaccuracies by displaying the original term in parentheses alongside its translation. To implement this approach, we generated a representative PTT dataset using a collaborative approach with large language models and applied knowledge distillation to fine-tune traditional Neural Machine Translation (NMT) models and small-sized Large Language Models (sLMs). Additionally, we developed a novel evaluation metric to assess both overall translation accuracy and the correct parenthetical presentation of terms. Our findings indicate that sLMs did not consistently outperform NMT models, with fine-tuning proving more effective than few-shot prompting, particularly in models with continued pre-training in the target language. These insights contribute to the advancement of more reliable terminology translation methodologies.

Topik & Kata Kunci

Penulis (5)

J

Jiyoon Myung

J

Jihyeon Park

J

Jungki Son

K

Kyungro Lee

J

Joohyung Han

Format Sitasi

Myung, J., Park, J., Son, J., Lee, K., Han, J. (2024). Efficient Technical Term Translation: A Knowledge Distillation Approach for Parenthetical Terminology Translation. https://arxiv.org/abs/2410.00683

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2024
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓