arXiv Open Access 2025

Discourse Graph Guided Document Translation with Large Language Models

Viet-Thanh Pham Minghan Wang Hao-Han Liao Thuy-Trang Vu
Lihat Sumber

Abstrak

Adapting large language models to full document translation remains challenging due to the difficulty of capturing long-range dependencies and preserving discourse coherence throughout extended texts. While recent agentic machine translation systems mitigate context window constraints through multi-agent orchestration and persistent memory, they require substantial computational resources and are sensitive to memory retrieval strategies. We introduce TransGraph, a discourse-guided framework that explicitly models inter-chunk relationships through structured discourse graphs and selectively conditions each translation segment on relevant graph neighbourhoods rather than relying on sequential or exhaustive context. Across three document-level MT benchmarks spanning six languages and diverse domains, TransGraph consistently surpasses strong baselines in translation quality and terminology consistency while incurring significantly lower token overhead.

Topik & Kata Kunci

Penulis (4)

V

Viet-Thanh Pham

M

Minghan Wang

H

Hao-Han Liao

T

Thuy-Trang Vu

Format Sitasi

Pham, V., Wang, M., Liao, H., Vu, T. (2025). Discourse Graph Guided Document Translation with Large Language Models. https://arxiv.org/abs/2511.07230

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓