Semantic Scholar Open Access 2020 320 sitasi

Nearest Neighbor Machine Translation

Urvashi Khandelwal Angela Fan Dan Jurafsky Luke Zettlemoyer M. Lewis

Abstrak

We introduce $k$-nearest-neighbor machine translation ($k$NN-MT), which predicts tokens with a nearest neighbor classifier over a large datastore of cached examples, using representations from a neural translation model for similarity search. This approach requires no additional training and scales to give the decoder direct access to billions of examples at test time, resulting in a highly expressive model that consistently improves performance across many settings. Simply adding nearest neighbor search improves a state-of-the-art German-English translation model by 1.5 BLEU. $k$NN-MT allows a single model to be adapted to diverse domains by using a domain-specific datastore, improving results by an average of 9.2 BLEU over zero-shot transfer, and achieving new state-of-the-art results---without training on these domains. A massively multilingual model can also be specialized for particular language pairs, with improvements of 3 BLEU for translating from English into German and Chinese. Qualitatively, $k$NN-MT is easily interpretable; it combines source and target context to retrieve highly relevant examples.

Topik & Kata Kunci

Penulis (5)

U

Urvashi Khandelwal

A

Angela Fan

D

Dan Jurafsky

L

Luke Zettlemoyer

M

M. Lewis

Format Sitasi

Khandelwal, U., Fan, A., Jurafsky, D., Zettlemoyer, L., Lewis, M. (2020). Nearest Neighbor Machine Translation. https://www.semanticscholar.org/paper/20d51f8e449b59c7e140f7a7eec9ab4d4d6f80ea

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2020
Bahasa
en
Total Sitasi
320×
Sumber Database
Semantic Scholar
Akses
Open Access ✓