Semantic Scholar Open Access 2013 1638 sitasi

Exploiting Similarities among Languages for Machine Translation

Tomas Mikolov Quoc V. Le I. Sutskever

Abstrak

Dictionaries and phrase tables are the basis of modern statistical machine translation systems. This paper develops a method that can automate the process of generating and extending dictionaries and phrase tables. Our method can translate missing word and phrase entries by learning language structures based on large monolingual data and mapping between languages from small bilingual data. It uses distributed representation of words and learns a linear mapping between vector spaces of languages. Despite its simplicity, our method is surprisingly effective: we can achieve almost 90% precision@5 for translation of words between English and Spanish. This method makes little assumption about the languages, so it can be used to extend and refine dictionaries and translation tables for any language pairs.

Topik & Kata Kunci

Penulis (3)

T

Tomas Mikolov

Q

Quoc V. Le

I

I. Sutskever

Format Sitasi

Mikolov, T., Le, Q.V., Sutskever, I. (2013). Exploiting Similarities among Languages for Machine Translation. https://www.semanticscholar.org/paper/0157dcd6122c20b5afc359a799b2043453471f7f

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2013
Bahasa
en
Total Sitasi
1638×
Sumber Database
Semantic Scholar
Akses
Open Access ✓