DOAJ Open Access 2019

Toward multi-lingual information retrieval system based on internet linguistic diversity measurement

Ebtsam Mohamed Samir Elmougy Mostafa Aref

Abstrak

We introduce a method for measuring the quantity of online content of a set of languages at domain level. This measurement is used for building a Multi-Lingual Information Retrieval (MLIR) system that identifies which languages are strongly represented on the internet about a specific query topic. The system architecture includes two modules; the off-line module builds a linguistic diversity index for languages at topic level and the on-line module, where the suitable language for search is identified based the index for retrieving the relevant documents to the user query in that language. The conducted experiments explore the usefulness of building such an index and its usage effect on both of monolingual and traditional MLIR system. From the obtained results, it has been proven that the more internet resources, the better the accuracy of the retrieved results, and therefore the better the system performance. Keywords: Multi-Lingual Information Retrieval (MLIR), Online content availability, Search language, Linguistic diversity index

Penulis (3)

E

Ebtsam Mohamed

S

Samir Elmougy

M

Mostafa Aref

Format Sitasi

Mohamed, E., Elmougy, S., Aref, M. (2019). Toward multi-lingual information retrieval system based on internet linguistic diversity measurement. https://doi.org/10.1016/j.asej.2018.11.009

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber doi.org/10.1016/j.asej.2018.11.009
Informasi Jurnal
Tahun Terbit
2019
Sumber Database
DOAJ
DOI
10.1016/j.asej.2018.11.009
Akses
Open Access ✓