arXiv Open Access 2022

Language Resources and Technologies for Non-Scheduled and Endangered Indian Languages

Ritesh Kumar Bornini Lahiri
Lihat Sumber

Abstrak

In the present paper, we will present a survey of the language resources and technologies available for the non-scheduled and endangered languages of India. While there have been different estimates from different sources about the number of languages in India, it could be assumed that there are more than 1,000 languages currently being spoken in India. However barring some of the 22 languages included in the 8th Schedule of the Indian Constitution (called the scheduled languages), there is hardly any substantial resource or technology available for the rest of the languages. Nonetheless there have been some individual attempts at developing resources and technologies for the different languages across the country. Of late, some financial support has also become available for the endangered languages. In this paper, we give a summary of the resources and technologies for those Indian languages which are not included in the 8th schedule of the Indian Constitution and/or which are endangered.

Topik & Kata Kunci

Penulis (2)

R

Ritesh Kumar

B

Bornini Lahiri

Format Sitasi

Kumar, R., Lahiri, B. (2022). Language Resources and Technologies for Non-Scheduled and Endangered Indian Languages. https://arxiv.org/abs/2204.02822

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2022
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓