arXiv Open Access 2024

The Russian Legislative Corpus

Denis Saveliev Ruslan Kuchakov
Lihat Sumber

Abstrak

We present the comprehensive Russian primary and secondary legislation corpus covering 1991 to 2023. The corpus collects all 281,413 texts (176,523,268 tokens) of non-secret federal regulations and acts, along with their metadata. The corpus has two versions the original text with minimal preprocessing and a version prepared for linguistic analysis with morphosyntactic markup.

Topik & Kata Kunci

Penulis (2)

D

Denis Saveliev

R

Ruslan Kuchakov

Format Sitasi

Saveliev, D., Kuchakov, R. (2024). The Russian Legislative Corpus. https://arxiv.org/abs/2406.04855

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2024
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓