arXiv Open Access 2024

Language Resources in Spanish for Automatic Text Simplification across Domains

Antonio Moreno-Sandoval Leonardo Campillos-Llanos Ana García-Serrano
Lihat Sumber

Abstrak

This work describes the language resources and models developed for automatic simplification of Spanish texts in three domains: Finance, Medicine and History studies. We created several corpora in each domain, annotation and simplification guidelines, a lexicon of technical and simplified medical terms, datasets used in shared tasks for the financial domain, and two simplification tools. The methodology, resources and companion publications are shared publicly on the web-site: https://clara-nlp.uned.es/.

Topik & Kata Kunci

Penulis (3)

A

Antonio Moreno-Sandoval

L

Leonardo Campillos-Llanos

A

Ana García-Serrano

Format Sitasi

Moreno-Sandoval, A., Campillos-Llanos, L., García-Serrano, A. (2024). Language Resources in Spanish for Automatic Text Simplification across Domains. https://arxiv.org/abs/2409.20466

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2024
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓