arXiv Open Access 2025

From Formal Language Theory to Statistical Learning: Finite Observability of Subregular Languages

Katsuhiko Hayashi Hidetaka Kamigaito

Lihat Sumber

Abstrak

We prove that all standard subregular language classes are linearly separable when represented by their deciding predicates. This establishes finite observability and guarantees learnability with simple linear models. Synthetic experiments confirm perfect separability under noise-free conditions, while real-data experiments on English morphology show that learned features align with well-known linguistic constraints. These results demonstrate that the subregular hierarchy provides a rigorous and interpretable foundation for modeling natural language structure. Our code used in real-data experiments is available at https://github.com/UTokyo-HayashiLab/subregular.

Topik & Kata Kunci

cs.CL cs.FL cs.LG

Penulis (2)

Katsuhiko Hayashi

Hidetaka Kamigaito

Format Sitasi

APA MLA BibTeX

Hayashi, K., Kamigaito, H. (2025). From Formal Language Theory to Statistical Learning: Finite Observability of Subregular Languages. https://arxiv.org/abs/2509.22598

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2025
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓