arXiv Open Access 2026

Coupling Local Context and Global Semantic Prototypes via a Hierarchical Architecture for Rhetorical Roles Labeling

Anas Belfathi Nicolas Hernandez Laura Monceaux Warren Bonnard Mary Catherine Lavissiere +2 lainnya
Lihat Sumber

Abstrak

Rhetorical Role Labeling (RRL) identifies the functional role of each sentence in a document, a key task for discourse understanding in domains such as law and medicine. While hierarchical models capture local dependencies effectively, they are limited in modeling global, corpus-level features. To address this limitation, we propose two prototype-based methods that integrate local context with global representations. Prototype-Based Regularization (PBR) learns soft prototypes through a distance-based auxiliary loss to structure the latent space, while Prototype-Conditioned Modulation (PCM) constructs corpus-level prototypes and injects them during training and inference. Given the scarcity of RRL resources, we introduce SCOTUS-Law, the first dataset of U.S. Supreme Court opinions annotated with rhetorical roles at three levels of granularity: category, rhetorical function, and step. Experiments on legal, medical, and scientific benchmarks show consistent improvements over strong baselines, with 4 Macro-F1 gains on low-frequency roles. We further analyze the implications in the era of Large Language Models and complement our findings with expert evaluation.

Topik & Kata Kunci

Penulis (7)

A

Anas Belfathi

N

Nicolas Hernandez

L

Laura Monceaux

W

Warren Bonnard

M

Mary Catherine Lavissiere

C

Christine Jacquin

R

Richard Dufour

Format Sitasi

Belfathi, A., Hernandez, N., Monceaux, L., Bonnard, W., Lavissiere, M.C., Jacquin, C. et al. (2026). Coupling Local Context and Global Semantic Prototypes via a Hierarchical Architecture for Rhetorical Roles Labeling. https://arxiv.org/abs/2603.03856

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2026
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓