arXiv Open Access 2026

SEA-Guard: Culturally Grounded Multilingual Safeguard for Southeast Asia

Panuthep Tasawong Jian Gang Ngui Alham Fikri Aji Trevor Cohn Peerat Limkonchotiwat

Lihat Sumber

Abstrak

Culturally aware safeguards are crucial for AI alignment in real-world settings, where safety extends beyond common sense and encompasses diverse local values, norms, and region-specific regulations. However, building large-scale, culturally grounded datasets is challenging due to limited resources and a scarcity of native annotators. Consequently, many safeguard models rely on machine translation of English datasets, often missing regional and cultural nuances. We present a novel agentic data-generation framework to scalably create authentic, region-specific safety datasets for Southeast Asia (SEA). On this foundation, we introduce the SEA-Guard family, the first multilingual safeguard models grounded in SEA cultural contexts. Evaluated across multiple benchmarks and cultural variants, SEA-Guard consistently outperforms existing safeguards at detecting regionally sensitive or harmful content while maintaining strong general safety performance.

Topik & Kata Kunci

cs.CL

Penulis (5)

Panuthep Tasawong

Jian Gang Ngui

Alham Fikri Aji

Trevor Cohn

Peerat Limkonchotiwat

Format Sitasi

APA MLA BibTeX

Tasawong, P., Ngui, J.G., Aji, A.F., Cohn, T., Limkonchotiwat, P. (2026). SEA-Guard: Culturally Grounded Multilingual Safeguard for Southeast Asia. https://arxiv.org/abs/2602.01618

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2026
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓