SEA-Guard: Culturally Grounded Multilingual Safeguard for Southeast Asia

Panuthep Tasawong; Jian Gang Ngui; Alham Fikri Aji; Trevor Cohn; Peerat Limkonchotiwat

SEA-Guard: Culturally Grounded Multilingual Safeguard for Southeast Asia

Panuthep Tasawong, Jian Gang Ngui, Alham Fikri Aji, Trevor Cohn, Peerat Limkonchotiwat

Abstract

Culturally aware safeguards are crucial for AI alignment in real-world settings, where safety extends beyond common sense and encompasses diverse local values, norms, and region-specific regulations. However, building large-scale, culturally grounded datasets is challenging due to limited resources and a scarcity of native annotators. Consequently, many safeguard models rely on machine translation of English datasets, often missing regional and cultural nuances. We present a novel agentic data-generation framework to scalably create authentic, region-specific safety datasets for Southeast Asia (SEA). On this foundation, we introduce the SEA-Guard family, the first multilingual safeguard models grounded in SEA cultural contexts. Evaluated across multiple benchmarks and cultural variants, SEA-Guard consistently outperforms existing safeguards at detecting regionally sensitive or harmful content while maintaining strong general safety performance.

Anthology ID:: 2026.findings-acl.141
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2917–2941
Language:
URL:: https://aclanthology.org/2026.findings-acl.141/
DOI:
Bibkey:
Cite (ACL):: Panuthep Tasawong, Jian Gang Ngui, Alham Fikri Aji, Trevor Cohn, and Peerat Limkonchotiwat. 2026. SEA-Guard: Culturally Grounded Multilingual Safeguard for Southeast Asia. In Findings of the Association for Computational Linguistics: ACL 2026, pages 2917–2941, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: SEA-Guard: Culturally Grounded Multilingual Safeguard for Southeast Asia (Tasawong et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.141.pdf
Checklist:: 2026.findings-acl.141.checklist.pdf

PDF Cite Search Checklist Fix data