PluRule: A Benchmark for Moderating Pluralistic Communities on Social Media

Zoher Kachwala; Bao Tran Truong; Rasika Muralidharan; Haewoon Kwak; Jisun An; Filippo Menczer

PluRule: A Benchmark for Moderating Pluralistic Communities on Social Media

Zoher Kachwala, Bao Tran Truong, Rasika Muralidharan, Haewoon Kwak, Jisun An, Filippo Menczer

Abstract

Social media are shifting towards pluralism — community-governed platforms where groups define their own norms. What violates rules in one community may be perfectly acceptable in another. Can AI models help moderate such pluralistic communities? We formalize the task as a multiple-choice problem, mirroring how human moderators operate in the real world: given a comment and its surrounding context, identify which specific rule, if any, is violated. We introduce PluRule, a multimodal, multilingual benchmark for detecting 13,371 rule violations across 1,989 Reddit communities spanning 2,885 rules in 9 languages. Using this benchmark, we show that state-of-the-art vision-language models struggle significantly: even GPT-5.2 with high reasoning performs only slightly better than a trivial baseline. We also find that bigger models and increased context provide marginal gains, and universal rules like civility and self-promotion are easier to detect. Our results show that moderation of pluralistic communities on social media is a fundamental challenge for language models. Our code and benchmark are publicly available.

Anthology ID:: 2026.acl-long.1590
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 34452–34471
Language:
URL:: https://aclanthology.org/2026.acl-long.1590/
DOI:
Bibkey:
Cite (ACL):: Zoher Kachwala, Bao Tran Truong, Rasika Muralidharan, Haewoon Kwak, Jisun An, and Filippo Menczer. 2026. PluRule: A Benchmark for Moderating Pluralistic Communities on Social Media. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 34452–34471, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: PluRule: A Benchmark for Moderating Pluralistic Communities on Social Media (Kachwala et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.1590.pdf
Checklist:: 2026.acl-long.1590.checklist.pdf

PDF Cite Search Checklist Fix data