ℛ3: Advertisement Compliance ℛectification via Group-ℛelative Experience Extractor and Curriculum ℛeinforcement

Yuan Chen; Zhenyu Hu; Mengge Xue; Cao Te; Liqun Liu; Peng Shu; Huan Yu; Jie Jiang

ℛ³: Advertisement Compliance ℛectification via Group-ℛelative Experience Extractor and Curriculum ℛeinforcement

Yuan Chen, Zhenyu Hu, Mengge Xue, Cao Te, Liqun Liu, Peng Shu, Huan Yu, Jie Jiang

Abstract

Rigorous content moderation is crucial for online advertising but leads to millions of daily rejections. This scale renders manual rectification infeasible, particularly for video advertisements.However, existing safety-driven methods often suffer from aggressive over-editing, which compromises the advertiser’s original semantic intent merely to satisfy compliance.In this work, we target the rectification of textual violations in video ads, covering both speech transcripts and on-screen text. We propose ℛ³, a novel framework designed to harmonize compliance with original semantic intent preservation.Our approach integrates three key innovations: (1) an experience-driven data synthesis framework that bootstraps high-quality supervision via group-**R**elative compliance experience extractor; (2) a curriculum **R**einforcement learning strategy with hierarchical rewards designed to enforce compliance while maximizing semantic consistency;and (3) a comprehensive video **R**ectification framework seamlessly integrating text recognition, rewriting, and re-rendering for industrial deployment. Extensive experiments on industrial datasets and online A/B testing demonstrate that ℛ³ significantly outperforms state-of-the-art baselines, achieving an optimal trade-off between violation rectification and intent preservation.

Anthology ID:: 2026.acl-industry.24
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Month:: July
Year:: 2026
Address:: San Diego, California, USA
Editors:: Yunyao Li, Georg Rehm, Mei Tu
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 358–370
Language:
URL:: https://aclanthology.org/2026.acl-industry.24/
DOI:
Bibkey:
Cite (ACL):: Yuan Chen, Zhenyu Hu, Mengge Xue, Cao Te, Liqun Liu, Peng Shu, Huan Yu, and Jie Jiang. 2026. ℛ3: Advertisement Compliance ℛectification via Group-ℛelative Experience Extractor and Curriculum ℛeinforcement. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026), pages 358–370, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):: ℛ3: Advertisement Compliance ℛectification via Group-ℛelative Experience Extractor and Curriculum ℛeinforcement (Chen et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-industry.24.pdf

PDF Cite Search Fix data