RuleR: Improving LLM Controllability by Rule-based Data Recycling

Ming Li; Han Chen; Chenguang Wang; Dang Nguyen; Dianqi Li; Tianyi Zhou

doi:10.18653/v1/2025.naacl-short.78

RuleR: Improving LLM Controllability by Rule-based Data Recycling

Ming Li, Han Chen, Chenguang Wang, Dang Nguyen, Dianqi Li, Tianyi Zhou

Abstract

Large language models (LLMs) still lack delicate controllability over their responses, which is critical to enhancing their performance and the user experience. However, curating supervised fine-tuning (SFT) datasets to improve LLM controllability usually relies on human experts or proprietary LLMs, which requires additional costs. To bridge this gap, we propose Rule-based Data Recycling (RuleR), a data augmentation method incorporating multiple constraints into the original data samples according to predefined rules, which creates new training tasks to consolidate the controllability of LLMs. Instead of creating new data from scratch, RuleR “recycles” existing data by simply applying rule-based edits to their responses and appending the rule-instructions in their original instructions. Experimental results demonstrate RuleR’s effectiveness in improving LLM controllability while maintaining general instruction-following capabilities.

Anthology ID:: 2025.naacl-short.78
Volume:: Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers)
Month:: April
Year:: 2025
Address:: Albuquerque, New Mexico
Editors:: Luis Chiruzzo, Alan Ritter, Lu Wang
Venue:: NAACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 926–943
Language:
URL:: https://aclanthology.org/2025.naacl-short.78/
DOI:: 10.18653/v1/2025.naacl-short.78
Bibkey:
Cite (ACL):: Ming Li, Han Chen, Chenguang Wang, Dang Nguyen, Dianqi Li, and Tianyi Zhou. 2025. RuleR: Improving LLM Controllability by Rule-based Data Recycling. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers), pages 926–943, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):: RuleR: Improving LLM Controllability by Rule-based Data Recycling (Li et al., NAACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.naacl-short.78.pdf

PDF Cite Search Fix data