Reference-Free Schema Generation for Literature Review Tables via Multi-Faceted Rewards

Sinjoy Saha; Suman Saha; Mahfuza Farooque; Wenpeng Yin

Reference-Free Schema Generation for Literature Review Tables via Multi-Faceted Rewards

Sinjoy Saha, Suman Saha, Mahfuza Farooque, Wenpeng Yin

Abstract

To accelerate scientific knowledge acquisition, LLMs are increasingly used to synthesize multiple papers into structured tables by inferring schemas and values. While value generation within a fixed schema can often be reduced to extractive question answering, the schema generation problem, determining which dimensions to compare a set of documents, lacks a formal mapping to standard NLP tasks. In this work, we formulate schema generation as a reinforcement learning problem and investigate whether these dimensions can be induced without access to gold-standard schemas. We design a multi-faceted reward framework capturing schema coverage, non-redundancy, relevance, and format, and train a small language model on a literature review dataset. Our approach yields consistent improvements over the untuned base model across intrinsic, reference-based, and LLM-judge metrics, and remains competitive with supervised fine-tuned models at 5× the parameter count on structural and diversity dimensions. All code, results and prompts are available in the GitHub repository: https://github.com/sinjoysaha/rl-schema-generation

Anthology ID:: 2026.acl-srw.111
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Santosh T.Y.S.S., Juan Diego Rodriguez, Ona de Gibert
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1253–1261
Language:
URL:: https://aclanthology.org/2026.acl-srw.111/
DOI:
Bibkey:
Cite (ACL):: Sinjoy Saha, Suman Saha, Mahfuza Farooque, and Wenpeng Yin. 2026. Reference-Free Schema Generation for Literature Review Tables via Multi-Faceted Rewards. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026), pages 1253–1261, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Reference-Free Schema Generation for Literature Review Tables via Multi-Faceted Rewards (Saha et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-srw.111.pdf

PDF Cite Search Fix data