ProUIE: A Macro-to-Micro Progressive Learning Method for LLM-based Universal Information Extraction

Wenda Liu; Song Zhigang; Shuai Nie; Guangyao Liu; Lisung Chen; Binyu Yang; Yaran Chen; Peng Zhou; Hongzhen Wang; Yuchen Liu (刘雨辰); Wenyue Hu; Jiaming Xu; Runyu Shi; Ying Huang

ProUIE: A Macro-to-Micro Progressive Learning Method for LLM-based Universal Information Extraction

Wenda Liu, Song Zhigang, Shuai Nie, Guangyao Liu, Lisung Chen, Binyu Yang, Yaran Chen, Peng Zhou, Hongzhen Wang, Yuchen Liu, Wenyue Hu, Jiaming Xu, Runyu Shi, Ying Huang

Abstract

LLM-based universal information extraction (UIE) methods often rely on additional information beyond the original training data, which increases training complexity yet often yields limited gains. To address this, we propose ProUIE, a Macro-to-Micro progressive learning approach that improves UIE without introducing any external information. ProUIE consists of three stages: (i) macro-level Complete Modeling (CM), which learns NER, RE, and EE along their intrinsic difficulty order on the full training data to build a unified extraction foundation, (ii) meso-level Streamlined Alignment (SA), which operates on sampled data with simplified target formats, streamlining and regularizing structured outputs to make them more concise and controllable, and (iii) micro-level Deep Exploration (DE), which applies GRPO with stepwise fine-grained rewards (SFR) over structural units to guide exploration and improve performance. Experiments on 36 public datasets show that ProUIE consistently improves unified extraction, outperforming strong instruction-tuned baselines on average for NER and RE while using a smaller backbone, and it further demonstrates clear gains in production-oriented information extraction.

Anthology ID:: 2026.findings-acl.1093
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 21737–21750
Language:
URL:: https://aclanthology.org/2026.findings-acl.1093/
DOI:
Bibkey:
Cite (ACL):: Wenda Liu, Song Zhigang, Shuai Nie, Guangyao Liu, Lisung Chen, Binyu Yang, Yaran Chen, Peng Zhou, Hongzhen Wang, Yuchen Liu, Wenyue Hu, Jiaming Xu, Runyu Shi, and Ying Huang. 2026. ProUIE: A Macro-to-Micro Progressive Learning Method for LLM-based Universal Information Extraction. In Findings of the Association for Computational Linguistics: ACL 2026, pages 21737–21750, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: ProUIE: A Macro-to-Micro Progressive Learning Method for LLM-based Universal Information Extraction (Liu et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.1093.pdf
Checklist:: 2026.findings-acl.1093.checklist.pdf

PDF Cite Search Checklist Fix data