AIGT: AI Generative Table Based on Prompt

Mingming Zhang (张明明); Zhiqing Xiao; Guoshan Lu; Sai Wu; Weiqiang Wang; Xing Fu; Can Yi; Junbo Zhao

AIGT: AI Generative Table Based on Prompt

Mingming Zhang, Zhiqing Xiao, Guoshan Lu, Sai Wu, Weiqiang Wang, Xing Fu, Can Yi, Junbo Zhao

Abstract

Tabular data, which accounts for over 80% of enterprise data assets, is vital in various fields. With growing concerns about privacy protection and data-sharing restrictions, generating high-quality synthetic tabular data has become essential. Recent advancements show that large language models (LLMs) can effectively generate realistic tabular data by leveraging semantic information and overcoming the challenges of high-dimensional data that arise from one-hot encoding. However, current methods do not fully utilize the rich information available in tables. To address this, we introduce AI Generative Table based on prompt enhancement, a novel approach that utilizes metadata information, such as table descriptions and schemas, as prompts to generate ultra-high-quality synthetic data. To overcome the token limit constraints of LLMs, we propose long-token partitioning algorithms that enable AIGT to model tables of any scale. AIGT achieves state-of-the-art performance on 14 out of 20 public datasets and two real industry datasets within the Alipay risk control system.

Anthology ID:: 2025.coling-main.664
Volume:: Proceedings of the 31st International Conference on Computational Linguistics
Month:: January
Year:: 2025
Address:: Abu Dhabi, UAE
Editors:: Owen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa, Barbara Di Eugenio, Steven Schockaert
Venue:: COLING
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 9926–9938
Language:
URL:: https://aclanthology.org/2025.coling-main.664/
DOI:
Bibkey:
Cite (ACL):: Mingming Zhang, Zhiqing Xiao, Guoshan Lu, Sai Wu, Weiqiang Wang, Xing Fu, Can Yi, and Junbo Zhao. 2025. AIGT: AI Generative Table Based on Prompt. In Proceedings of the 31st International Conference on Computational Linguistics, pages 9926–9938, Abu Dhabi, UAE. Association for Computational Linguistics.
Cite (Informal):: AIGT: AI Generative Table Based on Prompt (Zhang et al., COLING 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.coling-main.664.pdf

PDF Cite Search Fix data