Omni-Chart-600K: A Comprehensive Dataset of Chart Types for Chart Understanding

Shulei Wang; Shuai Yang; Wang Lin; Zirun Guo; Sihang Cai; Hai Huang; Ye Wang (王晔); Jingyuan Chen; Tao Jin

doi:10.18653/v1/2025.findings-naacl.226

Omni-Chart-600K: A Comprehensive Dataset of Chart Types for Chart Understanding

Shulei Wang, Shuai Yang, Wang Lin, Zirun Guo, Sihang Cai, Hai Huang, Ye Wang, Jingyuan Chen, Tao Jin

Abstract

To address the deficiencies in chart types and the limited scope of chart tasks in existing datasets, we conducted a comprehensive review of current data collection methodologies. By integrating manual annotation with data generation leveraging GPT-4, we developed a dataset that includes 21 diverse chart types and a broad spectrum of tasks, such as data retrieval and mathematical reasoning. Our analysis of existing models revealed that capabilities in information extraction, mathematical reasoning, and understanding of multiple chart types are essential for performing a variety of chart tasks. To overcome the limitations in these areas, we devised a two-stage training strategy and a method for jointly training the vision encoder tailored for multi-type charts. In the first stage, we designed several tasks to enhance the model’s general understanding of charts, aligning multimodal large models pre-trained on natural images to chart tasks. To further improve the model’s capability to understand various chart tasks and enhance its reasoning abilities, we employed Chain-of-Thought data for training in the second stage. Through two-stage training on our proposed dataset, the pre-trained multimodal large language model achieved state-of-the-art performance across multiple chart understanding tasks, demonstrating the superiority of our data and methods.

Anthology ID:: 2025.findings-naacl.226
Volume:: Findings of the Association for Computational Linguistics: NAACL 2025
Month:: April
Year:: 2025
Address:: Albuquerque, New Mexico
Editors:: Luis Chiruzzo, Alan Ritter, Lu Wang
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 4051–4069
Language:
URL:: https://aclanthology.org/2025.findings-naacl.226/
DOI:: 10.18653/v1/2025.findings-naacl.226
Bibkey:
Cite (ACL):: Shulei Wang, Shuai Yang, Wang Lin, Zirun Guo, Sihang Cai, Hai Huang, Ye Wang, Jingyuan Chen, and Tao Jin. 2025. Omni-Chart-600K: A Comprehensive Dataset of Chart Types for Chart Understanding. In Findings of the Association for Computational Linguistics: NAACL 2025, pages 4051–4069, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):: Omni-Chart-600K: A Comprehensive Dataset of Chart Types for Chart Understanding (Wang et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-naacl.226.pdf

PDF Cite Search Fix data