Dashboard2Code: Evaluating Multimodal Models on Reconstructing Interactive Dashboards

Tianhao Niu; Ziyu Han; Qiguang Chen (陈麒光); Shiqi Zhou (周士祺); Baocai Shan; Hengjie Fang; Qingfu Zhu; Wanxiang Che (车万翔)

Dashboard2Code: Evaluating Multimodal Models on Reconstructing Interactive Dashboards

Tianhao Niu, Ziyu Han, Qiguang Chen, Shiqi Zhou, Baocai Shan, Hengjie Fang, Qingfu Zhu, Wanxiang Che

Abstract

Automatic data visualization generation have advanced rapidly with multi-modal large language models, yet existing efforts largely focus on static charts and overlook the interactive dashboards commonly used for real-world data exploration. We introduce Dashboard2Code, a novel task that requires a model to proactively explore an interactive dashboard, acquire and integrate feedback from its own interactions (e.g., clicking and filtering), and generate code that reproduces the target dashboard. To support comprehensive evaluation, we present DashboardMimic, the first Plotly+Dash benchmark for Dashboard2Code, comprising 180 carefully designed and manually verified dashboard–code pairs spanning three difficulty levels and covering eight common real-world interaction patterns. We further propose an automated evaluation framework tailored to dashboards that combines code semantic analysis with dynamic interaction-based testing to assess visual and interaction consistency, showing strong agreement with human judgments. Experiments across a range of open- and closed-source multi-modal models reveal that even the strongest systems struggle on high-complexity dashboards and that a substantial performance gap remains between open-source and closed-source models on the Dashboard2Code task.

Anthology ID:: 2026.acl-long.1750
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 37696–37732
Language:
URL:: https://aclanthology.org/2026.acl-long.1750/
DOI:
Bibkey:
Cite (ACL):: Tianhao Niu, Ziyu Han, Qiguang Chen, Shiqi Zhou, Baocai Shan, Hengjie Fang, Qingfu Zhu, and Wanxiang Che. 2026. Dashboard2Code: Evaluating Multimodal Models on Reconstructing Interactive Dashboards. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 37696–37732, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Dashboard2Code: Evaluating Multimodal Models on Reconstructing Interactive Dashboards (Niu et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.1750.pdf
Checklist:: 2026.acl-long.1750.checklist.pdf

PDF Cite Search Checklist Fix data