KidsArtBench: Multi-Dimensional Children’s Art Evaluation with Attribute-Aware MLLMs

Mingrui Ye; Chanjin Zheng; Zengyi Yu; Chenyu Xiang; Zhixue Zhao; Zheng Yuan; Helen Yannakoudakis

KidsArtBench: Multi-Dimensional Children’s Art Evaluation with Attribute-Aware MLLMs

Mingrui Ye, Chanjin Zheng, Zengyi Yu, Chenyu Xiang, Zhixue Zhao, Zheng Yuan, Helen Yannakoudakis

Abstract

Multimodal Large Language Models (MLLMs) show progress across many visual–language tasks; however, their capacity to evaluate artistic expression remains limited: aesthetic concepts are inherently abstract and open-ended, and multimodal artwork annotations are scarce. We introduce KidsArtBench, a new benchmark of over 1k children’s artworks (ages 5-15) annotated by 12 expert educators across 9 rubric-aligned dimensions, together with expert comments for feedback. Unlike prior aesthetic datasets that provide single scalar scores on adult imagery, KidsArtBench targets children’s artwork and pairs multi-dimensional annotations with comment supervision to enable both ordinal assessment and formative feedback. Building on this resource, we propose an attribute-specific multi-LoRA approach – where each attribute corresponds to a distinct evaluation dimension (e.g., Realism, Imagination) in the scoring rubric – with Regression-Aware Fine-Tuning (RAFT) to align predictions with ordinal scales. On Qwen2.5-VL-7B, our method increases correlation from 0.468 to 0.653, with the largest gains on perceptual dimensions and narrowed gaps on higher-order attributes. Our results show that educator-aligned supervision and attribute-aware training yield pedagogically meaningful evaluations and establish a rigorous testbed for sustained progress in educational AI. We release data and code with ethics documentation.

Anthology ID:: 2026.eacl-long.267
Volume:: Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Vera Demberg, Kentaro Inui, Lluís Marquez
Venue:: EACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 5702–5722
Language:
URL:: https://aclanthology.org/2026.eacl-long.267/
DOI:
Bibkey:
Cite (ACL):: Mingrui Ye, Chanjin Zheng, Zengyi Yu, Chenyu Xiang, Zhixue Zhao, Zheng Yuan, and Helen Yannakoudakis. 2026. KidsArtBench: Multi-Dimensional Children’s Art Evaluation with Attribute-Aware MLLMs. In Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), pages 5702–5722, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: KidsArtBench: Multi-Dimensional Children’s Art Evaluation with Attribute-Aware MLLMs (Ye et al., EACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.eacl-long.267.pdf
Checklist:: 2026.eacl-long.267.checklist.pdf

PDF Cite Search Checklist Fix data