Learning from Near-Misses: Error-Aware Contrastive Few-Shot Learning for NL2Formula

Zhihao Shuai; Yiyun Chen; Maolin Ma; Yutong Chen; Hanjia Qiu; Jing Xu; Ziye Chen; Weikai Yang

Learning from Near-Misses: Error-Aware Contrastive Few-Shot Learning for NL2Formula

Zhihao Shuai, Yiyun Chen, Maolin Ma, Yutong Chen, Hanjia Qiu, Jing Xu, Ziye Chen, Weikai Yang

Abstract

Natural Language to Excel Formula (NL2Formula) translates user intent into executable spreadsheet formulas. However, current models often produce near-miss outputs—formulas that parse correctly yet fail at execution due to an incorrect function, operator, or reference. Through a systematic error analysis, we find that these errors repeatedly arise from a small set of structural decision points, motivating the need for typed error supervision rather than general error signals. To this end, we introduce an abstract syntax tree (AST)-based error taxonomy that organizes common error modes by the kind of decision that goes wrong in the parse tree. Building on this taxonomy, we propose Error-Aware Contrastive Few-Shot Learning (ECFL), an error-aware framework that unifies training and inference around typed error supervision. During offline training, ECFL mines near-miss errors, assigns error types under the taxonomy, and builds error-aware contrastive demonstrations for fine-tuning. During online inference, a lightweight predictor estimates likely error types and triggers targeted retrieval of contrastive demonstrations to guide single-pass decoding. Experiments show ECFL improves Exact Match (EM) by 6.4 points over supervised fine-tuning (SFT) and matches self-consistency (SC@5) accuracy at substantially lower inference cost.

Anthology ID:: 2026.acl-long.1368
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 29662–29676
Language:
URL:: https://aclanthology.org/2026.acl-long.1368/
DOI:
Bibkey:
Cite (ACL):: Zhihao Shuai, Yiyun Chen, Maolin Ma, Yutong Chen, Hanjia Qiu, Jing Xu, Ziye Chen, and Weikai Yang. 2026. Learning from Near-Misses: Error-Aware Contrastive Few-Shot Learning for NL2Formula. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 29662–29676, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Learning from Near-Misses: Error-Aware Contrastive Few-Shot Learning for NL2Formula (Shuai et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.1368.pdf
Checklist:: 2026.acl-long.1368.checklist.pdf

PDF Cite Search Checklist Fix data