Qi Sun
Unverified author pages with similar names: Qi Sun
2025
Advancing Sequential Numerical Prediction in Autoregressive Models
Xiang Fei | Jinghui Lu | Qi Sun | Hao Feng | Yanjie Wang | Wei Shi | An-Lan Wang | Jingqun Tang | Can Huang
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
Xiang Fei | Jinghui Lu | Qi Sun | Hao Feng | Yanjie Wang | Wei Shi | An-Lan Wang | Jingqun Tang | Can Huang
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
Autoregressive models have become the de facto choice for sequence generation tasks, but standard approaches treat digits as independent tokens and apply cross-entropy loss, overlooking the coherent structure of numerical sequences. This paper introduces Numerical Token Integrity Loss(NTIL) to address this gap. NTIL operates at two levels: (1) token-level, where it extends the Earth Mover’s Distance (EMD) to preserve ordinal relationships between numerical values, and (2) sequence-level, where it penalizes the overall discrepancy between the predicted and actual sequences. This dual approach improves numerical prediction and integrates effectively with LLMs/MLLMs. Extensive experiments show significant performance improvements with NTIL.