Bo Zheng
Other people with similar names: Bo Zheng
Unverified author pages with similar names: Bo Zheng
2026
MaskTab: Scalable Masked Tabular Pretraining with Scaling Laws and Distillation for Industrial Classification
Bo Zheng | Yudong Chen | Zihua Xiong | Shuai Fang | Peidong He | Yang Yang | Sheng Guo
Findings of the Association for Computational Linguistics: ACL 2026
Bo Zheng | Yudong Chen | Zihua Xiong | Shuai Fang | Peidong He | Yang Yang | Sheng Guo
Findings of the Association for Computational Linguistics: ACL 2026
Tabular data forms the backbone of high-stakes decision systems in finance, healthcare, and beyond. Yet industrial tabular datasets are inherently difficult: high-dimensional, riddled with missing entries, and rarely labeled at scale. While foundation models have revolutionized vision and language, tabular learning still leans on handcrafted features and lacks a general self-supervised framework. We present MaskTab, a unified pre-training framework designed specifically for industrial-scale tabular data. MaskTab encodes missing values via dedicated learnable tokens, enabling the model to distinguish structural absence from random dropout. It jointly optimizes a hybrid supervised pre-training scheme—utilizing a twin-path architecture to reconcile masked reconstruction with task-specific supervision—and an MoE-augmented loss that adaptively routes features through specialized subnetworks. On industrial-scale benchmarks, it achieves +5.04% AUC and +8.28% KS over prior art under rigorous scaling. Moreover, its representations distill effectively into lightweight models, yielding +2.55% AUC and +4.85% KS under strict latency and interpretability constraints, while improving robustness to distribution shifts. Our work demonstrates that tabular data admits a foundation-model treatment—when its structural idiosyncrasies are respected.
FinMRAGBench: A Realistic and Complex Benchmark for Multi-Modal RAG in Financial Document Analysis
Shouqing Yang | Qi Zhang | Yuhang Yang | Ruikang Xu | Yuwei Hou | Zhulin Jia | Lirong Gao | Haobo Wang | Jinglei Chen | Jiexiang Wang | Sheng Guo | Bo Zheng | Gang Chen
Findings of the Association for Computational Linguistics: ACL 2026
Shouqing Yang | Qi Zhang | Yuhang Yang | Ruikang Xu | Yuwei Hou | Zhulin Jia | Lirong Gao | Haobo Wang | Jinglei Chen | Jiexiang Wang | Sheng Guo | Bo Zheng | Gang Chen
Findings of the Association for Computational Linguistics: ACL 2026
Retrieval-augmented generation (RAG) has become a widely adopted paradigm for realistic financial analysis over financial documents. However, existing benchmarks fail to capture realistic financial analysis settings that involve cross-document retrieval, multi-page evidence integration, and diverse analytical tasks. To address this gap, we introduce FinMRAGBench, a comprehensive multi-modal financial RAG benchmark in which most questions require retrieving evidence scattered across multiple pages and documents, constructed from large-scale real-world annual reports and comprising 887 expert-verified QA pairs spanning five representative financial analysis tasks. Moreover, we introduce FinMRAGAgent, an agent trained on high-quality agentic trajectories following the reasoning-and-acting (ReAct) paradigm, capable of dynamic tool invocation and multi-step financial analysis. Our extensive experiments show that current multi-modal RAG systems still struggle with incomplete retrieval and complex financial reasoning. In contrast, FinMRAGAgent achieves the strongest overall performance across all models, demonstrating that our structured reasoning approach significantly enhances multi-modal RAG in realistic financial scenarios. The code and data are available at https://github.com/sqyangit/FinMRAGBench.
Learning from the Irrecoverable: Error-Localized Policy Optimization for Tool-Integrated LLM Reasoning
Qiao Liang | Yuke Zhu | Chao Ge | Lei Yang | Ying Shen | Bo Zheng | Sheng Guo
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Qiao Liang | Yuke Zhu | Chao Ge | Lei Yang | Ying Shen | Bo Zheng | Sheng Guo
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Tool-integrated reasoning (TIR) enables LLM agents to solve tasks through planning, tool use, and iterative revision, but outcome-only reinforcement learning in this setting suffers from sparse, delayed rewards and weak step-level credit assignment. In long-horizon TIR trajectories, an early irrecoverable mistake can determine success or failure, making it crucial to localize the first irrecoverable step and leverage it for fine-grained credit assignment. We propose Error-Localized Policy Optimization (ELPO), which localizes the first irrecoverable step via binary-search rollout trees under a fixed rollout budget, converts the resulting tree into stable learning signals through hierarchical advantage attribution, and applies error-localized adaptive clipping to strengthen corrective updates on the critical step and its suffix. Across TIR benchmarks in math, science QA, and code execution, ELPO consistently outperforms strong Agentic RL baselines under comparable sampling budgets, with additional gains in Pass@K and Major@K scaling, rollout ranking quality, and tool-call efficiency. Our code is publicly released for reproducibility at https://anonymous.4open.science/r/ELPO-7C19.
All Languages Matter: Understanding and Mitigating Language Bias in Multilingual RAG
Dan Wang | Guozhao Mo | Yafei Shi | Cheng Zhang | Bo Zheng | Boxi Cao | Xuanang Chen | Yaojie Lu | Hongyu Lin | Ben He | Xianpei Han | Le Sun
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Dan Wang | Guozhao Mo | Yafei Shi | Cheng Zhang | Bo Zheng | Boxi Cao | Xuanang Chen | Yaojie Lu | Hongyu Lin | Ben He | Xianpei Han | Le Sun
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Multilingual Retrieval-Augmented Generation (mRAG) leverages cross-lingual evidence to ground Large Language Models (LLMs) in global knowledge. However, we show that current mRAG systems suffer from a language bias during reranking, systematically favoring English and the query’s native language. By introducing an estimated oracle evidence analysis, we quantify a substantial performance gap between existing rerankers and the achievable upper bound. Further analysis reveals a critical distributional mismatch: while optimal predictions require evidence scattered across multiple languages, current systems systematically suppress such “answer-critical” documents, thereby limiting downstream generation performance. To bridge this gap, we propose Language-Agnostic Utility-driven Reranker Alignment (LAURA), Experiments across diverse languages and generation models show that LAURA effectively mitigates language bias and consistently improves mRAG performance.
Towards Interpretable Tabular Reasoning: Enhancing LLM Reasoning on Tabular Data with Pre-Constructed Logic Graph
Lirong Gao | Zewei Yu | Zhongrui Yin | Qi Zhang | Yuke Zhu | Bo Zheng | Haobo Wang | Junbo Zhao | Gang Chen | Sheng Guo
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Lirong Gao | Zewei Yu | Zhongrui Yin | Qi Zhang | Yuke Zhu | Bo Zheng | Haobo Wang | Junbo Zhao | Gang Chen | Sheng Guo
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Tabular data is widely used in fields such as finance and healthcare. Traditional tree-based models are prevalent for tabular prediction tasks due to their ability to handle heterogeneous features. However, their heavy reliance on feature engineering limits both their generalizability and their human-readable interpretability. On the other hand, Large Language Models (LLMs) naturally provide intermediate reasoning steps, thus offering greater transparency in decision-making. Nevertheless, LLMs often fail to match the predictive performance of tree-based models on tabular data. To address these challenges, we propose a novel Logic-Graph-Enhanced LLM Reasoning (LogGER) framework that integrates the strengths of tree-based models and LLMs. Specifically, we reformulate the traditional decision tree as a human-readable logic graph, which explicitly models the causal relationships between features and targets. This logic graph is automatically constructed using LLMs based on data priors and serves as the foundation for LogGER. To fully leverage the logic graph, we further introduce a logic-graph-guided process supervision approach, which evaluates and enhances the quality of LLM’s intermediate reasoning steps using logic-graph-aided process reward. Extensive experiments demonstrate that LogGER consistently outperforms both tree-based models and state-of-the-art LLM methods on a variety of tabular prediction tasks, achieving superior accuracy and interpretability.
Search
Fix author
Co-authors
- Sheng Guo 4
- Gang Chen 2
- Lirong Gao 2
- Haobo Wang 2
- Qi Zhang 2
- Yuke Zhu 2
- Boxi Cao 1
- Jinglei Chen 1
- Xuanang Chen 1
- Yudong Chen 1
- Shuai Fang 1
- Chao Ge 1
- Xianpei Han 1
- Ben He 1
- Peidong He 1
- Yuwei Hou 1
- Zhulin Jia 1
- Qiao Liang 1
- Hongyu Lin 1
- Yaojie Lu 1
- Guozhao Mo 1
- Ying Shen 1
- Yafei Shi 1
- Le Sun 1
- Dan Wang 1
- Jiexiang Wang 1
- Zihua Xiong 1
- Ruikang Xu 1
- Lei Yang 1
- Shouqing Yang 1
- Yang Yang 1
- Yuhang Yang 1
- Zhongrui Yin 1
- Zewei Yu 1
- Cheng Zhang 1
- Junbo Zhao 1