C-SHAP: Collocation-Aware Explanations for Financial NLP

Martina Menzio; Elisabetta Fersini; Davide Paris

C-SHAP: Collocation-Aware Explanations for Financial NLP

Martina Menzio, Elisabetta Fersini, Davide Paris

Abstract

Understanding the internal decision-making process of NLP models in high-stakes domains such as the financial sector is particularly challenging due to the complexity of domain-specific terminology and the need for transparency and accountability. Although SHAP is a widely used model-agnostic method for attributing model predictions to input features, its standard formulation treats input tokens as independent units, failing to capture the influence of collocations that often carry non-compositional meaning, instead modeled by the current language models. We introduce C-SHAP, an extension of SHAP that incorporates collocational dependencies into the explanation process to account for word combinations in the financial sector. C-SHAP dynamically groups tokens into significant collocations using a financial glossary and computes Shapley values over these structured units. The proposed approach has been evaluated to explain sentiment classification of Federal Reserve Minutes, demonstrating improved alignment with human rationales and better association to model behaviour compared to the standard token-level approach.

Anthology ID:: 2025.ranlp-1.82
Volume:: Proceedings of the 15th International Conference on Recent Advances in Natural Language Processing - Natural Language Processing in the Generative AI Era
Month:: September
Year:: 2025
Address:: Varna, Bulgaria
Editors:: Galia Angelova, Maria Kunilovskaya, Marie Escribe, Ruslan Mitkov
Venue:: RANLP
SIG:
Publisher:: INCOMA Ltd., Shoumen, Bulgaria
Note:
Pages:: 711–717
Language:
URL:: https://aclanthology.org/2025.ranlp-1.82/
DOI:
Bibkey:
Cite (ACL):: Martina Menzio, Elisabetta Fersini, and Davide Paris. 2025. C-SHAP: Collocation-Aware Explanations for Financial NLP. In Proceedings of the 15th International Conference on Recent Advances in Natural Language Processing - Natural Language Processing in the Generative AI Era, pages 711–717, Varna, Bulgaria. INCOMA Ltd., Shoumen, Bulgaria.
Cite (Informal):: C-SHAP: Collocation-Aware Explanations for Financial NLP (Menzio et al., RANLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.ranlp-1.82.pdf

PDF Cite Search Fix data