Is ChatGPT a Financial Expert? Evaluating Language Models on Financial Natural Language Processing

Yue Guo, Zian Xu, Yi Yang


Abstract
The emergence of Large Language Models (LLMs), such as ChatGPT, has revolutionized general natural language preprocessing (NLP) tasks. However, their expertise in the financial domain lacks a comprehensive evaluation. To assess the ability of LLMs to solve financial NLP tasks, we present FinLMEval, a framework for Financial Language Model Evaluation, comprising nine datasets designed to evaluate the performance of language models. This study compares the performance of fine-tuned auto-encoding language models (BERT, RoBERTa, FinBERT) and the LLM ChatGPT. Our findings reveal that while ChatGPT demonstrates notable performance across most financial tasks, it generally lags behind the fine-tuned expert models, especially when dealing with proprietary datasets. We hope this study builds foundation evaluation benchmarks for continuing efforts to build more advanced LLMs in the financial domain.
Anthology ID:
2023.findings-emnlp.58
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2023
Month:
December
Year:
2023
Address:
Singapore
Editors:
Houda Bouamor, Juan Pino, Kalika Bali
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
815–821
Language:
URL:
https://aclanthology.org/2023.findings-emnlp.58
DOI:
10.18653/v1/2023.findings-emnlp.58
Bibkey:
Cite (ACL):
Yue Guo, Zian Xu, and Yi Yang. 2023. Is ChatGPT a Financial Expert? Evaluating Language Models on Financial Natural Language Processing. In Findings of the Association for Computational Linguistics: EMNLP 2023, pages 815–821, Singapore. Association for Computational Linguistics.
Cite (Informal):
Is ChatGPT a Financial Expert? Evaluating Language Models on Financial Natural Language Processing (Guo et al., Findings 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.findings-emnlp.58.pdf