Financial Forecasting from Textual and Tabular Time Series

Ross Koval, Nicholas Andrews, Xifeng Yan


Abstract
There is a variety of multimodal data pertinent to public companies, spanning from accounting statements, macroeconomic statistics, earnings conference calls, and financial reports. These diverse modalities capture the state of firms from a variety of different perspectives but requires complex interactions to reconcile in the formation of accurate financial predictions. The commonality between these different modalities is that they all represent a time series, typically observed for a particular firm at a quarterly horizon, providing the ability to model trends and variations of company data over time. However, the time series of these diverse modalities contains varying temporal and cross-channel patterns that are challenging to model without the appropriate inductive biases. In this work, we design a novel multimodal time series prediction task that includes numerical financial results, macroeconomic states, and long financial documents to predict next quarter’s company earnings relative to analyst expectations. We explore a variety of approaches for this novel setting, establish strong unimodal baselines, and propose a multimodal model that exhibits state-of-the-art performance on this unique task. We demonstrate that each modality contains unique information and that the best performing model requires careful fusion of the different modalities in a multi-stage training approach. To better understand model behavior, we conduct a variety of probing experiments, reveal insights into the value of different modalities, and demonstrate the practical utility of our proposed method in a simulated trading setting.
Anthology ID:
2024.findings-emnlp.486
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2024
Month:
November
Year:
2024
Address:
Miami, Florida, USA
Editors:
Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
8289–8300
Language:
URL:
https://aclanthology.org/2024.findings-emnlp.486
DOI:
Bibkey:
Cite (ACL):
Ross Koval, Nicholas Andrews, and Xifeng Yan. 2024. Financial Forecasting from Textual and Tabular Time Series. In Findings of the Association for Computational Linguistics: EMNLP 2024, pages 8289–8300, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):
Financial Forecasting from Textual and Tabular Time Series (Koval et al., Findings 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.findings-emnlp.486.pdf