A Graph-Based Method for Unsupervised Knowledge Discovery from Financial Texts

Joel Oksanen, Abhilash Majumder, Kumar Saunack, Francesca Toni, Arun Dhondiyal


Abstract
The need for manual review of various financial texts, such as company filings and news, presents a major bottleneck in financial analysts’ work. Thus, there is great potential for the application of NLP methods, tools and resources to fulfil a genuine industrial need in finance. In this paper, we show how this potential can be fulfilled by presenting an end-to-end, fully unsupervised method for knowledge discovery from financial texts. Our method creatively integrates existing resources to construct automatically a knowledge graph of companies and related entities as well as to carry out unsupervised analysis of the resulting graph to provide quantifiable and explainable insights from the produced knowledge. The graph construction integrates entity processing and semantic expansion, before carrying out open relation extraction. We illustrate our method by calculating automatically the environmental rating for companies in the S&P 500, based on company filings with the SEC (Securities and Exchange Commission). We then show the usefulness of our method in this setting by providing an assessment of our method’s outputs with an independent MSCI source.
Anthology ID:
2022.lrec-1.579
Volume:
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
5412–5417
Language:
URL:
https://aclanthology.org/2022.lrec-1.579
DOI:
Bibkey:
Cite (ACL):
Joel Oksanen, Abhilash Majumder, Kumar Saunack, Francesca Toni, and Arun Dhondiyal. 2022. A Graph-Based Method for Unsupervised Knowledge Discovery from Financial Texts. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 5412–5417, Marseille, France. European Language Resources Association.
Cite (Informal):
A Graph-Based Method for Unsupervised Knowledge Discovery from Financial Texts (Oksanen et al., LREC 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.lrec-1.579.pdf