Semantic Similarity Covariance Matrix Shrinkage

Guillaume Becquin, Saher Esmeir


Abstract
An accurate estimation of the covariance matrix is a critical component of many applications in finance, including portfolio optimization. The sample covariance suffers from the curse of dimensionality when the number of observations is in the same order or lower than the number of variables. This tends to be the case in portfolio optimization, where a portfolio manager can choose between thousands of stocks using historical daily returns to guide their investment decisions. To address this issue, past works proposed linear covariance shrinkage to regularize the estimated matrix. While effective, the proposed methods relied solely on historical price data and thus ignored company fundamental data. In this work, we propose to utilise semantic similarity derived from textual descriptions or knowledge graphs to improve the covariance estimation. Rather than using the semantic similarity directly as a biased estimator to the covariance, we employ it as a shrinkage target. The resulting covariance estimators leverage both semantic similarity and recent price history, and can be readily adapted to a broad range of financial securities. The effectiveness of the approach is demonstrated for a period including diverse market conditions and compared with the covariance shrinkage prior art.
Anthology ID:
2023.findings-emnlp.668
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2023
Month:
December
Year:
2023
Address:
Singapore
Editors:
Houda Bouamor, Juan Pino, Kalika Bali
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
9977–9992
Language:
URL:
https://aclanthology.org/2023.findings-emnlp.668
DOI:
10.18653/v1/2023.findings-emnlp.668
Bibkey:
Cite (ACL):
Guillaume Becquin and Saher Esmeir. 2023. Semantic Similarity Covariance Matrix Shrinkage. In Findings of the Association for Computational Linguistics: EMNLP 2023, pages 9977–9992, Singapore. Association for Computational Linguistics.
Cite (Informal):
Semantic Similarity Covariance Matrix Shrinkage (Becquin & Esmeir, Findings 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.findings-emnlp.668.pdf