Apples to Apples: A Systematic Evaluation of Topic Models

Ismail Harrando, Pasquale Lisena, Raphael Troncy


Abstract
From statistical to neural models, a wide variety of topic modelling algorithms have been proposed in the literature. However, because of the diversity of datasets and metrics, there have not been many efforts to systematically compare their performance on the same benchmarks and under the same conditions. In this paper, we present a selection of 9 topic modelling techniques from the state of the art reflecting a diversity of approaches to the task, an overview of the different metrics used to compare their performance, and the challenges of conducting such a comparison. We empirically evaluate the performance of these models on different settings reflecting a variety of real-life conditions in terms of dataset size, number of topics, and distribution of topics, following identical preprocessing and evaluation processes. Using both metrics that rely on the intrinsic characteristics of the dataset (different coherence metrics), as well as external knowledge (word embeddings and ground-truth topic labels), our experiments reveal several shortcomings regarding the common practices in topic models evaluation.
Anthology ID:
2021.ranlp-1.55
Volume:
Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021)
Month:
September
Year:
2021
Address:
Held Online
Editors:
Ruslan Mitkov, Galia Angelova
Venue:
RANLP
SIG:
Publisher:
INCOMA Ltd.
Note:
Pages:
483–493
Language:
URL:
https://aclanthology.org/2021.ranlp-1.55
DOI:
Bibkey:
Cite (ACL):
Ismail Harrando, Pasquale Lisena, and Raphael Troncy. 2021. Apples to Apples: A Systematic Evaluation of Topic Models. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021), pages 483–493, Held Online. INCOMA Ltd..
Cite (Informal):
Apples to Apples: A Systematic Evaluation of Topic Models (Harrando et al., RANLP 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.ranlp-1.55.pdf
Code
 d2klab/tomodapi
Data
Yahoo! Answers