HiCOT: Improving Neural Topic Models via Optimal Transport and Contrastive Learning

Hoang Tran Vuong; Tue Le; Tu Vu; Tung Nguyen; Linh Ngo Van; Dinh Viet Sang; Thien Huu Nguyen

doi:10.18653/v1/2025.findings-acl.715

HiCOT: Improving Neural Topic Models via Optimal Transport and Contrastive Learning

Hoang Tran Vuong, Tue Le, Tu Vu, Tung Nguyen, Linh Ngo Van, Sang Dinh, Thien Huu Nguyen

Abstract

Recent advances in neural topic models (NTMs) have improved topic quality but still face challenges: weak document-topic alignment, high inference costs due to large pretrained language models (PLMs), and limited modeling of hierarchical topic structures. To address these issues, we introduce HiCOT (Hierarchical Clustering and Contrastive Learning with Optimal Transport for Neural Topic Modeling), a novel framework that enhances topic coherence and efficiency. HiCOT integrates Optimal Transport to refine document-topic relationships using compact PLM-based embeddings, captures semantic structure of the documents. Additionally, it employs hierarchical clustering combine with contrastive learning to disentangle topic-word and topic-topic relationships, ensuring clearer structure and better coherence. Experimental results on multiple benchmark datasets demonstrate HiCOT’s superior effectiveness over existing NTMs in topic coherence, topic performance, representation quality, and computational efficiency.

Anthology ID:: 2025.findings-acl.715
Volume:: Findings of the Association for Computational Linguistics: ACL 2025
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 13894–13920
Language:
URL:: https://aclanthology.org/2025.findings-acl.715/
DOI:: 10.18653/v1/2025.findings-acl.715
Bibkey:
Cite (ACL):: Hoang Tran Vuong, Tue Le, Tu Vu, Tung Nguyen, Linh Ngo Van, Sang Dinh, and Thien Huu Nguyen. 2025. HiCOT: Improving Neural Topic Models via Optimal Transport and Contrastive Learning. In Findings of the Association for Computational Linguistics: ACL 2025, pages 13894–13920, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: HiCOT: Improving Neural Topic Models via Optimal Transport and Contrastive Learning (Vuong et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-acl.715.pdf

PDF Cite Search Fix data