LogiCoL: Logically-Informed Contrastive Learning for Set-based Dense Retrieval

Yanzhen Shen; Sihao Chen; Xueqiang Xu; Yunyi Zhang; Chaitanya Malaviya; Dan Roth

doi:10.18653/v1/2025.emnlp-main.608

LogiCoL: Logically-Informed Contrastive Learning for Set-based Dense Retrieval

Yanzhen Shen, Sihao Chen, Xueqiang Xu, Yunyi Zhang, Chaitanya Malaviya, Dan Roth

Abstract

While significant progress has been made with dual- and bi-encoder dense retrievers, they often struggle on queries with logical connectives, a use case that is often overlooked yet important in downstream applications. Current dense retrievers struggle with such queries, such that the retrieved results do not respect the logical constraints implied in the queries. To address this challenge, we introduce LogiCoL, a logically-informed contrastive learning objective for dense retrievers. LogiCoL builds upon in-batch supervised contrastive learning, and learns dense retrievers to respect the subset and mutually-exclusive set relation between query results via two sets of soft constraints expressed via t-norm in the learning objective. We evaluate the effectiveness of LogiCoL on the task of entity retrieval, where the model is expected to retrieve a set of entities in Wikipedia that satisfy the implicit logical constraints in the query. We show that models trained with LogiCoL yield improvement both in terms of retrieval performance and logical consistency in the results. We provide detailed analysis and insights to uncover why queries with logical connectives are challenging for dense retrievers and why LogiCoL is most effective.

Anthology ID:: 2025.emnlp-main.608
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 12114–12125
Language:
URL:: https://aclanthology.org/2025.emnlp-main.608/
DOI:: 10.18653/v1/2025.emnlp-main.608
Bibkey:
Cite (ACL):: Yanzhen Shen, Sihao Chen, Xueqiang Xu, Yunyi Zhang, Chaitanya Malaviya, and Dan Roth. 2025. LogiCoL: Logically-Informed Contrastive Learning for Set-based Dense Retrieval. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 12114–12125, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: LogiCoL: Logically-Informed Contrastive Learning for Set-based Dense Retrieval (Shen et al., EMNLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.emnlp-main.608.pdf
Checklist:: 2025.emnlp-main.608.checklist.pdf

PDF Cite Search Checklist Fix data