pEBR: A Probabilistic Approach to Embedding Based Retrieval

Han Zhang; Yunjiang Jiang; Mingming Li; Haowei Yuan; Yiming Qiu; Wen-Yun Yang

pEBR: A Probabilistic Approach to Embedding Based Retrieval

Han Zhang, Yunjiang Jiang, Mingming Li, Haowei Yuan, Yiming Qiu, Wen-Yun Yang

Abstract

Embedding-based retrieval aims to learn a shared semantic representation space for both queries and items, enabling efficient and effective item retrieval through approximate nearest neighbor (ANN) algorithms. In current industrial practice, retrieval systems typically retrieve a fixed number of items for each query. However, this fixed-size retrieval often results in insufficient recall for head queries and low precision for tail queries. This limitation largely stems from the dominance of frequentist approaches in loss function design, which fail to address this challenge in industry. In this paper, we propose a novel probabilistic Embedding-Based Retrieval (pEBR) framework. Our method models the item distribution conditioned on each query, enabling the use of a dynamic cosine similarity threshold derived from the cumulative distribution function (CDF) of the probabilistic model. Experimental results demonstrate that pEBR significantly improves both retrieval precision and recall. Furthermore, ablation studies reveal that the probabilistic formulation effectively captures the inherent differences between head-to-tail queries.

Anthology ID:: 2025.emnlp-industry.161
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track
Month:: November
Year:: 2025
Address:: Suzhou (China)
Editors:: Saloni Potdar, Lina Rojas-Barahona, Sebastien Montella
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2332–2342
Language:
URL:: https://aclanthology.org/2025.emnlp-industry.161/
DOI:
Bibkey:
Cite (ACL):: Han Zhang, Yunjiang Jiang, Mingming Li, Haowei Yuan, Yiming Qiu, and Wen-Yun Yang. 2025. pEBR: A Probabilistic Approach to Embedding Based Retrieval. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track, pages 2332–2342, Suzhou (China). Association for Computational Linguistics.
Cite (Informal):: pEBR: A Probabilistic Approach to Embedding Based Retrieval (Zhang et al., EMNLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.emnlp-industry.161.pdf

PDF Cite Search Fix data