Adaptive Hyper-parameter Learning for Deep Semantic Retrieval

Mingming Li; Chunyuan Yuan; Huimu Wang; Peng Wang; Jingwei Zhuo; Binbin Wang; Lin Liu; Sulong Xu

doi:10.18653/v1/2023.emnlp-industry.72

Adaptive Hyper-parameter Learning for Deep Semantic Retrieval

Mingming Li, Chunyuan Yuan, Huimu Wang, Peng Wang, Jingwei Zhuo, Binbin Wang, Lin Liu, Sulong Xu

Abstract

Deep semantic retrieval has achieved remarkable success in online E-commerce applications. The majority of methods aim to distinguish positive items and negative items for each query by utilizing margin loss or softmax loss. Despite their decent performance, these methods are highly sensitive to hyper-parameters, i.e., margin and temperature 𝜏, which measure the similarity of negative pairs and affect the distribution of items in metric space. How to design and choose adaptively parameters for different pairs is still an open challenge. Recently several methods have attempted to alleviate the above problem by learning each parameter through trainable/statistical methods in the recommendation. We argue that those are not suitable for retrieval scenarios, due to the agnosticism and diversity of the queries. To fully overcome this limitation, we propose a novel adaptive metric learning method that designs a simple and universal hyper-parameter-free learning method to improve the performance of retrieval. Specifically, we first propose a method that adaptive obtains the hyper-parameters by relying on the batch similarity without fixed or extra-trainable hyper-parameters. Subsequently, we adopt a symmetric metric learning method to mitigate model collapse issues. Furthermore, the proposed method is general and sheds a highlight on other fields. Extensive experiments demonstrate our method significantly outperforms previous methods on a real-world dataset, highlighting the superiority and effectiveness of our method. This method has been successfully deployed on an online E-commerce search platform and brought substantial economic benefits.

Anthology ID:: 2023.emnlp-industry.72
Volume:: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: Industry Track
Month:: December
Year:: 2023
Address:: Singapore
Editors:: Mingxuan Wang, Imed Zitouni
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 775–782
Language:
URL:: https://aclanthology.org/2023.emnlp-industry.72/
DOI:: 10.18653/v1/2023.emnlp-industry.72
Bibkey:
Cite (ACL):: Mingming Li, Chunyuan Yuan, Huimu Wang, Peng Wang, Jingwei Zhuo, Binbin Wang, Lin Liu, and Sulong Xu. 2023. Adaptive Hyper-parameter Learning for Deep Semantic Retrieval. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: Industry Track, pages 775–782, Singapore. Association for Computational Linguistics.
Cite (Informal):: Adaptive Hyper-parameter Learning for Deep Semantic Retrieval (Li et al., EMNLP 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.emnlp-industry.72.pdf

PDF Cite Search Fix data