Query-LIFE: Query-aware Language Image Fusion Embedding for E-Commerce Relevance

Hai Zhu, Yuankai Guo, Ronggang Dou, Kai Liu


Abstract
Relevance module plays a fundamental role in e-commerce search as they are responsible for selecting relevant products from thousands of items based on user queries, thereby enhancing users experience and efficiency. The traditional method calculates the relevance score based on product titles and user queries, but the information in title alone maybe insufficient to describe the product completely. A more general method is to further leverage product image information. In recent years, vision-language pre-training model has achieved impressive results in many scenarios, which leverage contrastive learning to map both textual and visual features into a joint embedding space. In e-commerce, a common practice is to further fine-tune the model using e-commerce data on the basis of pre-trained model. However, the performance is sub-optimal because the vision-language pre-training models lack of alignment specifically designed for queries. In this paper, we propose Query-aware Language Image Fusion Embedding to address these challenges. Query-LIFE utilizes a query-based multimodal fusion to effectively incorporate the image and title based on the product types. Additionally, it employs query-aware modal alignment to enhance the accuracy of the comprehensive representation of products. Furthermore, we design GenFilt, which utilizes the generation capability of large models to filter out false negative samples and further improve the overall performance of the contrastive learning task in the model. Experiments have demonstrated that Query-LIFE outperforms existing baselines. We have conducted ablation studies and human evaluations to validate the effectiveness of each module within Query-LIFE. Moreover, Query-LIFE has been deployed on Miravia Search. resulting in improved both relevance and conversion efficiency.
Anthology ID:
2025.coling-industry.2
Volume:
Proceedings of the 31st International Conference on Computational Linguistics: Industry Track
Month:
January
Year:
2025
Address:
Abu Dhabi, UAE
Editors:
Owen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa, Barbara Di Eugenio, Steven Schockaert, Kareem Darwish, Apoorv Agarwal
Venue:
COLING
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
21–28
Language:
URL:
https://aclanthology.org/2025.coling-industry.2/
DOI:
Bibkey:
Cite (ACL):
Hai Zhu, Yuankai Guo, Ronggang Dou, and Kai Liu. 2025. Query-LIFE: Query-aware Language Image Fusion Embedding for E-Commerce Relevance. In Proceedings of the 31st International Conference on Computational Linguistics: Industry Track, pages 21–28, Abu Dhabi, UAE. Association for Computational Linguistics.
Cite (Informal):
Query-LIFE: Query-aware Language Image Fusion Embedding for E-Commerce Relevance (Zhu et al., COLING 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.coling-industry.2.pdf