ConFit v2: Improving Resume-Job Matching using Hypothetical Resume Embedding and Runner-Up Hard-Negative Mining

Xiao Yu; Ruize Xu; Chengyuan Xue; Jinzhong Zhang; Xu Ma; Zhou Yu

doi:10.18653/v1/2025.findings-acl.661

ConFit v2: Improving Resume-Job Matching using Hypothetical Resume Embedding and Runner-Up Hard-Negative Mining

Xiao Yu, Ruize Xu, Chengyuan Xue, Jinzhong Zhang, Xu Ma, Zhou Yu

Abstract

A reliable resume-job matching system helps a company recommend suitable candidates from a pool of resumes and helps a job seeker find relevant jobs from a list of job posts. However, since job seekers apply only to a few jobs, interaction labels in resume-job datasets are sparse. We introduce ConFit v2, an improvement over ConFit to tackle this sparsity problem. We propose two techniques to enhance the encoder’s contrastive training process: augmenting job data with hypothetical reference resume generated by a large language model; and creating high-quality hard negatives from unlabeled resume/job pairs using a novel hard-negative mining strategy. This method also simplifies the representation space of the encoder. We evaluate ConFit v2 on two real-world datasets and demonstrate that it outperforms ConFit and prior methods (including BM25 and OpenAI text-embedding-003), achieving an average absolute improvement of 13.8% in recall and 17.5% in nDCG across job-ranking and resume-ranking tasks.

Anthology ID:: 2025.findings-acl.661
Volume:: Findings of the Association for Computational Linguistics: ACL 2025
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 12775–12790
Language:
URL:: https://aclanthology.org/2025.findings-acl.661/
DOI:: 10.18653/v1/2025.findings-acl.661
Bibkey:
Cite (ACL):: Xiao Yu, Ruize Xu, Chengyuan Xue, Jinzhong Zhang, Xu Ma, and Zhou Yu. 2025. ConFit v2: Improving Resume-Job Matching using Hypothetical Resume Embedding and Runner-Up Hard-Negative Mining. In Findings of the Association for Computational Linguistics: ACL 2025, pages 12775–12790, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: ConFit v2: Improving Resume-Job Matching using Hypothetical Resume Embedding and Runner-Up Hard-Negative Mining (Yu et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-acl.661.pdf

PDF Cite Search Fix data