Enhancing Job Posting Classification with Multilingual Embeddings and Large Language Models

Hamit Kavas, Marc Serra-Vidal, Leo Wanner


Abstract
In the modern labour market, taxonomies such the European Skills, Competences, Qualifications and Occupations (ESCO) classification are used as an interlingua to match job postings with job seeker profiles. Both are classified with respect to ESCO occupations, and match if they align with the same occupation and the same skills assigned to the occupation. However, matching models usually struggle with the classification because of overlapping skills and similar definitions of occupations defined in the ESCO taxonomy. This often leads to imprecise classification outcomes. In this paper, we focus on the challenge of the classification of job postings written in Italian or Spanish against ESCO occupations written in English. We experiment with multilingual embeddings, zero-shot classification, and use of a large language model (LLM) and show that the use of an LLM leads to best results.
Anthology ID:
2024.clicit-1.53
Volume:
Proceedings of the 10th Italian Conference on Computational Linguistics (CLiC-it 2024)
Month:
December
Year:
2024
Address:
Pisa, Italy
Editors:
Felice Dell'Orletta, Alessandro Lenci, Simonetta Montemagni, Rachele Sprugnoli
Venue:
CLiC-it
SIG:
Publisher:
CEUR Workshop Proceedings
Note:
Pages:
440–450
Language:
URL:
https://aclanthology.org/2024.clicit-1.53/
DOI:
Bibkey:
Cite (ACL):
Hamit Kavas, Marc Serra-Vidal, and Leo Wanner. 2024. Enhancing Job Posting Classification with Multilingual Embeddings and Large Language Models. In Proceedings of the 10th Italian Conference on Computational Linguistics (CLiC-it 2024), pages 440–450, Pisa, Italy. CEUR Workshop Proceedings.
Cite (Informal):
Enhancing Job Posting Classification with Multilingual Embeddings and Large Language Models (Kavas et al., CLiC-it 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.clicit-1.53.pdf