LM-Lexicon: Improving Definition Modeling via Harmonizing Semantic Experts

Yang Liu; Jiaye Yang; Weigang Li; Jiahui Liang; Yang Li; Lingyong Yan

LM-Lexicon: Improving Definition Modeling via Harmonizing Semantic Experts

Yang Liu, Jiaye Yang, Weikang Li, Jiahui Liang, Yang Li, Lingyong Yan

Abstract

We introduce LM-Lexicon, an innovative definition modeling approach that incorporates data clustering, semantic expert learning, and model merging using a sparse mixture-of-experts architecture. By decomposing the definition modeling task into specialized semantic domains, where small language models are trained as domain experts, LM-Lexicon achieves substantial improvements (+7% BLEU score compared with the prior state-of-the-art model) over existing methods on five widely used benchmarks. Empirically, we demonstrate that 1) the clustering strategy enables fine-grained expert specialization with nearly 10% improvement in definition quality; 2) the semantic-aware domain-level routing mechanism achieves higher expert efficacy (+1%) than conventional token-level routing; and 3) further performance gains can be obtained through test-time compute and semantic expert scaling. Our work advances definition modeling while providing insights into the development of efficient language models for semantic-intensive applications.

Anthology ID:: 2026.eacl-long.1
Volume:: Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Vera Demberg, Kentaro Inui, Lluís Marquez
Venue:: EACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1–22
Language:
URL:: https://aclanthology.org/2026.eacl-long.1/
DOI:
Bibkey:
Cite (ACL):: Yang Liu, Jiaye Yang, Weikang Li, Jiahui Liang, Yang Li, and Lingyong Yan. 2026. LM-Lexicon: Improving Definition Modeling via Harmonizing Semantic Experts. In Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1–22, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: LM-Lexicon: Improving Definition Modeling via Harmonizing Semantic Experts (Liu et al., EACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.eacl-long.1.pdf
Checklist:: 2026.eacl-long.1.checklist.pdf

PDF Cite Search Checklist Fix data