GLA: Grounding Large Language Models in Molecular Hierarchy for Chemical Understanding

Yingxu Li; Jingjie Zeng; Zekun Wang; Hongfei Lin (林鸿飞); Liang Yang (杨亮)

GLA: Grounding Large Language Models in Molecular Hierarchy for Chemical Understanding

Yingxu Li, Jingjie Zeng, Zekun Wang, Hongfei Lin, Liang Yang

Abstract

Conventional Euclidean geometries lead to structural distortion and entangle core pharmacophoric identities with peripheral groups. Existing molecule-language models, relying on linear or uniform encodings, often obscure the hierarchical organization of chemical semantics. To address this, we propose Geometric-Language Alignment (GLA), a framework integrating intrinsic molecular topology into large language models. GLA employs a mixed-curvature encoder that adaptively learns geometric representations through a gating mechanism. These representations are aligned with text via a dual-view contrastive objective and injected into a frozen language model. Experiments on cross-modal retrieval, captioning, and property prediction benchmarks show GLA consistently improves performance over baselines, suggesting that modeling geometric heterogeneity enhances the grounding between molecular structure and chemical language.

Anthology ID:: 2026.findings-acl.1515
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 30311–30323
Language:
URL:: https://aclanthology.org/2026.findings-acl.1515/
DOI:
Bibkey:
Cite (ACL):: Yingxu Li, Jingjie Zeng, Zekun Wang, Hongfei Lin, and Liang Yang. 2026. GLA: Grounding Large Language Models in Molecular Hierarchy for Chemical Understanding. In Findings of the Association for Computational Linguistics: ACL 2026, pages 30311–30323, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: GLA: Grounding Large Language Models in Molecular Hierarchy for Chemical Understanding (Li et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.1515.pdf
Checklist:: 2026.findings-acl.1515.checklist.pdf

PDF Cite Search Checklist Fix data