ChatMol Copilot: An Agent for Molecular Modeling and Computation Powered by LLMs

Jinyuan Sun, Auston Li, Yifan Deng, Jiabo Li


Abstract
Large Language Models (LLMs) like ChatGPT excel at diverse tasks when given explicit instructions, yet they often struggle with specialized domains such as molecular science, lacking in-depth reasoning and sophisticated planning capabilities. To address these limitations, we introduce ChatMol Copilot, a chatbot-like agent specifically engineered for protein design and small molecule computations. ChatMol Copilot employs a multi-level abstraction framework to expand the LLM‘s capability. At the basic level, it integrates external computational tools through function calls, thus offloading complex tasks and enabling a focus on strategic decision-making. The second level is data abstraction. Large data sets (such as a large number of molecules created by a generative model) are stored in Redis cache, and the redis keys are referenced by LLMs for data sources involved in computation. The third level of abstraction allows the LLM to orchestrate these tools, either directly or via dynamically generated Python executables. Our evaluations demonstrate that ChatMol Copilot can adeptly manage molecular modeling tasks, effectively utilizing a variety of tools as directed. By simplifying access to sophisticated molecular modeling resources, ChatMol Copilot stands to significantly accelerate drug discovery and biotechnological innovation, empowering biochemists with advanced, user-friendly AI capabilities. The open-sourced code is available at https://github.com/ChatMol/ChatMol
Anthology ID:
2024.langmol-1.7
Volume:
Proceedings of the 1st Workshop on Language + Molecules (L+M 2024)
Month:
August
Year:
2024
Address:
Bangkok, Thailand
Editors:
Carl Edwards, Qingyun Wang, Manling Li, Lawrence Zhao, Tom Hope, Heng Ji
Venues:
LangMol | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
55–65
Language:
URL:
https://aclanthology.org/2024.langmol-1.7
DOI:
10.18653/v1/2024.langmol-1.7
Bibkey:
Cite (ACL):
Jinyuan Sun, Auston Li, Yifan Deng, and Jiabo Li. 2024. ChatMol Copilot: An Agent for Molecular Modeling and Computation Powered by LLMs. In Proceedings of the 1st Workshop on Language + Molecules (L+M 2024), pages 55–65, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):
ChatMol Copilot: An Agent for Molecular Modeling and Computation Powered by LLMs (Sun et al., LangMol-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.langmol-1.7.pdf