Skip to main content
CenXiv.org
This website is in trial operation, support us!
We gratefully acknowledge support from all contributors.
Contribute
Donate
cenxiv logo > q-bio > arXiv:2503.03503

Help | Advanced Search

Quantitative Biology > Biomolecules

arXiv:2503.03503 (q-bio)
[Submitted on 5 Mar 2025 ]

Title: Collaborative Expert LLMs Guided Multi-Objective Molecular Optimization

Title: 协作专家大语言模型引导的多目标分子优化

Authors:Jiajun Yu, Yizhen Zheng, Huan Yee Koh, Shirui Pan, Tianyue Wang, Haishuai Wang
Abstract: Molecular optimization is a crucial yet complex and time-intensive process that often acts as a bottleneck for drug development. Traditional methods rely heavily on trial and error, making multi-objective optimization both time-consuming and resource-intensive. Current AI-based methods have shown limited success in handling multi-objective optimization tasks, hampering their practical utilization. To address this challenge, we present MultiMol, a collaborative large language model (LLM) system designed to guide multi-objective molecular optimization. MultiMol comprises two agents, including a data-driven worker agent and a literature-guided research agent. The data-driven worker agent is a large language model being fine-tuned to learn how to generate optimized molecules considering multiple objectives, while the literature-guided research agent is responsible for searching task-related literature to find useful prior knowledge that facilitates identifying the most promising optimized candidates. In evaluations across six multi-objective optimization tasks, MultiMol significantly outperforms existing methods, achieving a 82.30% success rate, in sharp contrast to the 27.50% success rate of current strongest methods. To further validate its practical impact, we tested MultiMol on two real-world challenges. First, we enhanced the selectivity of Xanthine Amine Congener (XAC), a promiscuous ligand that binds both A1R and A2AR, successfully biasing it towards A1R. Second, we improved the bioavailability of Saquinavir, an HIV-1 protease inhibitor with known bioavailability limitations. Overall, these results indicate that MultiMol represents a highly promising approach for multi-objective molecular optimization, holding great potential to accelerate the drug development process and contribute to the advancement of pharmaceutical research.
Abstract: 分子优化是一个关键但复杂且耗时的过程,通常成为药物开发的瓶颈。 传统方法严重依赖试错法,使得多目标优化既耗时又耗费资源。 目前基于人工智能的方法在处理多目标优化任务方面表现出有限的成功,阻碍了其实际应用。 为了解决这一挑战,我们提出了MultiMol,一个协作式大型语言模型(LLM)系统,旨在指导多目标分子优化。 MultiMol包括两个代理,包括一个数据驱动的工作者代理和一个文献引导的研究代理。 数据驱动的工作者代理是一个经过微调的大型语言模型,旨在学习如何生成考虑多个目标的优化分子,而文献引导的研究代理负责搜索与任务相关的文献,以找到有助于识别最有希望的优化候选物的有用先验知识。 在六个多目标优化任务的评估中,MultiMol显著优于现有方法,成功率达到82.30%,这与当前最强方法的27.50%成功率形成鲜明对比。 为进一步验证其实际影响,我们在两个现实世界挑战中测试了MultiMol。 首先,我们提高了黄嘌呤胺共轭物(XAC)的选择性,这是一种能同时结合A1R和A2AR的多功能配体,成功地将其偏向于A1R。 其次,我们提高了沙奎那韦的生物利用度,这是一种已知生物利用度有限的HIV-1蛋白酶抑制剂。 总体而言,这些结果表明,MultiMol代表了一种高度有前景的多目标分子优化方法,具有加速药物开发过程和推动药物研究发展的巨大潜力。
Subjects: Biomolecules (q-bio.BM) ; Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as: arXiv:2503.03503 [q-bio.BM]
  (or arXiv:2503.03503v1 [q-bio.BM] for this version)
  https://doi.org/10.48550/arXiv.2503.03503
arXiv-issued DOI via DataCite

Submission history

From: Jiajun Yu [view email]
[v1] Wed, 5 Mar 2025 13:47:55 UTC (13,967 KB)
Full-text links:

Access Paper:

    View a PDF of the paper titled
  • View Chinese PDF
  • View PDF
  • HTML (experimental)
  • TeX Source
license icon view license
Current browse context:
q-bio.BM
< prev   |   next >
new | recent | 2025-03
Change to browse by:
cs
cs.AI
cs.LG
q-bio

References & Citations

  • NASA ADS
  • Google Scholar
  • Semantic Scholar
a export BibTeX citation Loading...

BibTeX formatted citation

×
Data provided by:

Bookmark

BibSonomy logo Reddit logo

Bibliographic and Citation Tools

Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)

Code, Data and Media Associated with this Article

alphaXiv (What is alphaXiv?)
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Hugging Face (What is Huggingface?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)

Demos

Replicate (What is Replicate?)
Hugging Face Spaces (What is Spaces?)
TXYZ.AI (What is TXYZ.AI?)

Recommenders and Search Tools

Influence Flower (What are Influence Flowers?)
CORE Recommender (What is CORE?)
IArxiv Recommender (What is IArxiv?)
  • Author
  • Venue
  • Institution
  • Topic

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack

京ICP备2025123034号