Skip to main content
CenXiv.org
This website is in trial operation, support us!
We gratefully acknowledge support from all contributors.
Contribute
Donate
cenxiv logo > eess > arXiv:2201.03116

Help | Advanced Search

Electrical Engineering and Systems Science > Systems and Control

arXiv:2201.03116 (eess)
[Submitted on 10 Jan 2022 (v1) , last revised 26 Jan 2022 (this version, v2)]

Title: Opportunities of Hybrid Model-based Reinforcement Learning for Cell Therapy Manufacturing Process Control

Title: 混合模型基础强化学习在细胞治疗制造过程控制中的机会

Authors:Hua Zheng, Wei Xie, Keqi Wang, Zheng Li
Abstract: Driven by the key challenges of cell therapy manufacturing, including high complexity, high uncertainty, and very limited process observations, we propose a hybrid model-based reinforcement learning (RL) to efficiently guide process control. We first create a probabilistic knowledge graph (KG) hybrid model characterizing the risk- and science-based understanding of biomanufacturing process mechanisms and quantifying inherent stochasticity, e.g., batch-to-batch variation. It can capture the key features, including nonlinear reactions, nonstationary dynamics, and partially observed state. This hybrid model can leverage existing mechanistic models and facilitate learning from heterogeneous process data. A computational sampling approach is used to generate posterior samples quantifying model uncertainty. Then, we introduce hybrid model-based Bayesian RL, accounting for both inherent stochasticity and model uncertainty, to guide optimal, robust, and interpretable dynamic decision making. Cell therapy manufacturing examples are used to empirically demonstrate that the proposed framework can outperform the classical deterministic mechanistic model assisted process optimization.
Abstract: 由细胞治疗制造中的关键挑战驱动,包括高复杂性、高不确定性以及非常有限的过程观察,我们提出了一种基于混合模型的强化学习(RL),以高效地指导过程控制。 我们首先创建了一个概率知识图(KG)混合模型,该模型描述了生物制造过程机制的风险和科学基础的理解,并量化了固有的随机性,例如批次间的变异。 它可以捕捉关键特征,包括非线性反应、非平稳动态和部分可观测的状态。 这种混合模型可以利用现有的机理模型,并促进从异构过程数据中进行学习。 采用计算采样方法生成量化模型不确定性的后验样本。 然后,我们引入基于混合模型的贝叶斯强化学习,同时考虑固有的随机性和模型不确定性,以指导最优、稳健和可解释的动态决策。 使用细胞治疗制造示例来实证证明,所提出的框架可以优于经典的确定性机理模型辅助过程优化。
Comments: 14 pages, 2 figures
Subjects: Systems and Control (eess.SY) ; Machine Learning (cs.LG)
Cite as: arXiv:2201.03116 [eess.SY]
  (or arXiv:2201.03116v2 [eess.SY] for this version)
  https://doi.org/10.48550/arXiv.2201.03116
arXiv-issued DOI via DataCite

Submission history

From: Wei Xie [view email]
[v1] Mon, 10 Jan 2022 00:01:19 UTC (2,771 KB)
[v2] Wed, 26 Jan 2022 04:36:23 UTC (2,606 KB)
Full-text links:

Access Paper:

    View a PDF of the paper titled
  • View Chinese PDF
  • View PDF
  • TeX Source
  • Other Formats
license icon view license
Current browse context:
eess.SY
< prev   |   next >
new | recent | 2022-01
Change to browse by:
cs
cs.LG
cs.SY
eess

References & Citations

  • NASA ADS
  • Google Scholar
  • Semantic Scholar
a export BibTeX citation Loading...

BibTeX formatted citation

×
Data provided by:

Bookmark

BibSonomy logo Reddit logo

Bibliographic and Citation Tools

Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)

Code, Data and Media Associated with this Article

alphaXiv (What is alphaXiv?)
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Hugging Face (What is Huggingface?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)

Demos

Replicate (What is Replicate?)
Hugging Face Spaces (What is Spaces?)
TXYZ.AI (What is TXYZ.AI?)

Recommenders and Search Tools

Influence Flower (What are Influence Flowers?)
CORE Recommender (What is CORE?)
IArxiv Recommender (What is IArxiv?)
  • Author
  • Venue
  • Institution
  • Topic

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack

京ICP备2025123034号