MPC-guided Imitation Learning of Neural Network Policies for the Artificial Pancreas

Chen, Hongkai; Paoletti, Nicola; Smolka, Scott A.; Lin, Shan

计算机科学 > 机器学习

arXiv:2003.01283v1 (cs)

[提交于 2020年3月3日 ]

标题：基于MPC的神经网络策略模仿学习用于人工胰腺

标题： MPC-guided Imitation Learning of Neural Network Policies for the Artificial Pancreas

Authors:Hongkai Chen, Nicola Paoletti, Scott A. Smolka, Shan Lin

摘要：尽管模型预测控制（MPC）目前是人工胰腺（AP）中胰岛素控制的主要算法，但它通常需要复杂的在线优化，这对于资源受限的医疗设备来说是不可行的。 MPC通常依赖于状态估计，这是一个容易出错的过程。在本文中，我们介绍了一种新的AP控制方法，该方法使用模仿学习从MPC计算的示范中合成神经网络胰岛素策略。这种策略计算效率高，并且通过在训练时用完整的状态信息对MPC进行操作，它们可以直接将测量值映射到最优治疗决策，从而绕过状态估计。我们通过蒙特卡洛Dropout进行贝叶斯推断来学习策略，这使我们能够量化预测不确定性，并由此得出更安全的治疗决策。我们证明了在特定患者模型下训练的控制策略可以很好地推广（在模型参数和扰动分布方面）到患者群体，始终优于具有状态估计的传统MPC。

摘要： Even though model predictive control (MPC) is currently the main algorithm for insulin control in the artificial pancreas (AP), it usually requires complex online optimizations, which are infeasible for resource-constrained medical devices. MPC also typically relies on state estimation, an error-prone process. In this paper, we introduce a novel approach to AP control that uses Imitation Learning to synthesize neural-network insulin policies from MPC-computed demonstrations. Such policies are computationally efficient and, by instrumenting MPC at training time with full state information, they can directly map measurements into optimal therapy decisions, thus bypassing state estimation. We apply Bayesian inference via Monte Carlo Dropout to learn policies, which allows us to quantify prediction uncertainty and thereby derive safer therapy decisions. We show that our control policies trained under a specific patient model readily generalize (in terms of model parameters and disturbance distributions) to patient cohorts, consistently outperforming traditional MPC with state estimation.

主题：	机器学习 (cs.LG) ; 系统与控制 (eess.SY); 机器学习 (stat.ML)
引用方式：	arXiv:2003.01283 [cs.LG]
	(或者 arXiv:2003.01283v1 [cs.LG] 对于此版本)
	https://doi.org/10.48550/arXiv.2003.01283

提交历史

来自： Hongkai Chen [查看电子邮件]
[v1] 星期二， 2020 年 3 月 3 日 01:25:45 UTC (766 KB)

计算机科学 > 机器学习

标题：基于MPC的神经网络策略模仿学习用于人工胰腺

标题： MPC-guided Imitation Learning of Neural Network Policies for the Artificial Pancreas

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

计算机科学 > 机器学习

标题： 基于MPC的神经网络策略模仿学习用于人工胰腺 显示英文标题

标题： MPC-guided Imitation Learning of Neural Network Policies for the Artificial Pancreas

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题：基于MPC的神经网络策略模仿学习用于人工胰腺