Skip to main content
CenXiv.org
This website is in trial operation, support us!
We gratefully acknowledge support from all contributors.
Contribute
Donate
cenxiv logo > eess > arXiv:2212.01371

Help | Advanced Search

Electrical Engineering and Systems Science > Systems and Control

arXiv:2212.01371 (eess)
[Submitted on 2 Dec 2022 ]

Title: Adaptive Robust Model Predictive Control via Uncertainty Cancellation

Title: 基于不确定性消除的自适应鲁棒模型预测控制

Authors:Rohan Sinha, James Harrison, Spencer M. Richards, Marco Pavone
Abstract: We propose a learning-based robust predictive control algorithm that compensates for significant uncertainty in the dynamics for a class of discrete-time systems that are nominally linear with an additive nonlinear component. Such systems commonly model the nonlinear effects of an unknown environment on a nominal system. We optimize over a class of nonlinear feedback policies inspired by certainty equivalent "estimate-and-cancel" control laws pioneered in classical adaptive control to achieve significant performance improvements in the presence of uncertainties of large magnitude, a setting in which existing learning-based predictive control algorithms often struggle to guarantee safety. In contrast to previous work in robust adaptive MPC, our approach allows us to take advantage of structure (i.e., the numerical predictions) in the a priori unknown dynamics learned online through function approximation. Our approach also extends typical nonlinear adaptive control methods to systems with state and input constraints even when we cannot directly cancel the additive uncertain function from the dynamics. We apply contemporary statistical estimation techniques to certify the system's safety through persistent constraint satisfaction with high probability. Moreover, we propose using Bayesian meta-learning algorithms that learn calibrated model priors to help satisfy the assumptions of the control design in challenging settings. Finally, we show in simulation that our method can accommodate more significant unknown dynamics terms than existing methods and that the use of Bayesian meta-learning allows us to adapt to the test environments more rapidly.
Abstract: 我们提出了一种基于学习的鲁棒预测控制算法,该算法能够补偿一类离散时间系统中动力学的显著不确定性,这些系统本质上是线性的,并具有一个加性非线性分量。 这类系统通常用于建模未知环境对名义系统产生的非线性效应。 我们优化了一类由确定性等价“估计并消除”控制律启发的非线性反馈策略,以在存在大范围不确定性的情况下实现显著的性能提升,在这种情况下,现有的基于学习的预测控制算法往往难以保证安全性。 与鲁棒自适应MPC之前的研究所不同,我们的方法使我们能够利用通过在线函数逼近学习到的先验未知动力学中的结构(即数值预测)。 我们的方法还将典型的非线性自适应控制方法扩展到具有状态和输入约束的系统,即使我们不能直接从动力学中消除加性不确定函数。 我们应用当代统计估计技术,通过高概率下的持续约束满足来认证系统的安全性。 此外,我们提出了使用贝叶斯元学习算法来学习校准的模型先验,以帮助在具有挑战性的设置中满足控制设计的假设。 最后,我们在仿真中展示了我们的方法可以容纳比现有方法更显著的未知动力学项,并且使用贝叶斯元学习可以使我们更快地适应测试环境。
Comments: Under review for the IEEE Transaction on Automatic Control, special issue on learning and control. arXiv admin note: text overlap with arXiv:2104.08261
Subjects: Systems and Control (eess.SY) ; Machine Learning (cs.LG); Robotics (cs.RO)
Cite as: arXiv:2212.01371 [eess.SY]
  (or arXiv:2212.01371v1 [eess.SY] for this version)
  https://doi.org/10.48550/arXiv.2212.01371
arXiv-issued DOI via DataCite

Submission history

From: Rohan Sinha [view email]
[v1] Fri, 2 Dec 2022 18:54:23 UTC (4,974 KB)
Full-text links:

Access Paper:

    View a PDF of the paper titled
  • View Chinese PDF
  • View PDF
  • TeX Source
view license
Current browse context:
eess.SY
< prev   |   next >
new | recent | 2022-12
Change to browse by:
cs
cs.LG
cs.RO
cs.SY
eess

References & Citations

  • NASA ADS
  • Google Scholar
  • Semantic Scholar
a export BibTeX citation Loading...

BibTeX formatted citation

×
Data provided by:

Bookmark

BibSonomy logo Reddit logo

Bibliographic and Citation Tools

Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)

Code, Data and Media Associated with this Article

alphaXiv (What is alphaXiv?)
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Hugging Face (What is Huggingface?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)

Demos

Replicate (What is Replicate?)
Hugging Face Spaces (What is Spaces?)
TXYZ.AI (What is TXYZ.AI?)

Recommenders and Search Tools

Influence Flower (What are Influence Flowers?)
CORE Recommender (What is CORE?)
IArxiv Recommender (What is IArxiv?)
  • Author
  • Venue
  • Institution
  • Topic

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack

京ICP备2025123034号