Robust synchronization and policy adaptation for networked heterogeneous agents

Arevalo-Castiblanco, Miguel F.; and, Eduardo Mojica-Nava; Uribe, César A.

电气工程与系统科学 > 系统与控制

arXiv:2409.03273 (eess)

[提交于 2024年9月5日 ]

标题：具有鲁棒同步和策略适应性的网络异构智能体

标题： Robust synchronization and policy adaptation for networked heterogeneous agents

Authors:Miguel F. Arevalo-Castiblanco, Eduardo Mojica-Nava and, César A. Uribe

摘要：我们提出了一种针对具有系统不确定性及输入幅度饱和的非线性异构代理的领导者-跟随者网络的鲁棒自适应在线同步方法。使用分布式输入幅度饱和自适应控制与强化学习（DMSAC-RL）实现同步，该方法通过使用强化学习（RL）策略改进了在现成模型上训练的策略的实证性能。领导者观察参考模型的性能，而跟随者观察它们所连接的代理的状态和动作，但不观察参考模型。领导者和跟随者可能与RL控制策略被训练的参考模型不同。 DMSAC-RL 使用一个内部循环，以增强输入的形式调整代理的学习策略，以解决分布式控制问题，包括输入匹配的不确定性参数。我们证明了异构网络的同步误差是统一最终有界的（UUB）。多输入多输出（MIMO）系统的网络数值分析支持我们的理论结果。

摘要： We propose a robust adaptive online synchronization method for leader-follower networks of nonlinear heterogeneous agents with system uncertainties and input magnitude saturation. Synchronization is achieved using a Distributed input Magnitude Saturation Adaptive Control with Reinforcement Learning (DMSAC-RL), which improves the empirical performance of policies trained on off-the-shelf models using Reinforcement Learning (RL) strategies. The leader observes the performance of a reference model, and followers observe the states and actions of the agents they are connected to, but not the reference model. The leader and followers may differ from the reference model in which the RL control policy was trained. DMSAC-RL uses an internal loop that adjusts the learned policy for the agents in the form of augmented input to solve the distributed control problem, including input-matched uncertainty parameters. We show that the synchronization error of the heterogeneous network is Uniformly Ultimately Bounded (UUB). Numerical analysis of a network of Multiple Input Multiple Output (MIMO) systems supports our theoretical findings.

评论：	30页，12幅图，会议论文
主题：	系统与控制 (eess.SY)
引用方式：	arXiv:2409.03273 [eess.SY]
	(或者 arXiv:2409.03273v1 [eess.SY] 对于此版本)
	https://doi.org/10.48550/arXiv.2409.03273

提交历史

来自： Miguel F Arevalo-Castiblanco Mr. [查看电子邮件]
[v1] 星期四， 2024 年 9 月 5 日 06:31:19 UTC (2,696 KB)

电气工程与系统科学 > 系统与控制

标题：具有鲁棒同步和策略适应性的网络异构智能体

标题： Robust synchronization and policy adaptation for networked heterogeneous agents

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

电气工程与系统科学 > 系统与控制

标题： 具有鲁棒同步和策略适应性的网络异构智能体 显示英文标题

标题： Robust synchronization and policy adaptation for networked heterogeneous agents

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题：具有鲁棒同步和策略适应性的网络异构智能体