rQdia: Regularizing Q-Value Distributions With Image Augmentation

Lerman, Sam; Bi, Jing

计算机科学 > 机器学习

arXiv:2506.21367 (cs)

[提交于 2025年6月26日 ]

标题： rQdia：使用图像增强正则化Q值分布

标题： rQdia: Regularizing Q-Value Distributions With Image Augmentation

Authors:Sam Lerman, Jing Bi

摘要： rQdia通过在基于像素的深度强化学习中使用增强图像来对Q值分布进行正则化。通过一个简单的辅助损失，该损失通过均方误差使这些分布相等，rQdia在MuJoCo连续控制套件的9/12和10/12任务中分别提升了DrQ和SAC，并在Data-Efficient Rainbow的18/26个Atari街机环境中取得了提升。收益在样本效率和长期训练中都有体现。此外，添加rQdia最终使无模型的连续控制从像素中超越了状态编码基线。

摘要： rQdia regularizes Q-value distributions with augmented images in pixel-based deep reinforcement learning. With a simple auxiliary loss, that equalizes these distributions via MSE, rQdia boosts DrQ and SAC on 9/12 and 10/12 tasks respectively in the MuJoCo Continuous Control Suite from pixels, and Data-Efficient Rainbow on 18/26 Atari Arcade environments. Gains are measured in both sample efficiency and longer-term training. Moreover, the addition of rQdia finally propels model-free continuous control from pixels over the state encoding baseline.

主题：	机器学习 (cs.LG) ; 人工智能 (cs.AI)
引用方式：	arXiv:2506.21367 [cs.LG]
	(或者 arXiv:2506.21367v1 [cs.LG] 对于此版本)
	https://doi.org/10.48550/arXiv.2506.21367

提交历史

来自： Jing Bi [查看电子邮件]
[v1] 星期四， 2025 年 6 月 26 日 15:16:35 UTC (3,302 KB)

计算机科学 > 机器学习

标题： rQdia：使用图像增强正则化Q值分布

标题： rQdia: Regularizing Q-Value Distributions With Image Augmentation

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

计算机科学 > 机器学习

标题： rQdia：使用图像增强正则化Q值分布 显示英文标题

标题： rQdia: Regularizing Q-Value Distributions With Image Augmentation

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题： rQdia：使用图像增强正则化Q值分布