Sample Efficient Reinforcement Learning through Learning from Demonstrations in Minecraft

Scheller, Christian; Schraner, Yanick; Vogel, Manfred

计算机科学 > 机器学习

arXiv:2003.06066 (cs)

[提交于 2020年3月12日 ]

标题：通过在Minecraft中从示范中学习实现高效的强化学习

标题： Sample Efficient Reinforcement Learning through Learning from Demonstrations in Minecraft

Authors:Christian Scheller, Yanick Schraner, Manfred Vogel

摘要：深度强化学习方法的样本效率低下是其在现实世界应用中的主要障碍。在本工作中，我们展示了如何通过人类演示来提高代理在Minecraft小游戏ObtainDiamond中的最终性能，仅需800万帧的环境交互。我们提出了一种训练过程，其中策略网络首先在人类数据上进行训练，然后通过强化学习进行微调。使用策略利用机制、经验回放和对抗灾难性遗忘的额外损失，我们的最佳代理能够达到平均分数48。我们提出解决方案在NeurIPS MineRL竞赛中获得了样本高效强化学习的第三名。

摘要： Sample inefficiency of deep reinforcement learning methods is a major obstacle for their use in real-world applications. In this work, we show how human demonstrations can improve final performance of agents on the Minecraft minigame ObtainDiamond with only 8M frames of environment interaction. We propose a training procedure where policy networks are first trained on human data and later fine-tuned by reinforcement learning. Using a policy exploitation mechanism, experience replay and an additional loss against catastrophic forgetting, our best agent was able to achieve a mean score of 48. Our proposed solution placed 3rd in the NeurIPS MineRL Competition for Sample-Efficient Reinforcement Learning.

评论：	10页，2图
主题：	机器学习 (cs.LG) ; 机器学习 (stat.ML)
引用方式：	arXiv:2003.06066 [cs.LG]
	(或者 arXiv:2003.06066v1 [cs.LG] 对于此版本)
	https://doi.org/10.48550/arXiv.2003.06066

提交历史

来自： Christian Scheller [查看电子邮件]
[v1] 星期四， 2020 年 3 月 12 日 23:46:16 UTC (1,214 KB)

计算机科学 > 机器学习

标题：通过在Minecraft中从示范中学习实现高效的强化学习

标题： Sample Efficient Reinforcement Learning through Learning from Demonstrations in Minecraft

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

计算机科学 > 机器学习

标题： 通过在Minecraft中从示范中学习实现高效的强化学习 显示英文标题

标题： Sample Efficient Reinforcement Learning through Learning from Demonstrations in Minecraft

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题：通过在Minecraft中从示范中学习实现高效的强化学习