Robustness Guarantees for Mode Estimation with an Application to Bandits

Pacchiano, Aldo; Jiang, Heinrich; Jordan, Michael I.

计算机科学 > 机器学习

arXiv:2003.02932 (cs)

[提交于 2020年3月5日 ]

标题：带有应用的博弈论模式估计的鲁棒性保证

标题： Robustness Guarantees for Mode Estimation with an Application to Bandits

Authors:Aldo Pacchiano, Heinrich Jiang, Michael I. Jordan

摘要：模式估计是统计学中的一个经典问题，在机器学习中有广泛的应用。尽管如此，对于在可能的对抗性数据污染下的鲁棒性特性，了解仍然有限。在本文中，我们在简单的随机化下给出了精确的鲁棒性保证以及隐私保证。然后，我们引入了一个多臂老虎机理论，其中值是奖励分布的模式而不是均值。我们证明了顶级臂识别、顶级m臂识别、上下文模式老虎机和无限连续臂顶级臂恢复问题的遗憾保证。我们在模拟中展示了我们的算法对由对抗性噪声序列引起的臂的扰动具有鲁棒性，因此使得模式老虎机在奖励可能包含异常值或对抗性破坏的情况下成为一个有吸引力的选择。

摘要： Mode estimation is a classical problem in statistics with a wide range of applications in machine learning. Despite this, there is little understanding in its robustness properties under possibly adversarial data contamination. In this paper, we give precise robustness guarantees as well as privacy guarantees under simple randomization. We then introduce a theory for multi-armed bandits where the values are the modes of the reward distributions instead of the mean. We prove regret guarantees for the problems of top arm identification, top m-arms identification, contextual modal bandits, and infinite continuous arms top arm recovery. We show in simulations that our algorithms are robust to perturbation of the arms by adversarial noise sequences, thus rendering modal bandits an attractive choice in situations where the rewards may have outliers or adversarial corruptions.

评论：	12页，7图，14附录页
主题：	机器学习 (cs.LG) ; 机器学习 (stat.ML)
引用方式：	arXiv:2003.02932 [cs.LG]
	(或者 arXiv:2003.02932v1 [cs.LG] 对于此版本)
	https://doi.org/10.48550/arXiv.2003.02932

提交历史

来自： Aldo Pacchiano [查看电子邮件]
[v1] 星期四， 2020 年 3 月 5 日 21:29:27 UTC (377 KB)

计算机科学 > 机器学习

标题：带有应用的博弈论模式估计的鲁棒性保证

标题： Robustness Guarantees for Mode Estimation with an Application to Bandits

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

计算机科学 > 机器学习

标题： 带有应用的博弈论模式估计的鲁棒性保证 显示英文标题

标题： Robustness Guarantees for Mode Estimation with an Application to Bandits

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题：带有应用的博弈论模式估计的鲁棒性保证