Hardness of Learning Fixed Parities with Neural Networks

Shoshani, Itamar; Shamir, Ohad

计算机科学 > 机器学习

arXiv:2501.00817 (cs)

[提交于 2025年1月1日 (v1) ，最后修订 2025年1月8日 (此版本， v2)]

标题：神经网络学习固定奇偶性的问题难度

标题： Hardness of Learning Fixed Parities with Neural Networks

Authors:Itamar Shoshani, Ohad Shamir

摘要：学习奇偶函数是学习理论中的一个经典问题，尽管在计算上是可行的，但不适用于梯度相关的标准学习算法。这种难度通常通过统计查询下界来解释[Kearns, 1998]。然而，这些界限只意味着对于任何给定的算法，存在某些最坏情况下的奇偶函数难以学习。因此，它们无法解释为什么固定的奇偶函数——例如所有坐标上的完整奇偶函数——在实践中，至少使用标准预测器和梯度下降方法时，难以学习[Abbe and Boix-Adsera, 2022]。在本文中，我们通过证明对于任何固定大小的最小奇偶函数，将其作为目标函数来训练带有扰动梯度下降的一层隐藏层ReLU网络将无法产生任何有意义的结果，从而解决了这个开放问题。为了建立这一点，我们证明了一个关于线性阈值（或加权多数）函数的傅里叶系数衰减的新结果，这可能具有独立的兴趣。

摘要： Learning parity functions is a canonical problem in learning theory, which although computationally tractable, is not amenable to standard learning algorithms such as gradient-based methods. This hardness is usually explained via statistical query lower bounds [Kearns, 1998]. However, these bounds only imply that for any given algorithm, there is some worst-case parity function that will be hard to learn. Thus, they do not explain why fixed parities - say, the full parity function over all coordinates - are difficult to learn in practice, at least with standard predictors and gradient-based methods [Abbe and Boix-Adsera, 2022]. In this paper, we address this open problem, by showing that for any fixed parity of some minimal size, using it as a target function to train one-hidden-layer ReLU networks with perturbed gradient descent will fail to produce anything meaningful. To establish this, we prove a new result about the decay of the Fourier coefficients of linear threshold (or weighted majority) functions, which may be of independent interest.

评论：	一个更新的版本被上传以修正定理2陈述中的拼写错误
主题：	机器学习 (cs.LG) ; 机器学习 (stat.ML)
引用方式：	arXiv:2501.00817 [cs.LG]
	(或者 arXiv:2501.00817v2 [cs.LG] 对于此版本)
	https://doi.org/10.48550/arXiv.2501.00817

提交历史

来自： Itamar Shoshani [查看电子邮件]
[v1] 星期三， 2025 年 1 月 1 日 12:04:06 UTC (33 KB)
[v2] 星期三， 2025 年 1 月 8 日 10:42:53 UTC (33 KB)

计算机科学 > 机器学习

标题：神经网络学习固定奇偶性的问题难度

标题： Hardness of Learning Fixed Parities with Neural Networks

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

计算机科学 > 机器学习

标题： 神经网络学习固定奇偶性的问题难度 显示英文标题

标题： Hardness of Learning Fixed Parities with Neural Networks

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题：神经网络学习固定奇偶性的问题难度