Improving the Backpropagation Algorithm with Consequentialism Weight Updates over Mini-Batches

Paeedeh, Naeem; Ghiasi-Shirazi, Kamaledin

计算机科学 > 机器学习

arXiv:2003.05164 (cs)

[提交于 2020年3月11日 (v1) ，最后修订 2021年1月2日 (此版本， v2)]

标题：通过在小批量数据上使用后果主义权重更新改进反向传播算法

标题： Improving the Backpropagation Algorithm with Consequentialism Weight Updates over Mini-Batches

Authors:Naeem Paeedeh, Kamaledin Ghiasi-Shirazi

摘要：许多尝试旨在改进自适应滤波器，这些滤波器也可以用于改进反向传播（BP）。归一化最小均值平方误差（NLMS）是从最小均值平方误差（LMS）派生出的最成功算法之一。然而，其扩展到多层神经网络之前尚未发生。在这里，我们首先表明可以将多层神经网络视为自适应滤波器的堆叠。此外，我们引入了比仿射投影算法（APA）中复杂的几何解释更易理解的NLMS解释，这种解释可以轻松推广到卷积神经网络，并且在小批量训练中表现更好。通过这种新观点，我们引入了一种更好的算法，该算法可以预测并纠正反向传播中发生的不利后果，甚至在它们发生之前就进行修正。最后，所提出的方法与随机梯度下降（SGD）兼容，并适用于基于动量的导数，如 RMSProp、Adam 和 NAG。我们的实验表明了该算法在深度神经网络训练中的有效性。

摘要： Many attempts took place to improve the adaptive filters that can also be useful to improve backpropagation (BP). Normalized least mean squares (NLMS) is one of the most successful algorithms derived from Least mean squares (LMS). However, its extension to multi-layer neural networks has not happened before. Here, we first show that it is possible to consider a multi-layer neural network as a stack of adaptive filters. Additionally, we introduce more comprehensible interpretations of NLMS than the complicated geometric interpretation in affine projection algorithm (APA) for a single fully-connected (FC) layer that can easily be generalized to, for instance, convolutional neural networks and also works better with mini-batch training. With this new viewpoint, we introduce a better algorithm by predicting then emending the adverse consequences of the actions that take place in BP even before they happen. Finally, the proposed method is compatible with stochastic gradient descent (SGD) and applicable to momentum-based derivatives such as RMSProp, Adam, and NAG. Our experiments show the usefulness of our algorithm in the training of deep neural networks.

主题：	机器学习 (cs.LG) ; 机器学习 (stat.ML)
引用方式：	arXiv:2003.05164 [cs.LG]
	(或者 arXiv:2003.05164v2 [cs.LG] 对于此版本)
	https://doi.org/10.48550/arXiv.2003.05164

提交历史

来自： Naeem Paeedeh [查看电子邮件]
[v1] 星期三， 2020 年 3 月 11 日 08:45:36 UTC (4,784 KB)
[v2] 星期六， 2021 年 1 月 2 日 03:41:00 UTC (4,436 KB)

计算机科学 > 机器学习

标题：通过在小批量数据上使用后果主义权重更新改进反向传播算法

标题： Improving the Backpropagation Algorithm with Consequentialism Weight Updates over Mini-Batches

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

计算机科学 > 机器学习

标题： 通过在小批量数据上使用后果主义权重更新改进反向传播算法 显示英文标题

标题： Improving the Backpropagation Algorithm with Consequentialism Weight Updates over Mini-Batches

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题：通过在小批量数据上使用后果主义权重更新改进反向传播算法