Incorporating Preconditioning into Accelerated Approaches: Theoretical Guarantees and Practical Improvement

Trifonov, Stepan; Levin, Leonid; Chezhegov, Savelii; Beznosikov, Aleksandr

数学 > 优化与控制

arXiv:2505.23510 (math)

[提交于 2025年5月29日 ]

标题：将预处理纳入加速方法：理论保证与实际改进

标题： Incorporating Preconditioning into Accelerated Approaches: Theoretical Guarantees and Practical Improvement

Authors:Stepan Trifonov, Leonid Levin, Savelii Chezhegov, Aleksandr Beznosikov

摘要：机器学习和深度学习是广泛研究的领域，为许多现代问题提供了解决方案。由于与数据集规模相关的新问题的复杂性，高效的处理方法是必要的。在优化理论中，Heavy Ball 和 Nesterov 方法在其模型权重更新中使用了 \textit{动量}。另一方面，所考虑的最小化问题可能是病态条件问题，这影响了上述技术的应用有效性和有效性。对此问题的一种解决方案是 \textit{预处理}，它已经在诸如 \textsc{AdaGrad}、\textsc{RMSProp}、\textsc{亚当}等方法中进行了研究。尽管如此，动量加速和预条件设置尚未完全结合在一起进行探索。因此，我们提出了带预处理的加速梯度下降法（\textsc{PHB}）和带预处理的Nesterov方法（\textsc{PN}），并在假设\textit{统一的}成立时，给出了其收敛性的理论保证。此外，我们还提供了数值实验，证明了与未使用缩放的技术相比，在迭代复杂度和oracle复杂度方面具有更优越的性能。

摘要： Machine learning and deep learning are widely researched fields that provide solutions to many modern problems. Due to the complexity of new problems related to the size of datasets, efficient approaches are obligatory. In optimization theory, the Heavy Ball and Nesterov methods use \textit{momentum} in their updates of model weights. On the other hand, the minimization problems considered may be poorly conditioned, which affects the applicability and effectiveness of the aforementioned techniques. One solution to this issue is \textit{preconditioning}, which has already been investigated in approaches such as \textsc{AdaGrad}, \textsc{RMSProp}, \textsc{Adam} and others. Despite this, momentum acceleration and preconditioning have not been fully explored together. Therefore, we propose the Preconditioned Heavy Ball (\textsc{PHB}) and Preconditioned Nesterov method (\textsc{PN}) with theoretical guarantees of convergence under \textit{unified} assumption on the scaling matrix. Furthermore, we provide numerical experiments that demonstrate superior performance compared to the unscaled techniques in terms of iteration and oracle complexities.

评论：	18页，2幅图
主题：	优化与控制 (math.OC)
引用方式：	arXiv:2505.23510 [math.OC]
	(或者 arXiv:2505.23510v1 [math.OC] 对于此版本)
	https://doi.org/10.48550/arXiv.2505.23510

提交历史

来自： Savelii Chezhegov [查看电子邮件]
[v1] 星期四， 2025 年 5 月 29 日 14:53:30 UTC (202 KB)

数学 > 优化与控制

标题：将预处理纳入加速方法：理论保证与实际改进

标题： Incorporating Preconditioning into Accelerated Approaches: Theoretical Guarantees and Practical Improvement

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

数学 > 优化与控制

标题： 将预处理纳入加速方法：理论保证与实际改进 显示英文标题

标题： Incorporating Preconditioning into Accelerated Approaches: Theoretical Guarantees and Practical Improvement

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题：将预处理纳入加速方法：理论保证与实际改进