On the rate of convergence of an over-parametrized deep neural network regression estimate learned by gradient descent

Kohler, Michael

Mathematics > Statistics Theory

arXiv:2504.03405 (math)

[Submitted on 4 Apr 2025 ]

Title: On the rate of convergence of an over-parametrized deep neural network regression estimate learned by gradient descent

Title: 关于通过梯度下降学习的过参数化深度神经网络回归估计的收敛速度

Authors:Michael Kohler

Abstract: Nonparametric regression with random design is considered. The $L_2$ error with integration with respect to the design measure is used as the error criterion. An over-parametrized deep neural network regression estimate with logistic activation function is defined, where all weights are learned by gradient descent. It is shown that the estimate achieves a nearly optimal rate of convergence in case that the regression function is $(p,C)$--smooth.

Abstract: 考虑了具有随机设计的非参数回归问题。以设计测度关于积分的 $L_2$ 误差作为误差准则。定义了一个使用逻辑激活函数的过参数化深度神经网络回归估计量，其中所有权重都通过梯度下降法学习。结果表明，当回归函数是 $(p,C)$--光滑时，该估计量可以达到接近最优的收敛速度。

Subjects:	Statistics Theory (math.ST)
Cite as:	arXiv:2504.03405 [math.ST]
	(or arXiv:2504.03405v1 [math.ST] for this version)
	https://doi.org/10.48550/arXiv.2504.03405

Submission history

From: Michael Kohler [view email]
[v1] Fri, 4 Apr 2025 12:28:54 UTC (32 KB)

Full-text links:

Access Paper:

view license

Current browse context:

math

< prev | next >

new | recent | 2025-04

Change to browse by:

math.ST
stat
stat.TH

References & Citations

export BibTeX citation

Mathematics > Statistics Theory

Title: On the rate of convergence of an over-parametrized deep neural network regression estimate learned by gradient descent

Title: 关于通过梯度下降学习的过参数化深度神经网络回归估计的收敛速度

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Statistics Theory

Title: On the rate of convergence of an over-parametrized deep neural network regression estimate learned by gradient descent Show Chinese title

Title: 关于通过梯度下降学习的过参数化深度神经网络回归估计的收敛速度

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Title: On the rate of convergence of an over-parametrized deep neural network regression estimate learned by gradient descent