Skip to main content
CenXiv.org
This website is in trial operation, support us!
We gratefully acknowledge support from all contributors.
Contribute
Donate
cenxiv logo > math > arXiv:2504.03405

Help | Advanced Search

Mathematics > Statistics Theory

arXiv:2504.03405 (math)
[Submitted on 4 Apr 2025 ]

Title: On the rate of convergence of an over-parametrized deep neural network regression estimate learned by gradient descent

Title: 关于通过梯度下降学习的过参数化深度神经网络回归估计的收敛速度

Authors:Michael Kohler
Abstract: Nonparametric regression with random design is considered. The $L_2$ error with integration with respect to the design measure is used as the error criterion. An over-parametrized deep neural network regression estimate with logistic activation function is defined, where all weights are learned by gradient descent. It is shown that the estimate achieves a nearly optimal rate of convergence in case that the regression function is $(p,C)$--smooth.
Abstract: 考虑了具有随机设计的非参数回归问题。 以设计测度关于积分的 $L_2$ 误差作为误差准则。 定义了一个使用逻辑激活函数的过参数化深度神经网络回归估计量, 其中所有权重都通过梯度下降法学习。 结果表明,当回归函数是 $(p,C)$--光滑时,该估计量可以达到接近最优的收敛速度。
Subjects: Statistics Theory (math.ST)
Cite as: arXiv:2504.03405 [math.ST]
  (or arXiv:2504.03405v1 [math.ST] for this version)
  https://doi.org/10.48550/arXiv.2504.03405
arXiv-issued DOI via DataCite

Submission history

From: Michael Kohler [view email]
[v1] Fri, 4 Apr 2025 12:28:54 UTC (32 KB)
Full-text links:

Access Paper:

    View a PDF of the paper titled
  • View Chinese PDF
  • View PDF
  • HTML (experimental)
  • TeX Source
  • Other Formats
license icon view license
Current browse context:
math
< prev   |   next >
new | recent | 2025-04
Change to browse by:
math.ST
stat
stat.TH

References & Citations

  • NASA ADS
  • Google Scholar
  • Semantic Scholar
a export BibTeX citation Loading...

BibTeX formatted citation

×
Data provided by:

Bookmark

BibSonomy logo Reddit logo

Bibliographic and Citation Tools

Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)

Code, Data and Media Associated with this Article

alphaXiv (What is alphaXiv?)
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Hugging Face (What is Huggingface?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)

Demos

Replicate (What is Replicate?)
Hugging Face Spaces (What is Spaces?)
TXYZ.AI (What is TXYZ.AI?)

Recommenders and Search Tools

Influence Flower (What are Influence Flowers?)
CORE Recommender (What is CORE?)
IArxiv Recommender (What is IArxiv?)
  • Author
  • Venue
  • Institution
  • Topic

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack

京ICP备2025123034号