Consistency of cross validation for comparing regression procedures

Yang, Yuhong

doi:10.1214/009053607000000514

数学 > 统计理论

arXiv:0803.2963 (math)

[提交于 2008年3月20日 ]

标题：交叉验证在比较回归方法中的一致性

标题： Consistency of cross validation for comparing regression procedures

Authors:Yuhong Yang

摘要：交叉验证（CV）的理论发展主要集中在从一组有限维模型（例如，线性回归中的子集或阶数选择）中选择一个，或者选择一个平滑参数（例如，核平滑的带宽）。然而，当应用于比较参数方法和非参数方法之间或在非参数方法内部时，关于交叉验证的一致性知之甚少。我们证明，在某些条件下，通过适当选择数据分割比例，交叉验证在选择更好程序方面是一致的，即概率趋近于1。我们的结果揭示了交叉验证有趣的行为。当比较两个以相同非参数速率收敛的模型（程序）时，与参数情况相反，交叉验证中用于评估的数据比例不需要在规模上占主导地位。此外，它甚至可以比用于估计的比例更小，而不会影响一致性属性。

摘要： Theoretical developments on cross validation (CV) have mainly focused on selecting one among a list of finite-dimensional models (e.g., subset or order selection in linear regression) or selecting a smoothing parameter (e.g., bandwidth for kernel smoothing). However, little is known about consistency of cross validation when applied to compare between parametric and nonparametric methods or within nonparametric methods. We show that under some conditions, with an appropriate choice of data splitting ratio, cross validation is consistent in the sense of selecting the better procedure with probability approaching 1. Our results reveal interesting behavior of cross validation. When comparing two models (procedures) converging at the same nonparametric rate, in contrast to the parametric case, it turns out that the proportion of data used for evaluation in CV does not need to be dominating in size. Furthermore, it can even be of a smaller order than the proportion for estimation while not affecting the consistency property.

评论：	发表于 http://dx.doi.org/10.1214/009053607000000514 的《统计学年鉴》（http://www.imstat.org/aos/）由数理统计学会（http://www.imstat.org）出版
主题：	统计理论 (math.ST)
MSC 类：	62G07, 62B10 (Primary) 62C20 (Secondary)
引用方式：	arXiv:0803.2963 [math.ST]
	(或者 arXiv:0803.2963v1 [math.ST] 对于此版本)
	https://doi.org/10.48550/arXiv.0803.2963
期刊参考：	IMS-AOS-AOS0284
相关 DOI:	https://doi.org/10.1214/009053607000000514

提交历史

来自： Yuhong Yang [查看电子邮件]
[v1] 星期四， 2008 年 3 月 20 日 10:52:01 UTC (232 KB)

数学 > 统计理论

标题：交叉验证在比较回归方法中的一致性

标题： Consistency of cross validation for comparing regression procedures

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

数学 > 统计理论

标题： 交叉验证在比较回归方法中的一致性 显示英文标题

标题： Consistency of cross validation for comparing regression procedures

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题：交叉验证在比较回归方法中的一致性