Multi-Method Analysis of Mathematics Placement Assessments: Classical, Machine Learning, and Clustering Approaches

Allagan, Julian D.; Singleton, Dasia A.; Perry, Shanae N.; Morgan, Gabrielle C.; Morgan, Essence A.

计算机科学 > 机器学习

arXiv:2511.04667 (cs)

[提交于 2025年11月6日 ]

标题：多方法分析数学分班评估：经典方法、机器学习和聚类方法

标题： Multi-Method Analysis of Mathematics Placement Assessments: Classical, Machine Learning, and Clustering Approaches

Authors:Julian D. Allagan, Dasia A. Singleton, Shanae N. Perry, Gabrielle C. Morgan, Essence A. Morgan

摘要：这项研究使用多方法框架结合经典测验理论、机器学习和无监督聚类，评估了对198名学生进行的40题数学分班考试。经典测验理论分析显示，55%的题目达到优秀区分度（$D \geq 0.40$），而30%的题目表现出较差的区分度（$D < 0.20$），需要替换。第6题（图表解读）成为考试中最有效的区分题，达到完美区分度（$D = 1.000$）、最高方差分析F统计量（$F = 4609.1$）和最大随机森林特征重要性（0.206），占预测力的20.6%。机器学习算法表现出卓越的性能，随机森林和梯度提升分别达到97.5%和96.0%的交叉验证准确率。 K均值聚类识别出一个自然的二元能力结构，边界为42.5%，与机构设定的55%阈值不同，表明可能存在过度归入补救类别的现象。两簇解决方案表现出极高的稳定性（引导程序ARI = 0.855），下簇纯度完美。各方法的汇聚证据支持具体的改进措施：替换区分度较差的题目，实施两阶段评估，并将随机森林预测与透明机制结合。这些发现表明，多方法整合为基于证据的数学分班优化提供了坚实的实证基础。

摘要： This study evaluates a 40-item mathematics placement examination administered to 198 students using a multi-method framework combining Classical Test Theory, machine learning, and unsupervised clustering. Classical Test Theory analysis reveals that 55\% of items achieve excellent discrimination ($D \geq 0.40$) while 30\% demonstrate poor discrimination ($D < 0.20$) requiring replacement. Question 6 (Graph Interpretation) emerges as the examination's most powerful discriminator, achieving perfect discrimination ($D = 1.000$), highest ANOVA F-statistic ($F = 4609.1$), and maximum Random Forest feature importance (0.206), accounting for 20.6\% of predictive power. Machine learning algorithms demonstrate exceptional performance, with Random Forest and Gradient Boosting achieving 97.5\% and 96.0\% cross-validation accuracy. K-means clustering identifies a natural binary competency structure with a boundary at 42.5\%, diverging from the institutional threshold of 55\% and suggesting potential overclassification into remedial categories. The two-cluster solution exhibits exceptional stability (bootstrap ARI = 0.855) with perfect lower-cluster purity. Convergent evidence across methods supports specific refinements: replace poorly discriminating items, implement a two-stage assessment, and integrate Random Forest predictions with transparency mechanisms. These findings demonstrate that multi-method integration provides a robust empirical foundation for evidence-based mathematics placement optimization.

评论：	28页，8表，4图，NAM会议
主题：	机器学习 (cs.LG)
MSC 类：	97C70, 62P25, 62H30, 68T05
引用方式：	arXiv:2511.04667 [cs.LG]
	(或者 arXiv:2511.04667v1 [cs.LG] 对于此版本)
	https://doi.org/10.48550/arXiv.2511.04667

提交历史

来自： Julian Allagan [查看电子邮件]
[v1] 星期四， 2025 年 11 月 6 日 18:53:07 UTC (31 KB)

计算机科学 > 机器学习

标题：多方法分析数学分班评估：经典方法、机器学习和聚类方法

标题： Multi-Method Analysis of Mathematics Placement Assessments: Classical, Machine Learning, and Clustering Approaches

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

计算机科学 > 机器学习

标题： 多方法分析数学分班评估：经典方法、机器学习和聚类方法 显示英文标题

标题： Multi-Method Analysis of Mathematics Placement Assessments: Classical, Machine Learning, and Clustering Approaches

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题：多方法分析数学分班评估：经典方法、机器学习和聚类方法