Diagonally-Weighted Generalized Method of Moments Estimation for Gaussian Mixture Modeling

Zhang, Liu; Mickelin, Oscar; Xu, Sheng; Singer, Amit

计算机科学 > 机器学习

arXiv:2507.20459 (cs)

[提交于 2025年7月28日 ]

标题：对角加权广义矩方法在高斯混合建模中的估计

标题： Diagonally-Weighted Generalized Method of Moments Estimation for Gaussian Mixture Modeling

Authors:Liu Zhang, Oscar Mickelin, Sheng Xu, Amit Singer

摘要：自皮尔逊[《皇家学会哲学汇刊》A, 185 (1894), 第71-110页]首次将矩方法(MM)应用于将数据建模为一维高斯混合以来，基于矩的估计方法已经大量出现。在这些方法中，广义矩方法(GMM)通过适当加权矩来提高MM的统计效率。然而，MM和GMM的计算复杂度和存储复杂度随着维度的增加呈指数增长，这使得这些方法在高维数据或需要更高阶矩的情况下变得不切实际。这种计算瓶颈在GMM中更为严重，因为它还需要估计一个大的权重矩阵。为了克服这些瓶颈，我们提出了对角加权GMM(DGMM)，它在统计效率、计算复杂度和数值稳定性之间取得了平衡。我们将DGMM应用于研究弱分离异方差低秩高斯混合的参数估计问题，并设计了一种计算高效且数值稳定的算法，该算法无需显式计算或存储矩张量即可获得DGMM估计量。我们实现了所提出的算法，并通过实验验证了DGMM的优势：在数值研究中，DGMM在要求显著更短运行时间的同时获得了更小的估计误差。代码和数据将在发表后在https://github.com/liu-lzhang/dgmm上提供。

摘要： Since Pearson [Philosophical Transactions of the Royal Society of London. A, 185 (1894), pp. 71-110] first applied the method of moments (MM) for modeling data as a mixture of one-dimensional Gaussians, moment-based estimation methods have proliferated. Among these methods, the generalized method of moments (GMM) improves the statistical efficiency of MM by weighting the moments appropriately. However, the computational complexity and storage complexity of MM and GMM grow exponentially with the dimension, making these methods impractical for high-dimensional data or when higher-order moments are required. Such computational bottlenecks are more severe in GMM since it additionally requires estimating a large weighting matrix. To overcome these bottlenecks, we propose the diagonally-weighted GMM (DGMM), which achieves a balance among statistical efficiency, computational complexity, and numerical stability. We apply DGMM to study the parameter estimation problem for weakly separated heteroscedastic low-rank Gaussian mixtures and design a computationally efficient and numerically stable algorithm that obtains the DGMM estimator without explicitly computing or storing the moment tensors. We implement the proposed algorithm and empirically validate the advantages of DGMM: in numerical studies, DGMM attains smaller estimation errors while requiring substantially shorter runtime than MM and GMM. The code and data will be available upon publication at https://github.com/liu-lzhang/dgmm.

主题：	机器学习 (cs.LG) ; 数值分析 (math.NA); 统计理论 (math.ST); 方法论 (stat.ME); 机器学习 (stat.ML)
MSC 类：	62F12, 62H30, 15A69, 65Y20
引用方式：	arXiv:2507.20459 [cs.LG]
	(或者 arXiv:2507.20459v1 [cs.LG] 对于此版本)
	https://doi.org/10.48550/arXiv.2507.20459

提交历史

来自： Liu Zhang [查看电子邮件]
[v1] 星期一， 2025 年 7 月 28 日 01:24:55 UTC (3,523 KB)

计算机科学 > 机器学习

标题：对角加权广义矩方法在高斯混合建模中的估计

标题： Diagonally-Weighted Generalized Method of Moments Estimation for Gaussian Mixture Modeling

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

计算机科学 > 机器学习

标题： 对角加权广义矩方法在高斯混合建模中的估计 显示英文标题

标题： Diagonally-Weighted Generalized Method of Moments Estimation for Gaussian Mixture Modeling

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题：对角加权广义矩方法在高斯混合建模中的估计