Semi-blind source separation with multichannel variational autoencoder

Kameoka, Hirokazu; Li, Li; Inoue, Shota; Makino, Shoji

统计学 > 机器学习

arXiv:1808.00892v3 (stat)

[提交于 2018年8月2日 (v1) ，最后修订 2018年8月26日 (此版本， v3)]

标题：基于多通道变分自编码器的半盲源分离

标题： Semi-blind source separation with multichannel variational autoencoder

Authors:Hirokazu Kameoka, Li Li, Shota Inoue, Shoji Makino

摘要：本文提出了一种多通道源分离技术，称为多通道变分自编码器（MVAE）方法，该方法使用条件变分自编码器（CVAE）来建模和估计混合中的源的功率谱图。通过使用带有源类别标签的训练样本的谱图来训练CVAE，我们可以将训练好的解码器分布用作通用生成模型，能够根据指定的类别标签生成谱图。通过将潜在空间变量和类别标签视为该生成模型的未知参数，我们可以开发出一种收敛保证的半盲源分离算法，该算法包括迭代估计底层源的功率谱图以及分离矩阵。在实验评估中，我们的MVAE比基线方法产生了更好的分离性能。

摘要： This paper proposes a multichannel source separation technique called the multichannel variational autoencoder (MVAE) method, which uses a conditional VAE (CVAE) to model and estimate the power spectrograms of the sources in a mixture. By training the CVAE using the spectrograms of training examples with source-class labels, we can use the trained decoder distribution as a universal generative model capable of generating spectrograms conditioned on a specified class label. By treating the latent space variables and the class label as the unknown parameters of this generative model, we can develop a convergence-guaranteed semi-blind source separation algorithm that consists of iteratively estimating the power spectrograms of the underlying sources as well as the separation matrices. In experimental evaluations, our MVAE produced better separation performance than a baseline method.

主题：	机器学习 (stat.ML) ; 机器学习 (cs.LG)
引用方式：	arXiv:1808.00892 [stat.ML]
	(或者 arXiv:1808.00892v3 [stat.ML] 对于此版本)
	https://doi.org/10.48550/arXiv.1808.00892

提交历史

来自： Hirokazu Kameoka [查看电子邮件]
[v1] 星期四， 2018 年 8 月 2 日 16:30:51 UTC (2,669 KB)
[v2] 星期五， 2018 年 8 月 3 日 23:04:04 UTC (3,818 KB)
[v3] 星期日， 2018 年 8 月 26 日 07:29:03 UTC (3,438 KB)

统计学 > 机器学习

标题：基于多通道变分自编码器的半盲源分离

标题： Semi-blind source separation with multichannel variational autoencoder

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

统计学 > 机器学习

标题： 基于多通道变分自编码器的半盲源分离 显示英文标题

标题： Semi-blind source separation with multichannel variational autoencoder

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题：基于多通道变分自编码器的半盲源分离