Quantum-Enhanced Analysis and Grading of Vocal Performance

Agarwal, Rohan

电气工程与系统科学 > 音频与语音处理

arXiv:2509.00106v1 (eess)

[提交于 2025年8月28日 ]

标题：量子增强的语音表演分析与评分

标题： Quantum-Enhanced Analysis and Grading of Vocal Performance

Authors:Rohan Agarwal

摘要：我们提出QuantumMelody，一种用于客观演唱评估的混合量子-经典方法。分组的声乐特征（音高稳定性、动态变化、音色）被编码到一个小的模拟量子电路中；所有九个量子比特都通过每个量子比特上的Hadamard门进行初始化，然后接收Rx、Ry和Rz旋转，并进行组内和组间纠缠。电路测量概率与频谱图变压器嵌入融合，以在标签2-5上估计分数并提供技术层面的反馈。在168个标记的20秒片段上，该混合方法与专家评分者达成74.29%的一致性，比经典特征基线高出12.86个百分点。在笔记本电脑类的Qiskit模拟器上，每个录音的处理时间不到一分钟；我们不声称硬件加速。这是在应用音频信号处理中实现可解释、客观的演唱评估的一个可行性步骤。

摘要： We present QuantumMelody, a hybrid quantum-classical method for objective singing assessment. Grouped vocal features (pitch stability, dynamics, timbre) are encoded into a small simulated quantum circuit; all nine qubits are initialized with a Hadamard on each qubit and then receive Rx, Ry, and Rz rotations, with intra- and cross-group entanglement. The circuit measurement probabilities are fused with spectrogram transformer embeddings to estimate a grade on labels 2-5 and to surface technique-level feedback. On 168 labeled 20 second excerpts, the hybrid reaches 74.29% agreement with expert graders, a +12.86 point gain over a classical-features baseline. Processing is sub-minute per recording on a laptop-class Qiskit simulator; we do not claim hardware speedups. This is a feasibility step toward interpretable, objective singing assessment in applied audio signal processing.

评论：	4页，5图。混合量子-经典可行性研究；仅模拟器结果
主题：	音频与语音处理 (eess.AS) ; 声音 (cs.SD)
ACM 类：	H.5.5; I.2.6; I.5.4
引用方式：	arXiv:2509.00106 [eess.AS]
	(或者 arXiv:2509.00106v1 [eess.AS] 对于此版本)
	https://doi.org/10.48550/arXiv.2509.00106

提交历史

来自： Rohan Agarwal [查看电子邮件]
[v1] 星期四， 2025 年 8 月 28 日 01:34:33 UTC (224 KB)

电气工程与系统科学 > 音频与语音处理

标题：量子增强的语音表演分析与评分

标题： Quantum-Enhanced Analysis and Grading of Vocal Performance

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

电气工程与系统科学 > 音频与语音处理

标题： 量子增强的语音表演分析与评分 显示英文标题

标题： Quantum-Enhanced Analysis and Grading of Vocal Performance

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题：量子增强的语音表演分析与评分