Skip to main content
CenXiv.org
This website is in trial operation, support us!
We gratefully acknowledge support from all contributors.
Contribute
Donate
cenxiv logo > cs > arXiv:2510.16192

Help | Advanced Search

Computer Science > Human-Computer Interaction

arXiv:2510.16192 (cs)
[Submitted on 17 Oct 2025 (v1) , last revised 22 Oct 2025 (this version, v2)]

Title: VoiceMorph: How AI Voice Morphing Reveals the Boundaries of Auditory Self-Recognition

Title: VoiceMorph:人工智能语音变形如何揭示听觉自我识别的边界

Authors:Kye Shimizu, Minghan Gao, Ananya Ganesh, Pattie Maes
Abstract: This study investigated auditory self-recognition boundaries using AI voice morphing technology, examining when individuals cease recognizing their own voice. Through controlled morphing between participants' voices and demographically matched targets at 1% increments using a mixed-methods design, we measured self-identification ratings and response times among 21 participants aged 18-64. Results revealed a critical recognition threshold at 35.2% morphing (95% CI [31.4, 38.1]). Older participants tolerated significantly higher morphing levels before losing self-recognition ($\beta$ = 0.617, p = 0.048), suggesting age-related vulnerabilities. Greater acoustic embedding distances predicted slower decision-making ($r \approx 0.5-0.53, p < 0.05$), with the longest response times for cloned versions of participants' own voices. Qualitative analysis revealed prosodic-based recognition strategies, universal voice manipulation discomfort, and awareness of applications spanning assistive technology to security risks. These findings establish foundational evidence for individual differences in voice morphing detection, with implications for AI ethics and vulnerable population protection as voice synthesis becomes accessible.
Abstract: 本研究利用人工智能语音变形技术探讨了听觉自我识别的边界,考察个体何时停止识别自己的声音。 通过使用混合方法设计,在参与者的声音和人口统计学匹配的目标之间以1%的增量进行受控变形,我们测量了21名年龄在18至64岁之间的参与者的自我认同评分和反应时间。 结果揭示了一个关键的识别阈值,为35.2%的变形(95% CI [31.4, 38.1])。 年长的参与者在失去自我识别前能容忍显著更高的变形水平($\beta$= 0.617,p = 0.048),这表明存在与年龄相关的脆弱性。 更大的声学嵌入距离预测了更慢的决策过程($r \approx 0.5-0.53, p < 0.05$),其中对参与者自己声音的克隆版本反应时间最长。 定性分析揭示了基于语调的识别策略、普遍的语音操纵不适感以及对从辅助技术到安全风险的应用的认识。 这些发现为语音变形检测中的个体差异提供了基础证据,对人工智能伦理和易受影响人群的保护具有意义,随着语音合成的普及,这一点尤为重要。
Subjects: Human-Computer Interaction (cs.HC)
Cite as: arXiv:2510.16192 [cs.HC]
  (or arXiv:2510.16192v2 [cs.HC] for this version)
  https://doi.org/10.48550/arXiv.2510.16192
arXiv-issued DOI via DataCite

Submission history

From: Minghan Gao [view email]
[v1] Fri, 17 Oct 2025 20:02:18 UTC (1,205 KB)
[v2] Wed, 22 Oct 2025 17:33:22 UTC (1,205 KB)
Full-text links:

Access Paper:

    View a PDF of the paper titled
  • View Chinese PDF
  • View PDF
  • HTML (experimental)
  • TeX Source
license icon view license
Current browse context:
cs.HC
< prev   |   next >
new | recent | 2025-10
Change to browse by:
cs

References & Citations

  • NASA ADS
  • Google Scholar
  • Semantic Scholar
a export BibTeX citation Loading...

BibTeX formatted citation

×
Data provided by:

Bookmark

BibSonomy logo Reddit logo

Bibliographic and Citation Tools

Bibliographic Explorer (What is the Explorer?)
Connected Papers (What is Connected Papers?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)

Code, Data and Media Associated with this Article

alphaXiv (What is alphaXiv?)
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Hugging Face (What is Huggingface?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)

Demos

Replicate (What is Replicate?)
Hugging Face Spaces (What is Spaces?)
TXYZ.AI (What is TXYZ.AI?)

Recommenders and Search Tools

Influence Flower (What are Influence Flowers?)
CORE Recommender (What is CORE?)
IArxiv Recommender (What is IArxiv?)
  • Author
  • Venue
  • Institution
  • Topic

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack

京ICP备2025123034号