Too Slow to Be Useful? On Incorporating Humans in the Loop of Smart Speakers

Huang, Shih-Hong; Huang, Chieh-Yang; Deng, Yuxin; Shen, Hua; Kuan, Szu-Chi; Huang, Ting-Hao 'Kenneth'

计算机科学 > 人机交互

arXiv:2212.03969v1 (cs)

[提交于 2022年12月7日 ]

标题：过于缓慢而无用？在智能音箱的循环中融入人类

标题： Too Slow to Be Useful? On Incorporating Humans in the Loop of Smart Speakers

Authors:Shih-Hong Huang, Chieh-Yang Huang, Yuxin Deng, Hua Shen, Szu-Chi Kuan, Ting-Hao 'Kenneth' Huang

摘要：实时人群驱动系统，如Chorus/Evorus、VizWiz和Apparition，展示了将人类纳入自动化系统如何补充自动解决方案的不足。然而，将此类架构应用于更多场景的一个未明说的瓶颈是将人类纳入自动化系统循环中的较长延迟。对于有严格周转时间约束的应用，人工操作组件的较长延迟和较大的速度变化似乎成为明显的障碍。本文通过使用基于文本的人工后端与用户通过仅支持语音的智能音箱进行对话，阐明并量化了这些限制。智能音箱必须在几秒内响应用户的请求，因此幕后工作人员只有几秒钟的时间来撰写答案。我们通过八对参与者测量了端到端系统延迟和对话质量，展示了这类系统的挑战和优势。

摘要： Real-time crowd-powered systems, such as Chorus/Evorus, VizWiz, and Apparition, have shown how incorporating humans into automated systems could supplement where the automatic solutions fall short. However, one unspoken bottleneck of applying such architectures to more scenarios is the longer latency of including humans in the loop of automated systems. For the applications that have hard constraints in turnaround times, human-operated components' longer latency and large speed variation seem to be apparent deal breakers. This paper explicates and quantifies these limitations by using a human-powered text-based backend to hold conversations with users through a voice-only smart speaker. Smart speakers must respond to users' requests within seconds, so the workers behind the scenes only have a few seconds to compose answers. We measured the end-to-end system latency and the conversation quality with eight pairs of participants, showing the challenges and superiority of such systems.

评论：	此文件是作者论文“太慢而无用？关于在智能音箱循环中融入人类”的扩展技术报告。该论文被第十届AAAI人类计算与众包会议（HCOMP 2022 WiP/Demo）的工作进展与演示轨道接受，https://youtu.be/iMDsX52VWGY
主题：	人机交互 (cs.HC)
引用方式：	arXiv:2212.03969 [cs.HC]
	(或者 arXiv:2212.03969v1 [cs.HC] 对于此版本)
	https://doi.org/10.48550/arXiv.2212.03969

提交历史

来自： Shih-Hong Huang [查看电子邮件]
[v1] 星期三， 2022 年 12 月 7 日 21:57:02 UTC (12,553 KB)

计算机科学 > 人机交互

标题：过于缓慢而无用？在智能音箱的循环中融入人类

标题： Too Slow to Be Useful? On Incorporating Humans in the Loop of Smart Speakers

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

计算机科学 > 人机交互

标题： 过于缓慢而无用？ 在智能音箱的循环中融入人类 显示英文标题

标题： Too Slow to Be Useful? On Incorporating Humans in the Loop of Smart Speakers

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题：过于缓慢而无用？在智能音箱的循环中融入人类