An Active Inference Model of Covert and Overt Visual Attention

Mišić, Tin; Koledić, Karlo; Bonsignorio, Fabio; Petrović, Ivan; Marković, Ivan

计算机科学 > 计算机视觉与模式识别

arXiv:2505.03856 (cs)

[提交于 2025年5月6日 ]

标题：一个关于显性和隐性视觉注意力的主动推理模型

标题： An Active Inference Model of Covert and Overt Visual Attention

Authors:Tin Mišić, Karlo Koledić, Fabio Bonsignorio, Ivan Petrović, Ivan Marković

摘要：选择性地关注相关刺激并过滤掉干扰对于处理复杂高维感官输入的代理来说至关重要。本文通过主动推理框架引入了一个关于显性和隐性视觉注意的模型，利用感觉精度的动态优化来最小化自由能。该模型根据当前环境信念和感官输入确定视觉感官精度，影响显性和隐性模式下的注意力分配。为了测试模型的有效性，我们分析了它在波斯纳提示任务和使用二维视觉数据的简单目标聚焦任务中的行为。测量反应时间以研究外源性和内源性注意力以及有效和无效提示之间的相互作用。结果显示，外源性和有效提示通常比内源性和无效提示导致更快的反应时间。此外，该模型表现出类似于返回抑制的行为，在特定的提示-目标启动异步间隔后，先前注意的位置会受到抑制。最后，我们调查了显性注意的不同方面，结果显示不自主的反射性扫视比有意的扫视发生得更快，但代价是适应性较差。

摘要： The ability to selectively attend to relevant stimuli while filtering out distractions is essential for agents that process complex, high-dimensional sensory input. This paper introduces a model of covert and overt visual attention through the framework of active inference, utilizing dynamic optimization of sensory precisions to minimize free-energy. The model determines visual sensory precisions based on both current environmental beliefs and sensory input, influencing attentional allocation in both covert and overt modalities. To test the effectiveness of the model, we analyze its behavior in the Posner cueing task and a simple target focus task using two-dimensional(2D) visual data. Reaction times are measured to investigate the interplay between exogenous and endogenous attention, as well as valid and invalid cueing. The results show that exogenous and valid cues generally lead to faster reaction times compared to endogenous and invalid cues. Furthermore, the model exhibits behavior similar to inhibition of return, where previously attended locations become suppressed after a specific cue-target onset asynchrony interval. Lastly, we investigate different aspects of overt attention and show that involuntary, reflexive saccades occur faster than intentional ones, but at the expense of adaptability.

评论：	7页，7幅图。代码可在 <https://github.com/unizgfer-lamor/ainf-visual-attention> 获取。
主题：	计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
ACM 类：	I.2.6; I.2.10
引用方式：	arXiv:2505.03856 [cs.CV]
	(或者 arXiv:2505.03856v1 [cs.CV] 对于此版本)
	https://doi.org/10.48550/arXiv.2505.03856

提交历史

来自： Tin Mišić [查看电子邮件]
[v1] 星期二， 2025 年 5 月 6 日 09:26:00 UTC (1,160 KB)

计算机科学 > 计算机视觉与模式识别

标题：一个关于显性和隐性视觉注意力的主动推理模型

标题： An Active Inference Model of Covert and Overt Visual Attention

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

计算机科学 > 计算机视觉与模式识别

标题： 一个关于显性和隐性视觉注意力的主动推理模型 显示英文标题

标题： An Active Inference Model of Covert and Overt Visual Attention

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题：一个关于显性和隐性视觉注意力的主动推理模型