计算机视觉与模式识别

2025年06月的作者和标题

总共 3129 条目 : 1-25 ... 101-125 126-150 151-175 176-200 201-225 226-250 251-275 ... 3126-3129

显示最多 25 每页条目：较少 | 更多 | 所有

[176] arXiv:2506.02014 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：基于多模态大语言模型优化的驾驶场景技术研究

标题： Research on Driving Scenario Technology Based on Multimodal Large Lauguage Model Optimization

Wang Mengjie, Zhu Huiping, Li Jian, Shi Wenxiu, Zhang Song

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[177] arXiv:2506.02015 (交叉列表自 cs.CV) [中文pdf, pdf, 其他]: 标题：面向对象的自改进偏好优化用于文本到图像生成

标题： Object-centric Self-improving Preference Optimization for Text-to-Image Generation

Yoonjin Oh, Yongjin Kim, Hyomin Kim, Donghwan Chi, Sungwoong Kim

主题：计算机视觉与模式识别 (cs.CV)
[178] arXiv:2506.02016 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：经典的深度神经网络是否具有弱对抗鲁棒性？

标题： Are classical deep neural networks weakly adversarially robust?

Nuolin Sun, Linyuan Wang, Dongyang Li, Bin Yan, Lei Li

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[179] arXiv:2506.02017 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：通过反馈实现公平：解决自动性别识别中的算法冒犯性性别认定问题

标题： Fairness through Feedback: Addressing Algorithmic Misgendering in Automatic Gender Recognition

Camilla Quaresmini, Giacomo Zanotti

主题：计算机视觉与模式识别 (cs.CV)
[180] arXiv:2506.02020 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：通过显式硬负样本梯度放大改进多模态嵌入学习

标题： Improve Multi-Modal Embedding Learning via Explicit Hard Negative Gradient Amplifying

Youze Xue, Dian Li, Gang Liu

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[181] arXiv:2506.02021 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：动态感知视频蒸馏：基于视频语义优化时间分辨率

标题： Dynamic-Aware Video Distillation: Optimizing Temporal Resolution Based on Video Semantics

Yinjie Zhao, Heng Zhao, Bihan Wen, Yew-Soon Ong, Joey Tianyi Zhou

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[182] arXiv:2506.02022 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：《你看见我了吗？》：一个多维度基准来评估多模态大型语言模型中的视觉感知

标题： Do You See Me : A Multidimensional Benchmark for Evaluating Visual Perception in Multimodal LLMs

Aditya Kanade, Tanuja Ganu

主题：计算机视觉与模式识别 (cs.CV)
[183] arXiv:2506.02095 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：循环一致性作为奖励：无需人类偏好即可学习图像-文本对齐

标题： Cycle Consistency as Reward: Learning Image-Text Alignment without Human Preferences

Hyojin Bahng, Caroline Chan, Fredo Durand, Phillip Isola

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[184] arXiv:2506.02112 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： SAB3R：三维重建中的语义增强主干网络

标题： SAB3R: Semantic-Augmented Backbone in 3D Reconstruction

Xuweiyi Chen, Tian Xia, Sihan Xu, Jianing Yang, Joyce Chai, Zezhou Cheng

评论： 3D-LLM/VLA @ CVPR2025 | 项目页面：https://uva-computer-vision-lab.github.io/sab3r/

主题：计算机视觉与模式识别 (cs.CV)
[185] arXiv:2506.02150 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：具有可学习核的隐式可变形医学图像配准

标题： Implicit Deformable Medical Image Registration with Learnable Kernels

Stefano Fogarollo, Gregor Laimer, Reto Bale, Matthias Harders

评论：预接受

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[186] arXiv:2506.02161 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： TIIF-Bench：你的文本到图像模型如何遵循您的指令？

标题： TIIF-Bench: How Does Your T2I Model Follow Your Instructions?

Xinyu Wei, Jinrui Zhang, Zeqing Wang, Hongyang Wei, Zhen Guo, Lei Zhang

评论： 23页，12图，11表

主题：计算机视觉与模式识别 (cs.CV)
[187] arXiv:2506.02164 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：利用决策变量相关性量化与任务相关的表征相似性

标题： Quantifying task-relevant representational similarity using decision variable correlation

Yu (Eric)Qian, Wilson S. Geisler, Xue-Xin Wei

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG) ; 神经与认知 (q-bio.NC) ; 定量方法 (q-bio.QM)
[188] arXiv:2506.02167 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： Fire360：一种用于退化全景消防视频中鲁棒感知和情景记忆的基准

标题： Fire360: A Benchmark for Robust Perception and Episodic Memory in Degraded 360-Degree Firefighting Videos

Aditi Tiwari, Farzaneh Masoud, Dac Trong Nguyen, Jill Kraft, Heng Ji, Klara Nahrstedt

评论： 20页，9个图，6个表格

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[189] arXiv:2506.02221 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： Diff2Flow：通过扩散模型对齐训练流匹配模型

标题： Diff2Flow: Training Flow Matching Models via Diffusion Model Alignment

Johannes Schusterbauer, Ming Gui, Frank Fundel, Bj√∂rn Ommer

评论：被CVPR 2025接受

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[190] arXiv:2506.02229 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： VLCD：用于准确且高效的自动胎盘分析的视觉-语言对比蒸馏

标题： VLCD: Vision-Language Contrastive Distillation for Accurate and Efficient Automatic Placenta Analysis

Manas Mehta, Yimu Pan, Kelly Gallagher, Alison D. Gernand, Jeffery A. Goldstein, Delia Mwinyelle, Leena Mithal, James Z. Wang

评论：第九届国际健康智能研讨会会议录与美国人工智能协会年度会议同期举办，宾夕法尼亚州费城，2025年3月

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI) ; 计算与语言 (cs.CL) ; 机器学习 (cs.LG)
[191] arXiv:2506.02244 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：运动感知视频生成模型

标题： Motion aware video generative model

Bowen Xue, Giuseppe Claudio Guarnera, Shuang Zhao, Zahra Montazeri

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[192] arXiv:2506.02247 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： PAIR-Net：通过预训练的音视频融合与对齐损失增强自我中心说话者检测

标题： PAIR-Net: Enhancing Egocentric Speaker Detection via Pretrained Audio-Visual Fusion and Alignment Loss

Yu Wang, Juhyung Ha, David J. Crandall

评论： 4页，1个图，1个表格

主题：计算机视觉与模式识别 (cs.CV)
[193] arXiv:2506.02265 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： Rig3R：针对学习的3D重建的骨架感知条件化

标题： Rig3R: Rig-Aware Conditioning for Learned 3D Reconstruction

Samuel Li, Pujith Kachana, Prajwal Chidananda, Saurabh Nair, Yasutaka Furukawa, Matthew Brown

主题：计算机视觉与模式识别 (cs.CV)
[194] arXiv:2506.02291 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：实体图像与多模态图像检索数据集

标题： Entity Image and Mixed-Modal Image Retrieval Datasets

Cristian-Ioan Blaga, Paul Suganthan, Sahil Dua, Krishna Srinivasan, Enrique Alfonseca, Peter Dornbach, Tom Duerig, Imed Zitouni, Zhe Dong

主题：计算机视觉与模式识别 (cs.CV) ; 信息检索 (cs.IR)
[195] arXiv:2506.02294 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：通过置信度引导的数据增强改善未知协变量偏移下的知识蒸馏

标题： Improving Knowledge Distillation Under Unknown Covariate Shift Through Confidence-Guided Data Augmentation

Niclas Popp, Kevin Alexander Laube, Matthias Hein, Lukas Schott

主题：计算机视觉与模式识别 (cs.CV)
[196] arXiv:2506.02295 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： QARI-OCR：通过多模态大型语言模型适应的高保真阿拉伯文文本识别

标题： QARI-OCR: High-Fidelity Arabic Text Recognition through Multimodal Large Language Model Adaptation

Ahmed Wasfy, Omer Nacar, Abdelakreem Elkhateb, Mahmoud Reda, Omar Elshehy, Adel Ammar, Wadii Boulila

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[197] arXiv:2506.02327 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：医学世界模型：治疗规划的肿瘤演化生成模拟

标题： Medical World Model: Generative Simulation of Tumor Evolution for Treatment Planning

Yijun Yang, Zhao-Yang Wang, Qiuping Liu, Shuwen Sun, Kang Wang, Rama Chellappa, Zongwei Zhou, Alan Yuille, Lei Zhu, Yu-Dong Zhang, Jieneng Chen

主题：计算机视觉与模式识别 (cs.CV)
[198] arXiv:2506.02334 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：通过互学习和类分布正则化的广义类别发现

标题： Generalized Category Discovery via Reciprocal Learning and Class-Wise Distribution Regularization

Duo Liu, Zhiquan Tan, Linglan Zhao, Zhongqiang Zhang, Xiangzhong Fang, Weiran Huang

评论： ICML2025海报

主题：计算机视觉与模式识别 (cs.CV)
[199] arXiv:2506.02354 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： RATE-Nav：基于区域感知的零样本物体导航终止增强视觉语言模型

标题： RATE-Nav: Region-Aware Termination Enhancement for Zero-shot Object Navigation with Vision-Language Models

Junjie Li, Nan Zhang, Xiaoyang Qu, Kai Lu, Guokuan Li, Jiguang Wan, Jianzong Wang

评论：被第63届计算语言学协会年会（ACL 2025）接受

主题：计算机视觉与模式识别 (cs.CV)
[200] arXiv:2506.02356 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： InterRVOS：基于交互感知的指代视频对象分割

标题： InterRVOS: Interaction-aware Referring Video Object Segmentation

Woojeong Jin, Seongchan Kim, Seungryong Kim

主题：计算机视觉与模式识别 (cs.CV)

总共 3129 条目 : 1-25 ... 101-125 126-150 151-175 176-200 201-225 226-250 251-275 ... 3126-3129

显示最多 25 每页条目：较少 | 更多 | 所有

计算机视觉与模式识别

2025年06月 的作者和标题

2025年06月的作者和标题