计算机视觉与模式识别

2025年06月的作者和标题

总共 3129 条目 : 1-25 26-50 51-75 76-100 101-125 126-150 151-175 176-200 ... 3126-3129

显示最多 25 每页条目：较少 | 更多 | 所有

[101] arXiv:2506.01300 (交叉列表自 cs.CV) [中文pdf, pdf, 其他]: 标题： ReAgent-V：一种面向视频理解的奖励驱动多智能体框架

标题： ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding

Yiyang Zhou, Yangfan He, Yaofeng Su, Siwei Han, Joel Jang, Gedas Bertasius, Mohit Bansal, Huaxiu Yao

评论： 31页，18幅图

主题：计算机视觉与模式识别 (cs.CV)
[102] arXiv:2506.01304 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： SAM-I2V：以不到 0.2% 的训练成本将 SAM 升级以支持提示式视频分割

标题： SAM-I2V: Upgrading SAM to Support Promptable Video Segmentation with Less than 0.2% Training Cost

Haiyang Mei, Pengyu Zhang, Mike Zheng Shou

评论： CVPR 2025

主题：计算机视觉与模式识别 (cs.CV)
[103] arXiv:2506.01331 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：超高分辨率图像合成：数据、方法与评估

标题： Ultra-High-Resolution Image Synthesis: Data, Method and Evaluation

Jinjin Zhang, Qiuyu Huang, Junjie Liu, Xiefan Guo, Di Huang

主题：计算机视觉与模式识别 (cs.CV)
[104] arXiv:2506.01338 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：一种用于车辆类别和方向检测的两阶段模型与照片级真实图像生成

标题： A 2-Stage Model for Vehicle Class and Orientation Detection with Photo-Realistic Image Generation

Youngmin Kim, Donghwa Kang, Hyeongboo Baek

评论：已被2022年IEEE大数据大会接受

期刊参考： 2022 IEEE国际大数据会议（Big Data）

主题：计算机视觉与模式识别 (cs.CV)
[105] arXiv:2506.01346 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：重新思考图像直方图匹配在图像分类中的应用

标题： Rethinking Image Histogram Matching for Image Classification

Rikuto Otsuka, Yuho Shoji, Yuka Ogino, Takahiro Toizumi, Atsushi Ito

主题：计算机视觉与模式识别 (cs.CV)
[106] arXiv:2506.01349 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：目标驱动自适应损失的红外小目标检测

标题： Target Driven Adaptive Loss For Infrared Small Target Detection

Yuho Shoji, Takahiro Toizumi, Atsushi Ito

主题：计算机视觉与模式识别 (cs.CV)
[107] arXiv:2506.01366 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：基于CLIP的降雨感知：具有模式感知网络路由和掩码引导交叉注意力的自适应去雨

标题： CLIP-driven rain perception: Adaptive deraining with pattern-aware network routing and mask-guided cross-attention

Cong Guan, Osamu Yoshie

主题：计算机视觉与模式识别 (cs.CV)
[108] arXiv:2506.01368 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：使用预训练扩散模型的合成数据增强用于长尾食物图像分类

标题： Synthetic Data Augmentation using Pre-trained Diffusion Models for Long-tailed Food Image Classification

GaYeon Koh, Hyun-Jic Oh, Jeonghyun Noh, Won-Ki Jeong

评论： 10页

主题：计算机视觉与模式识别 (cs.CV)
[109] arXiv:2506.01370 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： PointT2I：基于LLM的关键点驱动文本到图像生成

标题： PointT2I: LLM-based text-to-image generation via keypoints

Taekyung Lee, Donggyu Lee, Myungjoo Kang

主题：计算机视觉与模式识别 (cs.CV)
[110] arXiv:2506.01371 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： SVQA-R1: 通过视图一致的奖励优化强化MLLMs中的空间推理

标题： SVQA-R1: Reinforcing Spatial Reasoning in MLLMs via View-Consistent Reward Optimization

Peiyao Wang, Haibin Ling

评论： 9页，7幅图

主题：计算机视觉与模式识别 (cs.CV)
[111] arXiv:2506.01373 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：无训练也能获得：迈向体育领域及其他方面的通用多目标跟踪

标题： No Train Yet Gain: Towards Generic Multi-Object Tracking in Sports and Beyond

Tomasz Stanczyk, Seongro Yoon, Francois Bremond

主题：计算机视觉与模式识别 (cs.CV)
[112] arXiv:2506.01379 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：雷达斑点：用于自动驾驶场景高保真数据合成和3D重建的雷达高斯斑点技术

标题： RadarSplat: Radar Gaussian Splatting for High-Fidelity Data Synthesis and 3D Reconstruction of Autonomous Driving Scenes

Pou-Chun Kung, Skanda Harisha, Ram Vasudevan, Aline Eid, Katherine A. Skinner

主题：计算机视觉与模式识别 (cs.CV)
[113] arXiv:2506.01380 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：通过下一帧扩散在30+ FPS下玩转Transformer

标题： Playing with Transformer at 30+ FPS via Next-Frame Diffusion

Xinle Cheng, Tianyu He, Jiayi Xu, Junliang Guo, Di He, Jiang Bian

评论：项目页面：https://nextframed.github.io/

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[114] arXiv:2506.01388 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： VRD-IU：视觉丰富文档智能与理解的经验教训

标题： VRD-IU: Lessons from Visually Rich Document Intelligence and Understanding

Yihao Ding, Soyeon Caren Han, Yan Li, Josiah Poon

评论：已被IJCAI 2025演示赛道接受

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[115] arXiv:2506.01389 (交叉列表自 cs.CV) [中文pdf, pdf, 其他]: 标题：具有静态图案投影的多视图神经形状重建

标题： Neural shape reconstruction from multiple views with static pattern projection

Ryo Furukawa, Kota Nishihara, Hiroshi Kawasaki

评论： 6页，CVPR 2025神经场超越常规相机研讨会

主题：计算机视觉与模式识别 (cs.CV)
[116] arXiv:2506.01411 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： ViTA-PAR：基于属性提示的视觉和文本属性对齐用于行人属性识别

标题： ViTA-PAR: Visual and Textual Attribute Alignment with Attribute Prompting for Pedestrian Attribute Recognition

Minjeong Park, Hongbeen Park, Jinkyu Kim

评论：已被IEEE ICIP 2025接受

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[117] arXiv:2506.01413 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：激励推理以实现大型语言模型的高级指令跟随

标题： Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models

Yulei Qin, Gang Li, Zongyi Li, Zihan Xu, Yuchen Shi, Zhekai Lin, Xiao Cui, Ke Li, Xing Sun

评论：正文13页，3个表格，5个图表，附录45页

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI) ; 计算与语言 (cs.CL) ; 机器学习 (cs.LG)
[118] arXiv:2506.01430 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： DNAEdit：文本引导的修正流编辑中的直接噪声对齐

标题： DNAEdit: Direct Noise Alignment for Text-Guided Rectified Flow Editing

Chenxi Xie, Minghan Li, Shuai Li, Yuhui Wu, Qiaosi Yi, Lei Zhang

评论：项目网址: https://xiechenxi99.github.io/DNAEdit

主题：计算机视觉与模式识别 (cs.CV)
[119] arXiv:2506.01441 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：语义调色板引导的颜色传播

标题： Semantic Palette-Guided Color Propagation

Zi-Yu Zhang, Bing-Feng Seng, Ya-Feng Du, Kang Li, Zhe-Cheng Wang, Zheng-Jun Du

评论： 6页，5个图，IEEE ICME 2025

主题：计算机视觉与模式识别 (cs.CV)
[120] arXiv:2506.01443 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： MS-RAFT-3D：基于递归图像的场景流的多尺度架构

标题： MS-RAFT-3D: A Multi-Scale Architecture for Recurrent Image-Based Scene Flow

Jakob Schmid, Azin Jahedi, Noah Berenguel Senn, Andr√©s Bruhn

评论： ICIP 2025

主题：计算机视觉与模式识别 (cs.CV)
[121] arXiv:2506.01445 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：一种用于高效声纳图像分类的阴影和高亮区域上下文自适应融合新方法

标题： A Novel Context-Adaptive Fusion of Shadow and Highlight Regions for Efficient Sonar Image Classification

Kamal Basha S, Anukul Kiran B, Athira Nambiar, Suresh Rajendran

主题：计算机视觉与模式识别 (cs.CV)
[122] arXiv:2506.01454 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： DiffuseSlide：无需训练的高帧率视频生成扩散模型

标题： DiffuseSlide: Training-Free High Frame Rate Video Generation Diffusion

Geunmin Hwang, Hyun-kyu Ko, Younghyun Kim, Seungryong Lee, Eunbyung Park

主题：计算机视觉与模式识别 (cs.CV)
[123] arXiv:2506.01466 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：面向可扩展视频异常检索：一个合成视频-文本基准

标题： Towards Scalable Video Anomaly Retrieval: A Synthetic Video-Text Benchmark

Shuyu Yang, Yilun Wang, Yaxiong Wang, Li Zhu, Zhedong Zheng

主题：计算机视觉与模式识别 (cs.CV)
[124] arXiv:2506.01468 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：基于加权图神经网络的羊面部疼痛评估

标题： Sheep Facial Pain Assessment Under Weighted Graph Neural Networks

Alam Noor, Luis Almeida, Mohamed Daoudi, Kai Li, Eduardo Tovar

评论： 2025年第19届自动面部与手势识别国际会议（FG）

主题：计算机视觉与模式识别 (cs.CV)
[125] arXiv:2506.01471 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：半监督手术阶段识别视频Transformer：SemiVT-Surge

标题： SemiVT-Surge: Semi-Supervised Video Transformer for Surgical Phase Recognition

Yiping Li, Ronald de Jong, Sahar Nasirihaghighi, Tim Jaspers, Romy van Jaarsveld, Gino Kuiper, Richard van Hillegersberg, Fons van der Sommen, Jelle Ruurda, Marcel Breeuwer, Yasmina Al Khalil

评论：已被MICCAI 2025接受

主题：计算机视觉与模式识别 (cs.CV)

总共 3129 条目 : 1-25 26-50 51-75 76-100 101-125 126-150 151-175 176-200 ... 3126-3129

显示最多 25 每页条目：较少 | 更多 | 所有

计算机视觉与模式识别

2025年06月 的作者和标题

2025年06月的作者和标题