计算机视觉与模式识别

2025年06月的作者和标题

总共 3129 条目 : 1-100 101-200 176-275 201-300 301-400 401-500 ... 3101-3129

显示最多 100 每页条目：较少 | 更多 | 所有

[176] arXiv:2506.02014 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：基于多模态大语言模型优化的驾驶场景技术研究

标题： Research on Driving Scenario Technology Based on Multimodal Large Lauguage Model Optimization

Wang Mengjie, Zhu Huiping, Li Jian, Shi Wenxiu, Zhang Song

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[177] arXiv:2506.02015 (交叉列表自 cs.CV) [中文pdf, pdf, 其他]: 标题：面向对象的自改进偏好优化用于文本到图像生成

标题： Object-centric Self-improving Preference Optimization for Text-to-Image Generation

Yoonjin Oh, Yongjin Kim, Hyomin Kim, Donghwan Chi, Sungwoong Kim

主题：计算机视觉与模式识别 (cs.CV)
[178] arXiv:2506.02016 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：经典的深度神经网络是否具有弱对抗鲁棒性？

标题： Are classical deep neural networks weakly adversarially robust?

Nuolin Sun, Linyuan Wang, Dongyang Li, Bin Yan, Lei Li

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[179] arXiv:2506.02017 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：通过反馈实现公平：解决自动性别识别中的算法冒犯性性别认定问题

标题： Fairness through Feedback: Addressing Algorithmic Misgendering in Automatic Gender Recognition

Camilla Quaresmini, Giacomo Zanotti

主题：计算机视觉与模式识别 (cs.CV)
[180] arXiv:2506.02020 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：通过显式硬负样本梯度放大改进多模态嵌入学习

标题： Improve Multi-Modal Embedding Learning via Explicit Hard Negative Gradient Amplifying

Youze Xue, Dian Li, Gang Liu

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[181] arXiv:2506.02021 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：动态感知视频蒸馏：基于视频语义优化时间分辨率

标题： Dynamic-Aware Video Distillation: Optimizing Temporal Resolution Based on Video Semantics

Yinjie Zhao, Heng Zhao, Bihan Wen, Yew-Soon Ong, Joey Tianyi Zhou

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[182] arXiv:2506.02022 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：《你看见我了吗？》：一个多维度基准来评估多模态大型语言模型中的视觉感知

标题： Do You See Me : A Multidimensional Benchmark for Evaluating Visual Perception in Multimodal LLMs

Aditya Kanade, Tanuja Ganu

主题：计算机视觉与模式识别 (cs.CV)
[183] arXiv:2506.02095 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：循环一致性作为奖励：无需人类偏好即可学习图像-文本对齐

标题： Cycle Consistency as Reward: Learning Image-Text Alignment without Human Preferences

Hyojin Bahng, Caroline Chan, Fredo Durand, Phillip Isola

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[184] arXiv:2506.02112 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： SAB3R：三维重建中的语义增强主干网络

标题： SAB3R: Semantic-Augmented Backbone in 3D Reconstruction

Xuweiyi Chen, Tian Xia, Sihan Xu, Jianing Yang, Joyce Chai, Zezhou Cheng

评论： 3D-LLM/VLA @ CVPR2025 | 项目页面：https://uva-computer-vision-lab.github.io/sab3r/

主题：计算机视觉与模式识别 (cs.CV)
[185] arXiv:2506.02150 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：具有可学习核的隐式可变形医学图像配准

标题： Implicit Deformable Medical Image Registration with Learnable Kernels

Stefano Fogarollo, Gregor Laimer, Reto Bale, Matthias Harders

评论：预接受

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[186] arXiv:2506.02161 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： TIIF-Bench：你的文本到图像模型如何遵循您的指令？

标题： TIIF-Bench: How Does Your T2I Model Follow Your Instructions?

Xinyu Wei, Jinrui Zhang, Zeqing Wang, Hongyang Wei, Zhen Guo, Lei Zhang

评论： 23页，12图，11表

主题：计算机视觉与模式识别 (cs.CV)
[187] arXiv:2506.02164 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：利用决策变量相关性量化与任务相关的表征相似性

标题： Quantifying task-relevant representational similarity using decision variable correlation

Yu (Eric)Qian, Wilson S. Geisler, Xue-Xin Wei

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG) ; 神经与认知 (q-bio.NC) ; 定量方法 (q-bio.QM)
[188] arXiv:2506.02167 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： Fire360：一种用于退化全景消防视频中鲁棒感知和情景记忆的基准

标题： Fire360: A Benchmark for Robust Perception and Episodic Memory in Degraded 360-Degree Firefighting Videos

Aditi Tiwari, Farzaneh Masoud, Dac Trong Nguyen, Jill Kraft, Heng Ji, Klara Nahrstedt

评论： 20页，9个图，6个表格

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[189] arXiv:2506.02221 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： Diff2Flow：通过扩散模型对齐训练流匹配模型

标题： Diff2Flow: Training Flow Matching Models via Diffusion Model Alignment

Johannes Schusterbauer, Ming Gui, Frank Fundel, Björn Ommer

评论：被CVPR 2025接受

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[190] arXiv:2506.02229 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： VLCD：用于准确且高效的自动胎盘分析的视觉-语言对比蒸馏

标题： VLCD: Vision-Language Contrastive Distillation for Accurate and Efficient Automatic Placenta Analysis

Manas Mehta, Yimu Pan, Kelly Gallagher, Alison D. Gernand, Jeffery A. Goldstein, Delia Mwinyelle, Leena Mithal, James Z. Wang

评论：第九届国际健康智能研讨会会议录与美国人工智能协会年度会议同期举办，宾夕法尼亚州费城，2025年3月

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI) ; 计算与语言 (cs.CL) ; 机器学习 (cs.LG)
[191] arXiv:2506.02244 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：运动感知视频生成模型

标题： Motion aware video generative model

Bowen Xue, Giuseppe Claudio Guarnera, Shuang Zhao, Zahra Montazeri

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[192] arXiv:2506.02247 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： EgoVIS@CVPR：PAIR-Net：通过预训练的音频视觉融合和对齐损失增强第一视角说话人检测

标题： EgoVIS@CVPR: PAIR-Net: Enhancing Egocentric Speaker Detection via Pretrained Audio-Visual Fusion and Alignment Loss

Yu Wang, Juhyung Ha, David J. Crandall

评论： 4页，1图，1表

主题：计算机视觉与模式识别 (cs.CV)
[193] arXiv:2506.02265 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： Rig3R：针对学习的3D重建的骨架感知条件化

标题： Rig3R: Rig-Aware Conditioning for Learned 3D Reconstruction

Samuel Li, Pujith Kachana, Prajwal Chidananda, Saurabh Nair, Yasutaka Furukawa, Matthew Brown

主题：计算机视觉与模式识别 (cs.CV)
[194] arXiv:2506.02291 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：实体图像与多模态图像检索数据集

标题： Entity Image and Mixed-Modal Image Retrieval Datasets

Cristian-Ioan Blaga, Paul Suganthan, Sahil Dua, Krishna Srinivasan, Enrique Alfonseca, Peter Dornbach, Tom Duerig, Imed Zitouni, Zhe Dong

主题：计算机视觉与模式识别 (cs.CV) ; 信息检索 (cs.IR)
[195] arXiv:2506.02294 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：通过置信度引导的数据增强改善未知协变量偏移下的知识蒸馏

标题： Improving Knowledge Distillation Under Unknown Covariate Shift Through Confidence-Guided Data Augmentation

Niclas Popp, Kevin Alexander Laube, Matthias Hein, Lukas Schott

主题：计算机视觉与模式识别 (cs.CV)
[196] arXiv:2506.02295 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： QARI-OCR：通过多模态大型语言模型适应的高保真阿拉伯文文本识别

标题： QARI-OCR: High-Fidelity Arabic Text Recognition through Multimodal Large Language Model Adaptation

Ahmed Wasfy, Omer Nacar, Abdelakreem Elkhateb, Mahmoud Reda, Omar Elshehy, Adel Ammar, Wadii Boulila

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[197] arXiv:2506.02327 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：医学世界模型：治疗规划的肿瘤演化生成模拟

标题： Medical World Model: Generative Simulation of Tumor Evolution for Treatment Planning

Yijun Yang, Zhao-Yang Wang, Qiuping Liu, Shuwen Sun, Kang Wang, Rama Chellappa, Zongwei Zhou, Alan Yuille, Lei Zhu, Yu-Dong Zhang, Jieneng Chen

主题：计算机视觉与模式识别 (cs.CV)
[198] arXiv:2506.02334 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：通过互学习和类分布正则化的广义类别发现

标题： Generalized Category Discovery via Reciprocal Learning and Class-Wise Distribution Regularization

Duo Liu, Zhiquan Tan, Linglan Zhao, Zhongqiang Zhang, Xiangzhong Fang, Weiran Huang

评论： ICML2025海报

主题：计算机视觉与模式识别 (cs.CV)
[199] arXiv:2506.02354 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： RATE-Nav：基于区域感知的零样本物体导航终止增强视觉语言模型

标题： RATE-Nav: Region-Aware Termination Enhancement for Zero-shot Object Navigation with Vision-Language Models

Junjie Li, Nan Zhang, Xiaoyang Qu, Kai Lu, Guokuan Li, Jiguang Wan, Jianzong Wang

评论：被第63届计算语言学协会年会（ACL 2025）接受

主题：计算机视觉与模式识别 (cs.CV)
[200] arXiv:2506.02356 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： InterRVOS：基于交互感知的指代视频对象分割

标题： InterRVOS: Interaction-aware Referring Video Object Segmentation

Woojeong Jin, Seongchan Kim, Seungryong Kim

主题：计算机视觉与模式识别 (cs.CV)
[201] arXiv:2506.02358 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： RoadFormer：自动驾驶道路表面分类中的局部-全局特征融合

标题： RoadFormer : Local-Global Feature Fusion for Road Surface Classification in Autonomous Driving

Tianze Wang, Zhang Zhang, Chao Sun

主题：计算机视觉与模式识别 (cs.CV)
[202] arXiv:2506.02359 (交叉列表自 cs.CV) [中文pdf, pdf, 其他]: 标题：目标检测的自动标注数据

标题： Auto-Labeling Data for Object Detection

Brent A. Griffin, Manushree Gangwar, Jacob Sela, Jason J. Corso

主题：计算机视觉与模式识别 (cs.CV)
[203] arXiv:2506.02364 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：基于TRPCA启发的深度展开网络通过阈值t-SVD和Top-K稀疏Transformer进行高光谱图像去噪

标题： A TRPCA-Inspired Deep Unfolding Network for Hyperspectral Image Denoising via Thresholded t-SVD and Top-K Sparse Transformer

Liang Li, Jianli Zhao, Sheng Fang, Siyu Chen, Hui Sun

评论： 11页，6个图

主题：计算机视觉与模式识别 (cs.CV)
[204] arXiv:2506.02366 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：基于粒球的分类任务近似边界采样

标题： Approximate Borderline Sampling using Granular-Ball for Classification Tasks

Qin Xie, Qinghua Zhang, Shuyin Xia

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[205] arXiv:2506.02367 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： ViTNF：利用神经场提升视觉变换器在广义类别发现中的表现

标题： ViTNF: Leveraging Neural Fields to Boost Vision Transformers in Generalized Category Discovery

Jiayi Su, Dequan Jin

评论： 22页，3幅图

主题：计算机视觉与模式识别 (cs.CV)
[206] arXiv:2506.02382 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：多层级与多模态动作预测

标题： Multi-level and Multi-modal Action Anticipation

Seulgi Kim, Ghazal Kaviani, Mohit Prabhushankar, Ghassan AlRegib

评论：已被2025年IEEE图像处理国际会议（ICIP）接受

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[207] arXiv:2506.02393 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： RRCANet：用于红外小目标检测的循环可重用卷积注意力网络

标题： RRCANet: Recurrent Reusable-Convolution Attention Network for Infrared Small Target Detection

Yongxian Liu, Boyang Li, Ting Liu, Zaiping Lin, Wei An

评论：我们更正了图表中的一些注释错误

主题：计算机视觉与模式识别 (cs.CV)
[208] arXiv:2506.02395 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：魔鬼藏于黑暗：基于亮度感知的夜间去雾扩散模型

标题： The Devil is in the Darkness: Diffusion-Based Nighttime Dehazing Anchored in Brightness Perception

Xiaofeng Cong, Yu-Xin Zhang, Haoran Wei, Yeying Jin, Junming Hou, Jie Gui, Jing Zhang, Dacheng Tao

主题：计算机视觉与模式识别 (cs.CV)
[209] arXiv:2506.02396 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：面向恶劣天气下通用激光雷达分割的显式几何-反射率协作研究

标题： Towards Explicit Geometry-Reflectance Collaboration for Generalized LiDAR Segmentation in Adverse Weather

Longyu Yang, Ping Hu, Shangbo Yuan, Lu Zhang, Jun Liu, Hengtao Shen, Xiaofeng Zhu

主题：计算机视觉与模式识别 (cs.CV)
[210] arXiv:2506.02405 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：模型归因：追踪生成模型中的多阶段操作

标题： Modelship Attribution: Tracing Multi-Stage Manipulations Across Generative Models

Zhiya Tan, Xin Zhang, Joey Tianyi Zhou

主题：计算机视觉与模式识别 (cs.CV)
[211] arXiv:2506.02408 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：回顾计算病理学中的端到端学习与幻灯片级监督

标题： Revisiting End-to-End Learning with Slide-level Supervision in Computational Pathology

Wenhao Tang, Rong Qin, Heng Fang, Fengtao Zhou, Hao Chen, Xiang Li, Ming-Ming Cheng

主题：计算机视觉与模式识别 (cs.CV)
[212] arXiv:2506.02419 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：使用预训练扩散模型的涌现相似性引导注册

标题： Guiding Registration with Emergent Similarity from Pre-Trained Diffusion Models

Nurislam Tursynbek, Hastings Greer, Basar Demir, Marc Niethammer

评论：国际医学图像计算和计算机辅助干预会议 2025

主题：计算机视觉与模式识别 (cs.CV)
[213] arXiv:2506.02433 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：赋能功能神经影像：一种用于神经信号统一表示的预训练生成框架

标题： Empowering Functional Neuroimaging: A Pre-trained Generative Framework for Unified Representation of Neural Signals

Weiheng Yao, Xuhang Chen, Shuqiang Wang

主题：计算机视觉与模式识别 (cs.CV)
[214] arXiv:2506.02439 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：基于视频的语言驱动的可见光-红外视频人再识别

标题： Video-Level Language-Driven Video-Based Visible-Infrared Person Re-Identification

Shuang Li, Jiaxu Leng, Changjiang Kuang, Mingpi Tan, Xinbo Gao

评论：已被IEEE TIFS接受

主题：计算机视觉与模式识别 (cs.CV)
[215] arXiv:2506.02444 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： SViMo：手部与物体交互场景下用于视频和运动生成的同步扩散模型

标题： SViMo: Synchronized Diffusion for Video and Motion Generation in Hand-object Interaction Scenarios

Lingwei Dang, Ruizhi Shao, Hongwen Zhang, Wei Min, Yebin Liu, Qingyao Wu

主题：计算机视觉与模式识别 (cs.CV)
[216] arXiv:2506.02448 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： VidEvent：一个用于理解视频中事件动态演化的大型数据集

标题： VidEvent: A Large Dataset for Understanding Dynamic Evolution of Events in Videos

Baoyu Liang, Qile Su, Shoutai Zhu, Yuchen Liang, Chao Tong

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[217] arXiv:2506.02452 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：自适应神经时间感知文本到运动模型

标题： ANT: Adaptive Neural Temporal-Aware Text-to-Motion Model

Wenshuo Chen, Kuimou Yu, Haozhe Jia, Kaishen Yuan, Bowen Tian, Songning Lai, Hongru Xiao, Erhang Zhang, Lei Wang, Yutao Yue

主题：计算机视觉与模式识别 (cs.CV)
[218] arXiv:2506.02453 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： PAID：用于持续测试时适应的成对角度不变分解

标题： PAID: Pairwise Angular-Invariant Decomposition for Continual Test-Time Adaptation

Kunyu Wang, Xueyang Fu, Yuanfei Bao, Chengjie Ge, Chengzhi Cao, Wei Zhai, Zheng-Jun Zha

主题：计算机视觉与模式识别 (cs.CV)
[219] arXiv:2506.02459 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： ReSpace：基于文本的带偏好对齐的3D场景合成与编辑

标题： ReSpace: Text-Driven 3D Scene Synthesis and Editing with Preference Alignment

Martin JJ. Bucher, Iro Armeni

评论： 20页，17幅图（包括附录）

主题：计算机视觉与模式识别 (cs.CV)
[220] arXiv:2506.02462 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：通过灵敏度引导剪枝的高效测试时自适应目标检测

标题： Efficient Test-time Adaptive Object Detection via Sensitivity-Guided Pruning

Kunyu Wang, Xueyang Fu, Xin Lu, Chengjie Ge, Chengzhi Cao, Wei Zhai, Zheng-Jun Zha

评论：被CVPR 2025接收为口头报告论文

主题：计算机视觉与模式识别 (cs.CV)
[221] arXiv:2506.02472 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： HRTR：用于中风康复细粒度亚秒级动作分割的单阶段Transformer

标题： HRTR: A Single-stage Transformer for Fine-grained Sub-second Action Segmentation in Stroke Rehabilitation

Halil Ismail Helvaci, Justin Philip Huber, Jihye Bae, Sen-ching Samson Cheung

主题：计算机视觉与模式识别 (cs.CV)
[222] arXiv:2506.02473 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：基于微分运动的形状与材质生成感知

标题： Generative Perception of Shape and Material from Differential Motion

Xinran Nicole Han, Ko Nishino, Todd Zickler

主题：计算机视觉与模式识别 (cs.CV)
[223] arXiv:2506.02477 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：通过雨天特征记忆与重放实现更好的去雨泛化

标题： Towards Better De-raining Generalization via Rainy Characteristics Memorization and Replay

Kunyu Wang, Xueyang Fu, Chengzhi Cao, Chengjie Ge, Wei Zhai, Zheng-Jun Zha

主题：计算机视觉与模式识别 (cs.CV)
[224] arXiv:2506.02488 (交叉列表自 cs.CV) [中文pdf, pdf, 其他]: 标题： Flexiffusion：无需训练的分段式神经架构搜索用于高效的扩散模型

标题： Flexiffusion: Training-Free Segment-Wise Neural Architecture Search for Efficient Diffusion Models

Hongtao Huang, Xiaojun Chang, Lina Yao

评论：本文原本是打算作为我之前论文（arXiv:2409.17566）的v2版本，但错误地以新论文的形式提交了。

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[225] arXiv:2506.02492 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：基于信息量的共证据融合在医学图像分割中的应用

标题： Co-Evidential Fusion with Information Volume for Medical Image Segmentation

Yuanpeng He, Lijian Li, Tianxiang Zhan, Chi-Man Pun, Wenpin Jiao, Zhi Jin

主题：计算机视觉与模式识别 (cs.CV)
[226] arXiv:2506.02493 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：面向野外单幅图像的3D平面重建

标题： Towards In-the-wild 3D Plane Reconstruction from a Single Image

Jiachen Liu, Rui Yu, Sili Chen, Sharon X. Huang, Hengkai Guo

评论： CVPR 2025 高亮论文

主题：计算机视觉与模式识别 (cs.CV)
[227] arXiv:2506.02497 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： LumosFlow: 基于运动引导的长视频生成

标题： LumosFlow: Motion-Guided Long Video Generation

Jiahao Chen, Hangjie Yuan, Yichen Qian, Jingyun Liang, Jiazheng Xing, Pengwei Liu, Weihua Chen, Fan Wang, Bing Su

主题：计算机视觉与模式识别 (cs.CV)
[228] arXiv:2506.02528 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：关系适配器：使用扩散变换学习和迁移视觉关系

标题： RelationAdapter: Learning and Transferring Visual Relation with Diffusion Transformers

Yan Gong, Yiren Song, Yicheng Li, Chenglin Li, Yin Zhang

主题：计算机视觉与模式识别 (cs.CV)
[229] arXiv:2506.02534 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：利用不完美标签的弱监督增强单目高度估计

标题： Enhancing Monocular Height Estimation via Weak Supervision from Imperfect Labels

Sining Chen, Yilei Shi, Xiao Xiang Zhu

主题：计算机视觉与模式识别 (cs.CV)
[230] arXiv:2506.02535 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：内存耗尽：基于多模态稀疏滤波网络学习主特征用于半监督视频异常检测

标题： MemoryOut: Learning Principal Features via Multimodal Sparse Filtering Network for Semi-supervised Video Anomaly Detection

Juntong Li, Lingwei Dang, Yukun Su, Yun Hao, Qingxin Xiao, Yongwei Nie, Qingyao Wu

主题：计算机视觉与模式识别 (cs.CV)
[231] arXiv:2506.02537 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： VisuRiddles: 细粒度感知是多模态大型语言模型在抽象视觉推理中的主要瓶颈

标题： VisuRiddles: Fine-grained Perception is a Primary Bottleneck for Multimodal Large Language Models in Abstract Visual Reasoning

Hao Yan, Handong Zheng, Hao Wang, Liang Yin, Xingchen Liu, Zhenbiao Cao, Xinxing Su, Zihao Chen, Jihao Wu, Minghui Liao, Chao Weng, Wei Chen, Yuliang Liu, Xiang Bai

评论： 13页，4幅图

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[232] arXiv:2506.02547 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：概率在线事件降采样

标题： Probabilistic Online Event Downsampling

Andreu Girbau-Xalabarder, Jun Nagata, Shinichi Sumiyoshi

评论： accepted for the CVPR 2025 Events-Vision Workshop

主题：计算机视觉与模式识别 (cs.CV) ; 新兴技术 (cs.ET)
[233] arXiv:2506.02550 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：技术报告：Ego4D 长期动作预测挑战赛 2025

标题： Technical Report for Ego4D Long-Term Action Anticipation Challenge 2025

Qiaohui Chu, Haoyu Zhang, Yisen Feng, Meng Liu, Weili Guan, Yaowei Wang, Liqiang Nie

评论： CVPR EgoVis Workshop 2025上Ego4D长期动作预测挑战的冠军解决方案

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[234] arXiv:2506.02555 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： SurgVLM：一种用于手术智能的大规模视觉-语言模型和系统性评估基准

标题： SurgVLM: A Large Vision-Language Model and Systematic Evaluation Benchmark for Surgical Intelligence

Zhitao Zeng, Zhu Zhuo, Xiaojun Jia, Erli Zhang, Junde Wu, Jiaan Zhang, Yuxuan Wang, Chang Han Low, Jian Jiang, Zilong Zheng, Xiaochun Cao, Yutong Ban, Qi Dou, Yang Liu, Yueming Jin

评论： 29页，5幅图

主题：计算机视觉与模式识别 (cs.CV)
[235] arXiv:2506.02557 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：基于核的无监督嵌入对齐以增强视觉-语言模型中的视觉表示

标题： Kernel-based Unsupervised Embedding Alignment for Enhanced Visual Representation in Vision-language Models

Shizhan Gong, Yankai Jiang, Qi Dou, Farzan Farnia

评论： ICML 2025

主题：计算机视觉与模式识别 (cs.CV)
[236] arXiv:2506.02560 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： DCI：用于增强基于扩散的图像编辑的双条件反转

标题： DCI: Dual-Conditional Inversion for Boosting Diffusion-Based Image Editing

Zixiang Li, Haoyu Wang, Wei Wang, Chuangchuang Tan, Yunchao Wei, Yao Zhao

主题：计算机视觉与模式识别 (cs.CV)
[237] arXiv:2506.02571 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：对比与压缩：学习短轨迹的轻量级嵌入

标题： Contrast & Compress: Learning Lightweight Embeddings for Short Trajectories

Abhishek Vivekanandan, Christian Hubschneider, J. Marius Zöllner

评论：投稿同行评审

主题：计算机视觉与模式识别 (cs.CV)
[238] arXiv:2506.02587 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： BEVCALIB：基于几何引导的鸟瞰视图表示的激光雷达-相机标定

标题： BEVCALIB: LiDAR-Camera Calibration via Geometry-Guided Bird's-Eye View Representations

Weiduo Yuan, Jerry Li, Justin Yue, Divyank Shah, Konstantinos Karydis, Hang Qiu

主题：计算机视觉与模式识别 (cs.CV) ; 机器人技术 (cs.RO)
[239] arXiv:2506.02601 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：利用 unmixing 引导的扩散模型生成高光谱图像

标题： Hyperspectral Image Generation with Unmixing Guided Diffusion Model

Shiyu Shen, Bin Pan, Ziye Zhang, Zhenwei Shi

主题：计算机视觉与模式识别 (cs.CV) ; 图像与视频处理 (eess.IV)
[240] arXiv:2506.02604 (交叉列表自 cs.CV) [中文pdf, pdf, 其他]: 标题：卷积神经网络在图像超分辨率中的应用

标题： Application of convolutional neural networks in image super-resolution

Chunwei Tian, Mingjian Song, Wangmeng Zuo, Bo Du, Yanning Zhang, Shichao Zhang

评论：已被CAAI《智能系统学报》接受，使用中文撰写。

主题：计算机视觉与模式识别 (cs.CV) ; 图像与视频处理 (eess.IV)
[241] arXiv:2506.02605 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：基于一步扩散的现实世界图像超分辨率与视觉感知蒸馏

标题： One-Step Diffusion-based Real-World Image Super-Resolution with Visual Perception Distillation

Xue Wu, Jingwei Xin, Zhijun Tu, Jie Hu, Jie Li, Nannan Wang, Xinbo Gao

主题：计算机视觉与模式识别 (cs.CV)
[242] arXiv:2506.02614 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：大规模数据集下复杂天空背景中的高精度空间碎片跟踪

标题： High Performance Space Debris Tracking in Complex Skylight Backgrounds with a Large-Scale Dataset

Guohang Zhuang, Weixi Song, Jinyang Huang, Chenwei Yang, Wanli OuYang, Yan Lu

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[243] arXiv:2506.02615 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：基于视觉语言模型的驾驶场景理解分层问答

标题： Hierarchical Question-Answering for Driving Scene Understanding Using Vision-Language Models

Safaa Abdullahi Moallim Mohamud, Minjin Baek, Dong Seog Han

评论：这项工作已被提交至IEEE，以供可能的发表。

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[244] arXiv:2506.02626 (交叉列表自 cs.CV) [中文pdf, pdf, 其他]: 标题：合成虹膜图像数据库与身份泄露：风险与缓解策略

标题： Synthetic Iris Image Databases and Identity Leakage: Risks and Mitigation Strategies

Ada Sawilska, Mateusz Trokielewicz

主题：计算机视觉与模式识别 (cs.CV)
[245] arXiv:2506.02633 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： ControlMambaIR：基于状态空间模型的图像恢复条件控制

标题： ControlMambaIR: Conditional Controls with State-Space Model for Image Restoration

Cheng Yang, Lijing Liang, Zhixun Su

主题：计算机视觉与模式识别 (cs.CV)
[246] arXiv:2506.02671 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：小帮助，大飞跃：使用 AdaptNet 的视觉-语言模型高效测试时自适应

标题： Small Aid, Big Leap: Efficient Test-Time Adaptation for Vision-Language Models with AdaptNet

Xiao Chen, Jiazhen Huang, Qinting Jiang, Fanding Huang, Xianghua Fu, Jingyan Jiang, Zhi Wang

主题：计算机视觉与模式识别 (cs.CV)
[247] arXiv:2506.02677 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：自解缠与再组合用于跨域少量样本分割

标题： Self-Disentanglement and Re-Composition for Cross-Domain Few-Shot Segmentation

Jintao Tong, Yixiong Zou, Guangyao Chen, Yuhua Li, Ruixuan Li

评论：被ICML 2025接受

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI) ; 机器学习 (cs.LG)
[248] arXiv:2506.02680 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：用FLAIR求解反问题

标题： Solving Inverse Problems with FLAIR

Julius Erbach, Dominik Narnhofer, Andreas Dombos, Bernt Schiele, Jan Eric Lenssen, Konrad Schindler

主题：计算机视觉与模式识别 (cs.CV) ; 图像与视频处理 (eess.IV)
[249] arXiv:2506.02690 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：面向大模型时代的几何问题求解：一项调查

标题： Towards Geometry Problem Solving in the Large Model Era: A Survey

Yurui Zhao, Xiang Wang, Jiahong Liu, Irwin King, Zhitao Huang

评论： 8页，4个图，会议投稿

主题：计算机视觉与模式识别 (cs.CV) ; 几何拓扑 (math.GT)
[250] arXiv:2506.02692 (交叉列表自 cs.CV) [中文pdf, pdf, 其他]: 标题：大规模自监督视频基础模型用于智能手术

标题： Large-scale Self-supervised Video Foundation Model for Intelligent Surgery

Shu Yang, Fengtao Zhou, Leon Mayer, Fuxiang Huang, Yiliang Chen, Yihui Wang, Sunan He, Yuxiang Nie, Xi Wang, Ömer Sümer, Yueming Jin, Huihui Sun, Shuchang Xu, Alex Qinyang Liu, Zheng Li, Jing Qin, Jeremy YuenChun Teoh, Lena Maier-Hein, Hao Chen

主题：计算机视觉与模式识别 (cs.CV)
[251] arXiv:2506.02695 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： FaceSleuth：基于学习的单一方向注意力机制验证微观表情识别中的垂直优势

标题： FaceSleuth: Learning-Driven Single-Orientation Attention Verifies Vertical Dominance in Micro-Expression Recognition

Linquan Wu, Tianxiang Jiang, Wenhao Duan, Yini Fang, Jacky Keung

评论： 12页，2幅图

主题：计算机视觉与模式识别 (cs.CV)
[252] arXiv:2506.02697 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： LayoutRAG：用于内容无关条件布局生成的检索增强模型

标题： LayoutRAG: Retrieval-Augmented Model for Content-agnostic Conditional Layout Generation

Yuxuan Wu, Le Wang, Sanping Zhou, Mengnan Liu, Gang Hua, Haoxiang Li

评论： 12页，5幅图

主题：计算机视觉与模式识别 (cs.CV)
[253] arXiv:2506.02698 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：通过重噪声反转的平滑偏好优化以对齐具有不同人类偏好的扩散模型

标题： Smoothed Preference Optimization via ReNoise Inversion for Aligning Diffusion Models with Varied Human Preferences

Yunhong Lu, Qichao Wang, Hengyuan Cao, Xiaoyin Xu, Min Zhang

评论：已被ICML 2025接受

主题：计算机视觉与模式识别 (cs.CV)
[254] arXiv:2506.02702 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： ToothForge：使用同步谱嵌入的自动牙齿形状生成

标题： ToothForge: Automatic Dental Shape Generation using Synchronized Spectral Embeddings

Tibor Kubík, François Guibault, Michal Španěl, Hervé Lombaert

评论：医学影像信息处理（IPMI2025）

主题：计算机视觉与模式识别 (cs.CV)
[255] arXiv:2506.02708 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：视觉语言模型在图像评分和自我解释方面的迭代自改进

标题： Iterative Self-Improvement of Vision Language Models for Image Scoring and Self-Explanation

Naoto Tanji, Toshihiko Yamasaki

评论：录用为ICIP2025

主题：计算机视觉与模式识别 (cs.CV) ; 计算与语言 (cs.CL)
[256] arXiv:2506.02733 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： LinkTo-Anime：来自3D模型渲染的2D动画光流数据集

标题： LinkTo-Anime: A 2D Animation Optical Flow Dataset from 3D Model Rendering

Xiaoyi Feng, Kaifeng Zou, Caichun Cen, Tao Huang, Hui Guo, Zizhou Huang, Yingli Zhao, Mingqing Zhang, Diwei Wang, Yuntao Zou, Dagang Li

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[257] arXiv:2506.02736 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： GeneA-SLAM2：具有自动编码器预处理遗传关键点重采样和深度方差引导的动态区域去除的动态SLAM

标题： GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal

Shufan Qing, Anzhen Li, Qiandi Wang, Yuefeng Niu, Mingchen Feng, Guoliang Hu, Jinqiao Wu, Fengtao Nan, Yingchun Fan

主题：计算机视觉与模式识别 (cs.CV) ; 机器人技术 (cs.RO)
[258] arXiv:2506.02738 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： Open-PMC-18M：用于多模态表示学习的高保真大规模医学数据集

标题： Open-PMC-18M: A High-Fidelity Large Scale Medical Dataset for Multimodal Representation Learning

Negin Baghbanzadeh, Sajad Ashkezari, Elham Dolatabadi, Arash Afkanpour

评论： 15页

主题：计算机视觉与模式识别 (cs.CV)
[259] arXiv:2506.02741 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： VTGaussian-SLAM：具有散射视图绑定3D高斯分布的大规模场景RGBD SLAM

标题： VTGaussian-SLAM: RGBD SLAM for Large Scale Scenes with Splatting View-Tied 3D Gaussians

Pengchong Hu, Zhizhong Han

评论： ICML 2025

主题：计算机视觉与模式识别 (cs.CV)
[260] arXiv:2506.02751 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：鲁棒点积：解耦密度化和动态以实现无瞬态的3DGS

标题： RobustSplat: Decoupling Densification and Dynamics for Transient-Free 3DGS

Chuanyu Fu, Yuqi Zhang, Kunbin Yao, Guanying Chen, Yuan Xiong, Chuan Huang, Shuguang Cui, Xiaochun Cao

评论： ICCV 2025。项目页面：https://fcyycf.github.io/RobustSplat/

主题：计算机视觉与模式识别 (cs.CV)
[261] arXiv:2506.02764 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：通过共享表示的自由视图和视觉搜索统一注意力建模的高效方法

标题： Unified Attention Modeling for Efficient Free-Viewing and Visual Search via Shared Representations

Fatma Youssef Mohammed, Kostas Alexis

评论：已被2025年IEEE国际发展与学习会议（ICDL）接受

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[262] arXiv:2506.02765 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：一种用于车辆检测的动态变换器网络

标题： A Dynamic Transformer Network for Vehicle Detection

Chunwei Tian, Kai Liu, Bob Zhang, Zhixiang Huang, Chia-Wen Lin, David Zhang

评论： 8页，5幅图。本文已被接受发表在《IEEE消费电子汇刊》上。

主题：计算机视觉与模式识别 (cs.CV)
[263] arXiv:2506.02781 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： FreeScene：基于自由提示的3D场景合成的混合图扩散

标题： FreeScene: Mixed Graph Diffusion for 3D Scene Synthesis from Free Prompts

Tongyuan Bai, Wangyuanfan Bai, Dong Chen, Tieru Wu, Manyi Li, Rui Ma

评论：被CVPR 2025接受

主题：计算机视觉与模式识别 (cs.CV)
[264] arXiv:2506.02783 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： SAMJ：通过Segment Anything模型在ImageJ/Fiji上的快速图像标注

标题： SAMJ: Fast Image Annotation on ImageJ/Fiji via Segment Anything Model

Carlos Garcia-Lopez-de-Haro, Caterina Fuster-Barcelo, Curtis T. Rueden, Jonathan Heras, Vladimir Ulman, Daniel Franco-Barranco, Adrian Ines, Kevin W. Eliceiri, Jean-Christophe Olivo-Marin, Jean-Yves Tinevez, Daniel Sage, Arrate Munoz-Barrutia

主题：计算机视觉与模式识别 (cs.CV)
[265] arXiv:2506.02789 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：使用眼科超声视频自动测量视神经鞘直径

标题： Automated Measurement of Optic Nerve Sheath Diameter Using Ocular Ultrasound Video

Renxing Li, Weiyi Tang, Peiqi Li, Qiming Huang, Jiayuan She, Shengkai Li, Haoran Xu, Yeyun Wan, Jing Liu, Hailong Fu, Xiang Li, Jiangang Chen

评论： 17页，9幅图

主题：计算机视觉与模式识别 (cs.CV)
[266] arXiv:2506.02843 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：跨域少量学习的随机寄存器

标题： Random Registers for Cross-Domain Few-Shot Learning

Shuai Yi, Yixiong Zou, Yuhua Li, Ruixuan Li

评论：被ICML 2025接受

主题：计算机视觉与模式识别 (cs.CV)
[267] arXiv:2506.02845 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：超越地球：理解微重力环境中的人类行为与场景

标题： Go Beyond Earth: Understanding Human Actions and Scenes in Microgravity Environments

Di Wen, Lei Qi, Kunyu Peng, Kailun Yang, Fei Teng, Ao Luo, Jia Fu, Yufan Chen, Ruiping Liu, Yitian Shi, M. Saquib Sarfraz, Rainer Stiefelhagen

评论： 15页，3个图，代码可在https://github.com/LEI-QI-233/HAR-in-Space获取

主题：计算机视觉与模式识别 (cs.CV)
[268] arXiv:2506.02846 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： PBR-SR：基于2D图像先验的网格PBR纹理超分辨率

标题： PBR-SR: Mesh PBR Texture Super Resolution from 2D Image Priors

Yujin Chen, Yinyu Nie, Benjamin Ummenhofer, Reiner Birkl, Michael Paulitsch, Matthias Nießner

评论：项目页面：https://terencecyj.github.io/projects/PBR-SR/，视频：https://youtu.be/eaM5S3Mt1RM

主题：计算机视觉与模式识别 (cs.CV)
[269] arXiv:2506.02850 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：多阶段基于事件的令牌压缩方法METok用于高效长视频理解

标题： METok: Multi-Stage Event-based Token Compression for Efficient Long Video Understanding

Mengyue Wang, Shuo Chen, Kristian Kersting, Volker Tresp, Yunpu Ma

评论： 14页，10幅图

主题：计算机视觉与模式识别 (cs.CV)
[270] arXiv:2506.02853 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：学习金字塔结构的长程依赖关系用于三维人体姿态估计

标题： Learning Pyramid-structured Long-range Dependencies for 3D Human Pose Estimation

Mingjie Wei, Xuemei Xie, Yutong Zhong, Guangming Shi

评论：已被IEEE多媒体汇刊（TMM）接受

主题：计算机视觉与模式识别 (cs.CV)
[271] arXiv:2506.02854 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：分层自提示SAM：一种无需提示的医学图像分割框架

标题： Hierarchical Self-Prompting SAM: A Prompt-Free Medical Image Segmentation Framework

Mengmeng Zhang, Xingyuan Dai, Yicheng Sun, Jing Wang, Yueyang Yao, Xiaoyan Gong, Fuze Cong, Feiyue Wang, Yisheng Lv

主题：计算机视觉与模式识别 (cs.CV)
[272] arXiv:2506.02857 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：增强异常识别：深度伪造检测的鲁棒分布外策略

标题： Enhancing Abnormality Identification: Robust Out-of-Distribution Strategies for Deepfake Detection

Luca Maiano, Fabrizio Casadei, Irene Amerini

主题：计算机视觉与模式识别 (cs.CV)
[273] arXiv:2506.02866 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： MVTD：一个用于海上视觉目标跟踪的基准数据集

标题： MVTD: A Benchmark Dataset for Maritime Visual Object Tracking

Ahsan Baidar Bakht, Muhayy Ud Din, Sajid Javed, Irfan Hussain

评论：投稿至《自然·科学数据》

主题：计算机视觉与模式识别 (cs.CV)
[274] arXiv:2506.02868 (交叉列表自 cs.CV) [中文pdf, pdf, 其他]: 标题：基于视觉变换器和位置嵌入的泛北极永久冻土地貌与人类建筑基础设施特征检测

标题： Pan-Arctic Permafrost Landform and Human-built Infrastructure Feature Detection with Vision Transformers and Location Embeddings

Amal S. Perera, David Fernandez, Chandi Witharana, Elias Manos, Michael Pimenta, Anna K. Liljedahl, Ingmar Nitze, Yili Yang, Todd Nicholson, Chia-Yu Hsu, Wenwen Li, Guido Grosse

评论： 20页，两栏IEEE格式，13幅图

主题：计算机视觉与模式识别 (cs.CV)
[275] arXiv:2506.02875 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： NTIRE 2025 图像质量评估挑战赛：方法与结果

标题： NTIRE 2025 XGC Quality Assessment Challenge: Methods and Results

Xiaohong Liu, Xiongkuo Min, Qiang Hu, Xiaoyun Zhang, Jie Guo, Guangtao Zhai, Shushi Wang, Yingjie Zhou, Lu Liu, Jingxin Li, Liu Yang, Farong Wen, Li Xu, Yanwei Jiang, Xilei Zhu, Chunyi Li, Zicheng Zhang, Huiyu Duan, Xiele Wu, Yixuan Gao, Yuqin Cao, Jun Jia, Wei Sun, Jiezhang Cao, Radu Timofte, Baojun Li, Jiamian Huang, Dan Luo, Tao Liu, Weixia Zhang, Bingkun Zheng, Junlin Chen, Ruikai Zhou, Meiya Chen, Yu Wang, Hao Jiang, Xiantao Li, Yuxiang Jiang, Jun Tang, Yimeng Zhao, Bo Hu, Zelu Qi, Chaoyang Zhang, Fei Zhao, Ping Shi, Lingzhi Fu, Heng Cong, Shuai He, Rongyu Zhang, Jiarong He, Zongyao Hu, Wei Luo, Zihao Yu, Fengbin Guan, Yiting Lu, Xin Li, Zhibo Chen, Mengjing Su, Yi Wang, Tuo Chen, Chunxiao Li, Shuaiyu Zhao, Jiaxin Wen, Chuyi Lin, Sitong Liu, Ningxin Chu, Jing Wan, Yu Zhou, Baoying Chen, Jishen Zeng, Jiarui Liu, Xianjin Liu, Xin Chen, Lanzhi Zhou, Hangyu Li, You Han, Bibo Xiang, Zhenjie Liu, Jianzhang Lu, Jialin Gui, Renjie Lu, Shangfei Wang, Donghao Zhou, Jingyu Lin, Quanjian Song, Jiancheng Huang, Yufeng Yang, Changwei Wang, Shupeng Zhong, Yang Yang, Lihuo He, Jia Liu, Yuting Xing, Tida Fang, Yuchun Jin

评论： NTIRE 2025 XGC质量评估挑战报告。arXiv管理员注：文本与arXiv:2404.16687有重叠。

主题：计算机视觉与模式识别 (cs.CV)

总共 3129 条目 : 1-100 101-200 176-275 201-300 301-400 401-500 ... 3101-3129

显示最多 100 每页条目：较少 | 更多 | 所有

计算机视觉与模式识别

2025年06月 的作者和标题

2025年06月的作者和标题