计算机视觉与模式识别

2025年06月的作者和标题

总共 3129 条目 : 1-50 101-150 151-200 201-250 251-300 301-350 351-400 401-450 ... 3101-3129

显示最多 50 每页条目：较少 | 更多 | 所有

[251] arXiv:2506.02695 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： FaceSleuth：基于学习的单一方向注意力机制验证微观表情识别中的垂直优势

标题： FaceSleuth: Learning-Driven Single-Orientation Attention Verifies Vertical Dominance in Micro-Expression Recognition

Linquan Wu, Tianxiang Jiang, Wenhao Duan, Yini Fang, Jacky Keung

评论： 12页，2幅图

主题：计算机视觉与模式识别 (cs.CV)
[252] arXiv:2506.02697 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： LayoutRAG：用于内容无关条件布局生成的检索增强模型

标题： LayoutRAG: Retrieval-Augmented Model for Content-agnostic Conditional Layout Generation

Yuxuan Wu, Le Wang, Sanping Zhou, Mengnan Liu, Gang Hua, Haoxiang Li

评论： 12页，5幅图

主题：计算机视觉与模式识别 (cs.CV)
[253] arXiv:2506.02698 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：通过重噪声反转的平滑偏好优化以对齐具有不同人类偏好的扩散模型

标题： Smoothed Preference Optimization via ReNoise Inversion for Aligning Diffusion Models with Varied Human Preferences

Yunhong Lu, Qichao Wang, Hengyuan Cao, Xiaoyin Xu, Min Zhang

评论：已被ICML 2025接受

主题：计算机视觉与模式识别 (cs.CV)
[254] arXiv:2506.02702 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： ToothForge：使用同步谱嵌入的自动牙齿形状生成

标题： ToothForge: Automatic Dental Shape Generation using Synchronized Spectral Embeddings

Tibor Kubík, François Guibault, Michal Španěl, Hervé Lombaert

评论：医学影像信息处理（IPMI2025）

主题：计算机视觉与模式识别 (cs.CV)
[255] arXiv:2506.02708 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：视觉语言模型在图像评分和自我解释方面的迭代自改进

标题： Iterative Self-Improvement of Vision Language Models for Image Scoring and Self-Explanation

Naoto Tanji, Toshihiko Yamasaki

评论：录用为ICIP2025

主题：计算机视觉与模式识别 (cs.CV) ; 计算与语言 (cs.CL)
[256] arXiv:2506.02733 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： LinkTo-Anime：来自3D模型渲染的2D动画光流数据集

标题： LinkTo-Anime: A 2D Animation Optical Flow Dataset from 3D Model Rendering

Xiaoyi Feng, Kaifeng Zou, Caichun Cen, Tao Huang, Hui Guo, Zizhou Huang, Yingli Zhao, Mingqing Zhang, Diwei Wang, Yuntao Zou, Dagang Li

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[257] arXiv:2506.02736 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： GeneA-SLAM2：具有自动编码器预处理遗传关键点重采样和深度方差引导的动态区域去除的动态SLAM

标题： GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal

Shufan Qing, Anzhen Li, Qiandi Wang, Yuefeng Niu, Mingchen Feng, Guoliang Hu, Jinqiao Wu, Fengtao Nan, Yingchun Fan

主题：计算机视觉与模式识别 (cs.CV) ; 机器人技术 (cs.RO)
[258] arXiv:2506.02738 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： Open-PMC-18M：用于多模态表示学习的高保真大规模医学数据集

标题： Open-PMC-18M: A High-Fidelity Large Scale Medical Dataset for Multimodal Representation Learning

Negin Baghbanzadeh, Sajad Ashkezari, Elham Dolatabadi, Arash Afkanpour

评论： 15页

主题：计算机视觉与模式识别 (cs.CV)
[259] arXiv:2506.02741 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： VTGaussian-SLAM：具有散射视图绑定3D高斯分布的大规模场景RGBD SLAM

标题： VTGaussian-SLAM: RGBD SLAM for Large Scale Scenes with Splatting View-Tied 3D Gaussians

Pengchong Hu, Zhizhong Han

评论： ICML 2025

主题：计算机视觉与模式识别 (cs.CV)
[260] arXiv:2506.02751 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：鲁棒点积：解耦密度化和动态以实现无瞬态的3DGS

标题： RobustSplat: Decoupling Densification and Dynamics for Transient-Free 3DGS

Chuanyu Fu, Yuqi Zhang, Kunbin Yao, Guanying Chen, Yuan Xiong, Chuan Huang, Shuguang Cui, Xiaochun Cao

评论： ICCV 2025。项目页面：https://fcyycf.github.io/RobustSplat/

主题：计算机视觉与模式识别 (cs.CV)
[261] arXiv:2506.02764 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：通过共享表示的自由视图和视觉搜索统一注意力建模的高效方法

标题： Unified Attention Modeling for Efficient Free-Viewing and Visual Search via Shared Representations

Fatma Youssef Mohammed, Kostas Alexis

评论：已被2025年IEEE国际发展与学习会议（ICDL）接受

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[262] arXiv:2506.02765 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：一种用于车辆检测的动态变换器网络

标题： A Dynamic Transformer Network for Vehicle Detection

Chunwei Tian, Kai Liu, Bob Zhang, Zhixiang Huang, Chia-Wen Lin, David Zhang

评论： 8页，5幅图。本文已被接受发表在《IEEE消费电子汇刊》上。

主题：计算机视觉与模式识别 (cs.CV)
[263] arXiv:2506.02781 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： FreeScene：基于自由提示的3D场景合成的混合图扩散

标题： FreeScene: Mixed Graph Diffusion for 3D Scene Synthesis from Free Prompts

Tongyuan Bai, Wangyuanfan Bai, Dong Chen, Tieru Wu, Manyi Li, Rui Ma

评论：被CVPR 2025接受

主题：计算机视觉与模式识别 (cs.CV)
[264] arXiv:2506.02783 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： SAMJ：通过Segment Anything模型在ImageJ/Fiji上的快速图像标注

标题： SAMJ: Fast Image Annotation on ImageJ/Fiji via Segment Anything Model

Carlos Garcia-Lopez-de-Haro, Caterina Fuster-Barcelo, Curtis T. Rueden, Jonathan Heras, Vladimir Ulman, Daniel Franco-Barranco, Adrian Ines, Kevin W. Eliceiri, Jean-Christophe Olivo-Marin, Jean-Yves Tinevez, Daniel Sage, Arrate Munoz-Barrutia

主题：计算机视觉与模式识别 (cs.CV)
[265] arXiv:2506.02789 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：使用眼科超声视频自动测量视神经鞘直径

标题： Automated Measurement of Optic Nerve Sheath Diameter Using Ocular Ultrasound Video

Renxing Li, Weiyi Tang, Peiqi Li, Qiming Huang, Jiayuan She, Shengkai Li, Haoran Xu, Yeyun Wan, Jing Liu, Hailong Fu, Xiang Li, Jiangang Chen

评论： 17页，9幅图

主题：计算机视觉与模式识别 (cs.CV)
[266] arXiv:2506.02843 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：跨域少量学习的随机寄存器

标题： Random Registers for Cross-Domain Few-Shot Learning

Shuai Yi, Yixiong Zou, Yuhua Li, Ruixuan Li

评论：被ICML 2025接受

主题：计算机视觉与模式识别 (cs.CV)
[267] arXiv:2506.02845 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：超越地球：理解微重力环境中的人类行为与场景

标题： Go Beyond Earth: Understanding Human Actions and Scenes in Microgravity Environments

Di Wen, Lei Qi, Kunyu Peng, Kailun Yang, Fei Teng, Ao Luo, Jia Fu, Yufan Chen, Ruiping Liu, Yitian Shi, M. Saquib Sarfraz, Rainer Stiefelhagen

评论： 15页，3个图，代码可在https://github.com/LEI-QI-233/HAR-in-Space获取

主题：计算机视觉与模式识别 (cs.CV)
[268] arXiv:2506.02846 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： PBR-SR：基于2D图像先验的网格PBR纹理超分辨率

标题： PBR-SR: Mesh PBR Texture Super Resolution from 2D Image Priors

Yujin Chen, Yinyu Nie, Benjamin Ummenhofer, Reiner Birkl, Michael Paulitsch, Matthias Nießner

评论：项目页面：https://terencecyj.github.io/projects/PBR-SR/，视频：https://youtu.be/eaM5S3Mt1RM

主题：计算机视觉与模式识别 (cs.CV)
[269] arXiv:2506.02850 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：多阶段基于事件的令牌压缩方法METok用于高效长视频理解

标题： METok: Multi-Stage Event-based Token Compression for Efficient Long Video Understanding

Mengyue Wang, Shuo Chen, Kristian Kersting, Volker Tresp, Yunpu Ma

评论： 14页，10幅图

主题：计算机视觉与模式识别 (cs.CV)
[270] arXiv:2506.02853 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：学习金字塔结构的长程依赖关系用于三维人体姿态估计

标题： Learning Pyramid-structured Long-range Dependencies for 3D Human Pose Estimation

Mingjie Wei, Xuemei Xie, Yutong Zhong, Guangming Shi

评论：已被IEEE多媒体汇刊（TMM）接受

主题：计算机视觉与模式识别 (cs.CV)
[271] arXiv:2506.02854 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：分层自提示SAM：一种无需提示的医学图像分割框架

标题： Hierarchical Self-Prompting SAM: A Prompt-Free Medical Image Segmentation Framework

Mengmeng Zhang, Xingyuan Dai, Yicheng Sun, Jing Wang, Yueyang Yao, Xiaoyan Gong, Fuze Cong, Feiyue Wang, Yisheng Lv

主题：计算机视觉与模式识别 (cs.CV)
[272] arXiv:2506.02857 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：增强异常识别：深度伪造检测的鲁棒分布外策略

标题： Enhancing Abnormality Identification: Robust Out-of-Distribution Strategies for Deepfake Detection

Luca Maiano, Fabrizio Casadei, Irene Amerini

主题：计算机视觉与模式识别 (cs.CV)
[273] arXiv:2506.02866 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： MVTD：一个用于海上视觉目标跟踪的基准数据集

标题： MVTD: A Benchmark Dataset for Maritime Visual Object Tracking

Ahsan Baidar Bakht, Muhayy Ud Din, Sajid Javed, Irfan Hussain

评论：投稿至《自然·科学数据》

主题：计算机视觉与模式识别 (cs.CV)
[274] arXiv:2506.02868 (交叉列表自 cs.CV) [中文pdf, pdf, 其他]: 标题：基于视觉变换器和位置嵌入的泛北极永久冻土地貌与人类建筑基础设施特征检测

标题： Pan-Arctic Permafrost Landform and Human-built Infrastructure Feature Detection with Vision Transformers and Location Embeddings

Amal S. Perera, David Fernandez, Chandi Witharana, Elias Manos, Michael Pimenta, Anna K. Liljedahl, Ingmar Nitze, Yili Yang, Todd Nicholson, Chia-Yu Hsu, Wenwen Li, Guido Grosse

评论： 20页，两栏IEEE格式，13幅图

主题：计算机视觉与模式识别 (cs.CV)
[275] arXiv:2506.02875 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： NTIRE 2025 图像质量评估挑战赛：方法与结果

标题： NTIRE 2025 XGC Quality Assessment Challenge: Methods and Results

Xiaohong Liu, Xiongkuo Min, Qiang Hu, Xiaoyun Zhang, Jie Guo, Guangtao Zhai, Shushi Wang, Yingjie Zhou, Lu Liu, Jingxin Li, Liu Yang, Farong Wen, Li Xu, Yanwei Jiang, Xilei Zhu, Chunyi Li, Zicheng Zhang, Huiyu Duan, Xiele Wu, Yixuan Gao, Yuqin Cao, Jun Jia, Wei Sun, Jiezhang Cao, Radu Timofte, Baojun Li, Jiamian Huang, Dan Luo, Tao Liu, Weixia Zhang, Bingkun Zheng, Junlin Chen, Ruikai Zhou, Meiya Chen, Yu Wang, Hao Jiang, Xiantao Li, Yuxiang Jiang, Jun Tang, Yimeng Zhao, Bo Hu, Zelu Qi, Chaoyang Zhang, Fei Zhao, Ping Shi, Lingzhi Fu, Heng Cong, Shuai He, Rongyu Zhang, Jiarong He, Zongyao Hu, Wei Luo, Zihao Yu, Fengbin Guan, Yiting Lu, Xin Li, Zhibo Chen, Mengjing Su, Yi Wang, Tuo Chen, Chunxiao Li, Shuaiyu Zhao, Jiaxin Wen, Chuyi Lin, Sitong Liu, Ningxin Chu, Jing Wan, Yu Zhou, Baoying Chen, Jishen Zeng, Jiarui Liu, Xianjin Liu, Xin Chen, Lanzhi Zhou, Hangyu Li, You Han, Bibo Xiang, Zhenjie Liu, Jianzhang Lu, Jialin Gui, Renjie Lu, Shangfei Wang, Donghao Zhou, Jingyu Lin, Quanjian Song, Jiancheng Huang, Yufeng Yang, Changwei Wang, Shupeng Zhong, Yang Yang, Lihuo He, Jia Liu, Yuting Xing, Tida Fang, Yuchun Jin

评论： NTIRE 2025 XGC质量评估挑战报告。arXiv管理员注：文本与arXiv:2404.16687有重叠。

主题：计算机视觉与模式识别 (cs.CV)
[276] arXiv:2506.02882 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： GaRA-SAM：使用门控秩适应增强Segment Anything模型的鲁棒性

标题： GaRA-SAM: Robustifying Segment Anything Model with Gated-Rank Adaptation

Sohyun Lee, Yeho Gwon, Lukas Hoyer, Suha Kwak

主题：计算机视觉与模式识别 (cs.CV)
[277] arXiv:2506.02891 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： OpenFace 3.0：用于全面面部行为分析的轻量级多任务系统

标题： OpenFace 3.0: A Lightweight Multitask System for Comprehensive Facial Behavior Analysis

Jiewen Hu, Leena Mathur, Paul Pu Liang, Louis-Philippe Morency

评论： IEEE FG 2025, \copyright 2025 IEEE。个人使用本文材料是允许的。但对于任何当前或未来的媒体形式，包括重新印刷/重新发布本文材料用于广告或促销目的、创建新的汇编作品、再销售或分发到服务器或列表，以及重复使用本作品中受版权保护的任何部分，必须获得IEEE的许可。

主题：计算机视觉与模式识别 (cs.CV)
[278] arXiv:2506.02893 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：密集匹配摘要以加速双视图估计

标题： Dense Match Summarization for Faster Two-view Estimation

Jonathan Astermark, Anders Heyden, Viktor Larsson

评论：已被计算机视觉与模式识别会议（CVPR）2025接受

主题：计算机视觉与模式识别 (cs.CV)
[279] arXiv:2506.02896 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： FlySearch：探索视觉-语言模型如何探索

标题： FlySearch: Exploring how vision-language models explore

Adam Pardyl, Dominik Matuszek, Mateusz Przebieracz, Marek Cygan, Bartosz Zieliński, Maciej Wołczyk

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG) ; 机器人技术 (cs.RO)
[280] arXiv:2506.02914 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：基于标注指南的自动标注：通过3D激光雷达检测的基准测试

标题： Towards Auto-Annotation from Annotation Guidelines: A Benchmark through 3D LiDAR Detection

Yechi Ma, Wei Hua, Shu Kong

主题：计算机视觉与模式识别 (cs.CV)
[281] arXiv:2506.02938 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： MIND：基于UDFs的非流形曲面重建材料界面生成

标题： MIND: Material Interface Generation from UDFs for Non-Manifold Surface Reconstruction

Xuhui Chen, Fei Hou, Wencheng Wang, Hong Qin, Ying He

主题：计算机视觉与模式识别 (cs.CV)
[282] arXiv:2506.02964 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： FORLA：基于槽注意力的面向对象联邦表示学习

标题： FORLA:Federated Object-centric Representation Learning with Slot Attention

Guiqiu Liao, Matjaz Jogan, Eric Eaton, Daniel A. Hashimoto

评论： 24页，6个图

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[283] arXiv:2506.02975 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： HaploOmni：用于多模态视频理解和生成的统一单Transformer

标题： HaploOmni: Unified Single Transformer for Multimodal Video Understanding and Generation

Yicheng Xiao, Lin Song, Rui Yang, Cheng Cheng, Zunnan Xu, Zhaoyang Zhang, Yixiao Ge, Xiu Li, Ying Shan

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[284] arXiv:2506.02976 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：深度学习在视网膜退行性疾病评估中的应用：MARIO 年龄相关性黄斑变性进展挑战的综合分析

标题： Deep Learning for Retinal Degeneration Assessment: A Comprehensive Analysis of the MARIO AMD Progression Challenge

Rachid Zeghlache, Ikram Brahim, Pierre-Henri Conze, Mathieu Lamard, Mohammed El Amine Lazouni, Zineb Aziza Elaouaber, Leila Ryma Lazouni, Christopher Nielsen, Ahmad O. Ahsan, Matthias Wilms, Nils D. Forkert, Lovre Antonio Budimir, Ivana Matovinović, Donik Vršnak, Sven Lončarić, Philippe Zhang, Weili Jiang, Yihao Li, Yiding Hao, Markus Frohmann, Patrick Binder, Marcel Huber, Taha Emre, Teresa Finisterra Araújo, Marzieh Oghbaie, Hrvoje Bogunović, Amerens A. Bekkers, Nina M. van Liebergen, Hugo J. Kuijf, Abdul Qayyum, Moona Mazher, Steven A. Niederer, Alberto J. Beltrán-Carrero, Juan J. Gómez-Valverde, Javier Torresano-Rodríquez, Álvaro Caballero-Sastre, María J. Ledesma Carbayo, Yosuke Yamagishi, Yi Ding, Robin Peretzke, Alexandra Ertl, Maximilian Fischer, Jessica Kächele, Sofiane Zehar, Karim Boukli Hacene, Thomas Monfort, Béatrice Cochener, Mostafa El Habib Daho, Anas-Alexis Benyoussef, Gwenolé Quellec

评论：马里奥-MICCAI-挑战赛 2024

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[285] arXiv:2506.02981 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：基于生成模型的天文摄影湍流抑制

标题： Astrophotography turbulence mitigation via generative models

Joonyeoup Kim, Yu Yuan, Xingguang Zhang, Xijun Wang, Stanley Chan

主题：计算机视觉与模式识别 (cs.CV) ; 图像与视频处理 (eess.IV)
[286] arXiv:2506.03007 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： DFBench：大型多模态模型深度伪造图像检测能力基准测试

标题： DFBench: Benchmarking Deepfake Image Detection Capability of Large Multimodal Models

Jiarui Wang, Huiyu Duan, Juntong Wang, Ziheng Jia, Woo Yi Yang, Xiaorong Zhu, Yu Zhao, Jiaying Qian, Yuke Xing, Guangtao Zhai, Xiongkuo Min

主题：计算机视觉与模式识别 (cs.CV)
[287] arXiv:2506.03022 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： Smartflow：实现可扩展的空间时间地理研究

标题： Smartflow: Enabling Scalable Spatiotemporal Geospatial Research

David McVicar, Brian Avant, Adrian Gould, Diego Torrejon, Charles Della Porta, Ryan Mukherjee

期刊参考：国际地球科学与遥感会议 2023

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[288] arXiv:2506.03065 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：稀疏-vDiT：释放稀疏注意力的力量以加速视频扩散变换器

标题： Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers

Pengtao Chen, Xianfang Zeng, Maosen Zhao, Peng Ye, Mingzhu Shen, Wei Cheng, Gang Yu, Tao Chen

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI) ; 机器学习 (cs.LG)
[289] arXiv:2506.03067 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：编辑器：文本到图像扩散模型的有效且可解释的提示反演

标题： EDITOR: Effective and Interpretable Prompt Inversion for Text-to-Image Diffusion Models

Mingzhe Li, Gehao Zhang, Zhenting Wang, Shiqing Ma, Siqi Pan, Richard Cartwright, Juan Zhai

主题：计算机视觉与模式识别 (cs.CV)
[290] arXiv:2506.03073 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： LEG-SLAM：实时语言增强的高斯点云拼接用于SLAM

标题： LEG-SLAM: Real-Time Language-Enhanced Gaussian Splatting for SLAM

Roman Titkov, Egor Zubkov, Dmitry Yudin, Jaafar Mahmoud, Malik Mohrat, Gennady Sidorov

主题：计算机视觉与模式识别 (cs.CV)
[291] arXiv:2506.03079 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：占用中心的机器人视频生成

标题： ORV: 4D Occupancy-centric Robot Video Generation

Xiuyu Yang, Bohan Li, Shaocong Xu, Nan Wang, Chongjie Ye, Zhaoxi Chen, Minghan Qin, Yikang Ding, Xin Jin, Hang Zhao, Hao Zhao

评论：项目页面: https://orangesodahub.github.io/ORV/；代码: https://github.com/OrangeSodahub/ORV

主题：计算机视觉与模式识别 (cs.CV)
[292] arXiv:2506.03082 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： SG2VID：场景图实现视频合成的精细控制

标题： SG2VID: Scene Graphs Enable Fine-Grained Control for Video Synthesis

Ssharvien Kumar Sivakumar, Yannik Frisch, Ghazal Ghazaei, Anirban Mukhopadhyay

主题：计算机视觉与模式识别 (cs.CV)
[293] arXiv:2506.03084 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： InterMamba：具有自适应时空马步的高效人机交互生成

标题： InterMamba: Efficient Human-Human Interaction Generation with Adaptive Spatio-Temporal Mamba

Zizhao Wu, Yingying Sun, Yiming Chen, Xiaoling Gu, Ruyu Liu, Jiazhou Chen

主题：计算机视觉与模式识别 (cs.CV)
[294] arXiv:2506.03089 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：具有神经启发前端的显式建模子皮层视觉可提高CNN鲁棒性

标题： Explicitly Modeling Subcortical Vision with a Neuro-Inspired Front-End Improves CNN Robustness

Lucas Piper, Arlindo L. Oliveira, Tiago Marques

主题：计算机视觉与模式识别 (cs.CV) ; 神经与认知 (q-bio.NC)
[295] arXiv:2506.03096 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： FuseLIP: 通过离散标记的早期融合进行多模态嵌入

标题： FuseLIP: Multimodal Embeddings via Early Fusion of Discrete Tokens

Christian Schlarmann, Francesco Croce, Nicolas Flammarion, Matthias Hein

评论：代码和模型可在 https://github.com/chs20/fuselip 获取

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[296] arXiv:2506.03097 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： EgoVLM：第一人称视频理解的策略优化

标题： EgoVLM: Policy Optimization for Egocentric Video Understanding

Ashwin Vinod, Shrey Pandit, Aditya Vavre, Linshen Liu

评论：我们的代码可以在 https://github.com/adityavavre/VidEgoVLM 找到。

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[297] arXiv:2506.03103 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： DyTact：捕捉手-物操作中的动态接触

标题： DyTact: Capturing Dynamic Contacts in Hand-Object Manipulation

Xiaoyan Cong, Angela Xing, Chandradeep Pokhariya, Rao Fu, Srinath Sridhar

主题：计算机视觉与模式识别 (cs.CV)
[298] arXiv:2506.03107 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： ByteMorph：基准测试带有非刚性运动的指令引导图像编辑

标题： ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions

Di Chang, Mingdeng Cao, Yichun Shi, Bo Liu, Shengqu Cai, Shijie Zhou, Weilin Huang, Gordon Wetzstein, Mohammad Soleymani, Peng Wang

评论：网站: https://boese0601.github.io/bytemorph 数据集: https://huggingface.co/datasets/ByteDance-Seed/BM-6M 评估基准: https://huggingface.co/datasets/ByteDance-Seed/BM-Bench 代码: https://github.com/ByteDance-Seed/BM-code 在线演示: https://huggingface.co/spaces/Boese0601/ByteMorph-Demo

主题：计算机视觉与模式识别 (cs.CV)
[299] arXiv:2506.03110 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：重温跨域小样本学习中图像标记的连续性

标题： Revisiting Continuity of Image Tokens for Cross-domain Few-shot Learning

Shuai Yi, Yixiong Zou, Yuhua Li, Ruixuan Li

评论：被ICML 2025（亮点论文）接受

主题：计算机视觉与模式识别 (cs.CV)
[300] arXiv:2506.03114 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：基于航空林地影像的零样本树检测与分割

标题： Zero-Shot Tree Detection and Segmentation from Aerial Forest Imagery

Michelle Chen, David Russell, Amritha Pallavoor, Derek Young, Jane Wu

评论：代码：https://github.com/open-forest-observatory/tree-detection-framework

主题：计算机视觉与模式识别 (cs.CV)

总共 3129 条目 : 1-50 101-150 151-200 201-250 251-300 301-350 351-400 401-450 ... 3101-3129

显示最多 50 每页条目：较少 | 更多 | 所有

计算机视觉与模式识别

2025年06月 的作者和标题

2025年06月的作者和标题