计算机视觉与模式识别

最近提交的作者和标题

查看今天的新的变化

总共 754 条目 : 1-50 151-200 201-250 251-300 301-350 351-400 401-450 451-500 ... 751-754

显示最多 50 每页条目：较少 | 更多 | 所有

[301] arXiv:2507.05376 (交叉列表自 cs.CV) [中文pdf, pdf, 其他]: 标题： YOLO-APD：在复杂道路几何形状上增强YOLOv8以实现鲁棒的人行道检测

标题： YOLO-APD: Enhancing YOLOv8 for Robust Pedestrian Detection on Complex Road Geometries

Aquino Joctum, John Kandiri

评论：发表于《国际计算机趋势与技术杂志》(IJCTT)，第73卷，第6期，2024年。最终版本的记录可在以下网址获取：https://doi.org/10.14445/22312803/IJCTT-V73I6P108

期刊参考：《计算机趋势与技术国际期刊》（IJCTT），第73卷，第6期，第58-74页，2024年

主题：计算机视觉与模式识别 (cs.CV)
[302] arXiv:2507.05302 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：相关细节：视觉细节增强的自校正人脸伪造检测

标题： CorrDetail: Visual Detail Enhanced Self-Correction for Face Forgery Detection

Binjia Zhou, Hengrui Lou, Lizhe Chen, Haoyuan Li, Dawei Luo, Shuai Chen, Jie Lei, Zunlei Feng, Yijun Bei

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[303] arXiv:2507.05300 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：结构化标题提高文本到图像模型的提示遵循性（Re-LAION-Caption 19M）

标题： Structured Captions Improve Prompt Adherence in Text-to-Image Models (Re-LAION-Caption 19M)

Nicholas Merchant, Haitz Sáez de Ocáriz Borde, Andrei Cristian Popescu, Carlos Garcia Jurado Suarez

评论： 7页主论文+附录，18图

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI) ; 计算与语言 (cs.CL) ; 机器学习 (cs.LG)
[304] arXiv:2507.06167 (交叉列表自 cs.CL) [中文pdf, pdf, 其他]: 标题： Skywork-R1V3 技术报告

标题： Skywork-R1V3 Technical Report

Wei Shen, Jiangbo Pei, Yi Peng, Xuchen Song, Yang Liu, Jian Peng, Haofeng Sun, Yunzhuo Hao, Peiyu Wang, Jianhao Zhang, Yahui Zhou

主题：计算与语言 (cs.CL) ; 计算机视觉与模式识别 (cs.CV)
[305] arXiv:2507.06140 (交叉列表自 eess.IV) [中文pdf, pdf, html, 其他]: 标题： LangMamba：一种基于视觉语言模型的低剂量CT去噪语言驱动Mamba框架

标题： LangMamba: A Language-driven Mamba Framework for Low-dose CT Denoising with Vision-language Models

Zhihao Chen, Tao Chen, Chenhui Wang, Qi Gao, Huidong Xie, Chuang Niu, Ge Wang, Hongming Shan

评论： 11页，8图

主题：图像与视频处理 (eess.IV) ; 人工智能 (cs.AI) ; 计算机视觉与模式识别 (cs.CV)
[306] arXiv:2507.06137 (交叉列表自 cs.CL) [中文pdf, pdf, html, 其他]: 标题： NeoBabel：一种多语言开放塔用于视觉生成

标题： NeoBabel: A Multilingual Open Tower for Visual Generation

Mohammad Mahdi Derakhshani, Dheeraj Varghese, Marzieh Fadaee, Cees G. M. Snoek

评论： 34页，12图

主题：计算与语言 (cs.CL) ; 人工智能 (cs.AI) ; 计算机视觉与模式识别 (cs.CV)
[307] arXiv:2507.06109 (交叉列表自 cs.GR) [中文pdf, pdf, html, 其他]: 标题：灯塔GS：用于全景风格移动捕获的室内结构感知3D高斯点云渲染

标题： LighthouseGS: Indoor Structure-aware 3D Gaussian Splatting for Panorama-Style Mobile Captures

Seungoh Han, Jaehoon Jang, Hyunsu Kim, Jaeheung Surh, Junhyung Kwak, Hyowon Ha, Kyungdon Joo

评论：预印本

主题：图形学 (cs.GR) ; 人工智能 (cs.AI) ; 计算机视觉与模式识别 (cs.CV)
[308] arXiv:2507.06067 (交叉列表自 eess.IV) [中文pdf, pdf, html, 其他]: 标题：通过多模态融合和端到端配准增强从CBCT生成的合成CT

标题： Enhancing Synthetic CT from CBCT via Multimodal Fusion and End-To-End Registration

Maximilian Tschuchnig, Lukas Lamminger, Philipp Steininger, Michael Gadermayr

评论：已被CAIP 2025接受。arXiv管理员注释：与arXiv:2506.08716存在大量文本重叠

主题：图像与视频处理 (eess.IV) ; 人工智能 (cs.AI) ; 计算机视觉与模式识别 (cs.CV)
[309] arXiv:2507.06011 (交叉列表自 cs.DC) [中文pdf, pdf, html, 其他]: 标题： ECORE：边缘计算中面向深度学习模型的节能优化路由

标题： ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge

Daghash K. Alqahtani, Maria A. Rodriguez, Muhammad Aamir Cheema, Hamid Rezatofighi, Adel N. Toosi

主题：分布式、并行与集群计算 (cs.DC) ; 计算机视觉与模式识别 (cs.CV)
[310] arXiv:2507.05932 (交叉列表自 cs.SE) [中文pdf, pdf, html, 其他]: 标题： TigAug：自动驾驶系统中交通灯检测的测试数据增强

标题： TigAug: Data Augmentation for Testing Traffic Light Detection in Autonomous Driving Systems

You Lu, Dingji Wang, Kaifeng Huang, Bihuan Chen, Xin Peng

主题：软件工程 (cs.SE) ; 计算机视觉与模式识别 (cs.CV)
[311] arXiv:2507.05883 (交叉列表自 eess.IV) [中文pdf, pdf, 其他]: 标题：一种用于血管内超声和光学相干断层扫描成像数据完全自动配准的新框架

标题： A novel framework for fully-automated co-registration of intravascular ultrasound and optical coherence tomography imaging data

Xingwei He, Kit Mills Bransby, Ahmet Emir Ulutas, Thamil Kumaran, Nathan Angelo Lecaros Yap, Gonul Zeren, Hesong Zeng, Yaojun Zhang, Andreas Baumbach, James Moon, Anthony Mathur, Jouke Dijkstra, Qianni Zhang, Lorenz Raber, Christos V Bourantas

评论：预印本

主题：图像与视频处理 (eess.IV) ; 计算机视觉与模式识别 (cs.CV)
[312] arXiv:2507.05823 (交叉列表自 cs.LG) [中文pdf, pdf, html, 其他]: 标题：公平领域泛化：一种信息论视角

标题： Fair Domain Generalization: An Information-Theoretic View

Tangzheng Lian, Guanyu Hu, Dimitrios Kollias, Xinyu Yang, Oya Celiktutan

主题：机器学习 (cs.LG) ; 计算机视觉与模式识别 (cs.CV)
[313] arXiv:2507.05810 (交叉列表自 cs.LG) [中文pdf, pdf, html, 其他]: 标题：基于概念的机制可解释性使用结构化知识图谱

标题： Concept-Based Mechanistic Interpretability Using Structured Knowledge Graphs

Sofiia Chorna, Kateryna Tarelkina, Eloïse Berthier, Gianni Franchi

评论： 15页

主题：机器学习 (cs.LG) ; 人工智能 (cs.AI) ; 计算机视觉与模式识别 (cs.CV)
[314] arXiv:2507.05742 (交叉列表自 eess.IV) [中文pdf, pdf, html, 其他]: 标题：组织概念v2：全幻灯片图像的监督基础模型

标题： Tissue Concepts v2: A Supervised Foundation Model For Whole Slide Images

Till Nicke, Daniela Schacherer, Jan Raphael Schäfer, Natalia Artysh, Antje Prasse, André Homeyer, Andrea Schenk, Henning Höfener, Johannes Lotz

主题：图像与视频处理 (eess.IV) ; 计算机视觉与模式识别 (cs.CV)
[315] arXiv:2507.05661 (交叉列表自 cs.RO) [中文pdf, pdf, 其他]: 标题： 3DGS_LSR：基于三维高斯点云的自动驾驶大规模重定位

标题： 3DGS_LSR:Large_Scale Relocation for Autonomous Driving Based on 3D Gaussian Splatting

Haitao Lu, Haijier Chen, Haoze Liu, Shoujian Zhang, Bo Xu, Ziao Liu

评论： 13页，7图，4表

主题：机器人技术 (cs.RO) ; 计算机视觉与模式识别 (cs.CV)
[316] arXiv:2507.05656 (交叉列表自 eess.IV) [中文pdf, pdf, html, 其他]: 标题： ADPv2：一种分层组织学组织类型注释的数据集，用于结直肠疾病潜在生物标志物的发现

标题： ADPv2: A Hierarchical Histological Tissue Type-Annotated Dataset for Potential Biomarker Discovery of Colorectal Disease

Zhiyuan Yang, Kai Li, Sophia Ghamoshi Ramandi, Patricia Brassard, Hakim Khellaf, Vincent Quoc-Huy Trinh, Jennifer Zhang, Lina Chen, Corwyn Rowsell, Sonal Varma, Kostas Plataniotis, Mahdi S. Hosseini

主题：图像与视频处理 (eess.IV) ; 计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG) ; 定量方法 (q-bio.QM)
[317] arXiv:2507.05647 (交叉列表自 eess.IV) [中文pdf, pdf, html, 其他]: 标题：基于扩散的噪声条件下有限角度CT重建

标题： Diffusion-Based Limited-Angle CT Reconstruction under Noisy Conditions

Jiaqi Guo, Santiago López-Tapia

评论：被2025年IEEE国际图像处理会议（ICIP）研讨会接受

主题：图像与视频处理 (eess.IV) ; 计算机视觉与模式识别 (cs.CV)
[318] arXiv:2507.05627 (交叉列表自 cs.RO) [中文pdf, pdf, html, 其他]: 标题： DreamGrasp：从部分视角图像中进行零样本3D多物体重建用于机器人操作

标题： DreamGrasp: Zero-Shot 3D Multi-Object Reconstruction from Partial-View Images for Robotic Manipulation

Young Hun Kim, Seungyeon Kim, Yonghyeon Lee, Frank Chongwoo Park

主题：机器人技术 (cs.RO) ; 计算机视觉与模式识别 (cs.CV)
[319] arXiv:2507.05582 (交叉列表自 eess.IV) [中文pdf, pdf, html, 其他]: 标题：从放射学报告中学习分割

标题： Learning Segmentation from Radiology Reports

Pedro R. A. S. Bassi, Wenxuan Li, Jieneng Chen, Zheren Zhu, Tianyu Lin, Sergio Decherchi, Andrea Cavalli, Kang Wang, Yang Yang, Alan L. Yuille, Zongwei Zhou

评论：被MICCAI 2025接收

主题：图像与视频处理 (eess.IV) ; 计算机视觉与模式识别 (cs.CV)
[320] arXiv:2507.05515 (交叉列表自 cs.AI) [中文pdf, pdf, html, 其他]: 标题：用于增强现实中的多模态训练助手的细粒度视觉-语言建模

标题： Fine-Grained Vision-Language Modeling for Multimodal Training Assistants in Augmented Reality

Haochen Huang, Jiahuan Pei, Mohammad Aliannejadi, Xin Sun, Moonisa Ahsan, Pablo Cesar, Chuang Yu, Zhaochun Ren, Junxiao Wang

评论： 20页

主题：人工智能 (cs.AI) ; 计算与语言 (cs.CL) ; 计算机视觉与模式识别 (cs.CV)
[321] arXiv:2507.05451 (交叉列表自 eess.IV) [中文pdf, pdf, 其他]: 标题：基于自监督深度学习的超声微血管成像去噪方法

标题： Self-supervised Deep Learning for Denoising in Ultrasound Microvascular Imaging

Lijie Huang, Jingyi Yin, Jingke Zhang, U-Wai Lok, Ryan M. DeRuiter, Jieyang Jin, Kate M. Knoll, Kendra E. Petersen, James D. Krier, Xiang-yang Zhu, Gina K. Hesley, Kathryn A. Robinson, Andrew J. Bentall, Thomas D. Atwell, Andrew D. Rule, Lilach O. Lerman, Shigao Chen, Chengwu Huang

评论： 12页，10图。补充材料可在 https://zenodo.org/records/15832003 获取。

主题：图像与视频处理 (eess.IV) ; 计算机视觉与模式识别 (cs.CV) ; 信号处理 (eess.SP)
[322] arXiv:2507.05447 (交叉列表自 cs.HC) [中文pdf, pdf, html, 其他]: 标题： NRXR-ID：使用近程扩展现实和智能手机的虚拟现实中的双因素认证

标题： NRXR-ID: Two-Factor Authentication (2FA) in VR Using Near-Range Extended Reality and Smartphones

Aiur Nanzatov, Lourdes Peña-Castillo, Oscar Meruvia-Pastor

主题：人机交互 (cs.HC) ; 计算机视觉与模式识别 (cs.CV) ; 图形学 (cs.GR)
[323] arXiv:2507.05317 (交叉列表自 eess.IV) [中文pdf, pdf, html, 其他]: 标题： PWD：有限角度CT的先验引导和小波增强扩散模型

标题： PWD: Prior-Guided and Wavelet-Enhanced Diffusion Model for Limited-Angle CT

Yi Liu, Yiyang Wen, Zekun Zhou, Junqi Ma, Linghang Wang, Yucheng Yao, Liu Shi, Qiegen Liu

主题：图像与视频处理 (eess.IV) ; 人工智能 (cs.AI) ; 计算机视觉与模式识别 (cs.CV)
[324] arXiv:2507.05315 (交叉列表自 cs.LG) [中文pdf, pdf, html, 其他]: 标题：基于条件图神经网络的软组织变形和力预测

标题： Conditional Graph Neural Network for Predicting Soft Tissue Deformation and Forces

Madina Kojanazarova, Florentin Bieder, Robin Sandkühler, Philippe C. Cattin

主题：机器学习 (cs.LG) ; 人工智能 (cs.AI) ; 计算机视觉与模式识别 (cs.CV)
[325] arXiv:2507.05314 (交叉列表自 eess.IV) [中文pdf, pdf, html, 其他]: 标题：具有类别特定集成和贝叶斯超参数优化的双注意力U-Net++用于精确的伤口和鳞片标记分割

标题： Dual-Attention U-Net++ with Class-Specific Ensembles and Bayesian Hyperparameter Optimization for Precise Wound and Scale Marker Segmentation

Daniel Cieślak, Miriam Reca, Olena Onyshchenko, Jacek Rumiński

评论： 11页，会议：第20届北欧-波罗的海生物医学工程联合会议和第24届波兰生物控制论与生物医学工程会议；6幅图，2张表，11个参考文献

期刊参考： NBC 2025和PCBBE 2025联合会议，2025年6月16-18日，波兰华沙

主题：图像与视频处理 (eess.IV) ; 计算机视觉与模式识别 (cs.CV)
[326] arXiv:2507.05304 (交叉列表自 cs.GR) [中文pdf, pdf, 其他]: 标题：基于自注意力的多尺度图自动编码器网络的3D网格

标题： Self-Attention Based Multi-Scale Graph Auto-Encoder Network of 3D Meshes

Saqib Nazir, Olivier Lézoray, Sébastien Bougleux (UNICAEN)

期刊参考：国际联合神经网络会议，2025年6月，罗马，意大利

主题：图形学 (cs.GR) ; 人工智能 (cs.AI) ; 计算机视觉与模式识别 (cs.CV)
[327] arXiv:2507.05268 (交叉列表自 q-bio.NC) [中文pdf, pdf, html, 其他]: 标题：跨被试DD：一种跨被试脑机接口算法

标题： Cross-Subject DD: A Cross-Subject Brain-Computer Interface Algorithm

Xiaoyuan Li, Xinru Xue, Bohan Zhang, Ye Sun, Shoushuo Xi, Gang Liu

评论： 20页，9图

主题：神经与认知 (q-bio.NC) ; 计算机视觉与模式识别 (cs.CV) ; 系统与控制 (eess.SY)

[328] arXiv:2507.05260 (交叉列表自 cs.CV) [中文pdf, pdf, 其他]: 标题：超越单次，超越单一视角：跨视角和长时程蒸馏以获得更好的LiDAR表示

标题： Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations

Xiang Xu, Lingdong Kong, Song Wang, Chuanwei Zhou, Qingshan Liu

评论： ICCV 2025；26页，12图，10表；代码见 http://github.com/Xiangxu-0103/LiMA

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG) ; 机器人技术 (cs.RO)
[329] arXiv:2507.05259 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：超越简单编辑：基于复杂指令的X-Planner图像编辑

标题： Beyond Simple Edits: X-Planner for Complex Instruction-Based Image Editing

Chun-Hsiao Yeh, Yilin Wang, Nanxuan Zhao, Richard Zhang, Yuheng Li, Yi Ma, Krishna Kumar Singh

评论：项目页面：https://danielchyeh.github.io/x-planner/

主题：计算机视觉与模式识别 (cs.CV)
[330] arXiv:2507.05258 (交叉列表自 cs.CV) [中文pdf, pdf, 其他]: 标题：时空大语言模型：关于环境和行动的推理

标题： Spatio-Temporal LLM: Reasoning about Environments and Actions

Haozhen Zheng, Beitong Tian, Mingyuan Wu, Zhenggang Tang, Klara Nahrstedt, Alex Schwing

评论：代码和数据可在 https://zoezheng126.github.io/STLLM-website/ 获取

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[331] arXiv:2507.05256 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： SegmentDreamer：面向具有分割一致性轨迹蒸馏的高保真文本到3D合成

标题： SegmentDreamer: Towards High-fidelity Text-to-3D Synthesis with Segmented Consistency Trajectory Distillation

Jiahao Zhu, Zixuan Chen, Guangcong Wang, Xiaohua Xie, Yi Zhou

评论：被ICCV 2025接收，项目页面：https://zjhjojo.github.io/

主题：计算机视觉与模式识别 (cs.CV)
[332] arXiv:2507.05255 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：开放视觉推理器：转移语言认知行为用于视觉推理

标题： Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning

Yana Wei, Liang Zhao, Jianjian Sun, Kangheng Lin, Jisheng Yin, Jingcheng Hu, Yinmin Zhang, En Yu, Haoran Lv, Zejia Weng, Jia Wang, Chunrui Han, Yuang Peng, Qi Han, Zheng Ge, Xiangyu Zhang, Daxin Jiang, Vishal M. Patel

主题：计算机视觉与模式识别 (cs.CV) ; 计算与语言 (cs.CL)
[333] arXiv:2507.05254 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：从边缘到联合预测：评估自动驾驶场景一致的轨迹预测方法

标题： From Marginal to Joint Predictions: Evaluating Scene-Consistent Trajectory Prediction Approaches for Automated Driving

Fabian Konstantinidis, Ariel Dallari Guerreiro, Raphael Trumpp, Moritz Sackmann, Ulrich Hofmann, Marco Caccamo, Christoph Stiller

评论：被国际智能交通系统会议2025（ITSC 2025）接受

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI) ; 机器学习 (cs.LG) ; 多智能体系统 (cs.MA) ; 机器人技术 (cs.RO)
[334] arXiv:2507.05249 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：基于物理引导的双隐式神经表示的源分离

标题： Physics-Guided Dual Implicit Neural Representations for Source Separation

Yuan Ni, Zhantao Chen, Alexander N. Petsch, Edmund Xu, Cheng Peng, Alexander I. Kolesnikov, Sugata Chowdhury, Arun Bansil, Jana B. Thayer, Joshua J. Turner

主题：计算机视觉与模式识别 (cs.CV) ; 强关联电子 (cond-mat.str-el) ; 机器学习 (cs.LG) ; 数据分析、统计与概率 (physics.data-an)
[335] arXiv:2507.05229 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：军事车辆在低帧率无人机视频中的自监督实时跟踪

标题： Self-Supervised Real-Time Tracking of Military Vehicles in Low-FPS UAV Footage

Markiyan Kostiv, Anatolii Adamovskyi, Yevhen Cherniavskyi, Mykyta Varenyk, Ostap Viniavskyi, Igor Krashenyi, Oles Dobosevych

主题：计算机视觉与模式识别 (cs.CV)
[336] arXiv:2507.05221 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： CTA：跨任务对齐以实现更好的测试时训练

标题： CTA: Cross-Task Alignment for Better Test Time Training

Samuel Barbeau, Pedram Fekri, David Osowiechi, Ali Bahri, Moslem Yazdanpanah, Masih Aminbeidokhti, Christian Desrosiers

评论：预印本，正在审稿中

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[337] arXiv:2507.05211 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：全部一体：视觉描述引导的统一点云分割

标题： All in One: Visual-Description-Guided Unified Point Cloud Segmentation

Zongyan Han, Mohamed El Amine Boudjoghra, Jiahua Dong, Jinhong Wang, Rao Muhammad Anwer

评论：被ICCV2025接受

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)
[338] arXiv:2507.05189 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：基于卫星的印度水稻田地块制图：特伦甘纳邦案例研究

标题： Satellite-based Rabi rice paddy field mapping in India: a case study on Telangana state

Prashanth Reddy Putta, Fabio Dell'Acqua (University of Pavia)

评论： 60页，17图。拟投稿至《遥感应用：社会与环境》（RSASE）。由欧盟——下一代欧盟基金，使命4组件1.5资助。

主题：计算机视觉与模式识别 (cs.CV)
[339] arXiv:2507.05184 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： $\varphi$-Adapt：一种物理信息自适应学习方法用于二维量子材料的发现

标题： $\varphi$-Adapt: A Physics-Informed Adaptation Learning Approach to 2D Quantum Material Discovery

Hoang-Quan Nguyen, Xuan Bac Nguyen, Sankalp Pandey, Tim Faltermeier, Nicholas Borys, Hugh Churchill, Khoa Luu

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[340] arXiv:2507.05173 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：语义帧插值

标题： Semantic Frame Interpolation

Yijia Hong, Jiangning Zhang, Ran Yi, Yuji Wang, Weijian Cao, Xiaobin Hu, Zhucun Xue, Yabiao Wang, Chengjie Wang, Lizhuang Ma

评论： https://github.com/hyj542682306/语义帧插值

主题：计算机视觉与模式识别 (cs.CV)
[341] arXiv:2507.05165 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：微分注意力用于多模式危机事件分析

标题： Differential Attention for Multimodal Crisis Event Analysis

Nusrat Munia, Junfeng Zhu, Olfa Nasraoui, Abdullah-Al-Zubaer Imran

评论：发表于CVPRw 2025，MMFM3

主题：计算机视觉与模式识别 (cs.CV)
[342] arXiv:2507.05163 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： 4DSloMo：异步采集的高速场景4D重建

标题： 4DSloMo: 4D Reconstruction for High Speed Scene with Asynchronous Capture

Yutian Chen, Shi Guo, Tianshuo Yang, Lihe Ding, Xiuyuan Yu, Jinwei Gu, Tianfan Xue

评论：网页：https://openimaginglab.github.io/4DSloMo/

主题：计算机视觉与模式识别 (cs.CV)
[343] arXiv:2507.05162 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： LAID：空间和光谱域中的轻量级人工智能生成图像检测

标题： LAID: Lightweight AI-Generated Image Detection in Spatial and Spectral Domains

Nicholas Chivaran, Jianbing Ni

评论：将出现在PST2025的论文集上

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI) ; 密码学与安全 (cs.CR)
[344] arXiv:2507.05146 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： VERITAS：图像中真实性的验证与解释，以实现人工智能系统的透明度

标题： VERITAS: Verification and Explanation of Realness in Images for Transparency in AI Systems

Aadi Srivastava, Vignesh Natarajkumar, Utkarsh Bheemanaboyna, Devisree Akashapu, Nagraj Gaonkar, Archit Joshi

主题：计算机视觉与模式识别 (cs.CV) ; 机器学习 (cs.LG)
[345] arXiv:2507.05116 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： VOTE：轨迹集成投票的视觉-语言-动作优化

标题： VOTE: Vision-Language-Action Optimization with Trajectory Ensemble Voting

Juyi Lin, Amir Taherin, Arash Akbari, Arman Akbari, Lei Lu, Guangyu Chen, Taskin Padir, Xiaomeng Yang, Weiwei Chen, Yiqian Li, Xue Lin, David Kaeli, Pu Zhao, Yanzhi Wang

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI) ; 机器人技术 (cs.RO)
[346] arXiv:2507.05108 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：复兴文化遗产：一种全面历史文献修复的新方法

标题： Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document Restoration

Yuyi Zhang, Peirong Zhang, Zhenhua Yang, Pengyu Yan, Yongxin Shi, Pengwei Liu, Fengjun Guo, Lianwen Jin

期刊参考： ACL 2025 主会议

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI) ; 计算与语言 (cs.CL)
[347] arXiv:2507.05092 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： MoDiT：使用扩散变换器学习高度一致的3D运动系数以生成说话头像

标题： MoDiT: Learning Highly Consistent 3D Motion Coefficients with Diffusion Transformer for Talking Head Generation

Yucheng Wang, Dan Xu

主题：计算机视觉与模式识别 (cs.CV)
[348] arXiv:2507.05068 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： ICAS：从自回归图像生成模型中检测训练数据

标题： ICAS: Detecting Training Data from Autoregressive Image Generative Models

Hongyao Yu, Yixiang Qiu, Yiheng Yang, Hao Fang, Tianqu Zhuang, Jiaxin Hong, Bin Chen, Hao Wu, Shu-Tao Xia

评论： ACM MM 2025

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI) ; 密码学与安全 (cs.CR)
[349] arXiv:2507.05063 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题：基于人工智能的细胞形态学图像合成用于医学诊断

标题： AI-Driven Cytomorphology Image Synthesis for Medical Diagnostics

Jan Carreras Boada, Rao Muhammad Umer, Carsten Marr

评论： 8页，6图，2表。最终学位论文（TFG）提交至ESCI-UPF，并在Helmholtz Munich完成

主题：计算机视觉与模式识别 (cs.CV) ; 计算与语言 (cs.CL) ; 机器学习 (cs.LG)
[350] arXiv:2507.05056 (交叉列表自 cs.CV) [中文pdf, pdf, html, 其他]: 标题： INTER：通过交互引导采样减轻大型视觉语言模型中的幻觉

标题： INTER: Mitigating Hallucination in Large Vision-Language Models by Interaction Guidance Sampling

Xin Dong, Shichao Dong, Jin Wang, Jing Huang, Li Zhou, Zenghui Sun, Lihua Jing, Jingsong Lan, Xiaoyong Zhu, Bo Zheng

主题：计算机视觉与模式识别 (cs.CV) ; 人工智能 (cs.AI)

总共 754 条目 : 1-50 151-200 201-250 251-300 301-350 351-400 401-450 451-500 ... 751-754

显示最多 50 每页条目：较少 | 更多 | 所有

计算机视觉与模式识别

最近提交的作者和标题

2025年07月09日，星期三 (继续，展示最后 105 之 27 条目 )

2025年07月08日，星期二 (展示首先 328 之 23 条目 )

计算机视觉与模式识别

最近提交的作者和标题

2025年07月09日， 星期三 (继续， 展示 最后 105 之 27 条目 )

2025年07月08日， 星期二 (展示 首先 328 之 23 条目 )

2025年07月09日，星期三 (继续，展示最后 105 之 27 条目 )

2025年07月08日，星期二 (展示首先 328 之 23 条目 )