Skip to main content
CenXiv.org
This website is in trial operation, support us!
We gratefully acknowledge support from all contributors.
Contribute
Donate
cenxiv logo > math

Help | Advanced Search

All available Chinese PDFs

Note: All Chinese full-text PDFs on this website are translated by AI. Please forgive me if there are any translation issues. We are continuously improving. If there is any doubt, please refer to the original document. Please go here to report issues and make suggestions for improvements.

Total of 10728 entries : 1-100 101-200 201-300 301-400 ... 10701-10728
Showing up to 100 entries per page: fewer | more | all
[1] arXiv:2509.15225 [cn-pdf, pdf]
Title: Lost in Translation? Vocabulary Alignment for Source-Free Domain Adaptation in Open-Vocabulary Semantic Segmentation
Title: 翻译中的迷失? 用于开放词汇语义分割的无源域适应词汇对齐
Silvio Mazzucco, Carl Persson, Mattia Segu, Pier Luigi Dovesi, Federico Tombari, Luc Van Gool, Matteo Poggi
Comments: BMVC 2025 - Project Page: https://thegoodailab.org/blog/vocalign - Code: https://github.com/Sisso16/VocAlign
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[2] arXiv:2509.15192 [cn-pdf, pdf]
Title: Channel Prediction under Network Distribution Shift Using Continual Learning-based Loss Regularization
Title: 基于持续学习的损失正则化的网络分布偏移下的信道预测
Muhammad Ahmed Mohsin, Muhammad Umer, Ahsan Bilal, Muhammad Ibtsaam Qadir, Muhammad Ali Jamshed, Dean F. Hougen, John M. Cioffi
Comments: ICASSP 2026
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC)
[3] arXiv:2509.15182 [cn-pdf, pdf]
Title: Conditional Prior-based Non-stationary Channel Estimation Using Accelerated Diffusion Models
Title: 基于条件先验的非平稳信道估计使用加速扩散模型
Muhammad Ahmed Mohsin, Ahsan Bilal, Muhammad Umer, Asad Aali, Muhammad Ali Jamshed, Dean F. Hougen, John M. Cioffi
Comments: ICASSP 2026
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC)
[4] arXiv:2509.15167 [cn-pdf, pdf]
Title: Semi-Supervised 3D Medical Segmentation from 2D Natural Images Pretrained Model
Title: 从2D自然图像预训练模型进行半监督的3D医学分割
Pak-Hei Yeung, Jayroop Ramesh, Pengfei Lyu, Ana Namburete, Jagath Rajapakse
Comments: Machine Learning in Medical Imaging (MLMI) 2025 Oral
Subjects: Computer Vision and Pattern Recognition (cs.CV) ; Artificial Intelligence (cs.AI) ; Machine Learning (cs.LG)
[5] arXiv:2509.15160 [cn-pdf, pdf]
Title: An Evaluation-Centric Paradigm for Scientific Visualization Agents
Title: 以评估为中心的科学可视化代理范式
Kuangshi Ai, Haichao Miao, Zhimin Li, Chaoli Wang, Shusen Liu
Journal-ref: 1st Workshop on GenAI, Agents, and the Future of VIS (IEEE VIS Conference 2025)
Subjects: Human-Computer Interaction (cs.HC) ; Computation and Language (cs.CL) ; Graphics (cs.GR)
[6] arXiv:2509.15157 [cn-pdf, pdf]
Title: Mind the Gap: Data Rewriting for Stable Off-Policy Supervised Fine-Tuning
Title: 注意差距:稳定离策略监督微调的数据重写
Shiwan Zhao, Xuyang Zhao, Jiaming Zhou, Aobo Kong, Qicheng Li, Yong Qin
Subjects: Machine Learning (cs.LG) ; Computation and Language (cs.CL)
[7] arXiv:2509.15154 [cn-pdf, pdf]
Title: MedFact-R1: Towards Factual Medical Reasoning via Pseudo-Label Augmentation
Title: MedFact-R1:通过伪标签增强实现事实性医学推理
Gengliang Li, Rongyu Chen, Bin Li, Linlin Yang, Guodong Ding
Comments: Tech report
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[8] arXiv:2509.15152 [cn-pdf, pdf]
Title: Asymptotic Study of In-context Learning with Random Transformers through Equivalent Models
Title: 基于等效模型的上下文学习的渐近研究与随机变换器
Samet Demir, Zafer Dogan
Comments: MLSP 2025, 6 pages 2 figures
Subjects: Machine Learning (stat.ML) ; Machine Learning (cs.LG)
[9] arXiv:2509.15151 [cn-pdf, pdf]
Title: Exploring How Audio Effects Alter Emotion with Foundation Models
Title: 探索基础模型如何改变情感的音频效果
Stelios Katsis, Vassilis Lyberatos, Spyridon Kantarelis, Edmund Dervakos, Giorgos Stamou
Subjects: Sound (cs.SD) ; Artificial Intelligence (cs.AI)
[10] arXiv:2509.15147 [cn-pdf, pdf]
Title: Who to Trust? Aggregating Client Knowledge in Logit-Based Federated Learning
Title: 谁可以信任? 基于Logit的联邦学习中聚合客户端知识
Viktor Kovalchuk, Nikita Kotelevskii, Maxim Panov, Samuel Horváth, Martin Takáč
Subjects: Machine Learning (cs.LG)
[11] arXiv:2509.15141 [cn-pdf, pdf]
Title: Benefits of Online Tilted Empirical Risk Minimization: A Case Study of Outlier Detection and Robust Regression
Title: 在线倾斜经验风险最小化的优点:异常值检测与稳健回归的案例研究
Yigit E. Yildirim, Samet Demir, Zafer Dogan
Comments: MLSP 2025, 6 pages, 3 figures
Subjects: Machine Learning (stat.ML) ; Machine Learning (cs.LG)
[12] arXiv:2509.15140 [cn-pdf, pdf]
Title: FCPE: A Fast Context-based Pitch Estimation Model
Title: FCPE:一种快速基于上下文的音高估计模型
Yuxin Luo, Ruoyi Zhang, Lu-Chuan Liu, Tianyu Li, Hangyu Liu
Comments: Under review
Subjects: Sound (cs.SD) ; Computation and Language (cs.CL)
[13] arXiv:2509.15136 [cn-pdf, pdf]
Title: Nonlinear Cooperative Salvo Guidance with Seeker-Limited Interceptors
Title: 非线性协同齐射制导与导引头有限的拦截器
Lohitvel Gopikannan, Shashi Ranjan Kumar, Abhinav Sinha
Subjects: Systems and Control (eess.SY) ; Multiagent Systems (cs.MA) ; Robotics (cs.RO)
[14] arXiv:2509.15129 [cn-pdf, pdf]
Title: Doppler Radiance Field-Guided Antenna Selection for Improved Generalization in Multi-Antenna Wi-Fi-based Human Activity Recognition
Title: 多普勒辐射场引导的天线选择用于多天线Wi-Fi人体活动识别中的泛化改进
Navid Hasanzadeh, Shahrokh Valaee
Subjects: Signal Processing (eess.SP) ; Computer Vision and Pattern Recognition (cs.CV)
[15] arXiv:2509.15127 [cn-pdf, pdf]
Title: Learning Rate Should Scale Inversely with High-Order Data Moments in High-Dimensional Online Independent Component Analysis
Title: 学习率应在高维在线独立成分分析中与高阶数据矩成反比变化
M. Oguzhan Gultekin, Samet Demir, Zafer Dogan
Comments: MLSP 2025, 6 pages, 3 figures
Subjects: Machine Learning (stat.ML) ; Machine Learning (cs.LG)
[16] arXiv:2509.15124 [cn-pdf, pdf]
Title: Learning Mechanistic Subtypes of Neurodegeneration with a Physics-Informed Variational Autoencoder Mixture Model
Title: 用物理信息变分自编码器混合模型学习神经退行性机制亚型
Sanduni Pinnawala, Annabelle Hartanto, Ivor J. A. Simpson, Peter A. Wijeratne
Comments: 13 pages, 5 figures, accepted at SASHIMI workshop, MICCAI 2025
Subjects: Image and Video Processing (eess.IV) ; Computer Vision and Pattern Recognition (cs.CV) ; Machine Learning (cs.LG)
[17] arXiv:2509.15120 [cn-pdf, pdf]
Title: Efficient Conformal Prediction for Regression Models under Label Noise
Title: 标签噪声下的回归模型高效共形预测
Yahav Cohen, Jacob Goldberger, Tom Tirer
Subjects: Machine Learning (cs.LG)
[18] arXiv:2509.15116 [cn-pdf, pdf]
Title: The mechanization of science illustrated by the Lean formalization of the multi-graded Proj construction
Title: 科学的机械化通过多分级 Proj 构造的 Lean 形式化来说明
Arnaud Mayeux, Jujian Zhang
Comments: Short note
Subjects: Logic in Computer Science (cs.LO) ; Artificial Intelligence (cs.AI) ; Algebraic Geometry (math.AG)
[19] arXiv:2509.15099 [cn-pdf, pdf]
Title: Digital Twin-based Cooperative Autonomous Driving in Smart Intersections: A Multi-Agent Reinforcement Learning Approach
Title: 基于数字孪生的智能交叉口协同自动驾驶:一种多智能体强化学习方法
Taoyuan Yu, Kui Wang, Zongdian Li, Tao Yu, Kei Sakaguchi, Walid Saad
Subjects: Systems and Control (eess.SY)
[20] arXiv:2509.15095 [cn-pdf, pdf]
Title: Listening, Imagining \& Refining: A Heuristic Optimized ASR Correction Framework with LLMs
Title: 聆听、想象与优化:一种基于大语言模型的启发式自动语音识别纠正框架
Yutong Liu, Ziyue Zhang, Yongbin Yu, Xiangxiang Wang, Yuqing Cai, Nyima Tashi
Subjects: Audio and Speech Processing (eess.AS) ; Artificial Intelligence (cs.AI)
[21] arXiv:2509.15085 [cn-pdf, pdf]
Title: Real-Time Streaming Mel Vocoding with Generative Flow Matching
Title: 基于生成流匹配的实时流式梅尔声码器
Simon Welker, Tal Peer, Timo Gerkmann
Comments: (C) 2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works
Subjects: Audio and Speech Processing (eess.AS) ; Machine Learning (cs.LG) ; Sound (cs.SD) ; Signal Processing (eess.SP)
[22] arXiv:2509.15084 [cn-pdf, pdf]
Title: From Sea to System: Exploring User-Centered Explainable AI for Maritime Decision Support
Title: 从海洋到系统:探索以用户为中心的可解释人工智能在海事决策支持中的应用
Doreen Jirak, Pieter Maes, Armeen Saroukanoff, Dirk van Rooy
Comments: Paper accepted at Human Learning and Decision-Making Workshop @ECML-PKDD Conference 2025, Porto, Portugal
Subjects: Artificial Intelligence (cs.AI) ; Computers and Society (cs.CY) ; Human-Computer Interaction (cs.HC)
[23] arXiv:2509.15083 [cn-pdf, pdf]
Title: Transplant-Ready? Evaluating AI Lung Segmentation Models in Candidates with Severe Lung Disease
Title: 移植可用? 评估严重肺部疾病患者中的AI肺分割模型
Jisoo Lee, Michael R. Harowicz, Yuwen Chen, Hanxue Gu, Isaac S. Alderete, Lin Li, Maciej A. Mazurowski, Matthew G. Hartwig
Comments: 24 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[24] arXiv:2509.15072 [cn-pdf, pdf]
Title: Improving Internet Traffic Matrix Prediction via Time Series Clustering
Title: 通过时间序列聚类改进互联网流量矩阵预测
Martha Cash, Alexander Wyglinski
Comments: Accepted to ICMLA 2025
Subjects: Machine Learning (cs.LG)
[25] arXiv:2509.15062 [cn-pdf, pdf]
Title: Energy-Constrained Navigation for Planetary Rovers under Hybrid RTG-Solar Power
Title: 行星探测车在混合RTG-太阳能供电下的能量约束导航
Tianxin Hu, Weixiang Guo, Ruimeng Liu, Xinhang Xu, Rui Qian, Jinyu Chen, Shenghai Yuan, Lihua Xie
Subjects: Robotics (cs.RO)
[26] arXiv:2509.15048 [cn-pdf, pdf]
Title: Can maiBERT Speak for Maithili?
Title: 可以使用maiBERT表示马尔蒂利语吗?
Sumit Yadav, Raju Kumar Yadav, Utsav Maskey, Gautam Siddharth Kashyap Md Azizul Hoque, Ganesh Gautam
Comments: Preprint
Subjects: Computation and Language (cs.CL)
[27] arXiv:2509.15042 [cn-pdf, pdf]
Title: Reinforcement Learning Agent for a 2D Shooter Game
Title: 用于2D射击游戏的强化学习智能体
Thomas Ackermann, Moritz Spang, Hamza A. A. Gardi
Subjects: Machine Learning (cs.LG) ; Artificial Intelligence (cs.AI)
[28] arXiv:2509.15036 [cn-pdf, pdf]
Title: NEURAL: An Elastic Neuromorphic Architecture with Hybrid Data-Event Execution and On-the-fly Attention Dataflow
Title: 神经:一种具有混合数据-事件执行和实时注意力数据流的弹性类脑架构
Yuehai Chen, Farhad Merchant
Comments: Accepted by ASP-DAC 2026; 7 pages, 10 figures
Subjects: Hardware Architecture (cs.AR)
[29] arXiv:2509.15032 [cn-pdf, pdf]
Title: Sample Efficient Experience Replay in Non-stationary Environments
Title: 样本高效的非平稳环境经验回放
Tianyang Duan, Zongyuan Zhang, Songxiao Guo, Yuanye Zhao, Zheng Lin, Zihan Fang, Yi Liu, Dianxin Luan, Dong Huang, Heming Cui, Yong Cui
Comments: 5 pages, 3 figures
Subjects: Machine Learning (cs.LG) ; Artificial Intelligence (cs.AI) ; Networking and Internet Architecture (cs.NI)
[30] arXiv:2509.15026 [cn-pdf, pdf]
Title: Undersampled Phase Retrieval with Image Priors
Title: 带有图像先验的欠采样相位恢复
Stanislas Ducotterd, Zhiyuan Hu, Michael Unser, Jonathan Dong
Subjects: Image and Video Processing (eess.IV) ; Machine Learning (cs.LG)
[31] arXiv:2509.15001 [cn-pdf, pdf]
Title: BabyHuBERT: Multilingual Self-Supervised Learning for Segmenting Speakers in Child-Centered Long-Form Recordings
Title: BabyHuBERT:多语言自监督学习用于以儿童为中心的长格式录音中的说话人分割
Théo Charlot, Tarek Kunze, Maxime Poli, Alejandrina Cristia, Emmanuel Dupoux, Marvin Lavechin
Comments: 5 pages, 1 figure
Subjects: Audio and Speech Processing (eess.AS) ; Machine Learning (cs.LG) ; Sound (cs.SD)
[32] arXiv:2509.14987 [cn-pdf, pdf]
Title: Blockchain-Enabled Explainable AI for Trusted Healthcare Systems
Title: 基于区块链的可解释人工智能用于可信医疗系统
Md Talha Mohsin
Comments: 6 Pages, 4 Figures
Journal-ref: 2nd International Conference on Electrical and Computer Engineering Researches (ICECER), 2025
Subjects: Cryptography and Security (cs.CR) ; Artificial Intelligence (cs.AI) ; Machine Learning (cs.LG)
[33] arXiv:2509.14976 [cn-pdf, pdf]
Title: Dynamics of conductive nonmagnetic objects in presence of the Lenz effect
Title: 传导非磁性物体在楞次效应存在下的动力学
Alessandro Arduino, Oriano Bottauscio, Michael Steckner, Umberto Zanovello, Luca Zilberti
Comments: 10 pages, 5 figures
Subjects: Computational Engineering, Finance, and Science (cs.CE)
[34] arXiv:2509.14975 [cn-pdf, pdf]
Title: Beyond Random Masking: A Dual-Stream Approach for Rotation-Invariant Point Cloud Masked Autoencoders
Title: 超越随机遮蔽:一种双流方法用于旋转不变点云遮蔽自编码器
Xuanhua Yin, Dingxin Zhang, Yu Feng, Shunqi Mao, Jianhui Yu, Weidong Cai
Comments: 8 pages, 4 figures, aceppted by DICTA 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[35] arXiv:2509.14967 [cn-pdf, pdf]
Title: Affordance-Based Disambiguation of Surgical Instructions for Collaborative Robot-Assisted Surgery
Title: 基于功能的外科手术指令消歧用于协作机器人辅助手术
Ana Davila, Jacinto Colan, Yasuhisa Hasegawa
Comments: To be presented at the 1st Workshop on Intelligent Cobodied Assistance and Robotic Empowerment (iCARE). 2025 Conference on Robot Learning (CoRL)
Subjects: Robotics (cs.RO) ; Human-Computer Interaction (cs.HC)
[36] arXiv:2509.14965 [cn-pdf, pdf]
Title: Brain-HGCN: A Hyperbolic Graph Convolutional Network for Brain Functional Network Analysis
Title: Brain-HGCN:用于脑功能网络分析的双曲图卷积网络
Junhao Jia, Yunyou Liu, Cheng Yang, Yifei Sun, Feiwei Qin, Changmiao Wang, Yong Peng
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[37] arXiv:2509.14959 [cn-pdf, pdf]
Title: Discrete optimal transport is a strong audio adversarial attack
Title: 离散最优传输是一种强大的音频对抗攻击
Anton Selitskiy, Akib Shahriyar, Jishnuraj Prakasan
Subjects: Audio and Speech Processing (eess.AS) ; Artificial Intelligence (cs.AI)
[38] arXiv:2509.14954 [cn-pdf, pdf]
Title: Exploratory Movement Strategies for Texture Discrimination with a Neuromorphic Tactile Sensor
Title: 基于类脑触觉传感器的纹理辨识探索性运动策略
Xingchen Xu, Ao Li, Benjamin Ward-Cherrier
Comments: Accepted at IEEE/RSJ International Conference on Intelligent Robots and Systems 2025. Please cite the proceedings version
Subjects: Robotics (cs.RO)
[39] arXiv:2509.14949 [cn-pdf, pdf]
Title: Human Interaction for Collaborative Semantic SLAM using Extended Reality
Title: 基于扩展现实的协作语义SLAM的人机交互
Laura Ribeiro, Muhammad Shaheer, Miguel Fernandez-Cortizas, Ali Tourani, Holger Voos, Jose Luis Sanchez-Lopez
Comments: 7 pages, 5 figures, 3 tables
Subjects: Robotics (cs.RO) ; Human-Computer Interaction (cs.HC)
[40] arXiv:2509.14946 [cn-pdf, pdf]
Title: SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding
Title: SynParaSpeech:用于语音生成和理解的副语言数据集自动合成
Bingsong Bai, Qihang Lu, Wenbing Yang, Zihan Sun, YueRan Hou, Peilei Jia, Songbai Pu, Ruibo Fu, Yingming Gao, Ya Li, Jun Gao
Comments: submitted to ICASSP 2026
Subjects: Audio and Speech Processing (eess.AS) ; Computation and Language (cs.CL)
[41] arXiv:2509.14944 [cn-pdf, pdf]
Title: Estimating Respiratory Effort from Nocturnal Breathing Sounds for Obstructive Sleep Apnoea Screening
Title: 从夜间呼吸声音估计呼吸努力用于阻塞性睡眠呼吸暂停筛查
Xiaolei Xu, Chaoyue Niu, Guy J. Brown, Hector Romero, Ning Ma
Comments: Submitted to ICASSP 2026
Subjects: Sound (cs.SD) ; Artificial Intelligence (cs.AI) ; Audio and Speech Processing (eess.AS)
[42] arXiv:2509.14943 [cn-pdf, pdf]
Title: Explicit vs. Implicit Biographies: Evaluating and Adapting LLM Information Extraction on Wikidata-Derived Texts
Title: 显式与隐式传记:在维基数据派生文本上评估和适应大型语言模型的信息抽取
Alessandra Stramiglio, Andrea Schimmenti, Valentina Pasqual, Marieke van Erp, Francesco Sovrano, Fabio Vitali
Subjects: Computation and Language (cs.CL)
[43] arXiv:2509.14936 [cn-pdf, pdf]
Title: A Comparative Analysis of Transformer Models in Social Bot Detection
Title: 一种Transformer模型在社交机器人检测中的比较分析
Rohan Veit, Michael Lones
Comments: To appear in proceedings of UKCI 2025
Subjects: Machine Learning (cs.LG)
[44] arXiv:2509.14935 [cn-pdf, pdf]
Title: CAD-Driven Co-Design for Flight-Ready Jet-Powered Humanoids
Title: 基于CAD的飞行级喷气式人形机器人协同设计
Punith Reddy Vanteddu, Davide Gorbani, Giuseppe L'Erario, Hosameldin Awadalla Omer Mohamed, Fabio Bergonti, Daniele Pucci
Subjects: Robotics (cs.RO)
[45] arXiv:2509.14934 [cn-pdf, pdf]
Title: Mitigating data replication in text-to-audio generative diffusion models through anti-memorization guidance
Title: 通过反记忆引导减轻文本到音频生成扩散模型中的数据复制
Francisco Messina, Francesca Ronchini, Luca Comanducci, Paolo Bestagini, Fabio Antonacci
Subjects: Audio and Speech Processing (eess.AS) ; Machine Learning (cs.LG) ; Sound (cs.SD) ; Signal Processing (eess.SP)
[46] arXiv:2509.14930 [cn-pdf, pdf]
Title: Cross-Modal Knowledge Distillation for Speech Large Language Models
Title: 跨模态知识蒸馏用于语音大语言模型
Enzhi Wang, Qicheng Li, Zhiyuan Tang, Yuhang Jia
Subjects: Computation and Language (cs.CL) ; Artificial Intelligence (cs.AI)
[47] arXiv:2509.14927 [cn-pdf, pdf]
Title: GenKOL: Modular Generative AI Framework For Scalable Virtual KOL Generation
Title: GenKOL:可扩展虚拟KOL生成的模块化生成人工智能框架
Tan-Hiep To, Duy-Khang Nguyen, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[48] arXiv:2509.14925 [cn-pdf, pdf]
Title: Self-Explaining Reinforcement Learning for Mobile Network Resource Allocation
Title: 基于自我解释强化学习的移动网络资源分配
Konrad Nowosadko, Franco Ruggeri, Ahmad Terra
Subjects: Machine Learning (cs.LG) ; Networking and Internet Architecture (cs.NI)
[49] arXiv:2509.14920 [cn-pdf, pdf]
Title: Cost-Performance Analysis: A Comparative Study of CPU-Based Serverless and GPU-Based Training Architectures
Title: 成本性能分析:基于CPU的无服务器和基于GPU的训练架构的比较研究
Amine Barrak, Fabio Petrillo, Fehmi Jaafar
Journal-ref: The 26th International Conference on Parallel and Distributed Computing, Applications and Technologies 2025
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC)
[50] arXiv:2509.14915 [cn-pdf, pdf]
Title: PERAL: Perception-Aware Motion Control for Passive LiDAR Excitation in Spherical Robots
Title: PERAL:球形机器人中被动LiDAR激励的感知感知运动控制
Shenghai Yuan, Jason Wai Hao Yee, Weixiang Guo, Zhongyuan Liu, Thien-Minh Nguyen, Lihua Xie
Subjects: Robotics (cs.RO)
[51] arXiv:2509.14912 [cn-pdf, pdf]
Title: Back to Ear: Perceptually Driven High Fidelity Music Reconstruction
Title: 回到耳朵:感知驱动的高保真音乐重建
Kangdi Wang, Zhiyue Wu, Dinghao Zhou, Rui Lin, Junyu Dai, Tao Jiang
Comments: Check the Code here: https://github.com/Eps-Acoustic-Revolution-Lab/EAR_VAE and Model Weights here: https://huggingface.co/earlab/EAR_VAE
Subjects: Sound (cs.SD) ; Artificial Intelligence (cs.AI)
[52] arXiv:2509.14907 [cn-pdf, pdf]
Title: Artificial Intelligence and Market Entrant Game Developers
Title: 人工智能与市场进入者游戏开发者
Seonbin Jo, Woo-Sung Jung, Jisung Yoon, Hyunuk Kim
Subjects: Computers and Society (cs.CY)
[53] arXiv:2509.14901 [cn-pdf, pdf]
Title: Pseudo-Label Enhanced Cascaded Framework: 2nd Technical Report for LSVOS 2025 VOS Track
Title: 伪标签增强级联框架:LSVOS 2025 VOS赛道第二份技术报告
An Yan, Leilei Cao, Feng Lu, Ran Hong, Youhai Jiang, Fengjie Zhu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[54] arXiv:2509.14893 [cn-pdf, pdf]
Title: Temporally Heterogeneous Graph Contrastive Learning for Multimodal Acoustic event Classification
Title: 时间异构图对比学习用于多模态声音事件分类
Yuanjian Chen, Yang Xiao, Jinjie Huang
Subjects: Sound (cs.SD) ; Audio and Speech Processing (eess.AS)
[55] arXiv:2509.14886 [cn-pdf, pdf]
Title: A Multi-To-One Interview Paradigm for Efficient MLLM Evaluation
Title: 一种多对一访谈范式用于高效MLLM评估
Ye Shen, Junying Wang, Farong Wen, Yijin Guo, Qi Jia, Zicheng Zhang, Guangtao Zhai
Comments: 5 pages, 2 figures
Subjects: Computation and Language (cs.CL) ; Artificial Intelligence (cs.AI)
[56] arXiv:2509.14882 [cn-pdf, pdf]
Title: Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens
Title: Llama-Mimi:具有交错语义和声学标记的语音语言模型
Issa Sugiura, Shuhei Kurita, Yusuke Oda, Ryuichiro Higashinaka
Comments: 5 pages, 1 figures
Subjects: Computation and Language (cs.CL)
[57] arXiv:2509.14880 [cn-pdf, pdf]
Title: From Hype to Insight: Rethinking Large Language Model Integration in Visual Speech Recognition
Title: 从炒作到洞察:重新思考大型语言模型在视觉语音识别中的整合
Rishabh Jain, Naomi Harte
Comments: submitted to ICASSP 2026. This work has been submitted to the IEEE for possible publication
Subjects: Sound (cs.SD)
[58] arXiv:2509.14872 [cn-pdf, pdf]
Title: Temporal Representation Learning of Phenotype Trajectories for pCR Prediction in Breast Cancer
Title: 乳腺癌新辅助治疗后病理完全缓解预测的表型轨迹时间表示学习
Ivana Janíčková, Yen Y. Tan, Thomas H. Helbich, Konstantin Miloserdov, Zsuzsanna Bago-Horvath, Ulrike Heber, Georg Langs
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[59] arXiv:2509.14868 [cn-pdf, pdf]
Title: DPANet: Dual Pyramid Attention Network for Multivariate Time Series Forecasting
Title: DPANet:多变量时间序列预测的双金字塔注意力网络
Qianyang Li, Xingjun Zhang, Shaoxun Wang, Jia Wei
Subjects: Machine Learning (cs.LG) ; Artificial Intelligence (cs.AI)
[60] arXiv:2509.14866 [cn-pdf, pdf]
Title: Controllable Localized Face Anonymization Via Diffusion Inpainting
Title: 通过扩散修复的可控局部人脸匿名化
Ali Salar, Qing Liu, Guoying Zhao
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[61] arXiv:2509.14860 [cn-pdf, pdf]
Title: MARIC: Multi-Agent Reasoning for Image Classification
Title: MARIC:图像分类的多智能体推理
Wonduk Seo, Minhyeong Yu, Hyunjin An, Seunghyun Lee
Comments: Preprint
Subjects: Computer Vision and Pattern Recognition (cs.CV) ; Artificial Intelligence (cs.AI) ; Computation and Language (cs.CL) ; Multiagent Systems (cs.MA)
[62] arXiv:2509.14858 [cn-pdf, pdf]
Title: MeanFlowSE: one-step generative speech enhancement via conditional mean flow
Title: MeanFlowSE:通过条件均值流的一步生成语音增强
Duojia Li, Shenghui Lu, Hongchen Pan, Zongyi Zhan, Qingyang Hong, Lin Li
Subjects: Sound (cs.SD) ; Artificial Intelligence (cs.AI)
[63] arXiv:2509.14832 [cn-pdf, pdf]
Title: Diffusion-Based Scenario Tree Generation for Multivariate Time Series Prediction and Multistage Stochastic Optimization
Title: 基于扩散的多变量时间序列预测和多阶段随机优化的情景树生成
Stelios Zarifis, Ioannis Kordonis, Petros Maragos
Comments: 5 pages, 2 figures, 2 tables, and 1 algorithm. This version is submitted to the 51st IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2026), to be held in Barcelona, Spain, on May 4-8, 2026
Subjects: Machine Learning (cs.LG) ; Artificial Intelligence (cs.AI) ; Systems and Control (eess.SY)
[64] arXiv:2509.14827 [cn-pdf, pdf]
Title: Template-Based Cortical Surface Reconstruction with Minimal Energy Deformation
Title: 基于模板的皮层表面重建与最小能量变形
Patrick Madlindl, Fabian Bongratz, Christian Wachinger
Subjects: Computer Vision and Pattern Recognition (cs.CV) ; Artificial Intelligence (cs.AI) ; Machine Learning (cs.LG) ; Neurons and Cognition (q-bio.NC) ; Machine Learning (stat.ML)
[65] arXiv:2509.14824 [cn-pdf, pdf]
Title: Confirmation Bias as a Cognitive Resource in LLM-Supported Deliberation
Title: 确认偏误作为LLM支持的审议中的认知资源
Sander de Jong, Rune Møberg Jacobsen, Niels van Berkel
Subjects: Human-Computer Interaction (cs.HC)
[66] arXiv:2509.14806 [cn-pdf, pdf]
Title: SINAI at eRisk@CLEF 2022: Approaching Early Detection of Gambling and Eating Disorders with Natural Language Processing
Title: SINAI 在 eRisk@CLEF 2022 中:使用自然语言处理方法接近赌博和饮食障碍的早期检测
Alba Maria Marmol-Romero, Salud Maria Jimenez-Zafra, Flor Miriam Plaza-del-Arco, M. Dolores Molina-Gonzalez, Maria-Teresa Martin-Valdivia, Arturo Montejo-Raez
Comments: 11 pages, 1 figure, 4 tables. CLEF (Working Notes). 2022
Journal-ref: CEUR Workshop Proceedings 2022, vol. 3180, pp. 961-971
Subjects: Computation and Language (cs.CL)
[67] arXiv:2509.14804 [cn-pdf, pdf]
Title: Towards Building Speech Large Language Models for Multitask Understanding in Low-Resource Languages
Title: 面向低资源语言多任务理解的语音大语言模型构建
Mingchen Shao, Bingshen Mu, Chengyou Wang, Hai Li, Ying Yan, Zhonghua Fu, Lei Xie
Subjects: Sound (cs.SD)
[68] arXiv:2509.14803 [cn-pdf, pdf]
Title: OnlineMate: An LLM-Based Multi-Agent Companion System for Cognitive Support in Online Learning
Title: 在线伴侣:一种基于大语言模型的多智能体同伴系统,用于在线学习中的认知支持
Xian Gao, Zongyun Zhang, Ting Liu, Yuzhuo Fu
Subjects: Computers and Society (cs.CY) ; Artificial Intelligence (cs.AI)
[69] arXiv:2509.14797 [cn-pdf, pdf]
Title: SINAI at eRisk@CLEF 2023: Approaching Early Detection of Gambling with Natural Language Processing
Title: SINAI 在 eRisk@CLEF 2023 中:使用自然语言处理方法进行赌博的早期检测
Alba Maria Marmol-Romero, Flor Miriam Plaza-del-Arco, Arturo Montejo-Raez
Comments: 9 pages, 2 figures, 4 tables. CLEF (Working Notes). 2023
Journal-ref: CEUR Workshop Proceedings 2023, vol. 3497, pp. 743-751
Subjects: Computation and Language (cs.CL)
[70] arXiv:2509.14789 [cn-pdf, pdf]
Title: Acoustic Simulation Framework for Multi-channel Replay Speech Detection
Title: 多通道回放语音检测的声学仿真框架
Michael Neri, Tuomas Virtanen
Comments: Submitted to ICASSP 2026
Subjects: Audio and Speech Processing (eess.AS) ; Cryptography and Security (cs.CR) ; Sound (cs.SD) ; Signal Processing (eess.SP)
[71] arXiv:2509.14788 [cn-pdf, pdf]
Title: Structure-Aware Contrastive Learning with Fine-Grained Binding Representations for Drug Discovery
Title: 基于细粒度结合表示的结构感知对比学习用于药物发现
Jing Lan, Hexiao Ding, Hongzhao Chen, Yufeng Jiang, Nga-Chun Ng, Gwing Kei Yip, Gerald W. Y. Cheng, Yunlin Mao, Jing Cai, Liang-ting Lin, Jung Sun Yoo
Subjects: Machine Learning (cs.LG) ; Artificial Intelligence (cs.AI) ; Biomolecules (q-bio.BM)
[72] arXiv:2509.14778 [cn-pdf, pdf]
Title: OpenLens AI: Fully Autonomous Research Agent for Health Infomatics
Title: OpenLens AI:健康信息学的全自主研究代理
Yuxiao Cheng, Jinli Suo
Subjects: Artificial Intelligence (cs.AI) ; Multiagent Systems (cs.MA)
[73] arXiv:2509.14777 [cn-pdf, pdf]
Title: Dataset Distillation for Super-Resolution without Class Labels and Pre-trained Models
Title: 无类别标签和预训练模型的超分辨率数据集浓缩
Sunwoo Cho, Yejin Jung, Nam Ik Cho, Jae Woong Soh
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[74] arXiv:2509.14769 [cn-pdf, pdf]
Title: Frame Sampling Strategies Matter: A Benchmark for small vision language models
Title: 框架采样策略很重要:小视觉语言模型的基准
Marija Brkic, Anas Filali Razzouki, Yannis Tevissen, Khalil Guetari, Mounim A. El Yacoubi
Subjects: Computer Vision and Pattern Recognition (cs.CV) ; Computation and Language (cs.CL)
[75] arXiv:2509.14764 [cn-pdf, pdf]
Title: Efficient Solutions for Mitigating Initialization Bias in Unsupervised Self-Adaptive Auditory Attention Decoding
Title: 无监督自适应听觉注意力解码中减轻初始化偏差的高效解决方案
Yuanyuan Yao, Simon Geirnaert, Tinne Tuytelaars, Alexander Bertrand
Subjects: Signal Processing (eess.SP) ; Sound (cs.SD)
[76] arXiv:2509.14755 [cn-pdf, pdf]
Title: Data Augmentation via Latent Diffusion Models for Detecting Smell-Related Objects in Historical Artworks
Title: 通过潜在扩散模型的数据增强用于检测历史艺术品中的与气味相关的物体
Ahmed Sheta, Mathias Zinnen, Aline Sindel, Andreas Maier, Vincent Christlein
Comments: Appeared at the 4th International Workshop on Fine Art Pattern Extraction and Recognition (FAPER 2025), in conjunction with ICIAP 2025; proceedings forthcoming in ICIAP 2025 Workshops (LNCS, Springer)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[77] arXiv:2509.14752 [cn-pdf, pdf]
Title: KAIO: A Collection of More Challenging Korean Questions
Title: KAIO:更多具有挑战性的韩语问题集
Nahyun Lee, Guijin Son, Hyunwoo Ko, Kyubeen Han
Comments: 4 pages paper
Subjects: Computation and Language (cs.CL)
[78] arXiv:2509.14746 [cn-pdf, pdf]
Title: Chain-of-Thought Re-ranking for Image Retrieval Tasks
Title: 思维链重排序用于图像检索任务
Shangrong Wu, Yanghong Zhou, Yang Chen, Feng Zhang, P. Y. Mok
Subjects: Computer Vision and Pattern Recognition (cs.CV) ; Information Retrieval (cs.IR)
[79] arXiv:2509.14744 [cn-pdf, pdf]
Title: On the Use of Agentic Coding Manifests: An Empirical Study of Claude Code
Title: 关于使用代理编码清单的实证研究:Claude代码
Worawalan Chatlatanagulchai, Kundjanasith Thonglek, Brittany Reid, Yutaro Kashiwa, Pattara Leelaprute, Arnon Rungsawang, Bundit Manaskasemsak, Hajimu Iida
Subjects: Software Engineering (cs.SE)
[80] arXiv:2509.14737 [cn-pdf, pdf]
Title: Pushing the Limits of End-to-End Diarization
Title: 推动端到端说话人分离的极限
Samuel J. Broughton, Lahiru Samarakoon
Comments: As presented at Interspeech 2025
Journal-ref: In Proc. Interspeech 2025 (pp. 5218-5222)
Subjects: Sound (cs.SD)
[81] arXiv:2509.14723 [cn-pdf, pdf]
Title: Transcoder-based Circuit Analysis for Interpretable Single-Cell Foundation Models
Title: 基于编解码器的电路分析用于可解释的单细胞基础模型
Sosuke Hosokawa, Toshiharu Kawakami, Satoshi Kodera, Masamichi Ito, Norihiko Takeda
Subjects: Machine Learning (cs.LG)
[82] arXiv:2509.14710 [cn-pdf, pdf]
Title: Mitigating the Impact of Location Uncertainty on Radio Map-Based Predictive Rate Selection via Noisy-Input Gaussian Process
Title: 通过噪声输入高斯过程减轻位置不确定性对基于无线电信号图的预测速率选择的影响
Koya Sato
Comments: 6 pages, 8 figures. Accepted for presentation at 2025 IEEE GLOBECOM Workshops: Workshop on Radio Maps for Communications and Sensing
Subjects: Signal Processing (eess.SP) ; Networking and Internet Architecture (cs.NI)
[83] arXiv:2509.14698 [cn-pdf, pdf]
Title: Wohlhart's Three-Loop Mechanism: An Overconstrained and Shaky Linkage
Title: 沃赫尔特的三环机构:一个过约束且不稳定的连杆机构
Andreas Mueller
Journal-ref: Lenar\v{c}i\v{c} J., Siciliano B. (eds), Advances in Robot Kinematics 2020 (ARK 2020). Springer Proceedings in Advanced Robotics, Vol 15. Springer, Cham, pp. 125-132
Subjects: Robotics (cs.RO) ; Differential Geometry (math.DG) ; Group Theory (math.GR) ; Numerical Analysis (math.NA)
[84] arXiv:2509.14693 [cn-pdf, pdf]
Title: RationAnomaly: Log Anomaly Detection with Rationality via Chain-of-Thought and Reinforcement Learning
Title: 理性异常:通过思维链和强化学习进行合理性日志异常检测
Song Xu, Yilun Liu, Minggui He, Mingchen Dai, Ziang Chen, Chunguang Zhao, Jingzhou Du, Shimin Tao, Weibin Meng, Shenglin Zhang, Yongqian Sun, Boxing Chen, Daimeng Wei
Comments: 5 pages, 3 figures
Subjects: Artificial Intelligence (cs.AI)
[85] arXiv:2509.14689 [cn-pdf, pdf]
Title: HARNESS: Lightweight Distilled Arabic Speech Foundation Models
Title: HARNESS:轻量级蒸馏阿拉伯语语音基础模型
Vrunda N. sukhadia, Shammur Absar Chowdhury
Comments: 5 pages, 4 figures
Subjects: Computation and Language (cs.CL)
[86] arXiv:2509.14687 [cn-pdf, pdf]
Title: RealMirror: A Comprehensive, Open-Source Vision-Language-Action Platform for Embodied AI
Title: RealMirror:一个全面的、开源的视觉-语言-行动平台用于具身人工智能
Cong Tai, Zhaoyu Zheng, Haixu Long, Hansheng Wu, Haodong Xiang, Zhengbin Long, Jun Xiong, Rong Shi, Shizhuang Zhang, Gang Qiu, He Wang, Ruifeng Li, Jun Huang, Bin Chang, Shuai Feng, Tao Shen
Subjects: Robotics (cs.RO)
[87] arXiv:2509.14684 [cn-pdf, pdf]
Title: DAIEN-TTS: Disentangled Audio Infilling for Environment-Aware Text-to-Speech Synthesis
Title: DAIEN-TTS:环境感知文本到语音合成的分离音频填充
Ye-Xin Lu, Yu Gu, Kun Wei, Hui-Peng Du, Yang Ai, Zhen-Hua Ling
Comments: Submitted to ICASSP 2026
Subjects: Audio and Speech Processing (eess.AS) ; Sound (cs.SD)
[88] arXiv:2509.14680 [cn-pdf, pdf]
Title: LEED: A Highly Efficient and Scalable LLM-Empowered Expert Demonstrations Framework for Multi-Agent Reinforcement Learning
Title: LEED:一种高效且可扩展的多智能体强化学习LLM增强专家演示框架
Tianyang Duan, Zongyuan Zhang, Songxiao Guo, Dong Huang, Yuanye Zhao, Zheng Lin, Zihan Fang, Dianxin Luan, Heming Cui, Yong Cui
Comments: 5 pages, 4 figures
Subjects: Multiagent Systems (cs.MA) ; Machine Learning (cs.LG)
[89] arXiv:2509.14675 [cn-pdf, pdf]
Title: How Does Instrumental Music Help SingFake Detection?
Title: 乐器音乐如何帮助声纹伪造检测?
Xuanjun Chen, Chia-Yu Hu, I-Ming Lin, Yi-Cheng Lin, I-Hsiang Chiu, You Zhang, Sung-Feng Huang, Yi-Hsuan Yang, Haibin Wu, Hung-yi Lee, Jyh-Shing Roger Jang
Comments: Work in progress
Subjects: Sound (cs.SD) ; Audio and Speech Processing (eess.AS) ; Signal Processing (eess.SP)
[90] arXiv:2509.14668 [cn-pdf, pdf]
Title: DeepAssert: An LLM-Aided Verification Framework with Fine-Grained Assertion Generation for Modules with Extracted Module Specifications
Title: DeepAssert:一种带有细粒度断言生成的LLM辅助验证框架,用于具有提取模块规范的模块
Yonghao Wang, Jiaxin Zhou, Hongqin Lyu, Zhiteng Chao, Tiancheng Wang, Huawei Li
Comments: 7 pages, 8 figures
Subjects: Hardware Architecture (cs.AR)
[91] arXiv:2509.14666 [cn-pdf, pdf]
Title: Spatial Audio Motion Understanding and Reasoning
Title: 空间音频运动理解与推理
Arvind Krishna Sridhar, Yinyi Guo, Erik Visser
Comments: 5 pages, 2 figures, 3 tables
Subjects: Sound (cs.SD) ; Artificial Intelligence (cs.AI) ; Computation and Language (cs.CL)
[92] arXiv:2509.14659 [cn-pdf, pdf]
Title: Aligning Audio Captions with Human Preferences
Title: 对齐音频字幕与人类偏好
Kartik Hegde, Rehana Mahfuz, Yinyi Guo, Erik Visser
Comments: Submitted to ICASSP 2026
Subjects: Audio and Speech Processing (eess.AS) ; Machine Learning (cs.LG) ; Sound (cs.SD)
[93] arXiv:2509.14653 [cn-pdf, pdf]
Title: UMA-Split: unimodal aggregation for both English and Mandarin non-autoregressive speech recognition
Title: UMA-Split:用于英语和普通话非自回归语音识别的单模态聚合
Ying Fang, Xiaofei Li
Comments: Submit to ICASSP 2026
Subjects: Computation and Language (cs.CL)
[94] arXiv:2509.14647 [cn-pdf, pdf]
Title: AgentCompass: Towards Reliable Evaluation of Agentic Workflows in Production
Title: AgentCompass:面向生产环境中代理工作流的可靠评估
NVJK Kartik, Garvit Sapra, Rishav Hada, Nikhil Pareek
Subjects: Artificial Intelligence (cs.AI) ; Computation and Language (cs.CL)
[95] arXiv:2509.14643 [cn-pdf, pdf]
Title: Chameleon: A Surface-Anchored Smartphone AR Prototype with Visually Blended Mobile Display
Title: 变色龙:一种具有视觉融合移动显示的表面固定智能手机AR原型
Seungwon Yang, Suwon Yoon, Jeongwon Choi, Inseok Hwang
Comments: 3 pages, 1 figure, ACM UIST 2025 Poster
Subjects: Human-Computer Interaction (cs.HC)
[96] arXiv:2509.14640 [cn-pdf, pdf]
Title: DyWPE: Signal-Aware Dynamic Wavelet Positional Encoding for Time Series Transformers
Title: DyWPE:时间序列Transformer的信号感知动态小波位置编码
Habib Irani, Vangelis Metsis
Subjects: Machine Learning (cs.LG)
[97] arXiv:2509.14632 [cn-pdf, pdf]
Title: Mitigating Intra-Speaker Variability in Diarization with Style-Controllable Speech Augmentation
Title: 通过风格可控制的语音增强减轻说话人内部变化的说话人分割方法
Miseul Kim, Soo Jin Park, Kyungguen Byun, Hyeon-Kyeong Shin, Sunkuk Moon, Shuhua Zhang, Erik Visser
Comments: Submitted to ICASSP 2026
Subjects: Audio and Speech Processing (eess.AS) ; Artificial Intelligence (cs.AI) ; Signal Processing (eess.SP)
[98] arXiv:2509.14627 [cn-pdf, pdf]
Title: Towards Human-like Multimodal Conversational Agent by Generating Engaging Speech
Title: 通过生成吸引人的语音来实现类人多模态对话代理
Taesoo Kim, Yongsik Jo, Hyunmin Song, Taehwan Kim
Comments: Published in Interspeech 2025
Subjects: Human-Computer Interaction (cs.HC) ; Artificial Intelligence (cs.AI) ; Computation and Language (cs.CL)
[99] arXiv:2509.14619 [cn-pdf, pdf]
Title: LSTC-MDA: A Unified Framework for Long-Short Term Temporal Convolution and Mixed Data Augmentation in Skeleton-Based Action Recognition
Title: LSTC-MDA:基于骨架动作识别的长期短期时间卷积和混合数据增强的统一框架
Feng Ding, Haisheng Fu, Soroush Oraki, Jie Liang
Comments: Submitted to ICASSP
Subjects: Computer Vision and Pattern Recognition (cs.CV) ; Artificial Intelligence (cs.AI)
[100] arXiv:2509.14609 [cn-pdf, pdf]
Title: HybridMamba: A Dual-domain Mamba for 3D Medical Image Segmentation
Title: 混合Mamba:用于3D医学图像分割的双域Mamba
Weitong Wu, Zhaohu Xing, Jing Gong, Qin Peng, Lei Zhu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Total of 10728 entries : 1-100 101-200 201-300 301-400 ... 10701-10728
Showing up to 100 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack

京ICP备2025123034号