Skip to main content
CenXiv.org
This website is in trial operation, support us!
We gratefully acknowledge support from all contributors.
Contribute
Donate
cenxiv logo > cs.DB

Help | Advanced Search

Databases

Authors and titles for June 2025

Total of 111 entries : 1-50 51-100 101-111
Showing up to 50 entries per page: fewer | more | all
[1] arXiv:2506.00812 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: VecFlow: A High-Performance Vector Data Management System for Filtered-Search on GPUs
Title: VecFlow:一种用于GPU上过滤搜索的高性能矢量数据管理系统
Jingyi Xi, Chenghao Mo, Benjamin Karsin, Artem Chirkin, Mingqin Li, Minjia Zhang
Subjects: Databases (cs.DB)
[2] arXiv:2506.01173 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: SIFBench: An Extensive Benchmark for Fatigue Analysis
Title: SIFBench:疲劳分析的广泛基准
Tushar Gautam, Robert M. Kirby, Jacob Hochhalter, Shandian Zhe
Subjects: Databases (cs.DB) ; Machine Learning (cs.LG)
[3] arXiv:2506.01232 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: Retrieval-Augmented Generation of Ontologies from Relational Databases
Title: 从关系数据库中检索增强的本体生成
Mojtaba Nayyeri, Athish A Yogi, Nadeen Fathallah, Ratan Bahadur Thapa, Hans-Michael Tautenhahn, Anton Schnurpel, Steffen Staab
Comments: Under review
Subjects: Databases (cs.DB) ; Artificial Intelligence (cs.AI)
[4] arXiv:2506.01576 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: All You Need Is Binary Search! A Practical View on Lightweight Database Indexing on GPUs
Title: 一切尽在二分查找! GPU上轻量级数据库索引的实用观点
Justus Henneberg, Felix Schuhknecht
Subjects: Databases (cs.DB)
[5] arXiv:2506.02345 (cross-list from cs.DB) [cn-pdf, pdf, other]
Title: PandasBench: A Benchmark for the Pandas API
Title: PandasBench: 一个用于 Pandas API 的基准测试
Alex Broihier, Stefanos Baziotis, Daniel Kang, Charith Mendis
Subjects: Databases (cs.DB) ; Software Engineering (cs.SE)
[6] arXiv:2506.02509 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: In-context Clustering-based Entity Resolution with Large Language Models: A Design Space Exploration
Title: 基于上下文聚类的实体解析与大型语言模型:设计空间探索
Jiajie Fu, Haitong Tang, Arijit Khan, Sharad Mehrotra, Xiangyu Ke, Yunjun Gao
Comments: Accept by SIGMOD26
Subjects: Databases (cs.DB)
[7] arXiv:2506.02802 (cross-list from cs.DB) [cn-pdf, pdf, other]
Title: A Learned Cost Model-based Cross-engine Optimizer for SQL Workloads
Title: 基于学习的成本模型的跨引擎SQL工作负载优化器
András Strausz, Niels Pardon, Ioana Giurgiu
Comments: 6 pages
Subjects: Databases (cs.DB) ; Machine Learning (cs.LG)
[8] arXiv:2506.03826 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: SigSPARQL: Signals as a First-Class Citizen When Querying Knowledge Graphs
Title: SigSPARQL:在查询知识图谱时将信号作为一等公民
Tobias Schwarzinger, Gernot Steindl, Thomas Frühwirth, Thomas Preindl, Konrad Diwold, Katrin Ehrenmüller, Fajar J. Ekaputra
Subjects: Databases (cs.DB)
[9] arXiv:2506.04006 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: TransClean: Finding False Positives in Multi-Source Entity Matching under Real-World Conditions via Transitive Consistency
Title: TransClean:通过传递一致性在真实世界条件下发现多源实体匹配中的误报
Fernando de Meer Pardo, Branka Hadji Misheva, Martin Braschler, Kurt Stockinger
Subjects: Databases (cs.DB) ; Artificial Intelligence (cs.AI) ; Machine Learning (cs.LG)
[10] arXiv:2506.04230 (cross-list from cs.DB) [cn-pdf, pdf, other]
Title: Computationally Intensive Research: Advancing a Role for Secondary Analysis of Qualitative Data
Title: 计算密集型研究:推进定性数据二次分析的作用
Kaveh Mohajeri, Amir Karami
Comments: 20 Pages
Journal-ref: Journal of the Association for Information Systems (2025)
Subjects: Databases (cs.DB) ; Artificial Intelligence (cs.AI) ; Digital Libraries (cs.DL)
[11] arXiv:2506.04286 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: OxO2 -- A SSSOM mapping browser for logically sound crosswalks
Title: OxO2——一个用于逻辑上合理的跨映射浏览器的SSSOM工具
Henriette Harmse, Haider Iqbal, Helen Parkinson, James McLaughlin
Comments: 12 pages, 2 figures and 2 tables. Also submitted to FOIS Demonstration track and awaiting feedback
Subjects: Databases (cs.DB)
[12] arXiv:2506.04678 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: BVLSM: Write-Efficient LSM-Tree Storage via WAL-Time Key-Value Separation
Title: 基于WAL时间键值分离的高效写入LSM树存储:BVLSM
Ming Li, Wendi Cheng, Jiahe Wei, Xueqiang Shan, Weikai Liu, Xiaonan Zhao, Xiao Zhang
Subjects: Databases (cs.DB)
[13] arXiv:2506.05071 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: Memory Hierarchy Design for Caching Middleware in the Age of NVM
Title: NVM时代缓存中间件的内存层次结构设计
Shahram Ghandeharizadeh, Sandy Irani, Jenny Lam
Comments: A shorter version appeared in the IEEE 34th International Conference on Data Engineering (ICDE), Paris, France, 2018, pp. 1380-1383, doi: 10.1109/ICDE.2018.00155
Subjects: Databases (cs.DB) ; Hardware Architecture (cs.AR) ; Data Structures and Algorithms (cs.DS)
[14] arXiv:2506.05853 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: Training-Free Query Optimization via LLM-Based Plan Similarity
Title: 无需训练的查询优化通过基于大语言模型的计划相似性
Nikita Vasilenko, Alexander Demin, Vladimir Boorlakov
Comments: 18 pages, 5 figures
Subjects: Databases (cs.DB) ; Machine Learning (cs.LG)
[15] arXiv:2506.06147 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: Stream DaQ: Stream-First Data Quality Monitoring
Title: 流数据质量监控:以流为中心的数据质量监测
Vasileios Papastergios, Anastasios Gounaris
Subjects: Databases (cs.DB)
[16] arXiv:2506.06541 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: KramaBench: A Benchmark for AI Systems on Data-to-Insight Pipelines over Data Lakes
Title: KramaBench:数据湖上数据到洞见管道的AI系统基准测试
Eugenie Lai, Gerardo Vitagliano, Ziyu Zhang, Sivaprasad Sudhir, Om Chabra, Anna Zeng, Anton A. Zabreyko, Chenning Li, Ferdi Kossmann, Jialin Ding, Jun Chen, Markos Markakis, Matthew Russo, Weiyang Wang, Ziniu Wu, Michael J. Cafarella, Lei Cao, Samuel Madden, Tim Kraska
Subjects: Databases (cs.DB) ; Artificial Intelligence (cs.AI) ; Multiagent Systems (cs.MA)
[17] arXiv:2506.07675 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: QUITE: A Query Rewrite System Beyond Rules with LLM Agents
Title: QUITE:一种超越规则的查询重写系统,使用LLM代理
Yuyang Song, Hanxu Yan, Jiale Lao, Yibo Wang, Yufei Li, Yuanchun Zhou, Jianguo Wang, Mingjie Tang
Subjects: Databases (cs.DB) ; Artificial Intelligence (cs.AI)
[18] arXiv:2506.08249 (cross-list from cs.DB) [cn-pdf, pdf, other]
Title: RADAR: Benchmarking Language Models on Imperfect Tabular Data
Title: RADAR:在不完美表格数据上评估语言模型的基准测试
Ken Gu, Zhihan Zhang, Kate Lin, Yuwei Zhang, Akshay Paruchuri, Hong Yu, Mehran Kazemi, Kumar Ayush, A. Ali Heydari, Maxwell A. Xu, Girish Narayanswamy, Yun Liu, Ming-Zher Poh, Yuzhe Yang, Mark Malhotra, Shwetak Patel, Hamid Palangi, Xuhai Xu, Daniel McDuff, Tim Althoff, Xin Liu
Subjects: Databases (cs.DB) ; Computation and Language (cs.CL)
[19] arXiv:2506.08276 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: LEANN: A Low-Storage Vector Index
Title: LEANN:低存储向量索引
Yichuan Wang, Shu Liu, Zhifei Li, Yongji Wu, Ziming Mao, Yilong Zhao, Xiao Yan, Zhiying Xu, Yang Zhou, Ion Stoica, Sewon Min, Matei Zaharia, Joseph E. Gonzalez
Subjects: Databases (cs.DB) ; Machine Learning (cs.LG)
[20] arXiv:2506.08671 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: Evaluating Learned Indexes in LSM-tree Systems: Benchmarks,Insights and Design Choices
Title: 在LSM-tree系统中评估学习型索引:基准测试、见解与设计选择
Junfeng Liu, Jiarui Ye, Mengshi Chen, Meng Li, Siqiang Luo
Comments: 14 pages,12 figures
Subjects: Databases (cs.DB) ; Data Structures and Algorithms (cs.DS)
[21] arXiv:2506.09226 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: Terabyte-Scale Analytics in the Blink of an Eye
Title: 眨眼之间的千兆字节规模分析
Bowen Wu, Wei Cui, Carlo Curino, Matteo Interlandi, Rathijit Sen
Subjects: Databases (cs.DB) ; Distributed, Parallel, and Cluster Computing (cs.DC) ; Performance (cs.PF)
[22] arXiv:2506.09467 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: ArcNeural: A Multi-Modal Database for the Gen-AI Era
Title: ArcNeural:Gen-AI时代的多模态数据库
Wu Min, Qiao Yuncong, Yu Tan, Chenghu Yang
Subjects: Databases (cs.DB)
[23] arXiv:2506.10092 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: GPU Acceleration of SQL Analytics on Compressed Data
Title: GPU加速的压缩数据SQL分析
Zezhou Huang, Krystian Sakowski, Hans Lehnert, Wei Cui, Carlo Curino, Matteo Interlandi, Marius Dumitru, Rathijit Sen
Subjects: Databases (cs.DB)
[24] arXiv:2506.10238 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: A Unifying Algorithm for Hierarchical Queries
Title: 一种分层查询的统一算法
Mahmoud Abo Khamis, Jesse Comer, Phokion Kolaitis, Sudeepa Roy, Val Tannen
Subjects: Databases (cs.DB)
[25] arXiv:2506.10422 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: A Hybrid Heuristic Framework for Resource-Efficient Querying of Scientific Experiments Data
Title: 一种资源高效查询科学实验数据的混合启发式框架
Mayank Patel, Minal Bhise
Subjects: Databases (cs.DB) ; Distributed, Parallel, and Cluster Computing (cs.DC) ; Emerging Technologies (cs.ET) ; Performance (cs.PF)
[26] arXiv:2506.10886 (cross-list from cs.DB) [cn-pdf, pdf, other]
Title: S3Mirror: Making Genomic Data Transfers Fast, Reliable, and Observable with DBOS
Title: S3Mirror:利用DBOS实现基因组数据传输的快速、可靠和可观察性
Steven Vasquez-Grinnell, Alex Poliakov
Subjects: Databases (cs.DB) ; Genomics (q-bio.GN)
[27] arXiv:2506.11298 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: Jelly: a Fast and Convenient RDF Serialization Format
Title: Jelly:一种快速便捷的RDF序列化格式
Piotr Sowinski, Karolina Bogacka, Anastasiya Danilenka, Nikita Kozlov
Comments: Developers Workshop, co-located with SEMANTiCS'25: International Conference on Semantic Systems, September 3-5, 2025, Vienna, Austria
Subjects: Databases (cs.DB) ; Networking and Internet Architecture (cs.NI)
[28] arXiv:2506.11541 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: OCPQ: Object-Centric Process Querying & Constraints
Title: 面向对象的流程查询与约束
Aaron Küsters, Wil M.P. van der Aalst
Subjects: Databases (cs.DB)
[29] arXiv:2506.11870 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: LLM-based Dynamic Differential Testing for Database Connectors with Reinforcement Learning-Guided Prompt Selection
Title: 基于大语言模型的数据库连接器动态差分测试与强化学习引导的提示选择
Ce Lyu, Minghao Zhao, Yanhao Wang, Liang Jie
Comments: 5 pages
Subjects: Databases (cs.DB)
[30] arXiv:2506.12234 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: Datrics Text2SQL: A Framework for Natural Language to SQL Query Generation
Title: Datrics 文本到 SQL:一种自然语言生成 SQL 查询的框架
Tetiana Gladkykh, Kyrylo Kirykov
Comments: 28 pages, 6 figures, initial whitepaper version 1.0, submitted March 2025
Subjects: Databases (cs.DB) ; Artificial Intelligence (cs.AI) ; Computation and Language (cs.CL)
[31] arXiv:2506.12238 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: CPN-Py: A Python-Based Tool for Modeling and Analyzing Colored Petri Nets
Title: CPN-Py:基于Python的有色Petri网建模与分析工具
Alessandro Berti, Wil M.P. van der Aalst
Subjects: Databases (cs.DB)
[32] arXiv:2506.12488 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: Redbench: A Benchmark Reflecting Real Workloads
Title: Redbench:反映真实工作负载的基准测试
Skander Krid, Mihail Stoian, Andreas Kipf
Comments: Eighth International Workshop on Exploiting Artificial Intelligence Techniques for Data Management (aiDM 2025)
Subjects: Databases (cs.DB)
[33] arXiv:2506.12837 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: Towards Visualizing Electronic Medical Records via Natural Language Queries
Title: 通过自然语言查询可视化电子病历
Haodi Zhang, Siqi Ning, Qiyong Zheng, Jinyin Nie, Liangjie Zhang, Weicheng Wang, Yuanfeng Song
Subjects: Databases (cs.DB)
[34] arXiv:2506.12990 (cross-list from cs.DB) [cn-pdf, pdf, other]
Title: Humans, Machine Learning, and Language Models in Union: A Cognitive Study on Table Unionability
Title: 人类、机器学习和语言模型的结合:表格联合性的认知研究
Sreeram Marimuthu, Nina Klimenkova, Roee Shraga
Comments: 6 Pages, 4 figures, ACM SIGMOD HILDA '25 (Status-Accepted)
Subjects: Databases (cs.DB) ; Machine Learning (cs.LG)
[35] arXiv:2506.13144 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: EnhanceGraph: A Continuously Enhanced Graph-based Index for High-dimensional Approximate Nearest Neighbor Search
Title: EnhanceGraph:一种用于高维近似最近邻搜索的持续增强图索引
Xiaoyao Zhong, Jiabao Jin, Peng Cheng, Mingyu Yang, Haoyang Li, Zhitao Shen, Heng Tao Shen, Jingkuan Song
Subjects: Databases (cs.DB)
[36] arXiv:2506.13670 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: Parachute: Single-Pass Bi-Directional Information Passing
Title: 降落伞:单次传递双向信息
Mihail Stoian, Andreas Zimmerer, Skander Krid, Amadou Latyr Ngom, Jialin Ding, Tim Kraska, Andreas Kipf
Comments: To appear at VLDB 2025
Subjects: Databases (cs.DB)
[37] arXiv:2506.13785 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: LLM-Driven Data Generation and a Novel Soft Metric for Evaluating Text-to-SQL in Aviation MRO
Title: LLM驱动的数据生成及航空MRO领域文本到SQL评估的新型软指标
Patrick Sutanto, Jonathan Kenrick, Max Lorenz, Joan Santoso
Subjects: Databases (cs.DB) ; Information Retrieval (cs.IR)
[38] arXiv:2506.14034 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: Sketched Sum-Product Networks for Joins
Title: Sketching的和积网络用于连接
Brian Tsan, Abylay Amanbayev, Asoke Datta, Florin Rusu
Subjects: Databases (cs.DB) ; Machine Learning (cs.LG)
[39] arXiv:2506.14707 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: HARMONY: A Scalable Distributed Vector Database for High-Throughput Approximate Nearest Neighbor Search
Title: HARMONY:一种可扩展的分布式向量数据库,用于高吞吐量近似最近邻搜索
Qian Xu, Feng Zhang, Chengxi Li, Lei Cao, Zheng Chen, Jidong Zhai, Xiaoyong Du
Subjects: Databases (cs.DB)
[40] arXiv:2506.14772 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: SimBank: from Simulation to Solution in Prescriptive Process Monitoring
Title: SimBank:从仿真到预测过程监控的解决方案
Jakob De Moor, Hans Weytjens, Johannes De Smedt, Jochen De Weerdt
Subjects: Databases (cs.DB) ; Machine Learning (cs.LG)
[41] arXiv:2506.15831 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: Adaptive Anomaly Detection in the Presence of Concept Drift: Extended Report
Title: 适应概念漂移的异常检测:扩展报告
Jongjun Park, Fei Chiang, Mostafa Milani
Comments: Extended version (to be updated)
Subjects: Databases (cs.DB)
[42] arXiv:2506.15848 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: Delta: A Learned Mixed Cost-based Query Optimization Framework
Title: Delta:一种基于学习的混合成本查询优化框架
Jiazhen Peng, Zheng Qu, Xiaoye Miao, Rong Zhu
Subjects: Databases (cs.DB)
[43] arXiv:2506.15986 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: Empowering Graph-based Approximate Nearest Neighbor Search with Adaptive Awareness Capabilities
Title: 具备自适应感知能力的基于图的近似最近邻搜索赋能
Jiancheng Ruan, Tingyang Chen, Renchi Yang, Xiangyu Ke, Yunjun Gao
Comments: Accecpted by KDD2025
Subjects: Databases (cs.DB) ; Information Retrieval (cs.IR)
[44] arXiv:2506.15987 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: Filter-Centric Vector Indexing: Geometric Transformation for Efficient Filtered Vector Search
Title: 以滤波器为中心的向量索引:高效的滤波向量搜索的几何变换
Alireza Heidari, Wei Zhang
Comments: 9 pages
Subjects: Databases (cs.DB) ; Metric Geometry (math.MG)
[45] arXiv:2506.16007 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: Data-Agnostic Cardinality Learning from Imperfect Workloads
Title: 从不完美工作负载中学到的数据无关基数学习
Peizhi Wu, Rong Kang, Tieying Zhang, Jianjun Chen, Ryan Marcus, Zachary G. Ives
Comments: 14 pages. Technical Report (Extended Version)
Subjects: Databases (cs.DB) ; Machine Learning (cs.LG)
[46] arXiv:2506.16379 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: PBench: Workload Synthesizer with Real Statistics for Cloud Analytics Benchmarking
Title: PBench:基于真实统计的工作负载综合器用于云分析基准测试
Yan Zhou, Chunwei Liu, Bhuvan Urgaonkar, Zhengle Wang, Magnus Mueller, Chao Zhang, Songyue Zhang, Pascal Pfeil, Dominik Horn, Zhengchun Liu, Davide Pagano, Tim Kraska, Samuel Madden, Ju Fan
Subjects: Databases (cs.DB)
[47] arXiv:2506.16616 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: LDI: Localized Data Imputation
Title: 局部数据填补
Soroush Omidvartehrani, Davood Rafiei
Subjects: Databases (cs.DB)
[48] arXiv:2506.16923 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: Advancing Fact Attribution for Query Answering: Aggregate Queries and Novel Algorithms
Title: 查询回答的事实归因进展:聚合查询与新算法
Omer Abramovich, Daniel Deutch, Nave Frost, Ahmet Kara, Dan Olteanu
Subjects: Databases (cs.DB)
[49] arXiv:2506.16976 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: PUL: Pre-load in Software for Caches Wouldn't Always Play Along
Title: 预加载:缓存软件中的预加载并不总是奏效
Arthur Bernhardt, Sajjad Tamimi, Florian Stock, Andreas Koch, Ilia Petrov
Subjects: Databases (cs.DB)
[50] arXiv:2506.17226 (cross-list from cs.DB) [cn-pdf, pdf, html, other]
Title: DCMF: A Dynamic Context Monitoring and Caching Framework for Context Management Platforms
Title: DCMF:一种动态上下文监控和缓存框架用于上下文管理平台
Ashish Manchanda, Prem Prakash Jayaraman, Abhik Banerjee, Kaneez Fizza, Arkady Zaslavsky
Subjects: Databases (cs.DB)
Total of 111 entries : 1-50 51-100 101-111
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack

京ICP备2025123034号