Hardware

Applied Filters

People

Publications

Reproducibility Badges

Publication Date

Searched The ACM Guide to Computing Literature (3,842,466 records)|Limit your search to The ACM Full-Text Collection (774,529 records)

Showing 1 - 20of52 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
August 2024
LeanStore: A High-Performance Storage Engine for NVMe SSDs
- Viktor Leis
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 4536–4545https://doi.org/10.14778/3685800.3685915

Neither traditional disk-based database systems nor modern inmemory database systems are capable of fully exploiting modern servers with multiple NVMe SSDs. LeanStore is a high-performance OLTP storage engine specifically optimized for NVMe SSDs and ...
0
134
Metrics
Total Citations0
Total Downloads134
Last 12 Months134
Last 6 weeks64
Get Access
research-article
August 2024
Vector Databases: What's Really New and What's Next? (VLDB 2024 Panel)
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 4505–4506https://doi.org/10.14778/3685800.3685911

Vector databases have recently emerged as a hot topic in the field of databases, especially in industry. This is due to the widespread interest in Large Language Models (LLMs), where vector databases provide the relevant context for LLMs to produce more ...
0
129
Metrics
Total Citations0
Total Downloads129
Last 12 Months129
Last 6 weeks65
Get Access
research-article
August 2024
X-Stor: A Cloud-Native NoSQL Database Service with Multi-Model Support
- Hongyu Lei,
- Chunhua Li,
- Ke Zhou,
- Jianping Zhu,
- Kezhou Yan,
- Fen Xiao,
- Ming Xie,
- Jiang Wang,
- Shiyu Di
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 4025–4037https://doi.org/10.14778/3685800.3685824

In recent years at Tencent, we have observed that the use of multiple NoSQL databases for storing business data with diverse models has led to increased programming and deployment costs, as well as inefficient maintenance and underutilized resources. In ...
0
31
Metrics
Total Citations0
Total Downloads31
Last 12 Months31
Last 6 weeks17
Get Access
research-article
August 2024
TDSQL: Tencent Distributed Database System
- Yuxing Chen,
- Anqun Pan,
- Hailin Lei,
- Anda Ye,
- Shuo Han,
- Yan Tang,
- Wei Lu,
- Yunpeng Chai,
- Feng Zhang,
- Xiaoyong Du
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 3869–3882https://doi.org/10.14778/3685800.3685812

Distributed databases have become indispensable in contemporary computing and data processing, owing to their pivotal role in ensuring high availability and scalability. They effectively cater to the requirements of data management and high-concurrency ...
0
85
Metrics
Total Citations0
Total Downloads85
Last 12 Months85
Last 6 weeks40
Get Access
research-article
August 2024
Db2une: Tuning Under Pressure via Deep Learning
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 3855–3868https://doi.org/10.14778/3685800.3685811

Modern database systems including IBM Db2 have numerous parameters, "knobs," that require precise configuration to achieve optimal workload performance. Even for experts, manually "tuning" these knobs is a challenging process. We present Db2une, an ...
0
44
Metrics
Total Citations0
Total Downloads44
Last 12 Months44
Last 6 weeks22
Get Access
research-article
August 2024
An Examination of CXL Memory Use Cases for In-Memory Database Management Systems Using SAP HANA
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 3827–3840https://doi.org/10.14778/3685800.3685809

CXL-based disaggregated memory systems offer options to expand the memory beyond the limits of a single server via cache-coherent memory expansion cards or memory pools. Especially, In-Memory Database Management Systems (IMDBMSs) can benefit from ...
1
126
Metrics
Total Citations1
Total Downloads126
Last 12 Months126
Last 6 weeks51
Get Access
research-article
August 2024
Artifacts Available / v1.1
Towards Resource Efficiency: Practical Insights into Large-Scale Spark Workloads at ByteDance
- Yixin Wu,
- Xiuqi Huang,
- Zhongjia Wei,
- Hang Cheng,
- Chaohui Xin,
- Zuzhi Chen,
- Binbin Chen,
- Yufei Wu,
- Hao Wang,
- Tieying Zhang,
- Rui Shi,
- Xiaofeng Gao,
- Yuming Liang,
- Pengwei Zhao,
- Guihai Chen
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 3759–3771https://doi.org/10.14778/3685800.3685804

At ByteDance, where we execute over a million Spark jobs and handle 500PB of shuffled data daily, ensuring resource efficiency is paramount for cost savings. However, achieving optimization of resource efficiency in large-scale production environments ...
0
72
Metrics
Total Citations0
Total Downloads72
Last 12 Months72
Last 6 weeks26
Get Access
research-article
July 2024
Artifacts Available / v1.1
OLAP on Modern Chiplet-Based Processors
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 11Pages 3428–3441https://doi.org/10.14778/3681954.3682011

Chiplet-based CPUs, which combine multiple independent dies on a single package, allow hardware to scale to higher CPU core counts at the cost of more memory heterogeneity and performance variability. This introduces challenges when existing query ...
0
98
Metrics
Total Citations0
Total Downloads98
Last 12 Months98
Last 6 weeks22
Get Access
research-article
July 2024
Artifacts Available / v1.1
The Holon Approach for Simultaneously Tuning Multiple Components in a Self-Driving Database Management System with Machine Learning via Synthesized Proto-Actions
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 11Pages 3373–3387https://doi.org/10.14778/3681954.3682007

Existing machine learning (ML) approaches to automatically optimize database management systems (DBMSs) only target a single configuration space at a time (e.g., knobs, query hints, indexes). Simultaneously tuning multiple configuration spaces is ...
0
110
Metrics
Total Citations0
Total Downloads110
Last 12 Months110
Last 6 weeks21
Get Access
research-article
July 2024
nsDB: Architecting the Next Generation Database by Integrating Neural and Symbolic Systems
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 11Pages 3283–3289https://doi.org/10.14778/3681954.3682000

In this paper, we propose nsDB, a novel neuro-symbolic database system that integrates neural and symbolic system architectures natively to address the weaknesses of each, providing a strong database capable of data managing, model learning, and complex ...
0
137
Metrics
Total Citations0
Total Downloads137
Last 12 Months137
Last 6 weeks9
Get Access
research-article
July 2024
Artifacts Available / v1.1
Agile-Ant: Self-Managing Distributed Cache Management for Cost Optimization of Big Data Applications
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 11Pages 3151–3164https://doi.org/10.14778/3681954.3681990

Distributed in-memory processing frameworks accelerate application runs by caching important datasets in memory. Allocating a suitable cluster configuration for caching these datasets plays a crucial role in achieving minimal cost. We present Agile-ant, ...
0
72
Metrics
Total Citations0
Total Downloads72
Last 12 Months72
Last 6 weeks14
Get Access
research-article
December 2023
Artifacts Available / v1.1
BonsaiKV: Towards Fast, Scalable, and Persistent Key-Value Stores with Tiered, Heterogeneous Memory System
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 4Pages 726–739https://doi.org/10.14778/3636218.3636228

Emerging NUMA/CXL-based tiered memory systems with heterogeneous memory devices such as DRAM and NVMM deliver ultrafast speed, large capacity, and data persistence all at once, offering great promise to high-performance in-memory key-value stores. To ...
1
336
Metrics
Total Citations1
Total Downloads336
Last 12 Months336
Last 6 weeks33
Get Access
research-article
November 2023
Artifacts Available / v1.1
GPU Database Systems Characterization and Optimization
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 3Pages 441–454https://doi.org/10.14778/3632093.3632107

GPUs offer massive parallelism and high-bandwidth memory access, making them an attractive option for accelerating data analytics in database systems. However, while modern GPUs possess more resources than ever before (e.g., higher DRAM bandwidth), ...
3
681
Metrics
Total Citations3
Total Downloads681
Last 12 Months681
Last 6 weeks51
Get Access
research-article
November 2023
Artifacts Available / v1.1
SmartLite: A DBMS-Based Serving System for DNN Inference in Resource-Constrained Environments
- Qiuru Lin,
- Sai Wu,
- Junbo Zhao,
- Jian Dai,
- Meng Shi,
- Gang Chen,
- Feifei Li
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 3Pages 278–291https://doi.org/10.14778/3632093.3632095

Many IoT applications require the use of multiple deep neural networks (DNNs) to perform various tasks on low-cost edge devices with limited computation resources. However, existing DNN model serving platforms, such as TensorFlow Serving and TorchServe, ...
2
85
Metrics
Total Citations2
Total Downloads85
Last 12 Months85
Last 6 weeks7
Get Access
research-article
September 2023
Artifacts Available / v1.1
Catalyst: Optimizing Cache Management for Large In-memory Key-value Systems
- Kefei Wang,
- Feng Chen
Proceedings of the VLDB Endowment (PVLDB), Volume 16, Issue 13Pages 4339–4352https://doi.org/10.14778/3625054.3625068

In-memory key-value cache systems, such as Memcached and Redis, are essential in today's data centers. A key mission of such cache systems is to identify the most valuable data for caching. To achieve this, the current system design keeps track of each ...
1
182
Metrics
Total Citations1
Total Downloads182
Last 12 Months151
Last 6 weeks15
Get Access
research-article
September 2023
Artifacts Available / v1.1
AMNES: Accelerating the Computation of Data Correlation Using FPGAs
Proceedings of the VLDB Endowment (PVLDB), Volume 16, Issue 13Pages 4174–7187https://doi.org/10.14778/3625054.3625056

A widely used approach to characterize input data in both databases and ML is computing the correlation between attributes. The operation is supported by all major database engines and ML platforms. However, it is an expensive operation as the number of ...
1
48
Metrics
Total Citations1
Total Downloads48
Last 12 Months40
Last 6 weeks5
Get Access
research-article
July 2021
The art of balance: a RateupDB™ experience of building a CPU/GPU hybrid database product
Proceedings of the VLDB Endowment (PVLDB), Volume 14, Issue 12Pages 2999–3013https://doi.org/10.14778/3476311.3476378

GPU-accelerated database systems have been studied for more than 10 years, ranging from prototyping development to industry products serving in multiple domains of data applications. Existing GPU database research solutions are often focused on specific ...
24
398
Metrics
Total Citations24
Total Downloads398
Last 12 Months137
Last 6 weeks5
Get Access
research-article
July 2021
The end of Moore's law and the rise of the data processor
Proceedings of the VLDB Endowment (PVLDB), Volume 14, Issue 12Pages 2932–2944https://doi.org/10.14778/3476311.3476373

With the end of Moore's Law, database architects are turning to hardware accelerators to offload computationally intensive tasks from the CPU. In this paper, we show that accelerators can facilitate far more than just computation: they enable algorithms ...
0
423
Metrics
Total Citations0
Total Downloads423
Last 12 Months115
Last 6 weeks11
Get Access
research-article
July 2021
Robust voice querying with MUVE: optimally visualizing results of phonetically similar queries
Proceedings of the VLDB Endowment (PVLDB), Volume 14, Issue 11Pages 2397–2409https://doi.org/10.14778/3476249.3476289

Recently proposed voice query interfaces translate voice input into SQL queries. Unreliable speech recognition on top of the intrinsic challenges of text-to-SQL translation makes it hard to reliably interpret user input. We present MUVE (Multiplots for ...
0
78
Metrics
Total Citations0
Total Downloads78
Last 12 Months15
Last 6 weeks1
Get Access
research-article
July 2021
SKT: a one-pass multi-sketch data analytics accelerator
Proceedings of the VLDB Endowment (PVLDB), Volume 14, Issue 11Pages 2369–2382https://doi.org/10.14778/3476249.3476287

Data analysts often need to characterize a data stream as a first step to its further processing. Some of the initial insights to be gained include, e.g., the cardinality of the data set and its frequency distribution. Such information is typically ...
3
42
Metrics
Total Citations3
Total Downloads42
Last 12 Months18
Last 6 weeks1
Get Access

Applied Filters

People

Names

Institutions

Authors

Publications

All Publications

Content Type

Media Formats

Publisher

Reproducibility Badges

Publication Date

Results

LeanStore: A High-Performance Storage Engine for NVMe SSDs

Vector Databases: What's Really New and What's Next? (VLDB 2024 Panel)

X-Stor: A Cloud-Native NoSQL Database Service with Multi-Model Support

TDSQL: Tencent Distributed Database System

Db2une: Tuning Under Pressure via Deep Learning

An Examination of CXL Memory Use Cases for In-Memory Database Management Systems Using SAP HANA

Towards Resource Efficiency: Practical Insights into Large-Scale Spark Workloads at ByteDance

OLAP on Modern Chiplet-Based Processors

The Holon Approach for Simultaneously Tuning Multiple Components in a Self-Driving Database Management System with Machine Learning via Synthesized Proto-Actions

nsDB: Architecting the Next Generation Database by Integrating Neural and Symbolic Systems

Agile-Ant: Self-Managing Distributed Cache Management for Cost Optimization of Big Data Applications

BonsaiKV: Towards Fast, Scalable, and Persistent Key-Value Stores with Tiered, Heterogeneous Memory System

GPU Database Systems Characterization and Optimization

SmartLite: A DBMS-Based Serving System for DNN Inference in Resource-Constrained Environments

Catalyst: Optimizing Cache Management for Large In-memory Key-value Systems

AMNES: Accelerating the Computation of Data Correlation Using FPGAs

The art of balance: a RateupDB™ experience of building a CPU/GPU hybrid database product

The end of Moore's law and the rise of the data processor

Robust voice querying with MUVE: optimally visualizing results of phonetically similar queries

SKT: a one-pass multi-sketch data analytics accelerator