Database query processing and optimization (theory)

Applied Filters

People

Publications

Reproducibility Badges

Publication Date

Searched The ACM Guide to Computing Literature (3,842,883 records)|Limit your search to The ACM Full-Text Collection (774,939 records)

Showing 1 - 20of238 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
August 2024
Databases Unbound: Querying All of the World's Bytes with AI
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 4546–4554https://doi.org/10.14778/3685800.3685916

Over the past five decades, the relational database model has proven to be a scaleable and adaptable model for querying a variety of structured data, with use cases in analytics, transactions, graphs, streaming and more. However, most of the world's data ...
0
218
Metrics
Total Citations0
Total Downloads218
Last 12 Months218
Last 6 weeks86
Get Access
research-article
August 2024
Artifacts Available / v1.1
PrismX: A Single-Machine System for Querying Big Graphs
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 4485–4488https://doi.org/10.14778/3685800.3685906

We demonstrate PrismX (PRAM with SSDs as Memory eXtension), a single-machine system for graph analytics. PrismX allows users to make practical use of existing PRAM algorithms without any change. To cope with the limited DRAM capacity, it employs NVMe ...
0
8
Metrics
Total Citations0
Total Downloads8
Last 12 Months8
Last 6 weeks4
Get Access
research-article
August 2024
Pyneapple-G: Scalable Spatial Grouping Queries
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 4469–4472https://doi.org/10.14778/3685800.3685902

This paper demonstrates Pynapple-G, an open-source library for scalable spatial grouping queries based on Apache Sedona (formerly known as GeoSpark). We demonstrate two modules, namely, SGPAC and DDCEL, that support grouping points, grouping lines, and ...
1
9
Metrics
Total Citations1
Total Downloads9
Last 12 Months9
Last 6 weeks3
Get Access
research-article
August 2024
Artifacts Available / v1.1
Catcher: A Cache Analysis System for Top-k Pub/Sub Service
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 4389–4392https://doi.org/10.14778/3685800.3685882

Top-k Publish/Subscribe (TkPS) service is widely studied in spatial database, with various cache-based methods proposed to address its efficiency challenge in top-k result maintenance. These methods require in-depth exploration of relationships between ...
0
14
Metrics
Total Citations0
Total Downloads14
Last 12 Months14
Last 6 weeks9
Get Access
research-article
August 2024
Artifacts Available / v1.1
SEER: An End-to-End Toolkit for Benchmarking Time Series Database Systems in Monitoring Applications
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 4361–4364https://doi.org/10.14778/3685800.3685875

Time series database systems (TSDBs) are prevalent in many applications ranging from monitoring and IoT devices to scientific research. Those systems are specifically designed to efficiently manage data indexed by time. Because of the variety of ...
0
24
Metrics
Total Citations0
Total Downloads24
Last 12 Months24
Last 6 weeks9
Get Access
research-article
August 2024
Artifacts Available / v1.1
UniView: A Unified Autonomous Materialized View Management System for Various Databases
- Zhenrong Xu,
- Pengfei Wang,
- Guoze Xue,
- Qitong Yan,
- Shenghao Gong,
- Yelan Jiang,
- Yuren Mao,
- Yunjun Gao,
- Shu Shen,
- Wei Zhang,
- Dan Luo,
- Lu Chen
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 4353–4356https://doi.org/10.14778/3685800.3685873

Materialized views (MVs) are critical for improving query performance of database systems, especially in online analytical processing (OLAP) databases. Typically, MVs are maintained by DBAs, which relies on prior knowledge and manual operations. Recently,...
0
35
Metrics
Total Citations0
Total Downloads35
Last 12 Months35
Last 6 weeks17
Get Access
research-article
August 2024
QPJVis Demo: Quality-Boost Progressive Join Query Processing System
- Xin Zhang,
- Ahmed Eldawy
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 4345–4348https://doi.org/10.14778/3685800.3685871

Progressive query processing enables data scientists to efficiently analyze and explore large datasets. Data scientists can start further analyses earlier if the progressive result can represent the complete results well. Most progressive processing ...
0
13
Metrics
Total Citations0
Total Downloads13
Last 12 Months13
Last 6 weeks8
Get Access
research-article
August 2024
Artifacts Available / v1.1
Rodeo: Making Refinements for Diverse Top-K Queries
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 4341–4344https://doi.org/10.14778/3685800.3685870

Database queries are commonly used to select and rank items. With the increasing awareness of diversity, ensuring a diverse output (i.e., the representation of different groups in the top-k positions) becomes essential. To address this challenge, we ...
0
16
Metrics
Total Citations0
Total Downloads16
Last 12 Months16
Last 6 weeks6
Get Access
research-article
August 2024
DBG-PT: A Large Language Model Assisted Query Performance Regression Debugger
- Victor Giannakouris,
- Immanuel Trummer
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 4337–4340https://doi.org/10.14778/3685800.3685869

In this paper we explore the ability of Large Language Models (LLMs) in analyzing and comparing query plans, and resolving query performance regressions. We present DBG-PT, a query regression debugging framework powered by LLMs. DBG-PT keeps track of ...
0
40
Metrics
Total Citations0
Total Downloads40
Last 12 Months40
Last 6 weeks21
Get Access
research-article
August 2024
Spatial Query Optimization With Learning
- Xin Zhang,
- Ahmed Eldawy
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 4245–4248https://doi.org/10.14778/3685800.3685846

Query optimization is a key component in database management systems (DBMS) and distributed data processing platforms. Recent research in the database community incorporated techniques from artificial intelligence to enhance query optimization. Various ...
0
94
Metrics
Total Citations0
Total Downloads94
Last 12 Months94
Last 6 weeks43
Get Access
research-article
August 2024
Native Distributed Databases: Problems, Challenges and Opportunities
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 4217–4220https://doi.org/10.14778/3685800.3685839

Native distributed databases, crucial for scalable applications, offer transactional and analytical prowess but face data intricacies and network challenges. Under the CAP theorem's constraints, latency and replication issues necessitate creative ...
0
61
Metrics
Total Citations0
Total Downloads61
Last 12 Months61
Last 6 weeks29
Get Access
research-article
August 2024
Artifacts Available / v1.1
LLM for Data Management
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 4213–4216https://doi.org/10.14778/3685800.3685838

Machine learning techniques have been verified to be effective in optimizing data management systems and are widely researched in recent years. However, traditional small-sized ML models often struggle to generalize to new scenarios, and have limited ...
1
266
Metrics
Total Citations1
Total Downloads266
Last 12 Months266
Last 6 weeks126
Get Access
research-article
August 2024
Grouping, Subsumption, and Disjunctive Join Optimizations in Oracle
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 4200–4212https://doi.org/10.14778/3685800.3685837

Query optimization must evolve with new workloads. As analytic and data warehouse workloads become more ubiquitous, optimization techniques that reduce the amount of data processed during query execution, enable shared computation and avoid expensive ...
0
16
Metrics
Total Citations0
Total Downloads16
Last 12 Months16
Last 6 weeks3
Get Access
research-article
August 2024
Artifacts Available / v1.1
Petabyte-Scale Row-Level Operations in Data Lakehouses
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 4159–4172https://doi.org/10.14778/3685800.3685834

Data lakehouses combine the almost infinite scale and diverse tooling of a data lake with the reliability and functionality of a data warehouse. This paper presents extensions that enhance data lake-houses using Apache Iceberg and Apache Spark with ...
0
33
Metrics
Total Citations0
Total Downloads33
Last 12 Months33
Last 6 weeks16
Get Access
research-article
August 2024
Lindorm-UWC: An Ultra-Wide-Column Database for Internet of Vehicles
- Qianyu Ouyang,
- Chunhui Shen,
- Wenlong Yang,
- Peng Yu,
- Qiang Xiao,
- Jianhui Lei,
- Yadong Chen,
- Qilu Zhong,
- Xiang Wang,
- Yong Lin,
- Qingyi Meng,
- Zhicheng Ji,
- Wei Meng,
- Cen Zheng,
- Sheng Wang,
- Dan Pei,
- Wei Zhang,
- Feifei Li,
- Jingren Zhou
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 4117–4129https://doi.org/10.14778/3685800.3685831

In the Internet of Vehicle (IoV) systems, intelligent vehicles generate huge amounts of data that supports diverse services and applications. In practice, database systems are deployed in the cloud to manage data uploaded from the vehicle side and ...
0
57
Metrics
Total Citations0
Total Downloads57
Last 12 Months57
Last 6 weeks10
Get Access
research-article
August 2024
Artifacts Available / v1.1
Presto's History-Based Query Optimizer
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 4077–4089https://doi.org/10.14778/3685800.3685828

An important feature of modern query optimizers is the ability to produce a query plan that is optimal for the underlying data set. This requires the ability to estimate cardinalities and computational costs of intermediate query plan nodes, which is ...
0
22
Metrics
Total Citations0
Total Downloads22
Last 12 Months22
Last 6 weeks9
Get Access
research-article
August 2024
SQL Has Problems. We Can Fix Them: Pipe Syntax In SQL
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 4051–4063https://doi.org/10.14778/3685800.3685826

SQL has been extremely successful as the de facto standard language for working with data. Virtually all mainstream database-like systems use SQL as their primary query language. But SQL is an old language with significant design problems, making it ...
0
63
Metrics
Total Citations0
Total Downloads63
Last 12 Months63
Last 6 weeks28
Get Access
research-article
August 2024
Adaptive and Robust Query Execution for Lakehouses at Scale
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 3947–3959https://doi.org/10.14778/3685800.3685818

Many organizations have embraced the "Lakehouse" data management paradigm, which involves constructing structured data warehouses on top of open, unstructured data lakes. This approach stands in stark contrast to traditional, closed, relational databases ...
0
69
Metrics
Total Citations0
Total Downloads69
Last 12 Months69
Last 6 weeks40
Get Access
research-article
August 2024
Db2une: Tuning Under Pressure via Deep Learning
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 3855–3868https://doi.org/10.14778/3685800.3685811

Modern database systems including IBM Db2 have numerous parameters, "knobs," that require precise configuration to achieve optimal workload performance. Even for experts, manually "tuning" these knobs is a challenging process. We present Db2une, an ...
0
44
Metrics
Total Citations0
Total Downloads44
Last 12 Months44
Last 6 weeks20
Get Access
research-article
August 2024
ClickHouse - Lightning Fast Analytics for Everyone
Proceedings of the VLDB Endowment (PVLDB), Volume 17, Issue 12Pages 3731–3744https://doi.org/10.14778/3685800.3685802

Over the past several decades, the amount of data being stored and analyzed has increased exponentially. Businesses across industries and sectors have begun relying on this data to improve products, evaluate performance, and make business-critical ...
0
87
Metrics
Total Citations0
Total Downloads87
Last 12 Months87
Last 6 weeks37
Get Access

Applied Filters

People

Names

Institutions

Authors

Publications

All Publications

Content Type

Media Formats

Publisher

Reproducibility Badges

Publication Date

Results

Databases Unbound: Querying All of the World's Bytes with AI

PrismX: A Single-Machine System for Querying Big Graphs

Pyneapple-G: Scalable Spatial Grouping Queries

Catcher: A Cache Analysis System for Top-k Pub/Sub Service

SEER: An End-to-End Toolkit for Benchmarking Time Series Database Systems in Monitoring Applications

UniView: A Unified Autonomous Materialized View Management System for Various Databases

QPJVis Demo: Quality-Boost Progressive Join Query Processing System

Rodeo: Making Refinements for Diverse Top-K Queries

DBG-PT: A Large Language Model Assisted Query Performance Regression Debugger

Spatial Query Optimization With Learning

Native Distributed Databases: Problems, Challenges and Opportunities

LLM for Data Management

Grouping, Subsumption, and Disjunctive Join Optimizations in Oracle

Petabyte-Scale Row-Level Operations in Data Lakehouses

Lindorm-UWC: An Ultra-Wide-Column Database for Internet of Vehicles

Presto's History-Based Query Optimizer

SQL Has Problems. We Can Fix Them: Pipe Syntax In SQL

Adaptive and Robust Query Execution for Lakehouses at Scale

Db2une: Tuning Under Pressure via Deep Learning

ClickHouse - Lightning Fast Analytics for Everyone