research-article

The uncracked pieces in database cracking

Editors: H. V. Jagadish, Aoying Zhou Authors:

Felix Martin Schuhknecht,

Jens DittrichAuthors Info & Claims

Proceedings of the VLDB Endowment, Volume 7, Issue 2

Pages 97 - 108

https://doi.org/10.14778/2732228.2732229

Published: 01 October 2013 Publication History

Abstract

Database cracking has been an area of active research in recent years. The core idea of database cracking is to create indexes adaptively and incrementally as a side-product of query processing. Several works have proposed different cracking techniques for different aspects including updates, tuple-reconstruction, convergence, concurrency-control, and robustness. However, there is a lack of any comparative study of these different methods by an independent group. In this paper, we conduct an experimental study on database cracking. Our goal is to critically review several aspects, identify the potential, and propose promising directions in database cracking. With this study, we hope to expand the scope of database cracking and possibly leverage cracking in database engines other than MonetDB.

We repeat several prior database cracking works including the core cracking algorithms as well as three other works on convergence (hybrid cracking), tuple-reconstruction (sideways cracking), and robustness (stochastic cracking) respectively. We evaluate these works and show possible directions to do even better. We further test cracking under a variety of experimental settings, including high selectivity queries, low selectivity queries, and multiple query access patterns. Finally, we compare cracking against different sorting algorithms as well as against different main-memory optimised indexes, including the recently proposed Adaptive Radix Tree (ART). Our results show that: (i) the previously proposed cracking algorithms are repeatable, (ii) there is still enough room to significantly improve the previously proposed cracking algorithms, (iii) cracking depends heavily on query selectivity, (iv) cracking needs to catch up with modern indexing trends, and (v) different indexing algorithms have different indexing signatures.

References

[1]

Generalized Heap Implementation. https://github.com/valyala/gheap.

[2]

O. R. Birkeland. Searching Large Data Volumes with MISD Processing. PhD thesis.

[3]

T. Finch. Incremental calculation of weighted mean and variance. University of Cambridge Computing Service, 2009.

[4]

G. Graefe, F. Halim, S. Idreos, et al. Concurrency Control for Adaptive Indexing. In PVLDB, pages 656--667, 2012.

Digital Library

[5]

G. Graefe and H. Kuno. Self-selecting, Self-tuning, Incrementally Optimized Indexes. In EDBT, pages 371--381, 2010.

Digital Library

[6]

F. Halim, S. Idreos, et al. Stochastic Database Cracking: Towards Robust Adaptive Indexing in Main-Memory Column-Stores. In PVLDB, pages 502--513, 2012.

Digital Library

[7]

P. Hildebrandt and H. Isbitz. Radix Exchange - An Internal Sorting Method for Digital Computers. J. ACM, pages 156--163, 1959.

Digital Library

[8]

S. Idreos et al. Database Cracking. In CIDR, pages 68--78, 2007.

[9]

S. Idreos, M. Kersten, et al. Self-organizing Tuple Reconstruction In Column-stores. In SIGMOD, pages 297--308, 2009.

Digital Library

[10]

S. Idreos, M. Kersten, and S. Manegold. Updating a Cracked Database. In SIGMOD, pages 413--424, 2007.

Digital Library

[11]

S. Idreos, S. Manegold, H. Kuno, and G. Graefe. Merging What's Cracked, Cracking What's Merged: Adaptive Indexing in Main-Memory Column-Stores. In PVLDB, pages 585--597, 2011.

Digital Library

[12]

A. Jindal, E. Palatinus, V. Pavlov, and J. Dittrich. A Comparison of Knives for Bread Slicing. In PVLDB, pages 361--372, 2013.

Digital Library

[13]

M. Kersten et al. Cracking the Database Store. In CIDR, pages 213--224, 2005.

[14]

C. Kim et al. FAST: Fast Architecture Sensitive Tree Search on Modern CPUs and GPUs. In SIGMOD, pages 339--350, 2010.

Digital Library

[15]

V. Leis et al. The Adaptive Radix Tree: ARTful Indexing for Main-Memory Databases. In ICDE, pages 38--49, 2013.

Digital Library

[16]

X. Martinez-Palau, D. Dominguez-Sal, and J. L. Larriba-Pey. Two-way Replacement Selection. In PVLDB, pages 871--881, 2010.

Digital Library

[17]

J. Rao and K. A. Ross. Making B+-Trees Cache Conscious in Main Memory. In SIGMOD, pages 475--486, 2000.

Digital Library

Cited By

Mozaffari MDignös AGamper JStörl U(2024)Self-tuning Database Systems: A Systematic Literature Review of Automatic Database Schema Design and TuningACM Computing Surveys10.1145/366532356:11(1-37)Online publication date: 17-May-2024
https://dl.acm.org/doi/10.1145/3665323
Lampropoulos KZardbani FMamoulis NKarras P(2023)Adaptive Indexing in High-Dimensional Metric SpacesProceedings of the VLDB Endowment10.14778/3603581.360359216:10(2525-2537)Online publication date: 1-Jun-2023
https://dl.acm.org/doi/10.14778/3603581.3603592
Schuhknecht FHenneberg J(2023)Accelerating Main-Memory Table Scans with Partial Virtual ViewsProceedings of the 19th International Workshop on Data Management on New Hardware10.1145/3592980.3595315(89-93)Online publication date: 18-Jun-2023
https://dl.acm.org/doi/10.1145/3592980.3595315
Show More Cited By

Recommendations

An experimental evaluation and analysis of database cracking

Database cracking has been an area of active research in recent years. The core idea of database cracking is to create indexes adaptively and incrementally as a side product of query processing. Several works have proposed different cracking techniques ...
Cracking behavior of RC panels subject to biaxial tensile stresses

An analytical model which can simulate the post-cracking nonlinear behavior of reinforced concrete (RC) members such as bars and panels subject to uniaxial and biaxial tensile stresses is presented. The proposed model includes the description of biaxial ...
Cracking in-memory database index: A case study for Adaptive Radix Tree index
Abstract
Indexes provide a method to access data in databases quickly. It can improve the response speed of subsequent queries by building a complete index in advance. However, it also leads to a huge overhead of the continuous updating during ...
Highlights
- In-memory database indexes have more extensive research and application space.
- ...

Comments

Information & Contributors

Information

Published In

cover image Proceedings of the VLDB Endowment

Proceedings of the VLDB Endowment Volume 7, Issue 2

October 2013

36 pages

ISSN:2150-8097

Editors:
H. V. Jagadish
University of Michigan
,
Aoying Zhou
East Normal University, China

Issue’s Table of Contents

Publisher

VLDB Endowment

Publication History

Published: 01 October 2013

Published in PVLDB Volume 7, Issue 2

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

26
Total Citations
View Citations
228
Total Downloads

Downloads (Last 12 months)29
Downloads (Last 6 weeks)3

Reflects downloads up to 11 Sep 2024

Other Metrics

View Author Metrics

Citations

Cited By

Mozaffari MDignös AGamper JStörl U(2024)Self-tuning Database Systems: A Systematic Literature Review of Automatic Database Schema Design and TuningACM Computing Surveys10.1145/366532356:11(1-37)Online publication date: 17-May-2024
https://dl.acm.org/doi/10.1145/3665323
Lampropoulos KZardbani FMamoulis NKarras P(2023)Adaptive Indexing in High-Dimensional Metric SpacesProceedings of the VLDB Endowment10.14778/3603581.360359216:10(2525-2537)Online publication date: 1-Jun-2023
https://dl.acm.org/doi/10.14778/3603581.3603592
Schuhknecht FHenneberg J(2023)Accelerating Main-Memory Table Scans with Partial Virtual ViewsProceedings of the 19th International Workshop on Data Management on New Hardware10.1145/3592980.3595315(89-93)Online publication date: 18-Jun-2023
https://dl.acm.org/doi/10.1145/3592980.3595315
Dittrich JNix JSchön C(2022)The next 50 years in database indexing orProceedings of the VLDB Endowment10.14778/3494124.349413615:3(527-540)Online publication date: 4-Feb-2022
https://dl.acm.org/doi/10.14778/3494124.3494136
Zhang JZhou KLi GLiu YXie MCheng BXing J(2021): An efficient deep reinforcement learning-based automatic cloud database tuning systemThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-021-00670-930:6(959-987)Online publication date: 5-Jun-2021
https://dl.acm.org/doi/10.1007/s00778-021-00670-9
Rong KLu YBailis PKandula SLevis P(2020)Approximate partition selection for big-data workloads using summary statisticsProceedings of the VLDB Endowment10.14778/3407790.340784813:12(2606-2619)Online publication date: 14-Sep-2020
https://dl.acm.org/doi/10.14778/3407790.3407848
Idreos SCallaghan MMaier DPottinger RDoan ATan WAlawini ANgo H(2020)Key-Value Storage EnginesProceedings of the 2020 ACM SIGMOD International Conference on Management of Data10.1145/3318464.3383133(2667-2672)Online publication date: 11-Jun-2020
https://dl.acm.org/doi/10.1145/3318464.3383133
Kandula SOrr LChaudhuri S(2019)Pushing data-induced predicates through joins in big-data clustersProceedings of the VLDB Endowment10.14778/3368289.336829213:3(252-265)Online publication date: 1-Nov-2019
https://dl.acm.org/doi/10.14778/3368289.3368292
Balakrishnan DZiarek LKennedy OCheung ANguyễn K(2019)Fluid data structuresProceedings of the 17th ACM SIGPLAN International Symposium on Database Programming Languages10.1145/3315507.3330197(3-17)Online publication date: 23-Jun-2019
https://dl.acm.org/doi/10.1145/3315507.3330197
Idreos SKraska TBoncz PManegold SAilamaki ADeshpande AKraska T(2019)From Auto-tuning One Size Fits All to Self-designed and Learned Data-intensive SystemsProceedings of the 2019 International Conference on Management of Data10.1145/3299869.3314034(2054-2059)Online publication date: 25-Jun-2019
https://dl.acm.org/doi/10.1145/3299869.3314034
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Issue’s Table of Contents