Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

The uncracked pieces in database cracking

Published: 01 October 2013 Publication History

Abstract

Database cracking has been an area of active research in recent years. The core idea of database cracking is to create indexes adaptively and incrementally as a side-product of query processing. Several works have proposed different cracking techniques for different aspects including updates, tuple-reconstruction, convergence, concurrency-control, and robustness. However, there is a lack of any comparative study of these different methods by an independent group. In this paper, we conduct an experimental study on database cracking. Our goal is to critically review several aspects, identify the potential, and propose promising directions in database cracking. With this study, we hope to expand the scope of database cracking and possibly leverage cracking in database engines other than MonetDB.
We repeat several prior database cracking works including the core cracking algorithms as well as three other works on convergence (hybrid cracking), tuple-reconstruction (sideways cracking), and robustness (stochastic cracking) respectively. We evaluate these works and show possible directions to do even better. We further test cracking under a variety of experimental settings, including high selectivity queries, low selectivity queries, and multiple query access patterns. Finally, we compare cracking against different sorting algorithms as well as against different main-memory optimised indexes, including the recently proposed Adaptive Radix Tree (ART). Our results show that: (i) the previously proposed cracking algorithms are repeatable, (ii) there is still enough room to significantly improve the previously proposed cracking algorithms, (iii) cracking depends heavily on query selectivity, (iv) cracking needs to catch up with modern indexing trends, and (v) different indexing algorithms have different indexing signatures.

References

[1]
Generalized Heap Implementation. https://github.com/valyala/gheap.
[2]
O. R. Birkeland. Searching Large Data Volumes with MISD Processing. PhD thesis.
[3]
T. Finch. Incremental calculation of weighted mean and variance. University of Cambridge Computing Service, 2009.
[4]
G. Graefe, F. Halim, S. Idreos, et al. Concurrency Control for Adaptive Indexing. In PVLDB, pages 656--667, 2012.
[5]
G. Graefe and H. Kuno. Self-selecting, Self-tuning, Incrementally Optimized Indexes. In EDBT, pages 371--381, 2010.
[6]
F. Halim, S. Idreos, et al. Stochastic Database Cracking: Towards Robust Adaptive Indexing in Main-Memory Column-Stores. In PVLDB, pages 502--513, 2012.
[7]
P. Hildebrandt and H. Isbitz. Radix Exchange - An Internal Sorting Method for Digital Computers. J. ACM, pages 156--163, 1959.
[8]
S. Idreos et al. Database Cracking. In CIDR, pages 68--78, 2007.
[9]
S. Idreos, M. Kersten, et al. Self-organizing Tuple Reconstruction In Column-stores. In SIGMOD, pages 297--308, 2009.
[10]
S. Idreos, M. Kersten, and S. Manegold. Updating a Cracked Database. In SIGMOD, pages 413--424, 2007.
[11]
S. Idreos, S. Manegold, H. Kuno, and G. Graefe. Merging What's Cracked, Cracking What's Merged: Adaptive Indexing in Main-Memory Column-Stores. In PVLDB, pages 585--597, 2011.
[12]
A. Jindal, E. Palatinus, V. Pavlov, and J. Dittrich. A Comparison of Knives for Bread Slicing. In PVLDB, pages 361--372, 2013.
[13]
M. Kersten et al. Cracking the Database Store. In CIDR, pages 213--224, 2005.
[14]
C. Kim et al. FAST: Fast Architecture Sensitive Tree Search on Modern CPUs and GPUs. In SIGMOD, pages 339--350, 2010.
[15]
V. Leis et al. The Adaptive Radix Tree: ARTful Indexing for Main-Memory Databases. In ICDE, pages 38--49, 2013.
[16]
X. Martinez-Palau, D. Dominguez-Sal, and J. L. Larriba-Pey. Two-way Replacement Selection. In PVLDB, pages 871--881, 2010.
[17]
J. Rao and K. A. Ross. Making B+-Trees Cache Conscious in Main Memory. In SIGMOD, pages 475--486, 2000.

Cited By

View all
  • (2024)Self-tuning Database Systems: A Systematic Literature Review of Automatic Database Schema Design and TuningACM Computing Surveys10.1145/366532356:11(1-37)Online publication date: 17-May-2024
  • (2023)Adaptive Indexing in High-Dimensional Metric SpacesProceedings of the VLDB Endowment10.14778/3603581.360359216:10(2525-2537)Online publication date: 1-Jun-2023
  • (2023)Accelerating Main-Memory Table Scans with Partial Virtual ViewsProceedings of the 19th International Workshop on Data Management on New Hardware10.1145/3592980.3595315(89-93)Online publication date: 18-Jun-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment  Volume 7, Issue 2
October 2013
36 pages
ISSN:2150-8097
  • Editors:
  • H. V. Jagadish,
  • Aoying Zhou
Issue’s Table of Contents

Publisher

VLDB Endowment

Publication History

Published: 01 October 2013
Published in PVLDB Volume 7, Issue 2

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)29
  • Downloads (Last 6 weeks)3
Reflects downloads up to 11 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Self-tuning Database Systems: A Systematic Literature Review of Automatic Database Schema Design and TuningACM Computing Surveys10.1145/366532356:11(1-37)Online publication date: 17-May-2024
  • (2023)Adaptive Indexing in High-Dimensional Metric SpacesProceedings of the VLDB Endowment10.14778/3603581.360359216:10(2525-2537)Online publication date: 1-Jun-2023
  • (2023)Accelerating Main-Memory Table Scans with Partial Virtual ViewsProceedings of the 19th International Workshop on Data Management on New Hardware10.1145/3592980.3595315(89-93)Online publication date: 18-Jun-2023
  • (2022)The next 50 years in database indexing orProceedings of the VLDB Endowment10.14778/3494124.349413615:3(527-540)Online publication date: 4-Feb-2022
  • (2021): An efficient deep reinforcement learning-based automatic cloud database tuning systemThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-021-00670-930:6(959-987)Online publication date: 5-Jun-2021
  • (2020)Approximate partition selection for big-data workloads using summary statisticsProceedings of the VLDB Endowment10.14778/3407790.340784813:12(2606-2619)Online publication date: 14-Sep-2020
  • (2020)Key-Value Storage EnginesProceedings of the 2020 ACM SIGMOD International Conference on Management of Data10.1145/3318464.3383133(2667-2672)Online publication date: 11-Jun-2020
  • (2019)Pushing data-induced predicates through joins in big-data clustersProceedings of the VLDB Endowment10.14778/3368289.336829213:3(252-265)Online publication date: 1-Nov-2019
  • (2019)Fluid data structuresProceedings of the 17th ACM SIGPLAN International Symposium on Database Programming Languages10.1145/3315507.3330197(3-17)Online publication date: 23-Jun-2019
  • (2019)From Auto-tuning One Size Fits All to Self-designed and Learned Data-intensive SystemsProceedings of the 2019 International Conference on Management of Data10.1145/3299869.3314034(2054-2059)Online publication date: 25-Jun-2019
  • Show More Cited By

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media