Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Row-wise parallel predicate evaluation

Published: 01 August 2008 Publication History

Abstract

Table scans have become more interesting recently due to greater use of ad-hoc queries and greater availability of multi-core, vector-enabled hardware. Table scan performance is limited by value representation, table layout, and processing techniques. In this paper we propose a new layout and processing technique for efficient one-pass predicate evaluation. Starting with a set of rows with a fixed number of bits per column, we append columns to form a set of banks and then pad each bank to a supported machine word length, typically 16, 32, or 64 bits. We then evaluate partial predicates on the columns of each bank, using a novel evaluation strategy that evaluates column level equality, range tests, IN-list predicates, and conjuncts of these predicates, simultaneously on multiple columns within a bank, and on multiple rows within a machine register. This approach outperforms pure column stores, which must evaluate the partial predicates one column at a time. We evaluate and compare the performance and representation overhead of this new approach and several proposed alternatives.

References

[1]
D. Abadi, S. Madden, and M. Ferreira. Integrating compression and execution in column-oriented database systems. In SIGMOD, 2006.
[2]
A. Ailamaki, D. J. DeWitt, and M. D. Hill. Data page layouts for relational databases on deep memory hierarchies. The VLDB Journal, 11(3):198--215, 2002.
[3]
P. Boncz, M. Zukowski, and N. Nes. MonetDB/X100: Hyper-Pipelining Query Execution. In CIDR, 2005.
[4]
P. A. Boncz et al. Database Architecture Optimized for the New Bottleneck: Memory Access. In VLDB, 1999.
[5]
G. Dosa. The Tight Bound of First Fit Decreasing Bin-Packing Algorithm Is FFD(I)=(11/9)OPT(I)+6/9. In ESCAPE, 2007.
[6]
R. MacNicol and B. French. Sybase IQ Multiplex - Designed for analytics. In VLDB, 2004.
[7]
S. Padmanabhan et al. Block Oriented Processing of Relational Database Operations in Modern Computer Architectures. In ICDE, 2001.
[8]
M. Poss and D. Potapov. Data compression in oracle. In VLDB, 2003.
[9]
V. Raman et al. Constant time query processing. In ICDE, 2008.
[10]
V. Raman and G. Swart. Entropy compression of relations and querying of compressed relations. In VLDB, 2006.
[11]
J. Zhou and K. A. Ross. Implementing database operations using SIMD instructions. In SIGMOD, 2002.
[12]
P. Zikopoulos. G. Baklarz, L. Katsnelson. and C. Eaton. IBM DB2 9 New Features. McGraw-Hill, 2007.
[13]
M. Zukowski et al. Super-Scalar RAM-CPU Cache Compression. In ICDE, 2006.

Cited By

View all
  • (2024)RTScan: Efficient Scan with Ray Tracing CoresProceedings of the VLDB Endowment10.14778/3648160.364818317:6(1460-1472)Online publication date: 3-May-2024
  • (2024)Cabin: A Compressed Adaptive Binned Scan IndexProceedings of the ACM on Management of Data10.1145/36393122:1(1-26)Online publication date: 26-Mar-2024
  • (2023)Rethinking the Encoding of Integers for Scans on Skewed DataProceedings of the ACM on Management of Data10.1145/36267511:4(1-27)Online publication date: 12-Dec-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment  Volume 1, Issue 1
August 2008
1216 pages

Publisher

VLDB Endowment

Publication History

Published: 01 August 2008
Published in PVLDB Volume 1, Issue 1

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)18
  • Downloads (Last 6 weeks)3
Reflects downloads up to 23 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)RTScan: Efficient Scan with Ray Tracing CoresProceedings of the VLDB Endowment10.14778/3648160.364818317:6(1460-1472)Online publication date: 3-May-2024
  • (2024)Cabin: A Compressed Adaptive Binned Scan IndexProceedings of the ACM on Management of Data10.1145/36393122:1(1-26)Online publication date: 26-Mar-2024
  • (2023)Rethinking the Encoding of Integers for Scans on Skewed DataProceedings of the ACM on Management of Data10.1145/36267511:4(1-27)Online publication date: 12-Dec-2023
  • (2023)Selection Pushdown in Column Stores using Bit Manipulation InstructionsProceedings of the ACM on Management of Data10.1145/35893231:2(1-26)Online publication date: 20-Jun-2023
  • (2023)BtrBlocks: Efficient Columnar Compression for Data LakesProceedings of the ACM on Management of Data10.1145/35892631:2(1-26)Online publication date: 20-Jun-2023
  • (2019)SmokeProceedings of the VLDB Endowment10.14778/3199517.319952211:6(719-732)Online publication date: 17-Jan-2019
  • (2019)QuickstepProceedings of the VLDB Endowment10.14778/3199517.319951811:6(663-676)Online publication date: 17-Jan-2019
  • (2019)Accelerating In-Memory Database Selections Using Latency Masking Hardware ThreadsACM Transactions on Architecture and Code Optimization10.1145/331022916:2(1-28)Online publication date: 9-Apr-2019
  • (2018)SmokeProceedings of the VLDB Endowment10.5555/3199517.319952211:6(719-732)Online publication date: 1-Feb-2018
  • (2018)QuickstepProceedings of the VLDB Endowment10.5555/3199517.319951811:6(663-676)Online publication date: 1-Feb-2018
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media