Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/648315.756320guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Bitmap Indices for Speeding Up High-Dimensional Data Analysis

Published: 02 September 2002 Publication History

Abstract

Bitmap indices have gained wide acceptance in data warehouse applications and are an efficient access method for querying large amounts of read-only data. The main trend in bitmap index research focuses on typical business applications based on discrete attribute values. However, scientific data that is mostly characterised by nondiscrete attributes cannot be queried efficiently by currently supported access methods.In our previous work [13] we introduced a novel bitmap algorithm called GenericRangeEval for efficiently querying scientific data. We evaluated our approach based primarily on uniformly distributed and independent data. In this paper we analyse the behaviour of our bitmap index algorithm against various queries based on different data distributions.We have implemented an improved version of one of the most cited bitmap compression algorithms called Byte Aligned Bitmap Compression and adapted it to our bitmap indices. To prove the efficiency of our access method, we carried out high-dimensional queries against real data taken from two different scientific applications, namely High Energy Physics and Astronomy. The results clearly show that depending on the underlying data distribution and the query access patterns, our proposed bitmap indices can significantly improve the response time of high-dimensional queries when compared to conventional access methods.

References

[1]
G. Antoshenkov, Byte-Aligned Bitmap Compression, Technical Report, Oracle Corp. , 1994.
[2]
S. Amer-Yahia, T. Johnson, Optimizing Queries On Compressed Bitmaps, Proceedings of 26th International Conference on Very Large Data Bases , Cairo, Egypt, Sept. 2000, Morgan Kaufmann.
[3]
S. Berchtold, C. Boehm, H.-P. Kriegel, The Pyramid-Tree: Breaking the Curse of Dimensionality, SIGMOD 1998, Proceedings ACM SIGMOD International Conference on Management of Data , Seattle, Washington, USA, June 1998.
[4]
C. Chan, Y.E. Ioannidis, Bitmap Index Design and Evaluation, In Proceedings ACM SIGMOD International Conference on Management of Data , Seattle, Washington, USA, June 1998.
[5]
C. Chan, Y.E. Ioannidis, An Efficient Bitmap Encoding Scheme for Selection Queries, Proceedings ACM SIGMOD International Conference on Management of Data , Philadephia, Pennsylvania, USA, June 1999.
[6]
T. Johnson, Performance Measurements of Compressed Bitmap Indices, Proceedings of 25th International Conference on Very Large Data Bases , Edinburgh, Scotland, UK, September 1999, Morgan Kaufmann.
[7]
N. Koudas, Space Efficient Bitmap Indexing, International Conference on Information and Knowledge Management , McLean, VA, USA, November 2000.
[8]
P. O'Neil, D. Quass, Improved Query Performance with Variant Indexes, Proceedings ACM SIGMOD International Conference on Management of Data , Tucson, Arizona, USA, May 1997.
[9]
A. Shoshani, L.M. Bernardo, H. Nordberg, D. Rotem, A. Sim, Multidimensional Indexing and Query Coordination for Tertiary Storage Management, 11th International Conference on Scientific and Statistical Database Management , Cleveland, Ohio, USA, July 1999.
[10]
Sloan Digital Sky Survey, http://www.sdss.org
[11]
A. Szalay, P. Kunszt, A. Thakar, J. Gray, D. Slutz, Designing and Mining Multi-Terabyte Astronomy Archives: The Sloan Digital Sky Survey, Proceedings ACM SIGMOD International Conference on Management of Data , Philadephia, Pennsylvania, USA, June 1999.
[12]
K. Stockinger, D. Duellmann, W. Hoschek, E. Schikuta. Improving the Performance of High Energy Physics Analysis through Bitmap Indices. International Conference on Database and Expert Systems Applications , London - Greenwich, UK, Sept. 2000. Springer-Verlag.
[13]
K. Stockinger, Design and Implementation of Bitmap Indices for Scientific Data, International Database Engineering & Applications Symposium , Grenoble, France, July 2001, IEEE Computer Society Press.
[14]
K. Stockinger, Performance Analysis of Generic vs. Sliced Tags in HepODBMS, International Conference on Computing in High Energy and Nuclear Physics , Beijing, China, September, 2001.
[15]
K. Stockinger, Multi-Dimensional Bitmap Indices for Optimising Data Access within Object Oriented Databases at CERN, Ph.D. Thesis , University of Vienna, Austria, November 2001.
[16]
M. Wu, A.P. Buchmann, Encoded Bitmap Indexing for Data Warehouses, Proceedings of the Fourteenth International Conference on Data Engineering , Orlando, Florida, USA, February 1998.
[17]
M. Wu, Query Optimization for Selections Using Bitmaps, SIGMOD 1999, Proceedings ACM SIGMOD International Conference on Management of Data , Philadephia, Pennsylvania, USA, June 1999.
[18]
K. Wu, E. J. Otoo, A. Shoshani, A Performance Comparison of Bitmap Indexes, International Conference on Information and Knowledge Management , Atlanta, Georgia, USA, November, 2001.
[19]
K. Wu, P.S. Yu, Range-Based Bitmap Indexing for High-Cardinality Attributes with Skew, Technical Report, IBM Watson Research Center , May 1996.

Cited By

View all
  • (2017)Frag-shells cube based on hierarchical dimension encoding treeProceedings of the 11th International Conference on Ubiquitous Information Management and Communication10.1145/3022227.3022229(1-9)Online publication date: 5-Jan-2017
  • (2016)UpBitProceedings of the 2016 International Conference on Management of Data10.1145/2882903.2915964(1319-1332)Online publication date: 26-Jun-2016
  • (2016)Design Tradeoffs of Data Access MethodsProceedings of the 2016 International Conference on Management of Data10.1145/2882903.2912569(2195-2200)Online publication date: 26-Jun-2016
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
DEXA '02: Proceedings of the 13th International Conference on Database and Expert Systems Applications
September 2002
949 pages

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 02 September 2002

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 11 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2017)Frag-shells cube based on hierarchical dimension encoding treeProceedings of the 11th International Conference on Ubiquitous Information Management and Communication10.1145/3022227.3022229(1-9)Online publication date: 5-Jan-2017
  • (2016)UpBitProceedings of the 2016 International Conference on Management of Data10.1145/2882903.2915964(1319-1332)Online publication date: 26-Jun-2016
  • (2016)Design Tradeoffs of Data Access MethodsProceedings of the 2016 International Conference on Management of Data10.1145/2882903.2912569(2195-2200)Online publication date: 26-Jun-2016
  • (2012)Efficient content-based indexing of sequential data with bitmapsProceedings of the Second International Conference on Computational Science, Engineering and Information Technology10.1145/2393216.2393243(157-164)Online publication date: 26-Oct-2012
  • (2009)Divide-and-conquer scheme for strictly optimal retrieval of range queriesACM Transactions on Storage10.1145/1629075.16290775:3(1-32)Online publication date: 30-Nov-2009
  • (2007)Space-efficient structures for detecting port scansProceedings of the 18th international conference on Database and Expert Systems Applications10.5555/2395856.2395873(120-129)Online publication date: 3-Sep-2007
  • (2006)Approximate encoding for direct access and query processing over compressed bitmapsProceedings of the 32nd international conference on Very large data bases10.5555/1182635.1164200(846-857)Online publication date: 1-Sep-2006
  • (2006)Indexing incomplete databasesProceedings of the 10th international conference on Advances in Database Technology10.1007/11687238_52(884-901)Online publication date: 26-Mar-2006
  • (2005)Compressing Bitmap Indices by Data ReorganizationProceedings of the 21st International Conference on Data Engineering10.1109/ICDE.2005.35(310-321)Online publication date: 5-Apr-2005
  • (2004)On the performance of bitmap indices for high cardinality attributesProceedings of the Thirtieth international conference on Very large data bases - Volume 3010.5555/1316689.1316694(24-35)Online publication date: 31-Aug-2004
  • Show More Cited By

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media