Approximate encoding for direct access and query processing over compressed bitmaps

Published: 01 September 2006


Bitmap indices have been widely and successfully used in scientific and commercial databases. Compression techniques based on run-length encoding are used to improve the storage performance. However, these techniques introduce significant overheads in query processing even when only a few rows are queried. We propose a new bitmap encoding scheme based on multiple hashing, where the bitmap is kept in a compressed form, and can be directly accessed without decompression. Any subset of rows and/or columns can be retrieved efficiently by reconstructing and processing only the necessary subset of the bitmap. The proposed scheme provides approximate results with a trade-off between the amount of space and the accuracy. False misses are guaranteed not to occur, and the false positive rate can be estimated and controlled. We show that query execution is significantly faster than WAH-compressed bitmaps, which have been previously shown to achieve the fastest query response times. The proposed scheme achieves accurate results (90%-100%) and improves the speed of query processing from 1 to 3 orders of magnitude compared to WAH.


  (2016)An efficient method to evaluate intersections on big data setsTheoretical Computer Science10.1016/j.tcs.2016.07.018647:C(1-21)Online publication date: 27-Sep-2016
  (2015)The hyperdyadic index and generalized indexing and query with PIQUEProceedings of the 27th International Conference on Scientific and Statistical Database Management10.1145/2791347.2791374(1-12)Online publication date: 29-Jun-2015
  (2015)A Padded Encoding Scheme to Accelerate Scans by Leveraging SkewProceedings of the 2015 ACM SIGMOD International Conference on Management of Data10.1145/2723372.2737787(1509-1524)Online publication date: 27-May-2015
Information & Contributors


Published In

cover image ACM Conferences
VLDB '06: Proceedings of the 32nd international conference on Very large data bases
September 2006
1269 pages


  • SIGMOD: ACM Special Interest Group on Management of Data
  • K.I.S.S. SIG on Databases
  • AJU Information Technology Co., Ltd
  • US Army ITC-PAC Asian Research Office
  • Google Inc.
  • The Database Society of Japan
  • Samsung SOS
  • Advanced Information Technology Research Center
  • Naver
  • Microsoft: Microsoft
  • Korea Info Sci Society: Korea Information Science Society
  • SK telecom
  • Systems Applications Products
  • International Business Management
  • Air Force Office of Scientific Research/Asian Office of Aerospace R&D
  • Kosef
  • Kaist
  • LG Electronics


VLDB Endowment

Publication History

Published: 01 September 2006

  • Article


  (2016)An efficient method to evaluate intersections on big data setsTheoretical Computer Science10.1016/j.tcs.2016.07.018647:C(1-21)Online publication date: 27-Sep-2016
  (2015)The hyperdyadic index and generalized indexing and query with PIQUEProceedings of the 27th International Conference on Scientific and Statistical Database Management10.1145/2791347.2791374(1-12)Online publication date: 29-Jun-2015
  (2015)A Padded Encoding Scheme to Accelerate Scans by Leveraging SkewProceedings of the 2015 ACM SIGMOD International Conference on Management of Data10.1145/2723372.2737787(1509-1524)Online publication date: 27-May-2015
  (2010)Position list word aligned hybridProceedings of the 13th International Conference on Extending Database Technology10.1145/1739041.1739071(228-239)Online publication date: 22-Mar-2010
  (2009)Correlation mapsProceedings of the VLDB Endowment10.14778/1687627.16877652:1(1222-1233)Online publication date: 1-Aug-2009
  (2009)Inverted indexes vs. bitmap indexes in decision support systemsProceedings of the 18th ACM conference on Information and knowledge management10.1145/1645953.1646158(1509-1512)Online publication date: 2-Nov-2009
  (2009)Secondary indexing in one dimensionProceedings of the twenty-eighth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems10.1145/1559795.1559824(177-186)Online publication date: 29-Jun-2009
  (2008)Dynamic data organization for bitmap indicesProceedings of the 3rd international conference on Scalable information systems10.5555/1459693.1459733(1-10)Online publication date: 4-Jun-2008
  (2008)BrighthouseProceedings of the VLDB Endowment10.14778/1454159.14541741:2(1337-1345)Online publication date: 1-Aug-2008
  (2007)Space-efficient structures for detecting port scansProceedings of the 18th international conference on Database and Expert Systems Applications10.5555/2395856.2395873(120-129)Online publication date: 3-Sep-2007
