Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article
Free access

Multiattribute hashing using Gray codes

Published: 15 June 1986 Publication History
  • Get Citation Alerts
  • Abstract

    Multiattribute hashing and its variations have been proposed for partial match and range queries in the past. The main idea is that each record yields a bitstring @@@@ (“record signature”), according to the values of its attributes. The binary value (@@@@)2 of this string decides the bucket that the record is stored. In this paper we propose to use Gray codes instead of binary codes, in order to map record signatures to buckets. In Gray codes, successive codewords differ in the value of exactly one bit position, thus, successive buckets hold records with similar record signatures. The proposed method achieves better clustering of similar records and avoids some of the (expensive) random disk accesses, replacing them with sequential ones. We develop a mathematical model, derive formulas giving the average performance of both methods and show that the proposed method achieves 0% - 50% relative savings over the binary codes. We also discuss how Gray codes could be applied to some retrieval methods designed for range queries, such as the grid file [Nievergelt84a] and the approach based on the so-called z-ordering [Orenstein84a].

    References

    [1]
    Aho, A V and J D Ullman, "Optimal Partial Match Retrieval When Fields are Independently Specified," A CM TOPS, vol 4, no 2, pp 168- 179, June 1979
    [2]
    Bentley, J L, "Multi&menmonal Binary Search Trees Used for Associative Searching," CA CM, vol 18, no 9, pp 509-517, Sept 1975
    [3]
    Cardenas, A F, "Analysis and Performance of Inverted Data Base Structures," CA CM, vol 18, no 5, pp 253-263, May 1975
    [4]
    Fagln, R, J Nlevergeit, N Plppenger, and H R Strong, "Extendible Hashing- A Fast Access Method for Dynamm Fdes," A CM TOPS, vol 4, no 3, pp 315-344, Sept 1979
    [5]
    Faloutsos, C, "Gray Codes for Partial Match and Range Queries," IEEE Trans on Software Eng,neer:ng, 1985 Submitted for pubhcatlon
    [6]
    Gtlbert, E N, "Gray Codes and Paths on the n- Cube," Bell System Techn,cal Journal, vol 37, no 3, pp 815-826, May 1958
    [7]
    Gray, F, Pulse Code Commumcatsons, US Patent 2632058, March 17, 1953
    [8]
    Larson, P, "Dynamm Hashing," BIT, vol 18, pp 184-201, 1978
    [9]
    Larson, P A, "Performance Analysis of Linear Hashing with Partial Expaasmns," A UM TODS, vol 7, no 4, pp 566-587, Dec 1982
    [10]
    Lltwm, W, "Linear Hashing A new Tool for File and Table Addressing," Proc 6th Interaatsonal Conference on VLDB, pp 212-223, Montreal, Oct 1980
    [11]
    Lloyd, J W, "Optimal Partial-Match Retrieval," BIT, vol 20, pp 406-413, 1980
    [12]
    Lloyd, J W and K Ramamohanarao, "Partial- Match Retrieval for Dynamm Files," BIT, vol 22, pp 150-168, 1982
    [13]
    Martin, G N N, "Spiral Storage Incrementally Augmentable Hash Addressed Storage," Theory of Computatlon, Report No 27, Umv of Warwmk, Coventry, England, March 1979
    [14]
    Nlevergelt, J, H Hmterberger, and K C Sevclk, "The Grid File An Adaptable, Symmetric Multlkey File Structure," A UM TODS, vol 9, no 1, pp 38-71, March 1984
    [15]
    Orenstem, J A and T H Merrett, "A Class of Data Structures for Associative Searching," Proc of SIGA CT-SIGMOD, pp 181-190, Waterloo, Ontario, Canada, April 2-4, 1984
    [16]
    Ramamohanarao, K, JW Lloyd, and JA Thorn, "Partial-Match Retrieval Using Hashing and Descriptors," A UM TODS, vol 8, no 4, pp 552-576, Dec 1983
    [17]
    Remgold, E M, J Nlevergelt, and N Deo, Uomb:natorsal Algorsthms Theory and Praetsee, Prentice-Hall Inc, Englewood Chffs, New Jersey, 1977
    [18]
    Rlvest, R L, "Partial Match Retrieval Algorithms," SIAM J Compet, vol 5, no 1, pp 19-50, March 1976
    [19]
    Robinson, J T, "The k-D-B-Tree A Search Structure for Large Multldlmenmonal Dynamic Indexes," Proe ACM SIGMOD, pp 10-18, 1981
    [20]
    Rothme, J B and T Lozano, "Attribute Based File Orgamzatlon in a Paged Memory Environment," CA CM, vol 17, no 2, pp 63-69, Feb 1974

    Cited By

    View all
    • (2022)Efficient Point Cloud Analysis Using Hilbert CurveComputer Vision – ECCV 202210.1007/978-3-031-20086-1_42(730-747)Online publication date: 11-Nov-2022
    • (2018)FML-kNN: scalable machine learning on Big Data using k-nearest neighbor joinsJournal of Big Data10.1186/s40537-018-0115-x5:1Online publication date: 6-Feb-2018
    • (2018)Onion Curve: A Space Filling Curve with Near-Optimal Clustering2018 IEEE 34th International Conference on Data Engineering (ICDE)10.1109/ICDE.2018.00119(1236-1239)Online publication date: Apr-2018
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM SIGMOD Record
    ACM SIGMOD Record  Volume 15, Issue 2
    June 1986
    407 pages
    ISSN:0163-5808
    DOI:10.1145/16856
    Issue’s Table of Contents
    • cover image ACM Conferences
      SIGMOD '86: Proceedings of the 1986 ACM SIGMOD international conference on Management of data
      June 1986
      407 pages
      ISBN:0897911911
      DOI:10.1145/16894
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 15 June 1986
    Published in SIGMOD Volume 15, Issue 2

    Check for updates

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)82
    • Downloads (Last 6 weeks)15
    Reflects downloads up to 10 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)Efficient Point Cloud Analysis Using Hilbert CurveComputer Vision – ECCV 202210.1007/978-3-031-20086-1_42(730-747)Online publication date: 11-Nov-2022
    • (2018)FML-kNN: scalable machine learning on Big Data using k-nearest neighbor joinsJournal of Big Data10.1186/s40537-018-0115-x5:1Online publication date: 6-Feb-2018
    • (2018)Onion Curve: A Space Filling Curve with Near-Optimal Clustering2018 IEEE 34th International Conference on Data Engineering (ICDE)10.1109/ICDE.2018.00119(1236-1239)Online publication date: Apr-2018
    • (2018)Efficient region of visual interests search for geo-multimedia dataMultimedia Tools and Applications10.1007/s11042-018-6750-678:21(30839-30863)Online publication date: 31-Oct-2018
    • (2014)Optimality of Clustering Properties of Space-Filling CurvesACM Transactions on Database Systems10.1145/255668639:2(1-27)Online publication date: 26-May-2014
    • (2012)Reordering rows for better compressionACM Transactions on Database Systems10.1145/2338626.233863337:3(1-29)Online publication date: 6-Sep-2012
    • (2012)On the optimality of clustering properties of space filling curvesProceedings of the 31st ACM SIGMOD-SIGACT-SIGAI symposium on Principles of Database Systems10.1145/2213556.2213587(215-224)Online publication date: 21-May-2012
    • (2012)A Lower Bound on Proximity Preservation by Space Filling CurvesProceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium10.1109/IPDPS.2012.118(1295-1305)Online publication date: 21-May-2012
    • (2012)Approximate covering detection among content-based subscriptions using space filling curvesJournal of Parallel and Distributed Computing10.1016/j.jpdc.2012.09.00272:12(1591-1602)Online publication date: 1-Dec-2012
    • (2012)Group-Scope query and its access methodProceedings of the 14th Asia-Pacific international conference on Web Technologies and Applications10.1007/978-3-642-29253-8_49(552-559)Online publication date: 11-Apr-2012
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media