Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/646515.695985guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Efficient Minimal Perfect Hashing in Nearly Minimal Space

Published: 15 February 2001 Publication History

Abstract

We consider the following problem: Given a subset S of size n of a universe {0,..., u-1}, construct a minimal perfect hash function for S, i.e., a bijection h from S to {0,..., n - 1}. The parameters of interest are the space needed to store h, its evaluation time, and the time required to compute h from S. The number of bits needed for the representation of h, ignoring the other parameters, has been thoroughly studied and is known to be n log e + loglog u ± O(log n), where "log" denotes the binary logarithm. A construction by Schmidt and Siegel uses O(n + loglogu) bits and offers constant evaluation time, but the time to find h is not discussed. We present a simple randomized scheme that uses n log e+log log u+o(n+log log u) bits and has constant evaluation time and O(n + log log u) expected construction time.

References

[1]
J. L. Carter and M. N. Wegman, Universal Classes of Hash Functions, J. Comput. System Sci. 18 (1979), pp. 143-154.
[2]
M. Dietzfelbinger and F. Meyer auf der Heide, A new universal class of hash functions and dynamic hashing in real time, Proc. 17th International Colloquium on Automata, Languages and Programming (ICALP 1990), Lecture Notes in Computer Science, Vol. 443, Springer-Verlag, Berlin, pp. 6-19.
[3]
M. L. Fredman and J. Komlós, On the size of separating systems and families of perfect hash functions, SIAM J. Alg. Disc. Meth. 5 (1984), pp. 61-68.
[4]
M. L. Fredman, J. Komlós and E. SzemerÉdi, Storing a sparse table with O(1) worst case access time, J. ACM 31 (1984), pp. 538-544.
[5]
T. Hagerup, Sorting and searching on the word RAM, Proc. 15th Annual Symposium on Theoretical Aspects of Computer Science (STACS 1998), Lecture Notes in Computer Science, Vol. 1373, Springer-Verlag, Berlin, pp. 366-398.
[6]
M. Hofri, Probabilistic Analysis of Algorithms, Springer-Verlag, New York, 1987.
[7]
C. P. Kruskal, L. Rudolph and M. Snir, A complexity theory of efficient parallel algorithms, Theoret. Comput. Sci. 71, (1990), pp. 95-132.
[8]
K. Mehlhorn, Data Structures and Algorithms, Vol. 1: Sorting and Searching, Springer-Verlag, Berlin, 1984.
[9]
M. O. Rabin, Probabilistic algorithm for testing primality. J. Number Theory 12, (1980), pp. 128-138.
[10]
J. Radhakrishnan, Improved bounds for covering complete uniform hypergraphs, Inform. Process. Lett. 41 (1992), pp. 203-207.
[11]
J. P. Schmidt and A. Siegel, The spatial complexity of oblivious k-probe hash functions, SIAM J. Comput. 19 (1990), pp. 775-786.
[12]
A. Siegel, On universal classes of fast high performance hash functions, their time-space tradeoff, and their applications, Proc. 30th Annual IEEE Symposium on Foundations of Computer Science (FOCS 1989), pp. 20-25.

Cited By

View all
  • (2020)Fast hashing with strong concentration boundsProceedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing10.1145/3357713.3384259(1265-1278)Online publication date: 22-Jun-2020
  • (2017)Lens depth function and k-relative neighborhood graph: versatile tools for ordinal data analysisThe Journal of Machine Learning Research10.5555/3122009.315301418:1(1889-1940)Online publication date: 1-Jan-2017
  • (2016)Compressed String Dictionary Search with Edit Distance OneAlgorithmica10.1007/s00453-015-9990-074:3(1099-1122)Online publication date: 1-Mar-2016
  • Show More Cited By

Index Terms

  1. Efficient Minimal Perfect Hashing in Nearly Minimal Space
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    STACS '01: Proceedings of the 18th Annual Symposium on Theoretical Aspects of Computer Science
    February 2001
    574 pages
    ISBN:3540416951

    Publisher

    Springer-Verlag

    Berlin, Heidelberg

    Publication History

    Published: 15 February 2001

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 03 Sep 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2020)Fast hashing with strong concentration boundsProceedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing10.1145/3357713.3384259(1265-1278)Online publication date: 22-Jun-2020
    • (2017)Lens depth function and k-relative neighborhood graph: versatile tools for ordinal data analysisThe Journal of Machine Learning Research10.5555/3122009.315301418:1(1889-1940)Online publication date: 1-Jan-2017
    • (2016)Compressed String Dictionary Search with Edit Distance OneAlgorithmica10.1007/s00453-015-9990-074:3(1099-1122)Online publication date: 1-Mar-2016
    • (2016)Space-Efficient Substring Occurrence EstimationAlgorithmica10.1007/s00453-014-9936-y74:1(65-90)Online publication date: 1-Jan-2016
    • (2014)Retrieval and Perfect Hashing Using FingerprintingProceedings of the 13th International Symposium on Experimental Algorithms - Volume 850410.1007/978-3-319-07959-2_12(138-149)Online publication date: 29-Jun-2014
    • (2013)Compressed static functions with applicationsProceedings of the twenty-fourth annual ACM-SIAM symposium on Discrete algorithms10.5555/2627817.2627834(229-240)Online publication date: 6-Jan-2013
    • (2013)Memory efficient sanitization of a deduplicated storage systemProceedings of the 11th USENIX conference on File and Storage Technologies10.5555/2591272.2591282(81-94)Online publication date: 12-Feb-2013
    • (2012)Worst-case efficient single and multiple string matching on packed texts in the word-RAM modelJournal of Discrete Algorithms10.1016/j.jda.2011.12.01114(91-106)Online publication date: 1-Jul-2012
    • (2011)Distribution-aware compressed full-text indexesProceedings of the 19th European conference on Algorithms10.5555/2040572.2040655(760-771)Online publication date: 5-Sep-2011
    • (2011)The context of coordinating groups in dynamic mobile networksProceedings of the 13th international conference on Coordination models and languages10.5555/2022052.2022056(49-64)Online publication date: 6-Jun-2011
    • Show More Cited By

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media