Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article
Free access

A new hashing function: statistical behaviour and algorithm

Published: 21 March 1993 Publication History

Abstract

Existing hashing functions have various limitations. In this paper a new hashing function is proposed, which divides the range of the key-values into some equal segments, and maps the key-values in each segment linearly into the whole range of the address. The paper analyzes the statistical behavior of the function, and points out that, theoretically, by increasing the number of segments, the distribution of the resulting hash values can always approach uniform, if the key-values can be regarded as continuous. Two methods for obtaining the number of segments, the deterministic and the probabilistic, along with the algorithm, are also proposed.

References

[1]
R. J. Cichelli, Minimal perfect hash functions made simple, Communications of the ACM, Vol. 23, No. 1, pp 17--19, January 1980
[2]
E. A. Fox, et al, Practical minimal perfect hush functions for large databases, Communications of ACM, Vol. 35, No. 1, pp 105--121, January 1992
[3]
E. Hill, Jr., A comparative study on very large databases, Springer-Verlag, 1978
[4]
G. D. Knott, Hashing functions, The Computer Journal, Vol. 18, No. 1, pp 265--278, 1975
[5]
R. J. Larsen and M. L. Marx, An introduction to mathematical statistics and its applications, Prentice Hall, England Cliffs, New Jersey, 1986
[6]
P. K. Pearson, Fast hashing of variable-length text strings, Communications of ACM, Vol. 33, No. 6, pp 679--680, June 1990
[7]
Z. Y. Tian and S. B. Tong, An easy-to-compute general purpose hashing function, Advances in Modelling Analysis, B, Vol. 25, No. 1, pp 11--18, 1993
[8]
H. L. Wu, Database principle and design (in Chinese), Chinese National Defence Industry Press, 1987

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM SIGIR Forum
ACM SIGIR Forum  Volume 27, Issue 1
Spring 1993
45 pages
ISSN:0163-5840
DOI:10.1145/174263
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 March 1993
Published in SIGIR Volume 27, Issue 1

Check for updates

Author Tags

  1. data structure
  2. direct access file
  3. hash value
  4. hashing function
  5. key-value

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 209
    Total Downloads
  • Downloads (Last 12 months)19
  • Downloads (Last 6 weeks)7
Reflects downloads up to 18 Aug 2024

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media