Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article
Free access

Efficient Storage and Retrieval by Content and Address of Static Files

Published: 01 April 1974 Publication History

Abstract

We consider a set of static files or inventories, each consisting of the same number of entries, each entry a binary word of the same fixed length selected (with replacement) from the set of all binary sequences of that length, and the entries in each file sorted into lexical order. We also consider several retrieval questions of interest for each such file. One is to find the value of the jth entry, another to find the number of entries of value less than k.
When a binary representation of such a file is stored in computer memory and an algorithm or machine which knows only the file parameters (i.e. number of entries, number of possible values per entry) accesses some of the stored bits to answer a retrieval question, the number of bits stored and the number of bits accessed per retrieval question are two cost measures for the storage and retrieval task which have been used by Minsky and Papert. Bits stored depends on the representation chosen: bits accessed also depends on the retrieval question asked and on the algorithm used.
We give firm lower bounds to minimax measures of bits stored and bits accessed for each of four retrieval questions, and construct representations and algorithms for a bit-addressable machine which come within factors of two or three of attaining all four bounds at once for files of any size. All four factors approach one for large enough files.

References

[1]
CovslL T.M. Enumerative source encoding. IEEE Trans. IT-19 (Jan. 1973), 73-77.
[2]
ELIAs, P. On binary representations of monotone sequences. Proc. Sixth Princeton Conference on Information Sciences and Systems, March 1972, Dep. of Electrical Engineering, Princeton U., Princeton, N. J., 1972, pp. 54-57.
[3]
FANo,R.M. On the number of bits required to implement an associative memory. Memorandum 61, Computer Structures Group, Project MAC, MIT, Cambridge, Mass., n.d.
[4]
FANO, 1{. M. Transmission of Information, MIT Press, Cambridge, Mass., and Wiley, New York, 1961.
[5]
GALLAaE*~, R.G. Information Theory and Reliable Communication. Wiley, New York, 1968.
[6]
LEH~Sn, D.H. Teaching combinatorial tricks to a computer. Proceedings of Symposia in Applied Mathematics, Vol. X, Combinatorial Analysis, Amer. Math. Soc., Providence, R.I., 1960, Ch. 1, pp. 5-31.
[7]
MINSKY, M., AND PAeERT, S. Perceptrons. MIT Press, Cambridge, Mass., 1969, pp. 215-225.
[8]
SCnA~KWIJK, J. P.M. An algorithm for source coding. IEEE Trans. IT-18 (May 1972), 395-399.
[9]
WOZENCRAFT, J. M., AND REIFFEN, B. Sequential Decoding. MIT Press, Cambridge, Mass., 1961, pp. 71-73.
[10]
FLOWER, R.A. Computer updating of a data structure. Quart Progress Rep. 110, Res. Lab. of Electronics, MIT, Cambridge, Mass., July 1973.

Cited By

View all
  • (2024)Fulgor: a fast and compact k-mer index for large-scale matching and color queriesAlgorithms for Molecular Biology10.1186/s13015-024-00251-919:1Online publication date: 22-Jan-2024
  • (2024)Grafite: Taming Adversarial Queries with Optimal Range FiltersProceedings of the ACM on Management of Data10.1145/36392582:1(1-23)Online publication date: 26-Mar-2024
  • (2024)WebGraph: The Next Generation (Is in Rust)Companion Proceedings of the ACM Web Conference 202410.1145/3589335.3651581(686-689)Online publication date: 13-May-2024
  • Show More Cited By

Index Terms

  1. Efficient Storage and Retrieval by Content and Address of Static Files

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image Journal of the ACM
      Journal of the ACM  Volume 21, Issue 2
      April 1974
      176 pages
      ISSN:0004-5411
      EISSN:1557-735X
      DOI:10.1145/321812
      Issue’s Table of Contents

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 01 April 1974
      Published in JACM Volume 21, Issue 2

      Permissions

      Request permissions for this article.

      Check for updates

      Qualifiers

      • Article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)147
      • Downloads (Last 6 weeks)17
      Reflects downloads up to 04 Oct 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Fulgor: a fast and compact k-mer index for large-scale matching and color queriesAlgorithms for Molecular Biology10.1186/s13015-024-00251-919:1Online publication date: 22-Jan-2024
      • (2024)Grafite: Taming Adversarial Queries with Optimal Range FiltersProceedings of the ACM on Management of Data10.1145/36392582:1(1-23)Online publication date: 26-Mar-2024
      • (2024)WebGraph: The Next Generation (Is in Rust)Companion Proceedings of the ACM Web Conference 202410.1145/3589335.3651581(686-689)Online publication date: 13-May-2024
      • (2024)Parallel and External-Memory Construction of Minimal Perfect Hash Functions With PTHashIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.330334136:3(1249-1259)Online publication date: 1-Mar-2024
      • (2024)Partitioned Inverted Index Compression Using Hierarchical Dirichlet Process2024 4th International Conference on Neural Networks, Information and Communication (NNICE)10.1109/NNICE61279.2024.10499155(1433-1442)Online publication date: 19-Jan-2024
      • (2024)Computation over APT Compressed Data2024 Data Compression Conference (DCC)10.1109/DCC58796.2024.00023(153-162)Online publication date: 19-Mar-2024
      • (2024)BSuccinct: Rust libraries and programs focused on succinct data structuresSoftwareX10.1016/j.softx.2024.10168126(101681)Online publication date: May-2024
      • (2024)A Micro-architecture that supports the Fano–Elias encoding and a hardware accelerator for approximate membership queriesMicroprocessors and Microsystems10.1016/j.micpro.2023.104992105(104992)Online publication date: Mar-2024
      • (2024)TRGST: An enhanced generalized suffix tree for topological relations between pathsInformation Systems10.1016/j.is.2024.102406125(102406)Online publication date: Nov-2024
      • (2024)CoCo-trieInformation Systems10.1016/j.is.2023.102316120:COnline publication date: 1-Feb-2024
      • Show More Cited By

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Get Access

      Login options

      Full Access

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media