Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article
Free access

A hash code method for detecting and correcting spelling errors

Published: 01 December 1982 Publication History
  • Get Citation Alerts
  • Abstract

    The most common spelling errors are one extra letter, one missing letter, one wrong letter, or the transposition of two letters. Deletion, exchange, and rotation operators are defined which detect and “mend” such spelling errors and thus permit retrieval despite the errors. These three operators essentially delete a letter of a word, exchange two adjacent letters, and rotate a word cyclically. Moreover, the operators can be used in conjunction with hashing, thus permitting very fast retrieval. Results of experiments run on large databases in Hebrew and in English are briefly indicated.

    References

    [1]
    Bratley, P., and Choueka, Y. Processing terms in document retrieval systems. To appear.
    [2]
    Damerau, F.J. A technique for computer detection and correction of spelling errors. Comm. ACM 7, 3 (Mar. 1964), 171-176.
    [3]
    Mor, M., and Fraenkel, A.S. Retrieval in an environment of faulty texts or faulty queries. In Proc. 2nd Int. Conf. Databases-- Improving Usability and Responsiveness, Jerusalem, June 1982, Academic Press, New York.
    [4]
    Peterson, J.L. Computer programs for detecting and correcting spelling errors. Comm. ACM 23, t2 (Dec. 1980), 676-687.
    [5]
    Rosenbaum, W.S., and Hilliard, J.J. Multifont OCR postprocessing system. IBM J. Res. Dev. 19, 4 (July 1975), 398--421.
    [6]
    Schmidt, J., and Shamir, E. An improved program for constructing open hash tables. In J.W. de Bakker and J. van Leeuwen (Eds.), 7th Colloquium on Automata, Languages and Programming, July 14-18, 1980, Springer-Verlag, Berlin, pp. 569-581.
    [7]
    Shiloach, Y. Fast canonization of circular strings. J. of Algorithms 2 (1981) 107-121.

    Cited By

    View all

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Communications of the ACM
    Communications of the ACM  Volume 25, Issue 12
    Dec 1982
    84 pages
    ISSN:0001-0782
    EISSN:1557-7317
    DOI:10.1145/358728
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 December 1982
    Published in CACM Volume 25, Issue 12

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. deletion
    2. dictionary
    3. exchange
    4. rotation
    5. spelling
    6. spelling errors

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)53
    • Downloads (Last 6 weeks)2

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Full Access

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media