Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1963405.1963425acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Online spelling correction for query completion

Published: 28 March 2011 Publication History

Abstract

In this paper, we study the problem of online spelling correction for query completions. Misspelling is a common phenomenon among search engines queries. In order to help users effectively express their information needs, mechanisms for automatically correcting misspelled queries are required. Online spelling correction aims to provide spell corrected completion suggestions as a query is incrementally entered. As latency is crucial to the utility of the suggestions, such an algorithm needs to be not only accurate, but also efficient.
To tackle this problem, we propose and study a generative model for input queries, based on a noisy channel transformation of the intended queries. Utilizing spelling correction pairs, we train a Markov n-gram transformation model that captures user spelling behavior in an unsupervised fashion. To find the top spell-corrected completion suggestions in real-time, we adapt the A* search algorithm with various pruning heuristics to dynamically expand the search space efficiently. Evaluation of the proposed methods demonstrates a substantial increase in the effectiveness of online spelling correction over existing techniques.

References

[1]
http://en.wikipedia.org/wiki/Fat-finger
[2]
http://en.wikipedia.org/wiki/I_before_E_except_after_C
[3]
J. Bilmes. A gentle tutorial on the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models. Technical Report ICSI-TR-97-021. 1997.
[4]
M. Bisani and H. Ney. Joint-sequence models for grapheme-to-phoneme conversion. Speech Communication, Vol. 50. 2008.
[5]
A. Broder, P. Ciccolo, E. Gabrilovich, V. Josifovski, D. Metzler, L. Riedel, J. Yuan. Online expansion of rare queries for sponsored search. In WWW, 2009.
[6]
S. Chaudhuri and R. Kaushik. Extending auto-completion to tolerate errors. In SIGMOD, 2009.
[7]
Q. Chen, M. Li, and M. Zhou. Improving query spelling correction using web search results. In EMNLP-CoNLL, 2007.
[8]
S. F. Chen. Conditional and joint models for grapheme-to-phoneme conversion. In Eurospeech, 2003.
[9]
S. Cucerzan and E. Brill. Spelling correction as an iterative process that exploits the collective knowledge of web users. In EMNLP, 2004.
[10]
F. J. Damerau. A technique for computer detection and correction of spelling errors. Communication of ACM. Vol. 7. 1964.
[11]
J. Darragh, I. Witten, and M. James. The reactive keyboard: a predictive typing aid. Computer. Vol. 11. 1990.
[12]
J. Gao, X. Li, D. Micol, C. Quirk and X. Sun. A large scale ranker-based system for search query spelling correction. In COLING, 2010.
[13]
V. I. Levenshtein. Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady. 1966.
[14]
M. Li, Y. Zhang, M. Zhu, M. Zhou. Exploring distributional similarity based models for query spelling correction. In ACL, 2006.
[15]
E.M. Rieseman and A.R. Hanson. A contextual postprocessing system for error correction using binary n-grams. IEEE Transactions on Computers. Vol. 23. 1974.
[16]
X. Sun, J. Gao, D. Micol and C. Quirk. Learning phrase-based spelling error models from clickthrough data. In ACL, 2010.
[17]
P. Taylor. 2005. Hidden Markov models for grapheme to phoneme conversion. In Eurospeech, 2005.
[18]
K. Wang, X. Li, and J. Gao. Multi-style language model for web scale information retrieval. In SIGIR, 2010.

Cited By

View all
  • (2024)Guided Querying over Videos using Autocompletion SuggestionsProceedings of the 2024 Workshop on Human-In-the-Loop Data Analytics10.1145/3665939.3665964(1-7)Online publication date: 14-Jun-2024
  • (2023)FlaCGEC: A Chinese Grammatical Error Correction Dataset with Fine-grained Linguistic AnnotationProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3615119(5321-5325)Online publication date: 21-Oct-2023
  • (2023)Improving Query Correction Using Pre-train Language Model In Search EnginesProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614930(2999-3008)Online publication date: 21-Oct-2023
  • Show More Cited By

Index Terms

  1. Online spelling correction for query completion

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WWW '11: Proceedings of the 20th international conference on World wide web
    March 2011
    840 pages
    ISBN:9781450306324
    DOI:10.1145/1963405
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 28 March 2011

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. a* search
    2. query completion
    3. spelling correction
    4. transformation model

    Qualifiers

    • Research-article

    Conference

    WWW '11
    WWW '11: 20th International World Wide Web Conference
    March 28 - April 1, 2011
    Hyderabad, India

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)27
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 30 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Guided Querying over Videos using Autocompletion SuggestionsProceedings of the 2024 Workshop on Human-In-the-Loop Data Analytics10.1145/3665939.3665964(1-7)Online publication date: 14-Jun-2024
    • (2023)FlaCGEC: A Chinese Grammatical Error Correction Dataset with Fine-grained Linguistic AnnotationProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3615119(5321-5325)Online publication date: 21-Oct-2023
    • (2023)Improving Query Correction Using Pre-train Language Model In Search EnginesProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614930(2999-3008)Online publication date: 21-Oct-2023
    • (2023)Deep Learning Methods for Query Auto CompletionAdvances in Information Retrieval10.1007/978-3-031-28241-6_35(341-348)Online publication date: 16-Mar-2023
    • (2021)A Personalized Search Query Generating Method for Safety-Enhanced Vehicle-to-People NetworksIEEE Transactions on Vehicular Technology10.1109/TVT.2021.307562670:6(5296-5307)Online publication date: Jun-2021
    • (2020)Personalized Prefix Embedding for POI Auto-Completion in the Search Engine of Baidu MapsProceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining10.1145/3394486.3403318(2677-2685)Online publication date: 23-Aug-2020
    • (2020)NGNC: A Flexible and Efficient Framework for Error-Tolerant Query AutocompletionSoftware Foundations for Data Interoperability and Large Scale Graph Data Analytics10.1007/978-3-030-61133-0_8(101-115)Online publication date: 6-Nov-2020
    • (2020)Query Auto-CompletionQuery Understanding for Search Engines10.1007/978-3-030-58334-7_7(145-170)Online publication date: 2-Dec-2020
    • (2020)Query Spelling CorrectionQuery Understanding for Search Engines10.1007/978-3-030-58334-7_5(103-127)Online publication date: 2-Dec-2020
    • (2020)Effect of Dialogs’ Arrangement on Accuracy and Workload for Confirming Input DataHCI International 2020 - Posters10.1007/978-3-030-50726-8_77(587-593)Online publication date: 10-Jul-2020
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media