skip to main content
10.1145/2187980.2188130acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
poster

CloudSpeller: query spelling correction by using a unified hidden markov model with web-scale resources

Published: 16 April 2012 Publication History

Abstract

Query spelling correction is an important component of modern search engines that can help users to express an information need more accurately and thus improve search quality. In this work we proposed and implemented an end-to-end speller correction system, namely CloudSpeller. The CloudSpeller system uses a Hidden Markov Model to effectively model major types of spelling errors in a unified framework, in which we integrate a large-scale lexicon constructed using Wikipedia, an error model trained from high confidence correction pairs, and the Microsoft Web N-gram service. Our system achieves excellent performance on two search query spelling correction datasets, reaching 0.960 and 0.937 F1 scores on the TREC dataset and the MSN dataset respectively.

References

[1]
http://research.microsoft.com/en-us/collaboration/focus/cs/web-ngram.aspx.
[2]
E. Brill and R. Moore. An improved error model for noisy channel spelling correction. In ACL 2000.
[3]
Q. Chen, M. Li, and M. Zhou. Improving query spelling correction using web search results. In EMNLP 2007.
[4]
J. Gao, X. Li, D. Micol, C. Quirk, and X. Sun. A large scale ranker-based system for search query spelling correction. In COLING 2010.
[5]
S. Cucerzan and E. Brill. Spelling correction as an iterative process that exploits the collective knowledge of web users. In EMNLP, 2004.

Cited By

View all
  • (2023)Improving Query Correction Using Pre-train Language Model In Search EnginesProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614930(2999-3008)Online publication date: 21-Oct-2023
  • (2020)Towards the Natural Language Processing as Spelling Correction for Offline Handwritten Text Recognition SystemsApplied Sciences10.3390/app1021771110:21(7711)Online publication date: 31-Oct-2020
  • (2019)Query Error Correction Algorithm Based on Fusion Sequence to Sequence ModelComputational Collective Intelligence10.1007/978-3-030-28374-2_2(13-25)Online publication date: 9-Aug-2019

Index Terms

  1. CloudSpeller: query spelling correction by using a unified hidden markov model with web-scale resources

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WWW '12 Companion: Proceedings of the 21st International Conference on World Wide Web
    April 2012
    1250 pages
    ISBN:9781450312301
    DOI:10.1145/2187980

    Sponsors

    • Univ. de Lyon: Universite de Lyon

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 16 April 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. cloudspeller
    2. query spelling correction

    Qualifiers

    • Poster

    Conference

    WWW 2012
    Sponsor:
    • Univ. de Lyon
    WWW 2012: 21st World Wide Web Conference 2012
    April 16 - 20, 2012
    Lyon, France

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 02 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Improving Query Correction Using Pre-train Language Model In Search EnginesProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614930(2999-3008)Online publication date: 21-Oct-2023
    • (2020)Towards the Natural Language Processing as Spelling Correction for Offline Handwritten Text Recognition SystemsApplied Sciences10.3390/app1021771110:21(7711)Online publication date: 31-Oct-2020
    • (2019)Query Error Correction Algorithm Based on Fusion Sequence to Sequence ModelComputational Collective Intelligence10.1007/978-3-030-28374-2_2(13-25)Online publication date: 9-Aug-2019

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media