skip to main content
10.1145/2187980.2188147acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
poster

Enabling accent resilient speech based information retrieval

Published: 16 April 2012 Publication History

Abstract

Voice interfaces to browsers and mobile applications are becoming popular as typing with touch screens is cumbersome. The main issue of practical speech based interfaces is how to overcome speech recognition errors. This problem is more severe when the users are non-native speakers of English due to differences in pronunciations. In this paper, we describe a novel, intelligent speech interface design approach for IR tasks that is significantly robust to accent variations. Our solution uses phonemic similarity based word spreading and semantic information based filtering to boost the accuracy of any ASR. We evaluated our solution with Google Voice as the ASR for a web question-answering system developed in-house and the results are very encouraging.

References

[1]
M. Jeong and G. G. Lee, ''Improving speech recognition and understanding using error-corrective reranking,'' ACM Trans. on Asian Lang. Inf. Processing, vol. 7(1), pp. 2:1--2:26, 2008.
[2]
Sequitur G2P, http://www-i6.informatik.rwth-aachen.de/web/Software/g2p.html, 2011.
[3]
B. Hixon, et al., ''Phonemic similarity metrics to compare pronunciation methods,'' Proc. Interspeech, 2011.
[4]
SVMrank, http://www.cs.cornell.edu/people/tj/svm_light/svm_rank.html, 2011.
[5]
Y. Wang, X. -G. Qi, B. D. Davison, ''Standing on the shoulders of giants: ranking by combining multiple sources," Tech. Rep. LU-CSE-07-011, Lehigh University, USA.

Index Terms

  1. Enabling accent resilient speech based information retrieval

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WWW '12 Companion: Proceedings of the 21st International Conference on World Wide Web
    April 2012
    1250 pages
    ISBN:9781450312301
    DOI:10.1145/2187980

    Sponsors

    • Univ. de Lyon: Universite de Lyon

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 16 April 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. phonemic
    2. qa
    3. semantic feedback
    4. speech based ir

    Qualifiers

    • Poster

    Conference

    WWW 2012
    Sponsor:
    • Univ. de Lyon
    WWW 2012: 21st World Wide Web Conference 2012
    April 16 - 20, 2012
    Lyon, France

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 105
      Total Downloads
    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 10 Feb 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media