skip to main content
10.1145/3077257.3077264acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

SpeakQL: Towards Speech-driven Multi-modal Querying

Published:14 May 2017Publication History

ABSTRACT

Natural language and touch-based interfaces are making data querying significantly easier. But typed SQL remains the gold standard for query sophistication although it is painful in many querying environments. Recent advancements in automatic speech recognition raise the tantalizing possibility of bridging this gap by enabling spoken SQL queries. In this work, we outline our vision of one such new query interface and system for regular SQL that is primarily speech-driven. We propose an end-to-end architecture for making spoken SQL querying effective and efficient and present initial empirical results to understand the feasibility of such an approach. We identify several open research questions and propose alternative solutions that we plan to explore.

References

  1. Google Cloud Speech API. cloud.google.com/speech.Google ScholarGoogle Scholar
  2. Nuance MagicSpeech. australia.nuance.com/products/speechmagic/index.htm.Google ScholarGoogle Scholar
  3. Oracle SQL Developer. oracle.com/technetwork/issue-archive/2008/08-mar/o28sql-100636.html.Google ScholarGoogle Scholar
  4. D. Amodei et al. Deep Speech 2: End-to-End Speech Recognition in English and Mandarin. In ICML, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. C. Chelba and F. Jelinek. Exploiting Syntactic Structure for Language Modeling. In ACL, 2008.Google ScholarGoogle Scholar
  6. A. Crotty et al. Vizdom: Interactive Analytics through Pen and Touch. In VLDB Demo, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. G. Hinton et al. Deep Neural Networks for Acoustic Modeling in Speech Recognition. Signal Processing Magazine, 2012.Google ScholarGoogle Scholar
  8. S. Lajoie et al. Application of Spoken and Natural Language Technologies to Lotus Notes Based Messaging and Communication, 2002. dtic.mil/dtic/tr/fulltext/u2/a402014.pdf.Google ScholarGoogle Scholar
  9. F. Li et al. Constructing an Interactive Natural Language Interface for Relational Databases. In VLDB, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. G. Lyons et al. Making the Case for Query-by-Voice with EchoQuery. In SIGMOD Demo, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. T. Matsuzaki et al. Probabilistic CFG with Latent Annotations. In ACL, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. A. Nandi et al. Gestural Query Specification. In VLDB, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. L. Rabiner and B.-H. Juang. Fundamentals of Speech Recognition. Prentice-Hall, Inc., 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. S. Ruan et al. Speech Is 3x Faster than Typing for English and Mandarin Text Entry on Mobile Devices. CoRR, abs/1608.07323.Google ScholarGoogle Scholar
  15. M. M. Zloof. Query by Example. In National Computer Conference and Exposition, 1975. Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Conferences
    HILDA '17: Proceedings of the 2nd Workshop on Human-In-the-Loop Data Analytics
    May 2017
    89 pages
    ISBN:9781450350297
    DOI:10.1145/3077257

    Copyright © 2017 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 14 May 2017

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited

    Acceptance Rates

    Overall Acceptance Rate28of56submissions,50%

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader