research-article

SpeakQL: Towards Speech-driven Multi-modal Querying

Authors:
Dharmil Chandarana

University of California, San Diego

University of California, San Diego
View Profile

,
Vraj Shah

University of California, San Diego

University of California, San Diego
View Profile

,
Arun Kumar

University of California, San Diego

University of California, San Diego
View Profile

,
Lawrence Saul

University of California, San Diego

University of California, San Diego
View Profile

HILDA '17: Proceedings of the 2nd Workshop on Human-In-the-Loop Data AnalyticsMay 2017Article No.: 11Pages 1–6https://doi.org/10.1145/3077257.3077264

Published:14 May 2017Publication History

HILDA '17: Proceedings of the 2nd Workshop on Human-In-the-Loop Data Analytics

Pages 1–6

ABSTRACT

Natural language and touch-based interfaces are making data querying significantly easier. But typed SQL remains the gold standard for query sophistication although it is painful in many querying environments. Recent advancements in automatic speech recognition raise the tantalizing possibility of bridging this gap by enabling spoken SQL queries. In this work, we outline our vision of one such new query interface and system for regular SQL that is primarily speech-driven. We propose an end-to-end architecture for making spoken SQL querying effective and efficient and present initial empirical results to understand the feasibility of such an approach. We identify several open research questions and propose alternative solutions that we plan to explore.

References

Google Cloud Speech API. cloud.google.com/speech.Google Scholar
Nuance MagicSpeech. australia.nuance.com/products/speechmagic/index.htm.Google Scholar
Oracle SQL Developer. oracle.com/technetwork/issue-archive/2008/08-mar/o28sql-100636.html.Google Scholar
D. Amodei et al. Deep Speech 2: End-to-End Speech Recognition in English and Mandarin. In ICML, 2016. Google ScholarDigital Library
C. Chelba and F. Jelinek. Exploiting Syntactic Structure for Language Modeling. In ACL, 2008.Google Scholar
A. Crotty et al. Vizdom: Interactive Analytics through Pen and Touch. In VLDB Demo, 2014. Google ScholarDigital Library
G. Hinton et al. Deep Neural Networks for Acoustic Modeling in Speech Recognition. Signal Processing Magazine, 2012.Google Scholar
S. Lajoie et al. Application of Spoken and Natural Language Technologies to Lotus Notes Based Messaging and Communication, 2002. dtic.mil/dtic/tr/fulltext/u2/a402014.pdf.Google Scholar
F. Li et al. Constructing an Interactive Natural Language Interface for Relational Databases. In VLDB, 2015. Google ScholarDigital Library
G. Lyons et al. Making the Case for Query-by-Voice with EchoQuery. In SIGMOD Demo, 2016. Google ScholarDigital Library
T. Matsuzaki et al. Probabilistic CFG with Latent Annotations. In ACL, 2005. Google ScholarDigital Library
A. Nandi et al. Gestural Query Specification. In VLDB, 2014. Google ScholarDigital Library
L. Rabiner and B.-H. Juang. Fundamentals of Speech Recognition. Prentice-Hall, Inc., 1993. Google ScholarDigital Library
S. Ruan et al. Speech Is 3x Faster than Typing for English and Mandarin Text Entry on Mobile Devices. CoRR, abs/1608.07323.Google Scholar
M. M. Zloof. Query by Example. In National Computer Conference and Exposition, 1975. Google ScholarDigital Library

Recommendations

SpeakQL: Towards Speech-driven Multimodal Querying of Structured Data
SIGMOD '20: Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data

Speech-driven querying is becoming popular in new device environments such as smartphones, tablets, and even conversational assistants. However, such querying is largely restricted to natural language. Typed SQL remains the gold standard for ...
Read More
SpeakQL: Towards Speech-driven Multimodal Querying
SIGMOD '19: Proceedings of the 2019 International Conference on Management of Data

Speech-based inputs have become popular in many applications on constrained device environments such as smartphones and tablets, and even personal conversational assistants such as Siri, Alexa, and Cortana. Inspired by this recent success of speech-...
Read More
Demonstration of SpeakQL: Speech-driven Multimodal Querying of Structured Data
SIGMOD '19: Proceedings of the 2019 International Conference on Management of Data

In this demonstration, we present SpeakQL, a speech-driven query system and interface for structured data. SpeakQL supports a tractable and practically useful subset of regular SQL, allowing users to query in any domain with unbounded vocabulary with ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

HILDA '17: Proceedings of the 2nd Workshop on Human-In-the-Loop Data Analytics
May 2017
89 pages
ISBN:9781450350297
DOI:10.1145/3077257

Copyright © 2017 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 14 May 2017
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- research-article
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate28of56submissions,50%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 8
  Total Citations
  View Citations
- 144
  Total Downloads
- Downloads (Last 12 months)8
- Downloads (Last 6 weeks)4
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

SpeakQL: Towards Speech-driven Multi-modal Querying

HILDA '17: Proceedings of the 2nd Workshop on Human-In-the-Loop Data Analytics

ABSTRACT

References

Cited By

Recommendations

SpeakQL: Towards Speech-driven Multimodal Querying of Structured Data

SpeakQL: Towards Speech-driven Multimodal Querying

Demonstration of SpeakQL: Speech-driven Multimodal Querying of Structured Data

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

SpeakQL: Towards Speech-driven Multi-modal Querying

HILDA '17: Proceedings of the 2nd Workshop on Human-In-the-Loop Data Analytics

ABSTRACT

References

Cited By

Recommendations

SpeakQL: Towards Speech-driven Multimodal Querying of Structured Data

SpeakQL: Towards Speech-driven Multimodal Querying

Demonstration of SpeakQL: Speech-driven Multimodal Querying of Structured Data

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media