Skip to main content

A Speech Interface to the PENG\(^{ASP}\) System

  • Conference paper
  • First Online:
  • 643 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9767))

Abstract

The increased presence and accessibility of online speech recognition services has encouraged an investigation of the effectiveness of using such a service to allow users to speak a textual specification in controlled natural language instead of typing it. Google’s Web Speech API provides an accessible and portable speech recognition service that integrates well with web-based interfaces. Using Google’s Web Speech API, we present the design and implementation of a speech-based interface for the PENG\(^{ASP}\) system. We do this in order to examine the usefulness of speech-based input for controlled natural language processing and to explore potential synergies between speech-based and text-based input.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   34.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   44.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    http://developer.att.com/apis/speech.

  2. 2.

    https://www.google.com/intl/en/chrome/demos/speech.html.

  3. 3.

    https://speech-to-text-demo.mybluemix.net/.

  4. 4.

    http://www.swi-prolog.org/.

  5. 5.

    http://www.json.org/.

  6. 6.

    http://www.ecma-international.org/publications/standards/Ecma-262.htm.

  7. 7.

    http://api.jquery.com/.

  8. 8.

    https://www.w3.org/TR/html5/.

  9. 9.

    https://cloud.google.com/speech/.

References

  1. Bernstein, A., Kaufmann, E.: GINO - a guided input natural language ontology editor. In: Cruz, I., Decker, S., Allemang, D., Preist, C., Schwabe, D., Mika, P., Uschold, M., Aroyo, L.M. (eds.) ISWC 2006. LNCS, vol. 4273, pp. 144–157. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  2. Franconi, E., Guagliardo, P., Trevisan, M., Tessaris, S.: Quelo: an ontology-driven query interface. In: Proceedings of the 24th International Workshop on Description Logics (DL 2011) (2011)

    Google Scholar 

  3. Fuchs, N.E., Kaljurand, K., Kuhn, T.: Attempto controlled English for knowledge representation. In: Baroglio, C., Bonatti, P.A., Małuszyński, J., Marchiori, M., Polleres, A., Schaffert, S. (eds.) Reasoning Web 2008. LNCS, vol. 5224, pp. 104–124. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  4. Gebser, M., Kaminski, R., Kaufmann, B., Schaub, T.: Clingo = ASP + Control: Extended Report. In: CoRR, arXiv:1405.3694 (2014)

  5. Guy, G., Schwitter, R.: The PENG\(^{ASP}\) system: architecture, language and authoring tool. J. Lang. Resour. Eval. 1–26 (2016)

    Google Scholar 

  6. Hunt, A., McGlashan, S.: Speech Recognition Grammar Specification Version 1.0, W3C Recommendation, 16 March 2004

    Google Scholar 

  7. Kaljurand, K., Alumäe, T.: Controlled natural language in speech recognition based user interfaces. In: Kuhn, T., Fuchs, N.E. (eds.) CNL 2012. LNCS, vol. 7427, pp. 79–94. Springer, Heidelberg (2012)

    Google Scholar 

  8. Kuhn, T.: AceWiki: a natural and expressive semantic wiki. In: Semantic Web User Interaction at CHI 2008: Exploring HCI Challenges, CEUR Workshop Proceedings (2008)

    Google Scholar 

  9. Kuhn, T.: A survey and classification of controlled natural languages. Comput. Linguist. 40(1), 121–170 (2014)

    Article  Google Scholar 

  10. Larson, J.A.: VoiceXML 2.0 and the W3C speech interface framework. In: IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2001), pp. 5–8 (2001)

    Google Scholar 

  11. Lifschitz, V.: What is answer set programming? In: Proceedings of AAAI 2008, pp. 1594–1597 (2008)

    Google Scholar 

  12. Power, R.: OWL simplified English: a finite-state language for ontology editing. In: Kuhn, T., Fuchs, N.E. (eds.) CNL 2012. LNCS, vol. 7427, pp. 44–60. Springer, Heidelberg (2012)

    Google Scholar 

  13. Schwitter, R., Ljungberg, A., Hood, D.: ECOLE: a look-ahead editor for a controlled language. In: Proceedings of EAMT-CLAW03, Dublin, pp. 141–150 (2003)

    Google Scholar 

  14. Schwitter, R.: Controlled natural languages for knowledge representation. In: Proceedings of COLING 2010, Beijing, China, pp. 1113–1121 (2010)

    Google Scholar 

  15. Shires, G., Wennborg, H.: Web Speech API Specification, W3C Community, Final Report, 19 October 2012

    Google Scholar 

  16. Thompson, C.W., Pazandak, T., Tennant, H.R.: Talk to your semantic web. IEEE Internet Comput. 9(6), 75–78 (2005)

    Article  Google Scholar 

  17. Van Tichelen, L., Burke, D.: Semantic Interpretation for Speech Recognition (SISR) Version 1.0, W3C Recommendation, 5 April 2007

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rolf Schwitter .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Nalbandian, C., Schwitter, R. (2016). A Speech Interface to the PENG\(^{ASP}\) System. In: Davis, B., Pace, G., Wyner, A. (eds) Controlled Natural Language. CNL 2016. Lecture Notes in Computer Science(), vol 9767. Springer, Cham. https://doi.org/10.1007/978-3-319-41498-0_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-41498-0_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-41497-3

  • Online ISBN: 978-3-319-41498-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics