Skip to main content
Log in

Design and implementation of a VoiceXML-driven wiki application for assistive environments on the web

  • Original Article
  • Published:
Personal and Ubiquitous Computing Aims and scope Submit manuscript

Abstract

In this paper, we describe the design and implementation of an audio wiki application accessible via both the Public Switched Telephone Network and the Internet. The application exploits mature World Wide Web Consortium standards, such as VoiceXML, Speech Synthesis Markup Language, and Speech Recognition Grammar Specification toward achieving our goals. The purpose of such an application is to assist visually impaired, technologically uneducated, and underprivileged people in accessing information originally intended to be accessed visually via a personal computer (PC). Users may access wiki content via fixed or mobile phones, or via a PC using a Web Browser or a Voice over IP service. This feature promotes pervasiveness to collaboratively created content to an extremely large population, i.e., those who simply own a telephone line.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Abbreviations

ABNF:

Augmented Backus–Naur Form

ASR:

Automatic Speech Recognition

CGI:

Common Gateway Interface

DTB:

Digital Talking Book

DTMF:

Dual-Tone Multi-Frequency

GrXML:

Grammar XML

HTML:

Hypertext Markup Language

IT:

Information Technology

JSGF:

Java Speech Grammar Format

JSML:

Java Speech API Markup Language

PC:

Personal Computer

PSTN:

Public Switched Telephone Network

SIP:

Session Initiation Protocol

SMS:

Short Message Service

SRGS:

Speech Recognition Grammar Specification

SSML:

Speech Synthesis Markup Language

TTS:

Text to Speech

URI:

Uniform Resource Identifier

VoiceXML:

Voice eXtensible Markup Language

VoIP:

Voice over IP

WML:

Wireless Markup Language

W3C:

World Wide Web Consortium

XHTML:

Extensible Hyper Text Markup Language

XML:

eXtensible Markup Language

References

  1. http://www.wikipedia.org

  2. Kolias C, Demertzis S, Kambourakis G (2008) Design and implementation of a secure mobile wiki system. In: Uskov V (ed) 7th IASTED international conference on web-based education (WBE 2008). March 2008, Innsbruck, Austria, pp 212–217

  3. WML 2.0 Specification. http://www1.wapforum.org/tech/terms.asp?doc=WAP-238-WML-20010911-a.pdf

  4. i-mode Compatible 7.1 Specification. http://www.nttdocomo.co.jp/english/service/imode/make/content/html/version/index.html#p071

  5. Voice Extensible Markup Language 2.0 Specification. http://www.w3.org/TR/voicexml20/. Retrieved on 30 Dec 2008

  6. Leinonen T, Aucamp FN, Sari ER (2006) Audio Wiki for mobile communities: information system for the rest of Us. In: Workshop on speech in mobile and pervasive environments, Mobile HCI 06 conference, 12 Sept 2006, pp 3

  7. MediaWiki. http://www.mediawiki.org/wiki/MediaWiki. Retrieved on 5 May 2008

  8. Werner S, Wolff M, Eichner M, Hoffmann R (2004) Integrating speech enabled services in a web-based e-learning environment. In: Proceedings of international conference on information technology: coding and computing, vol 2. ITCC 2004, 5–7 April 2004, pp 303–307

  9. Wang L, Roe P, Pham B, Tjondronegoro D (2008) An audio wiki supporting mobile collaboration. In: Proceedings of the 2008 ACM symposium on applied computing (Fortaleza, Ceara, Brazil, 16–20 March 2008). SAC ‘08. ACM, New York, NY, pp 1889–1896

  10. Borodin Y, Mahmud J, Ramakrishman IV, Stent A (2007) The hearsay non-visual web browser. In: ACM international conference proceeding series, proceedings of the 2007 international cross-disciplinary conference on web accessibility (W4A), vol 225. Banff, Canada, pp 128–129

  11. The CMU Sphinx Group Open Source Speech Recognition Engines. http://cmusphinx.sourceforge.net/html/cmusphinx.php. Retrieved on 30 Dec 2008

  12. The DAISY Consortium. http://www.daisy.org/. Retrieved on 30 Dec 2008

  13. Speech Recognition Grammar 1.0 Specification. http://www.w3.org/TR/speech-grammar/. Retrieved on 30 Dec 2008

  14. Speech Synthesis Markup Language. http://www.w3.org/TR/speech-synthesis/. Retrieved on 30 Dec 2008

  15. JSpeech Grammar Format. http://www.w3.org/TR/2000/NOTE-jsgf-20000605/. Retrieved on 30 Dec 2008

  16. JSpeech Markup Language. http://www.w3.org/TR/jsml/. Retrieved on 30 Dec 2008

  17. Session Initiation Protocol. http://www.cs.columbia.edu/sip/drafts.html. Retrieved on 30 Dec 2008

  18. MS SQL Server 2005. http://www.microsoft.com/Sqlserver/2005/en/us/express.aspx. Retrieved on 30 Dec 2008

  19. Vocalocity’s openVXI 3.0. http://www.speech.cs.cmu.edu/openvxi/. Retrieved on 30 Dec 2008

  20. The Festival Speech Synthesis System. http://www.cstr.ed.ac.uk/projects/festival/. Retrieved on 30 Dec 2008

  21. Ding L (2009) Learn about VoIP quality measurements. http://www.embeddeddesignindia.co.in/STATIC/PDF/200903. EE Times-India, white paper Retrieved on 4 May

  22. ITU-T Rec G.107 (2005) The E-Model, a computational model for use in transmission planning. March 2005

  23. Spirent Communications (2007) Measuring jitter accurately. http://www.spirent.com/documents/4814.pdf, white paper Retrieved on 4 May 2009

  24. WikyBlog. http://www.wikyblog.com/. Retrieved on 20 July 2009

  25. Twitter. http://twitter.com/. Retrieved on 20 July 2009

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Constantinos Kolias.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kolias, C., Kolias, V., Anagnostopoulos, I. et al. Design and implementation of a VoiceXML-driven wiki application for assistive environments on the web. Pers Ubiquit Comput 14, 527–539 (2010). https://doi.org/10.1007/s00779-009-0274-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00779-009-0274-z

Keywords

Navigation