Abstract
In this paper, we describe the design and implementation of an audio wiki application accessible via both the Public Switched Telephone Network and the Internet. The application exploits mature World Wide Web Consortium standards, such as VoiceXML, Speech Synthesis Markup Language, and Speech Recognition Grammar Specification toward achieving our goals. The purpose of such an application is to assist visually impaired, technologically uneducated, and underprivileged people in accessing information originally intended to be accessed visually via a personal computer (PC). Users may access wiki content via fixed or mobile phones, or via a PC using a Web Browser or a Voice over IP service. This feature promotes pervasiveness to collaboratively created content to an extremely large population, i.e., those who simply own a telephone line.
Similar content being viewed by others
Abbreviations
- ABNF:
-
Augmented Backus–Naur Form
- ASR:
-
Automatic Speech Recognition
- CGI:
-
Common Gateway Interface
- DTB:
-
Digital Talking Book
- DTMF:
-
Dual-Tone Multi-Frequency
- GrXML:
-
Grammar XML
- HTML:
-
Hypertext Markup Language
- IT:
-
Information Technology
- JSGF:
-
Java Speech Grammar Format
- JSML:
-
Java Speech API Markup Language
- PC:
-
Personal Computer
- PSTN:
-
Public Switched Telephone Network
- SIP:
-
Session Initiation Protocol
- SMS:
-
Short Message Service
- SRGS:
-
Speech Recognition Grammar Specification
- SSML:
-
Speech Synthesis Markup Language
- TTS:
-
Text to Speech
- URI:
-
Uniform Resource Identifier
- VoiceXML:
-
Voice eXtensible Markup Language
- VoIP:
-
Voice over IP
- WML:
-
Wireless Markup Language
- W3C:
-
World Wide Web Consortium
- XHTML:
-
Extensible Hyper Text Markup Language
- XML:
-
eXtensible Markup Language
References
Kolias C, Demertzis S, Kambourakis G (2008) Design and implementation of a secure mobile wiki system. In: Uskov V (ed) 7th IASTED international conference on web-based education (WBE 2008). March 2008, Innsbruck, Austria, pp 212–217
WML 2.0 Specification. http://www1.wapforum.org/tech/terms.asp?doc=WAP-238-WML-20010911-a.pdf
i-mode Compatible 7.1 Specification. http://www.nttdocomo.co.jp/english/service/imode/make/content/html/version/index.html#p071
Voice Extensible Markup Language 2.0 Specification. http://www.w3.org/TR/voicexml20/. Retrieved on 30 Dec 2008
Leinonen T, Aucamp FN, Sari ER (2006) Audio Wiki for mobile communities: information system for the rest of Us. In: Workshop on speech in mobile and pervasive environments, Mobile HCI 06 conference, 12 Sept 2006, pp 3
MediaWiki. http://www.mediawiki.org/wiki/MediaWiki. Retrieved on 5 May 2008
Werner S, Wolff M, Eichner M, Hoffmann R (2004) Integrating speech enabled services in a web-based e-learning environment. In: Proceedings of international conference on information technology: coding and computing, vol 2. ITCC 2004, 5–7 April 2004, pp 303–307
Wang L, Roe P, Pham B, Tjondronegoro D (2008) An audio wiki supporting mobile collaboration. In: Proceedings of the 2008 ACM symposium on applied computing (Fortaleza, Ceara, Brazil, 16–20 March 2008). SAC ‘08. ACM, New York, NY, pp 1889–1896
Borodin Y, Mahmud J, Ramakrishman IV, Stent A (2007) The hearsay non-visual web browser. In: ACM international conference proceeding series, proceedings of the 2007 international cross-disciplinary conference on web accessibility (W4A), vol 225. Banff, Canada, pp 128–129
The CMU Sphinx Group Open Source Speech Recognition Engines. http://cmusphinx.sourceforge.net/html/cmusphinx.php. Retrieved on 30 Dec 2008
The DAISY Consortium. http://www.daisy.org/. Retrieved on 30 Dec 2008
Speech Recognition Grammar 1.0 Specification. http://www.w3.org/TR/speech-grammar/. Retrieved on 30 Dec 2008
Speech Synthesis Markup Language. http://www.w3.org/TR/speech-synthesis/. Retrieved on 30 Dec 2008
JSpeech Grammar Format. http://www.w3.org/TR/2000/NOTE-jsgf-20000605/. Retrieved on 30 Dec 2008
JSpeech Markup Language. http://www.w3.org/TR/jsml/. Retrieved on 30 Dec 2008
Session Initiation Protocol. http://www.cs.columbia.edu/sip/drafts.html. Retrieved on 30 Dec 2008
MS SQL Server 2005. http://www.microsoft.com/Sqlserver/2005/en/us/express.aspx. Retrieved on 30 Dec 2008
Vocalocity’s openVXI 3.0. http://www.speech.cs.cmu.edu/openvxi/. Retrieved on 30 Dec 2008
The Festival Speech Synthesis System. http://www.cstr.ed.ac.uk/projects/festival/. Retrieved on 30 Dec 2008
Ding L (2009) Learn about VoIP quality measurements. http://www.embeddeddesignindia.co.in/STATIC/PDF/200903. EE Times-India, white paper Retrieved on 4 May
ITU-T Rec G.107 (2005) The E-Model, a computational model for use in transmission planning. March 2005
Spirent Communications (2007) Measuring jitter accurately. http://www.spirent.com/documents/4814.pdf, white paper Retrieved on 4 May 2009
WikyBlog. http://www.wikyblog.com/. Retrieved on 20 July 2009
Twitter. http://twitter.com/. Retrieved on 20 July 2009
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Kolias, C., Kolias, V., Anagnostopoulos, I. et al. Design and implementation of a VoiceXML-driven wiki application for assistive environments on the web. Pers Ubiquit Comput 14, 527–539 (2010). https://doi.org/10.1007/s00779-009-0274-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00779-009-0274-z