Abstract
A specialised small Web browser with a voice-driven dialogue manager and a text-to-speech screen reader is presented. The Web browser was built from the GTK Web browser Dillo, which is a free software project in the terms of the GNU general public license. The new built-in screen reader is now triggered by pointing the mouse and uses the text-to-speech module for its output. A dialogue module together with a spoken-command input was also introduced into the browser. It can be used for navigation through a structure of common Web pages. The developed browser is primarily intended to be used with the new Web portal, exclusively dedicated to blind and visually impaired users. All the Web pages at the portal or at sites that are linked from this portal are expected to be arranged as common HTML/XML pages, which complies with the basic recommendations set by the Web Access Initiative.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Dobrišek, S., Gros, J., Mihelič, F., Pavešić, N. (1998). ‘Recording and Labelling of the GOPOLIS Slovenian Speech Database’. Proceedings ot the First International Conference on Language Resources and Evaluation. Granada, Spain. 2: pp. 1089–1096.
Dobrišek, S. (2001). Analysis and Recognition of Phones in Speech Signal. Ph.D. Thesis (in Slovene), University of Ljubljana.
Gros, J., Pavešić, N. and Mihelič, F. (1997). ‘Text-to-speech synthesis: A complete system for the Slovenian language’. Journal of Computing and Information Technology. CIT-5, 1:11–19.
Huang, X.D., Ariki, Y. and Jack, M.A. (1990). Hidden Markov Models for Speech Recognition. Edinburg Information Technology Series. Redwood Press Limited, London.
Ipšić, I., Mihelič, F., Dobrišek, S., Gros, J. and Pavešć, N. (1995). ‘Overview of the Spoken Queries in European Languages Project: The Slovenian Spoken Dialogue System’. Proceedings of the Scientific Conference on Artificial Intelligence in Industry. High Tatras, Slovakia. pp. 431–438.
Jelinek, F. (1998). Statistical Methods for Speech Recognition. The MIT Press. Cambridge, Massachusetts.
Moulines, E. and Charpentier F. (1990). ‘Pitch-SynchronousWaveform Processing Techniques for Text-to-Speech Synthesis Using Diphones’. Proceedings of the National Academy of Sciences of the United States of America. 92. 22:, pp. 9999–10006.
Zajicek, M., Powell, C. and Reeves, C. (1999). ‘Ergonomic factors for a speaking computer interface’. In M. A. Hanson, E. J. Lovesey and S. A. Robertson (Eds.), Contemporary Ergonomics-The proceedings of the 50th Ergonomics Society Conference, Leicester University. Taylor and Francis, London, pp. 484–488.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dobrišek, S., Gros, J., Vesnicer, B., Mihelič, F., Pavešić, N. (2002). A Voice-Driven Web Browser for Blind People. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2002. Lecture Notes in Computer Science(), vol 2448. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46154-X_65
Download citation
DOI: https://doi.org/10.1007/3-540-46154-X_65
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44129-8
Online ISBN: 978-3-540-46154-8
eBook Packages: Springer Book Archive