A Voice-Driven Web Browser for Blind People

Dobrišek, Simon; Gros, Jerneja; Vesnicer, Boštjan; Mihelič, France; Pavešić, Nikola

doi:10.1007/3-540-46154-X_65

Simon Dobrišek³,
Jerneja Gros³,
Boštjan Vesnicer³,
France Mihelič³ &
…
Nikola Pavešić³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2448))

Included in the following conference series:

International Conference on Text, Speech and Dialogue

584 Accesses

Abstract

A specialised small Web browser with a voice-driven dialogue manager and a text-to-speech screen reader is presented. The Web browser was built from the GTK Web browser Dillo, which is a free software project in the terms of the GNU general public license. The new built-in screen reader is now triggered by pointing the mouse and uses the text-to-speech module for its output. A dialogue module together with a spoken-command input was also introduced into the browser. It can be used for navigation through a structure of common Web pages. The developed browser is primarily intended to be used with the new Web portal, exclusively dedicated to blind and visually impaired users. All the Web pages at the portal or at sites that are linked from this portal are expected to be arranged as common HTML/XML pages, which complies with the basic recommendations set by the Web Access Initiative.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Dobrišek, S., Gros, J., Mihelič, F., Pavešić, N. (1998). ‘Recording and Labelling of the GOPOLIS Slovenian Speech Database’. Proceedings ot the First International Conference on Language Resources and Evaluation. Granada, Spain. 2: pp. 1089–1096.
Google Scholar
Dobrišek, S. (2001). Analysis and Recognition of Phones in Speech Signal. Ph.D. Thesis (in Slovene), University of Ljubljana.
Google Scholar
Gros, J., Pavešić, N. and Mihelič, F. (1997). ‘Text-to-speech synthesis: A complete system for the Slovenian language’. Journal of Computing and Information Technology. CIT-5, 1:11–19.
Google Scholar
Huang, X.D., Ariki, Y. and Jack, M.A. (1990). Hidden Markov Models for Speech Recognition. Edinburg Information Technology Series. Redwood Press Limited, London.
Google Scholar
Ipšić, I., Mihelič, F., Dobrišek, S., Gros, J. and Pavešć, N. (1995). ‘Overview of the Spoken Queries in European Languages Project: The Slovenian Spoken Dialogue System’. Proceedings of the Scientific Conference on Artificial Intelligence in Industry. High Tatras, Slovakia. pp. 431–438.
Google Scholar
Jelinek, F. (1998). Statistical Methods for Speech Recognition. The MIT Press. Cambridge, Massachusetts.
Google Scholar
Moulines, E. and Charpentier F. (1990). ‘Pitch-SynchronousWaveform Processing Techniques for Text-to-Speech Synthesis Using Diphones’. Proceedings of the National Academy of Sciences of the United States of America. 92. 22:, pp. 9999–10006.
Google Scholar
Zajicek, M., Powell, C. and Reeves, C. (1999). ‘Ergonomic factors for a speaking computer interface’. In M. A. Hanson, E. J. Lovesey and S. A. Robertson (Eds.), Contemporary Ergonomics-The proceedings of the 50th Ergonomics Society Conference, Leicester University. Taylor and Francis, London, pp. 484–488.
Google Scholar

Download references

Author information

Authors and Affiliations

Laboratory of Artificial Perception, Systems and Cybernetics, University of Ljubljana, Faculty of Electrical Engineering, Ljubljana, Slovenia
Simon Dobrišek, Jerneja Gros, Boštjan Vesnicer, France Mihelič & Nikola Pavešić

Authors

Simon Dobrišek
View author publications
You can also search for this author in PubMed Google Scholar
Jerneja Gros
View author publications
You can also search for this author in PubMed Google Scholar
Boštjan Vesnicer
View author publications
You can also search for this author in PubMed Google Scholar
France Mihelič
View author publications
You can also search for this author in PubMed Google Scholar
Nikola Pavešić
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Informatics Department of Programming Systems and Communication, Masaryk University, Botanická 68a, 602 00, Brno, Czech Republic
Petr Sojka
Faculty of Informatics Department of Information Technologies, Masaryk University, Botanická 68a, 602 00, Brno, Czech Republic
Ivan Kopeček & Karel Pala &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dobrišek, S., Gros, J., Vesnicer, B., Mihelič, F., Pavešić, N. (2002). A Voice-Driven Web Browser for Blind People. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2002. Lecture Notes in Computer Science(), vol 2448. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46154-X_65

Download citation

DOI: https://doi.org/10.1007/3-540-46154-X_65
Published: 23 August 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44129-8
Online ISBN: 978-3-540-46154-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics