Abstract
We have developed a computing system that takes a LATEX source as input and speaks it. The system is interactive in that the user can browse the document to listen to the parts that most interest him. Special attention has been given to the speaking of mathematical formulas; in this realm, the system outperforms humans. The system is designed primarily for the sight-impaired, but it has a much broader potential. AFL, the audio analogue of PostScript (Adobe Systems), for paper output is smaller than PostScript and consists of a simple block-structured language in which one writes commands that cause words to be spoken and sounds to be played. AFL is used to vary output parameters such as the speed of the spoken word, the pitch of the voice, and the length of pauses. AFL also synchronizes various sound components. The presence of AFL has allowed us to experiment extensively with various ways of speaking mathematics to arrive at effective audio renderings. The design of AFL is the focus of this paper.
Similar content being viewed by others
References
Boston Children's Hospital (1986) The MultiVoice User's Manual, Boston
Brown MH (1992) Zeus: a system for algorithm animation and multiview editing. Technical report, DEC Systems Research Center, Palo Alto, Calif
Burgess DA (1992) Techniques for low cost spatial audio. Proceedings of the ACM UIST92, pp 53–59
Klatt DH (1987) Review of text-to-speech conversion for English. Acoustic Soc Am J 82:737–783
Mynatt ED, Edwards WK (1992) Mapping GUIs to auditory interfaces. Proceedings of the ACM, UIST92, pp 61–70
Raman TV (1992a) An audio view of (LA)TEX documents. Proceedings of the TEX Users Group 13:372–379
Raman TV (1992b) Documents are not just for printing. Proceedings of the 1 st Workshop on the Principles of Document Processing
Raman TV (1994) Audio system for technial readings. PhD thesis, Cornell University, Ithaca, NY, htpp://www.research. digital.com/CRL/personal/raman/raman.thml.
Raman TV, Gries D (1994a) Documents mean more than just paper! Proceedings of the 2nd International Workshop on the Principles of Document Processing
Raman TV, Gries D (1994b) Interactive audio documents. Proceedings of the 1st Annual ACM/SIGCAPH Conference on Assistive Technology
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Raman, T.V., Gries, D. Audio formatting — presenting structured information aurally. Multimedia Systems 3, 116–125 (1995). https://doi.org/10.1007/BF01542863
Issue Date:
DOI: https://doi.org/10.1007/BF01542863