Skip to main content
Log in

Audio formatting — presenting structured information aurally

  • Published:
Multimedia Systems Aims and scope Submit manuscript

Abstract

We have developed a computing system that takes a LATEX source as input and speaks it. The system is interactive in that the user can browse the document to listen to the parts that most interest him. Special attention has been given to the speaking of mathematical formulas; in this realm, the system outperforms humans. The system is designed primarily for the sight-impaired, but it has a much broader potential. AFL, the audio analogue of PostScript (Adobe Systems), for paper output is smaller than PostScript and consists of a simple block-structured language in which one writes commands that cause words to be spoken and sounds to be played. AFL is used to vary output parameters such as the speed of the spoken word, the pitch of the voice, and the length of pauses. AFL also synchronizes various sound components. The presence of AFL has allowed us to experiment extensively with various ways of speaking mathematics to arrive at effective audio renderings. The design of AFL is the focus of this paper.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Boston Children's Hospital (1986) The MultiVoice User's Manual, Boston

  • Brown MH (1992) Zeus: a system for algorithm animation and multiview editing. Technical report, DEC Systems Research Center, Palo Alto, Calif

    Google Scholar 

  • Burgess DA (1992) Techniques for low cost spatial audio. Proceedings of the ACM UIST92, pp 53–59

  • Klatt DH (1987) Review of text-to-speech conversion for English. Acoustic Soc Am J 82:737–783

    Google Scholar 

  • Mynatt ED, Edwards WK (1992) Mapping GUIs to auditory interfaces. Proceedings of the ACM, UIST92, pp 61–70

  • Raman TV (1992a) An audio view of (LA)TEX documents. Proceedings of the TEX Users Group 13:372–379

    Google Scholar 

  • Raman TV (1992b) Documents are not just for printing. Proceedings of the 1 st Workshop on the Principles of Document Processing

  • Raman TV (1994) Audio system for technial readings. PhD thesis, Cornell University, Ithaca, NY, htpp://www.research. digital.com/CRL/personal/raman/raman.thml.

    Google Scholar 

  • Raman TV, Gries D (1994a) Documents mean more than just paper! Proceedings of the 2nd International Workshop on the Principles of Document Processing

  • Raman TV, Gries D (1994b) Interactive audio documents. Proceedings of the 1st Annual ACM/SIGCAPH Conference on Assistive Technology

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to T. V. Raman.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Raman, T.V., Gries, D. Audio formatting — presenting structured information aurally. Multimedia Systems 3, 116–125 (1995). https://doi.org/10.1007/BF01542863

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF01542863

Key words

Navigation