Visual Attention in Auditory Display

Mahler, Thorsten; Bayerl, Pierre; Neumann, Heiko; Weber, Michael

doi:10.1007/11768029_7

Visual Attention in Auditory Display

Thorsten Mahler²³,
Pierre Bayerl²⁴,
Heiko Neumann²⁴ &
…
Michael Weber²³

Conference paper

736 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4021))

Abstract

The interdisciplinary field of image sonification aims at the transformation of images to auditory signals. It brings together researchers from different fields of computer science like sound synthesizing, data mining and human computer interaction. Its goal is the use of sound and all its attributes to display the data sets itself and thus making the highly developed human aural system usable for data analysis. Unlike previous approaches we aim to sonify images of any kind. We propose that models of visual attention and visual grouping can be utilized to dynamically select relevant visual information to be sonified. For the auditory synthesis we employ an approach, which takes advantage of the sparseness of the selected input data. The presented approach proposes a combination of data sonification approaches, such as auditory scene generation, and models of human visual perception. It extends previous pixel-based transformation algorithms by incorporating mid-level vision coding and high-level control. The mapping utilizes elaborated sound parameters that allow non-trivial orientation and positioning in 3D space.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Fischer, Bayerl, Neumann, Christobal, Redondo: Are iterations and curvature useful for tensor voting? In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3023, pp. 158–169. Springer, Heidelberg (2004)
Chapter Google Scholar
Gonzalez, R.C., Woods, R.E.: Digital Image Processing. Addison-Wesley Longman Publishing Co. Inc., Boston (2001)
Google Scholar
Hermann, T., Meinicke, P., Ritter, H.: Principal curve sonification. In: Cook, P.R. (ed.) Proc. of the Int. Conf. on Auditory Display, pp. 81–86. Int. Community for Auditory Display (2000a)
Google Scholar
Hermann, T., Nattkemper, T., Schubert, W., Ritter, H.: Sonification of multi-channel image data. In: Falavar, V. (ed.) Proc. of the Mathematical and Engineering Techniques in Medical and Biological Sciences (METMBS 2000), pp. 745–750. CSREA Press (2000b)
Google Scholar
Itti, L., Koch, C., Niebur, E.: A model of saliency-based visual attention for rapid scene analysis. IEEE Trans. Pattern Anal. Machine Intell. 20(11), 1254–1259 (1998)
Article Google Scholar
Krüger, N., Wörgötter, F.: Symbolic pointillism: Computer art motivated by human brain structures. Leonardo 38(4), 337–340 (2005)
Article Google Scholar
Marr, D.: Vision. W.H. Freeman and Company, New York (1982)
Google Scholar
Martins, A.C.G., Rangayyan, R.M., Ruschioni, R.A.: Audification and sonification of texture in images. Journal of Electronic Imaging 10(3), 690–705 (2001)
Article Google Scholar
Meijer, P.B.: An experimental system for auditory image representation. IEEE Transactions on Biomedical Engineering 39(2), 112–121 (1992)
Article Google Scholar
Rath, M., Rocchesso, D.: Continuous sonic feedback from a rolling ball. IEEE Multimedia 12(2), 60–69 (2005)
Article Google Scholar
Simoncelli, E.P., Heeger, D.J.: A model of neuronal responses in visual area MT. Vision Research 38(5), 743–761 (1998)
Article Google Scholar
Trucco, E., Verri, A.: Introductory Techniques for 3-D Computer Vision. Prentice Hall PTR, Upper Saddle River (1998)
Google Scholar
Weidenbacher, U., Bayerl, P., Fleming, R., Neumann, H.: Extracting and depicting the 3d shape of specular surfaces. In: Siggraph Symposium on Applied Perception and Graphics in Visualization, pp. 83–86. ACM, New York (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Media Informatics,
Thorsten Mahler & Michael Weber
Department of Neuro Informatics, University of Ulm, Ulm, Germany
Pierre Bayerl & Heiko Neumann

Authors

Thorsten Mahler
View author publications
You can also search for this author in PubMed Google Scholar
Pierre Bayerl
View author publications
You can also search for this author in PubMed Google Scholar
Heiko Neumann
View author publications
You can also search for this author in PubMed Google Scholar
Michael Weber
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Augsburg, Germany
Elisabeth André
Natural Interactive Systems Laboratory (NISLab), University of Southern Denmark,, Campusvej 55, 5230, Odense, Denmark
Laila Dybkjær
Department of Information Technology, University of Ulm, Ulm, Germany
Wolfgang Minker
Institute of Neural Information Processing, University of Ulm, 89069, Ulm
Heiko Neumann
Institut für Medieninformatik, Universität Ulm, Ulm, Germany
Michael Weber

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mahler, T., Bayerl, P., Neumann, H., Weber, M. (2006). Visual Attention in Auditory Display. In: André, E., Dybkjær, L., Minker, W., Neumann, H., Weber, M. (eds) Perception and Interactive Technologies. PIT 2006. Lecture Notes in Computer Science(), vol 4021. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11768029_7

Download citation

DOI: https://doi.org/10.1007/11768029_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34743-9
Online ISBN: 978-3-540-34744-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics