Skip to main content

Visual Attention in Auditory Display

  • Conference paper
  • 736 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4021))

Abstract

The interdisciplinary field of image sonification aims at the transformation of images to auditory signals. It brings together researchers from different fields of computer science like sound synthesizing, data mining and human computer interaction. Its goal is the use of sound and all its attributes to display the data sets itself and thus making the highly developed human aural system usable for data analysis. Unlike previous approaches we aim to sonify images of any kind. We propose that models of visual attention and visual grouping can be utilized to dynamically select relevant visual information to be sonified. For the auditory synthesis we employ an approach, which takes advantage of the sparseness of the selected input data. The presented approach proposes a combination of data sonification approaches, such as auditory scene generation, and models of human visual perception. It extends previous pixel-based transformation algorithms by incorporating mid-level vision coding and high-level control. The mapping utilizes elaborated sound parameters that allow non-trivial orientation and positioning in 3D space.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Fischer, Bayerl, Neumann, Christobal, Redondo: Are iterations and curvature useful for tensor voting? In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3023, pp. 158–169. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  2. Gonzalez, R.C., Woods, R.E.: Digital Image Processing. Addison-Wesley Longman Publishing Co. Inc., Boston (2001)

    Google Scholar 

  3. Hermann, T., Meinicke, P., Ritter, H.: Principal curve sonification. In: Cook, P.R. (ed.) Proc. of the Int. Conf. on Auditory Display, pp. 81–86. Int. Community for Auditory Display (2000a)

    Google Scholar 

  4. Hermann, T., Nattkemper, T., Schubert, W., Ritter, H.: Sonification of multi-channel image data. In: Falavar, V. (ed.) Proc. of the Mathematical and Engineering Techniques in Medical and Biological Sciences (METMBS 2000), pp. 745–750. CSREA Press (2000b)

    Google Scholar 

  5. Itti, L., Koch, C., Niebur, E.: A model of saliency-based visual attention for rapid scene analysis. IEEE Trans. Pattern Anal. Machine Intell. 20(11), 1254–1259 (1998)

    Article  Google Scholar 

  6. Krüger, N., Wörgötter, F.: Symbolic pointillism: Computer art motivated by human brain structures. Leonardo 38(4), 337–340 (2005)

    Article  Google Scholar 

  7. Marr, D.: Vision. W.H. Freeman and Company, New York (1982)

    Google Scholar 

  8. Martins, A.C.G., Rangayyan, R.M., Ruschioni, R.A.: Audification and sonification of texture in images. Journal of Electronic Imaging 10(3), 690–705 (2001)

    Article  Google Scholar 

  9. Meijer, P.B.: An experimental system for auditory image representation. IEEE Transactions on Biomedical Engineering 39(2), 112–121 (1992)

    Article  Google Scholar 

  10. Rath, M., Rocchesso, D.: Continuous sonic feedback from a rolling ball. IEEE Multimedia 12(2), 60–69 (2005)

    Article  Google Scholar 

  11. Simoncelli, E.P., Heeger, D.J.: A model of neuronal responses in visual area MT. Vision Research 38(5), 743–761 (1998)

    Article  Google Scholar 

  12. Trucco, E., Verri, A.: Introductory Techniques for 3-D Computer Vision. Prentice Hall PTR, Upper Saddle River (1998)

    Google Scholar 

  13. Weidenbacher, U., Bayerl, P., Fleming, R., Neumann, H.: Extracting and depicting the 3d shape of specular surfaces. In: Siggraph Symposium on Applied Perception and Graphics in Visualization, pp. 83–86. ACM, New York (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Mahler, T., Bayerl, P., Neumann, H., Weber, M. (2006). Visual Attention in Auditory Display. In: André, E., Dybkjær, L., Minker, W., Neumann, H., Weber, M. (eds) Perception and Interactive Technologies. PIT 2006. Lecture Notes in Computer Science(), vol 4021. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11768029_7

Download citation

  • DOI: https://doi.org/10.1007/11768029_7

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-34743-9

  • Online ISBN: 978-3-540-34744-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics