Skip to main content

A Multi-channel Audio Compression Method with Virtual Source Location Information

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3767))

Abstract

Binaural cue coding (BCC) was introduced as an efficient representation method for MPEG-4 SAC (Spatial Audio Coding). However, in a low bit-rate environment, the spectrum of BCC output signals degrades with respect to the perceptual level. The proposed system in this paper estimates VSLI (virtual source location information) as the side information. The VSLI is the angle representation of spatial images between channels on playback layout. The subjective assessment results show that the proposed method provides better audio quality than the BCC method for encoding multi-channel signals.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Generic Coding of Moving Pictures and Associated Audio Information-Part 7: Advanced Audio Coding, ISO/IEC Std. 13 818-7 (1997)

    Google Scholar 

  2. Bosi, M., Brandenburg, K., Quackenbush, S.R., Fielder, L., Akagiri, K., Fuchs, H., Dietz, M., Herre, J., Davidson, G., Oikawa, Y.: ISO/IEC MPEG-2 advanced audio coding. J. Audio Eng. Soc. 45(10), 789–814 (1997)

    Google Scholar 

  3. Shinha, D., Johnston, J.D., Dorward, S., Quackenbush, S.R.: The perceptual audio coder (PAC). In: Madisetti, V., Williams, D.B. (eds.) The Digital Signal Processing Handbook, vol. ch. 42. CRC Press/ IEEE Press (1997)

    Google Scholar 

  4. Glasberg, B.R., Moore, B.C.J.: Derivation of auditory filter shapes from notched-noise data. Hear. Res. 47, 103–138 (1990)

    Article  Google Scholar 

  5. Pulkki, V., Karjalainen, M.: Localization of Amplitude-Panned Virtual Sources I: Stereophonic Pannig. J. Audio Eng. Soc. 49(9), 739–752 (2001)

    Google Scholar 

  6. Pulkki, V.: Localization of Amplitude-Panned Virtual Sources II: three-dimensional panning. J. Audio Eng. Soc. 49(9), 753–767 (2001)

    Google Scholar 

  7. West, J.R.: Five-channel panning laws: an analytic and experimental comparison, Master’s Thesis, Music Engineering, University of Miami (1998)

    Google Scholar 

  8. Stylianou, Y., Syrdal, A.K.: Perceptual and objective detection of discontinuities in concatenative speech synthesis. In: Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2001, vol. 2, pp. 837–840 (2001)

    Google Scholar 

  9. ITU-R Recommendation, Subjective Assessment of Sound Quality, International Telecommunication Union, BS. 562-3, Geneva (1990)

    Google Scholar 

  10. ITU-R Recommendation, Method for the Subjective Assessment of Intermediate Sound Quality (MUSHRA), International Telecommunication Union, BS. 1534-1, Geneva (2001)

    Google Scholar 

  11. ISO/IEC JTC1/SC29/WG11 (MPEG), Procedures for the Evaluation of Spatial Audio Coding Systems, Document N6691, Redmond (July 2004)

    Google Scholar 

  12. Faller, C., Baumgarte, F.: Efficient representation of spatial audio using perceptual parametrization. In: IEEE Workshop on Appl. of Sig. Proc. to Audio and Acoust. (October 2001)

    Google Scholar 

  13. Faller, C., Baumgarte, F.: Binaural cue coding applied to audio compression with flexible rendering. In: Proc. AES 113th Conv., Los Angeles, CA (October 2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Moon, Hg., Seo, Ji., Beak, S., Sung, KM. (2005). A Multi-channel Audio Compression Method with Virtual Source Location Information. In: Ho, YS., Kim, H.J. (eds) Advances in Multimedia Information Processing - PCM 2005. PCM 2005. Lecture Notes in Computer Science, vol 3767. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11581772_65

Download citation

  • DOI: https://doi.org/10.1007/11581772_65

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-30027-4

  • Online ISBN: 978-3-540-32130-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics