skip to main content
10.1145/2166966.2167017acmconferencesArticle/Chapter ViewAbstractPublication PagesiuiConference Proceedingsconference-collections
poster

Audio cloud: creation and rendering

Published:14 February 2012Publication History

ABSTRACT

Word clouds are extensively used to present a summary of the prominent words in a document on the World Wide Web. Such clouds give the user an idea about the content of the document. In this paper we present a mechanism to create and render an audio cloud for audio content. Such audio clouds are expected to provide a similar summary of the audio documents. They have wide applicability in various domains, especially for low-literate users who currently do not use the Internet but interact with audio-based systems.

Detecting words from an audio content is challenging, especially if the audio is in languages for which a speech recognition system does not exist. We present a language-independent mechanism to detect frequently occurring words within an audio document. We then present four ways to render these words that form an audio cloud. The four prototypes for rendering the audio cloud are based on varying the amplitude, the voice quality, echo and the repetition of audio words. An evaluation study conducted across 32 users suggests that literate and low-literate users easily understand the concept of audio cloud.

References

  1. Allauzen, C., Riley, M., Schalkwyk, J., Skut, W., and Mohri, M. Openfst: A general and efficient weighted finite-state transducer library. CIAA (2007), 11--23. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Furnas, G. W., Fake, C., von Ahn, L., Schachter, J., Golder, S., Fox, K., Davis, M., Marlow, C., and Naaman, M. Why do tagging systems work? In CHI'06 extended abstracts, CHI EA '06 (2006), 36--39. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Internet Usage World Wide by Country. http://www.infoplease.com/ipa/a0933606.html, Last accessed on October 10, 2011.Google ScholarGoogle Scholar
  4. Legg, L., and Gilbert, P. A pilot study of gender of voice and gender of voice hearer in psychotic voice hearers. Psychology and Psychotherapy: Theory, Research and Practice (2006), 517--527.Google ScholarGoogle Scholar
  5. Liddy, E. Advances in automatic text summarization. Inf. Retr. 4 (April 2001), 82--83. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Marzano, R. J. A theory-based meta-analysis of research on instruction. Mid-continent Aurora, Colorado: Regional Educational Laboratory. (2000).Google ScholarGoogle Scholar
  7. Miller, G. A. The magical number seven, plus or minus two: Some limits on our capacity for processing information. The Psychological Review (1956), 81--97.Google ScholarGoogle Scholar
  8. Parada, C., Sethy, A., and Ramabhadran, B. Query-by-example spoken term detection for oov terms. Proc. of Automatic Speech Recognition and Understanding (2009).Google ScholarGoogle ScholarCross RefCross Ref
  9. Tusing, K., and Dillard, J. The sounds of dominance. Human Communication Research 26, 1 (2000), 148--171.Google ScholarGoogle ScholarCross RefCross Ref
  10. UNESCO Institute for Statistics. Global education digest 2010: Comparing education statistics across the world, 2010.Google ScholarGoogle Scholar
  11. Vigas, A. B., Wattenberg, M., and Feinberg, J. Participatory visualization with wordle. IEEE Transactions on Visualization and Computer Graphics 15 (2009). Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Audio cloud: creation and rendering

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      IUI '12: Proceedings of the 2012 ACM international conference on Intelligent User Interfaces
      February 2012
      436 pages
      ISBN:9781450310482
      DOI:10.1145/2166966

      Copyright © 2012 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 14 February 2012

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • poster

      Acceptance Rates

      Overall Acceptance Rate746of2,811submissions,27%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader