skip to main content
10.1145/3459212.3459223acmotherconferencesArticle/Chapter ViewAbstractPublication PagesivspConference Proceedingsconference-collections
research-article

Proposal of the Aesthetic Experience-Oriented Evaluation Framework for Field-recording Sound Retrieval System: Experiments using Acoustic Feature Signatures Based on Multiscale Fractal Dimension

Authors Info & Claims
Published:20 July 2021Publication History

ABSTRACT

Sound designers and musicians often need to retrieve sound materials based on their similarity to aesthetic hearing experiences from sound databases such as Freesound. This study proposes an aesthetic experience-oriented evaluation framework for a field-recording sound retrieval system, using the sound clips extracted from Freesound. Furthermore, we discuss the features of the framework by analyzing the performance of the similarity search system for field-recording sound material using acoustic feature signatures that are based on the multiscale fractal dimension.

References

  1. V. Akkermans, F. Font, J. Funollet, B. de Jong, G. Roma, S. Togias, and X. Serra, “FREESOUND 2.0: An Improved Platformfor Sharing Audio Clips,” 12th Int. Soc. Music Inf. Retr. Conf., (2011).Google ScholarGoogle Scholar
  2. Music Technology Group of Universitat Pompeu Fabra, “The Freesound Project.,” https://www.freesound.org/Google ScholarGoogle Scholar
  3. Distributed Creation Inc., "Splice - Royalty-Free Sounds & Rent-to-Own Plugins," Retrieved Feb 4, 2021 from https://splice.com/Google ScholarGoogle Scholar
  4. S. Chachada and C. C. J. Kuo, “Environmental sound recognition: A survey,” APSIPA Trans. Signal Inf. Process., vol. 3, (2014).Google ScholarGoogle ScholarCross RefCross Ref
  5. D. Stowell, D. Giannoulis, E. Benetos, M. Lagrange, and M. D. Plumbley, “Detection and Classification of Acoustic Scenes and Events,” IEEE Trans. Multimed., vol. 17, no. 10, pp. 1733–1746, (2015).Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S. Chu, S. Narayanan, and C.-C. Kuo, “Environmental Sound Recognition With Time-Frequency Audio Features,” IEEE Trans. Audio. Speech. Lang. Processing, vol. 17, no. 6, pp. 1142–1158, (2009).Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. M. Solomos, “A Phenomenological Experience of Sound: Notes on Francisco López,” Contemp. Music Rev., vol. 38, no. 1–2, pp. 94–106, 2019.Google ScholarGoogle ScholarCross RefCross Ref
  8. M. Sunouchi, and Y. Tanaka, “Similarity Search of Freesound Environmental Sound Based on Their Enhanced Multiscale Fractal Dimension,” Sound Music Comput. Conf. 2013, SMC 2013, pp. 715–721, (2013).Google ScholarGoogle Scholar
  9. M. Sunouchi, and M. Yoshioka. "Diversity-Robust Acoustic Feature Signatures Based on Multiscale Fractal Dimension for Similarity Search of Environmental Sounds." arXiv preprint arXiv:2102.02964 (2021).Google ScholarGoogle Scholar
  10. "Sound Dataset extracted from Freesound," Online available. https://labs.43d.jp/fs3000_dataset/fs3000_dataset.tar.bz2 (17GB)Google ScholarGoogle Scholar
  11. M. Porter, “An algorithm for suffix stripping,” Progr. Electron. Libr. Inf. Syst., vol. 14, no. 3, pp. 130–137, (1980).Google ScholarGoogle ScholarCross RefCross Ref
  12. P. Maragos and A. Potamianos, “Fractal dimensions of speech sounds: computation and application to automatic speech recognition.” J. Acoust. Soc. Am., vol. 105, no. 3, pp. 1925–1932, (1999).Google ScholarGoogle ScholarCross RefCross Ref
  13. A. Zlatintsi and P. Maragos, “Multiscale Fractal Analysis of Musical Instrument Signals With Application to Recognition,” IEEE Trans. Audio. Speech. Lang. Processing, vol. 21, no. 4, pp. 737–748, (2013).Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. SPTK working group, “Speech Signal Processing Toolkit (SPTK),” http://sp-tk.sourceforge.net/, (Retrieved 2021-2-4).Google ScholarGoogle Scholar
  15. Y. Wang, L. Wang, Y. Li, D. He, T.-Y. Liu, and W. Chen, “A Theoretical Analysis of NDCG Type Ranking Measures,” Proc. 26th Annu. Conf. Learn. Theory, pp. 1–30, 2013.Google ScholarGoogle Scholar

Index Terms

  1. Proposal of the Aesthetic Experience-Oriented Evaluation Framework for Field-recording Sound Retrieval System: Experiments using Acoustic Feature Signatures Based on Multiscale Fractal Dimension
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        IVSP '21: Proceedings of the 2021 3rd International Conference on Image, Video and Signal Processing
        March 2021
        132 pages
        ISBN:9781450388917
        DOI:10.1145/3459212

        Copyright © 2021 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 20 July 2021

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed limited
      • Article Metrics

        • Downloads (Last 12 months)5
        • Downloads (Last 6 weeks)0

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format .

      View HTML Format