skip to main content
research-article

Audio Musical Dice Game: A User-Preference-Aware Medley Generating System

Published:02 June 2015Publication History
Skip Abstract Section

Abstract

This article proposes a framework for creating user-preference-aware music medleys from users' music collections. We treat the medley generation process as an audio version of a musical dice game. Once the user's collection has been analyzed, the system is able to generate various pleasing medleys. This flexibility allows users to create medleys according to the specified conditions, such as the medley structure or the must-use clips. Even users without musical knowledge can compose medley songs from their favorite tracks. The effectiveness of the system has been evaluated through both objective and subjective experiments on individual components in the system.

References

  1. Claudio Baccigalupo and Enric Plaza. 2006. Case-based sequential ordering of songs for playlist recommendation. In Advances in Case-Based Reasoning, Lecture Notes in Computer Science, vol. 4106. 286--300. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Luke Barrington, Antoni B. Chan, and Gert Lanckriet. 2010. Modeling music as a dynamic texture. IEEE Trans. Audio Speech Lang. Process. 18, 3, 602--612. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Sumit Basu. 2004. Mixing with Mozart. In Proceedings of the International Computer Music Conference.Google ScholarGoogle Scholar
  4. Gilberto Bernardes, Carlos Guedes, and Bruce Pennycook. 2012. EarGram: An application for interactive exploration of large databases of audio snippets for creative purposes. In Proceedings of the Symposium on Computer Music Multidisciplinary Research. 19--22.Google ScholarGoogle Scholar
  5. Charles Burkhart. 2005. The phrase rhythm of Chopin's A-Flat Major Mazurka, Op. 59, No. 2. In Engaging Music: Essays in Music Analysis, Deborah Jane Stein (Ed.), Oxford University Press, 3--12.Google ScholarGoogle Scholar
  6. Luca Chiarandini, Massimiliano Zanoni, and Augusto Sarti. 2011. A system for dynamic playlist generation driven by multimodal control signals and descriptors. In Proceedings of the International Workshop on Multimedia Signal Processing.Google ScholarGoogle ScholarCross RefCross Ref
  7. Dave Cliff. 2000. Hang the DJ: Automatic sequencing and seamless mixing of dance-music tracks. Tech. Rep. HP Labs.Google ScholarGoogle Scholar
  8. Richard Cole and Ed Schwartz. 2012. Virginia Tech Multimedia Music Dictionary. http://www.music.vt.edu/musicdictionary/.Google ScholarGoogle Scholar
  9. David Cope. 1987. Experiments in music intelligence. In Proceedings of the International Computer Music Conference.Google ScholarGoogle Scholar
  10. David Cope. 1996. Experiments in Musical Intelligence. A-R Editions, Madison, WI.Google ScholarGoogle Scholar
  11. Roger B. Dannenberg. 2006. Concatenative synthesis using score-aligned transcriptions music analysis and segmentation. In Proceedings of the International Computer Music Conference. 352--355.Google ScholarGoogle Scholar
  12. Matthew E. P. Davies, Philippe Hamel, Kazuyoshi Yoshii, and Masataka Goto. 2013. AutoMashUpper: An automatic multi-song mashup system. In Proceedings of the International Society for Music Information Retrieval.Google ScholarGoogle Scholar
  13. Steven B. Davis and Paul Mermelstein. 1980. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Audio Speech Lang. Process. 28, 4, 357--366.Google ScholarGoogle ScholarCross RefCross Ref
  14. Simon Dixon. 2007. Evaluation of the audio beat tracking system BeatRoot. J. New Music Res. 36, 1, 39--50.Google ScholarGoogle ScholarCross RefCross Ref
  15. Mark Dolson. 1986. The phase vocoder: A tutorial. Computer Music J. 10, 4, 14--27.Google ScholarGoogle ScholarCross RefCross Ref
  16. Arthur Flexer, Dominik Schnitzer, Martin Gasser, and Gerhard Widmer. 2008. Playlist generation using start and end songs. In Proceedings of the International Society for Music Information Retrieval. 173--178.Google ScholarGoogle Scholar
  17. Jonathan Foote. 1999. Visualizing music and audio using self-similarity. In Proceedings of the. ACM Multimedia Conference. 77--80. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. G. David Forney Jr. 1973. The Viterbi algorithm. Proc. IEEE 61, 3, 302--309.Google ScholarGoogle Scholar
  19. Garth Griffin, Y. E. Kim, and Douglas Turnbull. 2010. Beat-Sync-Mash-Coder: A web application for real-time creation of beat-synchronous music mashups. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing.Google ScholarGoogle ScholarCross RefCross Ref
  20. Pierre Hanna, Pascal Ferraro, and Matthias Robine. 2007. On optimizing the editing algorithms for evaluating similarity between monophonic musical sequences. J. New Music Res. 36, 4, 267--279.Google ScholarGoogle ScholarCross RefCross Ref
  21. Pierre Hanna, Matthias Robine, and Thomas Rocher. 2009. An alignment based system for chord sequence retrieval. In Proceedings of the Joint Conference on Digital Libraries. 101. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Hiromi Ishizaki, Keiichiro Hoashi, and Yasuhiro Takishima. 2009. Full-automatic DJ mixing system with optimal tempo adjustment based on measurement function of user discomfort. In Proceedings of the International Society for Music Information Retrieval. 135--140.Google ScholarGoogle Scholar
  23. Tristan Jehan. 2005. Creating music by listening. PhD Dissertation, Massachusetts Institute of Technology.Google ScholarGoogle Scholar
  24. Jesper Højvang Jensen, Mads Græsbø ll Christensen, Daniel P. W. Ellis, and Søren Holdt Jensen. 2009. Quantitative analysis of a common audio similarity measure. IEEE Trans. Audio Speech Lang. Process. 17, 4, 693--703. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Mohsen Kamalzadeh, Dominikus Baur, and Torsten Möller. 2012. A survey on music listening and management behaviors. In Proceedings of the International Society for Music Information Retrieval. 373--378.Google ScholarGoogle Scholar
  26. Ryoho Kobayashi. 2003. Sound clustering synthesis using spectral data. In Proceedings of the International Computer Music Conference.Google ScholarGoogle Scholar
  27. Alison Latham. 2011. The Oxford Companion to Music. http://www.oxfordmusiconline.com/subscriber/book/omo t114.Google ScholarGoogle Scholar
  28. Ioannidis Leonidas and Jean-Luc Rouas. 2012. Exploiting semantic content for singing voice detection. In Proceedings of the IEEE International Conference on Semantic Computing (ICSC'12). 134--137. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Yipeng Li and DeLiang Wang. 2007. Separation of singing voice from music accompaniment for monaural recordings. IEEE Trans. Audio Speech Lang. Process. 15, 4, 1475--1487. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Rensis Likert. 1932. A technique for the measurement of attitudes. Archives Psychol. 22, 140, 1--55.Google ScholarGoogle Scholar
  31. Heng-Yi Lin, Yin-Tzu Lin, Ming-Chun Tien, and Ja-Ling Wu. 2009. Music paste: Concatenating music clips based on chroma and rhythm features. In Proceedings of the International Society for Music Information Retrieval. Kobe.Google ScholarGoogle Scholar
  32. Qian Lin, Lie Lu, ChristopherWeare, and Frank Seide. 2010. Music rhythm characterization with application to workout-mix generation. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing. 69--72.Google ScholarGoogle ScholarCross RefCross Ref
  33. I-Ting Liu, Yin-Tzu Lin, and Ja-Ling Wu. 2013. Music cut and paste: A personalized musical medley generating system. In Proceedings of the International Society for Music Information Retrieval.Google ScholarGoogle Scholar
  34. Zhang Liu, Chaokun Wang, Jianmin Wang, Hao Wang, and Yiyuan Bai. 2012. Adaptive music resizing with stretching, cropping and insertion. Multimedia Syst. 19, 4, 359--380. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Beth Logan. 2002. Content-based playlist generation: Exploratory experiments. In Proceedings of the International Society for Music Information Retrieval. 2--3.Google ScholarGoogle Scholar
  36. Beth Logan and Stephen Chu. 2000. Music summarization using key phrases. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing. Vol. 2. 749--752. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Gareth Loy. 2006. Musimathics. Vol. 1, MIT Press, 295--296,347--350.Google ScholarGoogle Scholar
  38. Namunu C. Maddage, Changsheng Xu, Mohan S. Kankanhalli, and Xi Shao. 2004. Content-based music structure analysis with applications to music semantics understanding. In Proceedings of the. ACM Multimedia Conference. 112--119. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Yizhao Ni, Matt McVicar, Paul Santos-Rodriguez, and Tijl De Bie. 2012. An end-to-end machine learning system for harmonic analysis of music. IEEE Trans. Audio Speech Lang. Process. 20, 6, 1771--1783. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Oriol Nieto, M. M. Farbood, Tristan Jehan, and J. P. Bello. 2014. Perceptual analysis of the f-measure for evaluating section boundaries in music. In Proceedings of the International Society for Music Information Retrieval. 265--270.Google ScholarGoogle Scholar
  41. Tin Lay Nwe, Arun Shenoy, and Ye Wang. 2004. Singing voice detection in popular music. In Proceedings of the. ACM Multimedia Conference. 324--327. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Jouni Paulus, Meinard Müller, and Anssi Klapuri. 2010. Audio-based music structure analysis. In Proceedings of the International Society for Music Information Retrieval. 625--636.Google ScholarGoogle Scholar
  43. Johan Pauwels, Florian Kaiser, and Geoffroy Peeters. 2013. Combining harmony-based and novelty-based approaches for structural segmentation. In Proceedings of the International Society for Music Information Retrieval.Google ScholarGoogle Scholar
  44. Don Michael Randel. 2003. The Harvard Dictionary of Music. Belknap Press.Google ScholarGoogle Scholar
  45. Lise Regnier and Geoffroy Peeters. 2009. Singing voice detection in music tracks using direct voice vibrato detection. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing. 1685--1688. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. Diemo Schwarz. 2005. Current research in concatenative sound synthesis. In Proceedings of the International Computer Music Conference.Google ScholarGoogle Scholar
  47. Diemo Schwarz. 2007. Corpus-based concatenative synthesis. IEEE Signal Process Mag. 24, 2, 92--104.Google ScholarGoogle ScholarCross RefCross Ref
  48. Diemo Schwarz, Roland Cahen, and Sam Britton. 2008. Principles and applications of interactive corpus based concatenative synthesis. Journées d'Informatique Musicale.Google ScholarGoogle Scholar
  49. Man-Kwan Shan and Shih-Chuan Chiu. 2010. Algorithmic compositions based on discovered musical patterns. Multimedia Tools Appl. 46, 1, 1--23. Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. Mohammad Soleymani, Micheal N. Caro, Erik M. Schmidt, Cheng-Ya Sha, and Yi-Hsuan Yang. 2013. 1000 songs for emotional analysis of music. In Proceedings of the International ACM Workshop on Crowdsourcing for Multimedia. 1--6. Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. Min-Yian Su, Yi-Hsuan Yang, Yu-Ching Lin, and Homer H. Chen. 2009. An integrated approach to music boundary detection. In Proceedings of the International Society for Music Information Retrieval. 705--710.Google ScholarGoogle Scholar
  52. Douglas Turnbull and Gert Lanckriet. 2007. A supervised approach for detecting boundaries in music using difference features and boosting. In Proceedings of the International Society for Music Information Retrieval. 42--49.Google ScholarGoogle Scholar
  53. Stephen Webber. 2007. DJ Skills: The Essential Guide to Mixing and Scratching. Focal Press. 212--239.Google ScholarGoogle Scholar
  54. Ron J. Weiss and Juan Pablo Bello. 2010. Identifying repeated patterns in music using sparse convolutive non-negative matrix factorization. In Proceedings of the International Society for Music Information Retrieval.Google ScholarGoogle Scholar
  55. Stephan Wenger and Marcus Magnor. 2011. Constrained example-based audio synthesis. In Proceedings of the International Conference on Multimedia and Expo. Google ScholarGoogle ScholarDigital LibraryDigital Library
  56. Robert B. Zajonc. 1968. Attitudinal effects of mere exposure. J. Personality Social Psych. 9, 2, Part 2, 1--27.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Audio Musical Dice Game: A User-Preference-Aware Medley Generating System

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM Transactions on Multimedia Computing, Communications, and Applications
        ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 11, Issue 4
        April 2015
        231 pages
        ISSN:1551-6857
        EISSN:1551-6865
        DOI:10.1145/2788342
        Issue’s Table of Contents

        Copyright © 2015 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 2 June 2015
        • Accepted: 1 December 2014
        • Revised: 1 July 2014
        • Received: 1 February 2014
        Published in tomm Volume 11, Issue 4

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader