Abstract
This article proposes a framework for creating user-preference-aware music medleys from users' music collections. We treat the medley generation process as an audio version of a musical dice game. Once the user's collection has been analyzed, the system is able to generate various pleasing medleys. This flexibility allows users to create medleys according to the specified conditions, such as the medley structure or the must-use clips. Even users without musical knowledge can compose medley songs from their favorite tracks. The effectiveness of the system has been evaluated through both objective and subjective experiments on individual components in the system.
- Claudio Baccigalupo and Enric Plaza. 2006. Case-based sequential ordering of songs for playlist recommendation. In Advances in Case-Based Reasoning, Lecture Notes in Computer Science, vol. 4106. 286--300. Google ScholarDigital Library
- Luke Barrington, Antoni B. Chan, and Gert Lanckriet. 2010. Modeling music as a dynamic texture. IEEE Trans. Audio Speech Lang. Process. 18, 3, 602--612. Google ScholarDigital Library
- Sumit Basu. 2004. Mixing with Mozart. In Proceedings of the International Computer Music Conference.Google Scholar
- Gilberto Bernardes, Carlos Guedes, and Bruce Pennycook. 2012. EarGram: An application for interactive exploration of large databases of audio snippets for creative purposes. In Proceedings of the Symposium on Computer Music Multidisciplinary Research. 19--22.Google Scholar
- Charles Burkhart. 2005. The phrase rhythm of Chopin's A-Flat Major Mazurka, Op. 59, No. 2. In Engaging Music: Essays in Music Analysis, Deborah Jane Stein (Ed.), Oxford University Press, 3--12.Google Scholar
- Luca Chiarandini, Massimiliano Zanoni, and Augusto Sarti. 2011. A system for dynamic playlist generation driven by multimodal control signals and descriptors. In Proceedings of the International Workshop on Multimedia Signal Processing.Google ScholarCross Ref
- Dave Cliff. 2000. Hang the DJ: Automatic sequencing and seamless mixing of dance-music tracks. Tech. Rep. HP Labs.Google Scholar
- Richard Cole and Ed Schwartz. 2012. Virginia Tech Multimedia Music Dictionary. http://www.music.vt.edu/musicdictionary/.Google Scholar
- David Cope. 1987. Experiments in music intelligence. In Proceedings of the International Computer Music Conference.Google Scholar
- David Cope. 1996. Experiments in Musical Intelligence. A-R Editions, Madison, WI.Google Scholar
- Roger B. Dannenberg. 2006. Concatenative synthesis using score-aligned transcriptions music analysis and segmentation. In Proceedings of the International Computer Music Conference. 352--355.Google Scholar
- Matthew E. P. Davies, Philippe Hamel, Kazuyoshi Yoshii, and Masataka Goto. 2013. AutoMashUpper: An automatic multi-song mashup system. In Proceedings of the International Society for Music Information Retrieval.Google Scholar
- Steven B. Davis and Paul Mermelstein. 1980. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Audio Speech Lang. Process. 28, 4, 357--366.Google ScholarCross Ref
- Simon Dixon. 2007. Evaluation of the audio beat tracking system BeatRoot. J. New Music Res. 36, 1, 39--50.Google ScholarCross Ref
- Mark Dolson. 1986. The phase vocoder: A tutorial. Computer Music J. 10, 4, 14--27.Google ScholarCross Ref
- Arthur Flexer, Dominik Schnitzer, Martin Gasser, and Gerhard Widmer. 2008. Playlist generation using start and end songs. In Proceedings of the International Society for Music Information Retrieval. 173--178.Google Scholar
- Jonathan Foote. 1999. Visualizing music and audio using self-similarity. In Proceedings of the. ACM Multimedia Conference. 77--80. Google ScholarDigital Library
- G. David Forney Jr. 1973. The Viterbi algorithm. Proc. IEEE 61, 3, 302--309.Google Scholar
- Garth Griffin, Y. E. Kim, and Douglas Turnbull. 2010. Beat-Sync-Mash-Coder: A web application for real-time creation of beat-synchronous music mashups. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing.Google ScholarCross Ref
- Pierre Hanna, Pascal Ferraro, and Matthias Robine. 2007. On optimizing the editing algorithms for evaluating similarity between monophonic musical sequences. J. New Music Res. 36, 4, 267--279.Google ScholarCross Ref
- Pierre Hanna, Matthias Robine, and Thomas Rocher. 2009. An alignment based system for chord sequence retrieval. In Proceedings of the Joint Conference on Digital Libraries. 101. Google ScholarDigital Library
- Hiromi Ishizaki, Keiichiro Hoashi, and Yasuhiro Takishima. 2009. Full-automatic DJ mixing system with optimal tempo adjustment based on measurement function of user discomfort. In Proceedings of the International Society for Music Information Retrieval. 135--140.Google Scholar
- Tristan Jehan. 2005. Creating music by listening. PhD Dissertation, Massachusetts Institute of Technology.Google Scholar
- Jesper Højvang Jensen, Mads Græsbø ll Christensen, Daniel P. W. Ellis, and Søren Holdt Jensen. 2009. Quantitative analysis of a common audio similarity measure. IEEE Trans. Audio Speech Lang. Process. 17, 4, 693--703. Google ScholarDigital Library
- Mohsen Kamalzadeh, Dominikus Baur, and Torsten Möller. 2012. A survey on music listening and management behaviors. In Proceedings of the International Society for Music Information Retrieval. 373--378.Google Scholar
- Ryoho Kobayashi. 2003. Sound clustering synthesis using spectral data. In Proceedings of the International Computer Music Conference.Google Scholar
- Alison Latham. 2011. The Oxford Companion to Music. http://www.oxfordmusiconline.com/subscriber/book/omo t114.Google Scholar
- Ioannidis Leonidas and Jean-Luc Rouas. 2012. Exploiting semantic content for singing voice detection. In Proceedings of the IEEE International Conference on Semantic Computing (ICSC'12). 134--137. Google ScholarDigital Library
- Yipeng Li and DeLiang Wang. 2007. Separation of singing voice from music accompaniment for monaural recordings. IEEE Trans. Audio Speech Lang. Process. 15, 4, 1475--1487. Google ScholarDigital Library
- Rensis Likert. 1932. A technique for the measurement of attitudes. Archives Psychol. 22, 140, 1--55.Google Scholar
- Heng-Yi Lin, Yin-Tzu Lin, Ming-Chun Tien, and Ja-Ling Wu. 2009. Music paste: Concatenating music clips based on chroma and rhythm features. In Proceedings of the International Society for Music Information Retrieval. Kobe.Google Scholar
- Qian Lin, Lie Lu, ChristopherWeare, and Frank Seide. 2010. Music rhythm characterization with application to workout-mix generation. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing. 69--72.Google ScholarCross Ref
- I-Ting Liu, Yin-Tzu Lin, and Ja-Ling Wu. 2013. Music cut and paste: A personalized musical medley generating system. In Proceedings of the International Society for Music Information Retrieval.Google Scholar
- Zhang Liu, Chaokun Wang, Jianmin Wang, Hao Wang, and Yiyuan Bai. 2012. Adaptive music resizing with stretching, cropping and insertion. Multimedia Syst. 19, 4, 359--380. Google ScholarDigital Library
- Beth Logan. 2002. Content-based playlist generation: Exploratory experiments. In Proceedings of the International Society for Music Information Retrieval. 2--3.Google Scholar
- Beth Logan and Stephen Chu. 2000. Music summarization using key phrases. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing. Vol. 2. 749--752. Google ScholarDigital Library
- Gareth Loy. 2006. Musimathics. Vol. 1, MIT Press, 295--296,347--350.Google Scholar
- Namunu C. Maddage, Changsheng Xu, Mohan S. Kankanhalli, and Xi Shao. 2004. Content-based music structure analysis with applications to music semantics understanding. In Proceedings of the. ACM Multimedia Conference. 112--119. Google ScholarDigital Library
- Yizhao Ni, Matt McVicar, Paul Santos-Rodriguez, and Tijl De Bie. 2012. An end-to-end machine learning system for harmonic analysis of music. IEEE Trans. Audio Speech Lang. Process. 20, 6, 1771--1783. Google ScholarDigital Library
- Oriol Nieto, M. M. Farbood, Tristan Jehan, and J. P. Bello. 2014. Perceptual analysis of the f-measure for evaluating section boundaries in music. In Proceedings of the International Society for Music Information Retrieval. 265--270.Google Scholar
- Tin Lay Nwe, Arun Shenoy, and Ye Wang. 2004. Singing voice detection in popular music. In Proceedings of the. ACM Multimedia Conference. 324--327. Google ScholarDigital Library
- Jouni Paulus, Meinard Müller, and Anssi Klapuri. 2010. Audio-based music structure analysis. In Proceedings of the International Society for Music Information Retrieval. 625--636.Google Scholar
- Johan Pauwels, Florian Kaiser, and Geoffroy Peeters. 2013. Combining harmony-based and novelty-based approaches for structural segmentation. In Proceedings of the International Society for Music Information Retrieval.Google Scholar
- Don Michael Randel. 2003. The Harvard Dictionary of Music. Belknap Press.Google Scholar
- Lise Regnier and Geoffroy Peeters. 2009. Singing voice detection in music tracks using direct voice vibrato detection. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing. 1685--1688. Google ScholarDigital Library
- Diemo Schwarz. 2005. Current research in concatenative sound synthesis. In Proceedings of the International Computer Music Conference.Google Scholar
- Diemo Schwarz. 2007. Corpus-based concatenative synthesis. IEEE Signal Process Mag. 24, 2, 92--104.Google ScholarCross Ref
- Diemo Schwarz, Roland Cahen, and Sam Britton. 2008. Principles and applications of interactive corpus based concatenative synthesis. Journées d'Informatique Musicale.Google Scholar
- Man-Kwan Shan and Shih-Chuan Chiu. 2010. Algorithmic compositions based on discovered musical patterns. Multimedia Tools Appl. 46, 1, 1--23. Google ScholarDigital Library
- Mohammad Soleymani, Micheal N. Caro, Erik M. Schmidt, Cheng-Ya Sha, and Yi-Hsuan Yang. 2013. 1000 songs for emotional analysis of music. In Proceedings of the International ACM Workshop on Crowdsourcing for Multimedia. 1--6. Google ScholarDigital Library
- Min-Yian Su, Yi-Hsuan Yang, Yu-Ching Lin, and Homer H. Chen. 2009. An integrated approach to music boundary detection. In Proceedings of the International Society for Music Information Retrieval. 705--710.Google Scholar
- Douglas Turnbull and Gert Lanckriet. 2007. A supervised approach for detecting boundaries in music using difference features and boosting. In Proceedings of the International Society for Music Information Retrieval. 42--49.Google Scholar
- Stephen Webber. 2007. DJ Skills: The Essential Guide to Mixing and Scratching. Focal Press. 212--239.Google Scholar
- Ron J. Weiss and Juan Pablo Bello. 2010. Identifying repeated patterns in music using sparse convolutive non-negative matrix factorization. In Proceedings of the International Society for Music Information Retrieval.Google Scholar
- Stephan Wenger and Marcus Magnor. 2011. Constrained example-based audio synthesis. In Proceedings of the International Conference on Multimedia and Expo. Google ScholarDigital Library
- Robert B. Zajonc. 1968. Attitudinal effects of mere exposure. J. Personality Social Psych. 9, 2, Part 2, 1--27.Google ScholarCross Ref
Index Terms
- Audio Musical Dice Game: A User-Preference-Aware Medley Generating System
Recommendations
Mozart Musical Dice Game
SIGCSE 2023: Proceedings of the 54th ACM Technical Symposium on Computer Science Education V. 2A Musikalisches Würfelspiel (musical dice game) is a process for generating music by concatenating a random sequence of precomposed musical fragments. In this programming assignment, students "compose" a Viennese waltz in the style of Mozart by playing ...
Music Key Detection for Musical Audio
MMM '05: Proceedings of the 11th International Multimedia Modelling ConferenceThe key or the scale information of a piece of music provides important clues on its high level musical content, like harmonic and melodic context, which can be useful for music classification, retrieval or further content analysis. Researchers have ...
Precise pitch profile feature extraction from musical audio for key detection
The majority of pieces of music, including classical and popular music,are composed using music scales, such as keys. The key or the scale information of a piece provides important clues on its high level musical content, like harmonic and melodic ...
Comments