skip to main content
10.1145/3001773.3001782acmotherconferencesArticle/Chapter ViewAbstractPublication PagesesemConference Proceedingsconference-collections
research-article

DJ-MVP: An Automatic Music Video Producer

Authors Info & Claims
Published:09 November 2016Publication History

ABSTRACT

A music video (MV) is a videotaped performance of a recorded popular song, usually accompanied by dancing and visual images. In this paper, we outline the design of a generative music video system, which automatically generates an audio-video mashup for a given target audio track. The system performs segmentation for the given target song based on beat detection. Next, according to audio similarity analysis and color heuristic selection methods, we obtain generated video segments. Then, these video segments are truncated to match the length of audio segments and are concatenated as the final music video. An evaluation of our system has shown that users are receptive to this novel presentation of music videos and are interested in future developments.

References

  1. Jonathan Foote, Matthew Cooper, and Andreas Girgensohn. 2002. Creating music videos using automatic media analysis. In Proceedings of the ACM Multimedia (MM'02), 553--560. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Xian-Sheng Hua and Hong Jiang Zhang. 2004. Automatic music video generation based on temporal pattern analysis. In Proceedings of the ACM Multimedia, (MM'04), 472--475. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Jong Chul Yoon, In-Kwon Lee, Siwoo Byun. 2009. Automated music video generation using multi-level feature-based segmentation. Multimedia Tools and Applications. 41, 2, (January 2009), 197--214.Google ScholarGoogle Scholar
  4. Rui Cai, Lei Zhang, Feng Jing, Wei Lai, and Wei-Ying Ma. 2007. Automated music video generation using web image resource. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, (ICASSP'07), 737--740. Google ScholarGoogle ScholarCross RefCross Ref
  5. Tomoyasu Nakano, Sora Murofushi, Masataka Goto and Shigeo Morishima. 2011. Dancereproducer: An automatic mashup music video generation system by reusing dance video clips on the web. In Proceedings of Sound and Music Computing Conference, (SMC'11), 183--189.Google ScholarGoogle Scholar
  6. Scott D. Lipscomb. 1997. Perceptual measures of visual and auditory cues in film music. The Journal of the Acoustical Society of America. 101, 5, (June 1997), 3190.Google ScholarGoogle ScholarCross RefCross Ref
  7. Yu Fei Ma, Lie Lu, Hong Jiang Zhang and Mingjing Li. 2002. An attention model for video summarization. In Proceedings of ACM Multimedia, (MM'02), 533--542.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Frederic Patin. 2003. Accessible Online Tutorial. Retrieved March 22, 2016, from http://www.flipcode.com/misc/OnsetDetectionAlgorithms.pdfGoogle ScholarGoogle Scholar
  9. Michael I. Mandel and Daniel P.W. Ellis. 2005. Song-level features and support vector machines for music classification. In Proceedings of International. Symposium. Music Information Retrieval, (ISMIR'05), 594--599.Google ScholarGoogle Scholar
  10. Alan V. Oppenheim. 1969. Speech analysis-synthesis system based on homomorphic filtering. Journal of the Acoustical Society of America, 45, 2, (February 1969), 458--465.Google ScholarGoogle ScholarCross RefCross Ref
  11. Beth Logan. 2000. Mel frequency cepstral coefficients for music modeling. In Proceedings of International Symposium on Music Information Retrieval, (ISMIR'00).Google ScholarGoogle Scholar
  12. Shlomo Dubnov, G'erard Assayagm, and Arshia Cont. 2011. Audio oracle analysis of musical information rate. In Proceedings of IEEE International Conference on Semantic Computing, (ICSC'11), 567--571. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Sangoh Jeong. 2001. Histogram-Based Color Image Retrieval. Accessible Online Report, Retrieved March 22, 2016, from https://ece.uwaterloo.ca/~nnikvand/Coderep/ColorHist/Histogram-Based%20Color%20Image%20Retrieval.pdfGoogle ScholarGoogle Scholar
  14. ffmpeg. FFmpeg website, Retrieved July 27, 2015 from https://www.ffmpeg.org/Google ScholarGoogle Scholar
  15. Jean, Julien Aucouturier and Boris, Defreville. 2007. Sounds like a park: a computational technique to recognize soundscapes holistically, without source identification. In the Proceedings of International Congress on Acoustics, (ICA'07).Google ScholarGoogle Scholar
  16. Beads. Open source audio processing library. Retrieved August 27, 2015 from http://www.beadsproject.net/Google ScholarGoogle Scholar

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    ACE '16: Proceedings of the 13th International Conference on Advances in Computer Entertainment Technology
    November 2016
    373 pages
    ISBN:9781450347730
    DOI:10.1145/3001773

    Copyright © 2016 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 9 November 2016

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited

    Acceptance Rates

    Overall Acceptance Rate36of90submissions,40%

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader