skip to main content
10.1145/2393347.2396495acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
abstract

Automatic music video generation: cross matching of music and image

Authors Info & Claims
Published:29 October 2012Publication History

ABSTRACT

Music and image are two most popular media on the Internet. Human perception of music and image are highly correlated. Music video is one of such products, in which music and image are complement to each other. In this paper, we present a system which can automatically generate music video for a given song. The challenge of such system comes from how to select relative images and align them with the song. This paper deals with this challenge by leveraging lyrics (if exists) and the semantic similarity between music and image. We retrieve related image in internet with lyrics keyword as query and use a learning based method to estimate a semantic score between an image and a music segment. Finally we construct a music video after quality filtering and refinement. Our system also allows users to upload their images and re-pick recommended images to personalize the music video.

Skip Supplemental Material Section

Supplemental Material

d318.mp4

mp4

39.6 MB

References

  1. W. Luo, X. Wang, and X. Tang. Content-based photo quality assessment. In ICCV, 2011.Google ScholarGoogle Scholar
  2. X. Wu, Y. Qiao, X. Wang, and X. Tang. Cross matching of music and image. In ACMMM, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Automatic music video generation: cross matching of music and image

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader