MusicMixer: Automatic DJ System Considering Beat and Latent Topic Similarity

Hirai, Tatsunori; Doi, Hironori; Morishima, Shigeo

doi:10.1007/978-3-319-27671-7_59

Tatsunori Hirai¹⁹,
Hironori Doi²⁰ &
Shigeo Morishima^21,22

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9516))

Included in the following conference series:

International Conference on Multimedia Modeling

3114 Accesses
1 Citations

Abstract

This paper presents MusicMixer, an automatic DJ system that mixes songs in a seamless manner. MusicMixer mixes songs based on audio similarity calculated via beat analysis and latent topic analysis of the chromatic signal in the audio. The topic represents latent semantics about how chromatic sounds are generated. Given a list of songs, a DJ selects a song with beat and sounds similar to a specific point of the currently playing song to seamlessly transition between songs. By calculating the similarity of all existing pairs of songs, the proposed system can retrieve the best mixing point from innumerable possibilities. Although it is comparatively easy to calculate beat similarity from audio signals, it has been difficult to consider the semantics of songs as a human DJ considers. To consider such semantics, we propose a method to represent audio signals to construct topic models that acquire latent semantics of audio. The results of a subjective experiment demonstrate the effectiveness of the proposed latent semantic analysis method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

From raw audio to a seamless mix: creating an automated DJ system for Drum and Bass

Article Open access 24 September 2018

A Hierarchical Harmonic Mixing Method

AI-based Chinese-style music generation from video content: a study on cross-modal analysis and generation methods

Article Open access 12 February 2025

Notes

1.
The word “mix” here refers to the gradual transiton of one song to another.

References

Blei, D., Ng, A., Jordan, M.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
MATH Google Scholar
Ishizaki, H., Hoashi, K., Takishima, Y.: Full-automatic DJ mixing system with optimal tempo adjustment based on measurement function of user discomfort. In: Proceedings of ISMIR, pp. 135–140 (2009)
Google Scholar
Platt, J. Burges, C., Swenson, S., Weare, C., Zheng, A.: Learning a gaussian process prior for automatically generating music playlists. In: Proceedings of NIPS, pp. 1425–1432 (2001)
Google Scholar
Aucouturier, J.J., Pachet, F.: Scaling up music playlist generation. In: Proceedings of ICME, pp. 105–108 (2002)
Google Scholar
Pampalk, E., Pohle, T., Widmer, G.: Dynamic playlist generation based on skipping behavior. In: Proceedings of ISMIR, pp. 634–637 (2005)
Google Scholar
Ragno, R., Burges, C., Herley, C.: Inferring similarity between music objects with application to playlist generation. In: Proceedings of MIR, pp. 73–80 (2005)
Google Scholar
Goto, M., Goto, T.: Musicream: integrated music-listening interface for active, flexible, and unexpected encounters with musical pieces. Inf. Media Technol. 5(1), 139–152 (2010)
Google Scholar
Davies, M., Hamel, P., Yoshii, K., Goto, M.: AutoMashUpper: automatic creation of multi-song music mashups. Trans. Audio Speech Lang. Process. 22(12), 1726–1737 (2014)
Article Google Scholar
Tokui, N.: Massh!: a web-based collective music mashup system. In: Proceedings of DIMEA, pp. 526–527 (2009)
Google Scholar
Sasaki, S., Yoshii, K., Nakano, T., Goto, M., Morishima, S.: LyricsRadar: a lyrics retrieval system based on latent topics of lyrics. In: Proceedings of ISMIR, pp. 585–590 (2014)
Google Scholar
Sivic, J., Zisserman, A.: Video Google: a text retrieval approach to object matching in videos. In: Proceedings of ICCV, pp. 1470–1477 (2003)
Google Scholar
Nakano, T., Yoshii, K., Goto, M.: Vocal timbre analysis using latent Dirichlet allocation and cross-gender vocal timbre similarity. In: Proceedings of ICASSP, pp. 5202–5206 (2014)
Google Scholar
Hu, D., Saul, L.: A probabilistic topic model for unsupervised learning of musical key-profiles. In: Proceedings of ISMIR, pp. 441–446 (2009)
Google Scholar
Hu, D., Saul, L.: A probabilistic topic model for music analysis. In: Proceedings of NIPS (2009)
Google Scholar
Goto, M.: Development of the RWC music database. In: Proceedings of ICA, pp. 553–556 (2004)
Google Scholar
Hirai, T., Sasaki, S., Morishima, S.: MusicMean: fusion-based music generation. In: Proceedings of SMC, pp. 323–327 (2015)
Google Scholar

Download references

Acknowledgments

This work was supported by OngaCREST, CREST, JST and JSPS Grant-in-Aid for JSPS Fellows. This work was inspired by Tonkatsu DJ Agetaro.

Author information

Authors and Affiliations

Waseda University, Tokyo, Japan
Tatsunori Hirai
Dwango, Tokyo, Japan
Hironori Doi
Waseda Research Institute for Science and Engineering, Tokyo, Japan
Shigeo Morishima
JST CREST, Tokyo, Japan
Shigeo Morishima

Authors

Tatsunori Hirai
View author publications
You can also search for this author in PubMed Google Scholar
Hironori Doi
View author publications
You can also search for this author in PubMed Google Scholar
Shigeo Morishima
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tatsunori Hirai .

Editor information

Editors and Affiliations

University of Texas at San Antonio, San Antonio, USA
Qi Tian
Dept. of Information Engineering, University of Trento, Povo, Trento, Italy
Nicu Sebe
EECS, University of Central Florida, Orlando, Florida, USA
Guo-Jun Qi
EURECOM, Sophia-Antipolis, France
Benoit Huet
Hefei University of Technology, Hefei, Anhui, China
Richang Hong
School of Computing and Information, Hefei University of Technology, Hefei, Anhui, China
Xueliang Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hirai, T., Doi, H., Morishima, S. (2016). MusicMixer: Automatic DJ System Considering Beat and Latent Topic Similarity. In: Tian, Q., Sebe, N., Qi, GJ., Huet, B., Hong, R., Liu, X. (eds) MultiMedia Modeling. MMM 2016. Lecture Notes in Computer Science(), vol 9516. Springer, Cham. https://doi.org/10.1007/978-3-319-27671-7_59

Download citation

DOI: https://doi.org/10.1007/978-3-319-27671-7_59
Published: 03 January 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-27670-0
Online ISBN: 978-3-319-27671-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics