ABSTRACT
In this paper, we present madmom, an open-source audio processing and music information retrieval (MIR) library written in Python. madmom features a concise, NumPy-compatible, object oriented design with simple calling conventions and sensible default values for all parameters, which facilitates fast prototyping of MIR applications. Prototypes can be seamlessly converted into callable processing pipelines through madmom's concept of Processors, callable objects that run transparently on multiple cores. Processors can also be serialised, saved, and re-run to allow results to be easily reproduced anywhere. Apart from low-level audio processing, madmom puts emphasis on musically meaningful high-level features. Many of these incorporate machine learning techniques and madmom provides a module that implements some methods commonly used in MIR such as hidden Markov models and neural networks. Additionally, madmom comes with several state-of-the-art MIR algorithms for onset detection, beat, downbeat and meter tracking, tempo estimation, and chord recognition. These can easily be incorporated into bigger MIR systems or run as stand-alone programs.
- S. Behnel, R. Bradshaw, C. Citro, L. Dalcin, D. Seljebotn, and K. Smith. Cython: The Best of Both Worlds. Computing in Science Engineering, 13(2), 2011. Google ScholarDigital Library
- S. Böck, F. Krebs, and G. Widmer. A Multi-model Approach to Beat Tracking considering Heterogeneous Music Styles. In Proc. of the 15th Int. Society for Music Information Retrieval Conf. (ISMIR), 2014.Google Scholar
- S. Böck, F. Krebs, and G. Widmer. Accurate Tempo Estimation based on Recurrent Neural Networks and Resonating Comb Filters. In Proc. of the 16th Int. Society for Music Information Retrieval Conf. (ISMIR), 2015.Google Scholar
- S. Böck, F. Krebs, and G. Widmer. Joint Beat and Downbeat Tracking with Recurrent Neural Networks. In Proc. of the 17th Int. Society for Music Information Retrieval Conf. (ISMIR), 2016.Google Scholar
- S. Böck and M. Schedl. Enhanced Beat Tracking with Context-Aware Neural Networks. In Proc. of the 14th Int. Conf. on Digital Audio Effects (DAFx), 2011.Google Scholar
- D. Bogdanov, N. Wack, E. Gómez, S. Gulati, P. Herrera, O. Mayor, G. Roma, J. Salamon, J. Zapata, and X. Serra. Essentia: an open source library for sound and music analysis. In In Proc. of ACM Multimedia, 2013. Google ScholarDigital Library
- S. Dieleman, J. Schlüter, C. Raffel, E. Olson, S. K. Sønderby, D. Nouri, E. Battenberg, A. van den Oord, et al. Lasagne: First release., 2015.Google Scholar
- F. Eyben, S. Böck, B. Schuller, and A. Graves. Universal Onset Detection with Bidirectional Long Short-Term Memory Neural Networks. In Proc. of the 11th Int. Society for Music Information Retrieval Conf. (ISMIR), 2010.Google Scholar
- F. Eyben, F. Weninger, F. Gross, and B. Schuller. Recent Developments in openSMILE, the Munich Open-Source Multimedia Feature Extractor. In In Proc. of ACM Multimedia, Barcelona, Spain, 2013. Google ScholarDigital Library
- E. Jones, T. Oliphant, P. Peterson, et al. SciPy: Open source scientific tools for Python, 2001--. {Online; accessed 2016-05--20}.Google Scholar
- F. Korzeniowski, S. Böck, and G. Widmer. Probabilistic Extraction of Beat Positions from a Beat Activation Function. In Proc. of the 15th Int. Society for Music Information Retrieval Conf. (ISMIR), 2014.Google Scholar
- F. Korzeniowski and G. Widmer. Feature learning for chord recognition: The deep chroma extractor. In Proc. of the 17th Int. Society for Music Information Retrieval Conf. (ISMIR), 2016.Google Scholar
- F. Korzeniowski and G. Widmer. A fully convolutional deep auditory model for musical chord recognition. In Proc. of the IEEE Int. Workshop on Machine Learning for Signal Processing (MLSP), 2016.Google ScholarCross Ref
- F. Krebs, S. Böck, and G. Widmer. Rhythmic Pattern Modeling for Beat and Downbeat Tracking in Musical Audio. In Proc. of the 14th Int. Society for Music Information Retrieval Conf. (ISMIR), 2013.Google Scholar
- O. Lartillot and P. Toiviainen. A Matlab toolbox for musical feature extraction from audio. In Proc. of the 10th Int. Conf. on Digital Audio Effects (DAFx), 2007.Google Scholar
- B. Mathieu, S. Essid, T. Fillon, and J. Prado. YAAFE, an easy to use and efficient audio feature extraction software. In Proc. of the 11th Int. Society for Music Information Retrieval Conf. (ISMIR), 2010.Google Scholar
- B. McFee, C. Raffel, D. Liang, D. Ellis, M. McVicar, E. Battenberg, and O. Nieto. librosa: Audio and Music Signal Analysis in Python. In Proc. of the 14th Python in Science Conf. (SCIPY), 2015.Google ScholarCross Ref
- F. Pérez and B. E. Granger. IPython: A System for Interactive Scientific Computing. Computing in Science Engineering, 9(3), 2007. Google ScholarDigital Library
- J. Schlüter and S. Böck. Musical Onset Detection with Convolutional Neural Networks. In Proceedings of the 6th International Workshop on Machine Learning and Music, Prague, Czech Republic, 2013.Google Scholar
- J. Schlüter and S. Böck. Improved musical onset detection with convolutional neural networks. In Proc. of the 39th Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), 2014.Google ScholarCross Ref
- G. Tzanetakis and P. Cook. MARSYAS: a framework for audio analysis. Organised Sound, 4, 2000. Google ScholarDigital Library
- S. van der Walt, S. C. Colbert, and G. Varoquaux. The NumPy Array: A Structure for Efficient Numerical Computation. Computing in Science Engineering, 13(2), 2011. Google ScholarDigital Library
Index Terms
- madmom: A New Python Audio and Music Signal Processing Library
Recommendations
ESSENTIA: an open-source library for sound and music analysis
MM '13: Proceedings of the 21st ACM international conference on MultimediaWe present Essentia 2.0, an open-source C++ library for audio analysis and audio-based music information retrieval released under the Affero GPL license. It contains an extensive collection of reusable algorithms which implement audio input/output ...
On the suitability of state-of-the-art music information retrieval methods for analyzing, categorizing and accessing non-Western and ethnic music collections
With increasing amounts of music being available in digital form, research in music information retrieval has turned into a dominant field to support organization of and easy access to large collections of music. Yet, most research is focussed ...
Chord Progressions Selection Based on Song Audio Features
Hybrid Artificial Intelligent SystemsAbstractA chord progression is an essential building block in music. In the field of music theory is usually assumed that these progressions influence the mood, emotion, genre or other critical aspects of the songs, and also in the perception that they ...
Comments