Skip to main content

A Matlab Toolbox for Music Information Retrieval

  • Conference paper

Abstract

We present MIRToolbox, an integrated set of functions written in Matlab, dedicated to the extraction from audio files of musical features related, among others, to timbre, tonality, rhythm or form. The objective is to offer a state of the art of computational approaches in the area of Music Information Retrieval (MIR). The design is based on a modular framework: the different algorithms are decomposed into stages, formalized using a minimal set of elementary mechanisms, and integrating different variants proposed by alternative approaches — including new strategies we have developed —, that users can select and parametrize. These functions can adapt to a large area of objects as input.

This paper offers an overview of the set of features that can be extracted with MIRToolbox, illustrated with the description of three particular musical features. The toolbox also includes functions for statistical analysis, segmentation and clustering.

One of our main motivations for the development of the toolbox is to facilitate investigation of the relation between musical features and music-induced emotion. Preliminary results show that the variance in emotion ratings can be explained by a small set of acoustic features.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • EEROLA, T. and TOIVIAINEN, P. (2004): MIR in Matlab: The MIDI Toolbox. Proceedings of 5th International Conference on Music Information Retrieval, 22-27, Barcelona.

    Google Scholar 

  • FOOTE, J. and COOPER, M. (2003): Media segmentation using self-similarity decomposi-tion. In Proceedings of SPIE Storage and Retrieval for Multimedia Databases, 5021, 167-75.

    Google Scholar 

  • GOMEZ, E. (2006): Tonal description of polyphonic audio for music content processing. IN-FORMS Journal on Computing, 18-3, 294-304.

    Google Scholar 

  • JUSLIN, P. N. (1997): Emotional communication in music performance: A functionalist per-spective and some data. Music Perception, 14, 383-418.

    Google Scholar 

  • JUSLIN, P. N. and LAUKKA, P. (2003): Communication of emotions in vocal expression and music performance: Different channels, same code? Psychological Bulletin (129), 770-814.

    Article  Google Scholar 

  • KRUMHANSL, C. (1990): Cognitive Foundations of Musical Pitch. Oxford University Press, New York.

    Google Scholar 

  • KRUMHANSL, C. and KESSLER, E. J. (1982): Tracing the dynamic changes in perceived tonal organization in a spatial representation of musical keys. Psychological Review, 89, 334-368.

    Article  Google Scholar 

  • NABNEY, I. (2002): NETLAB: Algorithms for Pattern Recognition. Springer Advances In Pattern Recognition Series, Springer-Verlag, New-York.

    Google Scholar 

  • RABINER, L. and JUANG, B. H. (1993): Fundamentals of Speech Recognition. Prentice-Hall. SCHERER, K. R. and OSHINSKY J. S. (1977): Cue utilization in emotion attribution from auditory stimuli. Motivation and Emotion, 1-4, 331-346.

    Google Scholar 

  • SLANEY, M. (1998): Auditory Toolbox Version 2. Technical Report 1998-010, Interval Re-search Corporation.

    Google Scholar 

  • TOIVIAINEN, P. and KRUMHANSL, C. (2003): Measuring and modeling real-time re-sponses to music: The dynamics of tonality induction, Perception, 32-6, 741-766.

    Article  Google Scholar 

  • TOIVIAINEN, P. and SNYDER J. S. (2003): Tapping to Bach: Resonance-based modeling of pulse. Music Perception, 21(1), 43-80.

    Article  Google Scholar 

  • TZANETAKIS, G and COOK, P. (1999): Multifeature audio segmentation for browsing and annotation. Proceedings of the 1999 IEEE Workshop on Applications of Signal Process-ing to Audio and Acoustics. New-York.

    Google Scholar 

  • TZANETAKIS, G. and COOK, P. (2002): Musical genre classification of audio signals. IEEE Transactions on Speech and Audio Processing, 10(5), 293Ð302.

    Article  Google Scholar 

  • VESANTO, J. (1999): Self-organizing map in Matlab: the SOM Toolbox. Proceedings of the Matlab DSP Conference 1999. Espoo, Finland,35-40.

    Google Scholar 

  • WITTEN, I. H. and FRANK, E. (2005): Data Mining: Practical Machine Learning Tools and Techniques, 2nd Edition. Morgan Kaufmann, San Francisco, 2005.

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lartillot, O., Toiviainen, P., Eerola, T. (2008). A Matlab Toolbox for Music Information Retrieval. In: Preisach, C., Burkhardt, H., Schmidt-Thieme, L., Decker, R. (eds) Data Analysis, Machine Learning and Applications. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78246-9_31

Download citation

Publish with us

Policies and ethics