Automatic self-similarity based form labelling of classical-period piano sonata movements from audio recordings

Loading...
Thumbnail Image

Journal Title

Journal ISSN

Volume Title

Publisher

Institute of Electrical and Electronics Engineers

Abstract

Musical form refers to the overall structure or organisation of a musical composition. It is a complex and high-level property of music that requires musical training to identify. A review of previous research in this field indicates that the focus has been on the task of detecting section boundaries and that automatic audio based form label recognition is a field of study that remains largely unexplored. This study explores the complex task of automatically determining musical form from audio. It demonstrates the ability of a novel methodology to label eight different form types that occur in the movements of Classical-period piano sonatas. The methodology makes use of self-similarity matrices, generated from features extracted from raw audio, as input to a convolutional neural network. The superiority of our approach was confirmed by evaluating it against a neural network model based on state-of-the-art features. We also report an evaluation of self-similarity matrices based on automatically transcribed piano rolls for the task of form recognition. Piano rolls are demonstrated to be superior for this application when compared to a range of other feature representations. Additionally, the performance of the model is shown to be robust in handling variations in performer choices. These range from different interpretations of the same score to actual deviations from the score where performers may elect to play or not to play notated repeats thus highlighting its ability to generalise across different performances of the same piece.

Description

Keywords

Music, Annotations, Labeling, Feature extraction, Neural networks, Training, Timbre, Convolutional neural networks, Audio recording, Classical music, Deep learning, Form recognition, Music information retrieval, Music structure analysis

Sustainable Development Goals

SDG-04: Quality Education
SDG-09: Industry, innovation and infrastructure

Citation

P.A. Burger and J.P. Jacobs, "Automatic Self-Similarity Based Form Labelling of Classical-Period Piano Sonata Movements From Audio Recordings," in IEEE Transactions on Audio, Speech and Language Processing, vol. 33, pp. 3414-3427, 2025, doi: 10.1109/TASLPRO.2025.3594301.