Chroma binary similarity and local alignment applied to cover song identification

We present a new technique for audio signal comparison based on tonal subsequence alignment and its application to detect cover versions (i.e., different performances of the same underlying musical piece). Cover song identification is a task whose popularity has increased in the Music Information Re...

Descripción completa

Detalles Bibliográficos
Autores: Serrà Julià, Joan, Gómez Gutiérrez, Emilia, 1975-, Herrera Boyer, Perfecto, 1964-, Serra, Xavier
Tipo de recurso: artículo
Estado:Versión aceptada para publicación
Fecha de publicación:2008
País:España
Institución:Universitat Pompeu Fabra
Repositorio:Repositorio Digital de la UPF
OAI Identifier:oai:repositori.upf.edu:10230/16277
Acceso en línea:http://hdl.handle.net/10230/16277
http://dx.doi.org/10.1109/TASL.2008.924595
Access Level:acceso abierto
Palabra clave:Tonalitat (Música)
Música
So -- Enregistrament i reproducció -- Tècniques digitals
Acoustic signal analysis
Dynamic programming
Information retrieval
Multidimensional sequences
Music
Descripción
Sumario:We present a new technique for audio signal comparison based on tonal subsequence alignment and its application to detect cover versions (i.e., different performances of the same underlying musical piece). Cover song identification is a task whose popularity has increased in the Music Information Retrieval (MIR) community along in the past, as it provides a direct and objective way to evaluate music similarity algorithms./nThis article first presents a series of experiments carried out/nwith two state-of-the-art methods for cover song identification./nWe have studied several components of these (such as chroma resolution and similarity, transposition, beat tracking or Dynamic Time Warping constraints), in order to discover which characteristics would be desirable for a competitive cover song identifier. After analyzing many cross-validated results, the importance of these characteristics is discussed, and the best-performing ones are finally applied to the newly proposed method. Multiple/nevaluations of this one confirm a large increase in identification/naccuracy when comparing it with alternative state-of-the-art/napproaches.