Using jitter and shimmer in speaker verification

Jitter and shimmer are measures of the fundamental frequency and amplitude cycle-to-cycle variations, respectively. Both features have been largely used for the description of pathological voices, and since they characterise some aspects concerning particular voices, they are expected to have a cert...

Descripción completa

Detalles Bibliográficos
Autores: Farrús, Mireia, Hernando, Javier
Tipo de recurso: artículo
Estado:Versión aceptada para publicación
Fecha de publicación:2009
País:España
Institución:Varias* (Consorci de Biblioteques Universitáries de Catalunya, Centre de Serveis Científics i Acadèmics de Catalunya)
Repositorio:Recercat. Dipósit de la Recerca de Catalunya
OAI Identifier:oai:recercat.cat:10230/32737
Acceso en línea:http://hdl.handle.net/10230/32737
http://dx.doi.org/10.1049/iet-spr.2008.0147
Access Level:acceso abierto
Palabra clave:Support vector machines
Jitter
Speaker recognition
Descripción
Sumario:Jitter and shimmer are measures of the fundamental frequency and amplitude cycle-to-cycle variations, respectively. Both features have been largely used for the description of pathological voices, and since they characterise some aspects concerning particular voices, they are expected to have a certain degree of speaker specificity. In the current work, jitter and shimmer are successfully used in a speaker verification experiment. Moreover, both measures are combined with spectral and prosodic features using several types of normalisation and fusion techniques in order to obtain better verification results. The overall speaker verification system is also improved by using histogram equalisation as a normalisation technique previous to fusing the features by SVM.