Using jitter and shimmer in speaker verification

Jitter and shimmer are measures of the fundamental frequency and amplitude cycle-to-cycle variations, respectively. Both features have been largely used for the description of pathological voices, and since they characterise some aspects concerning particular voices, they are expected to have a cert...

ver descrição completa

Detalhes bibliográficos
Autores: Farrús, Mireia, Hernando, Javier
Tipo de documento: artigo
Estado:Versión aceptada para publicación
Data de publicação:2009
País:España
Recursos:Universitat Pompeu Fabra
Repositório:Repositorio Digital de la UPF
OAI Identifier:oai:repositori.upf.edu:10230/32737
Acesso em linha:http://hdl.handle.net/10230/32737
http://dx.doi.org/10.1049/iet-spr.2008.0147
Access Level:Acceso aberto
Palavra-chave:Support vector machines
Jitter
Speaker recognition
Descrição
Resumo:Jitter and shimmer are measures of the fundamental frequency and amplitude cycle-to-cycle variations, respectively. Both features have been largely used for the description of pathological voices, and since they characterise some aspects concerning particular voices, they are expected to have a certain degree of speaker specificity. In the current work, jitter and shimmer are successfully used in a speaker verification experiment. Moreover, both measures are combined with spectral and prosodic features using several types of normalisation and fusion techniques in order to obtain better verification results. The overall speaker verification system is also improved by using histogram equalisation as a normalisation technique previous to fusing the features by SVM.