Voice quality modelling for expressive speech synthesis

This paper presents the perceptual experiments that were carried out in order to validate the methodology of transforming expressive speech styles using voice quality (VoQ) parameters modelling, along with the well-known prosody (, duration, and energy), from a neutral style into a number of express...

ver descrição completa

Detalhes bibliográficos
Autores: Monzo Sánchez, Carlos, Iriondo Sanz, Ignasi, Socoró Carrié, Joan Claudi
Tipo de documento: artigo
Estado:Versão publicada
Data de publicação:2014
País:España
Recursos:Varias* (Consorci de Biblioteques Universitáries de Catalunya, Centre de Serveis Científics i Acadèmics de Catalunya)
Repositório:Recercat. Dipósit de la Recerca de Catalunya
OAI Identifier:oai:recercat.cat:20.500.14342/3439
Acesso em linha:http://hdl.handle.net/20.500.14342/3439
https://doi.org/10.1155/2014/627189
Access Level:Acceso aberto
Palavra-chave:Parla
81
Descrição
Resumo:This paper presents the perceptual experiments that were carried out in order to validate the methodology of transforming expressive speech styles using voice quality (VoQ) parameters modelling, along with the well-known prosody (, duration, and energy), from a neutral style into a number of expressive ones. The main goal was to validate the usefulness of VoQ in the enhancement of expressive synthetic speech in terms of speech quality and style identification. A harmonic plus noise model (HNM) was used to modify VoQ and prosodic parameters that were extracted from an expressive speech corpus. Perception test results indicated the improvement of obtained expressive speech styles using VoQ modelling along with prosodic characteristics