Voice Quality Modelling for Expressive Speech Synthesis

This paper presents the perceptual experiments that were carried out in order to validate the methodology of transforming expressive speech styles using voice quality (VoQ) parameters modelling, along with the well-known prosody (, duration, and energy), from a neutral style into a number of express...

Descripción completa

Detalles Bibliográficos
Autores: Monzo Sánchez, Carlos, Iriondo Sanz, Ignasi, Socoró Carrié, Joan Claudi
Tipo de recurso: artículo
Estado:Versión publicada
Fecha de publicación:2013
País:España
Institución:Universitat Ramon Llull (URL)
Repositorio:DAU Arxiu Digital de la Universitat Ramon Llull
OAI Identifier:oai:dau.url.edu:20.500.14342/3425
Acceso en línea:http://hdl.handle.net/20.500.14342/3425
http://dx.doi.org/10.1155/2014/627189
Access Level:acceso abierto
Palabra clave:Processament de la parla
Anàlisi prosòdica (Lingüística)
62
id ES_22b899dfdc661fbcd4bd3e97249863fa
oai_identifier_str oai:dau.url.edu:20.500.14342/3425
network_acronym_str ES
network_name_str España
repository_id_str
spelling Voice Quality Modelling for Expressive Speech SynthesisMonzo Sánchez, CarlosIriondo Sanz, IgnasiSocoró Carrié, Joan ClaudiProcessament de la parlaAnàlisi prosòdica (Lingüística)62This paper presents the perceptual experiments that were carried out in order to validate the methodology of transforming expressive speech styles using voice quality (VoQ) parameters modelling, along with the well-known prosody (, duration, and energy), from a neutral style into a number of expressive ones. The main goal was to validate the usefulness of VoQ in the enhancement of expressive synthetic speech in terms of speech quality and style identification. A harmonic plus noise model (HNM) was used to modify VoQ and prosodic parameters that were extracted from an expressive speech corpus. Perception test results indicated the improvement of obtained expressive speech styles using VoQ modelling along with prosodic characteristics.HindawiUniversitat Ramon Llull. La SalleUniversitat Oberta de Catalunya202020232020202320132014info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersion13 p.application/pdfhttp://hdl.handle.net/20.500.14342/3425http://dx.doi.org/10.1155/2014/627189RECERCAT (Dipòsit de la Recerca de Catalunya)reponame:DAU Arxiu Digital de la Universitat Ramon Llullinstname:Universitat Ramon Llull (URL)InglésScientific World Journal, 2014, Vol. 2014 (Gener)© L'autor/aAttribution 4.0 Internationalhttp://creativecommons.org/licenses/by/4.0/info:eu-repo/semantics/openAccessoai:dau.url.edu:20.500.14342/34252026-06-21T06:40:37Z
dc.title.none.fl_str_mv Voice Quality Modelling for Expressive Speech Synthesis
title Voice Quality Modelling for Expressive Speech Synthesis
spellingShingle Voice Quality Modelling for Expressive Speech Synthesis
Monzo Sánchez, Carlos
Processament de la parla
Anàlisi prosòdica (Lingüística)
62
title_short Voice Quality Modelling for Expressive Speech Synthesis
title_full Voice Quality Modelling for Expressive Speech Synthesis
title_fullStr Voice Quality Modelling for Expressive Speech Synthesis
title_full_unstemmed Voice Quality Modelling for Expressive Speech Synthesis
title_sort Voice Quality Modelling for Expressive Speech Synthesis
dc.creator.none.fl_str_mv Monzo Sánchez, Carlos
Iriondo Sanz, Ignasi
Socoró Carrié, Joan Claudi
author Monzo Sánchez, Carlos
author_facet Monzo Sánchez, Carlos
Iriondo Sanz, Ignasi
Socoró Carrié, Joan Claudi
author_role author
author2 Iriondo Sanz, Ignasi
Socoró Carrié, Joan Claudi
author2_role author
author
dc.contributor.none.fl_str_mv Universitat Ramon Llull. La Salle
Universitat Oberta de Catalunya
dc.subject.none.fl_str_mv Processament de la parla
Anàlisi prosòdica (Lingüística)
62
topic Processament de la parla
Anàlisi prosòdica (Lingüística)
62
description This paper presents the perceptual experiments that were carried out in order to validate the methodology of transforming expressive speech styles using voice quality (VoQ) parameters modelling, along with the well-known prosody (, duration, and energy), from a neutral style into a number of expressive ones. The main goal was to validate the usefulness of VoQ in the enhancement of expressive synthetic speech in terms of speech quality and style identification. A harmonic plus noise model (HNM) was used to modify VoQ and prosodic parameters that were extracted from an expressive speech corpus. Perception test results indicated the improvement of obtained expressive speech styles using VoQ modelling along with prosodic characteristics.
publishDate 2013
dc.date.none.fl_str_mv 2013
2014
2020
2020
2023
2023
dc.type.none.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
format article
status_str publishedVersion
dc.identifier.none.fl_str_mv http://hdl.handle.net/20.500.14342/3425
http://dx.doi.org/10.1155/2014/627189
url http://hdl.handle.net/20.500.14342/3425
http://dx.doi.org/10.1155/2014/627189
dc.language.none.fl_str_mv Inglés
language_invalid_str_mv Inglés
dc.relation.none.fl_str_mv Scientific World Journal, 2014, Vol. 2014 (Gener)
dc.rights.none.fl_str_mv © L'autor/a
Attribution 4.0 International
http://creativecommons.org/licenses/by/4.0/
info:eu-repo/semantics/openAccess
rights_invalid_str_mv © L'autor/a
Attribution 4.0 International
http://creativecommons.org/licenses/by/4.0/
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv 13 p.
application/pdf
dc.publisher.none.fl_str_mv Hindawi
publisher.none.fl_str_mv Hindawi
dc.source.none.fl_str_mv RECERCAT (Dipòsit de la Recerca de Catalunya)
reponame:DAU Arxiu Digital de la Universitat Ramon Llull
instname:Universitat Ramon Llull (URL)
instname_str Universitat Ramon Llull (URL)
reponame_str DAU Arxiu Digital de la Universitat Ramon Llull
collection DAU Arxiu Digital de la Universitat Ramon Llull
repository.name.fl_str_mv
repository.mail.fl_str_mv
_version_ 1869404598332030976
score 15,300724