Speech Recognition and Noise Adaptation in Realistic Noises

[EN] The recognition of isolated words in noise improves as words are delayed from the noise onset. This phenomenon, known as adaptation to noise, has been mostly investigated using synthetic noises. The aim here was to investigate whether adaptation occurs for realistic noises and to what extent it...

Descripción completa

Detalles Bibliográficos
Autores: Marrufo Pérez, Miriam Isabel, Lopez Poveda, Enrique A.
Tipo de recurso: artículo
Estado:Versión publicada
Fecha de publicación:2025
País:España
Institución:Universidad de Salamanca (USAL)
Repositorio:GREDOS. Repositorio Institucional de la Universidad de Salamanca
OAI Identifier:oai:gredos.usal.es:10366/167676
Acceso en línea:http://hdl.handle.net/10366/167676
Access Level:acceso abierto
Palabra clave:noise
sound-level statistics
speech intelligibility
6106.09 Procesos de Percepción
2411.13 Fisiología de la Audición
2490.01 Neurofisiología
id ES_34e9fc0bf3cce082b639de5dfd91cf81
oai_identifier_str oai:gredos.usal.es:10366/167676
network_acronym_str ES
network_name_str España
repository_id_str
spelling Speech Recognition and Noise Adaptation in Realistic NoisesMarrufo Pérez, Miriam IsabelLopez Poveda, Enrique A.noisesound-level statisticsspeech intelligibility6106.09 Procesos de Percepción2411.13 Fisiología de la Audición2490.01 Neurofisiología[EN] The recognition of isolated words in noise improves as words are delayed from the noise onset. This phenomenon, known as adaptation to noise, has been mostly investigated using synthetic noises. The aim here was to investigate whether adaptation occurs for realistic noises and to what extent it depends on the spectrum and level fluctuations of the noise. Forty-nine different realistic and synthetic noises were analyzed and classified according to how much they fluctuated in level over time and how much their spectra differed from the speech spectrum. Six representative noises were chosen that covered the observed range of level fluctuations and spectral differences but could still mask speech. For the six noises, speech reception thresholds (SRTs) were measured for natural and tone-vocoded words delayed 50 (early condition) and 800 ms (late condition) from the noise onset. Adaptation was calculated as the SRT improvement in the late relative to the early condition. Twenty-two adults with normal hearing participated in the experiments. For natural words, adaptation was small overall (mean = 0.5 dB) and similar across the six noises. For vocoded words, significant adaptation occurred for all six noises (mean = 1.3 dB) and was not statistically different across noises. For the tested noises, the amount of adaptation was independent of the spectrum and level fluctuations of the noise. The results suggest that adaptation in speech recognition can occur in realistic noisy environments.The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the European Regional Development Fund and Ministerio de Ciencia e Innovaci\u00F3n (grant number PID2019-108985GB-I00). A portion of the data was presented as the master thesis of Briggitte Escobar Campuzano (2021, Universidad de Salamanca). The authors thank her and Milagros J. Fumero for help with data collection. The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the European Regional Development Fund and Ministerio de Ciencia e Innovaci\u00F3n (grant number PID2019-108985GB-I00).202520252025info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionhttp://hdl.handle.net/10366/167676reponame:GREDOS. Repositorio Institucional de la Universidad de Salamancainstname:Universidad de Salamanca (USAL)InglésPID2019-108985GB-I00PID2019-108985GB-I00Attribution-NonCommercial-NoDerivatives 4.0 Internacionalhttp://creativecommons.org/licenses/by-nc-nd/4.0/info:eu-repo/semantics/openAccessoai:gredos.usal.es:10366/1676762026-06-07T06:28:51Z
dc.title.none.fl_str_mv Speech Recognition and Noise Adaptation in Realistic Noises
title Speech Recognition and Noise Adaptation in Realistic Noises
spellingShingle Speech Recognition and Noise Adaptation in Realistic Noises
Marrufo Pérez, Miriam Isabel
noise
sound-level statistics
speech intelligibility
6106.09 Procesos de Percepción
2411.13 Fisiología de la Audición
2490.01 Neurofisiología
title_short Speech Recognition and Noise Adaptation in Realistic Noises
title_full Speech Recognition and Noise Adaptation in Realistic Noises
title_fullStr Speech Recognition and Noise Adaptation in Realistic Noises
title_full_unstemmed Speech Recognition and Noise Adaptation in Realistic Noises
title_sort Speech Recognition and Noise Adaptation in Realistic Noises
dc.creator.none.fl_str_mv Marrufo Pérez, Miriam Isabel
Lopez Poveda, Enrique A.
author Marrufo Pérez, Miriam Isabel
author_facet Marrufo Pérez, Miriam Isabel
Lopez Poveda, Enrique A.
author_role author
author2 Lopez Poveda, Enrique A.
author2_role author
dc.subject.none.fl_str_mv noise
sound-level statistics
speech intelligibility
6106.09 Procesos de Percepción
2411.13 Fisiología de la Audición
2490.01 Neurofisiología
topic noise
sound-level statistics
speech intelligibility
6106.09 Procesos de Percepción
2411.13 Fisiología de la Audición
2490.01 Neurofisiología
description [EN] The recognition of isolated words in noise improves as words are delayed from the noise onset. This phenomenon, known as adaptation to noise, has been mostly investigated using synthetic noises. The aim here was to investigate whether adaptation occurs for realistic noises and to what extent it depends on the spectrum and level fluctuations of the noise. Forty-nine different realistic and synthetic noises were analyzed and classified according to how much they fluctuated in level over time and how much their spectra differed from the speech spectrum. Six representative noises were chosen that covered the observed range of level fluctuations and spectral differences but could still mask speech. For the six noises, speech reception thresholds (SRTs) were measured for natural and tone-vocoded words delayed 50 (early condition) and 800 ms (late condition) from the noise onset. Adaptation was calculated as the SRT improvement in the late relative to the early condition. Twenty-two adults with normal hearing participated in the experiments. For natural words, adaptation was small overall (mean = 0.5 dB) and similar across the six noises. For vocoded words, significant adaptation occurred for all six noises (mean = 1.3 dB) and was not statistically different across noises. For the tested noises, the amount of adaptation was independent of the spectrum and level fluctuations of the noise. The results suggest that adaptation in speech recognition can occur in realistic noisy environments.
publishDate 2025
dc.date.none.fl_str_mv 2025
2025
2025
dc.type.none.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
format article
status_str publishedVersion
dc.identifier.none.fl_str_mv http://hdl.handle.net/10366/167676
url http://hdl.handle.net/10366/167676
dc.language.none.fl_str_mv Inglés
language_invalid_str_mv Inglés
dc.relation.none.fl_str_mv PID2019-108985GB-I00
PID2019-108985GB-I00
dc.rights.none.fl_str_mv Attribution-NonCommercial-NoDerivatives 4.0 Internacional
http://creativecommons.org/licenses/by-nc-nd/4.0/
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Attribution-NonCommercial-NoDerivatives 4.0 Internacional
http://creativecommons.org/licenses/by-nc-nd/4.0/
eu_rights_str_mv openAccess
dc.source.none.fl_str_mv reponame:GREDOS. Repositorio Institucional de la Universidad de Salamanca
instname:Universidad de Salamanca (USAL)
instname_str Universidad de Salamanca (USAL)
reponame_str GREDOS. Repositorio Institucional de la Universidad de Salamanca
collection GREDOS. Repositorio Institucional de la Universidad de Salamanca
repository.name.fl_str_mv
repository.mail.fl_str_mv
_version_ 1869405847808901120
score 15,81155