Normal hearing and verbal discrimination in real sounds environments

Lodeiro Colatosti, Adriana; Pla Gil, Ignacio; Morant Ventura, Antonio; Latorre Monteagudo, Emilia; Chacón Aranda, Lucía; Marco Algarra, Jaime

doi:10.1016/j.otoeng.2024.05.005

Article information

Abstract

Full Text

Bibliography

Download PDF

Statistics

Figures (3)

Show moreShow less

Tables (3)

Table 1. Phonetic transcription in Spanish of Marrero and Cárdenas’ lists of words used in each sound environment.

Table 2. Demographic and audiological characteristics.

Table 3. Affected phonemes and allophones according to the sound environment studied.

Show moreShow less

Additional material (3)

Abstract

Introduction

Human beings are constantly exposed to complex acoustic environments every day, which even pose challenges for individuals with normal hearing. Speech perception relies not only on fixed elements within the acoustic wave but is also influenced by various factors. These factors include speech intensity, environmental noise, the presence of other speakers, individual specific characteristics, spatial separatios of sound sources, ambient reverberation, and audiovisual cues. The objective of this study is twofold: to determine the auditory capacity of normal hearing individuals to discriminate spoken words in real-life acoustic conditions and perform a phonetic analysis of misunderstood spoken words.

Materials and methods

This is a descriptive observational cross-sectional study involving 20 normal hearing individuals. Verbal audiometry was conducted in an open-field environment, with sounds masked by simulated real-word acoustic environment at various sound intensity levels. To enhance sound emission, 2D visual images related to the sounds were displayed on a television. We analyzed the percentage of correct answers and performed a phonetic analysis of misunderstood Spanish bisyllabic words in each environment.

Results

14 women (70%) and 6 men (30%), with an average age of 26 ± 5,4 years and a mean airway hearing threshold in the right ear of 10,56 ± 3,52 dB SPL and in the left ear of 10,12 ± 2,49 dB SPL. The percentage of verbal discrimination in the “Ocean” sound environment was 97,2 ± 5,04%, “Restaurant” was 94 ± 4,58%, and “Traffic” was 86,2 ± 9,94% (p = 0,000). Regarding the phonetic analysis, the allophones that exhibited statistically significant differences were as follows: [o] (p = 0,002) within the group of vocalic phonemes, [n] (p = 0,000) of voiced nasal consonants, [r] (p = 0,0016) of voiced fricatives, [b] (p = 0,000) and [g] (p = 0,045) of voiced stops.

Conclusion

The dynamic properties of the acoustic environment can impact the ability of a normal hearing individual to extract information from a voice signal. Our study demonstrates that this ability decreases when the voice signal is masked by one or more simultaneous interfering voices, as observed in a “Restaurant” environment, and when it is masked by a continuous and intense noise environment such as “Traffic”. Regarding the phonetic analysis, when the sound environment was composed of continuous-low frequency noise, we found that nasal consonants were particularly challenging to identify. Furthermore in situations with distracting verbal signals, vowels and vibrating consonants exhibited the worst intelligibility.

Keywords:

Speech in noise

Auditory outcomes

Realistic environment

Normal hearing

Phonetic

Phoneme recognition

Intelligibility

Resumen

Introducción

El ser humano está expuesto a entornos acústicos cotidianos que resultan complejos y se convierten en un desafío incluso para las personas con audición dentro de los parámetros de la normalidad. La percepción del habla no depende simplemente de elementos invariables accesibles directamente en la onda acústica si no, está condicionada por la intensidad del habla, del ruido ambiental, la cantidad de personas que conforman una conversación, factores específicos de cada individuo, la separación espacial de las fuentes de sonido, la reverberación ambiental, las señales audiovisuales, entre otras. El objetivo de este estudio es determinar la capacidad auditiva que tienen las personas normoyentes de discriminar la palabra hablada en condiciones acústicas existentes en la vida real y realizar el análisis fonético de las palabras erróneas.

Materiales y métodos

Es un estudio descriptivo observacional de corte transversal constituido por 20 personas normoyentes a los que se les realizó audiometrías verbales en campo libre enmascaradas con entornos sonoros reales simulados a distintos rangos de intensidad del sonido. La emisión de los sonidos se reforzó con el apoyo visual de imágenes alusivas en 2D transmitidas por una televisión. Se analizó el porcentaje de aciertos de las bisílabas emitidas y se realizó el análisis fonético de las palabras erróneas en cada entorno.

Resultados

14 mujeres (70%) y 6 hombres (30%), con una edad promedio de 26 ± 5,4 años y un umbral auditivo medio (UAM) de la vía aérea en oído derecho de 10,56 ± 3,52 dB SPL y en oído izquierdo de 10,12 ± 2,49 dB SPL. El porcentaje de discriminación verbal en el entorno sonoro “Océano” fue de 97,2 ± 5,04%, “Restaurante” de 94 ± 4,58%, y “Tráfico” de 86,2 ± 9,94% (p = 0,000). Con respecto al análisis fonético, los alófonos que presentaron diferencias estadísticamente significativos fueron: el alófono [o] (p = 0,002).del grupo de fonemas vocálicos, el alófono [n] (p = 0,000) de las consonantes nasales sonoras, al alófono [r] (p = 0,016) de las vibrantes sonoras, y los alófonos [b] (p = 0,000) y [g] (p = 0,045) de las oclusivas sonoras.

Conclusiones

Las propiedades dinámicas del entorno acústico pueden afectar la capacidad de una persona con audición normal de comprender una señal verbal. Nuestro estudio demuestra que esta capacidad disminuye cuando la señal verbal está enmascarada por una o más voces simultáneas, como se observa en el entorno “Restaurante, y cuando está enmascarada por un ambiente de ruido continuo e intenso como el del entorno “Tráfico”: Con respecto al análisis fonético, cuando el ambiente sonoro estaba compuesto por un ruido continuo y de baja frecuencia, las consonantes nasales sonoras [m] [ɲ] [n] fueron las más complicadas de identificar, y cuando existían estímulos verbales distractores, las vocales y las consonantes vibrantes sonoras [ɾ] [ɾɾ] fueron las que presentaron peor inteligibilidad.

Palabras clave:

Audición en ruido

Normoacusia

Fonética

Intelegibilidad

Entornos sonoros

Article

These are the options to access the full texts of the publication Acta Otorrinolaringológica Española

Subscriber

If you already have your login data, please click here .

If you have forgotten your password you can you can recover it by clicking here and selecting the option “I have forgotten my password”

Subscribe

Subscribe to

Acta Otorrinolaringológica Española

More information

Purchase

Purchase article

Purchasing article the PDF version will be downloaded

Price 19.34 €

Purchase now

Contact

Phone for subscriptions and reporting of errors

From Monday to Friday from 9 a.m. to 6 p.m. (GMT + 1) except for the months of July and August which will be from 9 a.m. to 3 p.m.

Calls from Spain

932 415 960

Calls from outside Spain

+34 932 415 960

E-mail

atencionalcliente@elsevier.com

Indexed in:

Follow us:

Subscribe:

Article

Indexed in:

Follow us:

Subscribe:

Article

Subscribe to our newsletter