Sequential stream segregation of voiced and unvoiced speech sounds based on fundamental frequency

Marion A. David; Mathieu Lavandier; Nicolas Grimault; Andrew J. Oxenham

doi:10.1016/j.heares.2016.11.016

Article Dans Une Revue Hearing Research Année : 2017

Sequential stream segregation of voiced and unvoiced speech sounds based on fundamental frequency

(1) , (2, 3) , (4) , (5)

1
2
3
4
5

Marion A. David

Fonction : Auteur

ToxAlim

Mathieu Lavandier

Fonction : Auteur
PersonId : 867776
IdHAL : mathieu-lavandier
IdRef : 115714294

École Nationale des Travaux Publics de l'État

Laboratoire Génie Civil et Bâtiment

Nicolas Grimault

Fonction : Auteur
PersonId : 738564
IdHAL : nicolas-grimault
ORCID : 0000-0003-3586-4426
IdRef : 131828835

Neurosciences Sensorielles Comportement Cognition

Andrew J. Oxenham

Fonction : Auteur

Auditory Perception and Cognition Laboratory

Résumé

Differences in fundamental frequency (F0) between voiced sounds are known to be a strong cue for stream segregation. However, speech consists of both voiced and unvoiced sounds, and less is known about whether and how the unvoiced portions are segregated. This study measured listeners' ability to integrate or segregate sequences of consonant-vowel tokens, comprising a voiceless fricative and a vowel, as a function of the F0 difference between interleaved sequences of tokens. A performance-based measure was used, in which listeners detected the presence of a repeated token either within one sequence or between the two sequences (measures of voluntary and obligatory streaming, respectively). The results showed a systematic increase of voluntary stream segregation as the F0 difference between the two interleaved sequences increased from 0 to 13 semitones, suggesting that F0 differences allowed listeners to segregate speech sounds, including the unvoiced portions. In contrast to the consistent effects of voluntary streaming, the trend towards obligatory stream segregation at large F0 differences failed to reach significance. Listeners were no longer able to perform the voluntary-streaming task reliably when the unvoiced portions were removed from the stimuli, suggesting that the unvoiced portions were used and correctly segregated in the original task. The results demonstrate that streaming based on F0 differences occurs for natural speech sounds, and that the unvoiced portions are correctly assigned to the corresponding voiced portions.

Mots clés

Stream segregation Fundamental frequency Speech sounds Stream segregation Fundamental frequency Speech sounds

Domaines

Acoustique [physics.class-ph]

Fichier principal

preprint.pdf (1.1 Mo)

Origine	Fichiers produits par l'(les) auteur(s)

Mathieu Lavandier : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01690720

Soumis le : vendredi 6 décembre 2024-10:12:22

Dernière modification le : lundi 16 décembre 2024-17:00:03

Dates et versions

hal-01690720 , version 1 (06-12-2024)

Identifiants

HAL Id : hal-01690720 , version 1
DOI : 10.1016/j.heares.2016.11.016
PUBMEDCENTRAL : PMC5239743

Citer

Marion A. David, Mathieu Lavandier, Nicolas Grimault, Andrew J. Oxenham. Sequential stream segregation of voiced and unvoiced speech sounds based on fundamental frequency. Hearing Research, 2017, 344, pp.235 - 243. ⟨10.1016/j.heares.2016.11.016⟩. ⟨hal-01690720⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-LYON1 ENTPE INRA UDL INRAE ANR TOXALIM INRAEOCCITANIETOULOUSE TOULOUSE-INP UNIV-UT3 UT3-TOULOUSEINP ENVT EIPURPAN

83 Consultations

0 Téléchargements

Sequential stream segregation of voiced and unvoiced speech sounds based on fundamental frequency

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager