Ιατρικά Άρθρα: Phonetics

Τρίτη 28 Μαΐου 2019

Phonetics

Statistical distributions of consonant variants in infant-directed speech: Evidence that /t/ may be exceptional

Publication date: July 2019

Source: Journal of Phonetics, Volume 75

Author(s): Laura Dilley, Jessica Gamache, Yuanyuan Wang, Derek M. Houston, Tonya R. Bergeson

Abstract

Statistical distributions of phonetic variants in spoken language influence speech perception for both language learners and mature users. We theorized that patterns of phonetic variant processing of consonants demonstrated by adults might stem in part from patterns of early exposure to statistics of phonetic variants in infant-directed (ID) speech. In particular, we hypothesized that ID speech might involve greater proportions of canonical /t/ pronunciations compared to adult-directed (AD) speech in at least some phonological contexts. This possibility was tested using a corpus of spontaneous speech of mothers speaking to other adults, or to their typically-developing infant. Tokens of word-final alveolar stops – including /t/, /d/, and the nasal stop /n/ – were examined in assimilable contexts (i.e., those followed by a word-initial labial and/or velar); these were classified as canonical, assimilated, deleted, or glottalized. Results confirmed that there were significantly more canonical pronunciations in assimilable contexts in ID compared with AD speech, an effect which was driven by the phoneme /t/. These findings suggest that at least in phonological contexts involving possible assimilation, children are exposed to more canonical /t/ variant pronunciations than adults are. This raises the possibility that perceptual processing of canonical /t/ may be partly attributable to exposure to canonical /t/ variants in ID speech. Results support the need for further research into how statistics of variant pronunciations in early language input may shape speech processing across the lifespan.

Spontaneous nasalization after glottal consonants in Thai

Publication date: July 2019

Source: Journal of Phonetics, Volume 75

Author(s): Sarah E. Johnson, Marissa Barlaz, Ryan K. Shosted, Brad P. Sutton

Abstract

Spontaneous nasalization is the emergence of distinctive nasalization in contexts lacking an historical etymological nasal. In Thai, low and mid-low vowels nasalize after /h/ and to a lesser degree after /ʔ/. It has been reasoned that nasalization after /h/ may occur because breathiness and nasalization are acoustically similar; both introduce higher energy at low frequencies and increase spectral tilt. Glottal consonants may generally facilitate nasalization because aerodynamically they do not require velopharyngeal closure. We investigated velopharyngeal opening (VPO) during vowels after /h/ and/ ʔ/ and measured spectral tilt (H1–H2). We measured VPO by processing oblique ultra-fast magnetic resonance images of the velopharyngeal port. Four Thai speakers exhibited a complex system of VPO that varied based on vowel height and preceding consonant. Low vowels after /h/ manifested more physiological nasalization than low vowels after /ʔ/, while the former were often produced with higher spectral tilt, which may be indicative of either increased breathiness or nasalization. While VPO is likely responsible for impressions of greater nasalization after /h/, our findings suggest that breathiness and VPO may interact in the spontaneously nasalized vowels of Thai.

Influence of coda stop features on perceived vowel duration

Publication date: July 2019

Source: Journal of Phonetics, Volume 75

Author(s): Chelsea Sanker

Abstract

Four experiments tested what cues contribute to English speakers' perception of vowel duration. Listeners categorized the duration of vowels as 'long' or 'short' for stimuli produced with voiced, voiceless, breathy voiced, or voiceless aspirated stop codas. Listeners demonstrated a strong ability to perceive vowel duration, though perception was continuous rather than categorical. There were several interacting factors influencing perceived vowel duration, based on expectations set by the presence of particular codas and also acoustic effects of the coda on the vowel. When the coda was removed, vowels that had been produced before voiced codas were perceived as longer than vowels produced before voiceless codas, though they exhibited the opposite effect when codas were present. Vowels were also perceived as longer when produced before breathy voiced stops, regardless of whether or not the stop was present. The steeper f0 falls associated with voiced codas within these stimuli likely contributed to the longer perceived duration of vowels from this environment; manipulating f0 contours eliminated effects of the original coda on perceived vowel duration. The effects of the production environment on perceived vowel duration suggest a possible perceptual pathway for the voicing effect on vowel duration.

Cue-shifting between acoustic cues: Evidence for directional asymmetry

Publication date: July 2019

Source: Journal of Phonetics, Volume 75

Author(s): Meng Yang, Megha Sundara

Abstract

Previous research shows that experience with co-varying cues is neither sufficient nor necessary for listeners to integrate them perceptually. Auditory Enhancement theorists explain this by positing that listeners integrate two cues more readily if the cues enhance each other's percept. To isolate the role of enhancement from that of experience, we forced English adult listeners to shift attention between two enhancing cues that they do not use phonemically, pitch and breathiness, by reversing the informativeness of the two cues in a cue weighting experiment. Listeners were able to shift attention from pitch to breathiness and vice versa if the two cues were in an enhancing relation. When this relationship was reversed, listeners could shift attention from pitch to breathiness but not in the opposite direction. Clearly, both the change in informativeness and the enhancing properties of the cues influenced the listeners' re-weighting of these cues. However, the directional asymmetry was not predicted. Moreover, the same asymmetry was observed in two new groups of listeners who have native language experience with either pitch or breathiness. We discuss the consequences of such asymmetric enhancement effects, rising from either processing limitations or articulatory contingencies, for language change.

Formant dynamics of Spanish vocalic sequences in related speakers: A forensic-voice-comparison investigation

Publication date: July 2019

Source: Journal of Phonetics, Volume 75

Author(s): Eugenia San Segundo, Junjie Yang

Abstract

This study investigates the dynamic acoustic properties of 19 vocalic sequences of Standard Peninsular Spanish, showing their potential for forensic voice comparison. Parametric curves (polynomials and discrete cosine transform) were fitted to the formant trajectories of the 19 Spanish vocalic sequences of 54 male speakers, comprising monozygotic (MZ) and dizygotic (DZ) twin pairs, non-twin brothers and unrelated speakers. Using the curve-fitting estimated coefficients as input to a multivariate-kernel-density formula, cross-validated likelihood ratios were calculated to express the probability of obtaining the observed difference between two speech samples under the hypothesis that the samples were produced by the same speaker and under the hypothesis that they were produced by a different speaker. The results show that the best-performing system is one that fuses the 19 vocalic sequences with a geometric-mean fusion method. When challenging the system with related speakers, the results show that MZ twin pairs affect performance but, more importantly, that non-twin sibling pairs can deteriorate performance too. This suggests that more investigations are necessary into a range of similar-sounding speakers beyond MZ twins. Several nurture aspects are highlighted as explanatory factors for the strikingly high similarity of a specific non-twin sibling pair.

Alignment of f0 peak in different pitch accent types affects perception of metrical stress

Publication date: May 2019