Linguistic Phonetics Fall 2005

Similar documents
Consonants: articulation and transcription

Phonetics. The Sound of Language

Speech Recognition using Acoustic Landmarks and Binary Phonetic Feature Classifiers

Quarterly Progress and Status Report. VCV-sequencies in a preliminary text-to-speech system for female speech

Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition

age, Speech and Hearii

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines

Christine Mooshammer, IPDS Kiel, Philip Hoole, IPSK München, Anja Geumann, Dublin

Quarterly Progress and Status Report. Voiced-voiceless distinction in alaryngeal speech - acoustic and articula

On Developing Acoustic Models Using HTK. M.A. Spaans BSc.

DEVELOPMENT OF LINGUAL MOTOR CONTROL IN CHILDREN AND ADOLESCENTS

To appear in the Proceedings of the 35th Meetings of the Chicago Linguistics Society. Post-vocalic spirantization: Typology and phonetic motivations

Proceedings of Meetings on Acoustics

Mandarin Lexical Tone Recognition: The Gating Paradigm

Consonant-Vowel Unity in Element Theory*

The Perception of Nasalized Vowels in American English: An Investigation of On-line Use of Vowel Nasalization in Lexical Access

Body-Conducted Speech Recognition and its Application to Speech Support System

THE RECOGNITION OF SPEECH BY MACHINE

1. REFLEXES: Ask questions about coughing, swallowing, of water as fast as possible (note! Not suitable for all

An Acoustic Phonetic Account of the Production of Word-Final /z/s in Central Minnesota English

Speech Synthesis in Noisy Environment by Enhancing Strength of Excitation and Formant Prominence

Design Of An Automatic Speaker Recognition System Using MFCC, Vector Quantization And LBG Algorithm

Segregation of Unvoiced Speech from Nonspeech Interference

Perceptual scaling of voice identity: common dimensions for different vowels and speakers

Radical CV Phonology: the locational gesture *

SEGMENTAL FEATURES IN SPONTANEOUS AND READ-ALOUD FINNISH

Audible and visible speech

Pobrane z czasopisma New Horizons in English Studies Data: 18/11/ :52:20. New Horizons in English Studies 1/2016

A comparison of spectral smoothing methods for segment concatenation based speech synthesis

Evaluation of Various Methods to Calculate the EGG Contact Quotient

Affricates. Affricates, nasals, laterals and continuants. Affricates. Affricates. Study questions

Speaker Recognition. Speaker Diarization and Identification

Phonological and Phonetic Representations: The Case of Neutralization

Universal contrastive analysis as a learning principle in CAPT

Speech Emotion Recognition Using Support Vector Machine

source or where they are needed to distinguish two forms of a language. 4. Geographical Location. I have attempted to provide a geographical

On the Formation of Phoneme Categories in DNN Acoustic Models

Beginning primarily with the investigations of Zimmermann (1980a),

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

Speaking Rate and Speech Movement Velocity Profiles

9 Sound recordings: acoustic and articulatory data

Quarterly Progress and Status Report. Sound symbolism in deictic words

Contrasting English Phonology and Nigerian English Phonology

Rhythm-typology revisited.

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION

WHEN THERE IS A mismatch between the acoustic

The analysis starts with the phonetic vowel and consonant charts based on the dataset:

Fix Your Vowels: Computer-assisted training by Dutch learners of Spanish

Phonology Revisited: Sor3ng Out the PH Factors in Reading and Spelling Development. Indiana, November, 2015

Markedness and Complex Stops: Evidence from Simplification Processes 1. Nick Danis Rutgers University

VIEW: An Assessment of Problem Solving Style

Speaker recognition using universal background model on YOHO database

Rachel E. Baker, Ann R. Bradlow. Northwestern University, Evanston, IL, USA

Voiceless Stop Consonant Modelling and Synthesis Framework Based on MISO Dynamic System

Klaus Zuberbühler c) School of Psychology, University of St. Andrews, St. Andrews, Fife KY16 9JU, Scotland, United Kingdom

Affricates. Affricates, nasals, laterals and continuants. Affricates. Affricates. Affricates. Affricates 11/20/2015. Phonetics of English 1

A Cross-language Corpus for Studying the Phonetics and Phonology of Prominence

Acoustic correlates of stress and their use in diagnosing syllable fusion in Tongan. James White & Marc Garellek UCLA

Different Task Type and the Perception of the English Interdental Fricatives

MASTERY OF PHONEMIC SYMBOLS AND STUDENT EXPERIENCES IN PRONUNCIATION TEACHING. Master s thesis Aino Saarelainen

Summary results (year 1-3)

SOUND STRUCTURE REPRESENTATION, REPAIR AND WELL-FORMEDNESS: GRAMMAR IN SPOKEN LANGUAGE PRODUCTION. Adam B. Buchwald

Complexity in Second Language Phonology Acquisition

The Good Judgment Project: A large scale test of different methods of combining expert predictions

Manner assimilation in Uyghur

Word Stress and Intonation: Introduction

Expressive speech synthesis: a review

AGENDA LEARNING THEORIES LEARNING THEORIES. Advanced Learning Theories 2/22/2016

On the Combined Behavior of Autonomous Resource Management Agents

U IVERSIDADE FEDERAL DE SA TA CATARI A PROGRAMA DE PÓS-GRADUAÇÃO EM LETRAS/I GLÊS E LITERATURA CORRESPO DE TE. Mariane Antero Alves

Seminar - Organic Computing

Noise-Adaptive Perceptual Weighting in the AMR-WB Encoder for Increased Speech Loudness in Adverse Far-End Noise Conditions

Collecting dialect data and making use of them an interim report from Swedia 2000

**Note: this is slightly different from the original (mainly in format). I would be happy to send you a hard copy.**

Progress Monitoring for Behavior: Data Collection Methods & Procedures

Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano. Graduate School of Information Science, Nara Institute of Science & Technology

Human Factors Engineering Design and Evaluation Checklist

Phonological Processing for Urdu Text to Speech System

Speaker Identification by Comparison of Smart Methods. Abstract

Online Publication Date: 01 May 1981 PLEASE SCROLL DOWN FOR ARTICLE

Software Maintenance

D Road Maps 6. A Guide to Learning System Dynamics. System Dynamics in Education Project

ENME 605 Advanced Control Systems, Fall 2015 Department of Mechanical Engineering

Probability and Statistics Curriculum Pacing Guide

Edexcel GCSE. Statistics 1389 Paper 1H. June Mark Scheme. Statistics Edexcel GCSE

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections

Voice conversion through vector quantization

Learning Methods in Multilingual Speech Recognition

The pronunciation of /7i/ by male and female speakers of avant-garde Dutch

5. Margi (Chadic, Nigeria): H, L, R (Williams 1973, Hoffmann 1963)

PDF hosted at the Radboud Repository of the Radboud University Nijmegen

Grade 6: Correlated to AGS Basic Math Skills

Major Milestones, Team Activities, and Individual Deliverables

CSC200: Lecture 4. Allan Borodin

Learners Use Word-Level Statistics in Phonetic Category Acquisition

INTRODUCTION J. Acoust. Soc. Am. 102 (3), September /97/102(3)/1891/7/$ Acoustical Society of America 1891

Prevalence of Oral Reading Problems in Thai Students with Cleft Palate, Grades 3-5

Language Change: Progress or Decay?

NCEO Technical Report 27

The Strong Minimalist Thesis and Bounded Optimality

Transcription:

MIT OpenCourseWare http://ocw.mit.edu 24.963 Linguistic Phonetics Fall 25 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

24.963 Linguistic Phonetics Quantal Theory Acoustic parameter I II III Articulatory parameter Image by MIT OpenCourseWare. Adapted from Stevens, K. N. "On the Quantal Nature of Speech." Journal of Phonetics 17 (1989): 3-46.

Reading for week 7: Johnson chapters 7 and 8. Assignments: 3rd acoustics assignment

Quantal Theory Quantal relationship between articulatory and acoustic parameters (Stevens 1972, 1989, etc) The acoustic difference between I and III is large - qualitatively different (Johnson s example: glottal aperture and voicing). The acoustic parameter is relatively insensitive to change in the articulatory parameter within regions I and II, hence: articulation need not be precise. continuous movement through the region will yield acoustic steady states. Acoustic parameter I II III Articulatory parameter Image by MIT OpenCourseWare. Adapted from Stevens, K. N. "On the Quantal Nature of Speech." Journal of Phonetics 17 (1989): 3-46.

Quantal Theory Claim: Linguistic contrasts involve differences between regions I and III. More specifically, quantal relations provide the basis for distinctive features: The articulatory and acoustic attributes that occur within the plateau-like regions of the relations are, in effect, the correlates of the distinctive features (p.5)

Voicing and glottal aperture This example is not Stevens s, but it s a nice illustration of the insight behind the quantal theory and the potential complications it faces: Gradual change in articulatory parameters can result in abrupt, qualitative change in acoustic output - voicing is qualitatively different from voicelessness. Glottal aperture is only one of many parameters that affects voicing - glottal tension and pressure drop across the glottis are relevant also. How does this affect the identification of quantal regions (particularly as a basis for features)? Languages also contrast breathy vs. modal voice vs. creaky voice. Are these quantal distinctions?

Voicing and glottal aperture Glottal aperture is only one of many parameters that affects voicing - glottal tension and pressure drop across the glottis are relevant also. How does this affect the identification of quantal regions (particularly as a basis for features)?.6 4 Phonation threshold pressure, P th (kpa).5.4 Oscillation onset Oscillation offset P L (Pa) 2 2 1 3 4.3-1 1 2 3 Prephonatory glottal halfwidth, ξ o (mm).5.1 ξ o (cm) Image by MIT OpenCourseWare. Adapted from Titze, Ingo R., Sheila S. Schmidt, Image by MIT OpenCourseWare. Adapted from Lucero, J. C. The Minimum and Michael R. Titze. Phonation Threshold Pressure in a Physical Lung Pressure to Sustain Vocal Fold Oscillation. Journal of the Model of the Vocal Fold Mucosa. Journal of the Acoustical Society of Acoustical Society of America 98 (1995): 779-784. America 97 (1995).

Quantal theory applied to vowels Regions of stability (quantal regions) for vowel formant frequencies occur where two formants converge. 4 4 3 3 Frequency 2 A c=.5 cm 2 Frequency 2 A =.5 cm 2 1 1.2 1 A = 1 2 4 6 8 Length of back cavity 1 12 14 2 4 6 8 1 Length of back cavity 12 14 A 1 A c A 2 A 2 A 1 l 1 l c l 2 1 2 Images by MIT OpenCourseWare. Adapted from Stevens, K. N. "On the Quantal Nature of Speech." Journal of Phonetics 17 (1989): 3-46.

Quantal theory applied to vowels The three most common vowels cross-linguistically are [i. a. u]. Stevens argues that these are quantal vowels. Frequency 4 3 2 1 A c=.5 cm 2.2 High front [i] is produced at the convergence of F2 and F3 created by a narrowconstriction in the palatal region. 2 4 6 8 Length of back cavity 1 12 14 A 1 A c A 2 l 1 l c l 2 Images by MIT OpenCourseWare. Adapted from Stevens, K. N. "On the Quantal Nature of Speech." Journal of Phonetics 17 (1989): 3-46.

Quantal theory applied to vowels Regions of stability (quantal regions) for vowel formant frequencies occur where two formants converge. 4 Low [#] is produced at the convergence of F1 and F2 created by a narrow back cavity and a wide front cavity of equal length. Frequency 3 2 1 A = 1 A =.5 cm 2 1 2 4 6 8 1 12 14 Length of back cavity Image by MIT OpenCourseWare. Adapted from Stevens, K. N. "On the Quantal Nature of Speech." Journal of Phonetics 17 (1989): 3-46. A 1 A 2 1 2 Image by MIT OpenCourseWare.

Quantal theory applied to vowels Regions of stability (quantal regions) for vowel formant frequencies occur where two formants converge. Frequency (khz) 4 3 2 High back rounded [u] is produced near a minimum in F2, in a region where F1 isrelatively stable. 1 2 4 6 8 1 Length of back cavity (cm) 12 14 Image by MIT OpenCourseWare. Adapted from Stevens, K. N. "On the Quantal Nature of Speech." Journal of Phonetics 17 (1989): 3-46.

Quantal theory applied to vowels Are all convergences of F1 & F2 or F2 & F3 quantal regions? 4 Some are not anatomically feasible - e.g. convergence of F2 and F3 at 12cm. Frequency 3 2 A =.5 cm 2 1 Convergence of F2 and F3 at 4cm is said to be quantal vowel [3]. 1 2 4 A = 1 6 8 1 Length of back cavity 12 14 Note that this vowel is crosslinguistically relatively unusual. A 1 A 2 1 2 Images by MIT OpenCourseWare. Adapted from Stevens, K. N. "On the Quantal Nature of Speech." Journal of Phonetics 17 (1989): 3-46.

Quantal theory applied to vowels Are mid vowels [e, o] quantal? We have suggested that the difference between high and mid vowels can be modeled as an increase in the area of the constriction. What is the effect on the formants? c 2 A c c A c ΔF = 2π 2 l c l 2 F n X A, F 1 = 2π Vl c A 1 A c A 2 l 1 l c l 2 Image by MIT OpenCourseWare. Adapted from Stevens, K. N. "On the Quantal Nature of Speech." Journal of Phonetics 17 (1989): 3-46.

Stability with respect to multiple parameters There is no quantal relationship between constriction area and formant frequencies In fact formants are maximally sensitive to constriction area at the points of formant stability. Is a quantal relationship between one articulatory parameter and one acoustic parameter sufficient? Stevens seems concerned about this case - argues that: although there is no minimum, the relationship between formants and constriction area is a shallow slope. there may be non-monotonicity in the relationship between muscle atcivity and constriction area (p.15).

F f = F b c 4l f c 2l b Stability with respect to multiple parameters What is the effect on F2 and F3 of varying constriction length? Consider the configuration where F2 and F3 converge. = A 1 A c A 2 l 1 l c l 2 Cavity length (cm) 1 5 Back cavity Front cavity Image by MIT OpenCourseWare. Adapted from Stevens, K. N. "On the Quantal Nature of Speech." Journal of Phonetics 17 (1989): 3-46. Frequency (khz) 2.5 2. F 3 F 2 2 3 4 5 Constriction length, l c (cm) Image by MIT OpenCourseWare. Adapted from Lindblom, B., and O. Engstrand. "In what Sense is Speech Quantal?" Journal of Phonetics 17 (1989): 17-121.

Stability with respect to multiple parameters Vowel formants vary monotonically with degree of lip constriction. How undesirable is articulatory precision? Languages do not appear to take full advantage of the imprecision that quantal regions allow - e.g. differences between Danish and English [i]. Note that Stevens has recently suggested that the quantal distinction between front and back vowels is based on the frequency of F2 relative to the first sub-glottal zero, not on convergence of F2 with F1/F3.

Lindblom s Theory of Adaptive Dispersion Liljencrants and Lindblom (1972), Lindblom 1986, 199a,b An alternative explanation for the cross-linguistic preference for the vowels [i, a, u]: these vowels are at the extremes of the formant space of physiologically possible vowels. These vowels are maximally distinct from each other and therefore less likely to be confused by a listener.

Lindblom s Theory of Adaptive Dispersion A shift in perspective: preferred systems of contrasts vs. preferred sounds. There are many generalizations about possible inventories of contrasting sounds.

Lindblom s Theory of Adaptive Dispersion Common vowel inventories: i u i u i u e o e o a a a Arabic, Spanish, Italian, Nyangumata, Swahili, Yoruba, Aleut, etc. Cherokee, etc. Tunica, etc. Unattested vowel inventories: i i i u e e a a ɔ

Lindblom s Theory of Adaptive Dispersion Lindblom s approach takes these generalizations as prior to generalizations about sounds: a preferred speech sound is one that appears in many preferred inventories. Specifically, sounds in a language are selected so as to best satisfy requirements that derive from the communicative function of language: Maximize perceptual distinctiveness Minimize effort

Liljencrants and Lindblom (1972) The role of perceptual contrast in predicting vowel inventories. The space of articulatorily possible vowels: Third Formant (F 3 ) khz 1.5 2. 3. 4. Second Formant (M 2 ) 2.5 1.5 1..5 MEL 15 2 Third Formant (F ) 3 MEL 15 1 5 First Formant (F 1 ) khz 2.5.5.75 2.5 1.5 1..5 MEL 25 5 75 First Formant (F 1 ) MEL 15 1 5 Second Formant (M 2 ) Images by MIT OpenCourseWare. Adapted from Liljencrants, Johan, and Bjorn Lindblom. Numerical Simulation of Vowel Quality Systems: The Role of Perceptual Contrast. Language 48, no. 4 (December 1972): 839-862.

Liljencrants and Lindblom (1972) Perceptual distinctiveness of contrast between V i and V j : distance between vowels in perceptual vowel space r ij = (x i x j ) 2 +(y i y j ) 2 where x n is F2 of V n in mel y n is F1 of V n in mel Maximize distinctiveness: select N vowels so as to minimize E E = n 1 i =1 i 1 j = 1 r ij 2

Predicted optimal inventories Reasonable approximations to typical 3 and 5 vowel inventories are derived. Preference for [i, a, u] is derived. Problem: Too many high, nonperipheral vowels. Not enough mid non-peripheral vowels. Second Formant (khz) 2.5 2. 1.5 1..5 2.5 2. 1.5 1..5 2.5 2. 1.5 1. 3 4 5 6 7 8 9 1 11 12.5.2.4.6.8.2.4.6.8.2.4.6.8.2.4.6.8 First Formant (khz) Image by MIT OpenCourseWare. Adapted from Liljencrants, Johan, and Bjorn Lindblom. Numerical Simulation of Vowel Quality Systems: The Role of Perceptual Contrast. Language 48, no. 4 (December 1972): 839-862.

Liljencrants and Lindblom (1972) The excess of central vowels arise because measuring distinctiveness in terms of distance in formant space gives too much weight to differences in F2 (even after mel scaling). Recent work by Diehl, Lindblom and Creeger (23) suggests that the greater perceptual significance of F1 probably follows from the higher intensity of F1 relative to F2. Second formant frequency (khz) 2.5 2. 1.5 1. 7 7.5.2.4.6.8.2.4.6.8 First formant frequency (khz) Image by MIT OpenCourseWare. Adatped from Diehl, R. L., B. Lindblom, and C. P. Creeger. "Increasing Realism of Auditory Representations Yields Further Insights into Vowel Phonetics." Proceedings of the 15th International Congress of Phonetic Sciences. Vol. 2. Adelaide, Australia: Causal Publications, 23, pp.1381-1384.

Liljencrants and Lindblom (1972) The absence of interior vowels [, ø] is a result of the way in which overall distinctiveness is calculated. Each vowel contributes to E based on its distance from every other vowel. Interior vowels have a high cost because they are relatively close to all the peripheral vowels. One possible alternative is to maximize the minimum distance (Flemming 25).

Problems with Adaptive Dispersion Specific instantiations of the model have made specific incorrect predictions (but some of the broad predictions are correct and models are improving). The model answers an inobvious question: Given N vowels, what should they be? - what determines the size of inventories? TAD predicts a single best inventory for each inventory size. Why would languages have sub-optimal inventories?

24.963 Linguistic Phonetics Source-filter analysis of fricatives

Noise source Turbulence noise - random pressure fluctuations. Turbulence can result when a jet of air flows out of a constriction into a wider channel (or open space). Relative level (db) 3 2 1.5 1 2 3 5 1 Frequency (khz) Image by MIT OpenCourseWare. Adapted from Stevens, K. N. On the Quantal Nature of Speech. Journal of Phonetics 17 (1989): 3-46.

Noise source Turbulence can result when a jet of air flows out of a constriction into a wider channel (or open space). The intensity of turbulence noise depends on particle velocity. For a given volume velocity, particle velocity will be greater if the channel is narrower, so for a given volume velocity, narrower constrictions yield louder frication noise. source at glottis source at glottis Relative Level (db) 2 1 source at supraglottal constriction source at supraglottal constriction Ag =.2 cm 2 Ag =.3 cm 2.1.2.3.4.5.1.2.3.4.5 Area of Supraglottal Constriction A c (cm 2 ) Image by MIT OpenCourseWare. Stevens, K. N. Acoustics Phonetics. Cambridge, MA: MIT Press, 1999.

Noise source Turbulence is also produced when an airstream strikes an obstacle (e.g. the teeth in [s]). The orientation of the obstacle to the direction of flow affects the amount of turbulence produced - the teeth are more or less perpendicular to the airflow in [s] and thus produce significant turbulence. The louder noise of strident fricatives is a result of downstream obstacles. Image by MIT OpenCourseWare. Stevens, K. N. Acoustic Phonetics. Cambridge, MA: MIT Press, 1999.

Filter characteristics The noise sources are filtered by the cavity in front of the constriction. In [h] the noise source is at the glottis, so the entire supralaryngeal vocal tract filters the source, just as in a vowel. So [h] has formants at the same frequency as a vowel with the same vocal tract shape, but the formants are excited by a noise source instead of voicing. The noise source generated at the glottis has lower intensity at low frequencies, so F1 generally has low intensity in [h]. 7 SPL in 3 - Hz Bands (db re.2 dyne/cm 2 ) 6 5 4 3 Periodic source Overall noise source 2 1 2 3 4 5 Frequency (khz) Image by MIT OpenCourseWare.

[h] The noise source generated at the glottis has lower intensity at low frequencies, so F1 generally has low intensity in [h]. SPL in 3 - Hz Bands (db re.2 dyne/cm 2 ) 7 6 5 4 3 Periodic source Overall noise source 2 1 2 3 4 5 Frequency (khz) Image by MIT OpenCourseWare. MAG (db) MAG (db) 6 4 2 6 4 2 [e] [h] [h] [o] [he] [ho] 1 2 3 4 FREQ (khz) Image by MIT OpenCourseWare. Stevens, K. N. Acoustic Phonetics. Cambridge, MA: MIT Press, 1999.

[h] 5 5 heed hoed 13.686 14.1873 17.5452 18.68 5 Time (s) 5 Time (s) Hoyd hoard 15.467 15.9137 Time (s) 19.5239 2.263 Time (s)

Filter characteristics As the place of articulation shifts forward, the cavity in front of the noise source is progressively smaller. A smaller cavity has higher resonances, so other things being equal, the concentration of energy in the fricative spectrum is higher the closer the place of articulation is to the lips. 6 x s 4 Frequency (khz) 4 2 6 Relative level (db) 3 2 1 5.9 3.5 2.2 1.5 4 2-2 s 1 2 3 4 6 Frequency (khz) Image by MIT OpenCourseWare. Image by MIT OpenCourseWare. Adapted from Stevens, K. N. "On the Quantal Nature of Speech." Journal of Phonetics 17 (1989): 3-46.

Filter characteristics The front cavity of a labial is so short (first resonance ~1 khz) that it has little effect on the fricative spectrum, resulting in fricative noise spread over a wide range of frequencies with a broad low-frequency peak. This picture can be complicated by acoustic coupling with back cavity. 5 2 2 8 Frequency (Hz) 12.111 12.7989 Time (s)

Filter characteristics Lip rounding lowers the resonant frequencies of the front cavity, just as in vowels. In coronals, the presence or absence of a sublingual cavity has a significant effect on the size of the front cavity. 6 4 S 2 db khz 2 4 6 8 1 khz 2 4 6 8 1 Image by MIT OpenCourseWare.

Source-filter analysis of stops 4 3 2 1 i b d g Image by MIT OpenCourseWare.

Stops Stops are complicated in that they involve a series of rapid changes in acoustic properties, but each component can be analyzed in similar terms to vowels and fricatives. A stop can consist of four phases: implosion (closure) transitions - closure - burst - release transitions

Closure Only source of sound is voicing, propagated through the walls of the vocal tract. The walls of the vocal tract resonate at low frequencies, so only low-freqeuncy sound is transmitted ( voice bar ).

Burst Consists of a transient, due to abrupt increase in pressure at release, followed by a short period of frication as air flows at high velocity through the narrow (but widening) constriction. Transient source is an impulse (flat spectrum) filtered by the front cavity. The frication is essentially the same as a fricative made at the same place of articulation. Alveolars have high freqeuncy, high intensity bursts. Velar bursts are concentrated at the frequency of F2 and/or F3 at release. Labial bursts are of low intensity, with energy over a wide range of freqeuncies, with a broad, low-frequency peak.

Release transitions As the constriction becomes more open, frication ceases. The source at this time is at the glottis - either voicing or aspiration noise. This source excites the entire vocal tract as in a vowel (or [h]). The shape of the vocal tract, and thus the formants, during this phase are basically determined by the location of the stop constriction and the quality of adjacent vowels. The formants move rapidly as the articulators move from the position of the stop to the position for the vowel. The formant movements are usually called formant transitions.

4 3 b d g 2 1 i 4 3 2 1 4 3 2 1 4 3 2 1 a 4 3 2 1 u ms 5 1 15 ms 5 1 15 ms 5 1 15 Image by MIT OpenCourseWare. Adapted from Ladefoged, Peter. Phonetic Data Analysis. Malden, MA: Blackwell, 23.

Release transitions In alveolar stops the formant transitions due to the tongue tip constriction are probably very rapid (Manuel and Stevens 1995), so the observed formant transitions appear to be due to tongue body movements. The tongue body is generally relatively front to facilitate placement of the tongue tip//blade, thus there is a relatively high F2 at release (~18-2 Hz) and high F3. Labial stops involve a constriction at the lips. The tongue position is determined by adjacent vowels, so the exact formant frequencies at release depend on these vowel qualities. The labial constriction always lowers formants, so F2 and F3 are generally lower at release of a labial than in the following vowel.

Release transitions Velar stops involve a dorsal constriction, but the exact location of this constriction depends on the neighbouring vowels. So the formant transitions of velars vary substantially, approximately tracking F2 of the adjacent vowel. F2 and F3 are often said to converge at velar closure. Under what conditions should this occur? Similar transitions are observed during the formation of a stop closure. Similar transitions are observed into and out of any consonant with a narrow constriction, e.g. fricatives, nasal stops.

Locus equations Typically F2 at the release of a consonant is a linear function of F2 at the midpoint of the adjacent vowel (Lindblom 1963, Klatt 1987, etc). The slope and intercept of this function depend on the consonant. 5 bid 5 b d 1.96743 2.4278 Time (s) 11.3647 11.8142 Time (s)

Locus equations The slope and intercept of this function depend on the consonant. a. c. /b/ F2 onset (Hz) y = 228.32 +.79963x R^2 =.968 y = 778.64 +.71259x R^2 =.93 28 26 24 22 2 18 16 14 12 1 1 14 18 22 26 3 28 26 24 22 2 18 16 14 12 1 1 14 18 22 26 3 /b/ F2 vowel (Hz) /g/ F2 vowel (Hz) /b/ F2 onset (Hz) b. /d/ F2 onset (Hz) y = 199.2 +.47767x R^2 =.954 28 26 24 22 2 18 16 14 12 1 1 14 18 22 26 3 /d/ F2 vowel (Hz) Image by MIT OpenCourseWare. Adpated from Fowler, C. A. Invariants, Specifiers, Cues: An Investigation of Locus Equations as Information for Place of Articulation. Perception and Psychophysics 55, no. 6 (1994): 597-61.

Affricates The frication portion of the release of the stop is prolonged to form a full-fledged fricative. The fricative portion of an affricate is distinguished from a regular fricative by its shorter duration, and perhaps by the rapid increase in intensity at its onset (short rise time).