Phonetics. The Sound of Language

Similar documents
Consonants: articulation and transcription

Speech Recognition using Acoustic Landmarks and Binary Phonetic Feature Classifiers

source or where they are needed to distinguish two forms of a language. 4. Geographical Location. I have attempted to provide a geographical

On Developing Acoustic Models Using HTK. M.A. Spaans BSc.

Christine Mooshammer, IPDS Kiel, Philip Hoole, IPSK München, Anja Geumann, Dublin

Consonant-Vowel Unity in Element Theory*

Universal contrastive analysis as a learning principle in CAPT

Contrasting English Phonology and Nigerian English Phonology

Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition

Quarterly Progress and Status Report. VCV-sequencies in a preliminary text-to-speech system for female speech

1. REFLEXES: Ask questions about coughing, swallowing, of water as fast as possible (note! Not suitable for all

Audible and visible speech

Affricates. Affricates, nasals, laterals and continuants. Affricates. Affricates. Study questions

On the Formation of Phoneme Categories in DNN Acoustic Models

Radical CV Phonology: the locational gesture *

Phonology Revisited: Sor3ng Out the PH Factors in Reading and Spelling Development. Indiana, November, 2015

The analysis starts with the phonetic vowel and consonant charts based on the dataset:

MASTERY OF PHONEMIC SYMBOLS AND STUDENT EXPERIENCES IN PRONUNCIATION TEACHING. Master s thesis Aino Saarelainen

Quarterly Progress and Status Report. Voiced-voiceless distinction in alaryngeal speech - acoustic and articula

Pobrane z czasopisma New Horizons in English Studies Data: 18/11/ :52:20. New Horizons in English Studies 1/2016

Word Stress and Intonation: Introduction

age, Speech and Hearii

Affricates. Affricates, nasals, laterals and continuants. Affricates. Affricates. Affricates. Affricates 11/20/2015. Phonetics of English 1

SOUND STRUCTURE REPRESENTATION, REPAIR AND WELL-FORMEDNESS: GRAMMAR IN SPOKEN LANGUAGE PRODUCTION. Adam B. Buchwald

SEGMENTAL FEATURES IN SPONTANEOUS AND READ-ALOUD FINNISH

To appear in the Proceedings of the 35th Meetings of the Chicago Linguistics Society. Post-vocalic spirantization: Typology and phonetic motivations

Language Change: Progress or Decay?

The Perception of Nasalized Vowels in American English: An Investigation of On-line Use of Vowel Nasalization in Lexical Access

Phonological and Phonetic Representations: The Case of Neutralization

THE RECOGNITION OF SPEECH BY MACHINE

Proceedings of Meetings on Acoustics

Prevalence of Oral Reading Problems in Thai Students with Cleft Palate, Grades 3-5

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines

9 Sound recordings: acoustic and articulatory data

Quarterly Progress and Status Report. Sound symbolism in deictic words

First Grade Curriculum Highlights: In alignment with the Common Core Standards

DEVELOPMENT OF LINGUAL MOTOR CONTROL IN CHILDREN AND ADOLESCENTS

The Indian English of Tibeto-Burman language speakers*

Journal of Phonetics

Perceptual scaling of voice identity: common dimensions for different vowels and speakers

Rhythm-typology revisited.

Fix Your Vowels: Computer-assisted training by Dutch learners of Spanish

An Acoustic Phonetic Account of the Production of Word-Final /z/s in Central Minnesota English

Complexity in Second Language Phonology Acquisition

1 st Quarter (September, October, November) August/September Strand Topic Standard Notes Reading for Literature

The pronunciation of /7i/ by male and female speakers of avant-garde Dutch

A Cross-language Corpus for Studying the Phonetics and Phonology of Prominence

On the nature of voicing assimilation(s)

Language Acquisition by Identical vs. Fraternal SLI Twins * Karin Stromswold & Jay I. Rifkin

NIH Public Access Author Manuscript Lang Speech. Author manuscript; available in PMC 2011 January 1.

Clinical Application of the Mean Babbling Level and Syllable Structure Level

U IVERSIDADE FEDERAL DE SA TA CATARI A PROGRAMA DE PÓS-GRADUAÇÃO EM LETRAS/I GLÊS E LITERATURA CORRESPO DE TE. Mariane Antero Alves

**Note: this is slightly different from the original (mainly in format). I would be happy to send you a hard copy.**

Taught Throughout the Year Foundational Skills Reading Writing Language RF.1.2 Demonstrate understanding of spoken words,

DIBELS Next BENCHMARK ASSESSMENTS

Edinburgh Research Explorer

Speaker Identification by Comparison of Smart Methods. Abstract

Speaker Recognition. Speaker Diarization and Identification

Demonstration of problems of lexical stress on the pronunciation Turkish English teachers and teacher trainees by computer

Houghton Mifflin Reading Correlation to the Common Core Standards for English Language Arts (Grade1)

Perceived speech rate: the effects of. articulation rate and speaking style in spontaneous speech. Jacques Koreman. Saarland University

Case study Norway case 1

Multilingual Speech Data Collection for the Assessment of Pronunciation and Prosody in a Language Learning System

Mandarin Lexical Tone Recognition: The Gating Paradigm

Design Of An Automatic Speaker Recognition System Using MFCC, Vector Quantization And LBG Algorithm

Speaking Rate and Speech Movement Velocity Profiles

Different Task Type and the Perception of the English Interdental Fricatives

Beginning primarily with the investigations of Zimmermann (1980a),

Speech/Language Pathology Plan of Treatment

Linguistics 220 Phonology: distributions and the concept of the phoneme. John Alderete, Simon Fraser University

Patricia Velasco, Ed.D. Bilingual Education Program Queens College, CUNY November 1, 2016

Speech Emotion Recognition Using Support Vector Machine

Listening and Speaking Skills of English Language of Adolescents of Government and Private Schools

The Journey to Vowelerria VOWEL ERRORS: THE LOST WORLD OF SPEECH INTERVENTION. Preparation: Education. Preparation: Education. Preparation: Education

Linguistics. Undergraduate. Departmental Honors. Graduate. Faculty. Linguistics 1

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,

Learning Methods in Multilingual Speech Recognition

EUROPEAN DAY OF LANGUAGES

A Believable Accent: The Phonology of the Pink Panther

CS224d Deep Learning for Natural Language Processing. Richard Socher, PhD

Sounds of Infant-Directed Vocabulary: Learned from Infants Speech or Part of Linguistic Knowledge?

A Neural Network GUI Tested on Text-To-Phoneme Mapping

GEMINATION STRATEGIES IN L1 AND ENGLISH PRONUNCIATION OF POLISH LEARNERS

Florida Reading Endorsement Alignment Matrix Competency 1

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

Body-Conducted Speech Recognition and its Application to Speech Support System

Markedness and Complex Stops: Evidence from Simplification Processes 1. Nick Danis Rutgers University

CEFR Overall Illustrative English Proficiency Scales

Manner assimilation in Uyghur

Career Series Interview with Dr. Dan Costa, a National Program Director for the EPA

Automatic intonation assessment for computer aided language learning

English Language and Applied Linguistics. Module Descriptions 2017/18

UKLO Round Advanced solutions and marking schemes. 6 The long and short of English verbs [15 marks]

CROSS-LANGUAGE MAPPING FOR SMALL-VOCABULARY ASR IN UNDER-RESOURCED LANGUAGES: INVESTIGATING THE IMPACT OF SOURCE LANGUAGE CHOICE

Procedia - Social and Behavioral Sciences 146 ( 2014 )

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape

L1 Influence on L2 Intonation in Russian Speakers of English

Similarity Avoidance in the Proto-Indo-European Root

GENERAL COMMENTS Some students performed well on the 2013 Tamil written examination. However, there were some who did not perform well.

Underlying Representations

Transcription:

Phonetics. The Sound of Language 1

The Description of Sounds Fromkin & Rodman: An Introduction to Language. Fort Worth etc., Harcourt Brace Jovanovich Read: Chapter 5, (p. 176ff.) (or the corresponding chapter) Phonetics: The Sounds of Language 2

The Concepts "Phonetics" und "Phonology" Phonetics Spoken language (substance) Articulation Acoustic signal Perception Phonology The sound system (abstract units) Sound regularities (which sounds; sound patterns) What are Phonetis und Phonology They are not completely separable because each depends on the other. Looking at the nature of the sounds of (a) language without taking into consideration that they work as the basis of a system of communication is meaningless. Looking at the sound system of a language without knowing is behind it in terms of articulation, acoustics and perception is useless. In principle, phonetics deals with concrete utterances, whereas phonology deals with the systematic workings of the structure (i.e., it is more abstract) Plan: We start by explaining how phonetic events can be described. Then we shall discuss the relationship between phonetic and phonological description. Finally (tomorrow), we shall deal with some principles of phonological description. 3

Areas of phonetics Speech production Speech acoustics Speech perception 4

Basic questions in phonetics I. What do we do to produce speech sounds (and utterances made up of speech sounds)? II. How can we describe the sounds? (phonetic classification und sound symbols) III. How can we describe the melodic und rhythmic aspects? (utterances are not just a sequence of sounds) 5

How do we produce speech sounds? i) We produce an airstream (Energy source) ii) iii) We transform the airstream (kinetic energy) into acoustic energy (Excitation signal) We modifiy the excitation signal, to produce different speech sounds (Speech signal) (Differences = information different words) 6

i) Airstream Excess pressure in the lungs results in an egressive pulmonic airstream, Other types of airstream are also used in speech: - egressive glottal airstream (Ejectives) - ingressive glottal airstream (Implosives) - ingressive "velic" airstream (Clicks) I.i) Airstream Excess pressure in the lungs results in an egressive pulmonic airstream, which is the normal airstream for speaking, (normal for German, English, French, etc.) but it is not the only airstream used in the languages of the world: - egressive glottale airstream (Ejectives) (The pressure is built up by the upward movement of the closed larynx) - ingressive glottale airstream (Implosives) (The pressure is reduced by the downward movement of the closed larynx) - ingressive "velic airstream (clicks) (The pressure is reduced by increasing the size of the cavity between tongue and palate). 7

ii) Excitation Transformation of kinetic energy into acoustic energy (= excitation) - At the glottis (=gap between the vocal folds): vocal fold vibration (= voicing; phonation) - At a point of constriction somewhere in the vocal tract (= noise, friction) - When a closure in the mouth is released (= release impulse, explosion) I.ii) Excitation The airstream (kinetic energy = energy in the laminar movement of air particles) is transformed into acoustic energy (vibrating air particles) - At the glottis: Vocal fold vibration. (= voicing; phonation) defining the class of sounds we call (= Vokale (i, a, u, usw.), Sonoranten (m, n, l, r, usw.) - If the vocal folds don t vibrate, the excitation has to take place in a different way. - AN articulatory constriction somewhere in the mouth or throat makes the airstream turbulent = acoustic energy perceived as friction noise.. ( = fricatives (f, s, sch, ch etc.)) - An articulatory occlusion (blockage) leads to a build-up of air-pressure in the mouth. The quick release of the stoppage causes a small explosion as the excess air escapes. (= plosives (p, t, k etc.) 8

iii) Modification How is the excitation signal modified to produce different speech sounds? By shaping the vocal tract. E.g. acoustic filtering which changes the quality ( colour of the sound ) [i] [y] (change of lip shape), [u] [y] (change of tongue position) I.iii) Modification The excitation (the sound produced) can be modified further because different sizes and shapes of the cavities in the mouth have different resonance properties. So they filter the sound and colour it. 9

How can we describe sounds? Consonants Excitation type (±voiced), e.g. [s z] Manner of articulation (from closure to narrow constriction to almost vowel-like), e.g. [b, v, w] Place of articulation (from the lips to the Glottis), e.g. [p, t, k, ] Vowels Degree of opening (tongue height/jaw opening), Tongue position (from front to back), Lip shape (rounded, neutral, spread) II. How can we describe speech sounds? The first distinction we make is between vowels and consonants: Consonants: We have already mentioned different types of excitation (voiced and voiceless); that gives us one Classification criterion. When we described excitation, we mentioned the narrow constriction (which produces friction) und complete closures (which leads to an explosion when released). These are two of the several Manners of articulation, which result in different types of sound, thus giving us a second criterion for classification.: Plosives, Fricatives, Sonorants (Nasals, Trills, Approximants = Laterals, Glides), The place in the mouth where the articulation takes place provides the third criterion: Place of Articulation. Different places of articulation change the shape of the oral cavities, i.e. of the resonances (filter) and differentiates the colour of the sounds produced: There are labial (lips), dental (teeth), alveolar (upper teeth ridge), palatal (hard palate), velar (soft palate), uvular (uvula), pharyngeal (throat), glottal (vocal folds) sounds. The distinction voiced vs. voiceless doesn t occur with the sonorant consonants (nasals, liquids, approximants). They are, by definition, always voiced. Vowels are also always voiced. In addition, they have no point of contact, so that the normal distinction according to manner of articulation isn t used. Since they are all vocalic they all belong to the same manner of articulation. But vowels are still differentiated and categorized according to similar principles: Similar to the manner of articulation, we have closed and open vowels. Similar to der place of articulation, we have front and back vowels (and central vowels between them) Also, vowels can be produced with rounded or unrounded lips, more or less independent of what the tongue is doing. They can also be produced short and long. 10

Places of Articulation Hard palate Alveolar ridge Lips Soft palate (velum) Uvula Tongue Teeth (dental) Vocal folds This sagittal cross-section of someone s head (x-ray on the left, schematic line drawing on the right) is the traditional way of portraying our vocal tract. You need to learn the places of articulation that are labeled. The organs of speech are also given in the book. The articulators are essentially the bottom lip and the tongue (which can move both independently and with the jaw to get close to (or make contact with) various places of articulation). Being two-dimensional, it doesn t show anything of the teeth except the incisors, nor does it show much of the complexity of the tongue shape. 11

Post-alveolar Upper boundary: Place of articulation Lower boundary: Active articulator 12

German Consonants Manner/place lab. alv. p-alv. pal. vel. uvul. glot. Plos: pb td Affric: pf ts Fric: Nas.son: Approx: kg ( ) t ( ) German consonants An additional, more complex category of sound is quite common in German, namely the affricate, which is a combination of a stop + a fricative. It is also described as a stop with a slow, fricative release. 13

English Consonants Man./place lab. dent. alv. p-alv. pal. vel. uvul. glot. Plos: pb td t Affric: Fric: kg Nas.son: Approx: English consonants English has less affricates than German, the fricatives are more fronted (additional dental fricatives and no palatal, velar or uvular fricatives), And English has more approximants than German 14

Vowels... Do not have a clear place of contact (constriction or closure) The tongue body (dorsum) changes shape and moves its centre of gravity around the vocal tract. 15

Traditional classificaton Acoustically important 16

German Vowels front central back close close-mid open-mid ( :) open German Vowels German vowels are distributed very systematically around the vowel space long vowels: : i: (bieten), y: (Tüte) short vowels: (bitten) Y (Hütte) (Busch) u: (tuten) long vowels e: (beten), : (Höhle) unstressed central vowel: (bitte) o: (boten Short vowels (Betten) (long vowel) : (bäten) (Socken) (bitter) a: (baten) open short and long a (hatten) Different languages often use the same symbols for their vowel descriptions without the vowels in question necessarily having the same quality. A transcription like that is called a broad transcription Example: German: "bieten" /bi:tn/ vs. English: "beaten" /bi:tn/ German also has 3 diphthongs (long vowels with changing quality) : as in Wein, as in Haus, and as in Heu 17

English Vowels front close close-mid open-mid open central back English vowles: English vowels are less systematically distributed. And there are less rounded vowels. Long: short: i: (beaten), short: long e (bet) short (bat) u: (tooting) (bush) (bitten) : (bitter) long (cut) (law) (hot) : (hard) English has similar diphthongs to German ( as in wine, as in house, and as in boy. It also has some diphthongs which start close to the quality of German long monophthongs: as in eight, late, bay, stay, etc. as in coat, bowl, note, etc. 18

(See www2.arts.gla.ac.uk/ipa/) 19

Do we really need transcription? Orthography cannot capture how we pronounce things. Transcription allows us to record deviations from standard pronunciations If we are only interested in e.g. speech recognition, or speech synthesis, we need to know how things are actually said. 20

Hearing what's said vs. listening to how it's said. Primarily, we listen to someone to hear WHAT she/he is saying. What did the person say? Ich bin in den Laden reingegangen...? Bin in den Laden reingegangen...? Bin in n Laden reingegangen...? Bin in n Lad n reingegang ng...? Orthography is not VERY good at capturing the details of the pronunciation: Hearing WHAT is said is the primary goal of speech communication (for a listener). We re mostly very good at it --- in fact so good that when we often reconstruct something that wasn t actually pronounced. Of course that is easy in our native langusge We don t have to hear the exact pronunciation because we speak as we learned when we were growing up. We do all the stylistic reductions that suit the situation quite automatically, because we know HOW to speak WHEN... well in principle anyway (can you remember a situation where you have heard someone speaking wrongly for the situation?) 21

Which sounds convey the most information? Consonants or vowels? What do you think? Hearing WHAT is said is the primary goal of speech communication (for a listener). We re mostly very good at it --- in fact so good that when we often reconstruct something that wasn t actually pronounced. Of course that is easy in our native langusge We don t have to hear the exact pronunciation because we speak as we learned when we were growing up. We do all the stylistic reductions that suit the situation quite automatically, because we know HOW to speak WHEN... well in principle anyway (can you remember a situation where you have heard someone speaking wrongly for the situation?) 22

Vowels and Consonants Letters as Information carriers: Orthography separates words the acoustic signal doesn't. Vowels only o e a e + ü + e i + a + i o + e i + e + a + i + e e a u e + u + ö + a. Consonants only V h rs g + f r + B rl n + m + M ttw ch + b g nnt + d r + T g + m t + T mp r t r n + m + zw lf + Gr d. Institut für Phonetik, Universität des Saarlandes (IPUS) 23

Vowels and Consonants Sounds as Information carriers 1. Consonants only 2. Vowels only German 3. Everything 1. Consonants only 2. Vowels only English 3. Everything Institut für Phonetik, Universität des Saarlandes (IPUS) English text: The weather forecast for tomorrow: rather cloudy in the morning with a few sunny spells in the afternoon. 24

Phonetics is more than just vowels & consonants Things happen to words when they are put together. The mouth prepares for the next word at the end of this one. Some words are more important than others In general: Lexical words > function words For "economy of effort" we don't invest much in function words. 25

Connected speech The president will be elected for a period of four years. connected speech with silences between words as chain of isolated words as chain of isolated without silences function words: isolated vs. connected 26

For Speech Recordings & Analysis www.praat.org (by Paul Boersma & David Weenink, Phonetics Amsterdam) Don't forget to do the exercises (on the web page for the course) 27