Anatomical Structures for Speech Production

Size: px
Start display at page:

Download "Anatomical Structures for Speech Production"

Transcription

1 Acoustic Properties of Speech Sounds Speech production Signal processing Properties of speech sounds of American English Microphone variations Spectrographic Examples CLSP Workshop 2 Acoustic Properties of Speech Sounds 1 Anatomical Structures for Speech Production Soft Palate (Velum) Soft Palate (Velum) Hyoid Bone Epiglottis Cricoid Cartilage Esophagus Lung Nasal Cavity Nasal Cavity Hard Palate Tongue Tongue Thyroid Cartilage Sternum Hard Palate Thyroid Cartilage Vocal Cords Vocal Folds Trachea Trachea Lung Jaw CLSP Workshop 2 Acoustic Properties of Speech Sounds 2

2 Sub-Word Linguistic Units The phoneme is one of the most basic linguistic units used to represent pronunciations of words ASR systems typically represent words as phoneme sequences English contains approximately 4 phonemes which can be grouped by manner and place of articulation Manner Class Number Vowels 16 Fricatives 8 Stops 6 Semivowels 4 Nasals 3 Affricates 2 Aspirant 1 CLSP Workshop 2 Acoustic Properties of Speech Sounds 3 Phonemes in American English IPA AB Word IPA AB Word IPA AB Word /i/ iy beat /s/ s see /w/ w wet /I/ ih bit /S/ sh she /r/ r red /e/ ey bait /f/ f fee /l/ l let /E/ eh bet /T/ th thief /y/ y yet /@/ ae bat /z/ z z /m/ m meet /a/ aa bob /Z/ zh Gigi /n/ n neat /O/ ao bought /v/ v v /4/ ng sing /^/ ah but /D/ dh thee /C/ ch church /o/ ow boat /p/ p pea /J/ jh judge /U/ uh book /t/ t tea /h/ hh heat /u/ uw boot /k/ k key /5/ er bird /b/ b bay /a / ay bite /d/ d day /O / oy Boyd /g/ g geese /a / aw bout /{/ ax about CLSP Workshop 2 Acoustic Properties of Speech Sounds 4

3 Places of Articulation for Speech Production Alveopalatal Alveolar Labial Dental Palatal Velar Uvular CLSP Workshop 2 Acoustic Properties of Speech Sounds 5 A Speech Waveform Two plus seven is less than ten CLSP Workshop 2 Acoustic Properties of Speech Sounds 6

4 Spectral Representations Speech waveforms are usually sampled at rates varying from 8K (telephone) to 2K (wide-band) samples/sec ASR systems typically transform the waveform into a spectrum: a sequence of frequency-based analyses usually performed at regular intervals (e.g., 1 ms) A short-time Fourier transform (STFT) performs a spectral analysis on waveform segments small enough to be able to assume that the speech signal is quasi-stationary The waveform segment is created by a moving window, whose type (e.g., Hamming) and duration (e.g., 5-25ms) have a significant impact on the resulting spectrum A spectrogram is an image computed from the resulting spectrum, which is often used to examine the waveform CLSP Workshop 2 Acoustic Properties of Speech Sounds 7 Short-Time Fourier Transform w [ 5 - m ] w [ 1 - m ] w [ 2 - m ] x [ m ] m n = 5 n = 1 n = 2 X n (e jω )= + m= w[n m]x[m]e jωm If n is fixed, then it can be shown that: X n (e jω )= 1 π 2π W(e jθ )e jθn X(e j(ω+θ) )dθ π The above equation is meaningful only if we assume that X(e jω ) represents the Fourier transform of a signal whose properties continue outside the window, or simply that the signal is zero outside the window. In order for X n (e jω ) to correspond to X(e jω ), W(e jω ) must resemble an impulse with respect to X(e jω ). CLSP Workshop 2 Acoustic Properties of Speech Sounds 8

5 Comparison of Windows CLSP Workshop 2 Acoustic Properties of Speech Sounds 9 Comparison of Windows (cont d) CLSP Workshop 2 Acoustic Properties of Speech Sounds 1

6 A Wide-Band Speech Spectrogram Two plus seven is less than ten CLSP Workshop 2 Acoustic Properties of Speech Sounds 11 A Narrow-Band Speech Spectrogram Two plus seven is less than ten CLSP Workshop 2 Acoustic Properties of Speech Sounds 12

7 Spectral Averages: Corpus and Representation TIMIT acoustic-phonetic corpus phonetic transcription aligned with waveform native speakers of American English (8 dialects) 8 sentences/speaker (dialect sentences excluded) 136 female, 326 male speakers (NIST train set) 3,696 utterances, 142,91 tokens Mel-Frequency Spectral Coefficients (MFSC s) Mel-frequency scale (linear < 1kHz,log> 1kHz) 4 channels (2 Hz khz) 25 ms Hamming window, 5 ms frame-rate Average computed over entire phonetic token (for stops spectral slice at release was used) CLSP Workshop 2 Acoustic Properties of Speech Sounds 13 Happy Little Vowel Chart "So inaccurate, yet so useful." Rob's F2 Increases FRONT BACK i I uú U u HIGH e E Think ^,{ O o MID F 3 is mighty a LOW Your pal 5 is the way TENSE = Towards Edges tends to be longer LAX = Towards Center tends to be shorter to go! SCHWAS: Plain ({) About /{ba t/ Front ( ) Roses /ro z z/ Retroflex (}) Forever /f}ev5/ F 1 Increases CLSP Workshop 2 Acoustic Properties of Speech Sounds 14

8 Friendly Little Consonant Chart "Somewhat more accurate, yet somewhat less useful." The Semi-vowels: Manner of Articulation Nasal Fricative Stop Place of Articulation Labial Dental Alveolar Palatal Velar p b f v m T D Weak (Non-strident) t d s z S Z Strong (Strident) n 4 Voicing: Unvoiced Voiced k g y w l is like an extreme is like an extreme is like an extreme i u o r is like an extreme 5 The Odds and Ends: h (unvoiced h) H (voiced h) F (flap)? (glottal stop) The Affricates: C J is like is like t+s d+z FÊ (nasalized flap) CLSP Workshop 2 Acoustic Properties of Speech Sounds 15 Vowel Production No significant constriction in the vocal tract Usually produced with periodic excitation Acoustic characteristics depend on the position of the jaw, tongue, and lips [i] [@] [a] [u] CLSP Workshop 2 Acoustic Properties of Speech Sounds 16

9 Vowels of American English There are approximately 18 vowels in American English made up of monothongs, diphthongs, and reduced vowels (schwa s) They are often described by the articulatory features: High/Low, Front/Back, Retroflexed, Rounded, andtense/lax /i/ iy beat /O/ ao bought /a / ay bite /I/ ih bit /^/ ah but /O / oy Boyd /e/ ey bait /o/ ow boat /a / aw bout /E/ eh bet /U/ uh book [{] ax about /@/ ae bat /u/ uw boot [ ] ix roses /a/ aa Bob /5/ er Bert [}] axr butter CLSP Workshop 2 Acoustic Properties of Speech Sounds 17 Vowel Formant Averages Vowels are often characterized by F1, F2, and F3 High/Low is correlated with F1 Front/Back is correlated with F2 Retroflexion is marked by a low F3 35 Female Speakers 35 Male Speakers 3 F 3 F 2 F 1 3 F 3 F 2 F 1 Average Frequency (Hz) Average Frequency (Hz) i I e a O ^ o U u 5 { Vowel i I e a O ^ o U u 5 { Vowel CLSP Workshop 2 Acoustic Properties of Speech Sounds 18

10 Vowel Formant Trajectories Diphthongs can have significant formant motion Most vowels in American English are somewhat diphthongized F Female Speakers i e I E a 5 ^ U u { o O O a F Male Speakers i e I a a 5 U ^ u o a O { O F F 1 CLSP Workshop 2 Acoustic Properties of Speech Sounds 19 Vowel Durations Each vowel has a different intrinsic duration Schwa s have distinctly shorter durations (5ms) /I, E, ^, U/ are the shortest monothongs Context can greatly influence vowel duration 25 Female Speakers 25 Male Speakers Average Duration (ms) Average Duration (ms) i I e a O ^ o U u 5 { a o a u Vowel i I e a O ^ o U u 5 { a o a u Vowel CLSP Workshop 2 Acoustic Properties of Speech Sounds 2

11 Fricative Production Turbulence produced at narrow constriction Constriction position determines acoustic characteristics Can be produced with periodic excitation [f] [T] [s] [S] CLSP Workshop 2 Acoustic Properties of Speech Sounds 21 Fricatives of American English There are 8 fricatives in American English They are often described by the features Strident/Non-Strident (Strong/Weak), Voiced/Unvoiced Four places of articulation: Labial, Dental, Alveolar, and Palatal Type Unvoiced Voiced Labial /f/ f fee /v/ v v Dental /T/ th thief /D/ dh thee Alveolar /s/ s see /z/ z z Palatal /S/ sh she /Z/ zh Gigi CLSP Workshop 2 Acoustic Properties of Speech Sounds 22

12 Fricative Energy NON-STRIDENT STRIDENT Probability Density unadjusted for frequency Average Total Energy Strident fricatives tend to be stronger than non-strident CLSP Workshop 2 Acoustic Properties of Speech Sounds 23 Fricative Durations UNVOICED VOICED Probability Density unadjusted for frequency Duration Voiced fricatives tend to be shorter than unvoiced CLSP Workshop 2 Acoustic Properties of Speech Sounds 24

13 Nasal Production Velum lowering results in airflow through nasal cavity Consonants produced with closure in oral cavity Nasalized vowels have output through oral and nasal cavities Nasal murmurs have similar spectral characteristics [m] [n] [4] CLSP Workshop 2 Acoustic Properties of Speech Sounds 25 Nasal Consonants of American English Three places of articulation: Labial, Alveolar, and Velar Always attached to a vowel, though can form an entire syllable in unstressed environments ([ní ], [mí ], [4Í ]) /4/ is always post-vocalic Place identified by neighboring formant transitions Type Nasal Labial /m/ m me Dental /n/ n knee Velar /4/ ng sing CLSP Workshop 2 Acoustic Properties of Speech Sounds 26

14 Nasal Durations Duration (ms) Singleton Unvoiced Cluster Voiced Cluster Nasal consonants tend to be shorter in clusters with unvoiced consonants, and longer with voiced consonants CLSP Workshop 2 Acoustic Properties of Speech Sounds 27 Semivowel Production Constriction in vocal tract, no turbulence Slower articulatory motion than other consonants Laterals form complete closure with tongue tip, airflow via sides of constriction [w] [y] [r] [l] CLSP Workshop 2 Acoustic Properties of Speech Sounds 28

15 Semivowels of American English There are 4 semivowels in American English Always attached to a vowel, though /l/ can form an entire syllable in unstressed environments ([lí]) Extreme articulation of a corresponding vowel Similar formant positions Generally weaker due to constriction Type Semivowel Nearest Vowel Glides /w/ w wet /u/ /y/ y yet /i/ Liquids /r/ r red /5/ /l/ l let /o/ CLSP Workshop 2 Acoustic Properties of Speech Sounds 29 Acoustic Properties of Semivowels /w/ is characterized by a very low F1, F2 Typically a rapid spectral falloff above F2 /y/ is characterized by very low F1, very high F2 /r/ is characterized by a very low F3 Prevocalic F3 < medial F3 < postvocalic F3 /l/ is characterized by a low F1 and F2 Often presence of high frequency energy Postvocalic /l/ characterized by minimal spectral discontinuity, gradual motion of formants CLSP Workshop 2 Acoustic Properties of Speech Sounds 3

16 Aspirant Production /h/ inamericanenglish Turbulence excitation at glottis No constriction in the vocal tract, normal formant excitation Coupling with subglottal system results in little energy in F1 region Periodic excitation can be present in medial position CLSP Workshop 2 Acoustic Properties of Speech Sounds 31 Stop Production Complete closure in the vocal tract, pressure build up Sudden release of the constriction, turbulence noise Can have periodic excitation during closure [b] [d] [g] CLSP Workshop 2 Acoustic Properties of Speech Sounds 32

17 Stops of American English There are 6 stop consonants in American English Same places of articulation as nasal consonants Unvoiced stops are typically aspirated Voiced stops usually exhibit a voice-bar during closure Information about formant transitions and release useful for classification Type Voiced Unvoiced Labial /b/ b bee /p/ p pea Dental /d/ d Dee /t/ t tea Velar /g/ g geese /k/ k key CLSP Workshop 2 Acoustic Properties of Speech Sounds 33 Singleton Stop Durations VOT Duration (ms) b d g p t k The voice onset time (VOT) of unvoiced stops is longer than that of voiced stops CLSP Workshop 2 Acoustic Properties of Speech Sounds 34

18 /s/-stop Durations VOT Duration (ms) p t k Unvoiced stops are unaspirated in /s/ stop sequences CLSP Workshop 2 Acoustic Properties of Speech Sounds 35 Stop-Semivowel Durations VOT Duration (ms) Singletons [Stop][Semivowel] Clusters b d g p t k Semivowels are partially devoiced in stop semivowel sequences CLSP Workshop 2 Acoustic Properties of Speech Sounds 36

19 Voicing Cues for Stops There are many voicing cues for a stop CLSP Workshop 2 Acoustic Properties of Speech Sounds 37 Affricate Production Alveolar-stop palatal-fricative pairs Sudden release of the constriction, turbulence noise Can have periodic excitation during closure Affricates of American English There are two affricates in American English Voiced Unvoiced /J/ jh judge /C/ ch church CLSP Workshop 2 Acoustic Properties of Speech Sounds 38

20 Speech from a Close-Talking Microphone Time (seconds) Zero Crossing Rate 16 khz 8 8 khz Total Energy 8 Energy Hz to 75 Hz Wide Band Spectrogram khz 4 4 khz Waveform The Thinker is a famous sculpture File: /server/users/jwc/latex/sum97/sennheiser.wav Printed by jwc on Wed Jul 16 11:58: Page: 1 CLSP Workshop 2 Acoustic Properties of Speech Sounds 39 Speech from a Omni-Directional Microphone Time (seconds) Zero Crossing Rate 16 khz 8 8 khz Total Energy 8 Energy Hz to 75 Hz Wide Band Spectrogram khz 4 4 khz Waveform The Thinker is a famous sculpture File: /server/users/jwc/latex/sum97/bk.wav Printed by jwc on Wed Jul 16 11:57: Page: 1 CLSP Workshop 2 Acoustic Properties of Speech Sounds 4

21 Speech over a Telephone Channel Time (seconds) Zero Crossing Rate 16 khz 8 8 khz Total Energy 8 Energy Hz to 75 Hz Wide Band Spectrogram khz 4 4 khz Waveform The Thinker is a famous sculpture File: /server/users/jwc/latex/sum97/telephone.wav Printed by jwc on Wed Jul 16 11:59: Page: 1 CLSP Workshop 2 Acoustic Properties of Speech Sounds 41

On the Formation of Phoneme Categories in DNN Acoustic Models

On the Formation of Phoneme Categories in DNN Acoustic Models On the Formation of Phoneme Categories in DNN Acoustic Models Tasha Nagamine Department of Electrical Engineering, Columbia University T. Nagamine Motivation Large performance gap between humans and state-

More information

Consonants: articulation and transcription

Consonants: articulation and transcription Phonology 1: Handout January 20, 2005 Consonants: articulation and transcription 1 Orientation phonetics [G. Phonetik]: the study of the physical and physiological aspects of human sound production and

More information

Speech Recognition using Acoustic Landmarks and Binary Phonetic Feature Classifiers

Speech Recognition using Acoustic Landmarks and Binary Phonetic Feature Classifiers Speech Recognition using Acoustic Landmarks and Binary Phonetic Feature Classifiers October 31, 2003 Amit Juneja Department of Electrical and Computer Engineering University of Maryland, College Park,

More information

Phonetics. The Sound of Language

Phonetics. The Sound of Language Phonetics. The Sound of Language 1 The Description of Sounds Fromkin & Rodman: An Introduction to Language. Fort Worth etc., Harcourt Brace Jovanovich Read: Chapter 5, (p. 176ff.) (or the corresponding

More information

Quarterly Progress and Status Report. VCV-sequencies in a preliminary text-to-speech system for female speech

Quarterly Progress and Status Report. VCV-sequencies in a preliminary text-to-speech system for female speech Dept. for Speech, Music and Hearing Quarterly Progress and Status Report VCV-sequencies in a preliminary text-to-speech system for female speech Karlsson, I. and Neovius, L. journal: STL-QPSR volume: 35

More information

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines Amit Juneja and Carol Espy-Wilson Department of Electrical and Computer Engineering University of Maryland,

More information

Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition

Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition Hua Zhang, Yun Tang, Wenju Liu and Bo Xu National Laboratory of Pattern Recognition Institute of Automation, Chinese

More information

age, Speech and Hearii

age, Speech and Hearii age, Speech and Hearii 1 Speech Commun cation tion 2 Sensory Comm, ection i 298 RLE Progress Report Number 132 Section 1 Speech Communication Chapter 1 Speech Communication 299 300 RLE Progress Report

More information

Quarterly Progress and Status Report. Voiced-voiceless distinction in alaryngeal speech - acoustic and articula

Quarterly Progress and Status Report. Voiced-voiceless distinction in alaryngeal speech - acoustic and articula Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Voiced-voiceless distinction in alaryngeal speech - acoustic and articula Nord, L. and Hammarberg, B. and Lundström, E. journal:

More information

Speaker Recognition. Speaker Diarization and Identification

Speaker Recognition. Speaker Diarization and Identification Speaker Recognition Speaker Diarization and Identification A dissertation submitted to the University of Manchester for the degree of Master of Science in the Faculty of Engineering and Physical Sciences

More information

Design Of An Automatic Speaker Recognition System Using MFCC, Vector Quantization And LBG Algorithm

Design Of An Automatic Speaker Recognition System Using MFCC, Vector Quantization And LBG Algorithm Design Of An Automatic Speaker Recognition System Using MFCC, Vector Quantization And LBG Algorithm Prof. Ch.Srinivasa Kumar Prof. and Head of department. Electronics and communication Nalanda Institute

More information

On Developing Acoustic Models Using HTK. M.A. Spaans BSc.

On Developing Acoustic Models Using HTK. M.A. Spaans BSc. On Developing Acoustic Models Using HTK M.A. Spaans BSc. On Developing Acoustic Models Using HTK M.A. Spaans BSc. Delft, December 2004 Copyright c 2004 M.A. Spaans BSc. December, 2004. Faculty of Electrical

More information

To appear in the Proceedings of the 35th Meetings of the Chicago Linguistics Society. Post-vocalic spirantization: Typology and phonetic motivations

To appear in the Proceedings of the 35th Meetings of the Chicago Linguistics Society. Post-vocalic spirantization: Typology and phonetic motivations Post-vocalic spirantization: Typology and phonetic motivations Alan C-L Yu University of California, Berkeley 0. Introduction Spirantization involves a stop consonant becoming a weak fricative (e.g., B,

More information

1. REFLEXES: Ask questions about coughing, swallowing, of water as fast as possible (note! Not suitable for all

1. REFLEXES: Ask questions about coughing, swallowing, of water as fast as possible (note! Not suitable for all Human Communication Science Chandler House, 2 Wakefield Street London WC1N 1PF http://www.hcs.ucl.ac.uk/ ACOUSTICS OF SPEECH INTELLIGIBILITY IN DYSARTHRIA EUROPEAN MASTER S S IN CLINICAL LINGUISTICS UNIVERSITY

More information

source or where they are needed to distinguish two forms of a language. 4. Geographical Location. I have attempted to provide a geographical

source or where they are needed to distinguish two forms of a language. 4. Geographical Location. I have attempted to provide a geographical Database Structure 1 This database, compiled by Merritt Ruhlen, contains certain kinds of linguistic and nonlinguistic information for the world s roughly 5,000 languages. This introduction will discuss

More information

SEGMENTAL FEATURES IN SPONTANEOUS AND READ-ALOUD FINNISH

SEGMENTAL FEATURES IN SPONTANEOUS AND READ-ALOUD FINNISH SEGMENTAL FEATURES IN SPONTANEOUS AND READ-ALOUD FINNISH Mietta Lennes Most of the phonetic knowledge that is currently available on spoken Finnish is based on clearly pronounced speech: either readaloud

More information

Universal contrastive analysis as a learning principle in CAPT

Universal contrastive analysis as a learning principle in CAPT Universal contrastive analysis as a learning principle in CAPT Jacques Koreman, Preben Wik, Olaf Husby, Egil Albertsen Department of Language and Communication Studies, NTNU, Trondheim, Norway jacques.koreman@ntnu.no,

More information

Rachel E. Baker, Ann R. Bradlow. Northwestern University, Evanston, IL, USA

Rachel E. Baker, Ann R. Bradlow. Northwestern University, Evanston, IL, USA LANGUAGE AND SPEECH, 2009, 52 (4), 391 413 391 Variability in Word Duration as a Function of Probability, Speech Style, and Prosody Rachel E. Baker, Ann R. Bradlow Northwestern University, Evanston, IL,

More information

Lecture 9: Speech Recognition

Lecture 9: Speech Recognition EE E6820: Speech & Audio Processing & Recognition Lecture 9: Speech Recognition 1 Recognizing speech 2 Feature calculation Dan Ellis Michael Mandel 3 Sequence

More information

Segregation of Unvoiced Speech from Nonspeech Interference

Segregation of Unvoiced Speech from Nonspeech Interference Technical Report OSU-CISRC-8/7-TR63 Department of Computer Science and Engineering The Ohio State University Columbus, OH 4321-1277 FTP site: ftp.cse.ohio-state.edu Login: anonymous Directory: pub/tech-report/27

More information

Phonology Revisited: Sor3ng Out the PH Factors in Reading and Spelling Development. Indiana, November, 2015

Phonology Revisited: Sor3ng Out the PH Factors in Reading and Spelling Development. Indiana, November, 2015 Phonology Revisited: Sor3ng Out the PH Factors in Reading and Spelling Development Indiana, November, 2015 Louisa C. Moats, Ed.D. (louisa.moats@gmail.com) meaning (semantics) discourse structure morphology

More information

Body-Conducted Speech Recognition and its Application to Speech Support System

Body-Conducted Speech Recognition and its Application to Speech Support System Body-Conducted Speech Recognition and its Application to Speech Support System 4 Shunsuke Ishimitsu Hiroshima City University Japan 1. Introduction In recent years, speech recognition systems have been

More information

Contrasting English Phonology and Nigerian English Phonology

Contrasting English Phonology and Nigerian English Phonology Contrasting English Phonology and Nigerian English Phonology Saleh, A. J. Rinji, D.N. ABSTRACT The thrust of this work is the fact that phonology plays a vital role in language and communication both in

More information

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) e-issn: 2278-2834,p- ISSN: 2278-8735.Volume 10, Issue 2, Ver.1 (Mar - Apr.2015), PP 55-61 www.iosrjournals.org Analysis of Emotion

More information

Speech Synthesis in Noisy Environment by Enhancing Strength of Excitation and Formant Prominence

Speech Synthesis in Noisy Environment by Enhancing Strength of Excitation and Formant Prominence INTERSPEECH September,, San Francisco, USA Speech Synthesis in Noisy Environment by Enhancing Strength of Excitation and Formant Prominence Bidisha Sharma and S. R. Mahadeva Prasanna Department of Electronics

More information

Consonant-Vowel Unity in Element Theory*

Consonant-Vowel Unity in Element Theory* Consonant-Vowel Unity in Element Theory* Phillip Backley Tohoku Gakuin University Kuniya Nasukawa Tohoku Gakuin University ABSTRACT. This paper motivates the Element Theory view that vowels and consonants

More information

The Perception of Nasalized Vowels in American English: An Investigation of On-line Use of Vowel Nasalization in Lexical Access

The Perception of Nasalized Vowels in American English: An Investigation of On-line Use of Vowel Nasalization in Lexical Access The Perception of Nasalized Vowels in American English: An Investigation of On-line Use of Vowel Nasalization in Lexical Access Joyce McDonough 1, Heike Lenhert-LeHouiller 1, Neil Bardhan 2 1 Linguistics

More information

Language Acquisition by Identical vs. Fraternal SLI Twins * Karin Stromswold & Jay I. Rifkin

Language Acquisition by Identical vs. Fraternal SLI Twins * Karin Stromswold & Jay I. Rifkin Stromswold & Rifkin, Language Acquisition by MZ & DZ SLI Twins (SRCLD, 1996) 1 Language Acquisition by Identical vs. Fraternal SLI Twins * Karin Stromswold & Jay I. Rifkin Dept. of Psychology & Ctr. for

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 2013 http://acousticalsociety.org/ ICA 2013 Montreal Montreal, Canada 2-7 June 2013 Speech Communication Session 2aSC: Linking Perception and Production

More information

DEVELOPMENT OF LINGUAL MOTOR CONTROL IN CHILDREN AND ADOLESCENTS

DEVELOPMENT OF LINGUAL MOTOR CONTROL IN CHILDREN AND ADOLESCENTS DEVELOPMENT OF LINGUAL MOTOR CONTROL IN CHILDREN AND ADOLESCENTS Natalia Zharkova 1, William J. Hardcastle 1, Fiona E. Gibbon 2 & Robin J. Lickley 1 1 CASL Research Centre, Queen Margaret University, Edinburgh

More information

Speech Emotion Recognition Using Support Vector Machine

Speech Emotion Recognition Using Support Vector Machine Speech Emotion Recognition Using Support Vector Machine Yixiong Pan, Peipei Shen and Liping Shen Department of Computer Technology Shanghai JiaoTong University, Shanghai, China panyixiong@sjtu.edu.cn,

More information

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION JOURNAL OF MEDICAL INFORMATICS & TECHNOLOGIES Vol. 11/2007, ISSN 1642-6037 Marek WIŚNIEWSKI *, Wiesława KUNISZYK-JÓŹKOWIAK *, Elżbieta SMOŁKA *, Waldemar SUSZYŃSKI * HMM, recognition, speech, disorders

More information

Journal of Phonetics

Journal of Phonetics Journal of Phonetics 40 (2012) 595 607 Contents lists available at SciVerse ScienceDirect Journal of Phonetics journal homepage: www.elsevier.com/locate/phonetics How linguistic and probabilistic properties

More information

The analysis starts with the phonetic vowel and consonant charts based on the dataset:

The analysis starts with the phonetic vowel and consonant charts based on the dataset: Ling 113 Homework 5: Hebrew Kelli Wiseth February 13, 2014 The analysis starts with the phonetic vowel and consonant charts based on the dataset: a) Given that the underlying representation for all verb

More information

The pronunciation of /7i/ by male and female speakers of avant-garde Dutch

The pronunciation of /7i/ by male and female speakers of avant-garde Dutch The pronunciation of /7i/ by male and female speakers of avant-garde Dutch Vincent J. van Heuven, Loulou Edelman and Renée van Bezooijen Leiden University/ ULCL (van Heuven) / University of Nijmegen/ CLS

More information

Perceptual scaling of voice identity: common dimensions for different vowels and speakers

Perceptual scaling of voice identity: common dimensions for different vowels and speakers DOI 10.1007/s00426-008-0185-z ORIGINAL ARTICLE Perceptual scaling of voice identity: common dimensions for different vowels and speakers Oliver Baumann Æ Pascal Belin Received: 15 February 2008 / Accepted:

More information

Christine Mooshammer, IPDS Kiel, Philip Hoole, IPSK München, Anja Geumann, Dublin

Christine Mooshammer, IPDS Kiel, Philip Hoole, IPSK München, Anja Geumann, Dublin 1 Title: Jaw and order Christine Mooshammer, IPDS Kiel, Philip Hoole, IPSK München, Anja Geumann, Dublin Short title: Production of coronal consonants Acknowledgements This work was partially supported

More information

Books Effective Literacy Y5-8 Learning Through Talk Y4-8 Switch onto Spelling Spelling Under Scrutiny

Books Effective Literacy Y5-8 Learning Through Talk Y4-8 Switch onto Spelling Spelling Under Scrutiny By the End of Year 8 All Essential words lists 1-7 290 words Commonly Misspelt Words-55 working out more complex, irregular, and/or ambiguous words by using strategies such as inferring the unknown from

More information

Speech Recognition at ICSI: Broadcast News and beyond

Speech Recognition at ICSI: Broadcast News and beyond Speech Recognition at ICSI: Broadcast News and beyond Dan Ellis International Computer Science Institute, Berkeley CA Outline 1 2 3 The DARPA Broadcast News task Aspects of ICSI

More information

Acoustic correlates of stress and their use in diagnosing syllable fusion in Tongan. James White & Marc Garellek UCLA

Acoustic correlates of stress and their use in diagnosing syllable fusion in Tongan. James White & Marc Garellek UCLA Acoustic correlates of stress and their use in diagnosing syllable fusion in Tongan James White & Marc Garellek UCLA 1 Introduction Goals: To determine the acoustic correlates of primary and secondary

More information

Mandarin Lexical Tone Recognition: The Gating Paradigm

Mandarin Lexical Tone Recognition: The Gating Paradigm Kansas Working Papers in Linguistics, Vol. 0 (008), p. 8 Abstract Mandarin Lexical Tone Recognition: The Gating Paradigm Yuwen Lai and Jie Zhang University of Kansas Research on spoken word recognition

More information

An Acoustic Phonetic Account of the Production of Word-Final /z/s in Central Minnesota English

An Acoustic Phonetic Account of the Production of Word-Final /z/s in Central Minnesota English Linguistic Portfolios Volume 6 Article 10 2017 An Acoustic Phonetic Account of the Production of Word-Final /z/s in Central Minnesota English Cassy Lundy St. Cloud State University, casey.lundy@gmail.com

More information

Speaker recognition using universal background model on YOHO database

Speaker recognition using universal background model on YOHO database Aalborg University Master Thesis project Speaker recognition using universal background model on YOHO database Author: Alexandre Majetniak Supervisor: Zheng-Hua Tan May 31, 2011 The Faculties of Engineering,

More information

SOUND STRUCTURE REPRESENTATION, REPAIR AND WELL-FORMEDNESS: GRAMMAR IN SPOKEN LANGUAGE PRODUCTION. Adam B. Buchwald

SOUND STRUCTURE REPRESENTATION, REPAIR AND WELL-FORMEDNESS: GRAMMAR IN SPOKEN LANGUAGE PRODUCTION. Adam B. Buchwald SOUND STRUCTURE REPRESENTATION, REPAIR AND WELL-FORMEDNESS: GRAMMAR IN SPOKEN LANGUAGE PRODUCTION by Adam B. Buchwald A dissertation submitted to The Johns Hopkins University in conformity with the requirements

More information

The NICT/ATR speech synthesis system for the Blizzard Challenge 2008

The NICT/ATR speech synthesis system for the Blizzard Challenge 2008 The NICT/ATR speech synthesis system for the Blizzard Challenge 2008 Ranniery Maia 1,2, Jinfu Ni 1,2, Shinsuke Sakai 1,2, Tomoki Toda 1,3, Keiichi Tokuda 1,4 Tohru Shimizu 1,2, Satoshi Nakamura 1,2 1 National

More information

The Indian English of Tibeto-Burman language speakers*

The Indian English of Tibeto-Burman language speakers* The Indian English of Tibeto-Burman language speakers* Caroline R. Wiltshire University of Florida English as spoken as a second language in India (IE) has developed different sound patterns from other

More information

Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification

Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification Tomi Kinnunen and Ismo Kärkkäinen University of Joensuu, Department of Computer Science, P.O. Box 111, 80101 JOENSUU,

More information

Richardson, J., The Next Step in Guided Writing, Ohio Literacy Conference, 2010

Richardson, J., The Next Step in Guided Writing, Ohio Literacy Conference, 2010 1 Procedures and Expectations for Guided Writing Procedures Context: Students write a brief response to the story they read during guided reading. At emergent levels, use dictated sentences that include

More information

A comparison of spectral smoothing methods for segment concatenation based speech synthesis

A comparison of spectral smoothing methods for segment concatenation based speech synthesis D.T. Chappell, J.H.L. Hansen, "Spectral Smoothing for Speech Segment Concatenation, Speech Communication, Volume 36, Issues 3-4, March 2002, Pages 343-373. A comparison of spectral smoothing methods for

More information

Voiceless Stop Consonant Modelling and Synthesis Framework Based on MISO Dynamic System

Voiceless Stop Consonant Modelling and Synthesis Framework Based on MISO Dynamic System ARCHIVES OF ACOUSTICS Vol. 42, No. 3, pp. 375 383 (2017) Copyright c 2017 by PAN IPPT DOI: 10.1515/aoa-2017-0039 Voiceless Stop Consonant Modelling and Synthesis Framework Based on MISO Dynamic System

More information

Linguistics 220 Phonology: distributions and the concept of the phoneme. John Alderete, Simon Fraser University

Linguistics 220 Phonology: distributions and the concept of the phoneme. John Alderete, Simon Fraser University Linguistics 220 Phonology: distributions and the concept of the phoneme John Alderete, Simon Fraser University Foundations in phonology Outline 1. Intuitions about phonological structure 2. Contrastive

More information

A study of speaker adaptation for DNN-based speech synthesis

A study of speaker adaptation for DNN-based speech synthesis A study of speaker adaptation for DNN-based speech synthesis Zhizheng Wu, Pawel Swietojanski, Christophe Veaux, Steve Renals, Simon King The Centre for Speech Technology Research (CSTR) University of Edinburgh,

More information

International Journal of Computational Intelligence and Informatics, Vol. 1 : No. 4, January - March 2012

International Journal of Computational Intelligence and Informatics, Vol. 1 : No. 4, January - March 2012 Text-independent Mono and Cross-lingual Speaker Identification with the Constraint of Limited Data Nagaraja B G and H S Jayanna Department of Information Science and Engineering Siddaganga Institute of

More information

**Note: this is slightly different from the original (mainly in format). I would be happy to send you a hard copy.**

**Note: this is slightly different from the original (mainly in format). I would be happy to send you a hard copy.** **Note: this is slightly different from the original (mainly in format). I would be happy to send you a hard copy.** REANALYZING THE JAPANESE CODA NASAL IN OPTIMALITY THEORY 1 KATSURA AOYAMA University

More information

Affricates. Affricates, nasals, laterals and continuants. Affricates. Affricates. Study questions

Affricates. Affricates, nasals, laterals and continuants. Affricates. Affricates. Study questions , nasals, laterals and continuants Phonetics of English 1 1. Tip artikulacije (type of articulation) /tʃ, dʒ/ su suglasnici (consonants) 2. Način artikulacije (manner of articulation) /tʃ, dʒ/ su afrikati

More information

Voice conversion through vector quantization

Voice conversion through vector quantization J. Acoust. Soc. Jpn.(E)11, 2 (1990) Voice conversion through vector quantization Masanobu Abe, Satoshi Nakamura, Kiyohiro Shikano, and Hisao Kuwabara A TR Interpreting Telephony Research Laboratories,

More information

Demonstration of problems of lexical stress on the pronunciation Turkish English teachers and teacher trainees by computer

Demonstration of problems of lexical stress on the pronunciation Turkish English teachers and teacher trainees by computer Available online at www.sciencedirect.com Procedia - Social and Behavioral Sciences 46 ( 2012 ) 3011 3016 WCES 2012 Demonstration of problems of lexical stress on the pronunciation Turkish English teachers

More information

Learning Methods in Multilingual Speech Recognition

Learning Methods in Multilingual Speech Recognition Learning Methods in Multilingual Speech Recognition Hui Lin Department of Electrical Engineering University of Washington Seattle, WA 98125 linhui@u.washington.edu Li Deng, Jasha Droppo, Dong Yu, and Alex

More information

A Cross-language Corpus for Studying the Phonetics and Phonology of Prominence

A Cross-language Corpus for Studying the Phonetics and Phonology of Prominence A Cross-language Corpus for Studying the Phonetics and Phonology of Prominence Bistra Andreeva 1, William Barry 1, Jacques Koreman 2 1 Saarland University Germany 2 Norwegian University of Science and

More information

Multilingual Speech Data Collection for the Assessment of Pronunciation and Prosody in a Language Learning System

Multilingual Speech Data Collection for the Assessment of Pronunciation and Prosody in a Language Learning System Multilingual Speech Data Collection for the Assessment of Pronunciation and Prosody in a Language Learning System O. Jokisch 1, A. Wagner 2, R. Sabo 3, R. Jäckel 1, N. Cylwik 2, M. Rusko 3, A. Ronzhin

More information

WHEN THERE IS A mismatch between the acoustic

WHEN THERE IS A mismatch between the acoustic 808 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 14, NO. 3, MAY 2006 Optimization of Temporal Filters for Constructing Robust Features in Speech Recognition Jeih-Weih Hung, Member,

More information

Phonological and Phonetic Representations: The Case of Neutralization

Phonological and Phonetic Representations: The Case of Neutralization Phonological and Phonetic Representations: The Case of Neutralization Allard Jongman University of Kansas 1. Introduction The present paper focuses on the phenomenon of phonological neutralization to consider

More information

Perceived speech rate: the effects of. articulation rate and speaking style in spontaneous speech. Jacques Koreman. Saarland University

Perceived speech rate: the effects of. articulation rate and speaking style in spontaneous speech. Jacques Koreman. Saarland University 1 Perceived speech rate: the effects of articulation rate and speaking style in spontaneous speech Jacques Koreman Saarland University Institute of Phonetics P.O. Box 151150 D-66041 Saarbrücken Germany

More information

Audible and visible speech

Audible and visible speech Building sensori-motor prototypes from audiovisual exemplars Gérard BAILLY Institut de la Communication Parlée INPG & Université Stendhal 46, avenue Félix Viallet, 383 Grenoble Cedex, France web: http://www.icp.grenet.fr/bailly

More information

Radical CV Phonology: the locational gesture *

Radical CV Phonology: the locational gesture * Radical CV Phonology: the locational gesture * HARRY VAN DER HULST 1 Goals 'Radical CV Phonology' is a variant of Dependency Phonology (Anderson and Jones 1974, Anderson & Ewen 1980, Ewen 1980, Lass 1984,

More information

The Journey to Vowelerria VOWEL ERRORS: THE LOST WORLD OF SPEECH INTERVENTION. Preparation: Education. Preparation: Education. Preparation: Education

The Journey to Vowelerria VOWEL ERRORS: THE LOST WORLD OF SPEECH INTERVENTION. Preparation: Education. Preparation: Education. Preparation: Education VOWEL ERRORS: THE LOST WORLD OF SPEECH INTERVENTION The Journey to Vowelerria An adventure across familiar territory child speech intervention leading to uncommon terrain vowel errors, Ph.D., CCC-SLP 03-15-14

More information

A Neural Network GUI Tested on Text-To-Phoneme Mapping

A Neural Network GUI Tested on Text-To-Phoneme Mapping A Neural Network GUI Tested on Text-To-Phoneme Mapping MAARTEN TROMPPER Universiteit Utrecht m.f.a.trompper@students.uu.nl Abstract Text-to-phoneme (T2P) mapping is a necessary step in any speech synthesis

More information

Pobrane z czasopisma New Horizons in English Studies Data: 18/11/ :52:20. New Horizons in English Studies 1/2016

Pobrane z czasopisma New Horizons in English Studies  Data: 18/11/ :52:20. New Horizons in English Studies 1/2016 LANGUAGE Maria Curie-Skłodowska University () in Lublin k.laidler.umcs@gmail.com Online Adaptation of Word-initial Ukrainian CC Consonant Clusters by Native Speakers of English Abstract. The phenomenon

More information

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections Tyler Perrachione LING 451-0 Proseminar in Sound Structure Prof. A. Bradlow 17 March 2006 Intra-talker Variation: Audience Design Factors Affecting Lexical Selections Abstract Although the acoustic and

More information

Word Stress and Intonation: Introduction

Word Stress and Intonation: Introduction Word Stress and Intonation: Introduction WORD STRESS One or more syllables of a polysyllabic word have greater prominence than the others. Such syllables are said to be accented or stressed. Word stress

More information

9 Sound recordings: acoustic and articulatory data

9 Sound recordings: acoustic and articulatory data 9 Sound recordings: acoustic and articulatory data Robert J. Podesva and Elizabeth Zsiga 1 Introduction Linguists, across the subdisciplines of the field, use sound recordings for a great many purposes

More information

NIH Public Access Author Manuscript Lang Speech. Author manuscript; available in PMC 2011 January 1.

NIH Public Access Author Manuscript Lang Speech. Author manuscript; available in PMC 2011 January 1. NIH Public Access Author Manuscript Published in final edited form as: Lang Speech. 2010 ; 53(Pt 1): 49 69. Spatial and Temporal Properties of Gestures in North American English /R/ Fiona Campbell, University

More information

Modern TTS systems. CS 294-5: Statistical Natural Language Processing. Types of Modern Synthesis. TTS Architecture. Text Normalization

Modern TTS systems. CS 294-5: Statistical Natural Language Processing. Types of Modern Synthesis. TTS Architecture. Text Normalization CS 294-5: Statistical Natural Language Processing Speech Synthesis Lecture 22: 12/4/05 Modern TTS systems 1960 s first full TTS Umeda et al (1968) 1970 s Joe Olive 1977 concatenation of linearprediction

More information

Human Emotion Recognition From Speech

Human Emotion Recognition From Speech RESEARCH ARTICLE OPEN ACCESS Human Emotion Recognition From Speech Miss. Aparna P. Wanare*, Prof. Shankar N. Dandare *(Department of Electronics & Telecommunication Engineering, Sant Gadge Baba Amravati

More information

The ABCs of O-G. Materials Catalog. Skills Workbook. Lesson Plans for Teaching The Orton-Gillingham Approach in Reading and Spelling

The ABCs of O-G. Materials Catalog. Skills Workbook. Lesson Plans for Teaching The Orton-Gillingham Approach in Reading and Spelling 2008 Intermediate Level Skills Workbook Group 2 Groups 1 & 2 The ABCs of O-G The Flynn System by Emi Flynn Lesson Plans for Teaching The Orton-Gillingham Approach in Reading and Spelling The ABCs of O-G

More information

Quarterly Progress and Status Report. Sound symbolism in deictic words

Quarterly Progress and Status Report. Sound symbolism in deictic words Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Sound symbolism in deictic words Traunmüller, H. journal: TMH-QPSR volume: 37 number: 2 year: 1996 pages: 147-150 http://www.speech.kth.se/qpsr

More information

Rhythm-typology revisited.

Rhythm-typology revisited. DFG Project BA 737/1: "Cross-language and individual differences in the production and perception of syllabic prominence. Rhythm-typology revisited." Rhythm-typology revisited. B. Andreeva & W. Barry Jacques

More information

A Believable Accent: The Phonology of the Pink Panther

A Believable Accent: The Phonology of the Pink Panther William Pickett California State University, Fullerton A Believable Accent: The Phonology of the Pink Panther If the empirical data employed by a linguist is defined as that which is verifiable or provable

More information

DIBELS Next BENCHMARK ASSESSMENTS

DIBELS Next BENCHMARK ASSESSMENTS DIBELS Next BENCHMARK ASSESSMENTS Click to edit Master title style Benchmark Screening Benchmark testing is the systematic process of screening all students on essential skills predictive of later reading

More information

Self-Supervised Acquisition of Vowels in American English

Self-Supervised Acquisition of Vowels in American English Self-Supervised Acquisition of Vowels in American English Michael H. Coen MIT Computer Science and Artificial Intelligence Laboratory 32 Vassar Street Cambridge, MA 2139 mhcoen@csail.mit.edu Abstract This

More information

Fix Your Vowels: Computer-assisted training by Dutch learners of Spanish

Fix Your Vowels: Computer-assisted training by Dutch learners of Spanish Carmen Lie-Lahuerta Fix Your Vowels: Computer-assisted training by Dutch learners of Spanish I t is common knowledge that foreign learners struggle when it comes to producing the sounds of the target language

More information

SARDNET: A Self-Organizing Feature Map for Sequences

SARDNET: A Self-Organizing Feature Map for Sequences SARDNET: A Self-Organizing Feature Map for Sequences Daniel L. James and Risto Miikkulainen Department of Computer Sciences The University of Texas at Austin Austin, TX 78712 dljames,risto~cs.utexas.edu

More information

A Comparison of DHMM and DTW for Isolated Digits Recognition System of Arabic Language

A Comparison of DHMM and DTW for Isolated Digits Recognition System of Arabic Language A Comparison of DHMM and DTW for Isolated Digits Recognition System of Arabic Language Z.HACHKAR 1,3, A. FARCHI 2, B.MOUNIR 1, J. EL ABBADI 3 1 Ecole Supérieure de Technologie, Safi, Morocco. zhachkar2000@yahoo.fr.

More information

Phonetic- and Speaker-Discriminant Features for Speaker Recognition. Research Project

Phonetic- and Speaker-Discriminant Features for Speaker Recognition. Research Project Phonetic- and Speaker-Discriminant Features for Speaker Recognition by Lara Stoll Research Project Submitted to the Department of Electrical Engineering and Computer Sciences, University of California

More information

Different Task Type and the Perception of the English Interdental Fricatives

Different Task Type and the Perception of the English Interdental Fricatives Different Task Type and the Perception of the English Interdental Fricatives Mara Silvia Reis, Denise Cristina Kluge, Melissa Bettoni-Techio Federal University of Santa Catarina marasreis@hotmail.com,

More information

Automatic segmentation of continuous speech using minimum phase group delay functions

Automatic segmentation of continuous speech using minimum phase group delay functions Speech Communication 42 (24) 429 446 www.elsevier.com/locate/specom Automatic segmentation of continuous speech using minimum phase group delay functions V. Kamakshi Prasad, T. Nagarajan *, Hema A. Murthy

More information

Learners Use Word-Level Statistics in Phonetic Category Acquisition

Learners Use Word-Level Statistics in Phonetic Category Acquisition Learners Use Word-Level Statistics in Phonetic Category Acquisition Naomi Feldman, Emily Myers, Katherine White, Thomas Griffiths, and James Morgan 1. Introduction * One of the first challenges that language

More information

BAUM-WELCH TRAINING FOR SEGMENT-BASED SPEECH RECOGNITION. Han Shu, I. Lee Hetherington, and James Glass

BAUM-WELCH TRAINING FOR SEGMENT-BASED SPEECH RECOGNITION. Han Shu, I. Lee Hetherington, and James Glass BAUM-WELCH TRAINING FOR SEGMENT-BASED SPEECH RECOGNITION Han Shu, I. Lee Hetherington, and James Glass Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology Cambridge,

More information

Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano. Graduate School of Information Science, Nara Institute of Science & Technology

Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano. Graduate School of Information Science, Nara Institute of Science & Technology ISCA Archive SUBJECTIVE EVALUATION FOR HMM-BASED SPEECH-TO-LIP MOVEMENT SYNTHESIS Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano Graduate School of Information Science, Nara Institute of Science & Technology

More information

Automatic intonation assessment for computer aided language learning

Automatic intonation assessment for computer aided language learning Available online at www.sciencedirect.com Speech Communication 52 (2010) 254 267 www.elsevier.com/locate/specom Automatic intonation assessment for computer aided language learning Juan Pablo Arias a,

More information

Language Change: Progress or Decay?

Language Change: Progress or Decay? Language Change: Progress or Decay? Fourth edition How and why do languages change? Where does the evidence of language change come from? How do languages begin and end? This introduction to language change

More information

Manner assimilation in Uyghur

Manner assimilation in Uyghur Manner assimilation in Uyghur Suyeon Yun (suyeon@mit.edu) 10th Workshop on Altaic Formal Linguistics (1) Possible patterns of manner assimilation in nasal-liquid sequences (a) Regressive assimilation lateralization:

More information

Speaker Identification by Comparison of Smart Methods. Abstract

Speaker Identification by Comparison of Smart Methods. Abstract Journal of mathematics and computer science 10 (2014), 61-71 Speaker Identification by Comparison of Smart Methods Ali Mahdavi Meimand Amin Asadi Majid Mohamadi Department of Electrical Department of Computer

More information

PHONETIC DISTANCE BASED ACCENT CLASSIFIER TO IDENTIFY PRONUNCIATION VARIANTS AND OOV WORDS

PHONETIC DISTANCE BASED ACCENT CLASSIFIER TO IDENTIFY PRONUNCIATION VARIANTS AND OOV WORDS PHONETIC DISTANCE BASED ACCENT CLASSIFIER TO IDENTIFY PRONUNCIATION VARIANTS AND OOV WORDS Akella Amarendra Babu 1 *, Ramadevi Yellasiri 2 and Akepogu Ananda Rao 3 1 JNIAS, JNT University Anantapur, Ananthapuramu,

More information

Affricates. Affricates, nasals, laterals and continuants. Affricates. Affricates. Affricates. Affricates 11/20/2015. Phonetics of English 1

Affricates. Affricates, nasals, laterals and continuants. Affricates. Affricates. Affricates. Affricates 11/20/2015. Phonetics of English 1 , nasals, laterals and continuants Phonetics of English 1 1. Tip artikulacije (type of articulation) /tʃ, dʒ/ su suglasnici (consonants) 2. Način artikulacije (manner of articulation) /tʃ, dʒ/ su afrikati

More information

Automatic English-Chinese name transliteration for development of multilingual resources

Automatic English-Chinese name transliteration for development of multilingual resources Automatic English-Chinese name transliteration for development of multilingual resources Stephen Wan and Cornelia Maria Verspoor Microsoft Research Institute Macquarie University Sydney NSW 2109, Australia

More information

Edinburgh Research Explorer

Edinburgh Research Explorer Edinburgh Research Explorer The magnetic resonance imaging subset of the mngu0 articulatory corpus Citation for published version: Steiner, I, Richmond, K, Marshall, I & Gray, C 2012, 'The magnetic resonance

More information

Characterizing and Processing Robot-Directed Speech

Characterizing and Processing Robot-Directed Speech Characterizing and Processing Robot-Directed Speech Paulina Varchavskaia, Paul Fitzpatrick, Cynthia Breazeal AI Lab, MIT, Cambridge, USA [paulina,paulfitz,cynthia]@ai.mit.edu Abstract. Speech directed

More information

Modeling function word errors in DNN-HMM based LVCSR systems

Modeling function word errors in DNN-HMM based LVCSR systems Modeling function word errors in DNN-HMM based LVCSR systems Melvin Jose Johnson Premkumar, Ankur Bapna and Sree Avinash Parchuri Department of Computer Science Department of Electrical Engineering Stanford

More information