This lecture. Speech production and articulatory phonetics. Mel-frequency cepstral coefficients (i.e., the input to ASR systems) Next week: 3 lectures

Size: px
Start display at page:

Download "This lecture. Speech production and articulatory phonetics. Mel-frequency cepstral coefficients (i.e., the input to ASR systems) Next week: 3 lectures"

Transcription

1

2 This lecture Speech production and articulatory phonetics. Mel-frequency cepstral coefficients (i.e., the input to ASR systems) Next week: 3 lectures Some images from Jim Glass course (MIT), the Jurafsky & Martin textbook, the Rolling Stones. CSC401/2511 Spring

3 The vocal tract Nasal cavity Many physical structures are co-ordinated in the production of speech. Velum Tongue Lips Generally, sound is generated by passing air through the vocal tract. Glottis Lungs Jaw Sound is modified by constricting airflow in particular ways. CSC401/2511 Spring

4 Place of articulation The location of the primary constriction can be: Alveolar: constriction near the alveolar ridge (e.g., /t/) Bilabial: touching of the lips together (e.g., /m/, /p/) Dental: constriction of/at the teeth (e.g., /th/) Labiodental: constriction between lip and teeth (e.g., /f/) Velar: constriction at or near the velum (e.g., /k/). CSC401/2511 Spring

5 Phonetic features Phonemes can have several features, e.g.: Place: Location of primary constriction. Voicing: Manner: High/low: Front/back: {"#$%&#"', )*#")*"#, +%,-"#, #")*&+%,-"#, $%#"', } Whether the glottis is vibrating. {+$&*1%, $&*1%, } The class of articulation type. {3-&4, $&5%#,,"3"#, 6'*1"-*$%, "44'&7*8",-, '%-'&6#%7, } Anterior position of the tongue. {h*;h, 8*+, #&5, } Ventral position of the tongue. {6'&,-, 1%,-'"#, )"1<, } voicing as in silence CSC401/2511 Spring

6 Spectrograms Spectrograms are 3D representations of any acoustic signal where a co-ordinate represents the amplitude of a particular sinusoidal frequency at a particular time. CSC401/2511 Spring

7 Formants and phonemes Formant: n. A large concentration of energy within a band of frequencies (e.g., = >, =?, ). =? = > CSC401/2511 Spring

8 Phonemic alphabets There are several alphabets that categorize the sounds of speech. The International Phonetic Alphabet (IPA) is popular, but it uses non-ascii symbols. The TIMIT phonemic alphabet will be used by default in this course. Other popular alphabets include ARPAbet, Worldbet, and OGIbet, usually adding special cases. E.g., /pcl/ is the period of silence immediately before a /p/. TIMIT IPA e.g. /iy/ /i y / beat /ih/ /ɪ/ bit /eh/ /ɛ/ bet /ae/ /æ/ bat /aa/ /ɑ/ Bob /ah/ /ʌ/ but /ao/ /ɔ/ bought /uh/ /ʊ/ book /uw/ /u/ boot /ux/ /u/ suit /ax/ /ə/ about CSC401/2511 Spring

9 Phonemes Now we re going to discuss each of these manners of articulation: Vowels: open vocal tract, no nasal air. Fricatives: noisy, with air passing through a tight constriction (e.g., shift ). Stops/plosives: complete vocal tract constriction and burst of energy (e.g., papa ). Nasals: involve air passing through the nasal cavity (e.g., mama ). Semivowels: similar to vowels, but typically with more constriction (e.g., wall ). Affricates: Alveolar stop followed by fricative. CSC401/2511 Spring

10 Vowels (1/6) There are approximately 19 vowels in Canadian English, including diphthongs in which the articulators move over time. Vowels are distinguished primarily by their formants. other /er/ /axr/ e.g. Bert butter diphthong /ey/ /ow/ /ay/ /oy/ /aw/ /ux/ e.g. bait boat bite boy bout suit Monophthong /iy/ /ih/ /eh/ /ae/ /aa/ /ao/ /ah/ /uh/ /uw/ /ax/ /ix/ e.g. beat bit bet bat Bob bought but book boot about roses CSC401/2511 Spring

11 The uniform tube Closed, vibrating end glottis 17 cm Open, radiating end lips The positions of the tongue, jaw, and lips change the shape and cross-sectional area of the vocal tract. CSC401/2511 Spring

12 Uniform tubes in practice Many musical instruments are based on the idea of uniform (or, in many cases, bent) tubes. Longer tubes produce deeper sounds (lower frequencies). A tube ½ the length of another will be 1 octave higher. CSC401/2511 Spring

13 Vowels as concatenated tubes The vocal tract can be modelled as the concatenation of dozens, hundreds, or thousands of tubes. /iy/ /uw/ CSC401/2511 Spring

14 Aside waves in concatenated tubes We model the volume velocity A B and the pressure variation 4 B at position 7 in the < CD lossless tube (whose area is E B ) at time - A B 7, - = A B G A B H B 7, - = I1 E B A B G A B H where 1 is the speed of sound, I is the density of air. CSC401/2511 Spring

15 Waves in concatenated tubes Because of partial wave reflections that occur at tube boundaries, we can generate spectra with particular resonances. CSC401/2511 Spring

16 Spectrograms of vowels Tongue height is correlated with the first formant, = >. Tongue frontness is correlated with the second formant, =?. CSC401/2511 Spring

17 The vowel trapezoid = > increases =? increases CSC401/2511 Spring

18 Formants and front/back, high/low Front/ low Front/ high Back/ high CSC401/2511 Spring

19 Fricatives (2/6) Fricatives are caused by acoustic turbulence at a narrow constriction whose position determines the sound. Fricatives can be voiced (i.e., the glottis can be vibrating). Labiodental dental alveolar palatal /f/ /th/ /s/ /sh/ CSC401/2511 Spring

20 Fricatives Fricatives have four places of articulation: Labio-dental ( labial ) Interdental ( dental ) Alveolar Palato-alveolar ( palatal ) Unvoiced Every place of articulation has both a voiced and an unvoiced version. Voiced Labial /f/ fee /v/ Vendetta Dental /th/ thief /dh/ Thee Alveolar /s/ see /z/ Zardoz Palatal /sh/ she /zh/ Zha-zha CSC401/2511 Spring

21 Unvoiced fricatives Note in these examples that /f/ has more energy in the word-final position, and /sh/ excites more of the spectrum than /s/ when word-initial. CSC401/2511 Spring

22 Voiced versus unvoiced fricatives CSC401/2511 Spring

23 Voiced versus unvoiced fricatives Spectra here look similar, but voiced fricatives often include significant energy < 150 Hz. CSC401/2511 Spring

24 Plosives (3/6) Plosives build pressure behind a complete closure in the vocal tract. This closure can be associated with voiced excitation. A sudden release of this constriction results in brief noise. labial /b/ alveolar /d/ velar /g/ CSC401/2511 Spring

25 Plosives Plosives have three places of articulation: Unvoiced Voiced Labial /p/ porpoise /b/ baboon Alveolar /t/ tort /d/ dodo Velar /k/ kick /g/ Google Voiced stops are usually characterized by a voice bar during closure, indicating the vibrating glottis. Formant transitions are very informative in classification. CSC401/2511 Spring

26 Voicing contrasts The voice bar CSC401/2511 Spring

27 Formant transitions among plosives Despite a common vowel, the motion of =? and into (and out of) the vowel helps identify the plosive. CSC401/2511 Spring

28 Voice onset time and voicing cues Voice onset time: n. the time from stop release (i.e., the start of the sound burst) to the start of vocal periodicity. There are at least 6 features that indicate voicing in plosives. CSC401/2511 Spring

29 Voice onset time among plosives Unvoiced plosives have longer voice onset times CSC401/2511 Spring

30 Nasals (4/6) Nasals involve lowering the velum so that air passes through the nasal cavity. Closures in the oral cavity (at same positions as plosives) change the resonant characteristics of the nasal sonorant. labial alveolar velar /m/ /n/ /ng/ CSC401/2511 Spring

31 Formant transitions among nasals Nasals often appear as two formants Despite a common vowel, the motion of =? and before and after each nasal helps to identify it. CSC401/2511 Spring

32 Semivowels (5/6) Semivowels act as consonants in syllables and involve constriction in the vocal tract, but there is less turbulence. They also involve slower articulatory motion. Laterals involve airflow around the sides of the tongue. /w/ /y/ /r/ /l/ CSC401/2511 Spring

33 Semivowels Semivowels are often sub-classified as glides or liquids. Glides Liquids Semivowel Nearest vowel /w/ Wow /uw/ /y/ yoyo /iy/ /r/ rear /er/ /l/ Lulu /ow/ Semivowels are more constricted versions of corresponding vowels. Similar formants, though generally weaker. CSC401/2511 Spring

34 Semivowels Note the drastic formant transitions which are more typical of semivowels. CSC401/2511 Spring

35 Affricates and aspirants (6/6) There are two affricates: /jh/ (voiced; e.g., judge) and /ch/ (unvoiced; e.g., church). These involve an alveolar stop followed by a fricative. Voicing in /jh/ is normally indicated by voice bars, as with plosives. There s only one aspirant in Canadian English: /h/ (e.g., hat) This involves turbulence generated at the glottis, In Canadian English, there is no constriction in the vocal tract. CSC401/2511 Spring

36 Affricates and aspirants CSC401/2511 Spring

37 Putting it all together Phonemes are grouped together in syllables which must contain a non-obstruent nucleus. Consonants can be clustered in particular ways in the onset and coda, but with constraints. E.g., you usually can t start a word with /kt/, nor end one with /tk/. CSC401/2511 Spring

38 Alternative pronunciations Pronunciations of words can vary significantly, but with observable frequencies. The Switchboard corpus is a phonetically annotated database of speech recorded in telephone conversations. CSC401/2511 Spring

39 Known effects of pronunciation Speakers tend to drop or change pronunciations in predictable ways in order to reduce the effort required to co-ordinate the various articulators. Palatalization generally refers to a conflation of phonemes closer to the frontal palate than they should be. Final t/d deletion is simply the omission of alveolar plosives from the ends of words. CSC401/2511 Spring

40 Variation at syllable boundaries CSC401/2511 Spring

41 Recall our input to ASR Frame Spectrum Amplitude We want to transform the spectrum into something more useful Frequency (Hz) CSC401/2511 Spring

42 1. The Mel-scale filter bank To mimic the response of the human ear (and because it empirically improves speech recognition), we often discretize the spectrum using J triangular filters. Uniform spacing before 1 khz, logarithmic after 1 khz CSC401/2511 Spring

43 2. Source and filter The acoustics of speech are produced by a glottal pulse waveform (the source) passing through a vocal tract whose shape modifies that wave (the filter). The shape of the vocal tract is more important to phoneme recognition. We to separate the source from the filter in the acoustics. CSC401/2511 Spring

44 2. Source and filter (aside) Since speech is assumed to be the output of a linear time invariant system, it can be described as a convolution. Convolution, 7 L, is beyond the scope of this course, but can be conceived as the modification of one signal by another. For speech signal 7[,], glottal signal ;,, and vocal tract transfer $[,] with spectra O[P], Q[P], and R[P], respectively : 7, = ;, $, O P = Q P R P log O[P] = log Q P + log R[P] We ve separated the source and filter into two terms! CSC401/2511 Spring

45 2. The cepstrum We separate the source and the filter by pretending the log of the spectrum is actually a time domain signal. the log spectrum log O[P] is a sum of the log spectra of the source and filter, i.e., a superposition; finding its spectrum will allow us to isolate these components. Cepstrum: n. the spectrum of the log of the spectrum. Fun fact: ceps is the reverse of spec. Instead of filters we have lifters log log log CSC401/2511 Spring

46 2. The cepstrum Spectrum Log spectrum Cepstrum The domain of the cepstrum is quefrency (a play on the word frequency ). CSC401/2511 Spring

47 2. The cepstrum Spectrum Pictures from John Coleman (2005) Cepstrum This is due to the vocal tract shape This is due to the glottis CSC401/2511 Spring

48 Mel-frequency cepstral coefficients Mel-frequency cepstral coefficients (MFCCs) are the most popular representation of speech used in ASR. They are the spectra of the logarithms of the mel-scaled filtered spectra of the windows of the waveform. Speech signal window DFT Mel filterbank log DFT MFCC CSC401/2511 Spring

49 Advantages of MFCCs The cepstrum produces highly uncorrelated features (every dimension is useful). This includes a separation of the source and filter. In practice, the cepstrum has been easier to learn than the spectrum for phoneme recognition. There is an efficient method to compute cepstra called the discrete cosine transform. CSC401/2511 Spring

50 MFCCs in practice An observation vector of MFCCs often consists of The first 13 cepstral coefficients (i.e., the first 13 dimensions produced by this method), An additional overall energy measure, The velocities (V) of each of those 14 dimensions, i.e., the rate of change of each coefficient at a given time The accelerations (VV) of each of original 14 dimensions. The result is that at a timeframe - we have an observation MFCC vector of (13+1)*3=42 dimensions. This vector is what is used by our ASR systems CSC401/2511 Spring

51 Next week

On the Formation of Phoneme Categories in DNN Acoustic Models

On the Formation of Phoneme Categories in DNN Acoustic Models On the Formation of Phoneme Categories in DNN Acoustic Models Tasha Nagamine Department of Electrical Engineering, Columbia University T. Nagamine Motivation Large performance gap between humans and state-

More information

Speech Recognition using Acoustic Landmarks and Binary Phonetic Feature Classifiers

Speech Recognition using Acoustic Landmarks and Binary Phonetic Feature Classifiers Speech Recognition using Acoustic Landmarks and Binary Phonetic Feature Classifiers October 31, 2003 Amit Juneja Department of Electrical and Computer Engineering University of Maryland, College Park,

More information

Consonants: articulation and transcription

Consonants: articulation and transcription Phonology 1: Handout January 20, 2005 Consonants: articulation and transcription 1 Orientation phonetics [G. Phonetik]: the study of the physical and physiological aspects of human sound production and

More information

Phonetics. The Sound of Language

Phonetics. The Sound of Language Phonetics. The Sound of Language 1 The Description of Sounds Fromkin & Rodman: An Introduction to Language. Fort Worth etc., Harcourt Brace Jovanovich Read: Chapter 5, (p. 176ff.) (or the corresponding

More information

Speaker Recognition. Speaker Diarization and Identification

Speaker Recognition. Speaker Diarization and Identification Speaker Recognition Speaker Diarization and Identification A dissertation submitted to the University of Manchester for the degree of Master of Science in the Faculty of Engineering and Physical Sciences

More information

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines Amit Juneja and Carol Espy-Wilson Department of Electrical and Computer Engineering University of Maryland,

More information

Design Of An Automatic Speaker Recognition System Using MFCC, Vector Quantization And LBG Algorithm

Design Of An Automatic Speaker Recognition System Using MFCC, Vector Quantization And LBG Algorithm Design Of An Automatic Speaker Recognition System Using MFCC, Vector Quantization And LBG Algorithm Prof. Ch.Srinivasa Kumar Prof. and Head of department. Electronics and communication Nalanda Institute

More information

Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition

Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition Hua Zhang, Yun Tang, Wenju Liu and Bo Xu National Laboratory of Pattern Recognition Institute of Automation, Chinese

More information

On Developing Acoustic Models Using HTK. M.A. Spaans BSc.

On Developing Acoustic Models Using HTK. M.A. Spaans BSc. On Developing Acoustic Models Using HTK M.A. Spaans BSc. On Developing Acoustic Models Using HTK M.A. Spaans BSc. Delft, December 2004 Copyright c 2004 M.A. Spaans BSc. December, 2004. Faculty of Electrical

More information

Quarterly Progress and Status Report. VCV-sequencies in a preliminary text-to-speech system for female speech

Quarterly Progress and Status Report. VCV-sequencies in a preliminary text-to-speech system for female speech Dept. for Speech, Music and Hearing Quarterly Progress and Status Report VCV-sequencies in a preliminary text-to-speech system for female speech Karlsson, I. and Neovius, L. journal: STL-QPSR volume: 35

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 2013 http://acousticalsociety.org/ ICA 2013 Montreal Montreal, Canada 2-7 June 2013 Speech Communication Session 2aSC: Linking Perception and Production

More information

age, Speech and Hearii

age, Speech and Hearii age, Speech and Hearii 1 Speech Commun cation tion 2 Sensory Comm, ection i 298 RLE Progress Report Number 132 Section 1 Speech Communication Chapter 1 Speech Communication 299 300 RLE Progress Report

More information

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION JOURNAL OF MEDICAL INFORMATICS & TECHNOLOGIES Vol. 11/2007, ISSN 1642-6037 Marek WIŚNIEWSKI *, Wiesława KUNISZYK-JÓŹKOWIAK *, Elżbieta SMOŁKA *, Waldemar SUSZYŃSKI * HMM, recognition, speech, disorders

More information

SEGMENTAL FEATURES IN SPONTANEOUS AND READ-ALOUD FINNISH

SEGMENTAL FEATURES IN SPONTANEOUS AND READ-ALOUD FINNISH SEGMENTAL FEATURES IN SPONTANEOUS AND READ-ALOUD FINNISH Mietta Lennes Most of the phonetic knowledge that is currently available on spoken Finnish is based on clearly pronounced speech: either readaloud

More information

Phonology Revisited: Sor3ng Out the PH Factors in Reading and Spelling Development. Indiana, November, 2015

Phonology Revisited: Sor3ng Out the PH Factors in Reading and Spelling Development. Indiana, November, 2015 Phonology Revisited: Sor3ng Out the PH Factors in Reading and Spelling Development Indiana, November, 2015 Louisa C. Moats, Ed.D. (louisa.moats@gmail.com) meaning (semantics) discourse structure morphology

More information

Universal contrastive analysis as a learning principle in CAPT

Universal contrastive analysis as a learning principle in CAPT Universal contrastive analysis as a learning principle in CAPT Jacques Koreman, Preben Wik, Olaf Husby, Egil Albertsen Department of Language and Communication Studies, NTNU, Trondheim, Norway jacques.koreman@ntnu.no,

More information

Speech Emotion Recognition Using Support Vector Machine

Speech Emotion Recognition Using Support Vector Machine Speech Emotion Recognition Using Support Vector Machine Yixiong Pan, Peipei Shen and Liping Shen Department of Computer Technology Shanghai JiaoTong University, Shanghai, China panyixiong@sjtu.edu.cn,

More information

Speaker recognition using universal background model on YOHO database

Speaker recognition using universal background model on YOHO database Aalborg University Master Thesis project Speaker recognition using universal background model on YOHO database Author: Alexandre Majetniak Supervisor: Zheng-Hua Tan May 31, 2011 The Faculties of Engineering,

More information

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) e-issn: 2278-2834,p- ISSN: 2278-8735.Volume 10, Issue 2, Ver.1 (Mar - Apr.2015), PP 55-61 www.iosrjournals.org Analysis of Emotion

More information

Contrasting English Phonology and Nigerian English Phonology

Contrasting English Phonology and Nigerian English Phonology Contrasting English Phonology and Nigerian English Phonology Saleh, A. J. Rinji, D.N. ABSTRACT The thrust of this work is the fact that phonology plays a vital role in language and communication both in

More information

Segregation of Unvoiced Speech from Nonspeech Interference

Segregation of Unvoiced Speech from Nonspeech Interference Technical Report OSU-CISRC-8/7-TR63 Department of Computer Science and Engineering The Ohio State University Columbus, OH 4321-1277 FTP site: ftp.cse.ohio-state.edu Login: anonymous Directory: pub/tech-report/27

More information

SOUND STRUCTURE REPRESENTATION, REPAIR AND WELL-FORMEDNESS: GRAMMAR IN SPOKEN LANGUAGE PRODUCTION. Adam B. Buchwald

SOUND STRUCTURE REPRESENTATION, REPAIR AND WELL-FORMEDNESS: GRAMMAR IN SPOKEN LANGUAGE PRODUCTION. Adam B. Buchwald SOUND STRUCTURE REPRESENTATION, REPAIR AND WELL-FORMEDNESS: GRAMMAR IN SPOKEN LANGUAGE PRODUCTION by Adam B. Buchwald A dissertation submitted to The Johns Hopkins University in conformity with the requirements

More information

The analysis starts with the phonetic vowel and consonant charts based on the dataset:

The analysis starts with the phonetic vowel and consonant charts based on the dataset: Ling 113 Homework 5: Hebrew Kelli Wiseth February 13, 2014 The analysis starts with the phonetic vowel and consonant charts based on the dataset: a) Given that the underlying representation for all verb

More information

Lecture 9: Speech Recognition

Lecture 9: Speech Recognition EE E6820: Speech & Audio Processing & Recognition Lecture 9: Speech Recognition 1 Recognizing speech 2 Feature calculation Dan Ellis Michael Mandel 3 Sequence

More information

1. REFLEXES: Ask questions about coughing, swallowing, of water as fast as possible (note! Not suitable for all

1. REFLEXES: Ask questions about coughing, swallowing, of water as fast as possible (note! Not suitable for all Human Communication Science Chandler House, 2 Wakefield Street London WC1N 1PF http://www.hcs.ucl.ac.uk/ ACOUSTICS OF SPEECH INTELLIGIBILITY IN DYSARTHRIA EUROPEAN MASTER S S IN CLINICAL LINGUISTICS UNIVERSITY

More information

Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification

Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification Tomi Kinnunen and Ismo Kärkkäinen University of Joensuu, Department of Computer Science, P.O. Box 111, 80101 JOENSUU,

More information

Quarterly Progress and Status Report. Voiced-voiceless distinction in alaryngeal speech - acoustic and articula

Quarterly Progress and Status Report. Voiced-voiceless distinction in alaryngeal speech - acoustic and articula Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Voiced-voiceless distinction in alaryngeal speech - acoustic and articula Nord, L. and Hammarberg, B. and Lundström, E. journal:

More information

Speech Recognition at ICSI: Broadcast News and beyond

Speech Recognition at ICSI: Broadcast News and beyond Speech Recognition at ICSI: Broadcast News and beyond Dan Ellis International Computer Science Institute, Berkeley CA Outline 1 2 3 The DARPA Broadcast News task Aspects of ICSI

More information

To appear in the Proceedings of the 35th Meetings of the Chicago Linguistics Society. Post-vocalic spirantization: Typology and phonetic motivations

To appear in the Proceedings of the 35th Meetings of the Chicago Linguistics Society. Post-vocalic spirantization: Typology and phonetic motivations Post-vocalic spirantization: Typology and phonetic motivations Alan C-L Yu University of California, Berkeley 0. Introduction Spirantization involves a stop consonant becoming a weak fricative (e.g., B,

More information

Affricates. Affricates, nasals, laterals and continuants. Affricates. Affricates. Study questions

Affricates. Affricates, nasals, laterals and continuants. Affricates. Affricates. Study questions , nasals, laterals and continuants Phonetics of English 1 1. Tip artikulacije (type of articulation) /tʃ, dʒ/ su suglasnici (consonants) 2. Način artikulacije (manner of articulation) /tʃ, dʒ/ su afrikati

More information

Phonological Processing for Urdu Text to Speech System

Phonological Processing for Urdu Text to Speech System Phonological Processing for Urdu Text to Speech System Sarmad Hussain Center for Research in Urdu Language Processing, National University of Computer and Emerging Sciences, B Block, Faisal Town, Lahore,

More information

source or where they are needed to distinguish two forms of a language. 4. Geographical Location. I have attempted to provide a geographical

source or where they are needed to distinguish two forms of a language. 4. Geographical Location. I have attempted to provide a geographical Database Structure 1 This database, compiled by Merritt Ruhlen, contains certain kinds of linguistic and nonlinguistic information for the world s roughly 5,000 languages. This introduction will discuss

More information

Speech Synthesis in Noisy Environment by Enhancing Strength of Excitation and Formant Prominence

Speech Synthesis in Noisy Environment by Enhancing Strength of Excitation and Formant Prominence INTERSPEECH September,, San Francisco, USA Speech Synthesis in Noisy Environment by Enhancing Strength of Excitation and Formant Prominence Bidisha Sharma and S. R. Mahadeva Prasanna Department of Electronics

More information

Learning Methods in Multilingual Speech Recognition

Learning Methods in Multilingual Speech Recognition Learning Methods in Multilingual Speech Recognition Hui Lin Department of Electrical Engineering University of Washington Seattle, WA 98125 linhui@u.washington.edu Li Deng, Jasha Droppo, Dong Yu, and Alex

More information

Phonetic- and Speaker-Discriminant Features for Speaker Recognition. Research Project

Phonetic- and Speaker-Discriminant Features for Speaker Recognition. Research Project Phonetic- and Speaker-Discriminant Features for Speaker Recognition by Lara Stoll Research Project Submitted to the Department of Electrical Engineering and Computer Sciences, University of California

More information

A Comparison of DHMM and DTW for Isolated Digits Recognition System of Arabic Language

A Comparison of DHMM and DTW for Isolated Digits Recognition System of Arabic Language A Comparison of DHMM and DTW for Isolated Digits Recognition System of Arabic Language Z.HACHKAR 1,3, A. FARCHI 2, B.MOUNIR 1, J. EL ABBADI 3 1 Ecole Supérieure de Technologie, Safi, Morocco. zhachkar2000@yahoo.fr.

More information

Human Emotion Recognition From Speech

Human Emotion Recognition From Speech RESEARCH ARTICLE OPEN ACCESS Human Emotion Recognition From Speech Miss. Aparna P. Wanare*, Prof. Shankar N. Dandare *(Department of Electronics & Telecommunication Engineering, Sant Gadge Baba Amravati

More information

Body-Conducted Speech Recognition and its Application to Speech Support System

Body-Conducted Speech Recognition and its Application to Speech Support System Body-Conducted Speech Recognition and its Application to Speech Support System 4 Shunsuke Ishimitsu Hiroshima City University Japan 1. Introduction In recent years, speech recognition systems have been

More information

Journal of Phonetics

Journal of Phonetics Journal of Phonetics 40 (2012) 595 607 Contents lists available at SciVerse ScienceDirect Journal of Phonetics journal homepage: www.elsevier.com/locate/phonetics How linguistic and probabilistic properties

More information

A comparison of spectral smoothing methods for segment concatenation based speech synthesis

A comparison of spectral smoothing methods for segment concatenation based speech synthesis D.T. Chappell, J.H.L. Hansen, "Spectral Smoothing for Speech Segment Concatenation, Speech Communication, Volume 36, Issues 3-4, March 2002, Pages 343-373. A comparison of spectral smoothing methods for

More information

International Journal of Computational Intelligence and Informatics, Vol. 1 : No. 4, January - March 2012

International Journal of Computational Intelligence and Informatics, Vol. 1 : No. 4, January - March 2012 Text-independent Mono and Cross-lingual Speaker Identification with the Constraint of Limited Data Nagaraja B G and H S Jayanna Department of Information Science and Engineering Siddaganga Institute of

More information

Pobrane z czasopisma New Horizons in English Studies Data: 18/11/ :52:20. New Horizons in English Studies 1/2016

Pobrane z czasopisma New Horizons in English Studies  Data: 18/11/ :52:20. New Horizons in English Studies 1/2016 LANGUAGE Maria Curie-Skłodowska University () in Lublin k.laidler.umcs@gmail.com Online Adaptation of Word-initial Ukrainian CC Consonant Clusters by Native Speakers of English Abstract. The phenomenon

More information

MASTERY OF PHONEMIC SYMBOLS AND STUDENT EXPERIENCES IN PRONUNCIATION TEACHING. Master s thesis Aino Saarelainen

MASTERY OF PHONEMIC SYMBOLS AND STUDENT EXPERIENCES IN PRONUNCIATION TEACHING. Master s thesis Aino Saarelainen MASTERY OF PHONEMIC SYMBOLS AND STUDENT EXPERIENCES IN PRONUNCIATION TEACHING Master s thesis Aino Saarelainen University of Jyväskylä Department of Languages English September 2016 JYVÄSKYLÄN YLIOPISTO

More information

WHEN THERE IS A mismatch between the acoustic

WHEN THERE IS A mismatch between the acoustic 808 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 14, NO. 3, MAY 2006 Optimization of Temporal Filters for Constructing Robust Features in Speech Recognition Jeih-Weih Hung, Member,

More information

Speaker Identification by Comparison of Smart Methods. Abstract

Speaker Identification by Comparison of Smart Methods. Abstract Journal of mathematics and computer science 10 (2014), 61-71 Speaker Identification by Comparison of Smart Methods Ali Mahdavi Meimand Amin Asadi Majid Mohamadi Department of Electrical Department of Computer

More information

The Perception of Nasalized Vowels in American English: An Investigation of On-line Use of Vowel Nasalization in Lexical Access

The Perception of Nasalized Vowels in American English: An Investigation of On-line Use of Vowel Nasalization in Lexical Access The Perception of Nasalized Vowels in American English: An Investigation of On-line Use of Vowel Nasalization in Lexical Access Joyce McDonough 1, Heike Lenhert-LeHouiller 1, Neil Bardhan 2 1 Linguistics

More information

Consonant-Vowel Unity in Element Theory*

Consonant-Vowel Unity in Element Theory* Consonant-Vowel Unity in Element Theory* Phillip Backley Tohoku Gakuin University Kuniya Nasukawa Tohoku Gakuin University ABSTRACT. This paper motivates the Element Theory view that vowels and consonants

More information

Affricates. Affricates, nasals, laterals and continuants. Affricates. Affricates. Affricates. Affricates 11/20/2015. Phonetics of English 1

Affricates. Affricates, nasals, laterals and continuants. Affricates. Affricates. Affricates. Affricates 11/20/2015. Phonetics of English 1 , nasals, laterals and continuants Phonetics of English 1 1. Tip artikulacije (type of articulation) /tʃ, dʒ/ su suglasnici (consonants) 2. Način artikulacije (manner of articulation) /tʃ, dʒ/ su afrikati

More information

Christine Mooshammer, IPDS Kiel, Philip Hoole, IPSK München, Anja Geumann, Dublin

Christine Mooshammer, IPDS Kiel, Philip Hoole, IPSK München, Anja Geumann, Dublin 1 Title: Jaw and order Christine Mooshammer, IPDS Kiel, Philip Hoole, IPSK München, Anja Geumann, Dublin Short title: Production of coronal consonants Acknowledgements This work was partially supported

More information

A Neural Network GUI Tested on Text-To-Phoneme Mapping

A Neural Network GUI Tested on Text-To-Phoneme Mapping A Neural Network GUI Tested on Text-To-Phoneme Mapping MAARTEN TROMPPER Universiteit Utrecht m.f.a.trompper@students.uu.nl Abstract Text-to-phoneme (T2P) mapping is a necessary step in any speech synthesis

More information

Audible and visible speech

Audible and visible speech Building sensori-motor prototypes from audiovisual exemplars Gérard BAILLY Institut de la Communication Parlée INPG & Université Stendhal 46, avenue Félix Viallet, 383 Grenoble Cedex, France web: http://www.icp.grenet.fr/bailly

More information

The Journey to Vowelerria VOWEL ERRORS: THE LOST WORLD OF SPEECH INTERVENTION. Preparation: Education. Preparation: Education. Preparation: Education

The Journey to Vowelerria VOWEL ERRORS: THE LOST WORLD OF SPEECH INTERVENTION. Preparation: Education. Preparation: Education. Preparation: Education VOWEL ERRORS: THE LOST WORLD OF SPEECH INTERVENTION The Journey to Vowelerria An adventure across familiar territory child speech intervention leading to uncommon terrain vowel errors, Ph.D., CCC-SLP 03-15-14

More information

Richardson, J., The Next Step in Guided Writing, Ohio Literacy Conference, 2010

Richardson, J., The Next Step in Guided Writing, Ohio Literacy Conference, 2010 1 Procedures and Expectations for Guided Writing Procedures Context: Students write a brief response to the story they read during guided reading. At emergent levels, use dictated sentences that include

More information

The Indian English of Tibeto-Burman language speakers*

The Indian English of Tibeto-Burman language speakers* The Indian English of Tibeto-Burman language speakers* Caroline R. Wiltshire University of Florida English as spoken as a second language in India (IE) has developed different sound patterns from other

More information

Language Acquisition by Identical vs. Fraternal SLI Twins * Karin Stromswold & Jay I. Rifkin

Language Acquisition by Identical vs. Fraternal SLI Twins * Karin Stromswold & Jay I. Rifkin Stromswold & Rifkin, Language Acquisition by MZ & DZ SLI Twins (SRCLD, 1996) 1 Language Acquisition by Identical vs. Fraternal SLI Twins * Karin Stromswold & Jay I. Rifkin Dept. of Psychology & Ctr. for

More information

Fix Your Vowels: Computer-assisted training by Dutch learners of Spanish

Fix Your Vowels: Computer-assisted training by Dutch learners of Spanish Carmen Lie-Lahuerta Fix Your Vowels: Computer-assisted training by Dutch learners of Spanish I t is common knowledge that foreign learners struggle when it comes to producing the sounds of the target language

More information

Perceptual scaling of voice identity: common dimensions for different vowels and speakers

Perceptual scaling of voice identity: common dimensions for different vowels and speakers DOI 10.1007/s00426-008-0185-z ORIGINAL ARTICLE Perceptual scaling of voice identity: common dimensions for different vowels and speakers Oliver Baumann Æ Pascal Belin Received: 15 February 2008 / Accepted:

More information

DEVELOPMENT OF LINGUAL MOTOR CONTROL IN CHILDREN AND ADOLESCENTS

DEVELOPMENT OF LINGUAL MOTOR CONTROL IN CHILDREN AND ADOLESCENTS DEVELOPMENT OF LINGUAL MOTOR CONTROL IN CHILDREN AND ADOLESCENTS Natalia Zharkova 1, William J. Hardcastle 1, Fiona E. Gibbon 2 & Robin J. Lickley 1 1 CASL Research Centre, Queen Margaret University, Edinburgh

More information

Linguistics 220 Phonology: distributions and the concept of the phoneme. John Alderete, Simon Fraser University

Linguistics 220 Phonology: distributions and the concept of the phoneme. John Alderete, Simon Fraser University Linguistics 220 Phonology: distributions and the concept of the phoneme John Alderete, Simon Fraser University Foundations in phonology Outline 1. Intuitions about phonological structure 2. Contrastive

More information

Modeling function word errors in DNN-HMM based LVCSR systems

Modeling function word errors in DNN-HMM based LVCSR systems Modeling function word errors in DNN-HMM based LVCSR systems Melvin Jose Johnson Premkumar, Ankur Bapna and Sree Avinash Parchuri Department of Computer Science Department of Electrical Engineering Stanford

More information

9 Sound recordings: acoustic and articulatory data

9 Sound recordings: acoustic and articulatory data 9 Sound recordings: acoustic and articulatory data Robert J. Podesva and Elizabeth Zsiga 1 Introduction Linguists, across the subdisciplines of the field, use sound recordings for a great many purposes

More information

Vowel mispronunciation detection using DNN acoustic models with cross-lingual training

Vowel mispronunciation detection using DNN acoustic models with cross-lingual training INTERSPEECH 2015 Vowel mispronunciation detection using DNN acoustic models with cross-lingual training Shrikant Joshi, Nachiket Deo, Preeti Rao Department of Electrical Engineering, Indian Institute of

More information

Mandarin Lexical Tone Recognition: The Gating Paradigm

Mandarin Lexical Tone Recognition: The Gating Paradigm Kansas Working Papers in Linguistics, Vol. 0 (008), p. 8 Abstract Mandarin Lexical Tone Recognition: The Gating Paradigm Yuwen Lai and Jie Zhang University of Kansas Research on spoken word recognition

More information

RP ENGLISH AND CASTILIAN SPANISH DIPHTHONGS REVISITED FROM THE BEATS-AND-BINDING PERSPECTIVE

RP ENGLISH AND CASTILIAN SPANISH DIPHTHONGS REVISITED FROM THE BEATS-AND-BINDING PERSPECTIVE Poznań Studies in Contemporary Linguistics 44(1), March 2008, pp. 37 60 School of English, Adam Mickiewicz University, Poznań, Poland doi:10.2478/v10010-008-0003-1 RP ENGLISH AND CASTILIAN SPANISH DIPHTHONGS

More information

The NICT/ATR speech synthesis system for the Blizzard Challenge 2008

The NICT/ATR speech synthesis system for the Blizzard Challenge 2008 The NICT/ATR speech synthesis system for the Blizzard Challenge 2008 Ranniery Maia 1,2, Jinfu Ni 1,2, Shinsuke Sakai 1,2, Tomoki Toda 1,3, Keiichi Tokuda 1,4 Tohru Shimizu 1,2, Satoshi Nakamura 1,2 1 National

More information

Rachel E. Baker, Ann R. Bradlow. Northwestern University, Evanston, IL, USA

Rachel E. Baker, Ann R. Bradlow. Northwestern University, Evanston, IL, USA LANGUAGE AND SPEECH, 2009, 52 (4), 391 413 391 Variability in Word Duration as a Function of Probability, Speech Style, and Prosody Rachel E. Baker, Ann R. Bradlow Northwestern University, Evanston, IL,

More information

Likelihood-Maximizing Beamforming for Robust Hands-Free Speech Recognition

Likelihood-Maximizing Beamforming for Robust Hands-Free Speech Recognition MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Likelihood-Maximizing Beamforming for Robust Hands-Free Speech Recognition Seltzer, M.L.; Raj, B.; Stern, R.M. TR2004-088 December 2004 Abstract

More information

ADDIS ABABA UNIVERSITY SCHOOL OF GRADUATE STUDIES MODELING IMPROVED AMHARIC SYLLBIFICATION ALGORITHM

ADDIS ABABA UNIVERSITY SCHOOL OF GRADUATE STUDIES MODELING IMPROVED AMHARIC SYLLBIFICATION ALGORITHM ADDIS ABABA UNIVERSITY SCHOOL OF GRADUATE STUDIES MODELING IMPROVED AMHARIC SYLLBIFICATION ALGORITHM BY NIRAYO HAILU GEBREEGZIABHER A THESIS SUBMITED TO THE SCHOOL OF GRADUATE STUDIES OF ADDIS ABABA UNIVERSITY

More information

Language Change: Progress or Decay?

Language Change: Progress or Decay? Language Change: Progress or Decay? Fourth edition How and why do languages change? Where does the evidence of language change come from? How do languages begin and end? This introduction to language change

More information

Modeling function word errors in DNN-HMM based LVCSR systems

Modeling function word errors in DNN-HMM based LVCSR systems Modeling function word errors in DNN-HMM based LVCSR systems Melvin Jose Johnson Premkumar, Ankur Bapna and Sree Avinash Parchuri Department of Computer Science Department of Electrical Engineering Stanford

More information

Self-Supervised Acquisition of Vowels in American English

Self-Supervised Acquisition of Vowels in American English Self-Supervised cquisition of Vowels in merican English Michael H. Coen MIT Computer Science and rtificial Intelligence Laboratory 32 Vassar Street Cambridge, M 2139 mhcoen@csail.mit.edu bstract This paper

More information

Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano. Graduate School of Information Science, Nara Institute of Science & Technology

Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano. Graduate School of Information Science, Nara Institute of Science & Technology ISCA Archive SUBJECTIVE EVALUATION FOR HMM-BASED SPEECH-TO-LIP MOVEMENT SYNTHESIS Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano Graduate School of Information Science, Nara Institute of Science & Technology

More information

Beginning primarily with the investigations of Zimmermann (1980a),

Beginning primarily with the investigations of Zimmermann (1980a), Orofacial Movements Associated With Fluent Speech in Persons Who Stutter Michael D. McClean Walter Reed Army Medical Center, Washington, D.C. Stephen M. Tasko Western Michigan University, Kalamazoo, MI

More information

ADVANCES IN DEEP NEURAL NETWORK APPROACHES TO SPEAKER RECOGNITION

ADVANCES IN DEEP NEURAL NETWORK APPROACHES TO SPEAKER RECOGNITION ADVANCES IN DEEP NEURAL NETWORK APPROACHES TO SPEAKER RECOGNITION Mitchell McLaren 1, Yun Lei 1, Luciana Ferrer 2 1 Speech Technology and Research Laboratory, SRI International, California, USA 2 Departamento

More information

On the nature of voicing assimilation(s)

On the nature of voicing assimilation(s) On the nature of voicing assimilation(s) Wouter Jansen Clinical Language Sciences Leeds Metropolitan University W.Jansen@leedsmet.ac.uk http://www.kuvik.net/wjansen March 15, 2006 On the nature of voicing

More information

Word Stress and Intonation: Introduction

Word Stress and Intonation: Introduction Word Stress and Intonation: Introduction WORD STRESS One or more syllables of a polysyllabic word have greater prominence than the others. Such syllables are said to be accented or stressed. Word stress

More information

Voiceless Stop Consonant Modelling and Synthesis Framework Based on MISO Dynamic System

Voiceless Stop Consonant Modelling and Synthesis Framework Based on MISO Dynamic System ARCHIVES OF ACOUSTICS Vol. 42, No. 3, pp. 375 383 (2017) Copyright c 2017 by PAN IPPT DOI: 10.1515/aoa-2017-0039 Voiceless Stop Consonant Modelling and Synthesis Framework Based on MISO Dynamic System

More information

Edinburgh Research Explorer

Edinburgh Research Explorer Edinburgh Research Explorer The magnetic resonance imaging subset of the mngu0 articulatory corpus Citation for published version: Steiner, I, Richmond, K, Marshall, I & Gray, C 2012, 'The magnetic resonance

More information

Phonological and Phonetic Representations: The Case of Neutralization

Phonological and Phonetic Representations: The Case of Neutralization Phonological and Phonetic Representations: The Case of Neutralization Allard Jongman University of Kansas 1. Introduction The present paper focuses on the phenomenon of phonological neutralization to consider

More information

The pronunciation of /7i/ by male and female speakers of avant-garde Dutch

The pronunciation of /7i/ by male and female speakers of avant-garde Dutch The pronunciation of /7i/ by male and female speakers of avant-garde Dutch Vincent J. van Heuven, Loulou Edelman and Renée van Bezooijen Leiden University/ ULCL (van Heuven) / University of Nijmegen/ CLS

More information

An Acoustic Phonetic Account of the Production of Word-Final /z/s in Central Minnesota English

An Acoustic Phonetic Account of the Production of Word-Final /z/s in Central Minnesota English Linguistic Portfolios Volume 6 Article 10 2017 An Acoustic Phonetic Account of the Production of Word-Final /z/s in Central Minnesota English Cassy Lundy St. Cloud State University, casey.lundy@gmail.com

More information

Robust Speech Recognition using DNN-HMM Acoustic Model Combining Noise-aware training with Spectral Subtraction

Robust Speech Recognition using DNN-HMM Acoustic Model Combining Noise-aware training with Spectral Subtraction INTERSPEECH 2015 Robust Speech Recognition using DNN-HMM Acoustic Model Combining Noise-aware training with Spectral Subtraction Akihiro Abe, Kazumasa Yamamoto, Seiichi Nakagawa Department of Computer

More information

ABSTRACT. Some children with speech sound disorders (SSD) have difficulty with literacyrelated

ABSTRACT. Some children with speech sound disorders (SSD) have difficulty with literacyrelated ABSTRACT Some children with speech sound disorders (SSD) have difficulty with literacyrelated skills. In particular, they often have trouble with phonological processing, which is a robust predictor of

More information

Radical CV Phonology: the locational gesture *

Radical CV Phonology: the locational gesture * Radical CV Phonology: the locational gesture * HARRY VAN DER HULST 1 Goals 'Radical CV Phonology' is a variant of Dependency Phonology (Anderson and Jones 1974, Anderson & Ewen 1980, Ewen 1980, Lass 1984,

More information

A NOVEL SCHEME FOR SPEAKER RECOGNITION USING A PHONETICALLY-AWARE DEEP NEURAL NETWORK. Yun Lei Nicolas Scheffer Luciana Ferrer Mitchell McLaren

A NOVEL SCHEME FOR SPEAKER RECOGNITION USING A PHONETICALLY-AWARE DEEP NEURAL NETWORK. Yun Lei Nicolas Scheffer Luciana Ferrer Mitchell McLaren A NOVEL SCHEME FOR SPEAKER RECOGNITION USING A PHONETICALLY-AWARE DEEP NEURAL NETWORK Yun Lei Nicolas Scheffer Luciana Ferrer Mitchell McLaren Speech Technology and Research Laboratory, SRI International,

More information

Automatic segmentation of continuous speech using minimum phase group delay functions

Automatic segmentation of continuous speech using minimum phase group delay functions Speech Communication 42 (24) 429 446 www.elsevier.com/locate/specom Automatic segmentation of continuous speech using minimum phase group delay functions V. Kamakshi Prasad, T. Nagarajan *, Hema A. Murthy

More information

SARDNET: A Self-Organizing Feature Map for Sequences

SARDNET: A Self-Organizing Feature Map for Sequences SARDNET: A Self-Organizing Feature Map for Sequences Daniel L. James and Risto Miikkulainen Department of Computer Sciences The University of Texas at Austin Austin, TX 78712 dljames,risto~cs.utexas.edu

More information

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape Koshi Odagiri 1, and Yoichi Muraoka 1 1 Graduate School of Fundamental/Computer Science and Engineering, Waseda University,

More information

Manner assimilation in Uyghur

Manner assimilation in Uyghur Manner assimilation in Uyghur Suyeon Yun (suyeon@mit.edu) 10th Workshop on Altaic Formal Linguistics (1) Possible patterns of manner assimilation in nasal-liquid sequences (a) Regressive assimilation lateralization:

More information

Understanding and Supporting Dyslexia Godstone Village School. January 2017

Understanding and Supporting Dyslexia Godstone Village School. January 2017 Understanding and Supporting Dyslexia Godstone Village School January 2017 By then end of the session I will: Have a greater understanding of Dyslexia and the ways in which children can be affected by

More information

Rhythm-typology revisited.

Rhythm-typology revisited. DFG Project BA 737/1: "Cross-language and individual differences in the production and perception of syllabic prominence. Rhythm-typology revisited." Rhythm-typology revisited. B. Andreeva & W. Barry Jacques

More information

The ABCs of O-G. Materials Catalog. Skills Workbook. Lesson Plans for Teaching The Orton-Gillingham Approach in Reading and Spelling

The ABCs of O-G. Materials Catalog. Skills Workbook. Lesson Plans for Teaching The Orton-Gillingham Approach in Reading and Spelling 2008 Intermediate Level Skills Workbook Group 2 Groups 1 & 2 The ABCs of O-G The Flynn System by Emi Flynn Lesson Plans for Teaching The Orton-Gillingham Approach in Reading and Spelling The ABCs of O-G

More information

Evaluation of Various Methods to Calculate the EGG Contact Quotient

Evaluation of Various Methods to Calculate the EGG Contact Quotient Diploma Thesis in Music Acoustics (Examensarbete 20 p) Evaluation of Various Methods to Calculate the EGG Contact Quotient Christian Herbst Mozarteum, Salzburg, Austria Work carried out under the ERASMUS

More information

A Cross-language Corpus for Studying the Phonetics and Phonology of Prominence

A Cross-language Corpus for Studying the Phonetics and Phonology of Prominence A Cross-language Corpus for Studying the Phonetics and Phonology of Prominence Bistra Andreeva 1, William Barry 1, Jacques Koreman 2 1 Saarland University Germany 2 Norwegian University of Science and

More information

Complexity in Second Language Phonology Acquisition

Complexity in Second Language Phonology Acquisition Complexity in Second Language Phonology Acquisition Complexidade na aquisição da fonologia de segunda língua Ronaldo Mangueira Lima Júnior* Universidade de Brasília (UnB) Brasília/DF Brasil ABSTRACT: This

More information

International Journal of Advanced Networking Applications (IJANA) ISSN No. :

International Journal of Advanced Networking Applications (IJANA) ISSN No. : International Journal of Advanced Networking Applications (IJANA) ISSN No. : 0975-0290 34 A Review on Dysarthric Speech Recognition Megha Rughani Department of Electronics and Communication, Marwadi Educational

More information

Quarterly Progress and Status Report. Sound symbolism in deictic words

Quarterly Progress and Status Report. Sound symbolism in deictic words Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Sound symbolism in deictic words Traunmüller, H. journal: TMH-QPSR volume: 37 number: 2 year: 1996 pages: 147-150 http://www.speech.kth.se/qpsr

More information

PHONETIC DISTANCE BASED ACCENT CLASSIFIER TO IDENTIFY PRONUNCIATION VARIANTS AND OOV WORDS

PHONETIC DISTANCE BASED ACCENT CLASSIFIER TO IDENTIFY PRONUNCIATION VARIANTS AND OOV WORDS PHONETIC DISTANCE BASED ACCENT CLASSIFIER TO IDENTIFY PRONUNCIATION VARIANTS AND OOV WORDS Akella Amarendra Babu 1 *, Ramadevi Yellasiri 2 and Akepogu Ananda Rao 3 1 JNIAS, JNT University Anantapur, Ananthapuramu,

More information

Vimala.C Project Fellow, Department of Computer Science Avinashilingam Institute for Home Science and Higher Education and Women Coimbatore, India

Vimala.C Project Fellow, Department of Computer Science Avinashilingam Institute for Home Science and Higher Education and Women Coimbatore, India World of Computer Science and Information Technology Journal (WCSIT) ISSN: 2221-0741 Vol. 2, No. 1, 1-7, 2012 A Review on Challenges and Approaches Vimala.C Project Fellow, Department of Computer Science

More information