David Weenink. First semester 2007

Size: px
Start display at page:

Download "David Weenink. First semester 2007"

Transcription

1 Institute of Phonetic Sciences University of Amsterdam First semester 2007

2

3 The Speech Organs

4 Speech Speech production organs determine acoustic characteristics of speech sounds. Motivation Find explanations for acoustical attributes of sounds Relation vocal tract shape and formants Why are female formants higher than males Characteristics of nasal and oral sounds Fricative sounds Spectral slope of vowels

5 Speech Speech production organs determine acoustic characteristics of speech sounds. Motivation Find explanations for acoustical attributes of sounds Relation vocal tract shape and formants Why are female formants higher than males Characteristics of nasal and oral sounds Fricative sounds Spectral slope of vowels

6 Speech Speech production organs determine acoustic characteristics of speech sounds. Motivation Find explanations for acoustical attributes of sounds Relation vocal tract shape and formants Why are female formants higher than males Characteristics of nasal and oral sounds Fricative sounds Spectral slope of vowels

7 Speech Speech production organs determine acoustic characteristics of speech sounds. Motivation Find explanations for acoustical attributes of sounds Relation vocal tract shape and formants Why are female formants higher than males Characteristics of nasal and oral sounds Fricative sounds Spectral slope of vowels

8 Speech Speech production organs determine acoustic characteristics of speech sounds. Motivation Find explanations for acoustical attributes of sounds Relation vocal tract shape and formants Why are female formants higher than males Characteristics of nasal and oral sounds Fricative sounds Spectral slope of vowels

9 Production of Speech Main processes in production of speech Sound source (glottis +/ turbulent airstream) Shape of vocal tract Radiation from the mouth Energy losses

10 Production of Speech Main processes in production of speech Sound source (glottis +/ turbulent airstream) Shape of vocal tract Radiation from the mouth Energy losses

11 Production of Speech Main processes in production of speech Sound source (glottis +/ turbulent airstream) Shape of vocal tract Radiation from the mouth Energy losses

12 Production of Speech Main processes in production of speech Sound source (glottis +/ turbulent airstream) Shape of vocal tract Radiation from the mouth Energy losses

13 Production of Speech Main processes in production of speech Sound source (glottis +/ turbulent airstream) Shape of vocal tract Radiation from the mouth Energy losses Model: These processes are independent

14 Source-Filter Theory The Source-Filter theory models the production aparatus as two independent units: The source (the glottal source or noise generated at a constriction) The filter (resonances in the cavities of the vocal tract)

15 Source-Filter Theory The Source-Filter theory models the production aparatus as two independent units: The source (the glottal source or noise generated at a constriction) The filter (resonances in the cavities of the vocal tract)

16 Source-Filter Theory The Source-Filter theory models the production aparatus as two independent units: The source (the glottal source or noise generated at a constriction) The filter (resonances in the cavities of the vocal tract) A speech sound is the result of a source signal being filtered

17 From: (Rosenberg, 1971) Excitation of the vocal tract By volume velocity at glottis Is pulse-like (open and closed phase) Primarily because of rapid closure of glottis Slope at closure increases with increasing vocal effort Pitch or intensity then t open Waveform more sinusoidal Pitch or intensity then t open and slope-at-closure Damping of formants higher on open phase

18 From: (Rosenberg, 1971) Excitation of the vocal tract By volume velocity at glottis Is pulse-like (open and closed phase) Primarily because of rapid closure of glottis Slope at closure increases with increasing vocal effort Pitch or intensity then t open Waveform more sinusoidal Pitch or intensity then t open and slope-at-closure Damping of formants higher on open phase

19 From: (Rosenberg, 1971) Excitation of the vocal tract By volume velocity at glottis Is pulse-like (open and closed phase) Primarily because of rapid closure of glottis Slope at closure increases with increasing vocal effort Pitch or intensity then t open Waveform more sinusoidal Pitch or intensity then t open and slope-at-closure Damping of formants higher on open phase

20 From: (Rosenberg, 1971) Excitation of the vocal tract By volume velocity at glottis Is pulse-like (open and closed phase) Primarily because of rapid closure of glottis Slope at closure increases with increasing vocal effort Pitch or intensity then t open Waveform more sinusoidal Pitch or intensity then t open and slope-at-closure Damping of formants higher on open phase

21 From: (Rosenberg, 1971) Excitation of the vocal tract By volume velocity at glottis Is pulse-like (open and closed phase) Primarily because of rapid closure of glottis Slope at closure increases with increasing vocal effort Pitch or intensity then t open Waveform more sinusoidal Pitch or intensity then t open and slope-at-closure Damping of formants higher on open phase

22 From: (Rosenberg, 1971) Excitation of the vocal tract By volume velocity at glottis Is pulse-like (open and closed phase) Primarily because of rapid closure of glottis Slope at closure increases with increasing vocal effort Pitch or intensity then t open Waveform more sinusoidal Pitch or intensity then t open and slope-at-closure Damping of formants higher on open phase

23 From: (Rosenberg, 1971) Excitation of the vocal tract By volume velocity at glottis Is pulse-like (open and closed phase) Primarily because of rapid closure of glottis Slope at closure increases with increasing vocal effort Pitch or intensity then t open Waveform more sinusoidal Pitch or intensity then t open and slope-at-closure Damping of formants higher on open phase

24 Glottal flow Glottal flow derivative Time (normalized) Time (normalized) Open phase Closed phase

25 Creating a Source from Pitch Targets Create PitchTier... source Add point Add point To PointProcess To Sound (phonation) x1=1000 y1=40 x2=8000 y2=40-3*12 b = 1/(x2-x1)*ln(y1/y2) Draw function... x1 x *exp(-b*(x-x1)) Time (s) Sound pressure level (db/hz) Frequency (Hz)

26 Creating a Noise Source For fricatives we need a noise source: Create Sound from formula... noise Mono randomGauss(0,0.2) Time (s) Sound pressure level (db/hz) Frequency (Hz)

27 Tube Models for Vowels Curvature of tract can be neglected! Only cross-sectional area Diameters equal over large lengths of vocal tract Lossless Number of segments 1...

28 Tube Models for Vowels Curvature of tract can be neglected! Only cross-sectional area Diameters equal over large lengths of vocal tract Lossless Number of segments 1...

29 Tube Models for Vowels Curvature of tract can be neglected! Only cross-sectional area Diameters equal over large lengths of vocal tract Lossless Number of segments 1...

30 Tube Models for Vowels Curvature of tract can be neglected! Only cross-sectional area Diameters equal over large lengths of vocal tract Lossless Number of segments 1...

31 Tube Models for Vowels Curvature of tract can be neglected! Only cross-sectional area Diameters equal over large lengths of vocal tract Lossless Number of segments 1...

32 The 1-tube closed end: anti-node open end: node Closed-open tube For length l: (2n 1) λ 4 We use λf = c and obtain F n = (2n 1)c 4l Shorter length, higher formants! c = 340 m/s Male l = 0.17 m F n = 500, 1500, 2500, 3500,... Female l = m F n = 586, 1759, 2931, 4103,...

33 Deductions from a Straight Tube Constriction at node/anti-node decreases/increases resonance frequency. at lip-end always a node so rounding causes lowering of all formants velar constriction at node of F 2 : lowering u

34 The 2-tube Equal or unequal section lengths: not all vowels can be simulated only some peripheral ones

35 The 4-tube of Fant (1960) the constriction area: 1 segment (fixed length: 0.05m) before and after the constriction: 2 segments the lips: 1 segment (fixed length: 0.01m) Three parameter model of Fant (1960) 1 1 distance of constriction from glottis 2 constriction area 3 lip opening Effect of lip-rounding was a lowering of F 2 : [i] vs [y] 1 G. Fant (1960),The Speech Production,Mouton: The Hague.

36 Fricatives Relation between vocal tract shape and speech waveform is obscure. Noise source location varies in vocal tract Limited resources of X-ray data for model validation Depend less on tract shape than vowels

37 Nasals Opening of an extra cavity: difficult to model Nasal formants: fixed tube somewhat longer than vocal tract Uvular + post-velar: Nasal formants at 300/400 +k800 Palatal to labial: anti-formants at c 4l mouth

38 [Rosenberg, 1971] A.E. Rosenberg (1971), Effect of glottal pulse shape on the quality of natural vowels, J. Acoust. Soc. Am. (49),

Phonetics. The Sound of Language

Phonetics. The Sound of Language Phonetics. The Sound of Language 1 The Description of Sounds Fromkin & Rodman: An Introduction to Language. Fort Worth etc., Harcourt Brace Jovanovich Read: Chapter 5, (p. 176ff.) (or the corresponding

More information

Quarterly Progress and Status Report. VCV-sequencies in a preliminary text-to-speech system for female speech

Quarterly Progress and Status Report. VCV-sequencies in a preliminary text-to-speech system for female speech Dept. for Speech, Music and Hearing Quarterly Progress and Status Report VCV-sequencies in a preliminary text-to-speech system for female speech Karlsson, I. and Neovius, L. journal: STL-QPSR volume: 35

More information

Consonants: articulation and transcription

Consonants: articulation and transcription Phonology 1: Handout January 20, 2005 Consonants: articulation and transcription 1 Orientation phonetics [G. Phonetik]: the study of the physical and physiological aspects of human sound production and

More information

Speech Recognition using Acoustic Landmarks and Binary Phonetic Feature Classifiers

Speech Recognition using Acoustic Landmarks and Binary Phonetic Feature Classifiers Speech Recognition using Acoustic Landmarks and Binary Phonetic Feature Classifiers October 31, 2003 Amit Juneja Department of Electrical and Computer Engineering University of Maryland, College Park,

More information

age, Speech and Hearii

age, Speech and Hearii age, Speech and Hearii 1 Speech Commun cation tion 2 Sensory Comm, ection i 298 RLE Progress Report Number 132 Section 1 Speech Communication Chapter 1 Speech Communication 299 300 RLE Progress Report

More information

On Developing Acoustic Models Using HTK. M.A. Spaans BSc.

On Developing Acoustic Models Using HTK. M.A. Spaans BSc. On Developing Acoustic Models Using HTK M.A. Spaans BSc. On Developing Acoustic Models Using HTK M.A. Spaans BSc. Delft, December 2004 Copyright c 2004 M.A. Spaans BSc. December, 2004. Faculty of Electrical

More information

Perceptual scaling of voice identity: common dimensions for different vowels and speakers

Perceptual scaling of voice identity: common dimensions for different vowels and speakers DOI 10.1007/s00426-008-0185-z ORIGINAL ARTICLE Perceptual scaling of voice identity: common dimensions for different vowels and speakers Oliver Baumann Æ Pascal Belin Received: 15 February 2008 / Accepted:

More information

Design Of An Automatic Speaker Recognition System Using MFCC, Vector Quantization And LBG Algorithm

Design Of An Automatic Speaker Recognition System Using MFCC, Vector Quantization And LBG Algorithm Design Of An Automatic Speaker Recognition System Using MFCC, Vector Quantization And LBG Algorithm Prof. Ch.Srinivasa Kumar Prof. and Head of department. Electronics and communication Nalanda Institute

More information

Speaker recognition using universal background model on YOHO database

Speaker recognition using universal background model on YOHO database Aalborg University Master Thesis project Speaker recognition using universal background model on YOHO database Author: Alexandre Majetniak Supervisor: Zheng-Hua Tan May 31, 2011 The Faculties of Engineering,

More information

Speaker Recognition. Speaker Diarization and Identification

Speaker Recognition. Speaker Diarization and Identification Speaker Recognition Speaker Diarization and Identification A dissertation submitted to the University of Manchester for the degree of Master of Science in the Faculty of Engineering and Physical Sciences

More information

Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition

Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition Hua Zhang, Yun Tang, Wenju Liu and Bo Xu National Laboratory of Pattern Recognition Institute of Automation, Chinese

More information

Speech Emotion Recognition Using Support Vector Machine

Speech Emotion Recognition Using Support Vector Machine Speech Emotion Recognition Using Support Vector Machine Yixiong Pan, Peipei Shen and Liping Shen Department of Computer Technology Shanghai JiaoTong University, Shanghai, China panyixiong@sjtu.edu.cn,

More information

Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification

Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification Tomi Kinnunen and Ismo Kärkkäinen University of Joensuu, Department of Computer Science, P.O. Box 111, 80101 JOENSUU,

More information

On the Formation of Phoneme Categories in DNN Acoustic Models

On the Formation of Phoneme Categories in DNN Acoustic Models On the Formation of Phoneme Categories in DNN Acoustic Models Tasha Nagamine Department of Electrical Engineering, Columbia University T. Nagamine Motivation Large performance gap between humans and state-

More information

To appear in the Proceedings of the 35th Meetings of the Chicago Linguistics Society. Post-vocalic spirantization: Typology and phonetic motivations

To appear in the Proceedings of the 35th Meetings of the Chicago Linguistics Society. Post-vocalic spirantization: Typology and phonetic motivations Post-vocalic spirantization: Typology and phonetic motivations Alan C-L Yu University of California, Berkeley 0. Introduction Spirantization involves a stop consonant becoming a weak fricative (e.g., B,

More information

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines Amit Juneja and Carol Espy-Wilson Department of Electrical and Computer Engineering University of Maryland,

More information

A comparison of spectral smoothing methods for segment concatenation based speech synthesis

A comparison of spectral smoothing methods for segment concatenation based speech synthesis D.T. Chappell, J.H.L. Hansen, "Spectral Smoothing for Speech Segment Concatenation, Speech Communication, Volume 36, Issues 3-4, March 2002, Pages 343-373. A comparison of spectral smoothing methods for

More information

Consonant-Vowel Unity in Element Theory*

Consonant-Vowel Unity in Element Theory* Consonant-Vowel Unity in Element Theory* Phillip Backley Tohoku Gakuin University Kuniya Nasukawa Tohoku Gakuin University ABSTRACT. This paper motivates the Element Theory view that vowels and consonants

More information

Affricates. Affricates, nasals, laterals and continuants. Affricates. Affricates. Study questions

Affricates. Affricates, nasals, laterals and continuants. Affricates. Affricates. Study questions , nasals, laterals and continuants Phonetics of English 1 1. Tip artikulacije (type of articulation) /tʃ, dʒ/ su suglasnici (consonants) 2. Način artikulacije (manner of articulation) /tʃ, dʒ/ su afrikati

More information

Audible and visible speech

Audible and visible speech Building sensori-motor prototypes from audiovisual exemplars Gérard BAILLY Institut de la Communication Parlée INPG & Université Stendhal 46, avenue Félix Viallet, 383 Grenoble Cedex, France web: http://www.icp.grenet.fr/bailly

More information

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) e-issn: 2278-2834,p- ISSN: 2278-8735.Volume 10, Issue 2, Ver.1 (Mar - Apr.2015), PP 55-61 www.iosrjournals.org Analysis of Emotion

More information

9 Sound recordings: acoustic and articulatory data

9 Sound recordings: acoustic and articulatory data 9 Sound recordings: acoustic and articulatory data Robert J. Podesva and Elizabeth Zsiga 1 Introduction Linguists, across the subdisciplines of the field, use sound recordings for a great many purposes

More information

Radical CV Phonology: the locational gesture *

Radical CV Phonology: the locational gesture * Radical CV Phonology: the locational gesture * HARRY VAN DER HULST 1 Goals 'Radical CV Phonology' is a variant of Dependency Phonology (Anderson and Jones 1974, Anderson & Ewen 1980, Ewen 1980, Lass 1984,

More information

1. REFLEXES: Ask questions about coughing, swallowing, of water as fast as possible (note! Not suitable for all

1. REFLEXES: Ask questions about coughing, swallowing, of water as fast as possible (note! Not suitable for all Human Communication Science Chandler House, 2 Wakefield Street London WC1N 1PF http://www.hcs.ucl.ac.uk/ ACOUSTICS OF SPEECH INTELLIGIBILITY IN DYSARTHRIA EUROPEAN MASTER S S IN CLINICAL LINGUISTICS UNIVERSITY

More information

Quarterly Progress and Status Report. Voiced-voiceless distinction in alaryngeal speech - acoustic and articula

Quarterly Progress and Status Report. Voiced-voiceless distinction in alaryngeal speech - acoustic and articula Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Voiced-voiceless distinction in alaryngeal speech - acoustic and articula Nord, L. and Hammarberg, B. and Lundström, E. journal:

More information

source or where they are needed to distinguish two forms of a language. 4. Geographical Location. I have attempted to provide a geographical

source or where they are needed to distinguish two forms of a language. 4. Geographical Location. I have attempted to provide a geographical Database Structure 1 This database, compiled by Merritt Ruhlen, contains certain kinds of linguistic and nonlinguistic information for the world s roughly 5,000 languages. This introduction will discuss

More information

DEVELOPMENT OF LINGUAL MOTOR CONTROL IN CHILDREN AND ADOLESCENTS

DEVELOPMENT OF LINGUAL MOTOR CONTROL IN CHILDREN AND ADOLESCENTS DEVELOPMENT OF LINGUAL MOTOR CONTROL IN CHILDREN AND ADOLESCENTS Natalia Zharkova 1, William J. Hardcastle 1, Fiona E. Gibbon 2 & Robin J. Lickley 1 1 CASL Research Centre, Queen Margaret University, Edinburgh

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 2013 http://acousticalsociety.org/ ICA 2013 Montreal Montreal, Canada 2-7 June 2013 Speech Communication Session 2aSC: Linking Perception and Production

More information

SEGMENTAL FEATURES IN SPONTANEOUS AND READ-ALOUD FINNISH

SEGMENTAL FEATURES IN SPONTANEOUS AND READ-ALOUD FINNISH SEGMENTAL FEATURES IN SPONTANEOUS AND READ-ALOUD FINNISH Mietta Lennes Most of the phonetic knowledge that is currently available on spoken Finnish is based on clearly pronounced speech: either readaloud

More information

Body-Conducted Speech Recognition and its Application to Speech Support System

Body-Conducted Speech Recognition and its Application to Speech Support System Body-Conducted Speech Recognition and its Application to Speech Support System 4 Shunsuke Ishimitsu Hiroshima City University Japan 1. Introduction In recent years, speech recognition systems have been

More information

Klaus Zuberbühler c) School of Psychology, University of St. Andrews, St. Andrews, Fife KY16 9JU, Scotland, United Kingdom

Klaus Zuberbühler c) School of Psychology, University of St. Andrews, St. Andrews, Fife KY16 9JU, Scotland, United Kingdom Published in The Journal of the Acoustical Society of America, Vol. 114, Issue 2, 2003, p. 1132-1142 which should be used for any reference to this work 1 The relationship between acoustic structure and

More information

Speech Synthesis in Noisy Environment by Enhancing Strength of Excitation and Formant Prominence

Speech Synthesis in Noisy Environment by Enhancing Strength of Excitation and Formant Prominence INTERSPEECH September,, San Francisco, USA Speech Synthesis in Noisy Environment by Enhancing Strength of Excitation and Formant Prominence Bidisha Sharma and S. R. Mahadeva Prasanna Department of Electronics

More information

Christine Mooshammer, IPDS Kiel, Philip Hoole, IPSK München, Anja Geumann, Dublin

Christine Mooshammer, IPDS Kiel, Philip Hoole, IPSK München, Anja Geumann, Dublin 1 Title: Jaw and order Christine Mooshammer, IPDS Kiel, Philip Hoole, IPSK München, Anja Geumann, Dublin Short title: Production of coronal consonants Acknowledgements This work was partially supported

More information

Quarterly Progress and Status Report. Sound symbolism in deictic words

Quarterly Progress and Status Report. Sound symbolism in deictic words Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Sound symbolism in deictic words Traunmüller, H. journal: TMH-QPSR volume: 37 number: 2 year: 1996 pages: 147-150 http://www.speech.kth.se/qpsr

More information

Perceptual Auditory Aftereffects on Voice Identity Using Brief Vowel Stimuli

Perceptual Auditory Aftereffects on Voice Identity Using Brief Vowel Stimuli Perceptual Auditory Aftereffects on Voice Identity Using Brief Vowel Stimuli Marianne Latinus 1,3 *, Pascal Belin 1,2 1 Institute of Neuroscience and Psychology, University of Glasgow, Glasgow, United

More information

Segregation of Unvoiced Speech from Nonspeech Interference

Segregation of Unvoiced Speech from Nonspeech Interference Technical Report OSU-CISRC-8/7-TR63 Department of Computer Science and Engineering The Ohio State University Columbus, OH 4321-1277 FTP site: ftp.cse.ohio-state.edu Login: anonymous Directory: pub/tech-report/27

More information

Mandarin Lexical Tone Recognition: The Gating Paradigm

Mandarin Lexical Tone Recognition: The Gating Paradigm Kansas Working Papers in Linguistics, Vol. 0 (008), p. 8 Abstract Mandarin Lexical Tone Recognition: The Gating Paradigm Yuwen Lai and Jie Zhang University of Kansas Research on spoken word recognition

More information

Expressive speech synthesis: a review

Expressive speech synthesis: a review Int J Speech Technol (2013) 16:237 260 DOI 10.1007/s10772-012-9180-2 Expressive speech synthesis: a review D. Govind S.R. Mahadeva Prasanna Received: 31 May 2012 / Accepted: 11 October 2012 / Published

More information

The pronunciation of /7i/ by male and female speakers of avant-garde Dutch

The pronunciation of /7i/ by male and female speakers of avant-garde Dutch The pronunciation of /7i/ by male and female speakers of avant-garde Dutch Vincent J. van Heuven, Loulou Edelman and Renée van Bezooijen Leiden University/ ULCL (van Heuven) / University of Nijmegen/ CLS

More information

Affricates. Affricates, nasals, laterals and continuants. Affricates. Affricates. Affricates. Affricates 11/20/2015. Phonetics of English 1

Affricates. Affricates, nasals, laterals and continuants. Affricates. Affricates. Affricates. Affricates 11/20/2015. Phonetics of English 1 , nasals, laterals and continuants Phonetics of English 1 1. Tip artikulacije (type of articulation) /tʃ, dʒ/ su suglasnici (consonants) 2. Način artikulacije (manner of articulation) /tʃ, dʒ/ su afrikati

More information

The Perception of Nasalized Vowels in American English: An Investigation of On-line Use of Vowel Nasalization in Lexical Access

The Perception of Nasalized Vowels in American English: An Investigation of On-line Use of Vowel Nasalization in Lexical Access The Perception of Nasalized Vowels in American English: An Investigation of On-line Use of Vowel Nasalization in Lexical Access Joyce McDonough 1, Heike Lenhert-LeHouiller 1, Neil Bardhan 2 1 Linguistics

More information

Rhythm-typology revisited.

Rhythm-typology revisited. DFG Project BA 737/1: "Cross-language and individual differences in the production and perception of syllabic prominence. Rhythm-typology revisited." Rhythm-typology revisited. B. Andreeva & W. Barry Jacques

More information

Voice conversion through vector quantization

Voice conversion through vector quantization J. Acoust. Soc. Jpn.(E)11, 2 (1990) Voice conversion through vector quantization Masanobu Abe, Satoshi Nakamura, Kiyohiro Shikano, and Hisao Kuwabara A TR Interpreting Telephony Research Laboratories,

More information

A Cross-language Corpus for Studying the Phonetics and Phonology of Prominence

A Cross-language Corpus for Studying the Phonetics and Phonology of Prominence A Cross-language Corpus for Studying the Phonetics and Phonology of Prominence Bistra Andreeva 1, William Barry 1, Jacques Koreman 2 1 Saarland University Germany 2 Norwegian University of Science and

More information

Online Publication Date: 01 May 1981 PLEASE SCROLL DOWN FOR ARTICLE

Online Publication Date: 01 May 1981 PLEASE SCROLL DOWN FOR ARTICLE This article was downloaded by:[university of Sussex] On: 15 July 2008 Access Details: [subscription number 776502344] Publisher: Psychology Press Informa Ltd Registered in England and Wales Registered

More information

A Neural Network GUI Tested on Text-To-Phoneme Mapping

A Neural Network GUI Tested on Text-To-Phoneme Mapping A Neural Network GUI Tested on Text-To-Phoneme Mapping MAARTEN TROMPPER Universiteit Utrecht m.f.a.trompper@students.uu.nl Abstract Text-to-phoneme (T2P) mapping is a necessary step in any speech synthesis

More information

Evaluation of Various Methods to Calculate the EGG Contact Quotient

Evaluation of Various Methods to Calculate the EGG Contact Quotient Diploma Thesis in Music Acoustics (Examensarbete 20 p) Evaluation of Various Methods to Calculate the EGG Contact Quotient Christian Herbst Mozarteum, Salzburg, Austria Work carried out under the ERASMUS

More information

Universal contrastive analysis as a learning principle in CAPT

Universal contrastive analysis as a learning principle in CAPT Universal contrastive analysis as a learning principle in CAPT Jacques Koreman, Preben Wik, Olaf Husby, Egil Albertsen Department of Language and Communication Studies, NTNU, Trondheim, Norway jacques.koreman@ntnu.no,

More information

Phonology Revisited: Sor3ng Out the PH Factors in Reading and Spelling Development. Indiana, November, 2015

Phonology Revisited: Sor3ng Out the PH Factors in Reading and Spelling Development. Indiana, November, 2015 Phonology Revisited: Sor3ng Out the PH Factors in Reading and Spelling Development Indiana, November, 2015 Louisa C. Moats, Ed.D. (louisa.moats@gmail.com) meaning (semantics) discourse structure morphology

More information

Learners Use Word-Level Statistics in Phonetic Category Acquisition

Learners Use Word-Level Statistics in Phonetic Category Acquisition Learners Use Word-Level Statistics in Phonetic Category Acquisition Naomi Feldman, Emily Myers, Katherine White, Thomas Griffiths, and James Morgan 1. Introduction * One of the first challenges that language

More information

Speaker Identification by Comparison of Smart Methods. Abstract

Speaker Identification by Comparison of Smart Methods. Abstract Journal of mathematics and computer science 10 (2014), 61-71 Speaker Identification by Comparison of Smart Methods Ali Mahdavi Meimand Amin Asadi Majid Mohamadi Department of Electrical Department of Computer

More information

U IVERSIDADE FEDERAL DE SA TA CATARI A PROGRAMA DE PÓS-GRADUAÇÃO EM LETRAS/I GLÊS E LITERATURA CORRESPO DE TE. Mariane Antero Alves

U IVERSIDADE FEDERAL DE SA TA CATARI A PROGRAMA DE PÓS-GRADUAÇÃO EM LETRAS/I GLÊS E LITERATURA CORRESPO DE TE. Mariane Antero Alves U IVERSIDADE FEDERAL DE SA TA CATARI A PROGRAMA DE PÓS-GRADUAÇÃO EM LETRAS/I GLÊS E LITERATURA CORRESPO DE TE Mariane Antero Alves PRODUCTIO OF E GLISH A D PORTUGUESE VOICELESS STOPS BY BRAZILIA EFL SPEAKERS

More information

Acoustic correlates of stress and their use in diagnosing syllable fusion in Tongan. James White & Marc Garellek UCLA

Acoustic correlates of stress and their use in diagnosing syllable fusion in Tongan. James White & Marc Garellek UCLA Acoustic correlates of stress and their use in diagnosing syllable fusion in Tongan James White & Marc Garellek UCLA 1 Introduction Goals: To determine the acoustic correlates of primary and secondary

More information

Revisiting the role of prosody in early language acquisition. Megha Sundara UCLA Phonetics Lab

Revisiting the role of prosody in early language acquisition. Megha Sundara UCLA Phonetics Lab Revisiting the role of prosody in early language acquisition Megha Sundara UCLA Phonetics Lab Outline Part I: Intonation has a role in language discrimination Part II: Do English-learning infants have

More information

An Acoustic Phonetic Account of the Production of Word-Final /z/s in Central Minnesota English

An Acoustic Phonetic Account of the Production of Word-Final /z/s in Central Minnesota English Linguistic Portfolios Volume 6 Article 10 2017 An Acoustic Phonetic Account of the Production of Word-Final /z/s in Central Minnesota English Cassy Lundy St. Cloud State University, casey.lundy@gmail.com

More information

Provisional. Using ambulatory voice monitoring to investigate common voice disorders: Research update

Provisional. Using ambulatory voice monitoring to investigate common voice disorders: Research update Using ambulatory voice monitoring to investigate common voice disorders: Research update Daryush D. Mehta 1, 2, 3*, Jarrad H. Van Stan 1, 3, Matías Zañartu 4, Marzyeh Ghassemi 5, John V. Guttag 5, Víctor

More information

The NICT/ATR speech synthesis system for the Blizzard Challenge 2008

The NICT/ATR speech synthesis system for the Blizzard Challenge 2008 The NICT/ATR speech synthesis system for the Blizzard Challenge 2008 Ranniery Maia 1,2, Jinfu Ni 1,2, Shinsuke Sakai 1,2, Tomoki Toda 1,3, Keiichi Tokuda 1,4 Tohru Shimizu 1,2, Satoshi Nakamura 1,2 1 National

More information

Contrasting English Phonology and Nigerian English Phonology

Contrasting English Phonology and Nigerian English Phonology Contrasting English Phonology and Nigerian English Phonology Saleh, A. J. Rinji, D.N. ABSTRACT The thrust of this work is the fact that phonology plays a vital role in language and communication both in

More information

Speech/Language Pathology Plan of Treatment

Speech/Language Pathology Plan of Treatment Caring for Your Quality of Life Patient s Last Name First Name MI HICN Speech/Language Pathology Plan of Treatment Provider Name LifeCare of Florida Primary Diagnosis(es) Provider No Onset Date SOC Date

More information

Fix Your Vowels: Computer-assisted training by Dutch learners of Spanish

Fix Your Vowels: Computer-assisted training by Dutch learners of Spanish Carmen Lie-Lahuerta Fix Your Vowels: Computer-assisted training by Dutch learners of Spanish I t is common knowledge that foreign learners struggle when it comes to producing the sounds of the target language

More information

Self-Supervised Acquisition of Vowels in American English

Self-Supervised Acquisition of Vowels in American English Self-Supervised Acquisition of Vowels in American English Michael H. Coen MIT Computer Science and Artificial Intelligence Laboratory 32 Vassar Street Cambridge, MA 2139 mhcoen@csail.mit.edu Abstract This

More information

NIH Public Access Author Manuscript Lang Speech. Author manuscript; available in PMC 2011 January 1.

NIH Public Access Author Manuscript Lang Speech. Author manuscript; available in PMC 2011 January 1. NIH Public Access Author Manuscript Published in final edited form as: Lang Speech. 2010 ; 53(Pt 1): 49 69. Spatial and Temporal Properties of Gestures in North American English /R/ Fiona Campbell, University

More information

A Comparison of DHMM and DTW for Isolated Digits Recognition System of Arabic Language

A Comparison of DHMM and DTW for Isolated Digits Recognition System of Arabic Language A Comparison of DHMM and DTW for Isolated Digits Recognition System of Arabic Language Z.HACHKAR 1,3, A. FARCHI 2, B.MOUNIR 1, J. EL ABBADI 3 1 Ecole Supérieure de Technologie, Safi, Morocco. zhachkar2000@yahoo.fr.

More information

MASTERY OF PHONEMIC SYMBOLS AND STUDENT EXPERIENCES IN PRONUNCIATION TEACHING. Master s thesis Aino Saarelainen

MASTERY OF PHONEMIC SYMBOLS AND STUDENT EXPERIENCES IN PRONUNCIATION TEACHING. Master s thesis Aino Saarelainen MASTERY OF PHONEMIC SYMBOLS AND STUDENT EXPERIENCES IN PRONUNCIATION TEACHING Master s thesis Aino Saarelainen University of Jyväskylä Department of Languages English September 2016 JYVÄSKYLÄN YLIOPISTO

More information

BAUM-WELCH TRAINING FOR SEGMENT-BASED SPEECH RECOGNITION. Han Shu, I. Lee Hetherington, and James Glass

BAUM-WELCH TRAINING FOR SEGMENT-BASED SPEECH RECOGNITION. Han Shu, I. Lee Hetherington, and James Glass BAUM-WELCH TRAINING FOR SEGMENT-BASED SPEECH RECOGNITION Han Shu, I. Lee Hetherington, and James Glass Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology Cambridge,

More information

Voiceless Stop Consonant Modelling and Synthesis Framework Based on MISO Dynamic System

Voiceless Stop Consonant Modelling and Synthesis Framework Based on MISO Dynamic System ARCHIVES OF ACOUSTICS Vol. 42, No. 3, pp. 375 383 (2017) Copyright c 2017 by PAN IPPT DOI: 10.1515/aoa-2017-0039 Voiceless Stop Consonant Modelling and Synthesis Framework Based on MISO Dynamic System

More information

Different Task Type and the Perception of the English Interdental Fricatives

Different Task Type and the Perception of the English Interdental Fricatives Different Task Type and the Perception of the English Interdental Fricatives Mara Silvia Reis, Denise Cristina Kluge, Melissa Bettoni-Techio Federal University of Santa Catarina marasreis@hotmail.com,

More information

Complexity in Second Language Phonology Acquisition

Complexity in Second Language Phonology Acquisition Complexity in Second Language Phonology Acquisition Complexidade na aquisição da fonologia de segunda língua Ronaldo Mangueira Lima Júnior* Universidade de Brasília (UnB) Brasília/DF Brasil ABSTRACT: This

More information

A Privacy-Sensitive Approach to Modeling Multi-Person Conversations

A Privacy-Sensitive Approach to Modeling Multi-Person Conversations A Privacy-Sensitive Approach to Modeling Multi-Person Conversations Danny Wyatt Dept. of Computer Science University of Washington danny@cs.washington.edu Jeff Bilmes Dept. of Electrical Engineering University

More information

Noise-Adaptive Perceptual Weighting in the AMR-WB Encoder for Increased Speech Loudness in Adverse Far-End Noise Conditions

Noise-Adaptive Perceptual Weighting in the AMR-WB Encoder for Increased Speech Loudness in Adverse Far-End Noise Conditions 26 24th European Signal Processing Conference (EUSIPCO) Noise-Adaptive Perceptual Weighting in the AMR-WB Encoder for Increased Speech Loudness in Adverse Far-End Noise Conditions Emma Jokinen Department

More information

Language Change: Progress or Decay?

Language Change: Progress or Decay? Language Change: Progress or Decay? Fourth edition How and why do languages change? Where does the evidence of language change come from? How do languages begin and end? This introduction to language change

More information

SOUND STRUCTURE REPRESENTATION, REPAIR AND WELL-FORMEDNESS: GRAMMAR IN SPOKEN LANGUAGE PRODUCTION. Adam B. Buchwald

SOUND STRUCTURE REPRESENTATION, REPAIR AND WELL-FORMEDNESS: GRAMMAR IN SPOKEN LANGUAGE PRODUCTION. Adam B. Buchwald SOUND STRUCTURE REPRESENTATION, REPAIR AND WELL-FORMEDNESS: GRAMMAR IN SPOKEN LANGUAGE PRODUCTION by Adam B. Buchwald A dissertation submitted to The Johns Hopkins University in conformity with the requirements

More information

Self-Supervised Acquisition of Vowels in American English

Self-Supervised Acquisition of Vowels in American English Self-Supervised cquisition of Vowels in merican English Michael H. Coen MIT Computer Science and rtificial Intelligence Laboratory 32 Vassar Street Cambridge, M 2139 mhcoen@csail.mit.edu bstract This paper

More information

Journal of Phonetics

Journal of Phonetics Journal of Phonetics 41 (2013) 297 306 Contents lists available at SciVerse ScienceDirect Journal of Phonetics journal homepage: www.elsevier.com/locate/phonetics The role of intonation in language and

More information

Speak with Confidence The Art of Developing Presentations & Impromptu Speaking

Speak with Confidence The Art of Developing Presentations & Impromptu Speaking Speak with Confidence The Art of Developing Presentations & Impromptu Speaking Use this system as a guide, but don't be afraid to modify it to fit your needs. Remember the keys to delivering a successful

More information

WHEN THERE IS A mismatch between the acoustic

WHEN THERE IS A mismatch between the acoustic 808 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 14, NO. 3, MAY 2006 Optimization of Temporal Filters for Constructing Robust Features in Speech Recognition Jeih-Weih Hung, Member,

More information

Markedness and Complex Stops: Evidence from Simplification Processes 1. Nick Danis Rutgers University

Markedness and Complex Stops: Evidence from Simplification Processes 1. Nick Danis Rutgers University Markedness and Complex Stops: Evidence from Simplification Processes 1 Nick Danis Rutgers University nick.danis@rutgers.edu WOCAL 8 Kyoto, Japan August 21-24, 2015 1 Introduction (1) Complex segments:

More information

Rachel E. Baker, Ann R. Bradlow. Northwestern University, Evanston, IL, USA

Rachel E. Baker, Ann R. Bradlow. Northwestern University, Evanston, IL, USA LANGUAGE AND SPEECH, 2009, 52 (4), 391 413 391 Variability in Word Duration as a Function of Probability, Speech Style, and Prosody Rachel E. Baker, Ann R. Bradlow Northwestern University, Evanston, IL,

More information

Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano. Graduate School of Information Science, Nara Institute of Science & Technology

Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano. Graduate School of Information Science, Nara Institute of Science & Technology ISCA Archive SUBJECTIVE EVALUATION FOR HMM-BASED SPEECH-TO-LIP MOVEMENT SYNTHESIS Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano Graduate School of Information Science, Nara Institute of Science & Technology

More information

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION JOURNAL OF MEDICAL INFORMATICS & TECHNOLOGIES Vol. 11/2007, ISSN 1642-6037 Marek WIŚNIEWSKI *, Wiesława KUNISZYK-JÓŹKOWIAK *, Elżbieta SMOŁKA *, Waldemar SUSZYŃSKI * HMM, recognition, speech, disorders

More information

Prevalence of Oral Reading Problems in Thai Students with Cleft Palate, Grades 3-5

Prevalence of Oral Reading Problems in Thai Students with Cleft Palate, Grades 3-5 Prevalence of Oral Reading Problems in Thai Students with Cleft Palate, Grades 3-5 Prajima Ingkapak BA*, Benjamas Prathanee PhD** * Curriculum and Instruction in Special Education, Faculty of Education,

More information

A study of speaker adaptation for DNN-based speech synthesis

A study of speaker adaptation for DNN-based speech synthesis A study of speaker adaptation for DNN-based speech synthesis Zhizheng Wu, Pawel Swietojanski, Christophe Veaux, Steve Renals, Simon King The Centre for Speech Technology Research (CSTR) University of Edinburgh,

More information

COMMUNICATION DISORDERS. Speech Production Process

COMMUNICATION DISORDERS. Speech Production Process Communication Disorders 165 implementing the methods selected; monitoring and evaluating the learning process to make sure progress is being made toward the goal; modifying or replacing strategies if they

More information

VIEW: An Assessment of Problem Solving Style

VIEW: An Assessment of Problem Solving Style 1 VIEW: An Assessment of Problem Solving Style Edwin C. Selby, Donald J. Treffinger, Scott G. Isaksen, and Kenneth Lauer This document is a working paper, the purposes of which are to describe the three

More information

Phonological and Phonetic Representations: The Case of Neutralization

Phonological and Phonetic Representations: The Case of Neutralization Phonological and Phonetic Representations: The Case of Neutralization Allard Jongman University of Kansas 1. Introduction The present paper focuses on the phenomenon of phonological neutralization to consider

More information

Automatic segmentation of continuous speech using minimum phase group delay functions

Automatic segmentation of continuous speech using minimum phase group delay functions Speech Communication 42 (24) 429 446 www.elsevier.com/locate/specom Automatic segmentation of continuous speech using minimum phase group delay functions V. Kamakshi Prasad, T. Nagarajan *, Hema A. Murthy

More information

Edinburgh Research Explorer

Edinburgh Research Explorer Edinburgh Research Explorer The magnetic resonance imaging subset of the mngu0 articulatory corpus Citation for published version: Steiner, I, Richmond, K, Marshall, I & Gray, C 2012, 'The magnetic resonance

More information

**Note: this is slightly different from the original (mainly in format). I would be happy to send you a hard copy.**

**Note: this is slightly different from the original (mainly in format). I would be happy to send you a hard copy.** **Note: this is slightly different from the original (mainly in format). I would be happy to send you a hard copy.** REANALYZING THE JAPANESE CODA NASAL IN OPTIMALITY THEORY 1 KATSURA AOYAMA University

More information

Journal of Phonetics

Journal of Phonetics Journal of Phonetics 40 (2012) 595 607 Contents lists available at SciVerse ScienceDirect Journal of Phonetics journal homepage: www.elsevier.com/locate/phonetics How linguistic and probabilistic properties

More information

Beginning primarily with the investigations of Zimmermann (1980a),

Beginning primarily with the investigations of Zimmermann (1980a), Orofacial Movements Associated With Fluent Speech in Persons Who Stutter Michael D. McClean Walter Reed Army Medical Center, Washington, D.C. Stephen M. Tasko Western Michigan University, Kalamazoo, MI

More information

Modern TTS systems. CS 294-5: Statistical Natural Language Processing. Types of Modern Synthesis. TTS Architecture. Text Normalization

Modern TTS systems. CS 294-5: Statistical Natural Language Processing. Types of Modern Synthesis. TTS Architecture. Text Normalization CS 294-5: Statistical Natural Language Processing Speech Synthesis Lecture 22: 12/4/05 Modern TTS systems 1960 s first full TTS Umeda et al (1968) 1970 s Joe Olive 1977 concatenation of linearprediction

More information

Sound and Meaning in Auditory Data Display

Sound and Meaning in Auditory Data Display Sound and Meaning in Auditory Data Display THOMAS HERMANN AND HELGE RITTER Invited Paper Auditory data display is an interdisciplinary field linking auditory perception research, sound engineering, data

More information

COMPUTER INTERFACES FOR TEACHING THE NINTENDO GENERATION

COMPUTER INTERFACES FOR TEACHING THE NINTENDO GENERATION Session 3532 COMPUTER INTERFACES FOR TEACHING THE NINTENDO GENERATION Thad B. Welch, Brian Jenkins Department of Electrical Engineering U.S. Naval Academy, MD Cameron H. G. Wright Department of Electrical

More information

Statistical Parametric Speech Synthesis

Statistical Parametric Speech Synthesis Statistical Parametric Speech Synthesis Heiga Zen a,b,, Keiichi Tokuda a, Alan W. Black c a Department of Computer Science and Engineering, Nagoya Institute of Technology, Gokiso-cho, Showa-ku, Nagoya,

More information

The Indian English of Tibeto-Burman language speakers*

The Indian English of Tibeto-Burman language speakers* The Indian English of Tibeto-Burman language speakers* Caroline R. Wiltshire University of Florida English as spoken as a second language in India (IE) has developed different sound patterns from other

More information

Speaking Rate and Speech Movement Velocity Profiles

Speaking Rate and Speech Movement Velocity Profiles Journal of Speech and Hearing Research, Volume 36, 41-54, February 1993 Speaking Rate and Speech Movement Velocity Profiles Scott G. Adams The Toronto Hospital Toronto, Ontario, Canada Gary Weismer Raymond

More information

Phonetic- and Speaker-Discriminant Features for Speaker Recognition. Research Project

Phonetic- and Speaker-Discriminant Features for Speaker Recognition. Research Project Phonetic- and Speaker-Discriminant Features for Speaker Recognition by Lara Stoll Research Project Submitted to the Department of Electrical Engineering and Computer Sciences, University of California

More information

Standards for Members of the American Handwriting Analysis Foundation

Standards for Members of the American Handwriting Analysis Foundation Standards for Members of the American Handwriting Analysis Foundation A. Purpose The purpose of this document is to provide a foundation for the development and evaluation of a set of standards for education,

More information

Inquiry Space: Using Graphs as a Tool to Understand Experiments

Inquiry Space: Using Graphs as a Tool to Understand Experiments Inquiry Space: Using Graphs as a Tool to Understand Experiments Introduction In our experience, high school students often see graphs as the result of an assignment, a final product to be constructed correctly

More information