Monitoring Soft Palate Movements in Speech

Similar documents
Consonants: articulation and transcription

The Perception of Nasalized Vowels in American English: An Investigation of On-line Use of Vowel Nasalization in Lexical Access

Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition

Quarterly Progress and Status Report. Voiced-voiceless distinction in alaryngeal speech - acoustic and articula

To appear in the Proceedings of the 35th Meetings of the Chicago Linguistics Society. Post-vocalic spirantization: Typology and phonetic motivations

Quarterly Progress and Status Report. VCV-sequencies in a preliminary text-to-speech system for female speech

Phonetics. The Sound of Language

The analysis starts with the phonetic vowel and consonant charts based on the dataset:

1. REFLEXES: Ask questions about coughing, swallowing, of water as fast as possible (note! Not suitable for all

Speech Recognition using Acoustic Landmarks and Binary Phonetic Feature Classifiers

Pobrane z czasopisma New Horizons in English Studies Data: 18/11/ :52:20. New Horizons in English Studies 1/2016

Phonological and Phonetic Representations: The Case of Neutralization

Consonant-Vowel Unity in Element Theory*

THE RECOGNITION OF SPEECH BY MACHINE

Prevalence of Oral Reading Problems in Thai Students with Cleft Palate, Grades 3-5

Title:A Flexible Simulation Platform to Quantify and Manage Emergency Department Crowding

Quarterly Progress and Status Report. Sound symbolism in deictic words

Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano. Graduate School of Information Science, Nara Institute of Science & Technology

Proceedings of Meetings on Acoustics

age, Speech and Hearii

Phonological Processing for Urdu Text to Speech System

Contrasting English Phonology and Nigerian English Phonology

How to Judge the Quality of an Objective Classroom Test

Design Of An Automatic Speaker Recognition System Using MFCC, Vector Quantization And LBG Algorithm

Learning Methods in Multilingual Speech Recognition

Measurement. When Smaller Is Better. Activity:

Tools to SUPPORT IMPLEMENTATION OF a monitoring system for regularly scheduled series

Body-Conducted Speech Recognition and its Application to Speech Support System

Student Perceptions of Reflective Learning Activities

On the Formation of Phoneme Categories in DNN Acoustic Models

On Developing Acoustic Models Using HTK. M.A. Spaans BSc.

Parallel Evaluation in Stratal OT * Adam Baker University of Arizona

DEVELOPMENT OF LINGUAL MOTOR CONTROL IN CHILDREN AND ADOLESCENTS

Evidence for Reliability, Validity and Learning Effectiveness

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Speech Emotion Recognition Using Support Vector Machine

STUDIES WITH FABRICATED SWITCHBOARD DATA: EXPLORING SOURCES OF MODEL-DATA MISMATCH

Mandarin Lexical Tone Recognition: The Gating Paradigm

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines

Readyman Activity Badge Outline -- Community Group

SEGMENTAL FEATURES IN SPONTANEOUS AND READ-ALOUD FINNISH

9 Sound recordings: acoustic and articulatory data

Speak with Confidence The Art of Developing Presentations & Impromptu Speaking

MADERA SCIENCE FAIR 2013 Grades 4 th 6 th Project due date: Tuesday, April 9, 8:15 am Parent Night: Tuesday, April 16, 6:00 8:00 pm

Course Law Enforcement II. Unit I Careers in Law Enforcement

Livermore Valley Joint Unified School District. B or better in Algebra I, or consent of instructor

Introduction to Yearbook / Newspaper Course Syllabus

Christine Mooshammer, IPDS Kiel, Philip Hoole, IPSK München, Anja Geumann, Dublin

**Note: this is slightly different from the original (mainly in format). I would be happy to send you a hard copy.**

Running head: DELAY AND PROSPECTIVE MEMORY 1

Preprint.

Parent Information Welcome to the San Diego State University Community Reading Clinic

Rottenberg, Annette. Elements of Argument: A Text and Reader, 7 th edition Boston: Bedford/St. Martin s, pages.

Universal contrastive analysis as a learning principle in CAPT

TU-E2090 Research Assignment in Operations Management and Services

Major Milestones, Team Activities, and Individual Deliverables

GCSE English Language 2012 An investigation into the outcomes for candidates in Wales

The Ti-Mandi window: a time-management tool for managers

Phys4051: Methods of Experimental Physics I

English Language Arts Summative Assessment

How to make successful presentations in English Part 2

Phonology Revisited: Sor3ng Out the PH Factors in Reading and Spelling Development. Indiana, November, 2015

Five Challenges for the Collaborative Classroom and How to Solve Them

Alberta Police Cognitive Ability Test (APCAT) General Information

Syllabus Foundations of Finance Summer 2014 FINC-UB

Creating Travel Advice

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape

Application of Virtual Instruments (VIs) for an enhanced learning environment

Arizona s English Language Arts Standards th Grade ARIZONA DEPARTMENT OF EDUCATION HIGH ACADEMIC STANDARDS FOR STUDENTS

Leader s Guide: Dream Big and Plan for Success

source or where they are needed to distinguish two forms of a language. 4. Geographical Location. I have attempted to provide a geographical

How to Read the Next Generation Science Standards (NGSS)

PhD project description. <Working title of the dissertation>

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections

LQVSumm: A Corpus of Linguistic Quality Violations in Multi-Document Summarization

Think A F R I C A when assessing speaking. C.E.F.R. Oral Assessment Criteria. Think A F R I C A - 1 -

Speech Recognition at ICSI: Broadcast News and beyond

Guidelines for the Master s Thesis Project in Biomedicine BIMM60 (30 hp): planning, writing and presentation.

Measuring physical factors in the environment

A Note on Structuring Employability Skills for Accounting Students

Speaker Recognition. Speaker Diarization and Identification

The Good Judgment Project: A large scale test of different methods of combining expert predictions

ENEE 302h: Digital Electronics, Fall 2005 Prof. Bruce Jacob

First Grade Curriculum Highlights: In alignment with the Common Core Standards

Measurement and statistical modeling of the urban heat island of the city of Utrecht (the Netherlands)

Audible and visible speech

Radical CV Phonology: the locational gesture *

INSTRUCTIONAL TECHNIQUES. Teaching by Lecture

GEMINATION STRATEGIES IN L1 AND ENGLISH PRONUNCIATION OF POLISH LEARNERS

Digital Fabrication and Aunt Sarah: Enabling Quadratic Explorations via Technology. Michael L. Connell University of Houston - Downtown

Word Stress and Intonation: Introduction

WE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT

Educational Attainment

Contrastiveness and diachronic variation in Chinese nasal codas. Tsz-Him Tsui The Ohio State University

SARDNET: A Self-Organizing Feature Map for Sequences

All Systems Go! Using a Systems Approach in Elementary Science

JiED EARLY ACCESS: Under final review by author(s). NOTE: PAGE NUMBERS AND MEDIA PLACEMENT ARE NOT FINAL

This curriculum is brought to you by the National Officer Team.

5 th Grade Language Arts Curriculum Map

WiggleWorks Software Manual PDF0049 (PDF) Houghton Mifflin Harcourt Publishing Company

Transcription:

Paper Delivered at the 8lst Meeting of The Acoustical Society of America Washington, D.C. April 23, 1971 Monitoring Soft Palate Movements in Speech JOHN J. OHALA, Department of Linguistics, University of California, Berkeley. Abstract A simple photoelectric device has been constructed which permits easy monitoring of soft palate movements in speech. Since its output signal cannot easily be calibrated it is most useful for detecting the timing of such movements and the relative rather than the absolute amount of velopharyngeal opening. Using it, it has been possible to verify others' findings that: (l) The soft palate is lowered more during vowels preceding a nasal consonant than it is during vowels following a nasal consonant. (2) The soft palate begins to lower for an upcoming nasal consonant as soon as it can, i.e., as soon as it is no longer required to be closed for an obstruent. (3) Other things being equal, the soft palate is typically lowered during the production of socalled low "oral" vowels such as [ ]. (Supported by the National Science Foundation.) 444h333 Most of the popular methods used by speech researchers to monitor the movements of the soft palate in speech, for example cinefluorography, although quite exact, necessitate a huge amount of labor (and a certain amount of danger) for the analysis of relatively short bits of speech. And there is no way of knowing in advance, before laboring over the data, whether or not the sample will prove interesting or not. There is a need, then, for a simple technique for measuring palatal movements a technique whereby rough-cut data may be gathered quickly in great volume so that a preliminary analysis may indicate which speech gestures may be more profitably studied in more detail using, perhaps, cinefluorography. The technique described in this paper meets this need to a certain extent. It monitors movements of the soft palate in the same way as a glottograph monitors variations in glottal area: a DC light is placed on one side of the opening and a light sensor on the other side (Sonesson 1959, Ohala 1966). The amount of light impinging on the light sensor and thus the voltage developed by it are taken as an indication of the size of the opening. 1

(FIRST SLIDE) In this case I inserted both light and light sensor in the same flexible transparent plastic catheter which is sealed at one end and has the leads of the sensor and the light emerging at the other end and connecting to amplifier and dry-cell, respectively. The sealed end is swallowed into the esophagus for increased stabilization and for the subject's comfort. Light and light sensor are separated by about 9 cm. or enough so that the light sensor rests approximately over the junction between the hard and soft palate and the light is well down into the pharynx, approximately at the level of the epiglottis or slightly above. The optimum placement is determined by trial and error. The outer diameter of the tube is about 4 mm; it should eventually be possible to find an even smaller tube. Dedicated subjects report no great discomfort or disruption of natural speech articulation. Due to the similarity of this device to the glottograph, I refer to it as the nasograph". 2

The NEXT SLIDE (#2) shows the nasograph in use in one of the two subjects who have used it so far. It has been observed that when in use it makes the subject look like a "jack-o-lantern." Calibration is a problem. I know of no simple way to calibrate it except to compare its output to that obtained by some other technique: cinefluorography, recording nasal air flow, etc. I have not done this. No doubt it is non-linear. The variability of the light impinging directly or by reflection on the light sensor, the possible movement of the plastic catheter itself during movements of the soft palate, etc., probably introduce various errors into the signal. In spite of all these problems it is surprising how well it does work. At any rate it ought to give a fairly reliable indication of relative palatal opening and of the timing of palatal movements. The obvious way to test it is to produce speech gestures known to require a raised velum or a 3

lowered velum and to see that it gives the expected signal. Obstruents (stops, fricatives, and affricates) require an air-tight oral cavity and thus must have a raised velum. Nasal consonants and nasalized vowels require a lowered velum. NEXT SLIDE (#3). This slide shows at top the output of the nasograph and the microphone signal recorded simultaneously during the production of the utterance "Say bo twice." The bottom of the slide shows the same parameters recorded during the phrase "Say bõ twice." The nasograph signal indicates raised velum by a line near the top, lowered velum by a line near the bottom of its range. The first utterance is devoid of nasals and accordingly the nasograph correctly indicates the velum is raised throughout the utterance. Only before and after the utterance is the velum open while the subject breathes. The second utterance is identical to the first except for the nasalized vowel in the second word thus the nasograph indicates the velum is lowered on that vowel and is raised again for the following stop /t/. A slight deflection in the signal in the middle of the word 'twice' is apparently not an artifact and indicates that the palate may be lowered slightly for low vowels. 4

NEXT SLIDE(#4). This slide shows the same two parameters for the utterances "Say mitt twice" and "Say Tim twice." As expected the velum is lowered during the nasal consonant /m/ in both cases. In 'mitt' the velum is raised during the vowel following the /m/. Thus this vowel is nasalized. In the word 'Tim' the velum lowers immediately after the /t/ during the vowel preceding the /m/ so this vowel is nasalized, too. This phenomenon, of course, is well known among phoneticians. It also appears that on the average the palate lowers more for word-final nasal consonants than for word-initial nasals. Thus one would expect vowels preceding nasals to be slightly more nasalized than vowels following nasals. This tendency was noted previously by the Indian linguist, A. Kelkar (1968). 5

NEXT SLIDE (#5). This slide shows the two utterances "Say mim twice" and "Say nun twice." (At Berkeley the word 'mim' is not a nonsense word; it is the pronounceable acronym for our lab's "Monthly Internal Memorandum.") Again, the general tendency for the velum to lower more for final nasals than for initial nasals can be seen. Curiously, after opening for the first nasal, there is a slight closing movement. This merits further study. 6

NEXT SLIDE (#6). This shows the two utterances: "See Sue" on top and "See Pa" on the bottom. The velum is raised throughout the first utterance. During the last vowel in the second utterance, however, it is apparent that the velum lowers a bit. A lowering of the palate for low vowels, that is, vowels with a high first formant, as in this case, was quite common. This is, again, a well-known phenomenon (Moll 1962). 7

NEXT SLIDE (#7). In this slide the two utterances "Pam Pan is here" and "Pam Ban is here" are shown. As expected the velum is raised during the four bilabial stops but opens for the up-coming nasals as soon as these stops are released. This pattern was observed in the previous slides, too, with an exception to be noted. 8

NEXT SLIDE (#8). Here the consonants /n/ and /h/ were inserted into the same frame as that used in the previous slide. The velum is, of course, open during the /n/. It also remains open during the glottal fricative /h/. This pattern was also observed during the glottal stop (that is, in a nasalized environment). Evidently a lowered velum in no way interferes with the correct production of these glottal consonants. /l/ and /r/ likewise could tolerate a somewhat lowered palate. The combination /h/ plus low vowel has been observed to give rise to nasalization in many languages, including some Semitic languages in Ethiopia (Hetzron 1969), many in the Lolo-Burmese and Sino-Tibetan families (James Matisoff, personal communication; Matthew Chen, personal communication), not to mention affected British R. P. in which 'half is sometimes rendered with a nasalized vowel, as in "Meet me in [h f] 'n 'our" (pointed out to me by James Matisoff). 9

NEXT SLIDE (#9). This shows the two utterances "Sam made that" and "Pa made that." I suggested previously that the palate begins to lower for an upcoming nasal consonant as soon as it can. Thus the earliest it can lower for the first nasal /m/ in the first utterance is immediately after the obstruent /s/ and this is what it does. Thus it lowers some 200 msec before the nasal proper. However in the second utterance the first nasal is in the second word and although the velum lowers after the /p/ it seems to lower no more than it ordinarily would for a low vowel. The lowering for the nasal consonant proper seems to start only about 50-60 msec before the nasal. Thus, how soon the anticipatory lowering of the velum begins before a nasal consonant seems to depend on whether a word boundary precedes the nasal. If this observation is correct it contradicts some previous findings of Moll and Daniloff (1968), who claimed the presence of word boundaries makes no difference for this kind of anticipatory lowering of the velum. LIGHTS. These findings, which were identical for both of the subjects studied, are, of course, highly tentative, considering the primitive state of development of the nasograph. However, the data do point out areas for further, hopefully fruitful study and this was all that I hoped to accomplish with the device. Thank you. 10

References Hetzron, R. (1969) Two notes on Semitic laryngeals in East Gurage. Phonetica 19:69-81. Kelkar, A.R. (1968) Studies in Hindi and Urdu, I. Poona. Moll, K. L. (1962) Velopharyngeal closure on vowels. Journal of Speech and Hearing Research 5:30-37. Moll, K.L. and R.G. Daniloff. (1968) An investigation of the timing of velar movements during speech. Paper presented at the 44th Annual Convention of the American Speech and Hearing Association. (Forthcoming in Journal of the Acoustical Society of America). Ohala, J. (1966) A new photo-electric glottograph. Working Papers in Phonetics 4:40-52, UCLA. Sonesson, B. (1959) A method for studying the vibratory movements of the vocal cords. A preliminary report. Journal of Laryngology 73:732-737. 11