Theories of Speech Perception

Similar documents
1. REFLEXES: Ask questions about coughing, swallowing, of water as fast as possible (note! Not suitable for all

The Perception of Nasalized Vowels in American English: An Investigation of On-line Use of Vowel Nasalization in Lexical Access

Stages of Literacy Ros Lugg

Dyslexia/dyslexic, 3, 9, 24, 97, 187, 189, 206, 217, , , 367, , , 397,

Phonological and Phonetic Representations: The Case of Neutralization

Mandarin Lexical Tone Recognition: The Gating Paradigm

Concept Acquisition Without Representation William Dylan Sabo

Phonological encoding in speech production

Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano. Graduate School of Information Science, Nara Institute of Science & Technology

Proceedings of Meetings on Acoustics

Language Development: The Components of Language. How Children Develop. Chapter 6

THE INFLUENCE OF TASK DEMANDS ON FAMILIARITY EFFECTS IN VISUAL WORD RECOGNITION: A COHORT MODEL PERSPECTIVE DISSERTATION

Linguistics 220 Phonology: distributions and the concept of the phoneme. John Alderete, Simon Fraser University

Presentation Format Effects in a Levels-of-Processing Task

Linking object names and object categories: Words (but not tones) facilitate object categorization in 6- and 12-month-olds

Unraveling symbolic number processing and the implications for its association with mathematics. Delphine Sasanguie

Understanding the Relationship between Comprehension and Production

Understanding and Supporting Dyslexia Godstone Village School. January 2017

Speech Perception in Dyslexic Children. With and Without Language Impairments. Franklin R. Manis. University of Southern California.

On Human Computer Interaction, HCI. Dr. Saif al Zahir Electrical and Computer Engineering Department UBC

On the Formation of Phoneme Categories in DNN Acoustic Models

Learners Use Word-Level Statistics in Phonetic Category Acquisition

9.85 Cognition in Infancy and Early Childhood. Lecture 7: Number

Correspondence between the DRDP (2015) and the California Preschool Learning Foundations. Foundations (PLF) in Language and Literacy

Multi-sensory Language Teaching. Seamless Intervention with Quality First Teaching for Phonics, Reading and Spelling

SLINGERLAND: A Multisensory Structured Language Instructional Approach

Infants learn phonotactic regularities from brief auditory experience

SARDNET: A Self-Organizing Feature Map for Sequences

Learning Methods in Multilingual Speech Recognition

Phonology Revisited: Sor3ng Out the PH Factors in Reading and Spelling Development. Indiana, November, 2015

Aging and the Use of Context in Ambiguity Resolution: Complex Changes From Simple Slowing

INTRODUCTION J. Acoust. Soc. Am. 102 (3), September /97/102(3)/1891/7/$ Acoustical Society of America 1891

Evolution of Symbolisation in Chimpanzees and Neural Nets

A Process-Model Account of Task Interruption and Resumption: When Does Encoding of the Problem State Occur?

ELA/ELD Standards Correlation Matrix for ELD Materials Grade 1 Reading

Encoding. Retrieval. Forgetting. Physiology of Memory. Systems and Types of Memory

Short-term memory in Down syndrome: Applying the working memory model

Investigating speech perception in children with dyslexia: is there evidence of a. consistent deficit in individuals? Abstract

BSID-II-NL project. Heidelberg March Selma Ruiter, University of Groningen

Linguistics. Undergraduate. Departmental Honors. Graduate. Faculty. Linguistics 1

Speech Recognition using Acoustic Landmarks and Binary Phonetic Feature Classifiers

ASSISTIVE COMMUNICATION

Processing Lexically Embedded Spoken Words

Psychology of Speech Production and Speech Perception

What is PDE? Research Report. Paul Nichols

raıs Factors affecting word learning in adults: A comparison of L2 versus L1 acquisition /r/ /aı/ /s/ /r/ /aı/ /s/ = individual sound

A NOTE ON THE BIOLOGY OF SPEECH PERCEPTION* Michael Studdert-Kennedy+

What is a Mental Model?

REVIEW OF NEURAL MECHANISMS FOR LEXICAL PROCESSING IN DOGS BY ANDICS ET AL. (2016)

An Evaluation of the Interactive-Activation Model Using Masked Partial-Word Priming. Jason R. Perry. University of Western Ontario. Stephen J.

English Language and Applied Linguistics. Module Descriptions 2017/18

1 st Quarter (September, October, November) August/September Strand Topic Standard Notes Reading for Literature

Running head: DELAY AND PROSPECTIVE MEMORY 1

Quarterly Progress and Status Report. Voiced-voiceless distinction in alaryngeal speech - acoustic and articula

Large Kindergarten Centers Icons

ENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist

ADHD Classroom Accommodations for Specific Behaviour

Human Factors Engineering Design and Evaluation Checklist

Visual CP Representation of Knowledge

A joint model of word segmentation and meaning acquisition through crosssituational

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines

Piano Safari Sight Reading & Rhythm Cards for Book 1

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

Student-Centered Learning

Improved Effects of Word-Retrieval Treatments Subsequent to Addition of the Orthographic Form

UDL AND LANGUAGE ARTS LESSON OVERVIEW

Rhythm-typology revisited.

Computerized training of the correspondences between phonological and orthographic units

Introduction to Psychology

Levels of processing: Qualitative differences or task-demand differences?

Beeson, P. M. (1999). Treating acquired writing impairment. Aphasiology, 13,

Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition

Going to School: Measuring Schooling Behaviors in GloFish

Mathematical Misconceptions -- Can We Eliminate Them? Phi lip Swedosh and John Clark The University of Melbourne. Introduction

To appear in the Proceedings of the 35th Meetings of the Chicago Linguistics Society. Post-vocalic spirantization: Typology and phonetic motivations

Houghton Mifflin Reading Correlation to the Common Core Standards for English Language Arts (Grade1)

SOFTWARE EVALUATION TOOL

Rachel E. Baker, Ann R. Bradlow. Northwestern University, Evanston, IL, USA

1 Signed languages and linguistics

AGENDA LEARNING THEORIES LEARNING THEORIES. Advanced Learning Theories 2/22/2016

The Evolution of Random Phenomena

Different Task Type and the Perception of the English Interdental Fricatives

First Grade Curriculum Highlights: In alignment with the Common Core Standards

learning collegiate assessment]

Accelerated Learning Course Outline

Evidence for Reliability, Validity and Learning Effectiveness

Contrastiveness and diachronic variation in Chinese nasal codas. Tsz-Him Tsui The Ohio State University

Translational Display of. in Communication Sciences and Disorders

Lexical Access during Sentence Comprehension (Re)Consideration of Context Effects

One major theoretical issue of interest in both developing and

The Effect of Close Reading on Reading Comprehension. Scores of Fifth Grade Students with Specific Learning Disabilities.

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

Innovative Methods for Teaching Engineering Courses

Piaget s Cognitive Development

Is rapid automatized naming automatic?

Peer Influence on Academic Achievement: Mean, Variance, and Network Effects under School Choice

Industrial Assessment Center. Don Kasten. IAC Student Webcast. Manager, Technical Operations Center for Advanced Energy Systems.

Program Matrix - Reading English 6-12 (DOE Code 398) University of Florida. Reading

Exploring Dyslexics Phonological Deficit I: Lexical vs Sub-lexical and Input vs Output Processes

YMCA SCHOOL AGE CHILD CARE PROGRAM PLAN

Transcription:

Theories of Speech Perception Motor Theory (Liberman) Close link between perception and production of speech Use motor information to compensate for lack of invariants in speech signal Determine which articulatory gesture was made, infer phoneme Human speech perception is an innate, species-specific skill Because only humans can produce speech, only humans can perceive it as a sequence of phonemes Speech is special Auditory Theory Derives from general properties of the auditory system Speech perception is not species-specific

Wilson & friends, 2004 Perception /pa/ /gi/ Bell Burst of white noise Production /pa/ /gi/ Tap alternate thumbs

Wilson et al., 2004 Black areas are premotor and primary motor cortex activated when subjects produced the syllables White arrows indicate central sulcus Orange represents areas activated by listening to speech Extensive activation in superior temporal gyrus Activation in motor areas involved in speech production (!)

Wilson and colleagues, 2004

Is categorical perception innate?

Manipulate VOT, Monitor Sucking

4-month-old infants: Eimas et al. (1971) 20 ms (Different Sides) 20 ms (Same Side) 0 ms (Control)

Is categorical perception species specific? Chinchillas exhibit categorical perception as well

Chinchilla experiment (Kuhl & Miller experiment) ba ba ba ba pa pa pa pa

Train on end-point ba (good), pa (bad) Test on intermediate stimuli Results: Chinchillas switched over from staying to running at about the same location as the English b/p phoneme boundary

VOT identification by chinchillas (Kuhl & Miller, 1981)

Categorical perception, Take 2 Natural discontinuities in many sensory systems; many of these are common across mammalian species Some stimulus differences are hard; others are easy Language takes advantage of natural boundaries

Categorical Perception & Auditory Theory Categorical perception may arise from rapid decay of auditory memory not unique to speech People have some ability to discriminate sounds within a phoneme judgments may reflect decision process rather than perception

Motor Theory versus Auditory Theory Close link between speech perception and speech production systems Motor Right! Some properties of speech perception (e.g. categorical perception) general auditory properties Auditory Right! Speech perception probably not innate speciesspecific Motor Wrong

Comprehension Recognize Word Phonological Info Visual Info Retrieve Information Syntactic Info Semantic/Pragmatic Info Integrate Syntactic & Semantic/Pragmatic Info Store Gist Representation

Word Recognition Serial Comprehension involves analysis at several different levels in turn Interactive Various sources interact and combine to produce efficient analysis Serial Interactive

Bottom-up Processes Acoustic Info Phonetic Info Phonemic Info Words & Sentences

Top-Down Processes To what extent does knowledge of what speaker is saying impact processes necessary for understanding speech?

Phonemic Restoration Effect Legislature Sentences

McGurk Effect

McGurk Effect Lips say ba Sound signal ga /ba/ bilabial /ga/ velar /da/ dental Subjects hear da

What s the relevance? What does this stuff have to do with interactive vs. serial models? Context Effects Interactive Models use all sources of information for rapid word ID Serial Models inefficient & slow

Marslen-Wilson s Cohort Model Mental representations of words activated (in parallel) on the basis of bottom-up input (sounds) Can be de-activated by subsequent input bottom-up (phonological) top-down (contextual)

Uniqueness and Recognition When we hear the beginning of a word this activates ALL words beginning with the same sound: the word initial cohort. Subsequent sounds eliminate candidates from the cohort until only one remains (failure to fit with context can also eliminate candidates) t - tea, tree, trick, tread, tressle, trespass, top, tick, etc. tr - tree, trick, tread, tressle, trespass, etc. tre - tread, tressle, trespass, etc. tres - tressle, trespass, etc. tresp - trespass.

Uniqueness and Recognition The uniqueness point is the point at which a word becomes uniquely identifiable from its initial sound sequence E.g. dial dayl crocodile krokod ayl UP UP For non-words there is a deviation point: a point at which the cohort is reduced to zero E.g. zn owble would be rejected with a faster RT than thousaj ining DP DP

Uniqueness and Recognition The recognition point is the point at which, empirically, a word is actually identified Empirical studies show that recognition point correlates with (and is closely tied to) the uniqueness point. phoneme monitoring latencies correlate with a priori cohort analysis (and one way to recognise word initial phonemes is to recognise the word and to know it begins with e.g. /p/)

Cohort Model (Marslen-Wilson & Tyler) Words consistent with input become active Cohort set of words consistent with first syllable Words in the cohort eliminated when they become inconsistent with input Words eliminated due to contextual incongruity Processing ends when there is one word left in the cohort /ka/ cat captain catch capitalism /kap/ captain capitalism Communism is slightly different from /kap/ capitalism

Marslen-Wilson & Tyler Normal The church was broken into last night. Some thieves stole most of the lead off the roof. Syntactic The power was located in green water. No buns puzzle some in the lead off the text. Random In was great power water the located. Some the no puzzle buns in lead text the off.

Marslen-Wilson & Tyler 300 250 200 150 100 200 260 285 Normal Syntactic Random 50 0 Monitoring Time

Activation in the Revised Cohort Model dog energise elephant activation time wombat elegant captain c a p t i n captive

TRACE Like the interactive-activation model of printed word recognition, TRACE has three sets of interconnected detectors Feature detectors Phoneme detectors Word detectors These detectors span different stretches of the input (feature detector span small parts, word detectors span larger parts) The input is divided into time slices which are processed sequentially.

Phoneme boundary 1 2 3 4 5 6 7 8 9 P detector P P P P P P P P. B detector...... B B B

If there are feature detectors, can we tire one of them out?

Selective adaptation 1. Do phoneme identification test (e.g., ba-pa continuum) 2. Play a stimulus from one of the endpoints many times (e.g., 100 times) 3. Repeat phoneme identification test

Selective adaptation % ba identification 100 50 Post-adpatation phoneme boundary Pre-adpatation phoneme boundary 0-100 -90-80 -70-60 -50-40 -30-20 -10 0 +10 +20 +30 +40 +50 +60 ba pa REPEAT -100 ba 100 times for one minute Voice Onset Time continuum