Harmonic Cues in Speech Segmentation: A cross-linguistic Corpus Study on Child-directed Speech

Similar documents
Revisiting the role of prosody in early language acquisition. Megha Sundara UCLA Phonetics Lab

LEXICAL CATEGORY ACQUISITION VIA NONADJACENT DEPENDENCIES IN CONTEXT: EVIDENCE OF DEVELOPMENTAL CHANGE AND INDIVIDUAL DIFFERENCES.

Learners Use Word-Level Statistics in Phonetic Category Acquisition

Mandarin Lexical Tone Recognition: The Gating Paradigm

The Perception of Nasalized Vowels in American English: An Investigation of On-line Use of Vowel Nasalization in Lexical Access

Infants learn phonotactic regularities from brief auditory experience

Phonological and Phonetic Representations: The Case of Neutralization

Acoustic correlates of stress and their use in diagnosing syllable fusion in Tongan. James White & Marc Garellek UCLA

Phonological encoding in speech production

A joint model of word segmentation and meaning acquisition through crosssituational

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

SENSITIVITY TO VISUAL PROSODIC CUES IN SIGNERS AND NONSIGNERS. Diane Brentari, Carolina González, Amanda Seidl, and Ronnie Wilbur

SEGMENTAL FEATURES IN SPONTANEOUS AND READ-ALOUD FINNISH

Abstract Rule Learning for Visual Sequences in 8- and 11-Month-Olds

Processing Lexically Embedded Spoken Words

Probabilistic principles in unsupervised learning of visual structure: human data and a model

Rhythm-typology revisited.

Linguistics 220 Phonology: distributions and the concept of the phoneme. John Alderete, Simon Fraser University

Eyebrows in French talk-in-interaction

A Bayesian Model of Stress Assignment in Reading

Linking Task: Identifying authors and book titles in verbose queries

CLASSIFICATION OF PROGRAM Critical Elements Analysis 1. High Priority Items Phonemic Awareness Instruction

A Statistical Model for Word Discovery in Transcribed Speech

Cross Language Information Retrieval

Constructing Parallel Corpus from Movie Subtitles

JSLHR. Research Article. Lexical Characteristics of Expressive Vocabulary in Toddlers With Autism Spectrum Disorder

Dyslexia/dyslexic, 3, 9, 24, 97, 187, 189, 206, 217, , , 367, , , 397,

Lexical Collocations (Verb + Noun) Across Written Academic Genres In English

LING 329 : MORPHOLOGY

Communicative signals promote abstract rule learning by 7-month-old infants

Proceedings of Meetings on Acoustics

Speech Recognition at ICSI: Broadcast News and beyond

Demonstration of problems of lexical stress on the pronunciation Turkish English teachers and teacher trainees by computer

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections

The Effect of Discourse Markers on the Speaking Production of EFL Students. Iman Moradimanesh

A Stochastic Model for the Vocabulary Explosion

Cambridgeshire Community Services NHS Trust: delivering excellence in children and young people s health services

The Bruins I.C.E. School

MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY

Modeling function word errors in DNN-HMM based LVCSR systems

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,

The role of word-word co-occurrence in word learning

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

raıs Factors affecting word learning in adults: A comparison of L2 versus L1 acquisition /r/ /aı/ /s/ /r/ /aı/ /s/ = individual sound

Corpus Linguistics (L615)

Using computational modeling in language acquisition research

Degeneracy results in canalisation of language structure: A computational model of word learning

CROSS-LANGUAGE INFORMATION RETRIEVAL USING PARAFAC2

A Cross-language Corpus for Studying the Phonetics and Phonology of Prominence

On the nature of voicing assimilation(s)

Possessive have and (have) got in New Zealand English Heidi Quinn, University of Canterbury, New Zealand

The Strong Minimalist Thesis and Bounded Optimality

1. REFLEXES: Ask questions about coughing, swallowing, of water as fast as possible (note! Not suitable for all

Matching Similarity for Keyword-Based Clustering

Parallel Evaluation in Stratal OT * Adam Baker University of Arizona

NUMBERS AND OPERATIONS

Characterizing and Processing Robot-Directed Speech

Books Effective Literacy Y5-8 Learning Through Talk Y4-8 Switch onto Spelling Spelling Under Scrutiny

Liaison acquisition, word segmentation and construction in French: A usage based account

SOUND STRUCTURE REPRESENTATION, REPAIR AND WELL-FORMEDNESS: GRAMMAR IN SPOKEN LANGUAGE PRODUCTION. Adam B. Buchwald

Word Stress and Intonation: Introduction

English Language and Applied Linguistics. Module Descriptions 2017/18

2,1 .,,, , %, ,,,,,,. . %., Butterworth,)?.(1989; Levelt, 1989; Levelt et al., 1991; Levelt, Roelofs & Meyer, 1999

Lexical category induction using lexically-specific templates

Atypical Prosodic Structure as an Indicator of Reading Level and Text Difficulty

Effects of Vocabulary and Phonotactic Probability on 2-Year-Olds Nonword Repetition

Web as Corpus. Corpus Linguistics. Web as Corpus 1 / 1. Corpus Linguistics. Web as Corpus. web.pl 3 / 1. Sketch Engine. Corpus Linguistics

Fluency Disorders. Kenneth J. Logan, PhD, CCC-SLP

BSID-II-NL project. Heidelberg March Selma Ruiter, University of Groningen

Review in ICAME Journal, Volume 38, 2014, DOI: /icame

The Prosodic (Re)organization of Determiners

Figuration & Frequency: A Usage-Based Approach to Metaphor

Modeling function word errors in DNN-HMM based LVCSR systems

Perceived speech rate: the effects of. articulation rate and speaking style in spontaneous speech. Jacques Koreman. Saarland University

LEARNING A SEMANTIC PARSER FROM SPOKEN UTTERANCES. Judith Gaspers and Philipp Cimiano

Universal contrastive analysis as a learning principle in CAPT

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Multi-modal Sensing and Analysis of Poster Conversations toward Smart Posterboard

Bridging Lexical Gaps between Queries and Questions on Large Online Q&A Collections with Compact Translation Models

SYNTACTIC ADAPTATION 1. Rapid Expectation Adaptation During Syntactic Comprehension. Alex B. Fine * T. Florian Jaeger. Thomas A. Farmer.

Florida Reading Endorsement Alignment Matrix Competency 1

Testing claims of a usage-based phonology with Liverpool English t-to-r 1

Linguistic Variation across Sports Category of Press Reportage from British Newspapers: a Diachronic Multidimensional Analysis

Arabic Orthography vs. Arabic OCR

arxiv: v1 [cs.cl] 2 Apr 2017

Rachel E. Baker, Ann R. Bradlow. Northwestern University, Evanston, IL, USA

Journal of Phonetics

Formulaic Language and Fluency: ESL Teaching Applications

Stages of Literacy Ros Lugg

Sources of difficulties in cross-cultural communication and ELT: The case of the long-distance but in Chinese discourse

STUDIES WITH FABRICATED SWITCHBOARD DATA: EXPLORING SOURCES OF MODEL-DATA MISMATCH

The analysis starts with the phonetic vowel and consonant charts based on the dataset:

A Bootstrapping Model of Frequency and Context Effects in Word Learning

To appear in The TESOL encyclopedia of ELT (Wiley-Blackwell) 1 RECASTING. Kazuya Saito. Birkbeck, University of London

Phonological Encoding in Sentence Production

Portuguese Vowel Harmony: A Comparative Analysis and the Superiority of Autosegmental Representations

AN INTRODUCTION (2 ND ED.) (LONDON, BLOOMSBURY ACADEMIC PP. VI, 282)

The influence of metrical constraints on direct imitation across French varieties

Program Matrix - Reading English 6-12 (DOE Code 398) University of Florida. Reading

Transcription:

Harmonic Cues in Speech Segmentation: A cross-linguistic Corpus Study on Child-directed Speech F. Nihan Ketrez İstanbul Bilgi University 1. Introduction Research on speech segmentation has shown that infants rely heavily on prosodic cues when they are segmenting words in a string of speech (Cutler & Norris 1988; Jusczyk, Houston & Newsome 1999; Morgan 1996; Jusczyk 1997, 1999; Curtin et al. 25, among others). Statistical probabilities and phonotactic constraints, i.e., constraints in the order of phones are also observed to be helpful in word segmentation (Brent & Cartwright 1996; Safran, Newport & Aslin 1996; Aslin, Saffran & Newport 1998, among others). It seems natural to assume that other phonological regularities, for instance, vowel harmony in harmonic languages, may contribute to word segmentation as well. Assuming that a majority of words are harmonic in a language (having only back or only front vowels within word boundaries), vowel shifts from [-back] to [+back] or vice versa may signal a word boundary. In (1) below, which is the first utterance of the Turkish corpus (Aksu, 1aa.cha) at CHILDES, shifts from [-back] to [+back] and from [+back] to [-back] overlap with word boundaries which are marked with #. 1. e rel cim bu ne [-back] [-back][-back] # [+back] # [-back] Indeed, studies conducted with artificial languages based on Finnish and Turkish vowel harmony rules showed that adult speakers could recognize words on the basis of such harmony cues (Suomi, McQueen & Cutler 1997; Vroomen, Tuomainen & Gelder 1998; Kabak, Maniwa &Kazanina 21). In these studies, artificial languages are created based on natural vowel harmony rules where word boundaries and harmony shifts perfectly overlap and adults were observed to be sensitive to such shifts. Results of headturn experiments with children further showed that seven-month-old infants were sensitive to vowel shifts in long strings of CV sequences and recognized harmonic sequences as words (Mintz &Walker 26). These findings suggest that when harmonic information is available, children can use such information to assign word boundaries. The question that remains to be raised is whether natural speech has such reliable cues. Harmonic languages are never perfectly harmonic, that is, there are always some exceptions on vowel harmony rules, or the lexicon of harmonic languages always have

non-harmonic words as well. Therefore it is possible that the harmonic languages are not as harmonic as it has been assumed in the literature. The goal of this study is then to figure out whether it is reasonable to assume that children acquiring harmonic languages receive reliable harmonic cues that could be used in speech segmentation. To this end, two harmonic languages are compared to each other, and contrasted with two non-harmonic languages. The study does not only provide information regarding the availability of harmonic cues, it also shows how generalizable the results are through a cross-linguistic comparison. 2. Procedure Parallel analyses were conducted on child-directed speech in two harmonic (Turkish, Hungarian) and two non-harmonic languages (Farsi, Polish) that are available at CHILDES. Similar amount of data (in terms of number of utterances) from each language were analyzed individually. Details of the data are reported in Table 1 below. Table 1: Corpus information for Turkish, Hungarian, Farsi and Polish. Language type Harmonic Language Turkish Hungarian Farsi Polish CHILDES Corpus Aksu MacWhinney Family (Leila) Weist Age range of CHI 2;-4;8 2;3-2;1 1;11-2;1 1;7-3;2 No. of utterances 1,232 11,478 13,325 13,258 No. of word tokens 34,391 41,514 4,472 13,778 At first, the most frequent 2 multisyllabic words were selected from each language and were coded as harmonic or non-harmonic. The reason for this selection was to exclude very low frequency words. Those words that have only back vowels or only front vowels were coded as harmonic. Those words that have a mixture of front or back vowels were categorized as non-harmonic. In the second analysis, which was run on the whole corpora, word boundaries were coded as harmonic or non-harmonic. word boundaries were those cases where a word that has a front vowel at the last syllable is followed by a word that has a back vowel in the initial syllable, or vice versa. An example of a non-harmonic word boundary is seen in (1) above. Harmonic word boundaries have the same type of vowel (either back or front) on both sides of the word boundary. For the last phase of the analysis, which looks at the same data from a different angle, all possible vowel pairs (/aa/, /ai/, /ao/ etc.) were coded as harmonic or nonharmonic and then the frequency of these sequences within words (VV) and across words (V#V) were compared.

3. Results The results of the first analysis, based on the most frequent 2 words in each language, suggest that a great majority of words are harmonic in harmonic languages. Although non-harmonic words are found in both Hungarian and Turkish, just as predicted, harmonic words are in majority. No such tendency is observed in Polish, where about half of the words were harmonic and the other half were not. An opposite pattern is observed in Farsi, where non-harmonic words were more frequent. Figure-1 below shows all four languages together for comparison. The scores displayed in the Figure are the percentages. Figure 1: Harmonic vs. non-harmonic words in harmonic and non-harmonic languages 1.53 89.47 22.5 77.4 67.6 32.4 45.4 54.6 Turkish Hungarian Farsi Polish Harmonic The second analysis that examines the word boundaries suggest that about % of word boundaries are harmonic in both harmonic and non-harmonic languages, as seen in Figure 2. These results, the latter one in particular, are problematic for a word segmentation mechanism based on vowel harmony cues. Before we give up on the idea of speech segmentation mechanism based on vowel harmony, we look at the same data from another perspective.

Figure 2: Harmonic vs. non-harmonic word boundaries in harmonic and non-harmonic languages 51 49 48 53 49 51 52 47 Turkish Hungarian Farsi Polish Harmonic The results suggest that harmonic sequences are more likely to appear within words (VV), and non-harmonic sequences are mostly found at boundaries (V#V) in harmonic languages. Figure 3 presents the results of the harmonic languages. The results in harmonic languages contrast with the results in non-harmonic languages. As clearly seen in Figure 4, there is no difference between within word (VV) and across word (V#V) occurrences in non-harmonic languages. Figure 3: Harmony within (VV) versus across (V#V) word boundary in harmonic languages. 23 42 73 71 77 57 27 28 Harmonic (Tur) (Tur) VV Harmonic (Hun) V#V (Hun)

Figure 4: Harmony within (VV) versus across (V#V) word boundary in non-harmonic languages 4 39 52 59 61 48 Harmonic (Far) (Far) Harmonic (Pol) (Pol) VV V#V Conclusion Natural harmonic languages, but not the non-harmonic ones, provide a learner with harmonic cues for word segmentation, although these cues are not as perfect as the ones created in the experiments. Therefore, harmonic cues, especially when they are used together with other cues, such as word stress, distributional properties of words, or morphemes, could potentially be useful in word segmentation. These results correctly predict that speakers of harmonic languages, but not the non-harmonic ones, rely on harmony cues in speech segmentation (Kabak, Maniwa & Kazanina 21). References Aslin, R.N., Saffran, J.R., & Newport, E.L. (1998). Computation of probability statistics by 8- month-old infants. Psychological Science, 9, 321-324. Brent, M.R., & Cartwright, T.A. (1996). Distributional regularity and phonotactic constraints are useful for segmentation. Cognition, 61, 93-125. Curtin, S., T. Mintz, & M. Christiansen. (25). Stress changes the representational landscape: Evidence from word segmentation. Cognition 97(3). 233-262. Cutler, A., & Norris, D.G.. (1988). The role of strong syllables in segmentation for lexical access. Journal of Experimental Psychology: Human Perception & Performance, 14, 113-121. Jusczyk, P. W., Houston, D. M., & Newsome, M. (1999). The beginnings of word segmentation in English-learning infants. Cognitive Psychology, 39, 159 27.

Jusczyk, P. W. (1997). The discovery of spoken language. Cambridge, MA: MIT Press. Jusczyk, P.W. (1999). How infants begin to extract words from fluent speech. Trends in Cognitive Science, 3, 323-328. Kabak, B., Maniwa, K. & Kazanina, N. (21) Listeners use vowel harmony and wordfinal stress to spot nonsense words: A study of Turkish and French. Journal of Laboratory Phonology, 1. 27-224. Morgan, J.L. (1996). A rhythmic bias in preverbal speech segmentation. Journal of Memory and Language, 35, 666-688. Mintz, T. and Walker, R. (26). Infant's sensitivity to vowel harmony and its role in word segmentation. Paper presented at the annual meeting of the LSA, Albuquerque, NM, January 7, 26. Saffran J, R., E. Newport & R. Aslin. (1996). Word segmentation: The role of distributional cues. Journal of Memory and Language 35. 66-621 Suomi, K. McQueen, J. M.& Cutler, A. (1997). Vowel Harmony and Speech Segmentation in Finnish. Journal of Memory and Language 36, 422-444. Vroomen, J., Tuomainen, J. & Gelder, B. (1998). The roles of word stress and vowel harmony in speech segmentation. Journal of Memory and Language 38. 133-149.