The production and perception of word-level prosody in Korean

Similar documents
Mandarin Lexical Tone Recognition: The Gating Paradigm

Revisiting the role of prosody in early language acquisition. Megha Sundara UCLA Phonetics Lab

Rhythm-typology revisited.

Word Stress and Intonation: Introduction

The Acquisition of English Intonation by Native Greek Speakers

Phonological Processing for Urdu Text to Speech System

L1 Influence on L2 Intonation in Russian Speakers of English

Phonological and Phonetic Representations: The Case of Neutralization

Copyright by Niamh Eileen Kelly 2015

Journal of Phonetics

Acoustic correlates of stress and their use in diagnosing syllable fusion in Tongan. James White & Marc Garellek UCLA

**Note: this is slightly different from the original (mainly in format). I would be happy to send you a hard copy.**

A Cross-language Corpus for Studying the Phonetics and Phonology of Prominence

Proceedings of Meetings on Acoustics

Quarterly Progress and Status Report. Voiced-voiceless distinction in alaryngeal speech - acoustic and articula

THE PERCEPTION AND PRODUCTION OF STRESS AND INTONATION BY CHILDREN WITH COCHLEAR IMPLANTS

To appear in the Proceedings of the 35th Meetings of the Chicago Linguistics Society. Post-vocalic spirantization: Typology and phonetic motivations

A survey of intonation systems

Demonstration of problems of lexical stress on the pronunciation Turkish English teachers and teacher trainees by computer

Rachel E. Baker, Ann R. Bradlow. Northwestern University, Evanston, IL, USA

Phonological encoding in speech production

The Perception of Nasalized Vowels in American English: An Investigation of On-line Use of Vowel Nasalization in Lexical Access

The influence of metrical constraints on direct imitation across French varieties

5. Margi (Chadic, Nigeria): H, L, R (Williams 1973, Hoffmann 1963)

1. REFLEXES: Ask questions about coughing, swallowing, of water as fast as possible (note! Not suitable for all

Infants Perception of Intonation: Is It a Statement or a Question?

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections

Perceived speech rate: the effects of. articulation rate and speaking style in spontaneous speech. Jacques Koreman. Saarland University

18 The syntax phonology interface

Discourse Structure in Spoken Language: Studies on Speech Corpora

Speech Recognition using Acoustic Landmarks and Binary Phonetic Feature Classifiers

DEVELOPMENT OF LINGUAL MOTOR CONTROL IN CHILDREN AND ADOLESCENTS

The Effect of Discourse Markers on the Speaking Production of EFL Students. Iman Moradimanesh

CLASSIFICATION OF PROGRAM Critical Elements Analysis 1. High Priority Items Phonemic Awareness Instruction

Sounds of Infant-Directed Vocabulary: Learned from Infants Speech or Part of Linguistic Knowledge?

Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition

Modern TTS systems. CS 294-5: Statistical Natural Language Processing. Types of Modern Synthesis. TTS Architecture. Text Normalization

Pobrane z czasopisma New Horizons in English Studies Data: 18/11/ :52:20. New Horizons in English Studies 1/2016

Bitonal lexical pitch accents in the Limburgian dialect of Borgloon

On the nature of voicing assimilation(s)

Unit Selection Synthesis Using Long Non-Uniform Units and Phonemic Identity Matching

ADDIS ABABA UNIVERSITY SCHOOL OF GRADUATE STUDIES MODELING IMPROVED AMHARIC SYLLBIFICATION ALGORITHM

The Odd-Parity Parsing Problem 1 Brett Hyde Washington University May 2008

Automatic intonation assessment for computer aided language learning

One major theoretical issue of interest in both developing and

Voice conversion through vector quantization

Speech Recognition at ICSI: Broadcast News and beyond

Eyebrows in French talk-in-interaction

SEGMENTAL FEATURES IN SPONTANEOUS AND READ-ALOUD FINNISH

Running head: DELAY AND PROSPECTIVE MEMORY 1

Universal contrastive analysis as a learning principle in CAPT

2,1 .,,, , %, ,,,,,,. . %., Butterworth,)?.(1989; Levelt, 1989; Levelt et al., 1991; Levelt, Roelofs & Meyer, 1999

Learners Use Word-Level Statistics in Phonetic Category Acquisition

GOLD Objectives for Development & Learning: Birth Through Third Grade

Language Acquisition by Identical vs. Fraternal SLI Twins * Karin Stromswold & Jay I. Rifkin

LING 329 : MORPHOLOGY

TEKS Comments Louisiana GLE

REVIEW OF CONNECTED SPEECH

NCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches

Local and Global Acoustic Correlates of Information Structure in Bulgarian

Large Kindergarten Centers Icons

Concept Acquisition Without Representation William Dylan Sabo

Linking object names and object categories: Words (but not tones) facilitate object categorization in 6- and 12-month-olds

Degeneracy results in canalisation of language structure: A computational model of word learning

Christine Mooshammer, IPDS Kiel, Philip Hoole, IPSK München, Anja Geumann, Dublin

Atypical Prosodic Structure as an Indicator of Reading Level and Text Difficulty

Parallel Evaluation in Stratal OT * Adam Baker University of Arizona

SOUND STRUCTURE REPRESENTATION, REPAIR AND WELL-FORMEDNESS: GRAMMAR IN SPOKEN LANGUAGE PRODUCTION. Adam B. Buchwald

(De-)Accentuation and the Processing of Information Status: Evidence from Event- Related Brain Potentials

An Acoustic Phonetic Account of the Production of Word-Final /z/s in Central Minnesota English

Consonant-Vowel Unity in Element Theory*

DIBELS Next BENCHMARK ASSESSMENTS

Does the Difficulty of an Interruption Affect our Ability to Resume?

An argument from speech pathology

The Journey to Vowelerria VOWEL ERRORS: THE LOST WORLD OF SPEECH INTERVENTION. Preparation: Education. Preparation: Education. Preparation: Education

Guidelines for blind and partially sighted candidates

The phonological grammar is probabilistic: New evidence pitting abstract representation against analogy

AN EXPERIMENTAL APPROACH TO NEW AND OLD INFORMATION IN TURKISH LOCATIVES AND EXISTENTIALS

THE SURFACE-COMPOSITIONAL SEMANTICS OF ENGLISH INTONATION MARK STEEDMAN. University of Edinburgh

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines

Rote rehearsal and spacing effects in the free recall of pure and mixed lists. By: Peter P.J.L. Verkoeijen and Peter F. Delaney

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,

High Tone in Moro: Effects of Prosodic Categories and Morphological Domains * Peter Jenks (Harvard University) and Sharon Rose (UC San Diego)

Prosody in Speech Interaction Expression of the Speaker and Appeal to the Listener

Segregation of Unvoiced Speech from Nonspeech Interference

Considerations for Aligning Early Grades Curriculum with the Common Core

Houghton Mifflin Reading Correlation to the Common Core Standards for English Language Arts (Grade1)

ELA/ELD Standards Correlation Matrix for ELD Materials Grade 1 Reading

Copyright and moral rights for this thesis are retained by the author

Phonological Encoding in Sentence Production

Sample Goals and Benchmarks

Speech Emotion Recognition Using Support Vector Machine

SARDNET: A Self-Organizing Feature Map for Sequences

PERSONAL STATEMENTS and STATEMENTS OF PURPOSE

1 st Quarter (September, October, November) August/September Strand Topic Standard Notes Reading for Literature

Regional variation in the realization of intonation contours in the Netherlands

Manner assimilation in Uyghur

IEEE Proof Print Version

Author: Justyna Kowalczys Stowarzyszenie Angielski w Medycynie (PL) Feb 2015

Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano. Graduate School of Information Science, Nara Institute of Science & Technology

Transcription:

The production and perception of word-level prosody in Korean Byung-jin Lim Indiana University 1. INTRODUCTION This paper reports the results of an investigation into the production and perception of Korean word-level prosody. The production study examines the effects of syllable weight and syllable position on syllable duration, vowel duration, onset duration, coda duration, and fundamental frequency of Seoul Korean. We also examine what attributes of a syllable cause it to be perceived as prominent. The perception study examines how Korean word prominence is perceived by native speakers of English, Japanese, and Korean. 1.1. Stress as prominence in English In English, stress is defined as "prominence" for relative distinctiveness of a linguistic unit (Jones, 196) or is termed as "accent" for achieving linguistic prominence (Lehiste, 197). According to Couper-Kuehlen (1986), stress denotes the perception of one linguistic unit which is somehow more prominent others. Incorporating previous definitions of stress, Beckman and colleagues (Beckman, 1986; Beckman, Edwards and Fletcher, 1992; and Beckman and Edwards, 1991) have taken the notion stress as prominence by describing stress and accent using one unified representation with four prominence levels (recited from de Jong, 1991: 6). As is well documented in previous studies, prominence is highly correlated with suprasegmental characteristics of speech sounds: prominent syllables or words have a conspicuous fundamental frequency change, longer duration and higher amplitude (Fry, 1955, 1958; Lieberman, 196; Lehiste, 197). Especially, a change in fundamental frequency is reported as the most reliable cue for prosodic prominence (Bolinger, 1958 for English; van Katwijk, 1974 for Dutch). 1.2. Korean word prominence For Korean prosodic prominence, de Jong (1994) suggests that the so-called stress or word-level prominence in Korean can be an interpretation of an edge tone. Along the same line, Jun (199, 1993, 1998) has developed a Korean Prosodic Model based on pitch contours following the Japanese prosodic models by Beckman and Pierrehumbert (1986) and Pierrehumbert and Beckman (1988). According to her model (1993, 1998), the delimitation of prosodic units is primarily determined by pitch contours or tonal patterns. She describes the tonal patterns of two important prosodic units in Seoul Korean, 1) Accentual Phrase (AP) and 2) Intonatational Phrase (IP): the AP has a phrase final rising pattern, a LH as in Figure (1b), and the IP may have one of several boundary For the purpose of this paper, we assume that CVC syllables are heavy syllables and CV syllables are light without any further explanation, since this is not the topic of the current paper: For the phonological status of coda consonant, J. H. Jun (1994) proposes that the Korean coda consonant is moraic; thus CVV and CVC syllables are heavy syllables. On the other hand, Tak (1997) analyzes that CV and CVC syllables are light in Korean. In addition, since the vowel length distinction see to be lost in Seoul Korean, the syllable structure with a long vowel, or CVV syllable structure, is not considered in this study, either. 1

tones such as L%, H%, LH%, HL%, LHL%, and HLH%. However, when the final syllable of the AP is the IP final, the final rise of the AP is replaced by the IP boundary tones as in (1a) 1. (1) Fundamental frequency contours depending on prosodic units a. b. [(L H L%) AP ] IP [(L H) AP (L H L%) AP ] IP ma.mu.ri ma.mu.ri co.a.yo In a phonetic study of Korean stress, Jun (1995a) reports that the second syllable is prosodically stronger, which could be interpreted as stressed. She also mentions that stress exists in the language since a certain syllable within a prosodic unit the AP shows higher fundamental frequency and greater amplitude. Since the prosodically strongest syllable of a word changes depending on its position within the AP, stress is a property of the AP. Korean is quite unique in that clai about stress (or prominence) placement vary considerably from study to study. H. B. Lee (1973, 1985) and H. Y. Lee (199) suggest that Korean is a stress language. They propose that Korean has a duration-based stress system where syllable duration plays a key role in locating stress, while the role of pitch pattern is only marginal in the decision of stress position. Along this line, Kim (1988) clai that Korean has a rightward right-headed iambic foot structure and it has moraic coda consonants. Thus, stress falls on the first syllable, if it is heavy, or it falls on the second syllable if the first syllable is light. Some examples are given in (2). (2) kyo:.do.so prison si:.cang market cam.ca.ri dragonfly H L L H H H L L nak.son rejection i.ya.ki story ke.ul mirror H H L L L L H (H: heavy syllable; L: light syllables; underlined are stressed syllables; data from Kim, 1998) On the other hand, Yu (1989) proposes Accent Placement Rules in Modern Korean. Accent falls on the leftmost heavy syllable, otherwise the rightmost light syllable. Examples are shown in (3). 1 Please see Jun (1993, 1995a, 1995b, and 1998) for the detailed characteristics of the AP and the IP. 2

(3) pa.ram wind sa:.ram people nun.bo.ra snow storm L H H H H L L a.u younger sibling i.ma forehead pi.hang.ki airplane L L L L L H L Previous studies claim that Korean prominence is weight-sensitive and duration based. In addition, syllable position see to play a role in determining prominence of the words. In this study, a production experiment is carried out to fulfill two functions. 1) It examines syllable position and weight effects on duration and fundamental frequency which might indicate the presence of stress and, 2) it determines acoustic correlates to be used in a following perception test. 2. EXPERIMENT I: PRODUCTION The goal of the first experiment was to examine the effects of syllable position and syllable weight on syllable duration, vowel duration, onset duration, coda duration and fundamental frequency of Seoul Korean. 2.1 Method 2.1.1. Subjects Two male native speakers of Seoul Korean participated in the experiment. They were both graduate students at Indiana University. They were both in their early thirties and neither had any known speech or hearing disorders. 2.1.2. Stimuli In a production experiment, stimuli included three-syllable words composed of combinations of heavy and light syllables. Given three-syllable words, it is logically possible to have eight different combinations in ter of syllable weight as in Table 1. All test words had the vowel [a] to minimize intrinsic durational differences. (4) [Table 1] Stimulus material No Syllable weight Word Gloss 1 LLL Ma.na.gwa The capital of Nicaragua 2 LHL Ja.jang.ga Lullaby 3 LHH Ma.gam.nal Deadline 4 LLH Na.ma.dam Ms. Nah 5 HHH Mal.jang.nan Joke 6 HLL Man.da.ra The title of a movie 7 HLH Kang.ma.dam Ms. Kang 8 HHL Man.dam.ga Comedian (H: heavy syllable; L: light syllable,. : syllable boundaries) 2.1.3. Procedure Subjects were asked to read the test words five times at a comfortable speech rate. Test words were placed in two prosodic conditions, in isolation as a single IP, and in a sentential context as the first AP in a larger IP as shown in (5). 3

(5) Test words within two prosodic conditions a. [ (ja.jang.ga) AP ] IP A lullaby b. [ (ja.jang.ga) AP (pul.reo.ju.se.yo) AP ] IP Please, sing a lullaby The list of test words was presented in Korean orthography. Fillers were included to avoid list effects. Before recording, subjects practiced reading randomly ordered test words to familiarize theelves with the materials. All recordings were made in the recording room in the Indiana University Linguistics Department. Recordings were analyzed using the software SoundScope on a Mac II in the Indiana University Phonetics Laboratory. 2.1.4. Measurement Measurements were taken of syllable, vowel, and consonant durations. In addition, the fundamental frequency of each vowel was measured. For the fundamental frequency measurements, we measured three points of vowels: 1) the onset, 2) the steady-state portion and 3) the offset of each vowel. This study only reports the fundamental frequency values of the steady-state portions of vowels. 2.2. Results 2.2.1. Vowel duration The results of the mean vowel duration as a function of syllable position are shown in (7). (7) Mean vowel duration for each speaker a. V-duration (Speaker 1: IP) b. V-duration (Speaker1: AP) 3 3 25 247.9 247.1 25 231.25 2 2 15 1 12.35 18.75 16.1 157.25 15 1 16.35 91.25 96.6 122 155.5 5 5 V1 V2 V3 V1 V2 V3 4

c. V-duration (Speaker 2: IP) d. V-duration (Speaker 2: AP) 3 3 25 25 2 198.1 2 15 1 98 125.5 121.95 153.4 15.3 15 1 114.55 15.5 17.3 144.1 141.75 166.1 5 5 V1 V2 V3 (V1: initial vowel; V2: middle vowel; V3: final vowel; : heavy syllable; : light syllable) V1 V2 V3 As seen in (7), the effect of syllable weight is salient: the combined mean vowel duration of light syllables were significantly longer than that of heavy syllables (t (118)=4.94, p<.1 for Speaker 1; t(118)=4.695, p<.1 for Speaker 2). For the vowel duration of the IP for Speaker 1, a one-way ANOVA revealed that a significant difference among syllable positions for light syllables [F (2, 57)=26.338, p<.1 for light syllables], while heavy syllables show a slightly larger than the.1 alpha level [F (2, 57)=4.488, p=.15]. According to post-hoc tests, initial vowel duration showed the longest value for the IP light syllables. For the vowel duration of the AP for the same speaker, there is a significant durational difference among syllable positions as well [F(2, 57)=128.65, p<.1 for light syllables; F(2, 57)=33.29, p<.1 for heavy syllables]: multiple comparisons showed that initial and medial vowel durations are significantly longer than final vowel durations for both light and heavy syllables. Let us turn to speaker 2. For the IP condition, first, there was a significant durational difference among syllables for both heavy and light syllables [F (2, 57)=55.151, p<.1 for light syllables; F(2, 57)=32.26, p<.1]: final vowels are significantly longer than medial vowels, in turn, which are significantly longer than initial vowels for both light and heavy syllables. For the AP condition, there was also a significant difference in vowel duration among syllable positions across heavy and light syllables [F(2, 57)=3.452, p<.1 for light syllables; F(2, 57)=24.483, p<.1 for heavy syllables]: for light syllables, final vowels are significantly longer than medial vowels, and initial vowels are the shortest. For heavy syllables, however, final vowels are the longest, and there is no significant durational difference between initial and medial vowels. 2.2.2. Syllable duration Like vowel duration shown above, there is a significant difference in syllable duration between light and heavy syllables; unlike vowel duration, however, heavy syllables are much longer than light syllables (t (118) = -8.135, p<.1) as shown in (8). 5

(8) Mean syllable duration for each speaker a. Syll-duration (Speaker 1: IP) b. Syll-duration (Speaker 1: AP) 45 438.1 45 4 4 35 344.6 324.5 35 337.5 3 294.1 267.6 3 284.3 27.85 25 2 235.9 25 2 232.4 173.9 15 15 145.15 1 1 5 5 c. Syll-duration (Speaker 2: IP) d. Syll-duration (Speaker 2: AP) 45 45 4 4 35 3 313.25 296.15 35 3 273.55 269.35 25 2 235.5 161.35 21.25 239.3 25 2 237.8 192.35 199.8 15 15 128.85 1 1 5 5 (S1: initial syllable; S2: middle syllable; S3: final syllable) This is not surprising since heavy syllables have a coda. Let us take a closer look at speaker 1, first. For the IP condition, as shown in (8a), the syllable duration revealed a significant difference among syllable positions [F (2, 57)=26.338, p<.1 for light syllables: F (2, 57)=59.34, p<.1 for heavy syllables]: regardless of syllable weight, medial syllables are longer than other syllables. For the AP condition, both light and heavy syllables showed significant differences in syllable duration as shown in (8b) [F (2, 57)=71.899, p<.1 for light syllables; F (2, 57)=5.377, p<.1 for heavy syllables]. Interestingly, this AP condition has the longest value on final syllables, like vowel durations for the AP, which is different from the IP condition. Speaker 2 showed similar patterns to speaker 1. For the IP condition, as shown in (8c), both light and heavy syllables showed significantly different syllable durations depending on syllable positions [F (2, 57)=34.976, p<.1 for light syllables; F (2, 57)=53.86, p <.1) for heavy syllables]. For light syllables, final syllables are longer 6

than medial syllables, in turn, which are longer than initial syllables. However, for heavy syllables, there is no significant difference in syllable duration between medial and final syllables, which is, however, statistically longer than initial syllables. For the AP condition, both light and heavy syllables showed a significant difference as a function of syllable position as in (8d) [F (2, 57)=33.63, p<.1 for light syllables; F (2, 57)=7.868, p=.1 for heavy syllables]. Multiple comparisons results showed that both medial and final syllables are longer than initial syllables across syllable weight. We have shown so far that there are not necessarily the same patterns between vowel and syllable durations. syllables showed the longest medial syllables with the longest final vowel duration. Thus, we turned to onset and coda durations to see durational patterns for internal structure of syllables. 2.2.3. Onset duration Average onset durations by each speaker are shown in (9). (9) Mean onset duration for each speaker a. Onset-duration (Speaker 1: IP) b. Onset-duration (Speaker 1: AP) 12 12 14.45 1 1 8 84.5 77.4 78.65 8 6 4 41 46.2 6 4 37.6 38.8 6.85 51.9 63.8 39.6 2 2 Onset1 Onset2 Onset3 Onset1 Onset2 Onset3 c. Onset-duration (Speaker 2: IP) d. Onset-duration (Speaker 2: AP) 12 12 1 1 8 6 4 37.4 35.85 69.95 47.85 62.9 41.2 8 6 4 38.95 57.25 48.25 47.9 33.7 2 2 14.3 Onset1 Onset2 Onset3 Onset1 Onset2 Onset3 7

For speaker 1, as shown in (9a), there is a significant positional effect on onset durations regardless of syllable weight for the IP condition [F (2, 57)=12.413, p<.1 for light syllables; F (2, 57)=19.414, p<.1 for heavy syllables]. For the AP condition, heavy syllables showed a significant positional differences in onset duration [F (2, 57)=1.95, p<.1] while light syllables did not [F (2, 57)=3.32, p=.44] as shown in (9b). In general, onset duration in medial syllable position is longer than other syllables across syllable weight and phrasal conditions. (9c) and (9d) showed mean onset durations for speaker 2; all across the board, medial onsets are longer than other positions. For the IP condition, only heavy syllables showed significantly different onset durations depending on syllable positions [F (2, 57)=14.57, p<.1] while light syllables showed no significant differences [F (2, 57)=1.75, p=.191]. On the other hand, the AP condition showed a significant positional effect on onset durations for light syllables [F (2, 57)=9.925, p<.1] and marginally significant differences for heavy syllables [F (2, 57)=4.344, p=.18]. 2.2.4. Coda duration For coda duration, only heavy syllables were considered since light syllables have no codas. Following are results of mean coda duration for each speaker under different phrasal conditions. (1) Mean coda durations for each speaker a. Coda-duration (Speaker 1) b. Coda-duration (Speaker 2) 25 224.9 25 2 183.25 2 15 1 13.55 126.85 77.45 118.2 IP 15 AP 99.65 1 93.35 121.35 19 82.95 79.7 IP AP 5 5 Coda1 Coda2 Coda3 Coda1 Coda2 Coda3 As shown in (1), positional effects were maintained with longest medial codas. A oneway ANOVA revealed that there is a significant difference in coda duration for the IP [F (2, 117)=6.371,p=.2], but not for the AP [F (2, 117)=.357, p=.71] for speaker 1 as shown in (1a). As shown in (1b), however, Speaker 2 did not show any significantly different coda durations as a function of syllable position even though there is a tendency to have longest medial codas regardless of phrasal conditions [F (2, 117)=1.23, p=.296 for the IP; F (2, 117)=.85, p=.449 for the AP]. 8

So far we have shown that durational differences are significant due to the effects of syllable position and syllable weight. In addition, these durational differences seemed to be highly affected by phrasal conditions. Lastly, we examined the effects of syllable position and syllable weight on the fundamental frequency in the following section, which is highly likely to be influenced by phrasal conditions as mentioned in section 1. 2.2.5. Fundamental frequency As previously mentioned, Korean prosodic units seem to be determined by pitch contours or tonal patterns (Jun, 1993, 1998). As shown in (1a & b), the AP has a phrase final rising pattern, or a LH, and the IP may have a LHL boundary tone for declarative sentences. Given this, we turn to the results of fundamental frequency patterns. The fundamental frequency contours by prosodic units and by speakers are shown in (11). (11) Fundamental frequency contours for each speaker a. F (Speaker 1: IP) b. F (Speaker 1: AP) 2 2 16 16 Hz 12 8 Hz 12 8 4 4 c. F (Speaker 2: IP) d. F (Speaker 2: AP) 2 2 16 16 Hz 12 8 Hz 12 8 4 4 As shown in (11), in general, the tonal patterns of each prosodic unit confirmed the previous studies in that the IP has a final fall while the AP has a rise. 9

However, there seemed to be a speaker-specific difference in these tonal patterns: For the IP condition, speaker 1 showed a falling tonal pattern with the highest value on the initial syllable whereas speaker 2 showed Low-High-Low tonal patterns, as in previous studies. For the AP, speaker 1 showed Low-High tonal patterns while speaker 2 showed Low-High-High tonal patterns. There were no syllable weight effects on the fundamental frequency contours across the speakers and phrasal conditions [for speaker 1 (t (58)=-1.89, p=.278 for the IP; t(58)=.638, p=,525 for the AP) and for speaker 2 (t (58)=-.259, p=.796 for the IP; t(58)=-.497, p=.62 for the AP)]. However, there was a significant positional effect on the F. For the IP, speaker 1 showed higher F values on the first syllables [F (2, 57)=7.773, p<.1 for light syllables; F (2, 57)=26.224, p<.1 for heavy syllables]. For speaker 2, the F value on the second syllable is higher than that of initial syllable, which, in turn, was significantly higher than final syllables [F (2, 57)=288.45, p<.1 for light syllables, F (2, 57)=327.34,.p<.1 for heavy syllables]. As mentioned earlier, the AP final tonal pattern showed a rise, a LH, and the results confirmed this tonal pattern with the highest F values for final syllables across speakers [For speaker 1: F (2, 57)=13.636, p<.1 for light syllables & F (2, 57)=2.49, p<.1 for heavy syllables; for speaker 2: F (2, 57)=288.45, p<.1 for light syllables & F (2, 57)= 27.34, p<.1 for heavy syllables]. 2.3. Discussion For durational patterns, the final vowels were significantly longer than vowels in the first two syllable positions except for speaker 1's IP condition 2. This durational difference was turned out to be a final lengthening effect by the post hoc analysis. Interestingly, for heavy syllables, medial syllable durations were noticeably longer than other syllables as far as syllable durations were concerned (except for speaker 1's AP condition). This might indicate that the second syllable position would be prominent or stressed. In addition, the durational patterns for the constituents of a syllable also showed that medial positions were different from other syllable positions. That is, coda durations and onset durations combine to give longer durations. Therefore, it might be possible to say that a second syllable position seemed to be favored for attracting prominence or stress just based on the durational patterns for heavy syllables. This is not the case for vowel durations. Seemingly, there is a conflict in durational patterns between vowels and syllables. Vowel durations indicate an edge effect, that is, a boundary lengthening effect while syllable durations might reflect the prominence pattern of words, a head effect in Korean. Fundamental frequency patterns also seem to support these conclusions. In general, the medial syllable position shows higher F than other syllables, especially for speaker 2. That is, the peak of F is shown near onset of the second syllables. Results of this study, however, seem to be problematic considering previous studies. According to Jun (199, 1994), the AP initial segments were lengthened, which implied that the IP initial segment duration would show greater lengthening since the IP is higher than the AP in a prosodic higher in a prosodic hierarchy. Compared to these studies, our 2 For the IP condition, speaker 1 see to have the so-called 'strengthening strategy' for initial two syllables with longer vowel duration and higher fundamental frequency on those initial two syllables. 1

results seem to be contradictory. Since Jun s argument was based on the durations of lenis stops in the AP and I measured initial sonorant segment durations in the IP, this difference might be a possible reason for the different results. 3. EXPERIMENT II: PERCEPTION In order to examine whether a certain syllable is perceived as prominent, a perception test was conducted. It see reasonable to expect that a listener s linguistic background, to some extent, affects his judgment of prominence. The extent to which judgment is affected depends on the classification of the language in question as a stress, tone, or quantity language. It may be expected that listeners who speak a stress language, such as English, are more sensitive to some acoustic cues in stress judgment, while native speakers of pitch-accent languages like Japanese may use different acoustic cues in prominence perception (cf. Beckman, 1986). In this perception test, native speakers of three different languages, English, Japanese, and Korean, participated as listeners. They were asked to indicate the most prominent syllable of utterance they heard. 3.1. Method 3.1.1. Subjects Five native speakers each of English and Japanese, and six Korean native speakers participated as listeners in this perception test. They were selected from the Indiana University population. They were all naïve to the purpose of the test. They also reported no history of speech and hearing related impairments. 3.1.2. Stimuli and Procedures In the production experiment, two subjects participated. Of the two, speaker 2 s speech data were presented to the listeners using a Sony mini disk cassette player 3. Each word was repeated five times. Then, the listeners were asked to mark prominent syllables on answer sheets. If the listeners were not certain or would fail to mark any syllable, the words were repeated once more. Then, the number of responses for prominence was counted and its distribution was compared by syllable positions and by listeners. 3.2. Results (12) shows the responses for prominence in the IP by listeners. The results indicated that medial syllables heard to be perceived as prominent across listeners. With a relatively high percentage, initial syllables were also marked as prominent following second syllables as shown below. Final syllables, however, did not seem to be favored for prominence for the IP. A few numbers of uncertainties for prominence were also observed, especially, among the Korean listeners. 3 Compared to speaker 1, speaker 2 showed smaller range of variation in data values. Therefore, we used speaker 2's speech data for the perception test. 11

(12) Percentage for prominence by each listener in the IP IP 7% 6% 5% 4% 3% 2% 1% % ENG JPN KOR S1 S2 S3 Not sure (13) Percentage for prominence by each listener in the AP AP 7% 6% 5% 4% 3% 2% 1% % ENG JPN KOR S1 S2 S3 Not sure Turning to the results of the AP, as seen in (13), it was found that a greater number of responses indicated prominence on medial syllable positions regardless of listener group. A closer examination also revealed differences among listener groups: English listeners tended to choose final syllables as prominent, whereas Japanese and Korean listeners showed similar patterns favoring medial syllables. 3.3. Discussion It is not surprising to find that different language speakers used different acoustic cues in perceiving prominence or stress. It is, however, interesting to see that they responded to the same speech data differently. As is well known, stressed syllables show longer duration, higher pitch and greater intensity in English (Fry, 1955, 1958; Lieberman, 196; Lehiste, 197). Based on the results of production and perception experiments, English speakers seem to use vowel lengthening, syllable duration and absolute pitch maxima in prominence perception: for the IP condition, they marked the second syllable as most prominent. Given the longest durations in syllables, onsets, and codas, and highest F values on second syllables, their choice for prominence see to be based on the maximal duration and pitch contour or/and the combination of the two. For the AP, their choice for the final syllable as most prominent could well match the longest vowel duration and syllable duration, and highest F on final syllables. It may be difficult to 12

say which acoustic factor has a primary function in prominence perception for English speakers based on the results of this study. However, it may not be difficult to say that they are very sensitive to these acoustic factors and using these as phonetic cues. Being speakers of an accented language, it is expected that Japanese listeners are more sensitive to pitch movement between syllables. Their perception data imply that they may use a relative change in pitch movement as a cue for prominence. In other words, they perceived as prominent or accented (or stressed), if there is any relatively big increase or decrease in pitch movement available between syllables. For Korean listeners, based on their perception data, they showed a similar pattern to Japanese listeners implying that they might be more sensitive to relative pitch movement rather than to durational change. However, the results did not directly evidence this. Related with recent studies where Korean see to lose vowel length distinction (Ingram and Park, 1997), durational factor may not play an important role in prominence perception. 4. CONCLUSIONS In this study, a production test was carried out to fulfill two functions; 1) to examine syllable positional and weight effects on duration and fundamental frequency and 2) to determine which acoustic correlates could be used as perceptional cues in Korean word perception. Results indicate that syllable weight and syllable position play a key role in affecting durational patterns and intonational contours. The perception study examined the extent to which Korean word prominence is perceived by native speakers of English, Japanese, and Korean. Results show that second syllables of three syllable words seem to be perceived as prominent in Korean. First syllable positions can also be a strong candidate for word prominence perception. However, it is also clear from the present results that the prominence perceived by Korean listeners is not the same as by English listeners. ACKNOWLEDGMENTS I would like to thank Ken de Jong, Stuart Davis, Yongsung Lee, Kwang-Chul Park, Chin Wan Chung, and Minkyung Lee for their valuable comments and encouragement. References Beckman, M. E. 1986. Stress and Non-stress Accent, Netherlands Phonetic Archives 7. Dordrecht: Foris. Beckman, M. E. and J. Edwards. 1991. Prosodic Categories and Duration control. Abstract in Journal of the Acoustical Society of America, 89 (4, pt.2): 1869. Beckman, M. E., J. Edwards, and J. Fletcher. 1992. Prosodic Structure and Tempo in a Sonority Model of Articulatory Dynamics. In Docherty, G. and D. R. Ladd, eds,. Papers in Laboratory Phonology II: Segment, Gesture, and Tone. Cambridge: Cambridge Univ. Press. Beckman, M. E., and J. B. Pierrehumbert. 1986. Intonational Structure in Japanese and English. Phonology Yearbook 3: 255-31. 13

Bolinger, D. L. 1958. A theory of pitch accent in English. Word 14: 19-149. Couper-Keuhlen, E. 1986. An Introduction to English Prosody. London: Edward Arnold. de Jong, K. 1994. Initial Tones and Prominence in Seoul Korean. OSU Working Papers in Linguistics 43: 1-14. Fry, D. B. 1955. Duration and intensity as physical correlates of linguistic stress. Journal of Acoustical Society of America 23: 765-769. Fry, D. B. 1958. Experiments in the perception of stress. Language and Speech 1: 126-152. Ingram, J. C. and S-G Park, 1997. Cross-language vowel perception and production by Japanese and Korean learners of English. Journal of Phonetics, Vol. 25, No. 3, Jul 1997, pp. 343-37 Jun, J. H. 1994. Metrical Weight Consistency in Korean Partial Reduplication. Phonology 11: 69-88. Jun, S. A. 199. The Domains of Laryngeal Feature Lenition Effects in Chonnam Korean. Paper presented at the 119 th ASA meeting. Jun, S. A. 1993. The Phonetics and Phonology of Korean Prosody. Ph. D. dissertation, The Ohio State University. Jun, S. A. 1995a. A Phonetic Study of Stress in Korean. Paper presented at the 13 th ASA meeting, St. Louis, MO. Jun, S.A. 1995b. Asymmetrical Prosodic Effects on the Laryngeal Gesture in Korean. In B. Connel and A. Arvaniti (eds.) Phonology and phonetic evidence: Papers in Laboratory Phonology IV: 235-253. Cambridge University Press, England. Jun. S. A. 1998. The Accentual Phrase in the Korean Prosodic Hierarchy. Phonology 15: 189-226. Kim, J. K. 1998. Anti-Trapping Effects in an Iambic System: Vowel Shortening in Korean. The 8 th Japanese/Korean Linguistics: 289-32. Lee, H. B. 1973. The Accent of Modern Korean (written in Korean), Seoul-tae-hak-kyomul-li-tae-hak-po 19: 113-128. Lee, H. B. 1985. Standard Pronunciation of Korean (written in Korean). Seoul: Phonetic Society of Korea. Lee, H. Y. 199. The Structure of Korean Prosody, Ph. D. dissertation, University of College London. Lehiste, I. 197. Suprasegmentals, MIT, Cambridge, MA. Lieberman, P. 196. Some acoustic correlates of word stress in American English. Journal of Acoustical Society of America 22: 451-454. Pierrehumbert, J. B. and M. E. Beckman. 1988. Japanese Tone Structure, Linguistic Inquiry, Monograph 15. Cambridge, Mass: MIT Press. Tak, J. Y. 1997. Issues in Korean Consonantal Phonology, Ph. D. dissertation, Indiana University. Terken, J. 1991. Fundamental frequency and perceived prominence of accented syllables. Journal of Acoustical Society of America, 89 (4): 1768-1776. Van Katwijk, A. 1974. Accentuation in Dutch. Van Gorkum, Assen, The Netherlands. Yu, J. 1989. A Study of the Accent-Placement Rules in Modern Korean (written in Korean). Sungkoknonchong 19: 293-322. 14