Quarterly Progress and Status Report. VCV-sequencies in a preliminary text-to-speech system for female speech

Size: px
Start display at page:

Download "Quarterly Progress and Status Report. VCV-sequencies in a preliminary text-to-speech system for female speech"

Transcription

1 Dept. for Speech, Music and Hearing Quarterly Progress and Status Report VCV-sequencies in a preliminary text-to-speech system for female speech Karlsson, I. and Neovius, L. journal: STL-QPSR volume: 35 number: 1 year: 1994 pages:

2

3

4 STL-QPSR been added. The LF source pulse is defined by four parameters: the shape parameters RG, RK, FA and the excitation energy EE. The noiselvoice source interaction is specified by the parameters NA and NM. NA specifies the amount of glottal flow modulated noise added to the glottal pulse. NM specifies the degree of glottal flow modulation of the noise source. The higher formants, which was realised as a higher-pole correction in male speech produced with the OVE m model, are now implemented as three separate formants (F5-F7), with constant relative distance. These new features have been proven to be essential for generating natural-sounding female speech. They have also improved the production of emotive synthetic speech (Carlson et al., 1992). A further extension of the GLOVE synthesiser has been tested in this study. The extension consists of a polelzero pair (FP,BP/FZ,BZ) that has been included in the vocal branch. The pole is implemented as a formant filter. The zero consists of an inverse formant filter. Both are controlled by specifying mid frequency and bandwidth. When the polelzero pair is not needed, both are placed at the same frequency and given a very low Q-value and will thus cancel each other. The experiments reported in this study were run on a version of the GLOVE model in a digital signal processor (DSP) simulator implementation (Carlsson & Neovius, 1990). The simulator runs as a separate server in our HPlApollo computer network, as a part of our experimental dialogue system (Blomberg et al., 1993). It can be directly accessed from the RULSYS speech synthesis development program, using the speech command in, for example, HISYS edit mode. Speech stimuli are generated at close to real-time rate on an HPIApollo 730. The simulator can easily be transferred to a standalone application with a floating point DSP for real-time performance in a text-tospeech device. Analysis Detailed analysis of female speech, for example using inverse filtering technique, has improved our knowledge of voiced speech segments (Karlsson, 1990; Karlsson, 1992). The main objective of inverse filtering is to eliminate the vocal tract filter function and to obtain a representation of the glottal flow in speech, but this technique has also proved to be valuable for detailed descriptions of voice source behaviour and of vocal tract characteristics. The technique is to cancel each formant in the speech wave with an inverse formant filter, with the corresponding mid-frequency and bandwidth, see Figure 1. To attain a good inverse filtered voice pulse, it was necessary to cancel extra pole/zero pairs for many consonants and adjoining parts of vowels. These polelzero pairs can have different origins. In nasals and laterals they are due to the geometry of the vocal tract (Nord, 1976). Voiced fricatives contain polelzero pairs that have their origins in the cavity behind the frication source. In aspirated articulations, the vocal cords never close completely. Accordingly, the cavities below the glottis are not acoustically isolated from the vocal tract. This will create polelzero pairs similar to what occurs in leaky or breathy voices (Klatt & Klatt, 1990). For the /h/ shown in Figure 1, the polelzero pair has the frequencies of 1697 Hz and Hz, respectively.

5 STL-QPSR Example of inverse filtering of the consonant /h/ in a VCV sequence. time in seconds speech inverse filtered speech integrated inverse filtered speech (equivalent to glottal air flow) Inverse filter specification the upper value gives the mid-frequency, the lower value the bandwidth in Hz spectrum of one fundamental period of the speech wave spectrum of the same period after inverse filtering khz both spectra are lifted by 6 dbloctave Figure 1. Representation of the formant matching component of the interactive inverse filtering program. The upper window displays the current glottal pulse. The lower window shows the spectrum of the wave before and after inverse filtering. After the initial LPC estimate of poles, the poles (down arrows) and the zeroes (up arrows) are positioned by hand. Mid-frequency is specified along the x-axis and bandwidth along the y-axis. Analysis data on voice source and vocal tract behaviour for this study was accordingly collected using inverse filtering technique with an interactive program. Preliminary formant frequencies and bandwidths were calculated automatically using the Linear Prediction auto correlation method. The formants and bandwidths were then finely tuned by hand, using both time and frequency representations of the speech wave before and after filtering. Zeroes and extra poles in the voice spectrum were also determined. The LF voice source model was then fitted to the inverse filtered voice pulse by hand, see Figure 2. The parameters of the LF-model for consecutive voice pulses were collected in tables together with formant, pole and zero mid-frequency and bandwidth values. These parameter tables were interpreted and formalised to a small number of data points for each VCV speech segment. The parameters were then used to produce handedited synthetic speech stimuli. The quality of these stimuli was modified by interactive editing and listening. Special effort was invested in the parameter transitions. The parameter data were further reduced to make it feasible to write new rules for the speech synthesis. The results of the proposed rules were continuously checked by generating text-to-speech stimuli.

6 time in seconds inverse filtered speech wave with the LF-model superimposed LF-model parameter values: FA= 180Hz FO = 250 Hz RG = 95% RK = 45% EE = 60 db, not calibrated but of interest when comparing different segments spectrum of one fundamental period of the inverse filtered speech wave spectrum of one pulse of the LF-model both spectra are lifted by 6 dbloctave khz Figure 2. Representation of the glottal flow pulse derivative matching component of the interactive inverse filtering program. The upper window displays the glottal source after the current setting of the inverse filter overlaid by the best approximation of the LF-pulse. The lower window displays the spectrum of these pulses. Results The analysis data used for the present experiment are taken from an ongoing and only partly published study of the acoustics of consonants in female speech (Karlsson, 1992). The speech material for that study consists of all Swedish consonants produced in VCV sequences in a carrier phrase. The vowels were short /a/, /u/ or /i/ and the initial and final vowel was the same. The initial vowel was always stressed, Swedish accent 1. In Figure 3, an example of the sequence /ili/ is shown together with two synthesised versions of the same sequence. Data for some consonants in different vowel contexts are given in Table 1. I Glottal parameters for consonants The LF-model (Fant et al., 1985) approximates the derivative of the glottal flow pulse using four parameters. The parameter EE decides the excitation strength, i.e. the amplitude. RG decides the balance between the first and second harmonic, and is normally higher in a more tense spealung mode. RK is lower in a tense voice than in a soft voice. FA is typically low for a soft voice and high for a tense voice. In Table 1 some examples of LF-parameter values for consonants are given. These values are fairly typical for all consonants investigated so far. The excitation strength, EE, was about equal for the nasals and weaker in /h/ and 111 compared to the preceding

7 Figure 3. Spectrograms and spectral sections of /ili/produced by a female speaker and by old and new synthesis rules. The old synthesis does not contain the pole/zero pair, all other parameters are the same for the old and the new synthesis. The spectral sections are taken from the middle of the /l/. stressed vowels. The consonant /V shows a characteristic pattern. During the first few pulses, there was an additional 1-3 db dip in the EE level. This can be seen in the spectrogram of natural speech in Figure 3. The RG parameter is not included in the table as the variation in this parameter was very small and did not seem to be related to the specific segment. The parameter RK was generally higher for the consonants compared to adjacent vowels. In Table 1 the largest differences in RK values are found in /h/ and in /rn/ in /a/-context. A high RK value means that the first harmonic is comparatively strong. The parameter FA was lower, i.e. the return time was longer, in all the consonants shown in Table 1 than in the surrounding vowels. This implies that the voice source in consonants contained relatively less high frequency energy. The lowest FA values were found in the nasals and the voiced plosives. FA was also low in nasalised parts of vowels; compare vowels before nasal in Table 1 with the other vowels. There is also a consistent difference between vowels; high front vowels always have lower FA values than open vowels. In Swedish, /h/ is voiced and aspirated in fully voiced contexts. The higher formants are to a large degree excited by aspirative noise. FA for /h/ was decided by studying the frequency content below approximately 2 khz as FA pertains to properties of the voiced source and, in /h/, the energy above 2 khz is mainly aspiration. Polelzero analysis Polelzero pairs were identified in the voiced consonants and in transitions from vowels to consonants, see Figure 1. Examples of the most distinct poles and zeros in some consonants are listed in Table 1. The polelzero pairs in these consonants are of different origins.

8

9 STL-QPSR compared to the other vowel contexts. This may be due to a different articulation of Ill. The polelzero pair in Iil before Ill is presumably due to incomplete glottal closure. In a transition from vowel to nasal, the passage through the nose is opened before the oral passage is closed (Nord, 1976). This results in poles and zeros in the vowel that are slightly different from those in the nasal. As can be seen in Table 1,1111 and /rn/ have similar polelzero data. The difference in articulation is manifested in F2 and F3 targets, and in back rounded vowel context the target for the zero. In earlier experiments (Nord, 1976), a polelzero pair at about 500 Hz in nasals was identified. In the synthesis experiments performed for this study, this polelzero pair was simulated by a broadening of the first formant bandwidth in combination with a low FA. The Nord study and the present experiment show that the polelzero pair between 1000 and 2000 Hz has a perceptual impact. Text-to-speech rules The analysis data were formulated as definitions and rules for the KTH text-to-speech synthesis, following the guidelines below. Some examples of parameter settings achieved by these rules are shown in Table 2. The rules was used in listening test 1, that will be described later in this paper. /V Basic setting After back rounded vowel FP BP FZ BZ EE EE Iml Basic setting in nasalised vowel in /m/ In back rounded nasalised vowel in /m/ after back rounded vowel FP B P FZ BZ In/ Basic setting in nasalised vowel in In/ In high front nasalised vowel in In/ after high front vowel FP B P FZ BZ Table 2. Values of some synthesis parameters decided by rules. Frequency and bandwidth values for the pole/zero pair (FP,BP / FZ,BZ) are given in Hz. The LF source parameter EE is given in db relative to EE in the preceding vowel. EEl gives the initial value, EE2 the value after 20 ms. For /h/, a polelzero pair was introduced. The frequencies and bandwidths were the same for all vowel contexts, according to analysis data. The LF parameter EE was decreased considerably in Iil and Iul context, less in /a/ context. RK was increased and FA was set to a low value. The aspiration was produced using NA, that introduces a modulation of the noise source. For Ill, two different aspects were formalised. The polelzero pair was specified in one data point. The specifications were the same for lil and la/ contexts, for Iul-context the polelzero pair as well as the formant values were lower. The excitation energy was reduced for the first 20 ms of the segment, see also Figure 3, which improved the naturalness notably.

10 STL-QPSR For the nasals I d and In/, three data points for the polelzero pair were specified. The first and last of these define the nasalisation of the preceding and succeeding vowels, the second defines the nasal. Separate rules modifying the polelzero pair had to be formulated for I d adjacent to a back, rounded vowel and for In/ adjacent to a high, front vowel. The polelzero values for the different contexts are given in Table 2. The nasal branch was not used in this experiment. The excitation energy was set to about 7 db less than for the preceding, stressed vowel. For In/ in Iil-context the excitation energy was set equal to the vowel value. FA was decreased to a low value, typically 400 Hz, for the nasalised part of the vowels, and even lower in the nasal. Listening tests Two tests were performed on different versions of the preliminary rule system. In the first test, a subset of the Swedish consonants were tested. The subjects were asked to mark the better of two versions of a consonant in a VCV context and to rate the difference. We believe that the test gives an estimate on both quality and naturalness. The test was an ABAB-test where A and B were different synthesis-by-rule versions of the same VCV. The intended VCV-stimuli was given on the answering sheet. The stimuli pairs were produced using an old and a new set of synthesis rules. The differences between the old and the new version consisted mainly of inclusion of the polelzero pair and modifications of the source parameters. Six subjects participated in this listening test. They had some experience of male speech synthesis. Test stimuli The consonants I d, In/, Iql, Ill, /h/, /v/ Id/ and If1 in /a/, Iil and Iul-contexts were tested. The context vowels were short and the first vowel was stressed, Swedish accent 1. The differences within the nasal pairs was that the old version contained nasal murmur shaped by the nasal branch and the new nasal was produced using only the vocal branch with the polelzero pair. The new versions of Ill and /v/ contained the polelzero pair as opposed to the old versions. For /h/, in the new version a different aspiration source, NA, was used in combination with the polelzero pair. In the new Id/, the voice bar was produced by setting the LF-source parameters FA very low and RK high, in the old Id/ the nasal branch produced the voice bar. There was no difference between the old and the new If/, the pairs were included to verify the test. Test results The results were encouraging, even though the old version was sometimes preferred. Subjects commented on a high degree of naturalness of the female speech. The new version of I11 was clearly preferred by the subjects. For Ivl there was a weak but clear tendency for a preference of the new version. The nasals showed less clear results. The new I d and In/ were preferred in four context, but for two pairs the old version was strongly preferred. The old Iql was preferred as was the old /h/, but the difference was small. The new Id/ was not as good as the old one. The If1 pairs got equal ratings and some subjects remarked that they were the same.

11 STL-QPSR The results of the first test of the new synthesis version were promising, but improvements both in rules and synthesiser were indicated. A close inspection of the new version nasals showed that many of these contained a large amount of noise that was caused by a less-than-optimal timing of parameter changes. This may in some cases explain the strong preference for the old version. The noise disappeared after a modification of the timing. In the new /dl, the amplitude change was too slow, which meant that it sounded more as a sonorant. After a speeding up of the parameter changes, the new Id/ is greatly improved. In the new /h/, the spectral content of the NA source need to be optimised. Second test Before the second test was run, a major revision of the rule system was performed. The work was focused on getting a complete inventory of consonants for the female textto-speech system as opposed to producing a limited set of VCV-sequences. The modifications indicated by listening test 1 were included in the rules. In the second test, the participants were asked to correctly identify the consonant in a VCV stimuli, following the SAM procedure (Goldstein & Till, 1992). To get data on the complete consonant inventory, the retroflex allophones where included. This resulted in 23 consonant phonemes. Each consonant was presented in three different vowel contexts, /a-a/, /i-i/, /u-u/. In this test, the stimuli were produced with Swedish accent 2. The vowels were short. Stimuli were presented once in a forced-choice test and the vowel context was indicated in he answering form. Ten subjects participated in the test. Overall error rate was 48% for correct identification. When only manner of articulation was considered, the error rate was 20%. Some consonants gave a very high confusion rate, in particular /h/ and the retroflex consonants. One possible reason for the low intelligibility of the consonant /h/ is that we so far lack knowledge on how to control the NA parameter by rule. The retroflex consonants are comparatively rare in the language and the voiced retroflexes hardly ever occur after a short vowel. They are often not included in this type of test. When the retroflexes were excluded, overall error rate decreased to 3 1 % for correct identification. The corresponding overall error rate figure for natural male speech is 5.6%, and for the latest revision of the KTH male speech synthesis 8.7% (Goldstein & Till, 1992). Measuring manner of articulation, the error rate was 18% on average. Only a few comments about the results will be made as this test is meant to be a starting point for future work. The use of glottal source control to produce the voice bar in the voiced plosives gives the desired responses. Very few confusions with other manners of articulation occurred. The production of the voice bar using glottal parameters makes it possible to generate more or less reduced occlusions. The introduction of a polelzero pair and an amplitude dip gave a convincing Ill. There was a trend to perceive the unvoiced plosives as voiced, which was particularly strong for /a/-context. A possible explanation is that the voice off-set of the preceding vowel is too slow. The nasals, that in test 1 were rated equally good in quality to the old nasals, appears to have lost their nasality. Confusions with non-nasals are particularly frequent for the /u/ context. It seems as if the transfer of the hand-

12 STL-QPSR edited rules tested in test 1 into general definitions and rules tested in test 2 has not always been completely successful. Conclusions In this report we have presented a framework for future text-to-speech development for generating more natural sounding synthetic speech. The inclusion of an extended GLOVE model within the RULSYSIHISYS environment provides a powerful tool for generating improved synthetic speech. Inverse filtering with matching against the LFmodel supplies adequate analysis data for the formulation of synthesis rules. The experiments have focused on female speech as the quality of the synthetic female speech can not be improved from the present status without inclusion of a versatile voice source and more complex articulation possibilities. Analysis data on consonants and vowel-consonant transitions suitable for the synthesis model has been presented. The analysis data have been formalised and incorporated within the rule system. The inclusion of an extra polelzero pair in the vocal branch give us a possibility to generate more natural synthetic speech. Results from a listening test with hand-edited VCV-sequences verify this. Formalising the data to rules is still at an initial stage. The second, diagnostic test reported in this study indicates where future efforts are needed. Acknowledgement This work has been supported by FRN, NUTEK, HSFR, KTH and the European CEC- ESPRIT project SPEECH MAPS. References Blomberg, M., Carlson, R., Elenius, K., Granstrom, B., Gustafson, J., Hunnicutt, S., Lindell, R. & Neovius, L. (1993). "An experimental dialogue system: Waxholm", Proc. of Eurospeech '93, pp Carlson, R., Granstrom, B. & Hunnicutt, S. (1990). "Multilingual text-to-speech development and applications", Advances in speech, hearing and language processing (B Ainsworth, ed.), Vol. 1, London: JAI Press Ltd., pp Carlson, R., Granstrom, B. & Karlsson, I. (1991). "Experiments with voice modelling in speech synthesis", Speech Communication, Vol. 10, pp Carlson, R., Granstrom, B. & Nord, L. (1992). "Experiments with emotive speech - acted utterances and synthesized replicas", Proc. of ICSLP '92, pp Carlson, R. & Nord, L. (1991). "Positional variants of some Swedish sonorants in an analysissynthesis scheme", Journal of Phonetics, Vol. 19, pp Carlsson, G. & Neovius, L. (1990). "Implementations of synthesis models for speech and singing." STL-QPSR 2-3/1990, pp Chafcouloff, M. (1985). "The spectral characteristics of the lateral I11 in French", Travaux de I'lnstitut de Phone'tique d'aix, Vol. 10, pp Cranen, B. & Boves, L. (1987). "On subglottal formant analysis"; JASA, Vol. 81, pp Fant, G., Liljencrants, J. & Lin, Q. (1985). "A four-parameter model of glottal flow.", STL- QPSR , pp I!

13

14

Quarterly Progress and Status Report. Voiced-voiceless distinction in alaryngeal speech - acoustic and articula

Quarterly Progress and Status Report. Voiced-voiceless distinction in alaryngeal speech - acoustic and articula Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Voiced-voiceless distinction in alaryngeal speech - acoustic and articula Nord, L. and Hammarberg, B. and Lundström, E. journal:

More information

Consonants: articulation and transcription

Consonants: articulation and transcription Phonology 1: Handout January 20, 2005 Consonants: articulation and transcription 1 Orientation phonetics [G. Phonetik]: the study of the physical and physiological aspects of human sound production and

More information

age, Speech and Hearii

age, Speech and Hearii age, Speech and Hearii 1 Speech Commun cation tion 2 Sensory Comm, ection i 298 RLE Progress Report Number 132 Section 1 Speech Communication Chapter 1 Speech Communication 299 300 RLE Progress Report

More information

Speech Synthesis in Noisy Environment by Enhancing Strength of Excitation and Formant Prominence

Speech Synthesis in Noisy Environment by Enhancing Strength of Excitation and Formant Prominence INTERSPEECH September,, San Francisco, USA Speech Synthesis in Noisy Environment by Enhancing Strength of Excitation and Formant Prominence Bidisha Sharma and S. R. Mahadeva Prasanna Department of Electronics

More information

Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition

Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition Hua Zhang, Yun Tang, Wenju Liu and Bo Xu National Laboratory of Pattern Recognition Institute of Automation, Chinese

More information

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines Amit Juneja and Carol Espy-Wilson Department of Electrical and Computer Engineering University of Maryland,

More information

Speech Recognition using Acoustic Landmarks and Binary Phonetic Feature Classifiers

Speech Recognition using Acoustic Landmarks and Binary Phonetic Feature Classifiers Speech Recognition using Acoustic Landmarks and Binary Phonetic Feature Classifiers October 31, 2003 Amit Juneja Department of Electrical and Computer Engineering University of Maryland, College Park,

More information

The Perception of Nasalized Vowels in American English: An Investigation of On-line Use of Vowel Nasalization in Lexical Access

The Perception of Nasalized Vowels in American English: An Investigation of On-line Use of Vowel Nasalization in Lexical Access The Perception of Nasalized Vowels in American English: An Investigation of On-line Use of Vowel Nasalization in Lexical Access Joyce McDonough 1, Heike Lenhert-LeHouiller 1, Neil Bardhan 2 1 Linguistics

More information

A comparison of spectral smoothing methods for segment concatenation based speech synthesis

A comparison of spectral smoothing methods for segment concatenation based speech synthesis D.T. Chappell, J.H.L. Hansen, "Spectral Smoothing for Speech Segment Concatenation, Speech Communication, Volume 36, Issues 3-4, March 2002, Pages 343-373. A comparison of spectral smoothing methods for

More information

Speech Emotion Recognition Using Support Vector Machine

Speech Emotion Recognition Using Support Vector Machine Speech Emotion Recognition Using Support Vector Machine Yixiong Pan, Peipei Shen and Liping Shen Department of Computer Technology Shanghai JiaoTong University, Shanghai, China panyixiong@sjtu.edu.cn,

More information

Mandarin Lexical Tone Recognition: The Gating Paradigm

Mandarin Lexical Tone Recognition: The Gating Paradigm Kansas Working Papers in Linguistics, Vol. 0 (008), p. 8 Abstract Mandarin Lexical Tone Recognition: The Gating Paradigm Yuwen Lai and Jie Zhang University of Kansas Research on spoken word recognition

More information

Voice conversion through vector quantization

Voice conversion through vector quantization J. Acoust. Soc. Jpn.(E)11, 2 (1990) Voice conversion through vector quantization Masanobu Abe, Satoshi Nakamura, Kiyohiro Shikano, and Hisao Kuwabara A TR Interpreting Telephony Research Laboratories,

More information

Phonetics. The Sound of Language

Phonetics. The Sound of Language Phonetics. The Sound of Language 1 The Description of Sounds Fromkin & Rodman: An Introduction to Language. Fort Worth etc., Harcourt Brace Jovanovich Read: Chapter 5, (p. 176ff.) (or the corresponding

More information

Perceptual scaling of voice identity: common dimensions for different vowels and speakers

Perceptual scaling of voice identity: common dimensions for different vowels and speakers DOI 10.1007/s00426-008-0185-z ORIGINAL ARTICLE Perceptual scaling of voice identity: common dimensions for different vowels and speakers Oliver Baumann Æ Pascal Belin Received: 15 February 2008 / Accepted:

More information

Design Of An Automatic Speaker Recognition System Using MFCC, Vector Quantization And LBG Algorithm

Design Of An Automatic Speaker Recognition System Using MFCC, Vector Quantization And LBG Algorithm Design Of An Automatic Speaker Recognition System Using MFCC, Vector Quantization And LBG Algorithm Prof. Ch.Srinivasa Kumar Prof. and Head of department. Electronics and communication Nalanda Institute

More information

CEFR Overall Illustrative English Proficiency Scales

CEFR Overall Illustrative English Proficiency Scales CEFR Overall Illustrative English Proficiency s CEFR CEFR OVERALL ORAL PRODUCTION Has a good command of idiomatic expressions and colloquialisms with awareness of connotative levels of meaning. Can convey

More information

Body-Conducted Speech Recognition and its Application to Speech Support System

Body-Conducted Speech Recognition and its Application to Speech Support System Body-Conducted Speech Recognition and its Application to Speech Support System 4 Shunsuke Ishimitsu Hiroshima City University Japan 1. Introduction In recent years, speech recognition systems have been

More information

Speaker Recognition. Speaker Diarization and Identification

Speaker Recognition. Speaker Diarization and Identification Speaker Recognition Speaker Diarization and Identification A dissertation submitted to the University of Manchester for the degree of Master of Science in the Faculty of Engineering and Physical Sciences

More information

The analysis starts with the phonetic vowel and consonant charts based on the dataset:

The analysis starts with the phonetic vowel and consonant charts based on the dataset: Ling 113 Homework 5: Hebrew Kelli Wiseth February 13, 2014 The analysis starts with the phonetic vowel and consonant charts based on the dataset: a) Given that the underlying representation for all verb

More information

Speech Recognition at ICSI: Broadcast News and beyond

Speech Recognition at ICSI: Broadcast News and beyond Speech Recognition at ICSI: Broadcast News and beyond Dan Ellis International Computer Science Institute, Berkeley CA Outline 1 2 3 The DARPA Broadcast News task Aspects of ICSI

More information

Audible and visible speech

Audible and visible speech Building sensori-motor prototypes from audiovisual exemplars Gérard BAILLY Institut de la Communication Parlée INPG & Université Stendhal 46, avenue Félix Viallet, 383 Grenoble Cedex, France web: http://www.icp.grenet.fr/bailly

More information

Online Publication Date: 01 May 1981 PLEASE SCROLL DOWN FOR ARTICLE

Online Publication Date: 01 May 1981 PLEASE SCROLL DOWN FOR ARTICLE This article was downloaded by:[university of Sussex] On: 15 July 2008 Access Details: [subscription number 776502344] Publisher: Psychology Press Informa Ltd Registered in England and Wales Registered

More information

Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano. Graduate School of Information Science, Nara Institute of Science & Technology

Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano. Graduate School of Information Science, Nara Institute of Science & Technology ISCA Archive SUBJECTIVE EVALUATION FOR HMM-BASED SPEECH-TO-LIP MOVEMENT SYNTHESIS Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano Graduate School of Information Science, Nara Institute of Science & Technology

More information

Rhythm-typology revisited.

Rhythm-typology revisited. DFG Project BA 737/1: "Cross-language and individual differences in the production and perception of syllabic prominence. Rhythm-typology revisited." Rhythm-typology revisited. B. Andreeva & W. Barry Jacques

More information

1. REFLEXES: Ask questions about coughing, swallowing, of water as fast as possible (note! Not suitable for all

1. REFLEXES: Ask questions about coughing, swallowing, of water as fast as possible (note! Not suitable for all Human Communication Science Chandler House, 2 Wakefield Street London WC1N 1PF http://www.hcs.ucl.ac.uk/ ACOUSTICS OF SPEECH INTELLIGIBILITY IN DYSARTHRIA EUROPEAN MASTER S S IN CLINICAL LINGUISTICS UNIVERSITY

More information

A Cross-language Corpus for Studying the Phonetics and Phonology of Prominence

A Cross-language Corpus for Studying the Phonetics and Phonology of Prominence A Cross-language Corpus for Studying the Phonetics and Phonology of Prominence Bistra Andreeva 1, William Barry 1, Jacques Koreman 2 1 Saarland University Germany 2 Norwegian University of Science and

More information

Universal contrastive analysis as a learning principle in CAPT

Universal contrastive analysis as a learning principle in CAPT Universal contrastive analysis as a learning principle in CAPT Jacques Koreman, Preben Wik, Olaf Husby, Egil Albertsen Department of Language and Communication Studies, NTNU, Trondheim, Norway jacques.koreman@ntnu.no,

More information

Noise-Adaptive Perceptual Weighting in the AMR-WB Encoder for Increased Speech Loudness in Adverse Far-End Noise Conditions

Noise-Adaptive Perceptual Weighting in the AMR-WB Encoder for Increased Speech Loudness in Adverse Far-End Noise Conditions 26 24th European Signal Processing Conference (EUSIPCO) Noise-Adaptive Perceptual Weighting in the AMR-WB Encoder for Increased Speech Loudness in Adverse Far-End Noise Conditions Emma Jokinen Department

More information

SEGMENTAL FEATURES IN SPONTANEOUS AND READ-ALOUD FINNISH

SEGMENTAL FEATURES IN SPONTANEOUS AND READ-ALOUD FINNISH SEGMENTAL FEATURES IN SPONTANEOUS AND READ-ALOUD FINNISH Mietta Lennes Most of the phonetic knowledge that is currently available on spoken Finnish is based on clearly pronounced speech: either readaloud

More information

Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification

Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification Tomi Kinnunen and Ismo Kärkkäinen University of Joensuu, Department of Computer Science, P.O. Box 111, 80101 JOENSUU,

More information

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION JOURNAL OF MEDICAL INFORMATICS & TECHNOLOGIES Vol. 11/2007, ISSN 1642-6037 Marek WIŚNIEWSKI *, Wiesława KUNISZYK-JÓŹKOWIAK *, Elżbieta SMOŁKA *, Waldemar SUSZYŃSKI * HMM, recognition, speech, disorders

More information

Phonological and Phonetic Representations: The Case of Neutralization

Phonological and Phonetic Representations: The Case of Neutralization Phonological and Phonetic Representations: The Case of Neutralization Allard Jongman University of Kansas 1. Introduction The present paper focuses on the phenomenon of phonological neutralization to consider

More information

Cambridgeshire Community Services NHS Trust: delivering excellence in children and young people s health services

Cambridgeshire Community Services NHS Trust: delivering excellence in children and young people s health services Normal Language Development Community Paediatric Audiology Cambridgeshire Community Services NHS Trust: delivering excellence in children and young people s health services Language develops unconsciously

More information

SARDNET: A Self-Organizing Feature Map for Sequences

SARDNET: A Self-Organizing Feature Map for Sequences SARDNET: A Self-Organizing Feature Map for Sequences Daniel L. James and Risto Miikkulainen Department of Computer Sciences The University of Texas at Austin Austin, TX 78712 dljames,risto~cs.utexas.edu

More information

Segregation of Unvoiced Speech from Nonspeech Interference

Segregation of Unvoiced Speech from Nonspeech Interference Technical Report OSU-CISRC-8/7-TR63 Department of Computer Science and Engineering The Ohio State University Columbus, OH 4321-1277 FTP site: ftp.cse.ohio-state.edu Login: anonymous Directory: pub/tech-report/27

More information

Learning Methods in Multilingual Speech Recognition

Learning Methods in Multilingual Speech Recognition Learning Methods in Multilingual Speech Recognition Hui Lin Department of Electrical Engineering University of Washington Seattle, WA 98125 linhui@u.washington.edu Li Deng, Jasha Droppo, Dong Yu, and Alex

More information

Pobrane z czasopisma New Horizons in English Studies Data: 18/11/ :52:20. New Horizons in English Studies 1/2016

Pobrane z czasopisma New Horizons in English Studies  Data: 18/11/ :52:20. New Horizons in English Studies 1/2016 LANGUAGE Maria Curie-Skłodowska University () in Lublin k.laidler.umcs@gmail.com Online Adaptation of Word-initial Ukrainian CC Consonant Clusters by Native Speakers of English Abstract. The phenomenon

More information

The NICT/ATR speech synthesis system for the Blizzard Challenge 2008

The NICT/ATR speech synthesis system for the Blizzard Challenge 2008 The NICT/ATR speech synthesis system for the Blizzard Challenge 2008 Ranniery Maia 1,2, Jinfu Ni 1,2, Shinsuke Sakai 1,2, Tomoki Toda 1,3, Keiichi Tokuda 1,4 Tohru Shimizu 1,2, Satoshi Nakamura 1,2 1 National

More information

Rachel E. Baker, Ann R. Bradlow. Northwestern University, Evanston, IL, USA

Rachel E. Baker, Ann R. Bradlow. Northwestern University, Evanston, IL, USA LANGUAGE AND SPEECH, 2009, 52 (4), 391 413 391 Variability in Word Duration as a Function of Probability, Speech Style, and Prosody Rachel E. Baker, Ann R. Bradlow Northwestern University, Evanston, IL,

More information

Automatic segmentation of continuous speech using minimum phase group delay functions

Automatic segmentation of continuous speech using minimum phase group delay functions Speech Communication 42 (24) 429 446 www.elsevier.com/locate/specom Automatic segmentation of continuous speech using minimum phase group delay functions V. Kamakshi Prasad, T. Nagarajan *, Hema A. Murthy

More information

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) e-issn: 2278-2834,p- ISSN: 2278-8735.Volume 10, Issue 2, Ver.1 (Mar - Apr.2015), PP 55-61 www.iosrjournals.org Analysis of Emotion

More information

On Developing Acoustic Models Using HTK. M.A. Spaans BSc.

On Developing Acoustic Models Using HTK. M.A. Spaans BSc. On Developing Acoustic Models Using HTK M.A. Spaans BSc. On Developing Acoustic Models Using HTK M.A. Spaans BSc. Delft, December 2004 Copyright c 2004 M.A. Spaans BSc. December, 2004. Faculty of Electrical

More information

Word Stress and Intonation: Introduction

Word Stress and Intonation: Introduction Word Stress and Intonation: Introduction WORD STRESS One or more syllables of a polysyllabic word have greater prominence than the others. Such syllables are said to be accented or stressed. Word stress

More information

Speaker Identification by Comparison of Smart Methods. Abstract

Speaker Identification by Comparison of Smart Methods. Abstract Journal of mathematics and computer science 10 (2014), 61-71 Speaker Identification by Comparison of Smart Methods Ali Mahdavi Meimand Amin Asadi Majid Mohamadi Department of Electrical Department of Computer

More information

COMPUTER INTERFACES FOR TEACHING THE NINTENDO GENERATION

COMPUTER INTERFACES FOR TEACHING THE NINTENDO GENERATION Session 3532 COMPUTER INTERFACES FOR TEACHING THE NINTENDO GENERATION Thad B. Welch, Brian Jenkins Department of Electrical Engineering U.S. Naval Academy, MD Cameron H. G. Wright Department of Electrical

More information

Appendix L: Online Testing Highlights and Script

Appendix L: Online Testing Highlights and Script Online Testing Highlights and Script for Fall 2017 Ohio s State Tests Administrations Test administrators must use this document when administering Ohio s State Tests online. It includes step-by-step directions,

More information

Christine Mooshammer, IPDS Kiel, Philip Hoole, IPSK München, Anja Geumann, Dublin

Christine Mooshammer, IPDS Kiel, Philip Hoole, IPSK München, Anja Geumann, Dublin 1 Title: Jaw and order Christine Mooshammer, IPDS Kiel, Philip Hoole, IPSK München, Anja Geumann, Dublin Short title: Production of coronal consonants Acknowledgements This work was partially supported

More information

Proceedings of Meetings on Acoustics

Proceedings of Meetings on Acoustics Proceedings of Meetings on Acoustics Volume 19, 2013 http://acousticalsociety.org/ ICA 2013 Montreal Montreal, Canada 2-7 June 2013 Speech Communication Session 2aSC: Linking Perception and Production

More information

PROJECT MANAGEMENT AND COMMUNICATION SKILLS DEVELOPMENT STUDENTS PERCEPTION ON THEIR LEARNING

PROJECT MANAGEMENT AND COMMUNICATION SKILLS DEVELOPMENT STUDENTS PERCEPTION ON THEIR LEARNING PROJECT MANAGEMENT AND COMMUNICATION SKILLS DEVELOPMENT STUDENTS PERCEPTION ON THEIR LEARNING Mirka Kans Department of Mechanical Engineering, Linnaeus University, Sweden ABSTRACT In this paper we investigate

More information

Evaluation of Various Methods to Calculate the EGG Contact Quotient

Evaluation of Various Methods to Calculate the EGG Contact Quotient Diploma Thesis in Music Acoustics (Examensarbete 20 p) Evaluation of Various Methods to Calculate the EGG Contact Quotient Christian Herbst Mozarteum, Salzburg, Austria Work carried out under the ERASMUS

More information

Robot manipulations and development of spatial imagery

Robot manipulations and development of spatial imagery Robot manipulations and development of spatial imagery Author: Igor M. Verner, Technion Israel Institute of Technology, Haifa, 32000, ISRAEL ttrigor@tx.technion.ac.il Abstract This paper considers spatial

More information

Consonant-Vowel Unity in Element Theory*

Consonant-Vowel Unity in Element Theory* Consonant-Vowel Unity in Element Theory* Phillip Backley Tohoku Gakuin University Kuniya Nasukawa Tohoku Gakuin University ABSTRACT. This paper motivates the Element Theory view that vowels and consonants

More information

Acoustic correlates of stress and their use in diagnosing syllable fusion in Tongan. James White & Marc Garellek UCLA

Acoustic correlates of stress and their use in diagnosing syllable fusion in Tongan. James White & Marc Garellek UCLA Acoustic correlates of stress and their use in diagnosing syllable fusion in Tongan James White & Marc Garellek UCLA 1 Introduction Goals: To determine the acoustic correlates of primary and secondary

More information

Expressive speech synthesis: a review

Expressive speech synthesis: a review Int J Speech Technol (2013) 16:237 260 DOI 10.1007/s10772-012-9180-2 Expressive speech synthesis: a review D. Govind S.R. Mahadeva Prasanna Received: 31 May 2012 / Accepted: 11 October 2012 / Published

More information

Guidelines for blind and partially sighted candidates

Guidelines for blind and partially sighted candidates Revised August 2006 Guidelines for blind and partially sighted candidates Our policy In addition to the specific provisions described below, we are happy to consider each person individually if their needs

More information

Seminar - Organic Computing

Seminar - Organic Computing Seminar - Organic Computing Self-Organisation of OC-Systems Markus Franke 25.01.2006 Typeset by FoilTEX Timetable 1. Overview 2. Characteristics of SO-Systems 3. Concern with Nature 4. Design-Concepts

More information

To appear in the Proceedings of the 35th Meetings of the Chicago Linguistics Society. Post-vocalic spirantization: Typology and phonetic motivations

To appear in the Proceedings of the 35th Meetings of the Chicago Linguistics Society. Post-vocalic spirantization: Typology and phonetic motivations Post-vocalic spirantization: Typology and phonetic motivations Alan C-L Yu University of California, Berkeley 0. Introduction Spirantization involves a stop consonant becoming a weak fricative (e.g., B,

More information

Entrepreneurial Discovery and the Demmert/Klein Experiment: Additional Evidence from Germany

Entrepreneurial Discovery and the Demmert/Klein Experiment: Additional Evidence from Germany Entrepreneurial Discovery and the Demmert/Klein Experiment: Additional Evidence from Germany Jana Kitzmann and Dirk Schiereck, Endowed Chair for Banking and Finance, EUROPEAN BUSINESS SCHOOL, International

More information

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections Tyler Perrachione LING 451-0 Proseminar in Sound Structure Prof. A. Bradlow 17 March 2006 Intra-talker Variation: Audience Design Factors Affecting Lexical Selections Abstract Although the acoustic and

More information

Think A F R I C A when assessing speaking. C.E.F.R. Oral Assessment Criteria. Think A F R I C A - 1 -

Think A F R I C A when assessing speaking. C.E.F.R. Oral Assessment Criteria. Think A F R I C A - 1 - C.E.F.R. Oral Assessment Criteria Think A F R I C A - 1 - 1. The extracts in the left hand column are taken from the official descriptors of the CEFR levels. How would you grade them on a scale of low,

More information

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words, A Language-Independent, Data-Oriented Architecture for Grapheme-to-Phoneme Conversion Walter Daelemans and Antal van den Bosch Proceedings ESCA-IEEE speech synthesis conference, New York, September 1994

More information

On the Formation of Phoneme Categories in DNN Acoustic Models

On the Formation of Phoneme Categories in DNN Acoustic Models On the Formation of Phoneme Categories in DNN Acoustic Models Tasha Nagamine Department of Electrical Engineering, Columbia University T. Nagamine Motivation Large performance gap between humans and state-

More information

An Acoustic Phonetic Account of the Production of Word-Final /z/s in Central Minnesota English

An Acoustic Phonetic Account of the Production of Word-Final /z/s in Central Minnesota English Linguistic Portfolios Volume 6 Article 10 2017 An Acoustic Phonetic Account of the Production of Word-Final /z/s in Central Minnesota English Cassy Lundy St. Cloud State University, casey.lundy@gmail.com

More information

WHEN THERE IS A mismatch between the acoustic

WHEN THERE IS A mismatch between the acoustic 808 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 14, NO. 3, MAY 2006 Optimization of Temporal Filters for Constructing Robust Features in Speech Recognition Jeih-Weih Hung, Member,

More information

Kelli Allen. Vicki Nieter. Jeanna Scheve. Foreword by Gregory J. Kaiser

Kelli Allen. Vicki Nieter. Jeanna Scheve. Foreword by Gregory J. Kaiser Kelli Allen Jeanna Scheve Vicki Nieter Foreword by Gregory J. Kaiser Table of Contents Foreword........................................... 7 Introduction........................................ 9 Learning

More information

Phonological Processing for Urdu Text to Speech System

Phonological Processing for Urdu Text to Speech System Phonological Processing for Urdu Text to Speech System Sarmad Hussain Center for Research in Urdu Language Processing, National University of Computer and Emerging Sciences, B Block, Faisal Town, Lahore,

More information

Learners Use Word-Level Statistics in Phonetic Category Acquisition

Learners Use Word-Level Statistics in Phonetic Category Acquisition Learners Use Word-Level Statistics in Phonetic Category Acquisition Naomi Feldman, Emily Myers, Katherine White, Thomas Griffiths, and James Morgan 1. Introduction * One of the first challenges that language

More information

THE MULTIVOC TEXT-TO-SPEECH SYSTEM

THE MULTIVOC TEXT-TO-SPEECH SYSTEM THE MULTVOC TEXT-TO-SPEECH SYSTEM Olivier M. Emorine and Pierre M. Martin Cap Sogeti nnovation Grenoble Research Center Avenue du Vieux Chene, ZRST 38240 Meylan, FRANCE ABSTRACT n this paper we introduce

More information

Using SAM Central With iread

Using SAM Central With iread Using SAM Central With iread January 1, 2016 For use with iread version 1.2 or later, SAM Central, and Student Achievement Manager version 2.4 or later PDF0868 (PDF) Houghton Mifflin Harcourt Publishing

More information

Quarterly Progress and Status Report. Sound symbolism in deictic words

Quarterly Progress and Status Report. Sound symbolism in deictic words Dept. for Speech, Music and Hearing Quarterly Progress and Status Report Sound symbolism in deictic words Traunmüller, H. journal: TMH-QPSR volume: 37 number: 2 year: 1996 pages: 147-150 http://www.speech.kth.se/qpsr

More information

STUDIES WITH FABRICATED SWITCHBOARD DATA: EXPLORING SOURCES OF MODEL-DATA MISMATCH

STUDIES WITH FABRICATED SWITCHBOARD DATA: EXPLORING SOURCES OF MODEL-DATA MISMATCH STUDIES WITH FABRICATED SWITCHBOARD DATA: EXPLORING SOURCES OF MODEL-DATA MISMATCH Don McAllaster, Larry Gillick, Francesco Scattone, Mike Newman Dragon Systems, Inc. 320 Nevada Street Newton, MA 02160

More information

Speaker recognition using universal background model on YOHO database

Speaker recognition using universal background model on YOHO database Aalborg University Master Thesis project Speaker recognition using universal background model on YOHO database Author: Alexandre Majetniak Supervisor: Zheng-Hua Tan May 31, 2011 The Faculties of Engineering,

More information

The Common European Framework of Reference for Languages p. 58 to p. 82

The Common European Framework of Reference for Languages p. 58 to p. 82 The Common European Framework of Reference for Languages p. 58 to p. 82 -- Chapter 4 Language use and language user/learner in 4.1 «Communicative language activities and strategies» -- Oral Production

More information

Human Emotion Recognition From Speech

Human Emotion Recognition From Speech RESEARCH ARTICLE OPEN ACCESS Human Emotion Recognition From Speech Miss. Aparna P. Wanare*, Prof. Shankar N. Dandare *(Department of Electronics & Telecommunication Engineering, Sant Gadge Baba Amravati

More information

Fix Your Vowels: Computer-assisted training by Dutch learners of Spanish

Fix Your Vowels: Computer-assisted training by Dutch learners of Spanish Carmen Lie-Lahuerta Fix Your Vowels: Computer-assisted training by Dutch learners of Spanish I t is common knowledge that foreign learners struggle when it comes to producing the sounds of the target language

More information

PROGRESS MONITORING FOR STUDENTS WITH DISABILITIES Participant Materials

PROGRESS MONITORING FOR STUDENTS WITH DISABILITIES Participant Materials Instructional Accommodations and Curricular Modifications Bringing Learning Within the Reach of Every Student PROGRESS MONITORING FOR STUDENTS WITH DISABILITIES Participant Materials 2007, Stetson Online

More information

Student Perceptions of Reflective Learning Activities

Student Perceptions of Reflective Learning Activities Student Perceptions of Reflective Learning Activities Rosalind Wynne Electrical and Computer Engineering Department Villanova University, PA rosalind.wynne@villanova.edu Abstract It is widely accepted

More information

Klaus Zuberbühler c) School of Psychology, University of St. Andrews, St. Andrews, Fife KY16 9JU, Scotland, United Kingdom

Klaus Zuberbühler c) School of Psychology, University of St. Andrews, St. Andrews, Fife KY16 9JU, Scotland, United Kingdom Published in The Journal of the Acoustical Society of America, Vol. 114, Issue 2, 2003, p. 1132-1142 which should be used for any reference to this work 1 The relationship between acoustic structure and

More information

Journal of Phonetics

Journal of Phonetics Journal of Phonetics 40 (2012) 595 607 Contents lists available at SciVerse ScienceDirect Journal of Phonetics journal homepage: www.elsevier.com/locate/phonetics How linguistic and probabilistic properties

More information

The pronunciation of /7i/ by male and female speakers of avant-garde Dutch

The pronunciation of /7i/ by male and female speakers of avant-garde Dutch The pronunciation of /7i/ by male and female speakers of avant-garde Dutch Vincent J. van Heuven, Loulou Edelman and Renée van Bezooijen Leiden University/ ULCL (van Heuven) / University of Nijmegen/ CLS

More information

NCEO Technical Report 27

NCEO Technical Report 27 Home About Publications Special Topics Presentations State Policies Accommodations Bibliography Teleconferences Tools Related Sites Interpreting Trends in the Performance of Special Education Students

More information

Application of Virtual Instruments (VIs) for an enhanced learning environment

Application of Virtual Instruments (VIs) for an enhanced learning environment Application of Virtual Instruments (VIs) for an enhanced learning environment Philip Smyth, Dermot Brabazon, Eilish McLoughlin Schools of Mechanical and Physical Sciences Dublin City University Ireland

More information

Revisiting the role of prosody in early language acquisition. Megha Sundara UCLA Phonetics Lab

Revisiting the role of prosody in early language acquisition. Megha Sundara UCLA Phonetics Lab Revisiting the role of prosody in early language acquisition Megha Sundara UCLA Phonetics Lab Outline Part I: Intonation has a role in language discrimination Part II: Do English-learning infants have

More information

A GENERIC SPLIT PROCESS MODEL FOR ASSET MANAGEMENT DECISION-MAKING

A GENERIC SPLIT PROCESS MODEL FOR ASSET MANAGEMENT DECISION-MAKING A GENERIC SPLIT PROCESS MODEL FOR ASSET MANAGEMENT DECISION-MAKING Yong Sun, a * Colin Fidge b and Lin Ma a a CRC for Integrated Engineering Asset Management, School of Engineering Systems, Queensland

More information

GROUP COMPOSITION IN THE NAVIGATION SIMULATOR A PILOT STUDY Magnus Boström (Kalmar Maritime Academy, Sweden)

GROUP COMPOSITION IN THE NAVIGATION SIMULATOR A PILOT STUDY Magnus Boström (Kalmar Maritime Academy, Sweden) GROUP COMPOSITION IN THE NAVIGATION SIMULATOR A PILOT STUDY Magnus Boström (Kalmar Maritime Academy, Sweden) magnus.bostrom@lnu.se ABSTRACT: At Kalmar Maritime Academy (KMA) the first-year students at

More information

VIEW: An Assessment of Problem Solving Style

VIEW: An Assessment of Problem Solving Style 1 VIEW: An Assessment of Problem Solving Style Edwin C. Selby, Donald J. Treffinger, Scott G. Isaksen, and Kenneth Lauer This document is a working paper, the purposes of which are to describe the three

More information

Document number: 2013/ Programs Committee 6/2014 (July) Agenda Item 42.0 Bachelor of Engineering with Honours in Software Engineering

Document number: 2013/ Programs Committee 6/2014 (July) Agenda Item 42.0 Bachelor of Engineering with Honours in Software Engineering Document number: 2013/0006139 Programs Committee 6/2014 (July) Agenda Item 42.0 Bachelor of Engineering with Honours in Software Engineering Program Learning Outcomes Threshold Learning Outcomes for Engineering

More information

Reinforcement Learning by Comparing Immediate Reward

Reinforcement Learning by Comparing Immediate Reward Reinforcement Learning by Comparing Immediate Reward Punit Pandey DeepshikhaPandey Dr. Shishir Kumar Abstract This paper introduces an approach to Reinforcement Learning Algorithm by comparing their immediate

More information

Dyslexia/dyslexic, 3, 9, 24, 97, 187, 189, 206, 217, , , 367, , , 397,

Dyslexia/dyslexic, 3, 9, 24, 97, 187, 189, 206, 217, , , 367, , , 397, Adoption studies, 274 275 Alliteration skill, 113, 115, 117 118, 122 123, 128, 136, 138 Alphabetic writing system, 5, 40, 127, 136, 410, 415 Alphabets (types of ) artificial transparent alphabet, 5 German

More information

Achievement Level Descriptors for American Literature and Composition

Achievement Level Descriptors for American Literature and Composition Achievement Level Descriptors for American Literature and Composition Georgia Department of Education September 2015 All Rights Reserved Achievement Levels and Achievement Level Descriptors With the implementation

More information

Houghton Mifflin Reading Correlation to the Common Core Standards for English Language Arts (Grade1)

Houghton Mifflin Reading Correlation to the Common Core Standards for English Language Arts (Grade1) Houghton Mifflin Reading Correlation to the Standards for English Language Arts (Grade1) 8.3 JOHNNY APPLESEED Biography TARGET SKILLS: 8.3 Johnny Appleseed Phonemic Awareness Phonics Comprehension Vocabulary

More information

Voiceless Stop Consonant Modelling and Synthesis Framework Based on MISO Dynamic System

Voiceless Stop Consonant Modelling and Synthesis Framework Based on MISO Dynamic System ARCHIVES OF ACOUSTICS Vol. 42, No. 3, pp. 375 383 (2017) Copyright c 2017 by PAN IPPT DOI: 10.1515/aoa-2017-0039 Voiceless Stop Consonant Modelling and Synthesis Framework Based on MISO Dynamic System

More information

A Comparison of DHMM and DTW for Isolated Digits Recognition System of Arabic Language

A Comparison of DHMM and DTW for Isolated Digits Recognition System of Arabic Language A Comparison of DHMM and DTW for Isolated Digits Recognition System of Arabic Language Z.HACHKAR 1,3, A. FARCHI 2, B.MOUNIR 1, J. EL ABBADI 3 1 Ecole Supérieure de Technologie, Safi, Morocco. zhachkar2000@yahoo.fr.

More information

Beginning primarily with the investigations of Zimmermann (1980a),

Beginning primarily with the investigations of Zimmermann (1980a), Orofacial Movements Associated With Fluent Speech in Persons Who Stutter Michael D. McClean Walter Reed Army Medical Center, Washington, D.C. Stephen M. Tasko Western Michigan University, Kalamazoo, MI

More information

A Reinforcement Learning Variant for Control Scheduling

A Reinforcement Learning Variant for Control Scheduling A Reinforcement Learning Variant for Control Scheduling Aloke Guha Honeywell Sensor and System Development Center 3660 Technology Drive Minneapolis MN 55417 Abstract We present an algorithm based on reinforcement

More information

Making welding simulators effective

Making welding simulators effective Making welding simulators effective Introduction Simulation based training had its inception back in the 1920s. The aviation field adopted this innovation in education when confronted with an increased

More information

International Journal of Computational Intelligence and Informatics, Vol. 1 : No. 4, January - March 2012

International Journal of Computational Intelligence and Informatics, Vol. 1 : No. 4, January - March 2012 Text-independent Mono and Cross-lingual Speaker Identification with the Constraint of Limited Data Nagaraja B G and H S Jayanna Department of Information Science and Engineering Siddaganga Institute of

More information

Developing a College-level Speed and Accuracy Test

Developing a College-level Speed and Accuracy Test Brigham Young University BYU ScholarsArchive All Faculty Publications 2011-02-18 Developing a College-level Speed and Accuracy Test Jordan Gilbert Marne Isakson See next page for additional authors Follow

More information

THE PENNSYLVANIA STATE UNIVERSITY SCHREYER HONORS COLLEGE DEPARTMENT OF MATHEMATICS ASSESSING THE EFFECTIVENESS OF MULTIPLE CHOICE MATH TESTS

THE PENNSYLVANIA STATE UNIVERSITY SCHREYER HONORS COLLEGE DEPARTMENT OF MATHEMATICS ASSESSING THE EFFECTIVENESS OF MULTIPLE CHOICE MATH TESTS THE PENNSYLVANIA STATE UNIVERSITY SCHREYER HONORS COLLEGE DEPARTMENT OF MATHEMATICS ASSESSING THE EFFECTIVENESS OF MULTIPLE CHOICE MATH TESTS ELIZABETH ANNE SOMERS Spring 2011 A thesis submitted in partial

More information