The Structure of the ORD Speech Corpus of Russian Everyday Communication
|
|
- Rose Atkinson
- 6 years ago
- Views:
Transcription
1 The Structure of the ORD Speech Corpus of Russian Everyday Communication Tatiana Sherstinova St. Petersburg State University, St. Petersburg, Universitetskaya nab. 11, , Russia Abstract. The paper presents the structure of the ORD speech corpus of Russian everyday communication, which contains recordings of all spoken episodes recorded during twenty-four hours by a demographically balanced group of people in St. Petersburg. The paper describes the structure of the corpus, consisting of audio files, annotation files and information system and reviews the main communicative episodes presented in the corpus. 1 Introduction: What Is the ORD Corpus? The abbreviation ORD stems from Russian Odin Rechevoj Den, literally translated as one day of speech. The main aim of creating the ORD corpus is to collect recordings of actual speech which we use in our everyday communication. The ORD creating is an interdisciplinary project, in which specialists in many scientific branches are involved. Primarily, they are linguists experts in different aspects of Russian language phoneticians, grammarians, lexicographers, dialectologists. Besides them two psychologists and a sociologist took part in project creation as well as specialists in modern information technologies. For the first series of recordings a demographically balanced group of 30 subjects representing various social and age strata in the population of St. Petersburg was selected. The subjects spent one day with dictaphones dangling around their necks and recording all their communication. So dictaphones should be turned on in the morning recording breakfast at home with family members, then preparation for going to work, the way to work itself, speaking by cellular telephone, then official and informal conversations at work with colleagues (e.g., about problems with children, world financial crisis, yesterday s football match, etc.), lunch time, shopping, recreation and so on up to the moment when subjects went to bed. In the result more then 240 hours of recording were obtained, from which 170 hours contain speech data quite suitable for further linguistic analysis and more than 50 hours of recordings are good enough for further phonetic analysis. The corpus was divided into 2202 communication episodes. 134 episodes are already transcribed in detail. At present, orthographic transcription of the corpus numbers more than wordforms [1]. The corpus presents the unique linguistic material, allowing to perform fundamental research in many aspects including complex behaviour of people in real world. At the V. Matoušek and P. Mautner (Eds.): TSD 2009, LNAI 5729, pp , c Springer-Verlag Berlin Heidelberg 2009
2 The Structure of the ORD Speech Corpus of Russian Everyday Communication 259 same time these utterly natural recordings may be used for practical purposes: for example, for verification of many scientific hypotheses, for adjustment and improvement of speech synthesis and recognition systems, etc. 2 The ORD Corpus Structure The ORD speech corpus consists of three major components: 1) audio files, 2) correspondent annotation files, and 3) information system. 2.1 Audio Files The methodology used for collection of speech for the ORD corpus when subjects were asked to keep dictophones on for many hours gave unique material about our everyday speech behaviour (e.g., we have quite rare recordings of people talking just to themselves or informal communication of cadets in barracks of military school). At the same time it inevitably got a great amount of non-acceptable recordings: fragments without speech and fragments with speech in very noisy environment (e.g., background remarks of metro passengers). It was necessary to separate fragments containing speech from that without speech, and to classify speech fragments according to their quality. Obviously, the archive copy of all recordings is kept in its original form, allowing to reconstitute speech days as they were. Every audio file of the other copy of the corpus was carefully listen to and segmented into fragments, which became the main units of audio corpus. It is supposed that each file is not longer than 30 minutes, contains speech recording of the similar quality and refers to the same or adjacent communication episode(s). All fragments without speech longer than several minutes are cut from ORD files, as well as fragments containing just background sounds of working TV or radio. Information on this segmentation is available in an auxiliary database. The new files have got names, referring to the subject s code and the ordinal number of episode. The phonetic quality of each file was further evaluated and measured in 4-score scale: 1 the best quality, suitable for precise phonetic analysis, 2 rather good quality partially suitable for phonetic analysis, 3 noisy recordings with low quality which is only partially legible (not suitable for phonetic analysis but suitable for other aspects of research), 4 unintelligible conversations or remarks in extreme noise, which could not be understand without noise reduction. It is planned to annotate first the recordings of the best quality, whereas the most noisy audio files are not to be annotated at all. 2.2 Annotation Files The ORD corpus is being annotated by means of two professional annotation tools ELAN [2] and Praat [3]. The main principles of multi-level tagging in the ORD corpus were described in [4]. The annotation formats of ELAN and Praat are fully convertible. ELAN is used for primary and general annotations of the corpus, whereas Praat is used for making real phonetic transcription and other phonetic annotations. The annotations are kept in files of two general types *.eaf (ELAN format) and *.TextGrid (Praat format). Being verified by experts, annotation data are exported into general information system.
3 260 T. Sherstinova 2.3 Information System The information system presents a relational database created on the base of MS Access All tables of the database are divided into 3 general groups. Group I Actual information about speakers, sound files, and communication episodes. Table 1.1 Informants (Speakers/Subjects): actual data about all base speakers, presented by the subjects themselves. For example, speaker s code (S01, S02, etc.), his/her nickname; gender; age; place of birth; social group; education; qualification; current occupation; nationality; number and quality of recorded files; total and usable time of recording; comments; etc. Table 1.2 Communicants (Interlocutors): some actual data about the main people who communicated with the subjects during their speech day: communicator s code; his/her (nick)name; relation to the subject or his/her social role (e.g., mother, friend, shop assistant, etc.); gender; approximate age; and some other possible information provided by the subjects interlocutors place of birth; social group; education; qualification; current occupation; nationality; as well as intelligibility of recorded speech; some comments; etc. Table 1.3 ARCSoundFiles: information about original (archival) sound files including total duration, that of intelligible speech and illegible or noisy fragments. Table 1.4 ORDSoundFiles: information about reformatted sound files including reference to correspondent original files, exact position in the original file, total duration, phonetic quality of recording, annotation priority, and the main communication episode, described in three fields: 1) where 2) doing what 3) who is (are) the main interlocutor(s). Table 1.5 Episodes: concise formal description of main voiced communication episodes. Segmentation into episodes was made by expert linguists, its description includes information on interlocutors, time, duration, place, aim and subject of communication, as well as some possible comments. At first segmentation into episodes was made rather arbitrary. Now we try to standardize both segmentation and its description. Group 2 presents some results of social and psycholinguistic data interpretation. Currently it contains just two tables: Table 2.1 InformantsSocial (Speakers Social Attributions) has the same structure as Table 1.2 (Speakers), but contains subjective evaluation of speaker s social characteristics by linguists who transcribed their speech. It should be noted that the linguists were not allowed to see actual information from Table 1.1 before filling Table 2.1. Table 2.2 InformantsPsycho (Speakers Psychological Portraits). This table is also filled by linguists who work with recordings and contains speaker s psychological rating in ten-point system for the following aspects: neurotizismus, spontaneous aggression, depression, irritation, sociability, tranquility, responsive aggression, self-consciousness, openness, extroversion/introversion, emotional instability, masculinity/feminism. Besides, here you can find a one-page essay about each base speaker written by researchers in a free style.
4 The Structure of the ORD Speech Corpus of Russian Everyday Communication 261 Group 3 contains tables for speech transcripts and multi-level annotations. Filling these tables of the database is still in progress. Table 3.1 MiniEpisodes is used to give reference to smaller real-life episodes within larger episodes described in Table 1.5. For example, Speaker X in Episode N (in the evening at summer house with mother) may have the following mini-episodes: 1) searching for matches (2 minutes), 2) trying to set fire to the oven (3 minites), 3) discussing plans for the rest of the evening (5 minutes), etc. Table 3.2 Timeline: Each (mini-)episode is sequentially subdivided into utterances (remarks/phrases) and pauses. The exact timing of each fragment is given. Table 3.3 Frases contains orthographic transcript of communication episodes made by linguists using a special system of notation. Reference to the speaking person is given in the Speaker field, using the same codes as in Table 1.1 and 1.2. Auxiliary information on starting-ending points of the phrase in milliseconds allows to listen to it in the correspondent database form. Table 3.4 Voice is linked with the previous table and contains information about possible changes of voice quality in some fragment of the speech (either physiologically or functionally e.g., hoarsely, smiling, yawning, exciting, imitating, etc.). Table 3.5 Events describes non-language audio events (squeak of a door, phone ring, etc.). Table 3.6 Notes: refers to auxiliary information which may be given to some period of time (e.g., this fragment contains specific youth slang ). Table 3.7 Words: keeps information about each wordform of the corpus (e.g., POS, grammatical form, syntactic role, phonetic transcription, etc.). Table 3.8 PhonWords: describes phonetic words. Segmentation into phonetic words is being made for sub-corpus of the best quality. The table currently contains orthographic spelling of phonetic words and reference to starting-ending points in correspondent audio files. Phonetic transcription is planned to be added later. It is possible to listen to each segmented phonetic word by means of database utilities. Table 3.9 Morphemes: contains data on morphemes (e.g., general class, ideal transcription, phonetic transcription, etc.). Table 3.10 Sounds: contains phonetic information about individual realization of phonemes or indivisible sound groups (ideal and real phonetic transcription, position, etc.). A number of additional levels of annotation is planned to be included into the database further (e.g., speaker s mental state, emotional connotations, communicational strategies, rhetorical techniques, prosodic models, etc.). Therefore, new tables will appear in the database in the future. On Fig. 1 you may see examples of orthographic transcripts of phrases in Table Frase for the mini-episode a story about speaker s driving lesson (Speaker S05, female, 29 years old, teacher). Basing on obtained annotations different frequency lists may be built for words and phonetic words for any speaker, episode, group of episodes and corpus in the whole. All occurrences of each word may be found and listen to. A flexible search system is currently being created for the database of annotations. Further data processing is based on database requests and special applications, which
5 262 T. Sherstinova Fig. 1. An example of orthographic transcript of phrases in Table Frase are currently being developed. One of such applications is E-Kar utility, which allows complex lexicographic and morphologic data processing. Speech material of the ORD corpus will be constantly increased. Thus the new recordings are currently being made by new groups of subjects. New functional modules are also being made within the ORD speech corpus for example, an audio dictionary of Russian morphemes. 3 Summarization of Communicative Episodes In this section we will briefly describe the content of the ORD corpus from the point of view of episodes typology. The term episode in the ORD terminology means continuous and preferably long-lasting fragment of one-day-of-speech recordings with the common conditions of communication (time, place, action, interlocutors). Episodes may be further subdivided into mini-episodes which refer to shorter periods of time with the common topic of conversation, simultaneous action, etc. All 2202 communication episodes detected in the days of speech were then divided into 22 general categories. The most frequent types are represented on Fig. 2. Percents shown on this chart refer to total duration of recording made in the given conditions and doesn t reflect the number of utterances (or words) recorded in each situation. It can be seen from Fig. 2, that the largest part of speech records (42% or nearly 93,6 hours) refers to communication related with the main occupation of the subjects (at work or at studies). This category is more than 4 times bigger than the group of the second rank - family conversation at home in the evening (9,92%). The types that follows refer to parties in coffee bars or restaurants (9,80%) and to dialogues on the way to the working place or somewhere else (8,77%). The fifth place is occupied by home conversations in
6 The Structure of the ORD Speech Corpus of Russian Everyday Communication 263 the morning (5,41%). The duration of any other category of episodes does not reach 5% of the averaged speech day. Such classification of episodes is rather rough. Evidently, some categories should be further reviewed. For example, the huge category at work may be divided into business meetings, work with clients, business calls, individual work, and personal contacts, including private and nonbusiness conversations. Moreover, some smaller episodes or situations (e.g., phone calls) may take place within practically any main category. Fig. 2. Summarization of main episodes in the ORD according to the total time of original recordings 3.1 Male and Female Speech Days Special investigation was made to compare average male and female speech days. In the majority of the categories of episodes the speech day of men does not differ significantly from that of women the difference is less than 1-2%, such is the case of the dominating category working/studies (see Table 1). There are, however, a few categories in which rather substantial difference can be observed. In particular, men spent more than 9% more time than women attending various (sport, cultural, etc.) events; therefore total time of men s being on a way is nearly 5% longer. As for women, they spent this time at home conversations in the evening (7% more than men on the average), at parties and dinners (both 2% longer than that of men) and in the morning conversations (3% longer). This result is, however, not surprising from both psychological and sociological points of view [5]. When the corpus will be totally annotated and transcribed we can measure more precise the quantity of speech (utterances and words) for each communication episodes and to study its dynamics within days-of-speech. Besides, differentiation of working days and holidays should be also taken into account. Then we can try to built an averaged
7 264 T. Sherstinova Table 1. Comparison of male and female communication episodes in the ORD corpus General types of episodes Difference (%) Male/Female 1 sport and cultural events 8,69 Male 2 on the way to anywhere 4,85 Male 3 at home in the daytime 2,51 Male 4 walk 1,58 Male 5 corporate party 1,47 6 visiting service centers, public institutions, etc. 1,17 7 the main occupation (work or study) 0,87 8 hobbies, leisure, sport 0,10 9 lunch 0,23 10 visiting a doctor 0,31 11 shopping 0,59 12 breakfast 0,79 13 on the way home 0,97 14 visiting other people 1,06 15 in the country 1,20 16 dinner 1,94 17 housework 2,05 Female 18 evening party (in cafe, restaurant) 2,08 Female 19 family talk at home in the morning 3,01 Female 20 family talk at home in the evening 6,92 Female model of peoples twenty-four-hour speech behaviour. Further, having determined the main chains and structures of everyday communication it will be possible to study time series of quantitative variables by means of standard statistical methods and to analyze frequency series (e.g., of lexical, grammatical or semantic units, acoustic phenomena, prosodic contours, etc.) depending on various conditions of communication. Acknowledgements The first recordings and database creating of the ORD corpus were supported by the Russian Foundation for Humanities within the framework of the project Speech Corpus of Russian Everyday Communication One Speaker s Day (project # e/Ya). Nowadays creating of the corpus is supported by the program of the Russian Ministry of Education titled Sound Form of Russian Grammar System in Communicative and Informational Approach and by the grant of the Russian Foundation for Humanities Development of an Information System for Monitoring of Russian Spoken Language (project # v). References 1. Asinovsky, A.S., Bogdanova, N.V., Rusakova, M.V., Stepanova, S.B., Sherstinova, T.Y.: Zvukovoj korpus russkogo yazyka povsednevnogo obschenia Odin rechevoj den : koncepcia i sosytojanie formirovania. In: Kompjuternaya lingvistika i intellektualnye tekhnologii. Vypusk. Po materialam mezhd. konferencii Dialog, Moscow, vol. 7 (14), pp (2008)
8 The Structure of the ORD Speech Corpus of Russian Everyday Communication ELAN - Linguistic Annotator. Version 3.6, 3. Praat: Doing Phonetics by computer, 4. Ryko, A.I., Stepanova, S.B.: Mnogourovnevaya lingvisticheskaya razmetka zvukovogo korpusa russkogo yazyka. In: Kompjuternaya lingvistika i intellektualnye tekhnologii. Vypusk. Po materialam mezhd. konferencii Dialog, Moscow, vol. 7 (14), pp (2008) 5. Sherstinova, T.Y.: Odin rechevoj den na vremennoj shkale: o perspektivakh issledovania dinamicheskikh processov na materiale zvukovogo korpusa. In: Vestnik Sankt-Peterburgskogo universiteta, Chast 2, St. Petersburg. Seria 9: Filologia. Vostokovedenie. Zhurnalistika, vol. 4, pp (2008)
The ORD Speech Corpus of Russian Everyday Communication One Speaker s Day : Creation Principles and Annotation
The ORD Speech Corpus of Russian Everyday Communication One Speaker s Day : Creation Principles and Annotation Alexander Asinovsky, Natalia Bogdanova, Marina Rusakova, Anastassia Ryko, Svetlana Stepanova,
More informationThe Common European Framework of Reference for Languages p. 58 to p. 82
The Common European Framework of Reference for Languages p. 58 to p. 82 -- Chapter 4 Language use and language user/learner in 4.1 «Communicative language activities and strategies» -- Oral Production
More informationProblems of the Arabic OCR: New Attitudes
Problems of the Arabic OCR: New Attitudes Prof. O.Redkin, Dr. O.Bernikova Department of Asian and African Studies, St. Petersburg State University, St Petersburg, Russia Abstract - This paper reviews existing
More informationMandarin Lexical Tone Recognition: The Gating Paradigm
Kansas Working Papers in Linguistics, Vol. 0 (008), p. 8 Abstract Mandarin Lexical Tone Recognition: The Gating Paradigm Yuwen Lai and Jie Zhang University of Kansas Research on spoken word recognition
More informationEnglish Language and Applied Linguistics. Module Descriptions 2017/18
English Language and Applied Linguistics Module Descriptions 2017/18 Level I (i.e. 2 nd Yr.) Modules Please be aware that all modules are subject to availability. If you have any questions about the modules,
More informationEyebrows in French talk-in-interaction
Eyebrows in French talk-in-interaction Aurélie Goujon 1, Roxane Bertrand 1, Marion Tellier 1 1 Aix Marseille Université, CNRS, LPL UMR 7309, 13100, Aix-en-Provence, France Goujon.aurelie@gmail.com Roxane.bertrand@lpl-aix.fr
More informationCEFR Overall Illustrative English Proficiency Scales
CEFR Overall Illustrative English Proficiency s CEFR CEFR OVERALL ORAL PRODUCTION Has a good command of idiomatic expressions and colloquialisms with awareness of connotative levels of meaning. Can convey
More informationSpeech Recognition at ICSI: Broadcast News and beyond
Speech Recognition at ICSI: Broadcast News and beyond Dan Ellis International Computer Science Institute, Berkeley CA Outline 1 2 3 The DARPA Broadcast News task Aspects of ICSI
More informationGetting the Story Right: Making Computer-Generated Stories More Entertaining
Getting the Story Right: Making Computer-Generated Stories More Entertaining K. Oinonen, M. Theune, A. Nijholt, and D. Heylen University of Twente, PO Box 217, 7500 AE Enschede, The Netherlands {k.oinonen
More informationWord Stress and Intonation: Introduction
Word Stress and Intonation: Introduction WORD STRESS One or more syllables of a polysyllabic word have greater prominence than the others. Such syllables are said to be accented or stressed. Word stress
More informationThink A F R I C A when assessing speaking. C.E.F.R. Oral Assessment Criteria. Think A F R I C A - 1 -
C.E.F.R. Oral Assessment Criteria Think A F R I C A - 1 - 1. The extracts in the left hand column are taken from the official descriptors of the CEFR levels. How would you grade them on a scale of low,
More informationLinking Task: Identifying authors and book titles in verbose queries
Linking Task: Identifying authors and book titles in verbose queries Anaïs Ollagnier, Sébastien Fournier, and Patrice Bellot Aix-Marseille University, CNRS, ENSAM, University of Toulon, LSIS UMR 7296,
More informationCandidates must achieve a grade of at least C2 level in each examination in order to achieve the overall qualification at C2 Level.
The Test of Interactive English, C2 Level Qualification Structure The Test of Interactive English consists of two units: Unit Name English English Each Unit is assessed via a separate examination, set,
More informationRUSSIAN LANGUAGE, INTERMEDIATE LEVEL
Listening COMPREHENSION RUSSIAN LANGUAGE, INTERMEDIATE LEVEL Course tutor(s) Tatiana Batrakova, native speaker. Teaching experience at Social-Pedagogical Academy in Nizhny Tagil, Russia (2008 2012). Since
More informationSchool Inspection in Hesse/Germany
Hessisches Kultusministerium School Inspection in Hesse/Germany Contents 1. Introduction...2 2. School inspection as a Procedure for Quality Assurance and Quality Enhancement...2 3. The Hessian framework
More informationGeneral syllabus for third-cycle courses and study programmes in
ÖREBRO UNIVERSITY This is a translation of a Swedish document. In the event of a discrepancy, the Swedishlanguage version shall prevail. General syllabus for third-cycle courses and study programmes in
More informationEvidence for Reliability, Validity and Learning Effectiveness
PEARSON EDUCATION Evidence for Reliability, Validity and Learning Effectiveness Introduction Pearson Knowledge Technologies has conducted a large number and wide variety of reliability and validity studies
More informationCourse Law Enforcement II. Unit I Careers in Law Enforcement
Course Law Enforcement II Unit I Careers in Law Enforcement Essential Question How does communication affect the role of the public safety professional? TEKS 130.294(c) (1)(A)(B)(C) Prior Student Learning
More informationIntra-talker Variation: Audience Design Factors Affecting Lexical Selections
Tyler Perrachione LING 451-0 Proseminar in Sound Structure Prof. A. Bradlow 17 March 2006 Intra-talker Variation: Audience Design Factors Affecting Lexical Selections Abstract Although the acoustic and
More informationCase study Norway case 1
Case study Norway case 1 School : B (primary school) Theme: Science microorganisms Dates of lessons: March 26-27 th 2015 Age of students: 10-11 (grade 5) Data sources: Pre- and post-interview with 1 teacher
More informationSpecification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments
Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments Cristina Vertan, Walther v. Hahn University of Hamburg, Natural Language Systems Division Hamburg,
More informationWhat s in a Step? Toward General, Abstract Representations of Tutoring System Log Data
What s in a Step? Toward General, Abstract Representations of Tutoring System Log Data Kurt VanLehn 1, Kenneth R. Koedinger 2, Alida Skogsholm 2, Adaeze Nwaigwe 2, Robert G.M. Hausmann 1, Anders Weinstein
More information5. UPPER INTERMEDIATE
Triolearn General Programmes adapt the standards and the Qualifications of Common European Framework of Reference (CEFR) and Cambridge ESOL. It is designed to be compatible to the local and the regional
More informationThe College Board Redesigned SAT Grade 12
A Correlation of, 2017 To the Redesigned SAT Introduction This document demonstrates how myperspectives English Language Arts meets the Reading, Writing and Language and Essay Domains of Redesigned SAT.
More informationGuidelines for the Use of the Continuing Education Unit (CEU)
Guidelines for the Use of the Continuing Education Unit (CEU) The UNC Policy Manual The essential educational mission of the University is augmented through a broad range of activities generally categorized
More informationAttention Getting Strategies : If You Can Hear My Voice Clap Once. By: Ann McCormick Boalsburg Elementary Intern Fourth Grade
McCormick 1 Attention Getting Strategies : If You Can Hear My Voice Clap Once By: Ann McCormick 2008 2009 Boalsburg Elementary Intern Fourth Grade adm5053@psu.edu April 25, 2009 McCormick 2 Table of Contents
More informationDerivational and Inflectional Morphemes in Pak-Pak Language
Derivational and Inflectional Morphemes in Pak-Pak Language Agustina Situmorang and Tima Mariany Arifin ABSTRACT The objectives of this study are to find out the derivational and inflectional morphemes
More informationGrammar Lesson Plan: Yes/No Questions with No Overt Auxiliary Verbs
Grammar Lesson Plan: Yes/No Questions with No Overt Auxiliary Verbs DIALOGUE: Hi Armando. Did you get a new job? No, not yet. Are you still looking? Yes, I am. Have you had any interviews? Yes. At the
More informationDegree Qualification Profiles Intellectual Skills
Degree Qualification Profiles Intellectual Skills Intellectual Skills: These are cross-cutting skills that should transcend disciplinary boundaries. Students need all of these Intellectual Skills to acquire
More informationUniversity of Pittsburgh Department of Slavic Languages and Literatures. Russian 0015: Russian for Heritage Learners 2 MoWe 3:00PM - 4:15PM G13 CL
1 University of Pittsburgh Department of Slavic Languages and Literatures Russian 0015: Russian for Heritage Learners 2 MoWe 3:00PM - 4:15PM G13 CL Spring 2011 Instructor: Yuliya Basina e-mail basina@pitt.edu
More informationhave to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,
A Language-Independent, Data-Oriented Architecture for Grapheme-to-Phoneme Conversion Walter Daelemans and Antal van den Bosch Proceedings ESCA-IEEE speech synthesis conference, New York, September 1994
More informationImproved Effects of Word-Retrieval Treatments Subsequent to Addition of the Orthographic Form
Orthographic Form 1 Improved Effects of Word-Retrieval Treatments Subsequent to Addition of the Orthographic Form The development and testing of word-retrieval treatments for aphasia has generally focused
More informationPrincipal vacancies and appointments
Principal vacancies and appointments 2009 10 Sally Robertson New Zealand Council for Educational Research NEW ZEALAND COUNCIL FOR EDUCATIONAL RESEARCH TE RŪNANGA O AOTEAROA MŌ TE RANGAHAU I TE MĀTAURANGA
More informationParent Information Welcome to the San Diego State University Community Reading Clinic
Parent Information Welcome to the San Diego State University Community Reading Clinic Who Are We? The San Diego State University Community Reading Clinic (CRC) is part of the SDSU Literacy Center in the
More information1.2 Interpretive Communication: Students will demonstrate comprehension of content from authentic audio and visual resources.
Course French I Grade 9-12 Unit of Study Unit 1 - Bonjour tout le monde! & les Passe-temps Unit Type(s) x Topical Skills-based Thematic Pacing 20 weeks Overarching Standards: 1.1 Interpersonal Communication:
More informationMASTER S THESIS GUIDE MASTER S PROGRAMME IN COMMUNICATION SCIENCE
MASTER S THESIS GUIDE MASTER S PROGRAMME IN COMMUNICATION SCIENCE University of Amsterdam Graduate School of Communication Kloveniersburgwal 48 1012 CX Amsterdam The Netherlands E-mail address: scripties-cw-fmg@uva.nl
More informationNewburgh Enlarged City School District Academic. Academic Intervention Services Plan
Newburgh Enlarged City School District Academic Academic Intervention Services Plan Revised September 2016 October 2015 Newburgh Enlarged City School District Elementary Academic Intervention Services
More informationLesson M4. page 1 of 2
Lesson M4 page 1 of 2 Miniature Gulf Coast Project Math TEKS Objectives 111.22 6b.1 (A) apply mathematics to problems arising in everyday life, society, and the workplace; 6b.1 (C) select tools, including
More informationRole of Pausing in Text-to-Speech Synthesis for Simultaneous Interpretation
Role of Pausing in Text-to-Speech Synthesis for Simultaneous Interpretation Vivek Kumar Rangarajan Sridhar, John Chen, Srinivas Bangalore, Alistair Conkie AT&T abs - Research 180 Park Avenue, Florham Park,
More informationMinistry of Education, Republic of Palau Executive Summary
Ministry of Education, Republic of Palau Executive Summary Student Consultant, Jasmine Han Community Partner, Edwel Ongrung I. Background Information The Ministry of Education is one of the eight ministries
More informationListening and Speaking Skills of English Language of Adolescents of Government and Private Schools
Listening and Speaking Skills of English Language of Adolescents of Government and Private Schools Dr. Amardeep Kaur Professor, Babe Ke College of Education, Mudki, Ferozepur, Punjab Abstract The present
More informationReview in ICAME Journal, Volume 38, 2014, DOI: /icame
Review in ICAME Journal, Volume 38, 2014, DOI: 10.2478/icame-2014-0012 Gaëtanelle Gilquin and Sylvie De Cock (eds.). Errors and disfluencies in spoken corpora. Amsterdam: John Benjamins. 2013. 172 pp.
More information5 Early years providers
5 Early years providers What this chapter covers This chapter explains the action early years providers should take to meet their duties in relation to identifying and supporting all children with special
More informationVOCATIONAL QUALIFICATION IN YOUTH AND LEISURE INSTRUCTION 2009
Requirements for Vocational Qualifications VOCATIONAL QUALIFICATION IN YOUTH AND LEISURE INSTRUCTION 2009 Regulation 17/011/2009 Publications 2013:4 Publications 2013:4 Requirements for Vocational Qualifications
More informationHow to write in essay form >>>CLICK HERE<<<
How to write in essay form >>>CLICK HERE
More informationEntrepreneurial Discovery and the Demmert/Klein Experiment: Additional Evidence from Germany
Entrepreneurial Discovery and the Demmert/Klein Experiment: Additional Evidence from Germany Jana Kitzmann and Dirk Schiereck, Endowed Chair for Banking and Finance, EUROPEAN BUSINESS SCHOOL, International
More informationSpeech Emotion Recognition Using Support Vector Machine
Speech Emotion Recognition Using Support Vector Machine Yixiong Pan, Peipei Shen and Liping Shen Department of Computer Technology Shanghai JiaoTong University, Shanghai, China panyixiong@sjtu.edu.cn,
More informationCorpus Linguistics (L615)
(L615) Basics of Markus Dickinson Department of, Indiana University Spring 2013 1 / 23 : the extent to which a sample includes the full range of variability in a population distinguishes corpora from archives
More informationSARDNET: A Self-Organizing Feature Map for Sequences
SARDNET: A Self-Organizing Feature Map for Sequences Daniel L. James and Risto Miikkulainen Department of Computer Sciences The University of Texas at Austin Austin, TX 78712 dljames,risto~cs.utexas.edu
More informationAtypical Prosodic Structure as an Indicator of Reading Level and Text Difficulty
Atypical Prosodic Structure as an Indicator of Reading Level and Text Difficulty Julie Medero and Mari Ostendorf Electrical Engineering Department University of Washington Seattle, WA 98195 USA {jmedero,ostendor}@uw.edu
More informationPhysics 270: Experimental Physics
2017 edition Lab Manual Physics 270 3 Physics 270: Experimental Physics Lecture: Lab: Instructor: Office: Email: Tuesdays, 2 3:50 PM Thursdays, 2 4:50 PM Dr. Uttam Manna 313C Moulton Hall umanna@ilstu.edu
More information- «Crede Experto:,,,». 2 (09) (http://ce.if-mstuca.ru) '36
- «Crede Experto:,,,». 2 (09). 2016 (http://ce.if-mstuca.ru) 811.512.122'36 Ш163.24-2 505.. е е ы, Қ х Ц Ь ғ ғ ғ,,, ғ ғ ғ, ғ ғ,,, ғ че ые :,,,, -, ғ ғ ғ, 2016 D. A. Alkebaeva Almaty, Kazakhstan NOUTIONS
More informationGlenn County Special Education Local Plan Area. SELPA Agreement
Page 1 of 10 Educational Mental Health Related Services, A Tiered Approach Draft Final March 21, 2012 Introduction Until 6-30-10, special education students with severe socio-emotional problems who did
More informationCreate A City: An Urban Planning Exercise Students learn the process of planning a community, while reinforcing their writing and speaking skills.
Create A City: An Urban Planning Exercise Students learn the process of planning a community, while reinforcing their writing and speaking skills. Author Gale Ekiss Grade Level 4-8 Duration 3 class periods
More informationAQUA: An Ontology-Driven Question Answering System
AQUA: An Ontology-Driven Question Answering System Maria Vargas-Vera, Enrico Motta and John Domingue Knowledge Media Institute (KMI) The Open University, Walton Hall, Milton Keynes, MK7 6AA, United Kingdom.
More informationDeveloping True/False Test Sheet Generating System with Diagnosing Basic Cognitive Ability
Developing True/False Test Sheet Generating System with Diagnosing Basic Cognitive Ability Shih-Bin Chen Dept. of Information and Computer Engineering, Chung-Yuan Christian University Chung-Li, Taiwan
More informationMODULE 4 Data Collection and Hypothesis Development. Trainer Outline
MODULE 4 Data Collection and Hypothesis Development Trainer Outline The following trainer guide includes estimated times for each section of the module, an overview of the information to be presented,
More informationLING 329 : MORPHOLOGY
LING 329 : MORPHOLOGY TTh 10:30 11:50 AM, Physics 121 Course Syllabus Spring 2013 Matt Pearson Office: Vollum 313 Email: pearsonm@reed.edu Phone: 7618 (off campus: 503-517-7618) Office hrs: Mon 1:30 2:30,
More informationFilms for ESOL training. Section 2 - Language Experience
Films for ESOL training Section 2 - Language Experience Introduction Foreword These resources were compiled with ESOL teachers in the UK in mind. They introduce a number of approaches and focus on giving
More informationAnnotation Pro. annotation of linguistic and paralinguistic features in speech. Katarzyna Klessa. Phon&Phon meeting
Annotation Pro annotation of linguistic and paralinguistic features in speech Katarzyna Klessa Phon&Phon meeting Faculty of English, AMU Poznań, 25 April 2017 annotationpro.org More information: Quick
More informationOutreach Connect User Manual
Outreach Connect A Product of CAA Software, Inc. Outreach Connect User Manual Church Growth Strategies Through Sunday School, Care Groups, & Outreach Involving Members, Guests, & Prospects PREPARED FOR:
More informationApplication Form Master Course Altervilles First Year M1
http://altervilles.universite-lyon.fr/ Application Form Master Course Altervilles 2015-2016 First Year M1 Application form to be sent to: christelle.morel.journel@univ-st-etienne.fr harold.mazoyer@sciencespo-lyon.fr
More informationSEGMENTAL FEATURES IN SPONTANEOUS AND READ-ALOUD FINNISH
SEGMENTAL FEATURES IN SPONTANEOUS AND READ-ALOUD FINNISH Mietta Lennes Most of the phonetic knowledge that is currently available on spoken Finnish is based on clearly pronounced speech: either readaloud
More informationUsing dialogue context to improve parsing performance in dialogue systems
Using dialogue context to improve parsing performance in dialogue systems Ivan Meza-Ruiz and Oliver Lemon School of Informatics, Edinburgh University 2 Buccleuch Place, Edinburgh I.V.Meza-Ruiz@sms.ed.ac.uk,
More informationLecture Notes in Artificial Intelligence 4343
Lecture Notes in Artificial Intelligence 4343 Edited by J. G. Carbonell and J. Siekmann Subseries of Lecture Notes in Computer Science Christian Müller (Ed.) Speaker Classification I Fundamentals, Features,
More informationThe Effect of Discourse Markers on the Speaking Production of EFL Students. Iman Moradimanesh
The Effect of Discourse Markers on the Speaking Production of EFL Students Iman Moradimanesh Abstract The research aimed at investigating the relationship between discourse markers (DMs) and a special
More informationChapter 9: Conducting Interviews
Chapter 9: Conducting Interviews Chapter 9: Conducting Interviews Chapter Outline: 9.1 Interviewing: A Matter of Styles 9.2 Preparing for the Interview 9.3 Example of a Legal Interview 9.1 INTERVIEWING:
More informationDeveloping Grammar in Context
Developing Grammar in Context intermediate with answers Mark Nettle and Diana Hopkins PUBLISHED BY THE PRESS SYNDICATE OF THE UNIVERSITY OF CAMBRIDGE The Pitt Building, Trumpington Street, Cambridge, United
More informationENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist
Meeting 2 Chapter 7 (Morphology) and chapter 9 (Syntax) Today s agenda Repetition of meeting 1 Mini-lecture on morphology Seminar on chapter 7, worksheet Mini-lecture on syntax Seminar on chapter 9, worksheet
More informationArabic Orthography vs. Arabic OCR
Arabic Orthography vs. Arabic OCR Rich Heritage Challenging A Much Needed Technology Mohamed Attia Having consistently been spoken since more than 2000 years and on, Arabic is doubtlessly the oldest among
More informationLecturing Module
Lecturing: What, why and when www.facultydevelopment.ca Lecturing Module What is lecturing? Lecturing is the most common and established method of teaching at universities around the world. The traditional
More informationWhat do Medical Students Need to Learn in Their English Classes?
ISSN - Journal of Language Teaching and Research, Vol., No., pp. 1-, May ACADEMY PUBLISHER Manufactured in Finland. doi:.0/jltr...1- What do Medical Students Need to Learn in Their English Classes? Giti
More informationMERRY CHRISTMAS Level: 5th year of Primary Education Grammar:
Level: 5 th year of Primary Education Grammar: Present Simple Tense. Sentence word order (Present Simple). Imperative forms. Functions: Expressing habits and routines. Describing customs and traditions.
More informationAn Empirical Analysis of the Effects of Mexican American Studies Participation on Student Achievement within Tucson Unified School District
An Empirical Analysis of the Effects of Mexican American Studies Participation on Student Achievement within Tucson Unified School District Report Submitted June 20, 2012, to Willis D. Hawley, Ph.D., Special
More informationIMPROVING SPEAKING SKILL OF THE TENTH GRADE STUDENTS OF SMK 17 AGUSTUS 1945 MUNCAR THROUGH DIRECT PRACTICE WITH THE NATIVE SPEAKER
IMPROVING SPEAKING SKILL OF THE TENTH GRADE STUDENTS OF SMK 17 AGUSTUS 1945 MUNCAR THROUGH DIRECT PRACTICE WITH THE NATIVE SPEAKER Mohamad Nor Shodiq Institut Agama Islam Darussalam (IAIDA) Banyuwangi
More informationIntroduction to the Common European Framework (CEF)
Introduction to the Common European Framework (CEF) The Common European Framework is a common reference for describing language learning, teaching, and assessment. In order to facilitate both teaching
More informationAbstract. Janaka Jayalath Director / Information Systems, Tertiary and Vocational Education Commission, Sri Lanka.
FEASIBILITY OF USING ELEARNING IN CAPACITY BUILDING OF ICT TRAINERS AND DELIVERY OF TECHNICAL, VOCATIONAL EDUCATION AND TRAINING (TVET) COURSES IN SRI LANKA Janaka Jayalath Director / Information Systems,
More informationInterview Contact Information Please complete the following to be used to contact you to schedule your child s interview.
Cabarrus\Kannapolis Early College High School Interview Contact Information Please complete the following to be used to contact you to schedule your child s interview. Student Name Student Number Middle
More informationWeb as Corpus. Corpus Linguistics. Web as Corpus 1 / 1. Corpus Linguistics. Web as Corpus. web.pl 3 / 1. Sketch Engine. Corpus Linguistics
(L615) Markus Dickinson Department of Linguistics, Indiana University Spring 2013 The web provides new opportunities for gathering data Viable source of disposable corpora, built ad hoc for specific purposes
More informationLinguistics. Undergraduate. Departmental Honors. Graduate. Faculty. Linguistics 1
Linguistics 1 Linguistics Matthew Gordon, Chair Interdepartmental Program in the College of Arts and Science 223 Tate Hall (573) 882-6421 gordonmj@missouri.edu Kibby Smith, Advisor Office of Multidisciplinary
More informationAn Analysis of Gender Differences in Minimal Responses in the conversations in the two TV-series Growing Pains and Boy Meets World
An Analysis of Gender Differences in Minimal Responses in the conversations in the two TV-series Growing Pains and Boy Meets World \ Ying He Kristianstad University English department The C-level of English
More informationGuidelines for Writing an Internship Report
Guidelines for Writing an Internship Report Master of Commerce (MCOM) Program Bahauddin Zakariya University, Multan Table of Contents Table of Contents... 2 1. Introduction.... 3 2. The Required Components
More informationSoftware Maintenance
1 What is Software Maintenance? Software Maintenance is a very broad activity that includes error corrections, enhancements of capabilities, deletion of obsolete capabilities, and optimization. 2 Categories
More informationContent Language Objectives (CLOs) August 2012, H. Butts & G. De Anda
Content Language Objectives (CLOs) Outcomes Identify the evolution of the CLO Identify the components of the CLO Understand how the CLO helps provide all students the opportunity to access the rigor of
More informationLEXICAL COHESION ANALYSIS OF THE ARTICLE WHAT IS A GOOD RESEARCH PROJECT? BY BRIAN PALTRIDGE A JOURNAL ARTICLE
LEXICAL COHESION ANALYSIS OF THE ARTICLE WHAT IS A GOOD RESEARCH PROJECT? BY BRIAN PALTRIDGE A JOURNAL ARTICLE Submitted in partial fulfillment of the requirements for the degree of Sarjana Sastra (S.S.)
More informationUniversity of Groningen. Systemen, planning, netwerken Bosman, Aart
University of Groningen Systemen, planning, netwerken Bosman, Aart IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document
More informationMeasurement. Time. Teaching for mastery in primary maths
Measurement Time Teaching for mastery in primary maths Contents Introduction 3 01. Introduction to time 3 02. Telling the time 4 03. Analogue and digital time 4 04. Converting between units of time 5 05.
More informationSection 7, Unit 4: Sample Student Book Activities for Teaching Listening
Section 7, Unit 4: Sample Student Book Activities for Teaching Listening I. ACTIVITIES TO PRACTICE THE SOUND SYSTEM 1. Listen and Repeat for elementary school students. It could be done as a pre-listening
More informationParsing of part-of-speech tagged Assamese Texts
IJCSI International Journal of Computer Science Issues, Vol. 6, No. 1, 2009 ISSN (Online): 1694-0784 ISSN (Print): 1694-0814 28 Parsing of part-of-speech tagged Assamese Texts Mirzanur Rahman 1, Sufal
More informationIndividual Component Checklist L I S T E N I N G. for use with ONE task ENGLISH VERSION
L I S T E N I N G Individual Component Checklist for use with ONE task ENGLISH VERSION INTRODUCTION This checklist has been designed for use as a practical tool for describing ONE TASK in a test of listening.
More informationGuidelines for drafting the participant observation report
Employment and Women on the 21st century in Europe: From Household economy to SME economy (Small and Medium enterprises) MUPYME Project Guidelines for drafting the participant observation report As agreed,
More informationGeorge Mason University Graduate School of Education Program: Special Education
George Mason University Graduate School of Education Program: Special Education 1 EDSE 590: Research Methods in Special Education Instructor: Margo A. Mastropieri, Ph.D. Assistant: Judy Ericksen Section
More informationNotes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1
Notes on The Sciences of the Artificial Adapted from a shorter document written for course 17-652 (Deciding What to Design) 1 Ali Almossawi December 29, 2005 1 Introduction The Sciences of the Artificial
More informationLANGUAGE IN INDIA Strength for Today and Bright Hope for Tomorrow Volume 11 : 12 December 2011 ISSN
LANGUAGE IN INDIA Strength for Today and Bright Hope for Tomorrow Volume ISSN 1930-2940 Managing Editor: M. S. Thirumalai, Ph.D. Editors: B. Mallikarjun, Ph.D. Sam Mohanlal, Ph.D. B. A. Sharada, Ph.D.
More informationSCU Graduation Occasional Address. Rear Admiral John Lord AM (Rtd) Chairman, Huawei Technologies Australia
SCU Graduation Occasional Address Rear Admiral John Lord AM (Rtd) Chairman, Huawei Technologies Australia 2.00 pm, Saturday, 24 September 2016 Whitebrook Theatre, Lismore Campus Ladies and gentlemen and
More informationMerbouh Zouaoui. Melouk Mohamed. Journal of Educational and Social Research MCSER Publishing, Rome-Italy. 1. Introduction
Acquiring Communication through Conversational Training: The Case Study of 1 st Year LMD Students at Djillali Liabès University Sidi Bel Abbès Algeria Doi:10.5901/jesr.2014.v4n6p353 Abstract Merbouh Zouaoui
More informationAN INTRODUCTION (2 ND ED.) (LONDON, BLOOMSBURY ACADEMIC PP. VI, 282)
B. PALTRIDGE, DISCOURSE ANALYSIS: AN INTRODUCTION (2 ND ED.) (LONDON, BLOOMSBURY ACADEMIC. 2012. PP. VI, 282) Review by Glenda Shopen _ This book is a revised edition of the author s 2006 introductory
More informationBiome I Can Statements
Biome I Can Statements I can recognize the meanings of abbreviations. I can use dictionaries, thesauruses, glossaries, textual features (footnotes, sidebars, etc.) and technology to define and pronounce
More informationCELTA. Syllabus and Assessment Guidelines. Third Edition. University of Cambridge ESOL Examinations 1 Hills Road Cambridge CB1 2EU United Kingdom
CELTA Syllabus and Assessment Guidelines Third Edition CELTA (Certificate in Teaching English to Speakers of Other Languages) is accredited by Ofqual (the regulator of qualifications, examinations and
More informationAP Statistics Summer Assignment 17-18
AP Statistics Summer Assignment 17-18 Welcome to AP Statistics. This course will be unlike any other math class you have ever taken before! Before taking this course you will need to be competent in basic
More information