A hybrid approach to translate Moroccan Arabic dialect

Size: px
Start display at page:

Download "A hybrid approach to translate Moroccan Arabic dialect"

Transcription

1 A hybrid approach to translate Moroccan Arabic dialect Ridouane Tachicart Mohammadia school of Engineers Mohamed Vth Agdal University, Rabat, Morocco Karim Bouzoubaa Mohammadia school of Engineers Mohamed Vth Agdal University, Rabat, Morocco Abstract Today, several tools exist for automatic processing of standard Arabic. In comparison, resources and tools for Arabic dialects especially Moroccan dialect are still lacking. Given the proximity of the Moroccan dialect with standard Arabic, a way is to provide a translation of the first to the second by combining a rule-based approach and a statistical approach, using tools designed for Arabic standard and adapting these tools to Moroccan dialect. We describe in this paper an architecture for such a translation. Keywords Automatic translation, Moroccan dialect, Standard Arabic, language model, translation model, bilingual dictionary. I. INTRODUCTION Arabic-speaking world is characterized by diglossia [15]. On the one hand, modern standard Arabic (MSA) is shared by the entire Arab world but it s not the mother tongue of any Arab country. It is used in the press, in broadcast news, in writing official government documents, etc. On the other hand, several Arabic dialects exist and are mother tongues. These dialects are usually not written and therefore have no standard spelling convention although initiatives have been launched to this end [1]. This particular situation is problematic for the automatic processing of Arabic dialects, and especially machine translation since resources for these languages are almost nonexistent. In addition, direct integration of resources designed for MSA in translation systems of Arabic dialects produces very low performance [1]. This leads to the necessity of either developing new Arabic dialect tools and resources or making adjustments to those existing for MSA. Scientific researches of Arabic dialects translation systems started only in the 2000s and are still in the early stages. Moroccan dialect (Darija) in turn, is not concerned in any research, which makes it isolated and remains difficult to understand for the majority of other Arab countries. In this paper we present an approach that involves translating the Moroccan dialect to the MSA by using tools designed to MSA and adaptating them to Moroccan dialect. On one hand, this translation aims to make a first step towards the automatic processing of Moroccan dialect. On the other hand, due to the increase of the content of this dialect on the web, it may facilitate cultural exchanges with people of the other Arab countries. We present in this paper the architecture of the future translation system combining a rule-based (analysis, transfer and generation) and a statistical approach to benefit from the advantages that these approaches provide. We proceed to the development of this system as soon as its architecture will be validated. In the first step, we use linguistic resources (morphological analyzer and bilingual dictionary) to try to produce possible translations of the source text of the Moroccan dialect, then we will use statistical tools for improving the results of this translation. Finally, evaluation of the quality of our system will be carried out using metrics which compare the result with the correct expected product results. The content of this article is as follows. We describe in section 2 the characteristics of the Moroccan dialect. Then we present in section 3 the existing approaches for the realization of a translation system. In section 4, we discuss previous works concerning automatic translation of Arabic dialects. We present in section 5 our approach. Finally, section 6 concludes the paper and suggests extensions and future work. II. MOROCCAN DIALECT Moroccan dialect, along with the Amazigh language, is the mother tongue of the Moroccan people. This is an evolving dialect as it continues to strongly integrate French, English االبيیسي as: and Spanish words especially in technical areas such /PC/, برووكراامم /program/ etc. We describe the characteristics of the dialect in the following. A. Transcription There are many ways to transcribe the sounds of the Moroccan dialect. First, very often a simplified writing using MSA characters is used.. This writing has sometimes unnecessary cases. For example silent letters such as ةة (end of a word) is pronounced اا in Moroccan dialect since this letter has no phonological or grammatical function, which is not the

2 case for the standard Arabic. In addition, this script cannot transcribe specific phenomena in Darija such as the pronunciation of G, V and P which we transcribe with االبوططو /sit/, جلس such as,بب and,فف,كك respectivly the letters /bar/ and االفيیداانج /drain/. A second method to transcribe Moroccan Arabic uses the Latin script. Its main drawback is that several letters transcribe the same phoneme as in the case of K, Q, and QU which can transcribe the letters قق and.طط Vice versa, ثث and تت can be represented by T. The last transpcription is the alphanumeric writing that involves writing the Darija using the Latin alphabet and some numbers in order to note /عع/ phonemes that do not exist in the French language such as and.قق For example the word عقوبة /sanction/ is transcribed with 3o9oba. This writing, in addition to its aberrant shape, has reading difficulties and cannot represent numbers with emphatic consonants such as emphatic 'S' or emphatic 'D'. The example below illustrates this: script: Arabic ما لقيیتش بحالو Latin script: Malkitch bhalo Alphanumeric script: Mal9itch b7alo B. Phonology Pronunciation of Darija and MSA is similar, but with two major differences. On the one hand, vowels are often omitted in Darija, especially when they are in end of an open syllable. This is probably due to interference with the Amazigh language [17]. For example, the word كتبت /to write/ is pronounced in MSA as 'Katabto', while in Darija it is pronounced as 'Ktbt. On the other hand, Moroccan dialect is characterized by the pronunciation of consonants G, V and W [16] that don t exist in the MSA. For example, 'Gouli', which.قولي means 'tell me', its equivalent in MSA is C. Vocabulary The vocabulary of Moroccan Arabic imports its words from several languages, but it is strongly influenced by the vocabulary of the MSA. The majority of its words are imported from the MSA. We can also find in the Moroccan dialect words of French origin, and a few words of Spanish and Amazigh origins. Table I shows the sources of some words of the Moroccan dialect. TABLE I. VOCABULARY OF MOROCCAN DIALECT Word Origin Darija equivalent شربت MSA شربت ا أتمشى MSA كنتمشى فهھمتني MSA فهھمتني اابجاوو Amazigh اابجاوو موشش Amazigh مش تماررةة Amazigh تماررةة Autobus French ططوبيیس Fromage French فرماجج Fourchette French فوررشيیطا Cuerda Spanish كورردداا Roueda Spanish رروويیداا Cucina Spanish كوززيینة D. Morpho-syntax The morphology of the Moroccan dialect is much less complex compared to the MSA. First, the composition of a sentence in Darija [17] is dominated by the order subject-verbobject. Then the words of MSA origin keep their morphology, or have some modifications in their patterns or their affixes. غا or غادديي in MSA is converted to سس For example, the prefix in Darija. In addition, the conjugation of Moroccan dialect verbs is possible using a list of prefixes and suffixes with a slight modification in the pattern of the word. In addition and unlike the MSA the conjugation of verbs in the female plural and dual don t exist, but it is replaced by the simple plural. Table II compares the conjugation of a verb in both languages. TABLE II. VERBS CONJUGATION MSA MD Dual ضربوهھھھما ضربوهھھھم Plural ضربوهھھھم ضربوهھھھم Female Plural ضربوهھھھن ضربوهھھھم Finally, negation in Moroccan Arabic dialects is similar to the Arabic dialects of North Africa, where the verb is always مانكتبش such as شش and the suffix ما included between the prefix /I don t write/. III. EXISTENT APPROACHES FOR AUTOMATIC TRANSLATION The development of a system for translating the Moroccan dialect requires an approach of machine translation. In fact, there are two basic strategies. On the one hand, the rule-based approach uses a morphological analysis by exploiting linguistic information of the source and target languages. It takes place in three phases (Figure 1), where the first is the analysis that produces a series of units to determine the grammatical structure of each word, knowing that often a morphological analyzer is introduced in this phase. Then each previous analysis is associated with one or more analyzes of the target language using a bilingual dictionary in the transfer phase. Finally, a target language text is produced in the generation phase with an appropriate order.

3 to build these systems. First, some have built their translation systems using rules [6, 7]. Others [8, 9] were based on statistical tools. The third approach, which is a hybrid, served some others for the implementation of their systems [4]. Nizar Habash and Owen Rambow [2] developed the MAGEAD morphological analyzer for Arabic dialects which can output the root, the pattern and the affixes for each entry. In the same principle, the authors of [3] rely on the use of the morphological analyzer Buckwalter [14] to translate Egyptian dialect to the MSA with diacritics combining a statistical approach and knowledge of linguistic rules. Wael Salloum and Fig. 1. Rule-based translation On the other hand, the statistical approach, which is used to reduce the cost of developing translation systems, uses two analyzes: the first is an analysis of a parallel corpus made by a translation model; the second is an analysis of a monolingual corpus made by a language model. These analyses give some parameters to be used by the decoder in order to calculate probabilities of the translation. In fact, the language model introduces the constraints imposed by the syntax of the target language and estimates the probability of a sentence of the language, while the translation model models the process of generating a source sentence from a target sentence. The output of these two models is used by a decoder that maximizes the likelihood of the translation of the source sentence to the target sentence in an acceptable time (Figure 2) Nizar Habash have used ADAM [4], a morphological Arabic dialects analyzer, to create ELISSA a rule-based system, which translates Arabic dialects to the MSA. Finally, Yahya alamlahi [5] introduced a system of translation of Yemeni dialect to the MSA without the use of tools, but only using an algorithm that analyzes the words of this dialect based on the list of affixes. III. APPROACH OF OUR SYSTEM A. Adopted approach A rules-based approach may be particularly suited to certain phrases, while other linguistic phenomena are properly addressed by a corpus-based approach (statistical method). We opt for a hybrid approach to leverage the strengths of each approach and produce a satisfactory translation. Indeed, we proceed through several stages (Figure 3) to reach our goal and translate the Moroccan dialect to the MSA. Fig. 2. Statistical translation II. PREVIOUS WORKS The number of research concerning the automatic translation between Arabic dialects and MSA has recently increased but remains low compared to research on the MSA. Indeed, researchers usually rely on three types of approaches

4 Fig. 3. Architecture of our system dialect corpus. In order to prepare the corpus, we will use the writings of some television productions scenarios and we use some MSA dictionaries to produce a bilingual dictionary. The extension of the bilingual dictionary will be made by collecting other resources on the web to ensure maximum coverage of the vocabulary of the Moroccan dialect. The second phase is to develop the system of translation in several steps. In the first step of identification, our system separates the content of the source text to distinguish the content of the dialect in MSA in order to facilitate the translation. Then, in the step of analyzing, the source text is segmented into annotated dialectal units using the Alkhalil morphological analyzer [10]. In addition of being free and open source, the advantage of this analyzer is its performance (ability to analyze ten words per second). It also provides for each word all possible analyzes based on morphological and syntactic rules. It uses independent database which contain a large linguistic resources, which offers the possibility to adapt the database with our needs without modifying the body of the morphological analyzer. In fact, we extend the list of prefixes and suffixes of Alkhalil to cover the Moroccan dialect and we adapt its dictionary with our dictionary (Moroccan dialect versus MSA) which includes words from several origins (French, Spanish, Amazigh...). In the transfer step, each unit produced above will be linked to one or more corresponding units in the target language (MSA) using our bilingual dictionary. Then, the system uses the previous analyses to generate different MSA phrases in the generation step. After that, it must optimize the result to produce the most fluent sentences using a language model built on the basis of SRILM tool [11]. Let us note that SRLIM is developed under Linux as scripts, it s free to use and quickly allows building language models. These models rely on the analysis of the n-grams model and allow representing and processing texts in several languages. Finally, to measure the quality of the translation of our system, we use usual and conventional indicators such as Recall, Precision, METEOR [12] and BLUE [13]. The latter TABLE III. TRANSLATION EXAMPLE 1 Input دداابا ماشوفتوهھھھمش You did not see them now? 2 ماشوفتوهھھھمش Analysis شش ووهھھھم تت شوفف ما 3 حرفف نفي ضميیرر االغاي بيینن تاء االمخاططب االجذعع حرفف نفي دداابا دداابا ظظرفف ززمانن 4 Transfert االا نن ظظرفف ززمانن لم حرفف نفي ررا أىى االجذعع تت تاء االمخاططب ووهھھھم ضميیرر االغاي بيینن 5 Generation ررا أيیتموهھھھم لم االا نن 6 االا نن لم ررا أيیتموهھھھم 7 Language Model لم ترووهھھھم االا نن 8 Output لم ترووهھھھم االا نن You did not see them now? Our approach is divided into two phases. The first is to collect and produce the linguistic resources necessary for the development of such a translation system, namely: a bilingual dictionary MSA versus Moroccan dialect and a Moroccan has gained the status of automatic measurement of reference within the community of machine translation, and is characterized by a brevity penalty to penalize systems that try

5 to artificially boost their scores by producing deliberately short sentences, while METEOR which is referred to the improvement of the correlation between the translation system and the translation of human translation level segments is characterized by the harmonic mean of the uni-grams of precision and recall. B. Example of translation Table III shows an example of translation of the text in Moroccan dialect دداابا ماشوفتوهھھھمش /you did not see them now/ to MSA according to the approach that we have just explained. In fact, in the analysis phase, the system uses the Alkhalil morphological analyzer to split this sentence in two words: دداابا and ماشوفتوهھھھمش in order to produce annotated units of each word as shown in the third row of table III. Indeed, the word دداابا is analyzed as ظظرفف ززمانن /adverb of time/, while the word is analyzed as a verbal unit and ماشوفتوهھھھمش decomposed to the prefix ما and the suffixes ووهھھھم,تت and,شش and the stem.شوفف Then, in the transfer phase (line 4), each unit produced in the previous phase is linked to its equivalent in MSA using the bilingual dictionary already created. In our ما+شش, دداابا example, the system translates respectively the units.ووهھھھم and االا نن, لم, ررا أىى, تت to its MSA equivalents ووهھھھم and شوفف, تت االا نن لم Then, in the generation phase (line 5 and 6), the phrase analysis. in MSA is generated from the previous ررا أيیتموهھھھم Finally, the component of the language model (line 7) uses the SRILM tool to improve the quality of translation and leads to the final sentence لم ترووهھھھم االا نن that is more fluent than the sentence االا نن لم ررا أيیتموهھھھم produced in the generation phase. IV. CONCLUSION We have presented an approach to translate the Moroccan dialect to MSA by exploiting processing tools for MSA. This is the first translation work concerning the Moroccan dialect. Among its extensions, we plan to introduce in the identification phase a grammar and spelling corrector. REFERENCES [1] Habash N., Diab M., Rambow O. (2011). Conventional Orthography for Dialectal Arabic. In: Proc. Eighth International Conference on Language Resources and Evaluation, LREC 2012, Istanbul, Turkey, May [2] Habash N., Rambow O. (2011). MAGEAD: a morphological analyzer and generator for the Arabic dialects (2006). In: Proc. 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics, Stroudsburg, USA, July [3] Abo Bakr H., Shaalan K., Ziedan I. (2008). A Transferring Egyptian Colloquial Dialect into Modern Standard Arabic. In: Proc. International Conference on Recent Advances in Natural Language Processing, RANLP 2007, Borovets, Bulgaria, September [4] Salloum W., Habash N. (2012). Elissa: A Dialectal to Standard Arabic Machine Translation System. In: Proc. 24th International Conference on Computational Linguistics, COLING 2012, Mumbai, India, December [5] Almalahi Y., Fateh, A., (2007). Sana ani Dialect to Modern Standard Arabic: Rule-based Direct Machine Translation. [6] Abo Bakr H., Shaalan K., Ziedan I. (2008). A Hybrid Approach for Converting Written Egyptian Colloquial Dialect into Diacritized Arabic. In: Proc. 6th International Conference on Informatics and Systems, INFOS 2008, Cairo, Egypt, Mars [7] Mohamed E., Mohit B., and Oflazer K., (2012) Transforming Standard Arabic to Colloquial Arabic. In: Proc. 50th Annual Meeting of the Association for Computational Linguistics, ACL 2012, Jeju, Korea, July [8] Zbib R., Malchiod E., Devlin J., Stallard D., Matsoukas S. Schwartz R., Makhoul J., Zaidan O., Callison-Burch C. (2012) Machine translation of Arabic dialects. In: Proc. 12th Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2012, Montreal, Canada, June 3-8. [9] Salloum W., Habash N. Plamondon, L., (2011). Dialectal to Standard Arabic Paraphrasing to Improve Arabic-English Statistical Machine Translation. In: Proc. Conference on Empirical Methods in Natural Language Processing, EMNLP 2011, Edinburgh, Scotland, UK, July [10] Boudlal A., Lakhouaja A., Mazroui A., Meziane A., Ould abdallahi ould bebah A., Shoul M. (2010).Alkhalil Morpho Sys1: A Morphosyntactic analysis system for Arabic texts. In: Proc. The International Arab Conference on Information Technology, ACIT 2010, Benghazi, Libya, December [11] Stolcke A. (2002) SRILM An Extensible language modeling toolkit. In Proc. 7th International Conference on Spoken Language Processing, ICSLP 2002, Denver, Colorado, USA, September [12] Denkowski M., Lavie A. (2011). Meteor 1.3: Automatic Metric for Reliable Optimization and Evaluation of Machine Translation Systems. In: Proc. 6th Workshop on Statistical Machine Translation, EMNLP 2011, Edinburgh, Scotland, UK, July 30 31, [13] Papenini K., Roukos S., Ward T., Zhu W. (2002). BLEU: a method for automatic evaluation of machine translation. In: Proc. 40th Annual Meeting on Association for Computational Linguistics, ACL 2002, Pennsylvania, Philadelphia, USA, July [14] Tim Buckwalter, "Buckwalter Arabic Morphological Analyzer Version 1.0. ", Tim Buckwalter ed Linguistic Data Consortium: University of Pennsylvania, [15] Ferguson, Charles (1959). "Diglossia". Word 15. [16] Ennaji Moha: Multilingualism, cultural identity, and education in Morocco. p [17] Ait cherif A., Boukbout M., Mahmoudi M., and Ouhmouch A. (2011). Moroccan Arabic Textbook - Peace corps Morocco.

arxiv: v1 [cs.cl] 2 Apr 2017

arxiv: v1 [cs.cl] 2 Apr 2017 Word-Alignment-Based Segment-Level Machine Translation Evaluation using Word Embeddings Junki Matsuo and Mamoru Komachi Graduate School of System Design, Tokyo Metropolitan University, Japan matsuo-junki@ed.tmu.ac.jp,

More information

Learning Methods in Multilingual Speech Recognition

Learning Methods in Multilingual Speech Recognition Learning Methods in Multilingual Speech Recognition Hui Lin Department of Electrical Engineering University of Washington Seattle, WA 98125 linhui@u.washington.edu Li Deng, Jasha Droppo, Dong Yu, and Alex

More information

Exploiting Phrasal Lexica and Additional Morpho-syntactic Language Resources for Statistical Machine Translation with Scarce Training Data

Exploiting Phrasal Lexica and Additional Morpho-syntactic Language Resources for Statistical Machine Translation with Scarce Training Data Exploiting Phrasal Lexica and Additional Morpho-syntactic Language Resources for Statistical Machine Translation with Scarce Training Data Maja Popović and Hermann Ney Lehrstuhl für Informatik VI, Computer

More information

Modeling full form lexica for Arabic

Modeling full form lexica for Arabic Modeling full form lexica for Arabic Susanne Alt Amine Akrout Atilf-CNRS Laurent Romary Loria-CNRS Objectives Presentation of the current standardization activity in the domain of lexical data modeling

More information

Florida Reading Endorsement Alignment Matrix Competency 1

Florida Reading Endorsement Alignment Matrix Competency 1 Florida Reading Endorsement Alignment Matrix Competency 1 Reading Endorsement Guiding Principle: Teachers will understand and teach reading as an ongoing strategic process resulting in students comprehending

More information

The Karlsruhe Institute of Technology Translation Systems for the WMT 2011

The Karlsruhe Institute of Technology Translation Systems for the WMT 2011 The Karlsruhe Institute of Technology Translation Systems for the WMT 2011 Teresa Herrmann, Mohammed Mediani, Jan Niehues and Alex Waibel Karlsruhe Institute of Technology Karlsruhe, Germany firstname.lastname@kit.edu

More information

Parallel Evaluation in Stratal OT * Adam Baker University of Arizona

Parallel Evaluation in Stratal OT * Adam Baker University of Arizona Parallel Evaluation in Stratal OT * Adam Baker University of Arizona tabaker@u.arizona.edu 1.0. Introduction The model of Stratal OT presented by Kiparsky (forthcoming), has not and will not prove uncontroversial

More information

1 st Quarter (September, October, November) August/September Strand Topic Standard Notes Reading for Literature

1 st Quarter (September, October, November) August/September Strand Topic Standard Notes Reading for Literature 1 st Grade Curriculum Map Common Core Standards Language Arts 2013 2014 1 st Quarter (September, October, November) August/September Strand Topic Standard Notes Reading for Literature Key Ideas and Details

More information

Books Effective Literacy Y5-8 Learning Through Talk Y4-8 Switch onto Spelling Spelling Under Scrutiny

Books Effective Literacy Y5-8 Learning Through Talk Y4-8 Switch onto Spelling Spelling Under Scrutiny By the End of Year 8 All Essential words lists 1-7 290 words Commonly Misspelt Words-55 working out more complex, irregular, and/or ambiguous words by using strategies such as inferring the unknown from

More information

ELA/ELD Standards Correlation Matrix for ELD Materials Grade 1 Reading

ELA/ELD Standards Correlation Matrix for ELD Materials Grade 1 Reading ELA/ELD Correlation Matrix for ELD Materials Grade 1 Reading The English Language Arts (ELA) required for the one hour of English-Language Development (ELD) Materials are listed in Appendix 9-A, Matrix

More information

NCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches

NCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches NCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches Yu-Chun Wang Chun-Kai Wu Richard Tzong-Han Tsai Department of Computer Science

More information

Language Model and Grammar Extraction Variation in Machine Translation

Language Model and Grammar Extraction Variation in Machine Translation Language Model and Grammar Extraction Variation in Machine Translation Vladimir Eidelman, Chris Dyer, and Philip Resnik UMIACS Laboratory for Computational Linguistics and Information Processing Department

More information

What the National Curriculum requires in reading at Y5 and Y6

What the National Curriculum requires in reading at Y5 and Y6 What the National Curriculum requires in reading at Y5 and Y6 Word reading apply their growing knowledge of root words, prefixes and suffixes (morphology and etymology), as listed in Appendix 1 of the

More information

Taught Throughout the Year Foundational Skills Reading Writing Language RF.1.2 Demonstrate understanding of spoken words,

Taught Throughout the Year Foundational Skills Reading Writing Language RF.1.2 Demonstrate understanding of spoken words, First Grade Standards These are the standards for what is taught in first grade. It is the expectation that these skills will be reinforced after they have been taught. Taught Throughout the Year Foundational

More information

Houghton Mifflin Reading Correlation to the Common Core Standards for English Language Arts (Grade1)

Houghton Mifflin Reading Correlation to the Common Core Standards for English Language Arts (Grade1) Houghton Mifflin Reading Correlation to the Standards for English Language Arts (Grade1) 8.3 JOHNNY APPLESEED Biography TARGET SKILLS: 8.3 Johnny Appleseed Phonemic Awareness Phonics Comprehension Vocabulary

More information

CLASSIFICATION OF PROGRAM Critical Elements Analysis 1. High Priority Items Phonemic Awareness Instruction

CLASSIFICATION OF PROGRAM Critical Elements Analysis 1. High Priority Items Phonemic Awareness Instruction CLASSIFICATION OF PROGRAM Critical Elements Analysis 1 Program Name: Macmillan/McGraw Hill Reading 2003 Date of Publication: 2003 Publisher: Macmillan/McGraw Hill Reviewer Code: 1. X The program meets

More information

Opportunities for Writing Title Key Stage 1 Key Stage 2 Narrative

Opportunities for Writing Title Key Stage 1 Key Stage 2 Narrative English Teaching Cycle The English curriculum at Wardley CE Primary is based upon the National Curriculum. Our English is taught through a text based curriculum as we believe this is the best way to develop

More information

Re-evaluating the Role of Bleu in Machine Translation Research

Re-evaluating the Role of Bleu in Machine Translation Research Re-evaluating the Role of Bleu in Machine Translation Research Chris Callison-Burch Miles Osborne Philipp Koehn School on Informatics University of Edinburgh 2 Buccleuch Place Edinburgh, EH8 9LW callison-burch@ed.ac.uk

More information

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many Schmidt 1 Eric Schmidt Prof. Suzanne Flynn Linguistic Study of Bilingualism December 13, 2013 A Minimalist Approach to Code-Switching In the field of linguistics, the topic of bilingualism is a broad one.

More information

Noisy SMS Machine Translation in Low-Density Languages

Noisy SMS Machine Translation in Low-Density Languages Noisy SMS Machine Translation in Low-Density Languages Vladimir Eidelman, Kristy Hollingshead, and Philip Resnik UMIACS Laboratory for Computational Linguistics and Information Processing Department of

More information

Age Effects on Syntactic Control in. Second Language Learning

Age Effects on Syntactic Control in. Second Language Learning Age Effects on Syntactic Control in Second Language Learning Miriam Tullgren Loyola University Chicago Abstract 1 This paper explores the effects of age on second language acquisition in adolescents, ages

More information

TINE: A Metric to Assess MT Adequacy

TINE: A Metric to Assess MT Adequacy TINE: A Metric to Assess MT Adequacy Miguel Rios, Wilker Aziz and Lucia Specia Research Group in Computational Linguistics University of Wolverhampton Stafford Street, Wolverhampton, WV1 1SB, UK {m.rios,

More information

Speech Recognition at ICSI: Broadcast News and beyond

Speech Recognition at ICSI: Broadcast News and beyond Speech Recognition at ICSI: Broadcast News and beyond Dan Ellis International Computer Science Institute, Berkeley CA Outline 1 2 3 The DARPA Broadcast News task Aspects of ICSI

More information

1. Introduction. 2. The OMBI database editor

1. Introduction. 2. The OMBI database editor OMBI bilingual lexical resources: Arabic-Dutch / Dutch-Arabic Carole Tiberius, Anna Aalstein, Instituut voor Nederlandse Lexicologie Jan Hoogland, Nederlands Instituut in Marokko (NIMAR) In this paper

More information

First Grade Curriculum Highlights: In alignment with the Common Core Standards

First Grade Curriculum Highlights: In alignment with the Common Core Standards First Grade Curriculum Highlights: In alignment with the Common Core Standards ENGLISH LANGUAGE ARTS Foundational Skills Print Concepts Demonstrate understanding of the organization and basic features

More information

English Language and Applied Linguistics. Module Descriptions 2017/18

English Language and Applied Linguistics. Module Descriptions 2017/18 English Language and Applied Linguistics Module Descriptions 2017/18 Level I (i.e. 2 nd Yr.) Modules Please be aware that all modules are subject to availability. If you have any questions about the modules,

More information

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments Cristina Vertan, Walther v. Hahn University of Hamburg, Natural Language Systems Division Hamburg,

More information

Derivational and Inflectional Morphemes in Pak-Pak Language

Derivational and Inflectional Morphemes in Pak-Pak Language Derivational and Inflectional Morphemes in Pak-Pak Language Agustina Situmorang and Tima Mariany Arifin ABSTRACT The objectives of this study are to find out the derivational and inflectional morphemes

More information

Problems of the Arabic OCR: New Attitudes

Problems of the Arabic OCR: New Attitudes Problems of the Arabic OCR: New Attitudes Prof. O.Redkin, Dr. O.Bernikova Department of Asian and African Studies, St. Petersburg State University, St Petersburg, Russia Abstract - This paper reviews existing

More information

Program Matrix - Reading English 6-12 (DOE Code 398) University of Florida. Reading

Program Matrix - Reading English 6-12 (DOE Code 398) University of Florida. Reading Program Requirements Competency 1: Foundations of Instruction 60 In-service Hours Teachers will develop substantive understanding of six components of reading as a process: comprehension, oral language,

More information

Linguistics. Undergraduate. Departmental Honors. Graduate. Faculty. Linguistics 1

Linguistics. Undergraduate. Departmental Honors. Graduate. Faculty. Linguistics 1 Linguistics 1 Linguistics Matthew Gordon, Chair Interdepartmental Program in the College of Arts and Science 223 Tate Hall (573) 882-6421 gordonmj@missouri.edu Kibby Smith, Advisor Office of Multidisciplinary

More information

Coast Academies Writing Framework Step 4. 1 of 7

Coast Academies Writing Framework Step 4. 1 of 7 1 KPI Spell further homophones. 2 3 Objective Spell words that are often misspelt (English Appendix 1) KPI Place the possessive apostrophe accurately in words with regular plurals: e.g. girls, boys and

More information

Test Blueprint. Grade 3 Reading English Standards of Learning

Test Blueprint. Grade 3 Reading English Standards of Learning Test Blueprint Grade 3 Reading 2010 English Standards of Learning This revised test blueprint will be effective beginning with the spring 2017 test administration. Notice to Reader In accordance with the

More information

A Quantitative Method for Machine Translation Evaluation

A Quantitative Method for Machine Translation Evaluation A Quantitative Method for Machine Translation Evaluation Jesús Tomás Escola Politècnica Superior de Gandia Universitat Politècnica de València jtomas@upv.es Josep Àngel Mas Departament d Idiomes Universitat

More information

Phonological Processing for Urdu Text to Speech System

Phonological Processing for Urdu Text to Speech System Phonological Processing for Urdu Text to Speech System Sarmad Hussain Center for Research in Urdu Language Processing, National University of Computer and Emerging Sciences, B Block, Faisal Town, Lahore,

More information

Linking Task: Identifying authors and book titles in verbose queries

Linking Task: Identifying authors and book titles in verbose queries Linking Task: Identifying authors and book titles in verbose queries Anaïs Ollagnier, Sébastien Fournier, and Patrice Bellot Aix-Marseille University, CNRS, ENSAM, University of Toulon, LSIS UMR 7296,

More information

Sentiment Analysis of Tunisian Dialect: Linguistic Resources and Experiments

Sentiment Analysis of Tunisian Dialect: Linguistic Resources and Experiments Sentiment Analysis of Tunisian Dialect: Linguistic Resources and Experiments Salima Mdhaffar 1,2, Fethi Bougares 1, Yannick Estève 1 and Lamia Hadrich-Belguith 2 1 LIUM Lab, University of Le Mans, France

More information

Arabic Orthography vs. Arabic OCR

Arabic Orthography vs. Arabic OCR Arabic Orthography vs. Arabic OCR Rich Heritage Challenging A Much Needed Technology Mohamed Attia Having consistently been spoken since more than 2000 years and on, Arabic is doubtlessly the oldest among

More information

ARNE - A tool for Namend Entity Recognition from Arabic Text

ARNE - A tool for Namend Entity Recognition from Arabic Text 24 ARNE - A tool for Namend Entity Recognition from Arabic Text Carolin Shihadeh DFKI Stuhlsatzenhausweg 3 66123 Saarbrücken, Germany carolin.shihadeh@dfki.de Günter Neumann DFKI Stuhlsatzenhausweg 3 66123

More information

LING 329 : MORPHOLOGY

LING 329 : MORPHOLOGY LING 329 : MORPHOLOGY TTh 10:30 11:50 AM, Physics 121 Course Syllabus Spring 2013 Matt Pearson Office: Vollum 313 Email: pearsonm@reed.edu Phone: 7618 (off campus: 503-517-7618) Office hrs: Mon 1:30 2:30,

More information

To appear in The TESOL encyclopedia of ELT (Wiley-Blackwell) 1 RECASTING. Kazuya Saito. Birkbeck, University of London

To appear in The TESOL encyclopedia of ELT (Wiley-Blackwell) 1 RECASTING. Kazuya Saito. Birkbeck, University of London To appear in The TESOL encyclopedia of ELT (Wiley-Blackwell) 1 RECASTING Kazuya Saito Birkbeck, University of London Abstract Among the many corrective feedback techniques at ESL/EFL teachers' disposal,

More information

The Internet as a Normative Corpus: Grammar Checking with a Search Engine

The Internet as a Normative Corpus: Grammar Checking with a Search Engine The Internet as a Normative Corpus: Grammar Checking with a Search Engine Jonas Sjöbergh KTH Nada SE-100 44 Stockholm, Sweden jsh@nada.kth.se Abstract In this paper some methods using the Internet as a

More information

Cross Language Information Retrieval

Cross Language Information Retrieval Cross Language Information Retrieval RAFFAELLA BERNARDI UNIVERSITÀ DEGLI STUDI DI TRENTO P.ZZA VENEZIA, ROOM: 2.05, E-MAIL: BERNARDI@DISI.UNITN.IT Contents 1 Acknowledgment.............................................

More information

A heuristic framework for pivot-based bilingual dictionary induction

A heuristic framework for pivot-based bilingual dictionary induction 2013 International Conference on Culture and Computing A heuristic framework for pivot-based bilingual dictionary induction Mairidan Wushouer, Toru Ishida, Donghui Lin Department of Social Informatics,

More information

Modeling function word errors in DNN-HMM based LVCSR systems

Modeling function word errors in DNN-HMM based LVCSR systems Modeling function word errors in DNN-HMM based LVCSR systems Melvin Jose Johnson Premkumar, Ankur Bapna and Sree Avinash Parchuri Department of Computer Science Department of Electrical Engineering Stanford

More information

Dickinson ISD ELAR Year at a Glance 3rd Grade- 1st Nine Weeks

Dickinson ISD ELAR Year at a Glance 3rd Grade- 1st Nine Weeks 3rd Grade- 1st Nine Weeks R3.8 understand, make inferences and draw conclusions about the structure and elements of fiction and provide evidence from text to support their understand R3.8A sequence and

More information

The development of a new learner s dictionary for Modern Standard Arabic: the linguistic corpus approach

The development of a new learner s dictionary for Modern Standard Arabic: the linguistic corpus approach BILINGUAL LEARNERS DICTIONARIES The development of a new learner s dictionary for Modern Standard Arabic: the linguistic corpus approach Mark VAN MOL, Leuven, Belgium Abstract This paper reports on the

More information

Correspondence between the DRDP (2015) and the California Preschool Learning Foundations. Foundations (PLF) in Language and Literacy

Correspondence between the DRDP (2015) and the California Preschool Learning Foundations. Foundations (PLF) in Language and Literacy 1 Desired Results Developmental Profile (2015) [DRDP (2015)] Correspondence to California Foundations: Language and Development (LLD) and the Foundations (PLF) The Language and Development (LLD) domain

More information

Language. Name: Period: Date: Unit 3. Cultural Geography

Language. Name: Period: Date: Unit 3. Cultural Geography Name: Period: Date: Unit 3 Language Cultural Geography The following information corresponds to Chapters 8, 9 and 10 in your textbook. Fill in the blanks to complete the definition or sentence. Note: All

More information

AQUA: An Ontology-Driven Question Answering System

AQUA: An Ontology-Driven Question Answering System AQUA: An Ontology-Driven Question Answering System Maria Vargas-Vera, Enrico Motta and John Domingue Knowledge Media Institute (KMI) The Open University, Walton Hall, Milton Keynes, MK7 6AA, United Kingdom.

More information

The NICT Translation System for IWSLT 2012

The NICT Translation System for IWSLT 2012 The NICT Translation System for IWSLT 2012 Andrew Finch Ohnmar Htun Eiichiro Sumita Multilingual Translation Group MASTAR Project National Institute of Information and Communications Technology Kyoto,

More information

Using SAM Central With iread

Using SAM Central With iread Using SAM Central With iread January 1, 2016 For use with iread version 1.2 or later, SAM Central, and Student Achievement Manager version 2.4 or later PDF0868 (PDF) Houghton Mifflin Harcourt Publishing

More information

The ABCs of O-G. Materials Catalog. Skills Workbook. Lesson Plans for Teaching The Orton-Gillingham Approach in Reading and Spelling

The ABCs of O-G. Materials Catalog. Skills Workbook. Lesson Plans for Teaching The Orton-Gillingham Approach in Reading and Spelling 2008 Intermediate Level Skills Workbook Group 2 Groups 1 & 2 The ABCs of O-G The Flynn System by Emi Flynn Lesson Plans for Teaching The Orton-Gillingham Approach in Reading and Spelling The ABCs of O-G

More information

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words, A Language-Independent, Data-Oriented Architecture for Grapheme-to-Phoneme Conversion Walter Daelemans and Antal van den Bosch Proceedings ESCA-IEEE speech synthesis conference, New York, September 1994

More information

CROSS-LANGUAGE INFORMATION RETRIEVAL USING PARAFAC2

CROSS-LANGUAGE INFORMATION RETRIEVAL USING PARAFAC2 1 CROSS-LANGUAGE INFORMATION RETRIEVAL USING PARAFAC2 Peter A. Chew, Brett W. Bader, Ahmed Abdelali Proceedings of the 13 th SIGKDD, 2007 Tiago Luís Outline 2 Cross-Language IR (CLIR) Latent Semantic Analysis

More information

TEKS Comments Louisiana GLE

TEKS Comments Louisiana GLE Side-by-Side Comparison of the Texas Educational Knowledge Skills (TEKS) Louisiana Grade Level Expectations (GLEs) ENGLISH LANGUAGE ARTS: Kindergarten TEKS Comments Louisiana GLE (K.1) Listening/Speaking/Purposes.

More information

DIBELS Next BENCHMARK ASSESSMENTS

DIBELS Next BENCHMARK ASSESSMENTS DIBELS Next BENCHMARK ASSESSMENTS Click to edit Master title style Benchmark Screening Benchmark testing is the systematic process of screening all students on essential skills predictive of later reading

More information

CEFR Overall Illustrative English Proficiency Scales

CEFR Overall Illustrative English Proficiency Scales CEFR Overall Illustrative English Proficiency s CEFR CEFR OVERALL ORAL PRODUCTION Has a good command of idiomatic expressions and colloquialisms with awareness of connotative levels of meaning. Can convey

More information

1.2 Interpretive Communication: Students will demonstrate comprehension of content from authentic audio and visual resources.

1.2 Interpretive Communication: Students will demonstrate comprehension of content from authentic audio and visual resources. Course French I Grade 9-12 Unit of Study Unit 1 - Bonjour tout le monde! & les Passe-temps Unit Type(s) x Topical Skills-based Thematic Pacing 20 weeks Overarching Standards: 1.1 Interpersonal Communication:

More information

National Literacy and Numeracy Framework for years 3/4

National Literacy and Numeracy Framework for years 3/4 1. Oracy National Literacy and Numeracy Framework for years 3/4 Speaking Listening Collaboration and discussion Year 3 - Explain information and ideas using relevant vocabulary - Organise what they say

More information

Constructing Parallel Corpus from Movie Subtitles

Constructing Parallel Corpus from Movie Subtitles Constructing Parallel Corpus from Movie Subtitles Han Xiao 1 and Xiaojie Wang 2 1 School of Information Engineering, Beijing University of Post and Telecommunications artex.xh@gmail.com 2 CISTR, Beijing

More information

C a l i f o r n i a N o n c r e d i t a n d A d u l t E d u c a t i o n. E n g l i s h a s a S e c o n d L a n g u a g e M o d e l

C a l i f o r n i a N o n c r e d i t a n d A d u l t E d u c a t i o n. E n g l i s h a s a S e c o n d L a n g u a g e M o d e l C a l i f o r n i a N o n c r e d i t a n d A d u l t E d u c a t i o n E n g l i s h a s a S e c o n d L a n g u a g e M o d e l C u r r i c u l u m S t a n d a r d s a n d A s s e s s m e n t G u i d

More information

IMPROVING SPEAKING SKILL OF THE TENTH GRADE STUDENTS OF SMK 17 AGUSTUS 1945 MUNCAR THROUGH DIRECT PRACTICE WITH THE NATIVE SPEAKER

IMPROVING SPEAKING SKILL OF THE TENTH GRADE STUDENTS OF SMK 17 AGUSTUS 1945 MUNCAR THROUGH DIRECT PRACTICE WITH THE NATIVE SPEAKER IMPROVING SPEAKING SKILL OF THE TENTH GRADE STUDENTS OF SMK 17 AGUSTUS 1945 MUNCAR THROUGH DIRECT PRACTICE WITH THE NATIVE SPEAKER Mohamad Nor Shodiq Institut Agama Islam Darussalam (IAIDA) Banyuwangi

More information

Reading Horizons. A Look At Linguistic Readers. Nicholas P. Criscuolo APRIL Volume 10, Issue Article 5

Reading Horizons. A Look At Linguistic Readers. Nicholas P. Criscuolo APRIL Volume 10, Issue Article 5 Reading Horizons Volume 10, Issue 3 1970 Article 5 APRIL 1970 A Look At Linguistic Readers Nicholas P. Criscuolo New Haven, Connecticut Public Schools Copyright c 1970 by the authors. Reading Horizons

More information

Parsing of part-of-speech tagged Assamese Texts

Parsing of part-of-speech tagged Assamese Texts IJCSI International Journal of Computer Science Issues, Vol. 6, No. 1, 2009 ISSN (Online): 1694-0784 ISSN (Print): 1694-0814 28 Parsing of part-of-speech tagged Assamese Texts Mirzanur Rahman 1, Sufal

More information

Primary English Curriculum Framework

Primary English Curriculum Framework Primary English Curriculum Framework Primary English Curriculum Framework This curriculum framework document is based on the primary National Curriculum and the National Literacy Strategy that have been

More information

THE ROLE OF DECISION TREES IN NATURAL LANGUAGE PROCESSING

THE ROLE OF DECISION TREES IN NATURAL LANGUAGE PROCESSING SISOM & ACOUSTICS 2015, Bucharest 21-22 May THE ROLE OF DECISION TREES IN NATURAL LANGUAGE PROCESSING MarilenaăLAZ R 1, Diana MILITARU 2 1 Military Equipment and Technologies Research Agency, Bucharest,

More information

8 The Growth of English Language Learning in Morocco: Culture, Class, and Status Competition

8 The Growth of English Language Learning in Morocco: Culture, Class, and Status Competition Elizabeth S. Buckner 8 The Growth of English Language Learning in Morocco: Culture, Class, and Status Competition Abstract English usage has been growing rapidly throughout Morocco, as the country further

More information

Using dialogue context to improve parsing performance in dialogue systems

Using dialogue context to improve parsing performance in dialogue systems Using dialogue context to improve parsing performance in dialogue systems Ivan Meza-Ruiz and Oliver Lemon School of Informatics, Edinburgh University 2 Buccleuch Place, Edinburgh I.V.Meza-Ruiz@sms.ed.ac.uk,

More information

Software Maintenance

Software Maintenance 1 What is Software Maintenance? Software Maintenance is a very broad activity that includes error corrections, enhancements of capabilities, deletion of obsolete capabilities, and optimization. 2 Categories

More information

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling Notebook for PAN at CLEF 2013 Andrés Alfonso Caurcel Díaz 1 and José María Gómez Hidalgo 2 1 Universidad

More information

FOREWORD.. 5 THE PROPER RUSSIAN PRONUNCIATION. 8. УРОК (Unit) УРОК (Unit) УРОК (Unit) УРОК (Unit) 4 80.

FOREWORD.. 5 THE PROPER RUSSIAN PRONUNCIATION. 8. УРОК (Unit) УРОК (Unit) УРОК (Unit) УРОК (Unit) 4 80. CONTENTS FOREWORD.. 5 THE PROPER RUSSIAN PRONUNCIATION. 8 УРОК (Unit) 1 25 1.1. QUESTIONS WITH КТО AND ЧТО 27 1.2. GENDER OF NOUNS 29 1.3. PERSONAL PRONOUNS 31 УРОК (Unit) 2 38 2.1. PRESENT TENSE OF THE

More information

Using a Native Language Reference Grammar as a Language Learning Tool

Using a Native Language Reference Grammar as a Language Learning Tool Using a Native Language Reference Grammar as a Language Learning Tool Stacey I. Oberly University of Arizona & American Indian Language Development Institute Introduction This article is a case study in

More information

BULATS A2 WORDLIST 2

BULATS A2 WORDLIST 2 BULATS A2 WORDLIST 2 INTRODUCTION TO THE BULATS A2 WORDLIST 2 The BULATS A2 WORDLIST 21 is a list of approximately 750 words to help candidates aiming at an A2 pass in the Cambridge BULATS exam. It is

More information

Reading Grammar Section and Lesson Writing Chapter and Lesson Identify a purpose for reading W1-LO; W2- LO; W3- LO; W4- LO; W5-

Reading Grammar Section and Lesson Writing Chapter and Lesson Identify a purpose for reading W1-LO; W2- LO; W3- LO; W4- LO; W5- New York Grade 7 Core Performance Indicators Grades 7 8: common to all four ELA standards Throughout grades 7 and 8, students demonstrate the following core performance indicators in the key ideas of reading,

More information

Acquisition vs. Learning of a Second Language: English Negation

Acquisition vs. Learning of a Second Language: English Negation Interculturalia Acquisition vs. Learning of a Second Language: English Negation Oana BADEA Key-words: acquisition, learning, first/second language, English negation General Remarks on Theories of Second/

More information

Curriculum and Assessment Guide (CAG) Elementary California Treasures First Grade

Curriculum and Assessment Guide (CAG) Elementary California Treasures First Grade Curriculum and Assessment Guide (CAG) Elementary 2012-2013 California Treasures First Grade 1 2 English Language Arts CORE INSTRUCTIONAL MATERIALS 2012-2013 Grade 1 Macmillan/McGraw-Hill California Treasures

More information

The College Board Redesigned SAT Grade 12

The College Board Redesigned SAT Grade 12 A Correlation of, 2017 To the Redesigned SAT Introduction This document demonstrates how myperspectives English Language Arts meets the Reading, Writing and Language and Essay Domains of Redesigned SAT.

More information

Grade 11 Language Arts (2 Semester Course) CURRICULUM. Course Description ENGLISH 11 (2 Semester Course) Duration: 2 Semesters Prerequisite: None

Grade 11 Language Arts (2 Semester Course) CURRICULUM. Course Description ENGLISH 11 (2 Semester Course) Duration: 2 Semesters Prerequisite: None Grade 11 Language Arts (2 Semester Course) CURRICULUM Course Description ENGLISH 11 (2 Semester Course) Duration: 2 Semesters Prerequisite: None Through the integrated study of literature, composition,

More information

A Corpus and Phonetic Dictionary for Tunisian Arabic Speech Recognition

A Corpus and Phonetic Dictionary for Tunisian Arabic Speech Recognition A Corpus and Phonetic Dictionary for Tunisian Arabic Speech Recognition Abir Masmoudi 1,2, Mariem Ellouze Khemakhem 1,Yannick Estève 2, Lamia Hadrich Belguith 1 and Nizar Habash 3 (1) ANLP Research group,

More information

Modeling function word errors in DNN-HMM based LVCSR systems

Modeling function word errors in DNN-HMM based LVCSR systems Modeling function word errors in DNN-HMM based LVCSR systems Melvin Jose Johnson Premkumar, Ankur Bapna and Sree Avinash Parchuri Department of Computer Science Department of Electrical Engineering Stanford

More information

MARK 12 Reading II (Adaptive Remediation)

MARK 12 Reading II (Adaptive Remediation) MARK 12 Reading II (Adaptive Remediation) The MARK 12 (Mastery. Acceleration. Remediation. K 12.) courses are for students in the third to fifth grades who are struggling readers. MARK 12 Reading II gives

More information

PAGE(S) WHERE TAUGHT If sub mission ins not a book, cite appropriate location(s))

PAGE(S) WHERE TAUGHT If sub mission ins not a book, cite appropriate location(s)) Ohio Academic Content Standards Grade Level Indicators (Grade 11) A. ACQUISITION OF VOCABULARY Students acquire vocabulary through exposure to language-rich situations, such as reading books and other

More information

Automatic Assessment of Spoken Modern Standard Arabic

Automatic Assessment of Spoken Modern Standard Arabic Automatic Assessment of Spoken Modern Standard Arabic Jian Cheng, Jared Bernstein, Ulrike Pado, Masanori Suzuki Pearson Knowledge Technologies 299 California Ave, Palo Alto, CA 94306 jian.cheng@pearson.com

More information

Machine Translation on the Medical Domain: The Role of BLEU/NIST and METEOR in a Controlled Vocabulary Setting

Machine Translation on the Medical Domain: The Role of BLEU/NIST and METEOR in a Controlled Vocabulary Setting Machine Translation on the Medical Domain: The Role of BLEU/NIST and METEOR in a Controlled Vocabulary Setting Andre CASTILLA castilla@terra.com.br Alice BACIC Informatics Service, Instituto do Coracao

More information

Procedia - Social and Behavioral Sciences 141 ( 2014 ) WCLTA Using Corpus Linguistics in the Development of Writing

Procedia - Social and Behavioral Sciences 141 ( 2014 ) WCLTA Using Corpus Linguistics in the Development of Writing Available online at www.sciencedirect.com ScienceDirect Procedia - Social and Behavioral Sciences 141 ( 2014 ) 124 128 WCLTA 2013 Using Corpus Linguistics in the Development of Writing Blanka Frydrychova

More information

ROSETTA STONE PRODUCT OVERVIEW

ROSETTA STONE PRODUCT OVERVIEW ROSETTA STONE PRODUCT OVERVIEW Method Rosetta Stone teaches languages using a fully-interactive immersion process that requires the student to indicate comprehension of the new language and provides immediate

More information

MARK¹² Reading II (Adaptive Remediation)

MARK¹² Reading II (Adaptive Remediation) MARK¹² Reading II (Adaptive Remediation) Scope & Sequence : Scope & Sequence documents describe what is covered in a course (the scope) and also the order in which topics are covered (the sequence). These

More information

South Carolina English Language Arts

South Carolina English Language Arts South Carolina English Language Arts A S O F J U N E 2 0, 2 0 1 0, T H I S S TAT E H A D A D O P T E D T H E CO M M O N CO R E S TAT E S TA N DA R D S. DOCUMENTS REVIEWED South Carolina Academic Content

More information

SLINGERLAND: A Multisensory Structured Language Instructional Approach

SLINGERLAND: A Multisensory Structured Language Instructional Approach SLINGERLAND: A Multisensory Structured Language Instructional Approach nancycushenwhite@gmail.com Lexicon Reading Center Dubai Teaching Reading IS Rocket Science 5% will learn to read on their own. 20-30%

More information

Grade 4. Common Core Adoption Process. (Unpacked Standards)

Grade 4. Common Core Adoption Process. (Unpacked Standards) Grade 4 Common Core Adoption Process (Unpacked Standards) Grade 4 Reading: Literature RL.4.1 Refer to details and examples in a text when explaining what the text says explicitly and when drawing inferences

More information

Understanding and Supporting Dyslexia Godstone Village School. January 2017

Understanding and Supporting Dyslexia Godstone Village School. January 2017 Understanding and Supporting Dyslexia Godstone Village School January 2017 By then end of the session I will: Have a greater understanding of Dyslexia and the ways in which children can be affected by

More information

Longest Common Subsequence: A Method for Automatic Evaluation of Handwritten Essays

Longest Common Subsequence: A Method for Automatic Evaluation of Handwritten Essays IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 6, Ver. IV (Nov Dec. 2015), PP 01-07 www.iosrjournals.org Longest Common Subsequence: A Method for

More information

Name of Course: French 1 Middle School. Grade Level(s): 7 and 8 (half each) Unit 1

Name of Course: French 1 Middle School. Grade Level(s): 7 and 8 (half each) Unit 1 Name of Course: French 1 Middle School Grade Level(s): 7 and 8 (half each) Unit 1 Estimated Instructional Time: 15 classes PA Academic Standards: Communication: Communicate in Languages Other Than English

More information

Listening and Speaking Skills of English Language of Adolescents of Government and Private Schools

Listening and Speaking Skills of English Language of Adolescents of Government and Private Schools Listening and Speaking Skills of English Language of Adolescents of Government and Private Schools Dr. Amardeep Kaur Professor, Babe Ke College of Education, Mudki, Ferozepur, Punjab Abstract The present

More information

Author: Fatima Lemtouni, Wayzata High School, Wayzata, MN

Author: Fatima Lemtouni, Wayzata High School, Wayzata, MN Title: Do Greetings Reflect Culture? Language: Arabic Author: Fatima Lemtouni, Wayzata High School, Wayzata, MN Level: Beginning/Novice low When: Semester one Theme: How do we greet and introduce each

More information

Word-based dialect identification with georeferenced rules

Word-based dialect identification with georeferenced rules Word-based dialect identification with georeferenced rules Yves Scherrer LATL Université de Genève Genève, Switzerland yves.scherrer@unige.ch Owen Rambow CCLS Columbia University New York, USA rambow@ccls.columbia.edu

More information

Detecting English-French Cognates Using Orthographic Edit Distance

Detecting English-French Cognates Using Orthographic Edit Distance Detecting English-French Cognates Using Orthographic Edit Distance Qiongkai Xu 1,2, Albert Chen 1, Chang i 1 1 The Australian National University, College of Engineering and Computer Science 2 National

More information

Yoshida Honmachi, Sakyo-ku, Kyoto, Japan 1 Although the label set contains verb phrases, they

Yoshida Honmachi, Sakyo-ku, Kyoto, Japan 1 Although the label set contains verb phrases, they FlowGraph2Text: Automatic Sentence Skeleton Compilation for Procedural Text Generation 1 Shinsuke Mori 2 Hirokuni Maeta 1 Tetsuro Sasada 2 Koichiro Yoshino 3 Atsushi Hashimoto 1 Takuya Funatomi 2 Yoko

More information

Teachers: Use this checklist periodically to keep track of the progress indicators that your learners have displayed.

Teachers: Use this checklist periodically to keep track of the progress indicators that your learners have displayed. Teachers: Use this checklist periodically to keep track of the progress indicators that your learners have displayed. Speaking Standard Language Aspect: Purpose and Context Benchmark S1.1 To exit this

More information