Automatic translation in Chinese and English based on mixed strategy
|
|
- Paula Ellis
- 6 years ago
- Views:
Transcription
1 dvanced Materials Research Online: ISSN: , Vols , pp doi: / Trans Tech Publications, Switzerland utomatic translation in Chinese and English based on mixed strategy Heng Chen 1, a and Yongsheng Yu 2,b 1 School of foreign languages, Wuhan Institute of Technology, Wuhan City, , China 2 School of computer science and technology, Huazhong University of Science and Technology, Wuhan City, , China a chenchrisheng@foxmail.com, b yuyosh@hotmail.com Keywords: automatic translation; mixed strategy; regulation identification; combination of words; knowledge database bstract. English and Chinese translation is an essential part in language teaching. utomatic translation is the focus of language research. In this thesis, we analyze the traditional translation system and the intelligent translation system, coming up with the working model. Then we discuss the different granularity in English and Chinese translation and its influence on translation, and propose sentence construction model and Chinese and English translation model. Through the analysis, the advantages of the auto translation based on phrases are shown. Then, on the purpose of assisting English language teaching, automatic translation framework with mixed strategy based on phrase collecting, judgment, statistics and functional words connection, is worked out. The framework includes building up translation knowledge database by analyzing the translation documents resources, identifying phrase and sentence models, and automatic translating based on the knowledge database. Introduction Due to the importance of translation, we are always trying to find out efficient tools to translate. Nowadays machine translation has taken the place of man in doing translation work. From the linguistic perspective, the translation process is the combination of understanding and expressing. The understanding precedes the expressing with careful choice of words. Wittgenstein holds that the meaning of the word, the component of language, does not exist. The attribute of words lies in its usage, so the same word has different meanings in different contexts. esides, many Chinese distinguished translators underscore "resemblance in spirit" rather than "resemblance in form". Therefore human s translation is a complicated thinking pattern at the high level, while computer intelligence remains at the low level of logic thinking. Naturally, we can only demand the machine or computer aided translation close to the original meaning on the basis of suitable words correspondence. The translation methods can be classified into two categories: rule-based machine translation and corpus-based machine translation. Rule-based machine translation includes literal translation, conversion in translation, translation using special transforming language. Corpus-based machine translation methods include translation based on examples and based on statistics etc. [1] and [2] clarify the concept and definition of machine translation and analyze the prospects of its application. While [3] gives a comprehensive introduction to the early computer aided translation. From the perspective of research methods, [4] puts forward the computer aided translation depending on the dictionary basis. However [5] does some researches from the linguistic perspective. [6] analyzes the English and Chinese translation through semiotics. [7] and [8] are instrumental in establishing corpus. [10] presents studying the computer aided translation based on language thinking process. ll these researches open up a new horizon for the computer aided translation, but the present achievement can not satisfy the public. The automatic translation systems using different translation methods have different features. In a literature translation agency, people pay close attention to the integrity and consistency of the whole document. For assisted English language teaching however, the accuracy of statement translation ll rights reserved. No part of contents of this paper may be reproduced or transmitted in any form or by any means without the written permission of Trans Tech Publications, (# , Pennsylvania State University, University Park, US-18/09/16,02:46:46)
2 dvanced Materials Research Vols catches more attention. So it is important to consider different methods for different application. The thesis, based on the present study, probes into the main design of machine translation system, compares advantages and disadvantages of rule-based machine translation and corpus based machine translation, discusses the difficulties and the structure of the automatic translation system, then comes up with auto translation model based on mixed strategies and provide the framework for the special application domain of assisted English language teaching. The analysis of the translation model There are many methods in automatic translation, which can be grouped into two categories: rule-based machine translation and corpus-based machine translation. For the development of translation software working system between Chinese and English, the working model can be classified into the traditional translation model and the intelligent translation model. The traditional translation model of automatic translation ased on words, the principle of the traditional translation tools is to incorporate the dictionary in the translation software, so the translation tools can provide convenient functions such as vocal pronunciation, the choice of several dictionaries, etc. The traditional translation model is illustrated by Fig.1, with dictionary as the database. The retrieval process is similar to consulting a dictionary. There do exist some problems in this traditional translation model: limited the source of the database, poor software availability. Some software upgrades its tradition e-dictionary and enlarges its database to enhance the word to word translation to the level of phrase to phrase translation. The typical representative is early illustrated in Fig.1, besides dictionaries, it also collects collocations. Fig. 1. The traditional translation model. The intelligent translation model The traditional translation model is based on words, while in this model many-to-many words mapping relationship exists in Chinese and English. When you use this translation tools as if you consult your dictionary, you have to choose the best translation. The intelligent translation model is based on the rules in forming sentences or phrases, so its translation is not word to word mapping, but higher than word granularity, or based on the high frequency of translation, through which the user can choose the good one. The translation model based on statistical result of analyzed literature shows the different frequencies of translation statistically so that the translator can choose the well accepted one. For the rule-based machine translation, many commercial systems rest on rule sets. First, the original language is analyzed into a source internal representation identified by the rule sets. Next, this source internal representation is translated into the internal representation of the target language. Finally, it will be transform into target language shown to the users. There are other methods using a unique third language. In every language translation couples, it always translates the original language material into the third language, and then translates the third language to the target language. This method has advantage of good expansibility, easy to translate among multi-languages, but it is difficult to design the unique third language. Fig.2 demonstrates the working process of statistical translation model, from which we can see that it analyzes the titles, the key words and abstracts of every scientific thesis, and then processes the analysis results in its statistical database. This method doesn t give any judgment of the translation. It only shows the statistical frequency of translation. The statistical translation tool provides the whole statistic translation results to the users so that they can choose the one they like, which give users a lot of helps in academic translation. ut it does
3 1944 Optoelectronics Engineering and Information Technologies in Industry not make any judgment on the translation itself, so sometimes the users feel confused. esides, its power can not be fully exerted, on the condition that some terms have less English translation in the database. Fig. 2. The statistical translation model. Granularity of automatic translation If we take a complete statement as studying units, and then the statement model can be expressed as: P Sentence = FV [, S, C, T] (1) In Eq. 1, V represents the vocabulary, S represents the construction model, C represents the context, and T shows the emotional elements of the statement. Each section reflects a different aspect of the statement, and four parts constitute a complete statement. ased on Eq. 1, the process of translation between Chinese and English can be expressed as: F V, S, C, T F V, S, C, T (2) Chinese [ ] [ ] Eq. 2 shows that the translation process is mapping and replacement between the parts V, S, C, T of Chinese sentence and the corresponding parts of the English sentence. Firstly, we analyze the vocabulary. For automatic translation, there are different granularities in different systems. Single word, phrase and short sentence, all can be viewed as vocabulary. Usually the words have many-to-many mapping relationship: one Chinese word has many relating English synonyms, and vice versa. ut actually these synonyms have some differences, which can not be identified in granularity of words. 1 2 p1 1 2 p2 1 2 pn V w, w,, w V w, w,, w, w, w,, w,, w, w,, w (3) English [ ] [{ }{ } { }] C1 C2 Cn E1 E1 E1 En En In Eq. 3, a Chinese sentence has n basic words; each word has many relating English synonyms; w C1 has p1 translation correspondents, wcnhas pn translation correspondents. Such many-to-many mapping leads to the difficulty of translation in granularity of word. Now, if we take the word group as the translation unit, and then the items in mapping domain will be significantly reduced.if we take phrases, combination of words or short sentence as the translation unit, then Eq. 3 will be transformed into Eq. 4: 1 2 rt V w, w,, w, w V w, w,, w,, w, w,, w (4) [ ] [{ }{ } { } { }] C1 C2 Ck Ct E1 Ek Et Et So if we can take a big part as translation unit, and record those one-to-one mapping couples into the knowledge database, in this way we can improve the automatic translation system. Next we study the construction model. The sentence is composed of different sentence elements. There is a serial of basic sentence elements in each language, and such elements have similar represent in different languages. S [ D D,, D, L ] S [ D, D,, D, L ] 1, 2 i 1 2 (5) In Eq. 5, D 1to Di and D 1to D j represent the sentence elements in Chinese and English, andl, L show the conjunction model. It has greatly improved automatic translation by collecting and recording the structure translation material into the knowledge database. For the last two items: C and T, the context relationship and the emotion in the statement depend on the whole document translation, in which difficulties still remain. j Et En
4 dvanced Materials Research Vols Framework of automatic translation based on mixed strategies From the above discussion, we can find that the essence of automatic translation is the mapping and converting in pragmatic context. Currently some of the commercial translation systems are designed with the method of rule-based; some are corpus-based. oth of them have their advantages and disadvantages. Thus commercial systems have their limitations. There are some connections between the translation model and the range of its application. For assisting of English language teaching, accurate and excellent choice of words, words group and sentence construction is the focus. While commercial translation pays more attention to translation in the whole documents. For the granularity of sentence, construction by conjunctions is very important, so the system should identify their relationship with other words within the sentence, that is, how they connect words, sentences, phrases or clauses together. The key of the automatic translation rests on phrases and the grammatical relationship of functional words with other words within a sentence. So collecting, organizing, and identifying the fixed phrases as well as their translation are crucial in automatic translation. In terms of phrases, they are more similar to well-acknowledged, fixed, conventional words groups. These words groups are useful in auto translation. New words and new usages are endless so it is not workable for us to add new words and usages into the database every time. We have to resort to computer to handle this work. Therefore, setting processing mode and words groups identification rules are vital in solving the problem in auto-identification and analysis in translation tools. For the conjunctions, it is very important to identify their relationship with other words within the sentence, that is, how they connect words, sentences, phrases or clauses together. We should record these conjunction relationships with other words in the knowledge database so as to match them in sentence translation. ased on the above discussion, with the purpose of assisting English language teaching, the framework of the intelligent machine-translation system based on mixed strategies should include two parts: analysis module and translation module. The process of the analysis is illustrated in Fig.3. Fig. 3. The scheme of auto-translation based on mixed strategy.. Resources Introducing. The origin resources include dictionaries, scientific literature, translation masterpieces etc. Dictionaries, as a basis, are necessary to translation. The scientific literature usually has Chinese and English titles, key words and abstracts, which are important source to translation knowledge database. The more abundant the resources are, the richer the content of the translation database will be. Obviously it can improve the computer aided translation.. nalyze the phrases. In this phase, the sentences will be analyzed and separated into different words, phrases, and words groups. Keys words in scientific literature can be collected directly, while examples in dictionaries, abstracts in scientific literature, and translation masterpieces all have to be comparatively analyzed by translation system through English and Chinese, the two languages. For every corresponding Chinese and English sentence, we should find out every Chinese and English words matching relationship and check the content words and their adjacencies to see if some collocations or fixed or conventional words groups exist. Words groups should extend from two words combinations to all the entire possible combinations to identify all the possible words groups.
5 1946 Optoelectronics Engineering and Information Technologies in Industry C. nalyze the sentence. The analysis of the functional words is vital in translation of sentences. So they should be extracted and analyzed their possible relationship with other words according to what has recorded in database. fter these analyses of functional and content words in the database, the system can provide the translations of sentence. D. Intelligent identification. The key to setting up a translation database lies in the standard to identify the fixed usage and judge the correct sentence. Obviously, giving predetermined thresholds is unreasonable, for in actual application, there are so many different word combinations that it is impossible to set a unified judgment standard. Here we adopt intelligent algorithm, after statistical listing the translation results, the system sets and adjusts the threshold based on the historic records. The working process of the translation module is as follows: Sentences will be analyzed one by one. Words in the sentences will be grouped into different combinations, in which the longer words groups enjoy priority in matching in the database. If the corresponding matching can not be found, then the words groups will be separated into shorter words groups or combinations until all the words groups can find all their translations in database. t the same time, the functional words esp. conjunctions are also analyzed in accordance with their relationship with the content words in the database. fter all these, the translation of a sentence can be offered by the automatic translation system. Conclusions With the development of computer science, some influential automatic translation systems emerge. The working process of these systems is studied in the thesis. In the following, we discuss the granularity of automatic translation unit, from single word,, phrase, word groups and short sentence, and show the statement model based on vocabulary, construction model, context, and the emotional elements of the statement. Through these studies, the framework of the automatic translation system based on mixed strategy for special application of assisting English language teaching is put forward. References [1] Makoto Nagao. Machine Translation, How Far Can It Go? London: Oxford University Press, 1986 [2] W. John Hutchins et al. n introduction to machine translation. US: cademic Press, 1992 [3] W. John Hutchins. Machine translation: past, present, future. London: Ellis Horwood Limited, 1986 [4] Shuttleworth, Mark & Cowie, Moria. Dictionary of Translation Studies. Manchester: St.Jerome publishing [5] George Yule. The Study of Language. eijing: Foreign Language Teaching & Research Press, [6] R. l-kahnjii, S. El-shiyab, R. Hussein. On the Use of Compensatory Strategies in Simultaneous Interpretation. Journal des traducteurs, 45(3), pp [7] F. Och, H. Ney. Systematic Comparison of Various Statistical lignment Models. Computational Linguistics, 29(1),2003. pp [8] P. rown, S. Chen, S. DellaPietra, et al. utomatic Speech Recognition in Machine ided Translation. Computer Speech and Language, 8(3),1994. pp [9] Esselink, ert. Practical Guide to Localization. msterdam: John enjamins Pub Co., [10] Feng Zhiwei. Computer Processing of Natural Languages. Shanghai: Shanghai Foreign Language & Education Press
LEXICAL COHESION ANALYSIS OF THE ARTICLE WHAT IS A GOOD RESEARCH PROJECT? BY BRIAN PALTRIDGE A JOURNAL ARTICLE
LEXICAL COHESION ANALYSIS OF THE ARTICLE WHAT IS A GOOD RESEARCH PROJECT? BY BRIAN PALTRIDGE A JOURNAL ARTICLE Submitted in partial fulfillment of the requirements for the degree of Sarjana Sastra (S.S.)
More informationA Case Study: News Classification Based on Term Frequency
A Case Study: News Classification Based on Term Frequency Petr Kroha Faculty of Computer Science University of Technology 09107 Chemnitz Germany kroha@informatik.tu-chemnitz.de Ricardo Baeza-Yates Center
More informationEmpirical research on implementation of full English teaching mode in the professional courses of the engineering doctoral students
Empirical research on implementation of full English teaching mode in the professional courses of the engineering doctoral students Yunxia Zhang & Li Li College of Electronics and Information Engineering,
More informationCross Language Information Retrieval
Cross Language Information Retrieval RAFFAELLA BERNARDI UNIVERSITÀ DEGLI STUDI DI TRENTO P.ZZA VENEZIA, ROOM: 2.05, E-MAIL: BERNARDI@DISI.UNITN.IT Contents 1 Acknowledgment.............................................
More informationConstructing a support system for self-learning playing the piano at the beginning stage
Alma Mater Studiorum University of Bologna, August 22-26 2006 Constructing a support system for self-learning playing the piano at the beginning stage Tamaki Kitamura Dept. of Media Informatics, Ryukoku
More informationWriting a composition
A good composition has three elements: Writing a composition an introduction: A topic sentence which contains the main idea of the paragraph. a body : Supporting sentences that develop the main idea. a
More informationLongest Common Subsequence: A Method for Automatic Evaluation of Handwritten Essays
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 6, Ver. IV (Nov Dec. 2015), PP 01-07 www.iosrjournals.org Longest Common Subsequence: A Method for
More informationWelcome to the Purdue OWL. Where do I begin? General Strategies. Personalizing Proofreading
Welcome to the Purdue OWL This page is brought to you by the OWL at Purdue (http://owl.english.purdue.edu/). When printing this page, you must include the entire legal notice at bottom. Where do I begin?
More informationSpeech Emotion Recognition Using Support Vector Machine
Speech Emotion Recognition Using Support Vector Machine Yixiong Pan, Peipei Shen and Liping Shen Department of Computer Technology Shanghai JiaoTong University, Shanghai, China panyixiong@sjtu.edu.cn,
More informationParsing of part-of-speech tagged Assamese Texts
IJCSI International Journal of Computer Science Issues, Vol. 6, No. 1, 2009 ISSN (Online): 1694-0784 ISSN (Print): 1694-0814 28 Parsing of part-of-speech tagged Assamese Texts Mirzanur Rahman 1, Sufal
More informationA Study of Metacognitive Awareness of Non-English Majors in L2 Listening
ISSN 1798-4769 Journal of Language Teaching and Research, Vol. 4, No. 3, pp. 504-510, May 2013 Manufactured in Finland. doi:10.4304/jltr.4.3.504-510 A Study of Metacognitive Awareness of Non-English Majors
More informationHow to Judge the Quality of an Objective Classroom Test
How to Judge the Quality of an Objective Classroom Test Technical Bulletin #6 Evaluation and Examination Service The University of Iowa (319) 335-0356 HOW TO JUDGE THE QUALITY OF AN OBJECTIVE CLASSROOM
More informationEffectiveness of Electronic Dictionary in College Students English Learning
2016 International Conference on Mechanical, Control, Electric, Mechatronics, Information and Computer (MCEMIC 2016) ISBN: 978-1-60595-352-6 Effectiveness of Electronic Dictionary in College Students English
More informationTaking into Account the Oral-Written Dichotomy of the Chinese language :
Taking into Account the Oral-Written Dichotomy of the Chinese language : The division and connections between lexical items for Oral and for Written activities Bernard ALLANIC 安雄舒长瑛 SHU Changying 1 I.
More informationArizona s English Language Arts Standards th Grade ARIZONA DEPARTMENT OF EDUCATION HIGH ACADEMIC STANDARDS FOR STUDENTS
Arizona s English Language Arts Standards 11-12th Grade ARIZONA DEPARTMENT OF EDUCATION HIGH ACADEMIC STANDARDS FOR STUDENTS 11 th -12 th Grade Overview Arizona s English Language Arts Standards work together
More informationAN INTRODUCTION (2 ND ED.) (LONDON, BLOOMSBURY ACADEMIC PP. VI, 282)
B. PALTRIDGE, DISCOURSE ANALYSIS: AN INTRODUCTION (2 ND ED.) (LONDON, BLOOMSBURY ACADEMIC. 2012. PP. VI, 282) Review by Glenda Shopen _ This book is a revised edition of the author s 2006 introductory
More informationModeling user preferences and norms in context-aware systems
Modeling user preferences and norms in context-aware systems Jonas Nilsson, Cecilia Lindmark Jonas Nilsson, Cecilia Lindmark VT 2016 Bachelor's thesis for Computer Science, 15 hp Supervisor: Juan Carlos
More informationAQUA: An Ontology-Driven Question Answering System
AQUA: An Ontology-Driven Question Answering System Maria Vargas-Vera, Enrico Motta and John Domingue Knowledge Media Institute (KMI) The Open University, Walton Hall, Milton Keynes, MK7 6AA, United Kingdom.
More informationA Note on Structuring Employability Skills for Accounting Students
A Note on Structuring Employability Skills for Accounting Students Jon Warwick and Anna Howard School of Business, London South Bank University Correspondence Address Jon Warwick, School of Business, London
More informationNotes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1
Notes on The Sciences of the Artificial Adapted from a shorter document written for course 17-652 (Deciding What to Design) 1 Ali Almossawi December 29, 2005 1 Introduction The Sciences of the Artificial
More informationThought and Suggestions on Teaching Material Management Job in Colleges and Universities Based on Improvement of Innovation Capacity
Thought and Suggestions on Teaching Material Management Job in Colleges and Universities Based on Improvement of Innovation Capacity Lihua Geng 1 & Bingjun Yao 1 1 Changchun University of Science and Technology,
More informationStrategy Study on Primary School English Game Teaching
6th International Conference on Electronic, Mechanical, Information and Management (EMIM 2016) Strategy Study on Primary School English Game Teaching Feng He Primary Education College, Linyi University
More informationZHANG Xiaojun, XIONG Xiaoliang School of Finance and Business English, Wuhan Yangtze Business University, P.R.China,
Studies on the Characteristic Training Mode of Foreign Business Talents of Private University Taking International Economy and Trade Major of Wuhan Yangtze Business University as an Example ZHANG Xiaojun,
More informationLISTENING STRATEGIES AWARENESS: A DIARY STUDY IN A LISTENING COMPREHENSION CLASSROOM
LISTENING STRATEGIES AWARENESS: A DIARY STUDY IN A LISTENING COMPREHENSION CLASSROOM Frances L. Sinanu Victoria Usadya Palupi Antonina Anggraini S. Gita Hastuti Faculty of Language and Literature Satya
More informationMining Association Rules in Student s Assessment Data
www.ijcsi.org 211 Mining Association Rules in Student s Assessment Data Dr. Varun Kumar 1, Anupama Chadha 2 1 Department of Computer Science and Engineering, MVN University Palwal, Haryana, India 2 Anupama
More informationINNOWIZ: A GUIDING FRAMEWORK FOR PROJECTS IN INDUSTRIAL DESIGN EDUCATION
INTERNATIONAL CONFERENCE ON ENGINEERING AND PRODUCT DESIGN EDUCATION 8 & 9 SEPTEMBER 2011, CITY UNIVERSITY, LONDON, UK INNOWIZ: A GUIDING FRAMEWORK FOR PROJECTS IN INDUSTRIAL DESIGN EDUCATION Pieter MICHIELS,
More informationTHE VERB ARGUMENT BROWSER
THE VERB ARGUMENT BROWSER Bálint Sass sass.balint@itk.ppke.hu Péter Pázmány Catholic University, Budapest, Hungary 11 th International Conference on Text, Speech and Dialog 8-12 September 2008, Brno PREVIEW
More informationReducing Features to Improve Bug Prediction
Reducing Features to Improve Bug Prediction Shivkumar Shivaji, E. James Whitehead, Jr., Ram Akella University of California Santa Cruz {shiv,ejw,ram}@soe.ucsc.edu Sunghun Kim Hong Kong University of Science
More informationSpeech Recognition at ICSI: Broadcast News and beyond
Speech Recognition at ICSI: Broadcast News and beyond Dan Ellis International Computer Science Institute, Berkeley CA Outline 1 2 3 The DARPA Broadcast News task Aspects of ICSI
More informationData Fusion Models in WSNs: Comparison and Analysis
Proceedings of 2014 Zone 1 Conference of the American Society for Engineering Education (ASEE Zone 1) Data Fusion s in WSNs: Comparison and Analysis Marwah M Almasri, and Khaled M Elleithy, Senior Member,
More informationWord Segmentation of Off-line Handwritten Documents
Word Segmentation of Off-line Handwritten Documents Chen Huang and Sargur N. Srihari {chuang5, srihari}@cedar.buffalo.edu Center of Excellence for Document Analysis and Recognition (CEDAR), Department
More informationAustralian Journal of Basic and Applied Sciences
AENSI Journals Australian Journal of Basic and Applied Sciences ISSN:1991-8178 Journal home page: www.ajbasweb.com Feature Selection Technique Using Principal Component Analysis For Improving Fuzzy C-Mean
More informationTwitter Sentiment Classification on Sanders Data using Hybrid Approach
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 4, Ver. I (July Aug. 2015), PP 118-123 www.iosrjournals.org Twitter Sentiment Classification on Sanders
More informationUSER ADAPTATION IN E-LEARNING ENVIRONMENTS
USER ADAPTATION IN E-LEARNING ENVIRONMENTS Paraskevi Tzouveli Image, Video and Multimedia Systems Laboratory School of Electrical and Computer Engineering National Technical University of Athens tpar@image.
More informationMastering Team Skills and Interpersonal Communication. Copyright 2012 Pearson Education, Inc. publishing as Prentice Hall.
Chapter 2 Mastering Team Skills and Interpersonal Communication Chapter 2-1 Communicating Effectively in Teams Chapter 2-2 Communicating Effectively in Teams Collaboration involves working together to
More informationDeveloping True/False Test Sheet Generating System with Diagnosing Basic Cognitive Ability
Developing True/False Test Sheet Generating System with Diagnosing Basic Cognitive Ability Shih-Bin Chen Dept. of Information and Computer Engineering, Chung-Yuan Christian University Chung-Li, Taiwan
More informationPAGE(S) WHERE TAUGHT If sub mission ins not a book, cite appropriate location(s))
Ohio Academic Content Standards Grade Level Indicators (Grade 11) A. ACQUISITION OF VOCABULARY Students acquire vocabulary through exposure to language-rich situations, such as reading books and other
More informationMYP Language A Course Outline Year 3
Course Description: The fundamental piece to learning, thinking, communicating, and reflecting is language. Language A seeks to further develop six key skill areas: listening, speaking, reading, writing,
More informationBANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS
Daffodil International University Institutional Repository DIU Journal of Science and Technology Volume 8, Issue 1, January 2013 2013-01 BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS Uddin, Sk.
More informationDerivational and Inflectional Morphemes in Pak-Pak Language
Derivational and Inflectional Morphemes in Pak-Pak Language Agustina Situmorang and Tima Mariany Arifin ABSTRACT The objectives of this study are to find out the derivational and inflectional morphemes
More informationImplementing the English Language Arts Common Core State Standards
1st Grade Implementing the English Language Arts Common Core State Standards A Teacher s Guide to the Common Core Standards: An Illinois Content Model Framework English Language Arts/Literacy Adapted from
More informationA Study of Video Effects on English Listening Comprehension
Studies in Literature and Language Vol. 8, No. 2, 2014, pp. 53-58 DOI:10.3968/4348 ISSN 1923-1555[Print] ISSN 1923-1563[Online] www.cscanada.net www.cscanada.org Study of Video Effects on English Listening
More informationSpecification of the Verity Learning Companion and Self-Assessment Tool
Specification of the Verity Learning Companion and Self-Assessment Tool Sergiu Dascalu* Daniela Saru** Ryan Simpson* Justin Bradley* Eva Sarwar* Joohoon Oh* * Department of Computer Science ** Dept. of
More information10.2. Behavior models
User behavior research 10.2. Behavior models Overview Why do users seek information? How do they seek information? How do they search for information? How do they use libraries? These questions are addressed
More informationLinking Task: Identifying authors and book titles in verbose queries
Linking Task: Identifying authors and book titles in verbose queries Anaïs Ollagnier, Sébastien Fournier, and Patrice Bellot Aix-Marseille University, CNRS, ENSAM, University of Toulon, LSIS UMR 7296,
More informationOnline Marking of Essay-type Assignments
Online Marking of Essay-type Assignments Eva Heinrich, Yuanzhi Wang Institute of Information Sciences and Technology Massey University Palmerston North, New Zealand E.Heinrich@massey.ac.nz, yuanzhi_wang@yahoo.com
More informationAbbey Academies Trust. Every Child Matters
Abbey Academies Trust Every Child Matters Amended POLICY For Modern Foreign Languages (MFL) September 2005 September 2014 September 2008 September 2011 Every Child Matters within a loving and caring Christian
More informationUse of Online Information Resources for Knowledge Organisation in Library and Information Centres: A Case Study of CUSAT
DESIDOC Journal of Library & Information Technology, Vol. 31, No. 1, January 2011, pp. 19-24 2011, DESIDOC Use of Online Information Resources for Knowledge Organisation in Library and Information Centres:
More informationLITERACY ACROSS THE CURRICULUM POLICY
"Pupils should be taught in all subjects to express themselves correctly and appropriately and to read accurately and with understanding." QCA Use of Language across the Curriculum "Thomas Estley Community
More informationEvaluation of Usage Patterns for Web-based Educational Systems using Web Mining
Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining Dave Donnellan, School of Computer Applications Dublin City University Dublin 9 Ireland daviddonnellan@eircom.net Claus Pahl
More informationEvaluation of Usage Patterns for Web-based Educational Systems using Web Mining
Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining Dave Donnellan, School of Computer Applications Dublin City University Dublin 9 Ireland daviddonnellan@eircom.net Claus Pahl
More informationTransfer Learning Action Models by Measuring the Similarity of Different Domains
Transfer Learning Action Models by Measuring the Similarity of Different Domains Hankui Zhuo 1, Qiang Yang 2, and Lei Li 1 1 Software Research Institute, Sun Yat-sen University, Guangzhou, China. zhuohank@gmail.com,lnslilei@mail.sysu.edu.cn
More informationTimeline. Recommendations
Introduction Advanced Placement Course Credit Alignment Recommendations In 2007, the State of Ohio Legislature passed legislation mandating the Board of Regents to recommend and the Chancellor to adopt
More informationGrade 6: Module 1: Unit 2: Lesson 5 Building Vocabulary: Working with Words about the Key Elements of Mythology
Grade 6: Module 1: Unit 2: Lesson 5 about the Key Elements of Mythology This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. Exempt third-party content
More informationTeachers Guide Chair Study
Certificate of Initial Mastery Task Booklet 2006-2007 School Year Teachers Guide Chair Study Dance Modified On-Demand Task Revised 4-19-07 Central Falls Johnston Middletown West Warwick Coventry Lincoln
More informationThe Good Judgment Project: A large scale test of different methods of combining expert predictions
The Good Judgment Project: A large scale test of different methods of combining expert predictions Lyle Ungar, Barb Mellors, Jon Baron, Phil Tetlock, Jaime Ramos, Sam Swift The University of Pennsylvania
More informationThe development and implementation of a coaching model for project-based learning
The development and implementation of a coaching model for project-based learning W. Van der Hoeven 1 Educational Research Assistant KU Leuven, Faculty of Bioscience Engineering Heverlee, Belgium E-mail:
More informationDublin City Schools Broadcast Video I Graded Course of Study GRADES 9-12
Philosophy The Broadcast and Video Production Satellite Program in the Dublin City School District is dedicated to developing students media production skills in an atmosphere that includes stateof-the-art
More informationOCR for Arabic using SIFT Descriptors With Online Failure Prediction
OCR for Arabic using SIFT Descriptors With Online Failure Prediction Andrey Stolyarenko, Nachum Dershowitz The Blavatnik School of Computer Science Tel Aviv University Tel Aviv, Israel Email: stloyare@tau.ac.il,
More informationSINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)
SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF) Hans Christian 1 ; Mikhael Pramodana Agus 2 ; Derwin Suhartono 3 1,2,3 Computer Science Department,
More informationGuidelines for the Use of the Continuing Education Unit (CEU)
Guidelines for the Use of the Continuing Education Unit (CEU) The UNC Policy Manual The essential educational mission of the University is augmented through a broad range of activities generally categorized
More informationAn Interactive Intelligent Language Tutor Over The Internet
An Interactive Intelligent Language Tutor Over The Internet Trude Heift Linguistics Department and Language Learning Centre Simon Fraser University, B.C. Canada V5A1S6 E-mail: heift@sfu.ca Abstract: This
More informationText Type Purpose Structure Language Features Article
Page1 Text Types - Purpose, Structure, and Language Features The context, purpose and audience of the text, and whether the text will be spoken or written, will determine the chosen. Levels of, features,
More informationEssay on importance of good friends. It can cause flooding of the countries or even continents..
Essay on importance of good friends. It can cause flooding of the countries or even continents.. Essay on importance of good friends >>>CLICK HERE
More informationIdentifying Novice Difficulties in Object Oriented Design
Identifying Novice Difficulties in Object Oriented Design Benjy Thomasson, Mark Ratcliffe, Lynda Thomas University of Wales, Aberystwyth Penglais Hill Aberystwyth, SY23 1BJ +44 (1970) 622424 {mbr, ltt}
More informationProcedia - Social and Behavioral Sciences 141 ( 2014 ) WCLTA Using Corpus Linguistics in the Development of Writing
Available online at www.sciencedirect.com ScienceDirect Procedia - Social and Behavioral Sciences 141 ( 2014 ) 124 128 WCLTA 2013 Using Corpus Linguistics in the Development of Writing Blanka Frydrychova
More informationhave to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,
A Language-Independent, Data-Oriented Architecture for Grapheme-to-Phoneme Conversion Walter Daelemans and Antal van den Bosch Proceedings ESCA-IEEE speech synthesis conference, New York, September 1994
More informationCommunicative Language Teaching (CLT): A Critical and Comparative Perspective
ISSN 1799-2591 Theory and Practice in Language Studies, Vol. 3, No. 9, pp. 1579-1583, September 2013 Manufactured in Finland. doi:10.4304/tpls.3.9.1579-1583 Communicative Language Teaching (CLT): A Critical
More informationCONCEPT MAPS AS A DEVICE FOR LEARNING DATABASE CONCEPTS
CONCEPT MAPS AS A DEVICE FOR LEARNING DATABASE CONCEPTS Pirjo Moen Department of Computer Science P.O. Box 68 FI-00014 University of Helsinki pirjo.moen@cs.helsinki.fi http://www.cs.helsinki.fi/pirjo.moen
More informationCalibration of Confidence Measures in Speech Recognition
Submitted to IEEE Trans on Audio, Speech, and Language, July 2010 1 Calibration of Confidence Measures in Speech Recognition Dong Yu, Senior Member, IEEE, Jinyu Li, Member, IEEE, Li Deng, Fellow, IEEE
More informationThe 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X
The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, 2013 10.12753/2066-026X-13-154 DATA MINING SOLUTIONS FOR DETERMINING STUDENT'S PROFILE Adela BÂRA,
More informationOn-Line Data Analytics
International Journal of Computer Applications in Engineering Sciences [VOL I, ISSUE III, SEPTEMBER 2011] [ISSN: 2231-4946] On-Line Data Analytics Yugandhar Vemulapalli #, Devarapalli Raghu *, Raja Jacob
More informationEnglish for Specific Purposes World ISSN Issue 34, Volume 12, 2012 TITLE:
TITLE: The English Language Needs of Computer Science Undergraduate Students at Putra University, Author: 1 Affiliation: Faculty Member Department of Languages College of Arts and Sciences International
More informationI. INTRODUCTION. for conducting the research, the problems in teaching vocabulary, and the suitable
1 I. INTRODUCTION This chapter describes the background of the problem which includes the reasons for conducting the research, the problems in teaching vocabulary, and the suitable activity which is needed
More informationTHE EFFECTS OF TEACHING THE 7 KEYS OF COMPREHENSION ON COMPREHENSION DEBRA HENGGELER. Submitted to. The Educational Leadership Faculty
7 Keys to Comprehension 1 RUNNING HEAD: 7 Keys to Comprehension THE EFFECTS OF TEACHING THE 7 KEYS OF COMPREHENSION ON COMPREHENSION By DEBRA HENGGELER Submitted to The Educational Leadership Faculty Northwest
More informationThe English Monolingual Dictionary: Its Use among Second Year Students of University Technology of Malaysia, International Campus, Kuala Lumpur
The English Monolingual Dictionary: Its Use among Second Year Students of University Technology of Malaysia, International Campus, Kuala Lumpur Amerrudin Abd. Manan and Khairi Obaid Al-Zubaidi (University
More informationMandarin Lexical Tone Recognition: The Gating Paradigm
Kansas Working Papers in Linguistics, Vol. 0 (008), p. 8 Abstract Mandarin Lexical Tone Recognition: The Gating Paradigm Yuwen Lai and Jie Zhang University of Kansas Research on spoken word recognition
More informationConstructing Parallel Corpus from Movie Subtitles
Constructing Parallel Corpus from Movie Subtitles Han Xiao 1 and Xiaojie Wang 2 1 School of Information Engineering, Beijing University of Post and Telecommunications artex.xh@gmail.com 2 CISTR, Beijing
More informationAssessment and Evaluation
Assessment and Evaluation 201 202 Assessing and Evaluating Student Learning Using a Variety of Assessment Strategies Assessment is the systematic process of gathering information on student learning. Evaluation
More informationSystematic reviews in theory and practice for library and information studies
Systematic reviews in theory and practice for library and information studies Sue F. Phelps, Nicole Campbell Abstract This article is about the use of systematic reviews as a research methodology in library
More information1. Introduction. 2. The OMBI database editor
OMBI bilingual lexical resources: Arabic-Dutch / Dutch-Arabic Carole Tiberius, Anna Aalstein, Instituut voor Nederlandse Lexicologie Jan Hoogland, Nederlands Instituut in Marokko (NIMAR) In this paper
More informationUser Education Programs in Academic Libraries: The Experience of the International Islamic University Malaysia Students
University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln Library Philosophy and Practice (e-journal) Libraries at University of Nebraska-Lincoln 2012 User Education Programs in
More informationLecturing Module
Lecturing: What, why and when www.facultydevelopment.ca Lecturing Module What is lecturing? Lecturing is the most common and established method of teaching at universities around the world. The traditional
More informationWeb as Corpus. Corpus Linguistics. Web as Corpus 1 / 1. Corpus Linguistics. Web as Corpus. web.pl 3 / 1. Sketch Engine. Corpus Linguistics
(L615) Markus Dickinson Department of Linguistics, Indiana University Spring 2013 The web provides new opportunities for gathering data Viable source of disposable corpora, built ad hoc for specific purposes
More informationHeritage Korean Stage 6 Syllabus Preliminary and HSC Courses
Heritage Korean Stage 6 Syllabus Preliminary and HSC Courses 2010 Board of Studies NSW for and on behalf of the Crown in right of the State of New South Wales This document contains Material prepared by
More informationLip reading: Japanese vowel recognition by tracking temporal changes of lip shape
Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape Koshi Odagiri 1, and Yoichi Muraoka 1 1 Graduate School of Fundamental/Computer Science and Engineering, Waseda University,
More informationCELTA. Syllabus and Assessment Guidelines. Third Edition. University of Cambridge ESOL Examinations 1 Hills Road Cambridge CB1 2EU United Kingdom
CELTA Syllabus and Assessment Guidelines Third Edition CELTA (Certificate in Teaching English to Speakers of Other Languages) is accredited by Ofqual (the regulator of qualifications, examinations and
More informationUSING INTERACTIVE VIDEO TO IMPROVE STUDENTS MOTIVATION IN LEARNING ENGLISH
USING INTERACTIVE VIDEO TO IMPROVE STUDENTS MOTIVATION IN LEARNING ENGLISH By: ULFATUL MA'RIFAH Dosen FKIP Unmuh Gresik RIRIS IKA WULANDARI ABSTRACT: Motivation becomes an important part in the successful
More informationSpecification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments
Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments Cristina Vertan, Walther v. Hahn University of Hamburg, Natural Language Systems Division Hamburg,
More informationEducator s e-portfolio in the Modern University
Educator s e-portfolio in the Modern University Nataliia Morze 1, Liliia Varchenko-Trotsenko 1 1 Borys Grinchenko Kyiv University, 18/2 Bulvarno-Kudriavska Str, Kyiv, Ukraine, n.morze@kubg.edu.ua, l.varchenko@kubg.edu.ua
More informationTextbook Evalyation:
STUDIES IN LITERATURE AND LANGUAGE Vol. 1, No. 8, 2010, pp. 54-60 www.cscanada.net ISSN 1923-1555 [Print] ISSN 1923-1563 [Online] www.cscanada.org Textbook Evalyation: EFL Teachers Perspectives on New
More informationNCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches
NCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches Yu-Chun Wang Chun-Kai Wu Richard Tzong-Han Tsai Department of Computer Science
More informationGrade 5: Module 3A: Overview
Grade 5: Module 3A: Overview This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. Exempt third-party content is indicated by the footer: (name of copyright
More informationLearning Methods in Multilingual Speech Recognition
Learning Methods in Multilingual Speech Recognition Hui Lin Department of Electrical Engineering University of Washington Seattle, WA 98125 linhui@u.washington.edu Li Deng, Jasha Droppo, Dong Yu, and Alex
More informationA Reinforcement Learning Variant for Control Scheduling
A Reinforcement Learning Variant for Control Scheduling Aloke Guha Honeywell Sensor and System Development Center 3660 Technology Drive Minneapolis MN 55417 Abstract We present an algorithm based on reinforcement
More informationIdentification of Opinion Leaders Using Text Mining Technique in Virtual Community
Identification of Opinion Leaders Using Text Mining Technique in Virtual Community Chihli Hung Department of Information Management Chung Yuan Christian University Taiwan 32023, R.O.C. chihli@cycu.edu.tw
More informationRottenberg, Annette. Elements of Argument: A Text and Reader, 7 th edition Boston: Bedford/St. Martin s, pages.
Textbook Review for inreview Christine Photinos Rottenberg, Annette. Elements of Argument: A Text and Reader, 7 th edition Boston: Bedford/St. Martin s, 2003 753 pages. Now in its seventh edition, Annette
More informationADDIE MODEL THROUGH THE TASK LEARNING APPROACH IN TEXTILE KNOWLEDGE COURSE IN DRESS-MAKING EDUCATION STUDY PROGRAM OF STATE UNIVERSITY OF MEDAN
International Journal of GEOMATE, Feb., 217, Vol. 12, Issue, pp. 19-114 International Journal of GEOMATE, Feb., 217, Vol.12 Issue, pp. 19-114 Special Issue on Science, Engineering & Environment, ISSN:2186-299,
More informationProcedia - Social and Behavioral Sciences 154 ( 2014 )
Available online at www.sciencedirect.com ScienceDirect Procedia - Social and Behavioral Sciences 154 ( 2014 ) 263 267 THE XXV ANNUAL INTERNATIONAL ACADEMIC CONFERENCE, LANGUAGE AND CULTURE, 20-22 October
More informationDeveloping a Language for Assessing Creativity: a taxonomy to support student learning and assessment
Investigations in university teaching and learning vol. 5 (1) autumn 2008 ISSN 1740-5106 Developing a Language for Assessing Creativity: a taxonomy to support student learning and assessment Janette Harris
More information