Introduction to Machine Translation

Similar documents
arxiv: v1 [cs.cl] 2 Apr 2017

Cross Language Information Retrieval

Re-evaluating the Role of Bleu in Machine Translation Research

The Karlsruhe Institute of Technology Translation Systems for the WMT 2011

Regression for Sentence-Level MT Evaluation with Pseudo References

Speech Recognition at ICSI: Broadcast News and beyond

Noisy SMS Machine Translation in Low-Density Languages

Compositional Semantics

Language Model and Grammar Extraction Variation in Machine Translation

Large Kindergarten Centers Icons

(Sub)Gradient Descent

A Quantitative Method for Machine Translation Evaluation

The MSR-NRC-SRI MT System for NIST Open Machine Translation 2008 Evaluation

CS 598 Natural Language Processing

CROSS-LANGUAGE INFORMATION RETRIEVAL USING PARAFAC2

Evaluation of a Simultaneous Interpretation System and Analysis of Speech Log for User Experience Assessment

Task Tolerance of MT Output in Integrated Text Processes

Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data

ROSETTA STONE PRODUCT OVERVIEW

Machine Translation on the Medical Domain: The Role of BLEU/NIST and METEOR in a Controlled Vocabulary Setting

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Detecting English-French Cognates Using Orthographic Edit Distance

LQVSumm: A Corpus of Linguistic Quality Violations in Multi-Document Summarization

Language Acquisition Chart

Improved Reordering for Shallow-n Grammar based Hierarchical Phrase-based Translation

TINE: A Metric to Assess MT Adequacy

Greedy Decoding for Statistical Machine Translation in Almost Linear Time

Impact of Controlled Language on Translation Quality and Post-editing in a Statistical Machine Translation Environment

Exploiting Phrasal Lexica and Additional Morpho-syntactic Language Resources for Statistical Machine Translation with Scarce Training Data

Domain Adaptation in Statistical Machine Translation of User-Forum Data using Component-Level Mixture Modelling

Taught Throughout the Year Foundational Skills Reading Writing Language RF.1.2 Demonstrate understanding of spoken words,

Reading Grammar Section and Lesson Writing Chapter and Lesson Identify a purpose for reading W1-LO; W2- LO; W3- LO; W4- LO; W5-

Candidates must achieve a grade of at least C2 level in each examination in order to achieve the overall qualification at C2 Level.

Grade 11 Language Arts (2 Semester Course) CURRICULUM. Course Description ENGLISH 11 (2 Semester Course) Duration: 2 Semesters Prerequisite: None

PAGE(S) WHERE TAUGHT If sub mission ins not a book, cite appropriate location(s))

Formulaic Language and Fluency: ESL Teaching Applications

MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY

The role of the first language in foreign language learning. Paul Nation. The role of the first language in foreign language learning

Grade 4. Common Core Adoption Process. (Unpacked Standards)

University of Pittsburgh Department of Slavic Languages and Literatures. Russian 0015: Russian for Heritage Learners 2 MoWe 3:00PM - 4:15PM G13 CL

Linking Task: Identifying authors and book titles in verbose queries

BENGKEL 21ST CENTURY LEARNING DESIGN PERINGKAT DAERAH KUNAK, 2016

Individual Component Checklist L I S T E N I N G. for use with ONE task ENGLISH VERSION

Constructing Parallel Corpus from Movie Subtitles

The KIT-LIMSI Translation System for WMT 2014

South Carolina English Language Arts

To appear in The TESOL encyclopedia of ELT (Wiley-Blackwell) 1 RECASTING. Kazuya Saito. Birkbeck, University of London

Context Free Grammars. Many slides from Michael Collins

Abbey Academies Trust. Every Child Matters

1 st Quarter (September, October, November) August/September Strand Topic Standard Notes Reading for Literature

NATIONAL CENTER FOR EDUCATION STATISTICS RESPONSE TO RECOMMENDATIONS OF THE NATIONAL ASSESSMENT GOVERNING BOARD AD HOC COMMITTEE ON.

Overview of the 3rd Workshop on Asian Translation

Correspondence between the DRDP (2015) and the California Preschool Learning Foundations. Foundations (PLF) in Language and Literacy

GENERAL COMMENTS Some students performed well on the 2013 Tamil written examination. However, there were some who did not perform well.

SOFTWARE EVALUATION TOOL

Degree Qualification Profiles Intellectual Skills

The NICT Translation System for IWSLT 2012

First Grade Curriculum Highlights: In alignment with the Common Core Standards

Classroom Activities/Lesson Plan

BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS

ACCOMMODATIONS FOR STUDENTS WITH DISABILITIES

Loughton School s curriculum evening. 28 th February 2017

Dublin City Schools Mathematics Graded Course of Study GRADE 4

CEFR Overall Illustrative English Proficiency Scales

Oakland Unified School District English/ Language Arts Course Syllabus

Linguistics. Undergraduate. Departmental Honors. Graduate. Faculty. Linguistics 1

Natural Language Processing. George Konidaris

École Jeannine Manuel Bedford Square, Bloomsbury, London WC1B 3DN

Multilingual Sentiment and Subjectivity Analysis

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling

Daily Assessment (All periods)

AGENDA LEARNING THEORIES LEARNING THEORIES. Advanced Learning Theories 2/22/2016

Quantitative analysis with statistics (and ponies) (Some slides, pony-based examples from Blase Ur)

Informatics 2A: Language Complexity and the. Inf2A: Chomsky Hierarchy

DICE - Final Report. Project Information Project Acronym DICE Project Title

Statewide Framework Document for:

Oakland Unified School District English/ Language Arts Course Syllabus

1.2 Interpretive Communication: Students will demonstrate comprehension of content from authentic audio and visual resources.

Encoding. Retrieval. Forgetting. Physiology of Memory. Systems and Types of Memory

CX 101/201/301 Latin Language and Literature 2015/16

How to analyze visual narratives: A tutorial in Visual Narrative Grammar

Inteligencia Artificial. Revista Iberoamericana de Inteligencia Artificial ISSN:

Bridging Lexical Gaps between Queries and Questions on Large Online Q&A Collections with Compact Translation Models

Applications of memory-based natural language processing

A Case Study: News Classification Based on Term Frequency

21st Century Community Learning Center

CELTA. Syllabus and Assessment Guidelines. Third Edition. University of Cambridge ESOL Examinations 1 Hills Road Cambridge CB1 2EU United Kingdom

Listening and Speaking Skills of English Language of Adolescents of Government and Private Schools

Berlitz Swedish-English Dictionary (Berlitz Bilingual Dictionaries) By Berlitz Guides

Training and evaluation of POS taggers on the French MULTITAG corpus

The College Board Redesigned SAT Grade 12

A hybrid approach to translate Moroccan Arabic dialect

English 491: Methods of Teaching English in Secondary School. Identify when this occurs in the program: Senior Year (capstone course), week 11

Prentice Hall Literature Common Core Edition Grade 10, 2012

Project in the framework of the AIM-WEST project Annotation of MWEs for translation

University of New Orleans

Think A F R I C A when assessing speaking. C.E.F.R. Oral Assessment Criteria. Think A F R I C A - 1 -

Designing a Rubric to Assess the Modelling Phase of Student Design Projects in Upper Year Engineering Courses

Radius STEM Readiness TM

Cross-Lingual Dependency Parsing with Universal Dependencies and Predicted PoS Labels

Transcription:

Introduction to Machine Translation CMSC 723 / LING 723 / INST 725 Marine Carpuat Slides & figure credits: Philipp Koehn mt-class.org

Today s topics Machine Translation Historical Background Machine Translation is an old idea Machine Translation Today Use cases and method Machine Translation Evaluation

1947 When I look at an article in Russian, I say to myself: This is really written in English, but it has been coded in some strange symbols. I will now proceed to decode. Warren Weaver

1950s-1960s 1954 Georgetown-IBM experiment 250 words, 6 grammar rules 1966 ALPAC report Skeptical in research progress Led to decreased US government funding for MT

Rule based systems Approach Build dictionaries Write transformation rules Refine, refine, refine Meteo system for weather forecasts (1976) Systran (1968),

1988 More about the IBM story: 20 years of bitext workshop

Statistical Machine Translation 1990s: increased research Mid 2000s: phrase-based MT (Moses, Google Translate) Around 2010: commercial viability Since mid 2010s: neural network models

MT History: Hype vs. Reality

How Good is Machine Translation? Chinese > English

How Good is Machine Translation? French > English

The Vauquois Triangle

Learning from Data What is the best translation? Counts in parallel corpus (aka bitext) Here European Parliament corpus

Learning from Data What is most fuent? A language modeling problem!

Word Alignment

Phrase-based Models Input segmented in phrases Each phrase is translated in output language Phrases are reordered

Neural MT

What is MT good (enough) for? Assimilation: reader initiates translation, wants to know content User is tolerant of inferior quality Focus of majority of research Communication: participants in conversation don t speak same language Users can ask questions when something is unclear Chat room translations, hand-held devices Often combined with speech recognition Dissemination: publisher wants to make content available in other languages High quality required Almost exclusively done by human translators

Applications

State of the Art (rough estimates)

Today s topics Machine Translation Historical Background Machine Translation is an old idea Machine Translation Today Use cases and method Machine Translation Evaluation

How good is a translation? Problem: no single right answer

Evaluation How good is a given machine translation system? Many different translations acceptable Evaluation metrics Subjective judgments by human evaluators Automatic evaluation metrics Task-based evaluation

Adequacy and Fluency Human judgment Given: machine translation output Given: input and/or reference translation Task: assess quality of MT output Metrics Adequacy: does the output convey the meaning of the input sentence? Is part of the message lost, added, or distorted? Fluency: is the output fluent? Involves both grammatical correctness and idiomatic word choices.

Fluency and Adequacy: Scales

Let s try: rate fluency & adequacy on 1-5 scale

Challenges in MT evaluation No single correct answer Human evaluators disagree

Automatic Evaluation Metrics Goal: computer program that computes quality of translations Advantages: low cost, optimizable, consistent Basic strategy Given: MT output Given: human reference translation Task: compute similarity between them

Precision and Recall of Words

Precision and Recall of Words

Word Error Rate

WER example

BLEU Bilingual Evaluation Understudy

Multiple Reference Translations

BLEU examples

Semantics-aware metrics: e.g., METEOR

Drawbacks of Automatic Metrics All words are treated as equally relevant Operate on local level Scores are meaningless (absolute value not informative) Human translators score low on BLEU

Yet automatic metrics such as BLEU correlate with human judgement

Caveats: bias toward statistical systems

Automatic metrics Essential tool for system development Use with caution: not suited to rank systems of different types Still an open area of research Connects with semantic analysis

Task-Based Evaluation Post-Editing Machine Translation

Task-Based Evaluation Content Understanding Tests

Today s topics Machine Translation Historical Background Machine Translation is an old idea Machine Translation Today Use cases and method Machine Translation Evaluation