Foundations of Natural Language Processing Lecture 18 Wrapup, review, and exam information

Similar documents
Natural Language Processing. George Konidaris

Analysis of Probabilistic Parsing in NLP

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Applications of memory-based natural language processing

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation

2/15/13. POS Tagging Problem. Part-of-Speech Tagging. Example English Part-of-Speech Tagsets. More Details of the Problem. Typical Problem Cases

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

CS 598 Natural Language Processing

Loughton School s curriculum evening. 28 th February 2017

Chinese Language Parsing with Maximum-Entropy-Inspired Parser

Towards a Machine-Learning Architecture for Lexical Functional Grammar Parsing. Grzegorz Chrupa la

Parsing of part-of-speech tagged Assamese Texts

Linking Task: Identifying authors and book titles in verbose queries

English Language and Applied Linguistics. Module Descriptions 2017/18

West s Paralegal Today The Legal Team at Work Third Edition

Context Free Grammars. Many slides from Michael Collins

Basic Parsing with Context-Free Grammars. Some slides adapted from Julia Hirschberg and Dan Jurafsky 1

Cross Language Information Retrieval

Author: Justyna Kowalczys Stowarzyszenie Angielski w Medycynie (PL) Feb 2015

Degree Qualification Profiles Intellectual Skills

Lessons from a Massive Open Online Course (MOOC) on Natural Language Processing for Digital Humanities

Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm

BYLINE [Heng Ji, Computer Science Department, New York University,

THE ROLE OF DECISION TREES IN NATURAL LANGUAGE PROCESSING

The taming of the data:

The Role of the Head in the Interpretation of English Deverbal Compounds

What the National Curriculum requires in reading at Y5 and Y6

Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data

An Introduction to the Minimalist Program

a) analyse sentences, so you know what s going on and how to use that information to help you find the answer.

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks

EdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar

The Smart/Empire TIPSTER IR System

Grammars & Parsing, Part 1:

Backwards Numbers: A Study of Place Value. Catherine Perez

The Strong Minimalist Thesis and Bounded Optimality

Segmented Discourse Representation Theory. Dynamic Semantics with Discourse Structure

Inteligencia Artificial. Revista Iberoamericana de Inteligencia Artificial ISSN:

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections

Using Semantic Relations to Refine Coreference Decisions

(Sub)Gradient Descent

Developing a large semantically annotated corpus

MYCIN. The MYCIN Task

CS Machine Learning

Arizona s English Language Arts Standards th Grade ARIZONA DEPARTMENT OF EDUCATION HIGH ACADEMIC STANDARDS FOR STUDENTS

BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS

License to Deliver FAQs: Everything DiSC Workplace Certification

The Master Question-Asker

Spoken Language Parsing Using Phrase-Level Grammars and Trainable Classifiers

Prediction of Maximal Projection for Semantic Role Labeling

AQUA: An Ontology-Driven Question Answering System

Introduction, Organization Overview of NLP, Main Issues

Using dialogue context to improve parsing performance in dialogue systems

The Karlsruhe Institute of Technology Translation Systems for the WMT 2011

LQVSumm: A Corpus of Linguistic Quality Violations in Multi-Document Summarization

Developing a TT-MCTAG for German with an RCG-based Parser

Developing Grammar in Context

Part I. Figuring out how English works

Heuristic Sample Selection to Minimize Reference Standard Training Set for a Part-Of-Speech Tagger

Lecturing in the Preclinical Curriculum A GUIDE FOR FACULTY LECTURERS

Language Model and Grammar Extraction Variation in Machine Translation

University of Alberta. Large-Scale Semi-Supervised Learning for Natural Language Processing. Shane Bergsma

Welcome to the Purdue OWL. Where do I begin? General Strategies. Personalizing Proofreading

Multi-genre Writing Assignment

MYP Language A Course Outline Year 3

Mathematics Scoring Guide for Sample Test 2005

Clickthrough-Based Translation Models for Web Search: from Word Models to Phrase Models

Grade 6: Module 2A: Unit 2: Lesson 8 Mid-Unit 3 Assessment: Analyzing Structure and Theme in Stanza 4 of If

Web as Corpus. Corpus Linguistics. Web as Corpus 1 / 1. Corpus Linguistics. Web as Corpus. web.pl 3 / 1. Sketch Engine. Corpus Linguistics

NCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

Linguistics. Undergraduate. Departmental Honors. Graduate. Faculty. Linguistics 1

Lecture 1: Machine Learning Basics

MENTORING. Tips, Techniques, and Best Practices

Modeling Attachment Decisions with a Probabilistic Parser: The Case of Head Final Structures

The Ups and Downs of Preposition Error Detection in ESL Writing

ENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

Word Sense Disambiguation

Speech Recognition at ICSI: Broadcast News and beyond

Think A F R I C A when assessing speaking. C.E.F.R. Oral Assessment Criteria. Think A F R I C A - 1 -

Interactive Corpus Annotation of Anaphor Using NLP Algorithms

Virtually Anywhere Episodes 1 and 2. Teacher s Notes

Short Text Understanding Through Lexical-Semantic Analysis

The stages of event extraction

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

UNIVERSITY OF OSLO Department of Informatics. Dialog Act Recognition using Dependency Features. Master s thesis. Sindre Wetjen

Distant Supervised Relation Extraction with Wikipedia and Freebase

Control and Boundedness

Natural Language Processing: Interpretation, Reasoning and Machine Learning

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

An Interactive Intelligent Language Tutor Over The Internet

SEMAFOR: Frame Argument Resolution with Log-Linear Models

Beyond the Pipeline: Discrete Optimization in NLP

Writing the Personal Statement

Proof Theory for Syntacticians

The Discourse Anaphoric Properties of Connectives

AN INTRODUCTION (2 ND ED.) (LONDON, BLOOMSBURY ACADEMIC PP. VI, 282)

Chunk Parsing for Base Noun Phrases using Regular Expressions. Let s first let the variable s0 be the sentence tree of the first sentence.

Transcription:

Foundations of Natural Language Processing Lecture 18 Wrapup, review, and exam information Alex Lascarides 23 March 2018 Alex Lascarides FNLP Lecture 18 23 March 2018

WARNING: this isn t the same course it was in 2015 and before When revising for the exam, past exam papers are useful, but be careful of overfitting. Most topics in common with last year The changed a lot in 2016 Different topics; some new approaches/models Alex Lascarides FNLP Lecture 18 1

Topics in common with previous years Corpora, annotation, evaluation Ambiguity at all levels N-gram models, entropy, smoothing Noisy channel framework Spelling correction, edit distance HMMs, part-of-speech tagging Syntax, parsing algorithms, PCFGs, other grammar formalisms Lexical semantics: word senses Alex Lascarides FNLP Lecture 18 2

Eliminated from previous years You will not be expected to answer questions about these topics. corpus markup mathematical details of backoff in N-gram models details of forward-backward algorithm for HMMs feature structure grammars crowdsourcing in detail implementation details of Good-Turing smoothing pronoun resolution discourse coherence Alex Lascarides FNLP Lecture 18 3

New since 2015 So past papers are not a good guide for these! Updated discussion of evaluation High-level overview of more modern smoothing methods (K-N) More complete example of spelling correction (end-to-end system) Generalized discussion of EM (showing application in both spelling correction and HMMs) Text classification (tasks and methods) Dependency grammar and related algorithms Semantic roles and distributional semantics Machine Translation (non-examinable this year; has been on some past papers). Alex Lascarides FNLP Lecture 18 4

Format of the exam As in previous years, the exam has two parts: Part A: 8 short-answer questions, each worth 3 marks (total of 24 marks). Part B: 3 longer questions worth 13 marks each, of which you must answer two (total of 26 marks). Be clear which questions you are answering. If you (start to) answer more than two, you must clearly cross out one answer. Alex Lascarides FNLP Lecture 18 5

What counts and what doesn t Things that do matter (not necessarily a complete list): Complete answer (double check you ve answered everything that was asked!) Clear explanations/reasoning where appropriate Correct equations, all variables defined Legible Alex Lascarides FNLP Lecture 18 6

Things that do not matter: What counts and what doesn t Perfect spelling/grammar/handwriting: as long as it is clear what you mean. Do not waste time writing drafts/copying over, but clearly cross out any scratch work that should not be marked. You can lose marks for have both correct and incorrect answers unless one is crossed out. Full sentences. If a word or short phrase conveys the meaning, no need for more. Alex Lascarides FNLP Lecture 18 7

Other ways to prepare Lecture summary slides are a good place to start: they don t have all the details, but make sure you understand the details underlying the main points mentioned. Do the labs! Make sure you understand the answers you get Heed any feedback on your courseworks and talk to your classmates or post on Piazza if you still don t understand. Post questions on Piazza. We will not always answer immediately but will try to ensure questions are answered. Exception: we will not answer any questions asked less than 48 hours before the exam. Alex Lascarides FNLP Lecture 18 8

What courses follow on next year? IAML: if you haven t already taken it, do! ML underlies most of NLP, and fourth year courses assume a strong background. Natural Language Understanding: more advanced models and algorithms for processing syntax, semantics, and discourse. Topics in NLP: covers some more advanced general techniques followed by student presentations based on current research papers. Machine Translation: will be a 20 point coursework-only course focusing on implementation of models and algorithms, plus looking at where they fail (i.e. linguistic issues). Automatic Speech Recognition: builds on knowledge from this course, but focuses on speech processing. Alex Lascarides FNLP Lecture 18 9

Other related courses Other machine learning courses (MLPR, MLP, PMR): These cover modern statistical approaches and deep learning models that are increasingly popular in NLP. Extreme computing: for dealing with huge data sets. Computational Cognitive Science, Topics in Cognitive Modeling: sections on computational models of human language processing. include Alex Lascarides FNLP Lecture 18 10

That s all folks! Alex Lascarides FNLP Lecture 18 11