Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS

Similar documents
AQUA: An Ontology-Driven Question Answering System

Applications of memory-based natural language processing

Linking Task: Identifying authors and book titles in verbose queries

5 th Grade Language Arts Curriculum Map

English Language and Applied Linguistics. Module Descriptions 2017/18

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Compositional Semantics

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS

ScienceDirect. Malayalam question answering system

Cross Language Information Retrieval

Parsing of part-of-speech tagged Assamese Texts

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Cambridge NATIONALS. Creative imedia Level 1/2. UNIT R081 - Pre-Production Skills DELIVERY GUIDE

Individual Component Checklist L I S T E N I N G. for use with ONE task ENGLISH VERSION

Developing Grammar in Context

BYLINE [Heng Ji, Computer Science Department, New York University,

PAGE(S) WHERE TAUGHT If sub mission ins not a book, cite appropriate location(s))

Zotero: A Tool for Constructionist Learning in Critical Information Literacy

Speech Recognition at ICSI: Broadcast News and beyond

AN INTRODUCTION (2 ND ED.) (LONDON, BLOOMSBURY ACADEMIC PP. VI, 282)

Loughton School s curriculum evening. 28 th February 2017

Opportunities for Writing Title Key Stage 1 Key Stage 2 Narrative

Postprint.

Essentials of Ability Testing. Joni Lakin Assistant Professor Educational Foundations, Leadership, and Technology

The Smart/Empire TIPSTER IR System

Natural Language Processing. George Konidaris

MYCIN. The MYCIN Task

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Web as Corpus. Corpus Linguistics. Web as Corpus 1 / 1. Corpus Linguistics. Web as Corpus. web.pl 3 / 1. Sketch Engine. Corpus Linguistics

10 Tips For Using Your Ipad as An AAC Device. A practical guide for parents and professionals

MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY

Segmented Discourse Representation Theory. Dynamic Semantics with Discourse Structure

Probabilistic Latent Semantic Analysis

Subject: Opening the American West. What are you teaching? Explorations of Lewis and Clark

Analyzing sentiments in tweets for Tesla Model 3 using SAS Enterprise Miner and SAS Sentiment Analysis Studio

Universiteit Leiden ICT in Business

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections

ELA/ELD Standards Correlation Matrix for ELD Materials Grade 1 Reading

A Case Study: News Classification Based on Term Frequency

An Interactive Intelligent Language Tutor Over The Internet

Secondary English-Language Arts

CS 598 Natural Language Processing

Netpix: A Method of Feature Selection Leading. to Accurate Sentiment-Based Classification Models

Rule-based Expert Systems

A Comparison of Two Text Representations for Sentiment Analysis

Reading Grammar Section and Lesson Writing Chapter and Lesson Identify a purpose for reading W1-LO; W2- LO; W3- LO; W4- LO; W5-

The Werewolf Knight Drama. School Drama TM

1.2 Interpretive Communication: Students will demonstrate comprehension of content from authentic audio and visual resources.

Part I. Figuring out how English works

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

Arizona s English Language Arts Standards th Grade ARIZONA DEPARTMENT OF EDUCATION HIGH ACADEMIC STANDARDS FOR STUDENTS

Jacqueline C. Kowtko, Patti J. Price Speech Research Program, SRI International, Menlo Park, CA 94025

SEMAFOR: Frame Argument Resolution with Log-Linear Models

Lucy Calkins Units of Study 3-5 Heinemann Books Support Document. Designed to support the implementation of the Lucy Calkins Curriculum

Procedia - Social and Behavioral Sciences 141 ( 2014 ) WCLTA Using Corpus Linguistics in the Development of Writing

BUSINESS OCR LEVEL 2 CAMBRIDGE TECHNICAL. Cambridge TECHNICALS BUSINESS ONLINE CERTIFICATE/DIPLOMA IN R/502/5326 LEVEL 2 UNIT 11

ENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist

Grade 11 Language Arts (2 Semester Course) CURRICULUM. Course Description ENGLISH 11 (2 Semester Course) Duration: 2 Semesters Prerequisite: None

Trend Survey on Japanese Natural Language Processing Studies over the Last Decade

Assignment 1: Predicting Amazon Review Ratings

MYP Language A Course Outline Year 3

Achievement Level Descriptors for American Literature and Composition

WE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

The stages of event extraction

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

An Introduction to the Minimalist Program

What the National Curriculum requires in reading at Y5 and Y6

Dickinson ISD ELAR Year at a Glance 3rd Grade- 1st Nine Weeks

The Internet as a Normative Corpus: Grammar Checking with a Search Engine

Author: Justyna Kowalczys Stowarzyszenie Angielski w Medycynie (PL) Feb 2015

Lecture 1: Basic Concepts of Machine Learning

Program Matrix - Reading English 6-12 (DOE Code 398) University of Florida. Reading

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

Chinese Language Parsing with Maximum-Entropy-Inspired Parser

Basic Parsing with Context-Free Grammars. Some slides adapted from Julia Hirschberg and Dan Jurafsky 1

Physics 270: Experimental Physics

Evaluation of Learning Management System software. Part II of LMS Evaluation

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Python Machine Learning

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Guidelines for Project I Delivery and Assessment Department of Industrial and Mechanical Engineering Lebanese American University

Multilingual Sentiment and Subjectivity Analysis

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Aspectual Classes of Verb Phrases

This course has been proposed to fulfill the Individuals, Institutions, and Cultures Level 1 pillar.

Initial English Language Training for Controllers and Pilots. Mr. John Kennedy École Nationale de L Aviation Civile (ENAC) Toulouse, France.

Linguistic Variation across Sports Category of Press Reportage from British Newspapers: a Diachronic Multidimensional Analysis

Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,

Prentice Hall Literature: Timeless Voices, Timeless Themes Gold 2000 Correlated to Nebraska Reading/Writing Standards, (Grade 9)

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

A Correlation of. Grade 6, Arizona s College and Career Ready Standards English Language Arts and Literacy

Informatics 2A: Language Complexity and the. Inf2A: Chomsky Hierarchy

Procedia - Social and Behavioral Sciences 154 ( 2014 )

A First-Pass Approach for Evaluating Machine Translation Systems

Machine Learning and Development Policy

Read&Write Gold is a software application and can be downloaded in Macintosh or PC version directly from

Transcription:

Natural Language Processing: Part II Overview of Natural Language Processing (L90): ACS Ann Copestake Computer Laboratory University of Cambridge October 2017

Outline of today s lecture Overview of the course Why NLP is hard Scope of NLP A sample application: sentiment classification More NLP applications NLP subtasks

Part II / ACS / MLSALT Part II 12 lectures, assessed by exam questions (as before). Supervisions. ACS L90 Overview of NLP: other modules go into much greater depth: L90 intended for people with no substantial background in NLP. Same 12 lectures as Part II, plus extended practical organised by Helen Yannakoudakis (with demonstrators). No supervisions, Q&A session(s) will be offered. MLSALT, Engineering: Milica Gasic

Also note: Lecture notes in batches. No notes for lecture 12: not directly examinable. Slides: on web page (in advance where possible), but possible (slight) differences to slides used in lecture. Exercises: pre-lecture and post-lecture. Glossary in lecture notes. Webpage with links to demos etc. Recommended Book: Jurafsky and Martin (2008). Linguistics background: Bender (2013).

Overview of the course NLP and linguistics NLP: the computational modelling of human language. 1. Morphology the structure of words: lecture 2. 2. Syntax the way words are used to form phrases: lectures 3, 4 and 5. 3. Semantics Compositional semantics the construction of meaning based on syntax: lecture 6. Lexical semantics the meaning of individual words: lecture 7, 8 and 9 (sort of). 4. Pragmatics meaning in context: lecture 10. 5. Language generation lecture 11. 6. Some current research lecture 12.

Why NLP is hard Querying a knowledge base User query: Has my order number 4291 been shipped yet? Database: ORDER Order number Date ordered Date shipped 4290 2/2/13 2/2/13 4291 2/2/13 2/2/13 4292 2/2/13 USER: Has my order number 4291 been shipped yet? DB QUERY: order(number=4291,date_shipped=?) RESPONSE: Order number 4291 was shipped on 2/2/13

Why NLP is hard Why is this difficult? Similar strings mean different things, different strings mean the same thing: 1. How fast is the TZ? 2. How fast will my TZ arrive? 3. Please tell me when I can expect the TZ I ordered. Ambiguity: Do you sell Sony laptops and disk drives? Do you sell (Sony (laptops and disk drives))? Do you sell (Sony laptops) and disk drives)?

Why NLP is hard Why is this difficult? Similar strings mean different things, different strings mean the same thing: 1. How fast is the TZ? 2. How fast will my TZ arrive? 3. Please tell me when I can expect the TZ I ordered. Ambiguity: Do you sell Sony laptops and disk drives? Do you sell (Sony (laptops and disk drives))? Do you sell (Sony laptops) and disk drives)?

Why NLP is hard Why is this difficult? Similar strings mean different things, different strings mean the same thing: 1. How fast is the TZ? 2. How fast will my TZ arrive? 3. Please tell me when I can expect the TZ I ordered. Ambiguity: Do you sell Sony laptops and disk drives? Do you sell (Sony (laptops and disk drives))? Do you sell (Sony laptops) and disk drives)?

Why NLP is hard Why is this difficult? Similar strings mean different things, different strings mean the same thing: 1. How fast is the TZ? 2. How fast will my TZ arrive? 3. Please tell me when I can expect the TZ I ordered. Ambiguity: Do you sell Sony laptops and disk drives? Do you sell (Sony (laptops and disk drives))? Do you sell (Sony laptops) and disk drives)?

Why NLP is hard Why is this difficult? Similar strings mean different things, different strings mean the same thing: 1. How fast is the TZ? 2. How fast will my TZ arrive? 3. Please tell me when I can expect the TZ I ordered. Ambiguity: Do you sell Sony laptops and disk drives? Do you sell (Sony (laptops and disk drives))? Do you sell (Sony laptops) and disk drives)?

Why NLP is hard Why is this difficult? Similar strings mean different things, different strings mean the same thing: 1. How fast is the TZ? 2. How fast will my TZ arrive? 3. Please tell me when I can expect the TZ I ordered. Ambiguity: Do you sell Sony laptops and disk drives? Do you sell (Sony (laptops and disk drives))? Do you sell (Sony laptops) and disk drives)?

Why NLP is hard Wouldn t it be better if...? The properties which make natural language difficult to process are essential to human communication: Flexible Learnable but compact Emergent, evolving systems Synonymy and ambiguity go along with these properties. Natural language communication can be indefinitely precise: Ambiguity is mostly local (for humans) Semi-formal additions and conventions for different genres

Why NLP is hard Wouldn t it be better if...? The properties which make natural language difficult to process are essential to human communication: Flexible Learnable but compact Emergent, evolving systems Synonymy and ambiguity go along with these properties. Natural language communication can be indefinitely precise: Ambiguity is mostly local (for humans) Semi-formal additions and conventions for different genres

Scope of NLP Some NLP applications spelling and grammar checking predictive text optical character recognition (OCR) screen readers augmentative and alternative communication machine aided translation lexicographers tools information retrieval document classification document clustering information extraction sentiment classification text mining

Scope of NLP More NLP applications... question answering summarization text segmentation exam marking language teaching report generation machine translation natural language interfaces to databases email understanding dialogue systems

A sample application: sentiment classification Opinion mining: what do they think about me? Task: scan documents (webpages, tweets etc) for positive and negative opinions on people, products etc. Find all references to entity in some document collection: list as positive, negative (possibly with strength) or neutral. Construct summary report plus examples (text snippets). Fine-grained classification: e.g., for phone, opinions about: design, performance, battery life...

A sample application: sentiment classification iphone 8 review (Guardian 29/9/2017) The iphone 8 has Apple s latest and best processor. The six-core A11 Bionic has two high-performance cores and four power-efficient cores and is apparently the most powerful so far because it can use a combination of all six at once. Performance was excellent, but I struggled to see a real difference in day-to-day speed compared to the iphone 7. But what I m very pleased to be able to report is that Apple has finally improved battery life for the 4.7in iphone. We re not talking a two-day battery here, but the iphone 8 lasted just over 26 hours...

A sample application: sentiment classification Sentiment classification: the research task Full task: information retrieval, cleaning up text structure, named entity recognition, identification of relevant parts of text. Evaluation by humans. Research task: preclassified documents, topic known, opinion in text along with some straightforwardly extractable score. Movie review corpus (Pang et al 2002): strongly positive or negative reviews from IMDb, 50:50 split, with rating score.

A sample application: sentiment classification IMDb: An American Werewolf in London (1981) Rating: 9/10 Ooooo. Scary. The old adage of the simplest ideas being the best is once again demonstrated in this, one of the most entertaining films of the early 80 s, and almost certainly Jon Landis best work to date. The script is light and witty, the visuals are great and the atmosphere is top class. Plus there are some great freeze-frame moments to enjoy again and again. Not forgetting, of course, the great transformation scene which still impresses to this day. In Summary: Top banana

A sample application: sentiment classification Bag of words technique Treat the reviews as collections of individual words. Classify reviews according to positive or negative words. Could use word lists prepared by humans, but machine learning based on a portion of the corpus (training set) is preferable. Use human rankings for training and evaluation. Pang et al, 2002: Chance success is 50% (corpus artificially balanced), bag-of-words gives 80%.

A sample application: sentiment classification Some sources of errors for bag-of-words Negation: Ridley Scott has never directed a bad film. Overfitting the training data: e.g., if training set includes a lot of films from before 2005, Ridley may be a strong positive indicator, but then we test on reviews for Kingdom of Heaven? Comparisons and contrasts.

A sample application: sentiment classification Contrasts in the discourse This film should be brilliant. It sounds like a great plot, the actors are first grade, and the supporting cast is good as well, and Stallone is attempting to deliver a good performance. However, it can t hold up.

A sample application: sentiment classification More contrasts AN AMERICAN WEREWOLF IN PARIS is a failed attempt... Julie Delpy is far too good for this movie. She imbues Serafine with spirit, spunk, and humanity. This isn t necessarily a good thing, since it prevents us from relaxing and enjoying AN AMERICAN WEREWOLF IN PARIS as a completely mindless, campy entertainment experience. Delpy s injection of class into an otherwise classless production raises the specter of what this film could have been with a better script and a better cast... She was radiant, charismatic, and effective...

A sample application: sentiment classification Doing sentiment classification properly? Morphology, syntax and compositional semantics: who is talking about what, what terms are associated with what, tense... Lexical semantics: are words positive or negative in this context? Word senses (e.g., spirit)? Pragmatics and discourse structure: what is the topic of this section of text? Pronouns and definite references. Getting all this to work well on arbitrary text is very hard. Ultimately the problem is AI-complete, but can we do well enough for NLP to be useful?

More NLP applications IR, IE and QA Information retrieval: return documents in response to a user query (Internet Search is a special case) Information extraction: discover specific information from a set of documents (e.g. company joint ventures) Question answering: answer a specific user question by returning a section of a document: What is the capital of France? Paris has been the French capital for many centuries.

More NLP applications MT Earliest attempted NLP application. High quality only if the domain is restricted (or with very close languages: e.g., Swedish-Danish). Utility greatly increased in 1990s with increase in availability of electronic text. Good applications for bad MT... Spoken language translation is viable for limited domains.

More NLP applications Natural language interfaces and dialogue systems All rely on a limited domain: LUNAR: classic example of a natural language interface to a database (NLID): 1970 1975 SHRDLU: (text-based) dialogue system: 1973 Current spoken dialogue systems Limited domain allows disambiguation: e.g., in LUNAR, rock had one sense.

NLP subtasks NLP subtasks input preprocessing: speech recognizer, text preprocessor or gesture recognizer. morphological analysis (2) part of speech tagging (3) parsing: this includes syntax and compositional semantics (4, 5, 6) disambiguation, inference (6, 7, 8, 9) context processing (10) discourse structuring (11) realization (11) morphological generation (2) output processing: text-to-speech, text formatter, etc.

NLP subtasks Subtasks in natural language interface to a knowledge base KB KB/CONTEXT KB/DISCOURSE STRUCTURING PARSING REALIZATION MORPHOLOGY MORPHOLOGY GENERATION INPUT PROCESSING user input OUTPUT PROCESSING output

NLP subtasks General comments Even simple applications might need complex knowledge sources. Applications cannot be 100% perfect. Applications that are < 100% perfect can be useful. Aids to humans are easier than replacements for humans. NLP interfaces compete with non-language approaches. Typically: shallow processing on arbitrary input or deep processing on narrow domains. Limited domain systems require expensive expertise to port or large amounts of (expensive) data. External influences on NLP are very important.