Natural Language Processing. Lecture 27: Conclusion

Similar documents
Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

CS 598 Natural Language Processing

(Words and their meaning)

Chunk Parsing for Base Noun Phrases using Regular Expressions. Let s first let the variable s0 be the sentence tree of the first sentence.

Basic Parsing with Context-Free Grammars. Some slides adapted from Julia Hirschberg and Dan Jurafsky 1

Parsing of part-of-speech tagged Assamese Texts

Context Free Grammars. Many slides from Michael Collins

Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm

11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation

Grammars & Parsing, Part 1:

2/15/13. POS Tagging Problem. Part-of-Speech Tagging. Example English Part-of-Speech Tagsets. More Details of the Problem. Typical Problem Cases

Prediction of Maximal Projection for Semantic Role Labeling

ENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

Copyright 2002 by the McGraw-Hill Companies, Inc.

Natural Language Processing. George Konidaris

Chapter 4: Valence & Agreement CSLI Publications

Basic Syntax. Doug Arnold We review some basic grammatical ideas and terminology, and look at some common constructions in English.

What Can Neural Networks Teach us about Language? Graham Neubig a2-dlearn 11/18/2017

BULATS A2 WORDLIST 2

LTAG-spinal and the Treebank

Dear Teacher: Welcome to Reading Rods! Reading Rods offer many outstanding features! Read on to discover how to put Reading Rods to work today!

Compositional Semantics

Applications of memory-based natural language processing

Some Principles of Automated Natural Language Information Extraction

Introduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions.

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

EdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections

CELTA. Syllabus and Assessment Guidelines. Third Edition. University of Cambridge ESOL Examinations 1 Hills Road Cambridge CB1 2EU United Kingdom

a) analyse sentences, so you know what s going on and how to use that information to help you find the answer.

FOREWORD.. 5 THE PROPER RUSSIAN PRONUNCIATION. 8. УРОК (Unit) УРОК (Unit) УРОК (Unit) УРОК (Unit) 4 80.

Control and Boundedness

Teachers: Use this checklist periodically to keep track of the progress indicators that your learners have displayed.

The Role of the Head in the Interpretation of English Deverbal Compounds

Words come in categories

English Language and Applied Linguistics. Module Descriptions 2017/18

ESSLLI 2010: Resource-light Morpho-syntactic Analysis of Highly

Web as Corpus. Corpus Linguistics. Web as Corpus 1 / 1. Corpus Linguistics. Web as Corpus. web.pl 3 / 1. Sketch Engine. Corpus Linguistics

University of Alberta. Large-Scale Semi-Supervised Learning for Natural Language Processing. Shane Bergsma

Adjectives tell you more about a noun (for example: the red dress ).

Analysis of Probabilistic Parsing in NLP

Modeling full form lexica for Arabic

Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data

Developing a TT-MCTAG for German with an RCG-based Parser

The College Board Redesigned SAT Grade 12

Language properties and Grammar of Parallel and Series Parallel Languages

First Grade Curriculum Highlights: In alignment with the Common Core Standards

Construction Grammar. University of Jena.

ELA/ELD Standards Correlation Matrix for ELD Materials Grade 1 Reading

Specifying a shallow grammatical for parsing purposes

The stages of event extraction

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

Myths, Legends, Fairytales and Novels (Writing a Letter)

1 st Quarter (September, October, November) August/September Strand Topic Standard Notes Reading for Literature

THE VERB ARGUMENT BROWSER

Developing Grammar in Context

The Internet as a Normative Corpus: Grammar Checking with a Search Engine

Ch VI- SENTENCE PATTERNS.

An Interactive Intelligent Language Tutor Over The Internet

Natural Language Processing: Interpretation, Reasoning and Machine Learning

Hindi Aspectual Verb Complexes

Using dialogue context to improve parsing performance in dialogue systems

Correspondence between the DRDP (2015) and the California Preschool Learning Foundations. Foundations (PLF) in Language and Literacy

Building an HPSG-based Indonesian Resource Grammar (INDRA)

Major Milestones, Team Activities, and Individual Deliverables

UNIT IX. Don t Tell. Are there some things that grown-ups don t let you do? Read about what this child feels.

I. INTRODUCTION. for conducting the research, the problems in teaching vocabulary, and the suitable

Vocabulary Usage and Intelligibility in Learner Language

SEMAFOR: Frame Argument Resolution with Log-Linear Models

UNIVERSITY OF OSLO Department of Informatics. Dialog Act Recognition using Dependency Features. Master s thesis. Sindre Wetjen

AQUA: An Ontology-Driven Question Answering System

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

Corpus Linguistics (L615)

Analysis: Evaluation: Knowledge: Comprehension: Synthesis: Application:

Linguistic Variation across Sports Category of Press Reportage from British Newspapers: a Diachronic Multidimensional Analysis

Writing a composition

ELD CELDT 5 EDGE Level C Curriculum Guide LANGUAGE DEVELOPMENT VOCABULARY COMMON WRITING PROJECT. ToolKit

Taught Throughout the Year Foundational Skills Reading Writing Language RF.1.2 Demonstrate understanding of spoken words,

Cross Language Information Retrieval

Welcome to the Purdue OWL. Where do I begin? General Strategies. Personalizing Proofreading

Language and Computers. Writers Aids. Introduction. Non-word error detection. Dictionaries. N-gram analysis. Isolated-word error correction

Modeling Attachment Decisions with a Probabilistic Parser: The Case of Head Final Structures

Comprehension Recognize plot features of fairy tales, folk tales, fables, and myths.

Chinese for Beginners CEFR Level: A1

1/20 idea. We ll spend an extra hour on 1/21. based on assigned readings. so you ll be ready to discuss them in class

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

Test Blueprint. Grade 3 Reading English Standards of Learning

The Role of Semantic and Discourse Information in Learning the Structure of Surgical Procedures

A corpus-based approach to the acquisition of collocational prepositional phrases

Netpix: A Method of Feature Selection Leading. to Accurate Sentiment-Based Classification Models

NAME: East Carolina University PSYC Developmental Psychology Dr. Eppler & Dr. Ironsmith

Houghton Mifflin Reading Correlation to the Common Core Standards for English Language Arts (Grade1)

Segmented Discourse Representation Theory. Dynamic Semantics with Discourse Structure

Formulaic Language and Fluency: ESL Teaching Applications

TEKS Comments Louisiana GLE

Books Effective Literacy Y5-8 Learning Through Talk Y4-8 Switch onto Spelling Spelling Under Scrutiny

The Structure of Relative Clauses in Maay Maay By Elly Zimmer

Transcription:

Natural Language Processing Lecture 27: Conclusion

Levels of Linguistc nowledge spoken phonetcs phonology morphology writen orthography shallower syntax semantcs deeper pragmatcs discourse

uygarlastramadıklarımızdanmıssınızcasına (behaving) as if you are among those whom we could not civilize

uygarlastramadıklarımızdanmıssınızcasına (behaving) as if you are among those whom we could not civilize uygar civilized +las become +tr cause to +ama not able +dık past partciple +lar plural +ımız frst person plural possessive ( our ) +dan second person plural ( y all ) +mıs past +sınız ablatve case ( from/among ) +casına fnite verb adverb ( as if )

Finite-State Automaton Q: a fnite set of states q0 Q: a special start state F Q: a set of fnal states Σ: a fnite alphabet Transitons:... qi s Σ* qj... Encodes a set of strings that can be recognized by following paths from q0 to some state in F.

Levels of Linguistc nowledge spoken phonetcs phonology ambiguity morphology syntax semantcs writen orthography shallower deeper pragmatcs discourse

Noisy Channel source What you want y What you see x channel decode

source Noisy Channel NN VB RB y Cats meow ofen x channel decode

Noisy Channel source How are you? y 你好吗? x channel decode

Noisy Channel source Okay, Google y x channel decode

Startng and Stopping Unigram model:... Bigram model:... Trigram model:...

Language Modeling Questons Why do we use context? What does smoothing do, and why is it necessary? What do we use to evaluate language models?

Tagging

Broad POS categories open classes closed classes nouns verbs adjectves adverbs prepositons partcles determiners numerals pronouns conjunctons auxiliary verbs

Syntax

Parsing C Y vs. Earley s Algorithm Both dynamic programming CNF vs. general forms

C Y Algorithm: Chart Noun, Verb - VP,S - S book Det NP - NP this Noun - - fight Prep PP through PNoun, NP Houston

C Y Equatons

Semantcs

Where s the beef? Sentences from the brown corpus. Extracted from the concordancer in The Compleat Lexical Tutor, htp://www.lextutor.ca/

chicken

Synsets for dog (n) S: (n) dog, domestc dog, Canis familiaris (a member of the genus Canis (probably descended from the common wolf) that has been domestcated by man since prehistoric tmes; occurs in many breeds) "the dog barked all night" S: (n) frump, dog (a dull unatractve unpleasant girl or woman) "she got a reputaton as a frump"; "she's a real dog" S: (n) dog (informal term for a man) "you lucky dog" S: (n) cad, bounder, blackguard, dog, hound, heel (someone who is morally reprehensible) "you dirty dog" S: (n) frank, frankfurter, hotdog, hot dog, dog, wiener, wienerwurst, weenie (a smooth-textured sausage of minced beef or pork usually smoked; ofen served on a bread roll) S: (n) pawl, detent, click, dog (a hinged catch that fts into a notch of a ratchet to move a wheel forward or prevent it from moving backward) S: (n) andiron, fredog, dog, dog-iron (metal supports for logs in a freplace) "the andirons were too hot to touch" 22

Entty Linking Mary picked up the ball. She threw it to me.

Semantc oles PropBank is a set of verb-sense-specifc frames with informal descriptons for their arguments. Consider the word Agree ARG0: agreer ARG1: propositon ARG2: other entty agreeing [The group] ARG0 agreed [it wouldn t make an ofer] ARG1. Usually [John] ARG0 agrees [with Mary on everything] ARG2.

Fall (move downward) in PropBank arg1: logical subject, patent, thing falling arg2: extent, amount fallen arg3: startng point arg4: ending point argm-loc: medium Sales fell to $251.2 million from $278.8 million. The average junk bond fell by 4.2%. The meteor fell through the atmosphere, crashing into Cambridge.

M L #1: First-Order Logic DressCode(ThePorch) Functon Serves(UnionGrill, AmericanFood) estaurant(uniongrill) Predicates Have(Speaker, FiveDollars) ^ Have(Speaker, LotOfTime) x Person(x) Have(x, FiveDollars) x,y Person(x) ^ estaurant(y) ^ HasVisited(x,y)

First Order Logic: Advantages Flexible Well-understood Widely used

EM We ofen have unlabeled or incomplete data EM is an for learning without labels, e.g., classifcaton without classes E-step M-step Pick random centroids! Iterate the following:! Use centroids to label the data! Compute centroids using the labeled data! Keep doing this until labels don t change

NLP Uses Answer questions using the Web Translate documents from one language to another Do library research; summarize Manage messages intelligently Help make informed decisions Follow directions given by any user Fix your spelling or grammar Grade exams Write poems or novels Listen and give advice Estimate public opinion Read everything and make predictions Interactively help people learn Help disabled people Help refugees/disaster victims Document or reinvigorate indigenous languages

More NLP... Language Technologies Minor 4 LT courses plus LT project 5 th year Masters in Language Technologies

More NLP Courses 11-492/692 Speech Processing Fall: Alan W Black Practcal Systems for Speech 11-711 Algorithms and NLP Fall: Yulia Tsvetkov,Taylor Berg- irkpatrick esearch oriented 11-727 Computatonal Semantcs Spring: Ed Hovy, Teruko Mitamura

More NLP Courses 11-747 Neural Networks for NLP Spring: Graham Neubig 11-830 Computatonal Ethics for NLP Spring: Yulia Tsvetkov, Alan W Black 11-777 Advanced Multmodal ML Fall: Louis-Philippe Morency Visual, Gesture, Speech Most Neural Net Classing Always involve NLP