CMPT-413 Computational Linguistics

Similar documents
Interactive Corpus Annotation of Anaphor Using NLP Algorithms

Constraining X-Bar: Theta Theory

CS 598 Natural Language Processing

Universal Grammar 2. Universal Grammar 1. Forms and functions 1. Universal Grammar 3. Conceptual and surface structure of complex clauses

A DISSERTATION SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA BY. Kaitlin Rose Johnson

Master s Thesis. An Agent-Based Platform for Dialogue Management

Control and Boundedness

Unit 8 Pronoun References

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

Underlying and Surface Grammatical Relations in Greek consider

L1 and L2 acquisition. Holger Diessel

Informatics 2A: Language Complexity and the. Inf2A: Chomsky Hierarchy

Compositional Semantics

Argument structure and theta roles

Introduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions.

ENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist

Using Semantic Relations to Refine Coreference Decisions

ELD CELDT 5 EDGE Level C Curriculum Guide LANGUAGE DEVELOPMENT VOCABULARY COMMON WRITING PROJECT. ToolKit

a) analyse sentences, so you know what s going on and how to use that information to help you find the answer.

Context Free Grammars. Many slides from Michael Collins

Developing Grammar in Context

Focus of the Unit: Much of this unit focuses on extending previous skills of multiplication and division to multi-digit whole numbers.

Construction Grammar. University of Jena.

The Effect of Discourse Markers on the Speaking Production of EFL Students. Iman Moradimanesh

Aspectual Classes of Verb Phrases

Case government vs Case agreement: modelling Modern Greek case attraction phenomena in LFG

Lecture 9. The Semantic Typology of Indefinites

Parsing of part-of-speech tagged Assamese Texts

Inleiding Taalkunde. Docent: Paola Monachesi. Blok 4, 2001/ Syntax 2. 2 Phrases and constituent structure 2. 3 A minigrammar of Italian 3

Derivational: Inflectional: In a fit of rage the soldiers attacked them both that week, but lost the fight.

Possessive have and (have) got in New Zealand English Heidi Quinn, University of Canterbury, New Zealand

Loughton School s curriculum evening. 28 th February 2017

Knowledge based expert systems D H A N A N J A Y K A L B A N D E

Part I. Figuring out how English works

Ch VI- SENTENCE PATTERNS.

Intension, Attitude, and Tense Annotation in a High-Fidelity Semantic Representation

The presence of interpretable but ungrammatical sentences corresponds to mismatches between interpretive and productive parsing.

11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections

The Smart/Empire TIPSTER IR System

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

Grammar Lesson Plan: Yes/No Questions with No Overt Auxiliary Verbs

Course Outline for Honors Spanish II Mrs. Sharon Koller

An Interactive Intelligent Language Tutor Over The Internet

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Basic Parsing with Context-Free Grammars. Some slides adapted from Julia Hirschberg and Dan Jurafsky 1

Let's Learn English Lesson Plan

Basic Syntax. Doug Arnold We review some basic grammatical ideas and terminology, and look at some common constructions in English.

LQVSumm: A Corpus of Linguistic Quality Violations in Multi-Document Summarization

Segmented Discourse Representation Theory. Dynamic Semantics with Discourse Structure

The Discourse Anaphoric Properties of Connectives

Proof Theory for Syntacticians

Prediction of Maximal Projection for Semantic Role Labeling

Approaches to control phenomena handout Obligatory control and morphological case: Icelandic and Basque

Common Core ENGLISH GRAMMAR & Mechanics. Worksheet Generator Standard Descriptions. Grade 2

Adjectives tell you more about a noun (for example: the red dress ).

Annotating (Anaphoric) Ambiguity 1 INTRODUCTION. Paper presentend at Corpus Linguistics 2005, University of Birmingham, England

Causal Link Semantics for Narrative Planning Using Numeric Fluents

Words come in categories

The stages of event extraction

GERM 3040 GERMAN GRAMMAR AND COMPOSITION SPRING 2017

Information Structure and Referential Givenness/Newness: How Much Belongs in the Grammar?

Common Core State Standards for English Language Arts

BYLINE [Heng Ji, Computer Science Department, New York University,

Written by: YULI AMRIA (RRA1B210085) ABSTRACT. Key words: ability, possessive pronouns, and possessive adjectives INTRODUCTION

Campus Academic Resource Program An Object of a Preposition: A Prepositional Phrase: noun adjective

Kindergarten Lessons for Unit 7: On The Move Me on the Map By Joan Sweeney

THE SOME INDEFINITES

Programma di Inglese

Focusing bound pronouns

AN EXPERIMENTAL APPROACH TO NEW AND OLD INFORMATION IN TURKISH LOCATIVES AND EXISTENTIALS

UNIVERSITY OF OSLO Department of Informatics. Dialog Act Recognition using Dependency Features. Master s thesis. Sindre Wetjen

Today we examine the distribution of infinitival clauses, which can be

IN THIS UNIT YOU LEARN HOW TO: SPEAKING 1 Work in pairs. Discuss the questions. 2 Work with a new partner. Discuss the questions.

Analysis of Probabilistic Parsing in NLP

AQUA: An Ontology-Driven Question Answering System

The Structure of Relative Clauses in Maay Maay By Elly Zimmer

Chapter 4: Valence & Agreement CSLI Publications

THE ROLE OF DECISION TREES IN NATURAL LANGUAGE PROCESSING

Advanced Grammar in Use

ON THE COGNITIVE STATUS OF ANTECEDENTS IN SPANISH DISCOURSE ANAPHORA

Formulaic Language and Fluency: ESL Teaching Applications

Some Principles of Automated Natural Language Information Extraction

1/20 idea. We ll spend an extra hour on 1/21. based on assigned readings. so you ll be ready to discuss them in class

Thornhill Primary School - Grammar coverage Year 1-6

Strategic discourse comprehension

CEFR Overall Illustrative English Proficiency Scales

Semantic Inference at the Lexical-Syntactic Level for Textual Entailment Recognition

Word Stress and Intonation: Introduction

Copyright and moral rights for this thesis are retained by the author

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape

The College Board Redesigned SAT Grade 12

What the National Curriculum requires in reading at Y5 and Y6

Dear Teacher: Welcome to Reading Rods! Reading Rods offer many outstanding features! Read on to discover how to put Reading Rods to work today!

An Introduction to the Minimalist Program

Writing a composition

BULATS A2 WORDLIST 2

Modeling Dialogue Building Highly Responsive Conversational Agents

CHAPTER IV RESEARCH FINDING AND DISCUSSION

Integrating Meta-Level and Domain-Level Knowledge for Task-Oriented Dialogue

Transcription:

CMPT-413 Computational Linguistics Anoop Sarkar http://www.cs.sfu.ca/ anoop 1

Discourse Processing Multiple sentences, dialogs Human-human (Switchboard corpus) and human-computer interaction (ATIS corpus) New phenomena at the discourse level: 1. John went to Bill s car dealership to check out an Acura Integra. He looked at it for about an hour. 2

Discourse Structure Consider a sequence of sentences: s1, s2,.... Such a sequence is structured based on various relationships between the sentences. The discourse structure is a tree expressing these relationships: (DISCOURSE (DR1 (S1 [s1]) (DR2 (S2 [s2]) (S3 [s3])) (S4 [s4])... ) 3

Discourse Structure Each DRi is some discourse relationship, e.g.: (COMPARISON (S1 [Bill drove his old car from BC to Quebec]) (TEMPORAL-SEQUENCE (S2 [On the other hand, John bought a new car]) (S3 [Then, he drove it across the country to Quebec]))) These tree structures can be described by writing down context-free grammar rules, but in this case capturing rules of discourse structure (distinct from rules of sentence structure). 4

Reference Resolution In the previous discourse: John, Bill, Acura Integra, car dealership are all discourse entities. Anaphors like he, she, it are referring expressions, e.g. John and he corefer. A group of referring expressions that corefer is called a coreference chain. Each discourse entity can refer to one or more entities in the real world. Keeping track of discourse entities and relationships between them across multiple sentences is the job of the discourse model. 5

Types of Referring Expressions Indefinite Noun Phrases: specific vs. non-specific indefinites: I saw this great looking car today vs. Mary is going to marry a Swede Definite Noun Phrases: refers to an existing entity I saw an Acura Integra and a Mercedes today. The Integra was white. what about: I m going to take the bus today Pronouns: locality effects, occurs later in the discourse than the entity it refers to: I saw an Acura Integra and a Mercedes today. It was white. cataphora: Before he bought it, John test-drove his Acura. 6

Types of Referring Expressions Demonstratives (also called deictic pronouns) I like this better than that. One Anaphora (one of them) I saw six Acura Integras today. Now I want one. Inferrables (no explicit discourse entity to refer to) I almost bought an Acura Integra today. But a door was dented and the engine was noisy. 7

Types of Referring Expressions Discontinuous Sets (plural referring expressions): John has an Acura and Mary has a Mazda. They drive them all the time. Generics (refer to a class of objects): I saw no less than six Acura Integras today. They are the coolest cars. 8

Syntactic and Semantic Constraints on Reference Person, Number, Gender and Case agreement. John has a new Acura. It is red. Syntactic constraints: John bought himself a new Acura. [himself=john] (reflexives) John bought him a new Acura. [him John] Pleonastic It: A pronoun that has no reference: It is raining 9

Pleonastic It detection 10

Syntactic and Semantic Constraints on Reference These constraints apply in practice to rule out certain coreference possibilities: John wanted a new car. Bill bought him a new Acura. [him=john] John wanted a new car. He bought him a new Acura. [he=john,him John] Selectional restrictions: John parked his Acura in the garage. He had driven it around for hours. (not always) John bought a new Acura. It drinks gasoline like a fish. 11

Preferences in Pronoun Resolution Recency: John has an Integra. Bill has a Legend. Mary likes to drive it. Grammatical Role: subject > existential predicate NP > object > indirect object > nouns in adverbial PP An Acura Integra is parked in the lot. (subject) There is an Acura Integra parked in the lot. (existential predicate NP) John parked an Acura Integra in the lot. (object) John gave his Acura Integra a wash. (indirect object) Inside his Acura Integra, John installed a new CD player. (adv. PP) 12

Preferences in Pronoun Resolution Repeated Mention: entities referred to as pronouns are likely to continue being used as pronouns Parallelism: (cf. grammatical role) Mary went with Sue to the car dealership. Sally went with her to the market. Verb Semantics: John telephoned Bill. He had lost the pamphlet. John criticized Bill. He had lost the pamphlet. 13

Centering Theory and an Algorithm for Pronoun Resolution Centering Theory (Grosz et al., 1995) is a theory of local attention and how it changes over time in a discourse It makes the claim that a single entity is being centered at any given point in the discourse (the point of attention) First we represent the discourse within a discourse model, and then we use this representation for pronoun resolution Let U n and U n+1 represent adjacent utterances in a discourse. 14

Centering The backward looking center: C b (U n ) of utterance U n is the entity that is being focused on after U n is interpreted. The forward looking centers: C f (U n ) of utterance U n is an ordered list of entities that are possible candidates for C b (U n+1 ). The ordering can be one of the preferences given above (e.g. the grammatical role hierarchy) or a combination of preferences. C b (U n+1 ) is defined as the most highly ranked entity in the list C f (U n ) mentioned in U n+1. The C b of the first utterance is undefined. The most highly ranked entity before we see U n+1 is called C p (U n ), the preferred center. 15

Centering Centering then defines relationships between utterances as a function of the relation between the backward center and the preferred center C b (U n+1 ) = C b (U n ) C b (U n+1 ) C b (U n ) or undefined C b (U n ) C p (U n+1 ) = C b (U n+1 ) Continue Smooth-Shift C p (U n+1 ) C b (U n+1 ) Retain Rough-Shift These transitions provide a theory of text coherence 16

Centering for Pronoun Resolution The following rules are used by the algorithm (Brennan et al. ACL 1987): 1. If any element of C f (U n ) is realized by a pronoun in utterance U n+1, then C b (U n+1 ) must also be realized by a pronoun. 2. Transition states are ordered by preference: Continue > Retain > Smooth-Shift > Rough-Shift. 17

Centering for Pronoun Resolution The algorithm for pronoun resolution is defined as follows: 1. Generate possible C b C f combinations for each possible set of reference assignments. 2. Filter by constraints, e.g. if some assignments are illegal due to syntactic or semantic constraints remove them from consideration. 3. Rank by transition orderings. 18

Centering for Pronoun Resolution Consider the following discourse: John saw a beautiful Acura Integra at the dealership. (U 1 ) He showed it to Bob. (U 2 ) He bought it. (U 3 ) For sentence U 1 we get: C f (U 1 ) : C p (U 1 ) : C b (U 1 ) : {John, Integra, dealership} John undefined 19

Centering for Pronoun Resolution For sentence U 2 we have two options for it. Option 1: C f (U 2 ) : {John, Integra, Bob} C p (U 2 ) : John C b (U 2 ) : John Result: Continue C p (U 2 ) = C b (U 2 ); C b (U 1 ) undefined Option 2: C f (U 2 ) : {John, dealership, Bob} C p (U 2 ) : John C b (U 2 ) : John Result: Continue C p (U 2 ) = C b (U 2 ); C b (U 1 ) undefined 20

Centering for Pronoun Resolution For sentence U 3 we have two options for he. Option 1: C f (U 3 ) : {John, Integra} C p (U 3 ) : John C b (U 3 ) : John Result: Continue C p (U 3 ) = C b (U 3 ) = C b (U 2 ) preferred Option 2: C f (U 3 ) : {Bob, Integra} C p (U 3 ) : Bob C b (U 3 ) : Integra Result: Rough-Shift C p (U 3 ) C b (U 3 ); C b (U 3 ) C b (U 2 ) 21

Centering for Pronoun Resolution Another example: Who is Max waiting for? (U 1 ) He is waiting for Fred. (U 2 ) He invited him for dinner. (U 3 ) For sentence U 1 we get: C f (U 1 ) : C p (U 1 ) : C b (U 1 ) : {Max} Max undefined 22

For sentence U 2 by assigning he to Max (the only option) we get: C f (U 2 ) : C p (U 2 ) : C b (U 2 ) : {Max, Fred} Max Max For sentence U 3 we have two options for he and him Either he = Max and him = Fred OR he = Fred and him = Max Note that there are only two options for reference and not four due to the syntactic constraint on binding the pronouns. Ruled out: he = Max and him = Max OR he = Fred and him = Fred

Option 1: C f (U 3 ) : {Max, Fred} C p (U 3 ) : Max C b (U 3 ) : Max Result: Continue C p (U 3 ) = C b (U 3 ) = C b (U 2 ) preferred Option 2: C f (U 3 ) : {Fred, Max} C p (U 3 ) : Fred C b (U 3 ) : Max Result: Retain C p (U 3 ) C b (U 3 ); C b (U 3 ) = C b (U 2 )

Pronoun Resolution Algorithms Centering is one route towards a pronoun resolution algorithm. There are many others including the Lappin and Leass algorithm and the Hobbs Algorithm (see J&M Chp. 18). Accuracy is measured in terms of the number of co-reference chains that are recovered correctly. Annual competition on co-reference is held as part of the Message Understanding Conference (MUC) 23

Dialog Systems So far, we have looked at multiple utterance, but not at dialog Dialog is different: Turn Taking (usually handled using canned text in current dialog systems) Common Ground Conversational Implicature 24

Common Ground As conversation proceeds, the speaker and hearer share a common set of information. They also share common world knowledge. If there is a problem in reaching common ground, the dialog needs to contain some indicators like continuers or backchannels. Often repeats or reformulations are used in dialog systems to establish common ground: A: Ok. I ll take the 5ish flight on the night before on the 11th. B: On the 11th? 25

Conversational Implicature Scalar implicature: He dresses even worse than Anoop. If the dialog system hears: I want 3 stops in my itinerary. should it report on flights that have 7 stops? clearly not. why not? If the system asks: And on what day would you like to travel? and the user responds: I need to be there for a meeting from the 12th to the 15th why is the user s response taken to be relevant? 26

Conversational Implicature Common inferences in discourse (called Grice s Maxims): Quantity: Be exactly as informative as required rules out certain entailments that usually apply: 3 stops does not mean 7 stops. Quality: your contribution will be assumed to be true. Relevance: your contribution is assumed to be relevant to the current situation. Take the user response to mean the 11th. Manner: do not repeat yourself if you know something exists in the common ground. 27

Dialog Systems Dialog systems have to choose between speech acts (dialog acts) Assertives: committing the speaker to a fact. e.g. suggesting, concluding Directives: try to get the hearer to do something. e.g. asking, ordering, requesting Commissives: try to get the hearer to commit. e.g. promising, planning, opposing Expressives: express a psychological state. e.g. thanking, apologizing Declarations: change the state of the common ground. e.g. reserve a flight, name something 28

Conversation Management in Dialog Systems opening.76.23.63 suggest constraint.99.18.36.22 accept.19.77 reject.46 closing Dialog Act Planning using an HMM 29