Compositional Semantics

Similar documents
Proof Theory for Syntacticians

Natural Language Processing. George Konidaris

CS 598 Natural Language Processing

Some Principles of Automated Natural Language Information Extraction

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

Parsing of part-of-speech tagged Assamese Texts

AQUA: An Ontology-Driven Question Answering System

Part III: Semantics. Notes on Natural Language Processing. Chia-Ping Chen

Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm

Introduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions.

ENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist

Chapter 4: Valence & Agreement CSLI Publications

Foundations of Knowledge Representation in Cyc

Aspectual Classes of Verb Phrases

Segmented Discourse Representation Theory. Dynamic Semantics with Discourse Structure

Constraining X-Bar: Theta Theory

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

Controlled vocabulary

Intension, Attitude, and Tense Annotation in a High-Fidelity Semantic Representation

Program in Linguistics. Academic Year Assessment Report

Derivational: Inflectional: In a fit of rage the soldiers attacked them both that week, but lost the fight.

Context Free Grammars. Many slides from Michael Collins

(Sub)Gradient Descent

The College Board Redesigned SAT Grade 12

Specifying Logic Programs in Controlled Natural Language

Ch VI- SENTENCE PATTERNS.

Basic Parsing with Context-Free Grammars. Some slides adapted from Julia Hirschberg and Dan Jurafsky 1

The Interface between Phrasal and Functional Constraints

Probabilistic Latent Semantic Analysis

Linguistics. Undergraduate. Departmental Honors. Graduate. Faculty. Linguistics 1

11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation

1/20 idea. We ll spend an extra hour on 1/21. based on assigned readings. so you ll be ready to discuss them in class

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

LFG Semantics via Constraints

Leader s Guide: Dream Big and Plan for Success

Construction Grammar. University of Jena.

Knowledge-Based - Systems

The MEANING Multilingual Central Repository

Guidelines for Writing an Internship Report

Control and Boundedness

Argument structure and theta roles

Approaches to control phenomena handout Obligatory control and morphological case: Icelandic and Basque

Theoretical Syntax Winter Answers to practice problems

Informatics 2A: Language Complexity and the. Inf2A: Chomsky Hierarchy

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

BYLINE [Heng Ji, Computer Science Department, New York University,

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

A R "! I,,, !~ii ii! A ow ' r.-ii ' i ' JA' V5, 9. MiN, ;

Radius STEM Readiness TM

Transfer Learning Action Models by Measuring the Similarity of Different Domains

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

IBM Software Group. Mastering Requirements Management with Use Cases Module 6: Define the System

CS Machine Learning

Case government vs Case agreement: modelling Modern Greek case attraction phenomena in LFG

Citation for published version (APA): Veenstra, M. J. A. (1998). Formalizing the minimalist program Groningen: s.n.

Type-driven semantic interpretation and feature dependencies in R-LFG

THE ANTINOMY OF THE VARIABLE: A TARSKIAN RESOLUTION Bryan Pickel and Brian Rabern University of Edinburgh

AN EXPERIMENTAL APPROACH TO NEW AND OLD INFORMATION IN TURKISH LOCATIVES AND EXISTENTIALS

TU-E2090 Research Assignment in Operations Management and Services

On-Line Data Analytics

Grade 6: Module 2A Unit 2: Overview

Grammars & Parsing, Part 1:

Minimalism is the name of the predominant approach in generative linguistics today. It was first

Derivational and Inflectional Morphemes in Pak-Pak Language

UNIVERSITY OF OSLO Department of Informatics. Dialog Act Recognition using Dependency Features. Master s thesis. Sindre Wetjen

Prediction of Maximal Projection for Semantic Role Labeling

Which verb classes and why? Research questions: Semantic Basis Hypothesis (SBH) What verb classes? Why the truth of the SBH matters

Semantic Inference at the Lexical-Syntactic Level for Textual Entailment Recognition

MYP Language A Course Outline Year 3

Pseudo-Passives as Adjectival Passives

Visit us at:

Underlying and Surface Grammatical Relations in Greek consider

Update on Soar-based language processing

Secondary English-Language Arts

"f TOPIC =T COMP COMP... OBJ

CPS122 Lecture: Identifying Responsibilities; CRC Cards. 1. To show how to use CRC cards to identify objects and find responsibilities

Heritage Korean Stage 6 Syllabus Preliminary and HSC Courses

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Universal Grammar 2. Universal Grammar 1. Forms and functions 1. Universal Grammar 3. Conceptual and surface structure of complex clauses

The Smart/Empire TIPSTER IR System

Grade Band: High School Unit 1 Unit Target: Government Unit Topic: The Constitution and Me. What Is the Constitution? The United States Government

OFFICE SUPPORT SPECIALIST Technical Diploma

The stages of event extraction

Modeling user preferences and norms in context-aware systems

K 1 2 K 1 2. Iron Mountain Public Schools Standards (modified METS) Checklist by Grade Level Page 1 of 11

The presence of interpretable but ungrammatical sentences corresponds to mismatches between interpretive and productive parsing.

Inleiding Taalkunde. Docent: Paola Monachesi. Blok 4, 2001/ Syntax 2. 2 Phrases and constituent structure 2. 3 A minigrammar of Italian 3

Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data

Loughton School s curriculum evening. 28 th February 2017

Word Stress and Intonation: Introduction

THE VERB ARGUMENT BROWSER

Using Semantic Relations to Refine Coreference Decisions

Focusing bound pronouns

Pre-Processing MRSes

Language Evolution, Metasyntactically. First International Workshop on Bidirectional Transformations (BX 2012)

Project in the framework of the AIM-WEST project Annotation of MWEs for translation

LTAG-spinal and the Treebank

cambridge occasional papers in linguistics Volume 8, Article 3: 41 55, 2015 ISSN

Lecture 1: Basic Concepts of Machine Learning

Transcription:

Compositional Semantics CMSC 723 / LING 723 / INST 725 MARINE CARPUAT marine@cs.umd.edu

Words, bag of words Sequences Trees Meaning

Representing Meaning An important goal of NLP/AI: convert natural language into a representation that supports semantic inferences Why? Many applications require semantic understanding Question answering, translation, fact-checking, giving instructions to a robot, Challenge: how to bridge gap between linguistic input to non-linguistic knowledge of the world

Representing Meaning Challenges for mapping linguistic input to meaning different words/structure, same meaning She needed to make a quick decision in that situation. The scenario required her to make a split-second judgment. I saw the man. The man was seen by me.

Representing Meaning Challenges for mapping linguistic input to meaning same words, different meaning - I walked by the bank - to deposit my check. - to take a look at the river. Everyone on the island speaks two languages. Two languages are spoken by everyone on the island.

Representing Meaning create representations of linguistic inputs that capture the meanings of those inputs. In most cases, they re simultaneously descriptions of the meanings of utterances and of some potential state of affairs in some world.

Desired Properties of Meaning Representations Goal: express propositions, while abstracting away from ambiguity/vagueness of natural language Desired Properties Verifiability No ambiguity Expressiveness Inference

Natural Language Inferences Examples All blips are foos. Blop is a blip. Blop is a foo. Mozart was born in Salzburg. Mozart was born in Vienna. No, that can t be. These are different cities.

We ll cover different families of approaches Logical Semantics Shallow Representations and Lexical Semantics Textual Inference

Constrasting 2 Strategies to Semantic Analysis Logical semantics Complete analysis Create a First Order Logic representation that accounts for all the entities, roles and relations present in a sentence Information Extraction Superficial analysis Pulls out only the entities, relations and roles that are of interest to the consuming application.

Information Extraction: Entity Recognition PERSON ORGANIZATION American Airlines, a unit of AMR, immediately matched the move, spokesman Tim Wagner said.

Information Extraction: Predicting Relations PERSON Founder? Investor? Member? Employee? President? ORGANIZATION American Airlines, a unit of AMR, immediately matched the move, spokesman Tim Wagner said.

Information Extraction Relations PERSON- SOCIAL PHYSICAL GENERAL AFFILIATION PART- WHOLE Family Business Lasting Personal Located Near Citizen- Resident- Ethnicity- Religion Subsidiary Org-Location- Origin Geographical Founder Ownership Membership ORG AFFILIATION Sports-Affiliation Investor Student-Alum Employment ARTIFACT User-Owner-Inventor- Manufacturer 17 relations from 2008 Relation Extraction Task from Automated Content Extraction (ACE)

Information Extraction Relations UMLS: Unified Medical Language System 134 entity types, 54 relations Injury disrupts Physiological Function Bodily Location location-of Biologic Function Anatomical Structure part-of Organism Pharmacologic Substance causes Pathological Function Pharmacologic Substance treats Pathologic Function

Building Blocks of Logical Representations of Meaning Propositional Semantics Proposition symbols: P, Q, Boolean operators negation, conjunction, disjunction Implication, equivalence Inference rules Can be defined using Boolean connectives P => Q

Building Blocks of Logical Representations of Meaning Predicate Logic: extends our representation with Constants = elements that name entities in the model Predicates = sets of objects or, equivalently, functions from objects to truth values Functions = sets of pairs of objects, or eq. functions from one object to another

Building Blocks of Logical Representations of Meaning Predicate Logic: extends our representation further with Variables = let us refer to objects which are not locally specified Quantifiers = used to bind variables Existential Universal

A CFG specification of the syntax of First Order Logic Representations From SLP2 Section 17.3

Representing a sentence in FOL Franco likes Frasca. How can we represent the Liking predicateargument template?

Predicate-Argument Structure in Natural Language Events, actions and relationships can be captured with representations that consist of predicates and arguments to those predicates. Predicates Primarily Verbs, VPs, Sentences Sometimes Nouns and NPs Arguments Primarily Nouns, Nominals, NPs, PPs But also everything else, depends on the context

Example: representing predicateargument structure Mary gave a list to John. Giving(Mary, John, List) More precisely Gave conveys a three-argument predicate The first argument is the subject The second is the recipient, which is conveyed by the NP inside the PP The third argument is the thing given, conveyed by the direct object

Example: representing predicateargument structure Predicate-argument structures as templates We can think of the verb/vp providing a template like the following e, x, y, zgiving( e)^giver( e, x)^given( e, y)^givee( e, z) The semantics of the NPs and the PPs in the sentence plug into the slots provided in the template

A CFG specification of the syntax of First Order Logic Representations From SLP2 Section 17.3

Representing a sentence in FOL Franco likes Frasca. Liking predicate-argument template

One More Building Block of Logical Representations of Meaning Lambda forms Take a FOL formula with variables in it that are to be bound. Allow those variables to be bound by treating the lambda form as a function with formal arguments. λx.p(x) λx.p(x)(franco) P(Franco)

Lambda Reductions

Logical Semantics Representations of Natural Language Building blocks Propositional Logic Predicate Logic Lambda Forms Given a sentence, how can we construct its logical representation? One approach: compositional semantics

Compositional Analysis: use syntax to guide semantic analysis

Principle of Compositionality The meaning of a whole is derived from the meanings of the parts What parts? The constituents of the syntactic parse of the input What could it mean for a part to have a meaning?

Compositional Analysis: use syntax to guide semantic analysis

Augmented Rules We ll accomplish this by attaching semantic formation rules to our syntactic CFG rules Abstractly A 1... n { f (α1.sem,...αn.sem)} This should be read as: the semantics we attach to A can be computed from some function applied to the semantics of A s parts.

Example Easy parts NP -> PropNoun PropNoun -> Frasca PropNoun -> Franco Attachments {PropNoun.sem} {Frasca} {Franco}

Example S -> NP VP VP -> Verb NP Verb -> likes {VP.sem(NP.sem)} {Verb.sem(NP.sem)???

Which approach can we use to discover information about specific entities?

What approach can we use to summarize text?

Which approach can we use to query databases?

Which approach can we use to instruct a robot?

Recap Intro to Semantics Meaning representations motivated by semantic processing for specific applications 2 approaches to semantic processing complete FOL representation vs. shallow information extraction