Computational Linguistics

Similar documents
Developing a TT-MCTAG for German with an RCG-based Parser

LTAG-spinal and the Treebank

Parsing of part-of-speech tagged Assamese Texts

Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm

Control and Boundedness

Efficient Normal-Form Parsing for Combinatory Categorial Grammar

CS 598 Natural Language Processing

11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation

Proof Theory for Syntacticians

Compositional Semantics

Basic Parsing with Context-Free Grammars. Some slides adapted from Julia Hirschberg and Dan Jurafsky 1

Context Free Grammars. Many slides from Michael Collins

"f TOPIC =T COMP COMP... OBJ

Argument structure and theta roles

Derivational: Inflectional: In a fit of rage the soldiers attacked them both that week, but lost the fight.

Basic Syntax. Doug Arnold We review some basic grammatical ideas and terminology, and look at some common constructions in English.

Grammars & Parsing, Part 1:

Towards a MWE-driven A* parsing with LTAGs [WG2,WG3]

Natural Language Processing. George Konidaris

cambridge occasional papers in linguistics Volume 8, Article 3: 41 55, 2015 ISSN

Some Principles of Automated Natural Language Information Extraction

Towards a Machine-Learning Architecture for Lexical Functional Grammar Parsing. Grzegorz Chrupa la

Prediction of Maximal Projection for Semantic Role Labeling

The Discourse Anaphoric Properties of Connectives

Constraining X-Bar: Theta Theory

Introduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions.

Case government vs Case agreement: modelling Modern Greek case attraction phenomena in LFG

Chapter 4: Valence & Agreement CSLI Publications

Hyperedge Replacement and Nonprojective Dependency Structures

Approaches to control phenomena handout Obligatory control and morphological case: Icelandic and Basque

University of Edinburgh. University of Pennsylvania

Accurate Unlexicalized Parsing for Modern Hebrew

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

Type-driven semantic interpretation and feature dependencies in R-LFG

Analysis of Probabilistic Parsing in NLP

Specifying Logic Programs in Controlled Natural Language

The presence of interpretable but ungrammatical sentences corresponds to mismatches between interpretive and productive parsing.

Update on Soar-based language processing

Annotation Projection for Discourse Connectives

arxiv:cmp-lg/ v1 16 Aug 1996

Pre-Processing MRSes

Theoretical Syntax Winter Answers to practice problems

UCLA UCLA Electronic Theses and Dissertations

The Interface between Phrasal and Functional Constraints

Hindi Aspectual Verb Complexes

Hindi-Urdu Phrase Structure Annotation

ENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist

A relational approach to translation

SOME MINIMAL NOTES ON MINIMALISM *

LFG Semantics via Constraints

Ch VI- SENTENCE PATTERNS.

Adapting Stochastic Output for Rule-Based Semantics

Citation for published version (APA): Veenstra, M. J. A. (1998). Formalizing the minimalist program Groningen: s.n.

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Inleiding Taalkunde. Docent: Paola Monachesi. Blok 4, 2001/ Syntax 2. 2 Phrases and constituent structure 2. 3 A minigrammar of Italian 3

COMPUTATIONAL COMPLEXITY OF LEFT-ASSOCIATIVE GRAMMAR

Surface Structure, Intonation, and Meaning in Spoken Language

An Introduction to the Minimalist Program

Intension, Attitude, and Tense Annotation in a High-Fidelity Semantic Representation

Pseudo-Passives as Adjectival Passives

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections

Interfacing Phonology with LFG

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

Specifying a shallow grammatical for parsing purposes

BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS

Chapter 3: Semi-lexical categories. nor truly functional. As Corver and van Riemsdijk rightly point out, There is more

Segmented Discourse Representation Theory. Dynamic Semantics with Discourse Structure

Modeling Attachment Decisions with a Probabilistic Parser: The Case of Head Final Structures

Language properties and Grammar of Parallel and Series Parallel Languages

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

Som and Optimality Theory

Feature-Based Grammar

AQUA: An Ontology-Driven Question Answering System

Informatics 2A: Language Complexity and the. Inf2A: Chomsky Hierarchy

Which verb classes and why? Research questions: Semantic Basis Hypothesis (SBH) What verb classes? Why the truth of the SBH matters

Authors note Chapter One Why Simpler Syntax? 1.1. Different notions of simplicity

- «Crede Experto:,,,». 2 (09) ( '36

An Interactive Intelligent Language Tutor Over The Internet

Agree or Move? On Partial Control Anna Snarska, Adam Mickiewicz University

Treebank mining with GrETEL. Liesbeth Augustinus Frank Van Eynde

arxiv:cmp-lg/ v1 7 Jun 1997 Abstract

A R "! I,,, !~ii ii! A ow ' r.-ii ' i ' JA' V5, 9. MiN, ;

The Syntax of Coordinate Structure Complexes

Domain Adaptation for Parsing

The Structure of Multiple Complements to V

Parsing with Treebank Grammars: Empirical Bounds, Theoretical Models, and the Structure of the Penn Treebank

Multiple case assignment and the English pseudo-passive *

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Character Stream Parsing of Mixed-lingual Text

Linguistics. Undergraduate. Departmental Honors. Graduate. Faculty. Linguistics 1

Universal Grammar 2. Universal Grammar 1. Forms and functions 1. Universal Grammar 3. Conceptual and surface structure of complex clauses

1/20 idea. We ll spend an extra hour on 1/21. based on assigned readings. so you ll be ready to discuss them in class

Can Human Verb Associations help identify Salient Features for Semantic Verb Classification?

A Corpus-based Evaluation of a Domain-specific Text to Knowledge Mapping Prototype

Ensemble Technique Utilization for Indonesian Dependency Parser

Dual Content Semantics, privative adjectives, and dynamic compositionality

Structure and Intonation in Spoken Language Understanding

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

THE INTERNATIONAL JOURNAL OF HUMANITIES & SOCIAL STUDIES

Building an HPSG-based Indonesian Resource Grammar (INDRA)

Transcription:

Computational Linguistics CSC 2501 / 485 Fall 2018 9A 9A. Mildly Context-Sensitive Grammar Formalisms Gerald Penn Department of Computer Science, University of Toronto Based on slides by David Smith, Dan Klein, Stephen Clark and Eva Banik Copyright 2017 Gerald Penn. All rights reserved.

Combinatory Categorial Grammar 15

Combinatory Categorial Grammar (CCG) Categorial grammar (CG) is one of the oldest grammar formalisms Combinatory Categorial Grammar now well established and computationally well founded (Steedman, 1996, 2000) Account of syntax; semantics; prodody and information structure; automatic parsers; generation 16

Combinatory Categorial Grammar (CCG) CCG is a lexicalized grammar An elementary syntactic structure for CCG a lexical category is assigned to each word in a sentence walked: S\ give me an to my left and I return a sentence A small number of rules define how categories can combine Rules based on the combinators from Combinatory Logic 17

CCG Lexical Categories Atomic categories: S, N,, PP,... (not many more) Complex categories are built recursively from atomic categories and slashes, which indicate the directions of arguments Complex categories encode subcategorisation information intransitive verb: S \ walked transitive verb: (S \ )/ respected ditransitive verb: ((S \ )/ )/ gave Complex categories can encode modification PP nominal: ( \ )/ PP verbal: ((S \ )\(S \ ))/ 18

Simple CCG Derivation interleukin 10 inhibits production (S\)/ S S\ > < > forward application < backward application 19

Function Application Schemata Forward (>) and backward (<) application: X /Y Y X (>) Y X \Y X (<) 20

Classical Categorial Grammar Classical Categorial Grammar only has application rules Classical Categorial Grammar is context free S S\ (S\)/ interleukin-10 inhibits production 21

Classical Categorial Grammar Classical Categorial Grammar only has application rules Classical Categorial Grammar is context free S VP V interleukin-10 inhibits production 22

Extraction out of a Relative Clause The company which Microsoft bought /N N (\)/(S/) (S\)/ S/(S\) S/ \ > T type-raising > B forward composition Stephen Clark Practical Linguistically Motivated Parsing JHU, June 2009 23

Extraction out of a Relative Clause The company which Microsoft bought /N N (\)/(S/) (S\)/ > T type-raising >T S/(S\) S/ \ Stephen Clark Practical Linguistically Motivated Parsing JHU, June 2009 24

Extraction out of a Relative Clause The company which Microsoft bought /N N (\)/(S/) (S\)/ > T type-raising > B forward composition >T S/(S\) \ S/ >B Stephen Clark Practical Linguistically Motivated Parsing JHU, June 2009 25

Extraction out of a Relative Clause The company which Microsoft bought /N N (\)/(S/) (S\)/ >T S/(S\) S/ >B > \ Stephen Clark Practical Linguistically Motivated Parsing JHU, June 2009 26

Extraction out of a Relative Clause The company which Microsoft bought /N N (\)/(S/) (S\)/ > >T S/(S\) \ S/ < >B > Stephen Clark Practical Linguistically Motivated Parsing JHU, June 2009 27

Forward Composition and Type-Raising Forward composition (> B ): X /Y Y/Z X /Z (> B ) Type-raising (T): X T /(T \X ) (> T ) X T \(T /X ) (< T ) Extra combinatory rules increase the weak generative power to mild context -sensitivity Stephen Clark Practical Linguistically Motivated Parsing JHU, June 2009 28

Non-constituents in ccg Right Node Raising Google sells but Microsoft buys shares (S\)/ conj (S\)/ S/(S\) S/ >T >T > T type-raising S/(S\) S/ S/ S Stephen Clark Practical Linguistically Motivated Parsing JHU, June 2009 29

Non-constituents in ccg Right Node Raising Google sells but Microsoft buys shares (S\)/ conj (S\)/ S/(S\) >T >T S/ > T type-raising > B forward composition S/(S\) >B >B S/ S S/ Stephen Clark Practical Linguistically Motivated Parsing JHU, June 2009 30

Non-constituents in ccg Right Node Raising Google sells but Microsoft buys shares (S\)/ conj (S\)/ S/(S\) >T >T S/ S/(S\) >B >B S/ S S/ <Φ> Stephen Clark Practical Linguistically Motivated Parsing JHU, June 2009 31

Non-constituents in ccg Right Node Raising Google sells but Microsoft buys shares (S\)/ conj (S\)/ S/(S\) >T >T S/ S/(S\) >B >B S/ S S/ <Φ> > Stephen Clark Practical Linguistically Motivated Parsing JHU, June 2009 32

Combinatory Categorial Grammar ccg is mildly context sensitive Natural language is provably non-context free Constructions in Dutch and Swiss German (Shieber, 1985) require more than context free power for their analysis these have crossing dependencies (which ccg can handle) Type 0 languages Context sensitive languages Mildly context sensitive languages = natural languages (?) Context free languages Regular languages Stephen Clark Practical Linguistically Motivated Parsing JHU, June 2009 33

CCG Semantics Categories encode argument sequences Parallel syntactic combinator operations and lambda calculus semantic operations 34

CCG Semantics Left arg. Right arg. Operation Result X/Y : f Y : a Forward application X : f(a) Y : a X\Y : f Backward application X : f(a) X/Y : f Y/Z : g Forward composition X/Z : λx.f(g(x)) X : a Type raising T/(T\X) : λf.f(a) etc. 35

Tree Adjoining Grammar 36

TAG Building Blocks Elementary trees (of many depths) Substitution at Tree Substitution Grammar equivalent to CFG α 3 peanuts α 1 Harry α 2 S VP V likes 37

TAG Building Blocks Auxiliary trees for adjunction Adds extra power beyond CFG α 1 Harry α 2 S VP V likes α 3 peanuts β VP VP* Adv passionately 38

Derivation Tree Derived Tree α 2 Harry α 1 likes β passionately α 3 peanuts Harry S VP 1 VP 2 V Adv passionately likes peanuts Semantics Harry(x) likes(e, x, y) peanuts(y) passionately(e) 4 39