Tree-Adjoining Grammar (TAG) Overview. Introduction to TAG (1) Linguistics 614. Spring 2015

Similar documents
"f TOPIC =T COMP COMP... OBJ

Developing a TT-MCTAG for German with an RCG-based Parser

LTAG-spinal and the Treebank

Context Free Grammars. Many slides from Michael Collins

Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm

Proof Theory for Syntacticians

Introduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions.

Grammars & Parsing, Part 1:

Chapter 4: Valence & Agreement CSLI Publications

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

CS 598 Natural Language Processing

Inleiding Taalkunde. Docent: Paola Monachesi. Blok 4, 2001/ Syntax 2. 2 Phrases and constituent structure 2. 3 A minigrammar of Italian 3

Basic Syntax. Doug Arnold We review some basic grammatical ideas and terminology, and look at some common constructions in English.

arxiv:cmp-lg/ v1 16 Aug 1996

Control and Boundedness

11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation

Informatics 2A: Language Complexity and the. Inf2A: Chomsky Hierarchy

Hyperedge Replacement and Nonprojective Dependency Structures

Some Principles of Automated Natural Language Information Extraction

Basic Parsing with Context-Free Grammars. Some slides adapted from Julia Hirschberg and Dan Jurafsky 1

Universal Grammar 2. Universal Grammar 1. Forms and functions 1. Universal Grammar 3. Conceptual and surface structure of complex clauses

Hindi-Urdu Phrase Structure Annotation

Som and Optimality Theory

The presence of interpretable but ungrammatical sentences corresponds to mismatches between interpretive and productive parsing.

Towards a MWE-driven A* parsing with LTAGs [WG2,WG3]

ENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist

Parsing of part-of-speech tagged Assamese Texts

Constraining X-Bar: Theta Theory

The Structure of Multiple Complements to V

Case government vs Case agreement: modelling Modern Greek case attraction phenomena in LFG

The building blocks of HPSG grammars. Head-Driven Phrase Structure Grammar (HPSG) HPSG grammars from a linguistic perspective

An Introduction to the Minimalist Program

Approaches to control phenomena handout Obligatory control and morphological case: Icelandic and Basque

On the Notion Determiner

Compositional Semantics

LING 329 : MORPHOLOGY

Natural Language Processing. George Konidaris

Structure-Preserving Extraction without Traces

Argument structure and theta roles

Dependency, licensing and the nature of grammatical relations *

Towards a Machine-Learning Architecture for Lexical Functional Grammar Parsing. Grzegorz Chrupa la

The Interface between Phrasal and Functional Constraints

Agree or Move? On Partial Control Anna Snarska, Adam Mickiewicz University

UCLA UCLA Electronic Theses and Dissertations

The Discourse Anaphoric Properties of Connectives

Advanced Grammar in Use

Minimalism is the name of the predominant approach in generative linguistics today. It was first

A Grammar for Battle Management Language

Erkki Mäkinen State change languages as homomorphic images of Szilard languages

THE INTERNATIONAL JOURNAL OF HUMANITIES & SOCIAL STUDIES

Ch VI- SENTENCE PATTERNS.

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections

Derivational: Inflectional: In a fit of rage the soldiers attacked them both that week, but lost the fight.

COMPUTATIONAL COMPLEXITY OF LEFT-ASSOCIATIVE GRAMMAR

BULATS A2 WORDLIST 2

Feature-Based Grammar

A Version Space Approach to Learning Context-free Grammars

Parsing with Treebank Grammars: Empirical Bounds, Theoretical Models, and the Structure of the Penn Treebank

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

Focusing bound pronouns

Focus of the Unit: Much of this unit focuses on extending previous skills of multiplication and division to multi-digit whole numbers.

Switched Control and other 'uncontrolled' cases of obligatory control

A First-Pass Approach for Evaluating Machine Translation Systems

LNGT0101 Introduction to Linguistics

1/20 idea. We ll spend an extra hour on 1/21. based on assigned readings. so you ll be ready to discuss them in class

AP Statistics Summer Assignment 17-18

The Strong Minimalist Thesis and Bounded Optimality

Hans-Ulrich Block, Hans Haugeneder Siemens AG, MOnchen ZT ZTI INF W. Germany. (2) [S' [NP who][s does he try to find [NP e]]s IS' $=~

Abstractions and the Brain

Improved Effects of Word-Retrieval Treatments Subsequent to Addition of the Orthographic Form

Pseudo-Passives as Adjectival Passives

Algebra 2- Semester 2 Review

a) analyse sentences, so you know what s going on and how to use that information to help you find the answer.

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

Derivations (MP) and Evaluations (OT) *

The semantics of case *

PRODUCT PLATFORM DESIGN: A GRAPH GRAMMAR APPROACH

The Structure of Relative Clauses in Maay Maay By Elly Zimmer

6.863J Natural Language Processing Lecture 12: Featured attraction. Instructor: Robert C. Berwick

A General Class of Noncontext Free Grammars Generating Context Free Languages

Character Stream Parsing of Mixed-lingual Text

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

A relational approach to translation

Hindi Aspectual Verb Complexes

Analysis of Probabilistic Parsing in NLP

RANKING AND UNRANKING LEFT SZILARD LANGUAGES. Erkki Mäkinen DEPARTMENT OF COMPUTER SCIENCE UNIVERSITY OF TAMPERE REPORT A ER E P S I M S

Parsing natural language

Ohio s Learning Standards-Clear Learning Targets

CHILDREN S POSSESSIVE STRUCTURES: A CASE STUDY 1. Andrew Radford and Joseph Galasso, University of Essex

MOODLE 2.0 GLOSSARY TUTORIALS

Sight Word Assessment

The Verbmobil Semantic Database. Humboldt{Univ. zu Berlin. Computerlinguistik. Abstract

Citation for published version (APA): Veenstra, M. J. A. (1998). Formalizing the minimalist program Groningen: s.n.

Prediction of Maximal Projection for Semantic Role Labeling

Modeling Attachment Decisions with a Probabilistic Parser: The Case of Head Final Structures

Theoretical Syntax Winter Answers to practice problems

Underlying and Surface Grammatical Relations in Greek consider

University of Edinburgh. University of Pennsylvania

Guidelines for Writing an Internship Report

CAS LX 522 Syntax I. Long-distance wh-movement. Long distance wh-movement. Islands. Islands. Locality. NP Sea. NP Sea

Transcription:

Tree-Adjoining Grammar (TAG) Linguistics 614 With thanks to Detmar Meurers & Laura Kallmeyer pring 2015 Overview 1 Introduction 2 TAG for Natural Languages 3 Conclusion (1) TAG (Joshi et al. 1975, Joshi & chabes 1997) extends CFG in the following sense: In a CFG, each derivation step amounts to substituting a new tree of height 1 for a leaf. In a TAG, we allow (finite) trees that are arbitrarily large

(2) Phrase tructure Trees AD sometimes laughs 1 2 AD 3 4 5 AD sometimes 6 laughs (3) tring rewriting derivation 0 1 (rule #1) 2 (rule #4) 3 AD (rule #2) 4 sometimes (rule #5) 5 sometimes (rule #3) 6 sometimes laughs (rule #6) (4) ubstitution (Tree ubstitution Grammars (TGs) Elementary structures are trees Arrow indies where substitution takes place tree 1: tree 2: derived tree: laughs laughs

(5) Q: with TGs, how would we obtain heartily laughs? Besides substitution at leaves, we also can replace internal nodes with new trees (adjunction). In an adjunction, the new tree is an auxiliary tree and has a special leaf, the foot node. The trees that are added in substitution operations are called initial trees. Auxiliary tree modifies an XP only if root & foot nodes are both XP Using adjunction gives Tree Adjoining Grammar (TAG) (6) (1) sometimes laughs AD sometimes laughs derived tree: AD sometimes laughs (7) A Tree Adjoining Grammar (TAG) is a quadruple G = N, T, I, A such that T and N are disjoint alphabets, the terminals and nonterminals, I is a finite set of initial trees, and A is a finite set of auxiliary trees. The trees in I A are called elementary trees. nodes in elementary trees are labeled with symbols from T N {ε} all internal nodes have labels from N G is lexicalized iff each elementary tree has at least one leaf with a terminal label.

(8) TAG as defined above are more powerful than CFG but they cannot generate the copy language ({ww w {a, b} }). In order to increase the expressive power, adjunction constraints are introduced that specify for each node 1 whether adjunction is mandatory and 2 which trees can be adjoined. (9) Three types of constraints are distinguished: 1 A node is said to carry a obligatory adjunction (OA) constraint if adjunction is obligatory at that node. 2 A node is said to carry a null adjunction (NA) constraint if adjunction is not obligatory and the set of adjoinable trees is empty. 3 A node is said to carry a selective adjunction (A) constraint if adjunction is not obligatory and the set of adjoinable trees is not empty. (6) Example: TAG for the copy language ǫ a NA NA a b NA NA b

(7) Example (2) seems to sleep OA to sleep seems (8) TAG derivations are described by derivation trees: For each derivation in a TAG there is a corresponding derivation tree. This tree contains nodes for all elementary trees used in the derivation, and edges for all adjunctions and substitutions performed throughout the derivation. Whenever an elementary tree γ was attached to the node at address p in the elementary tree γ, there is an edge from γ to γ labeled with p. We use Gorn addresses: The root has address ε, and the i th daughter of the node with address p has address pi. (9) Derivation tree example The derivation tree for the derivation of (2) seems to sleep: sleep 1 2 john seems

FTAG (1) Feature-structure based TAG (FTAG) (ijay-hanker & Joshi, 1988): each node has a top and a bottom feature structure (except substitution nodes that have only a top). Nodes in the same elementary tree can share features (extended domain of locality). Intuition: The top feature structure tells us something about what the node represents within the surrounding structure, and the bottom feature structure tells us something about what the tree below the node represents. In the final derived tree, both must be the same. FTAG (2) Example: agr 1 agr pers 3 num sing 1 sings FTAG (3) Example: agr 1 agr 1 mode ind mode ger singing

FTAG (4) Unifiion during derivation: ubstitution: the top of the root of the new initial tree unifies with the top of the substitution node Adjunction: the top of the root of the new auxiliary tree unifies with the top of the adjunction site, and the bottom of the foot of the new tree unifies with the bottom of the adjunction site. In the final derived tree, top and bottom unify for all nodes. FTAG (5) Example: agr pers 3 num sing agr 1 agr pers 3 num sing 1 sings FTAG (6) Example: agr 2 mode ind agr 2 pers 3 num sing is agr 1 agr 1 mode ind mode ger mode ger singing

FTAG (7) In FTAG, there are no explicit adjunction constraints. Instead, adjunction constraints are expressed via feature unifiion requirements. Important: LTAG feature structures are restricted; there is only a finite set of possible feature structures. Therefore, the following can be shown: For each FTAG there exists a weakly equivalent TAG with adjunction constraints and vice versa. The two TAGs generate even the same sets of trees, only with different node labels. Elementary trees (1) Important features of LTAG (Lexicalized TAG): Grammar is lexicalized Recursive parts are put into separate elementary trees that can be adjoined (Factoring of recursion, FR) Elementary trees can be arbitrarily large, in particular (because of FR) they can contain elements that are far apart in the final derived tree (Extended domain of locality) LTAG game: http://www.ltaggame.com Elementary trees (2) (3) a. who i did tell am that Bill likes t i b. who i did tell am that Mary said that Bill likes t i WH i OA COMP that WH i who likes ǫ i INFL did tell Bill am

Elementary trees (3) Elementary trees are extended projections of lexical items. Recursion is factored away finite set of elementary trees. The elementary tree of a lexical predie contains slots for all arguments of the predie, for nothing more. Besides lexical predies, there are functional elements (complementizers, determiners, auxiliaries, negation) whose treatment in LTAG is less clear. They can be either in separate elementary trees (XTAG, 2001) or in the elementary tree of the lexical item they are associated with (Frank, 2002). Elementary trees (4) Example (4) gives a book to Mary PP gives P to Elementary trees (5) Example: (5) expected Mary to make a comment expected selects for a subject and an infinitival sentence: expected to make a comment The sentential object is realised as a foot node in order to allow extractions: (6) whom does expect to come?

Elementary trees (6) to make a comment: make and comment in the same elementary tree since they form a light verb construction: Det to make N comment a Elementary trees (7) Example with modifiers: (7) the good student participated in every course during the semester N AP A good N Det the N student Elementary trees (8) PP participated P in PP P during

Elementary trees (9) Constraints on larger structures (constraints on unbounded dependencies ) need not be stipulated but follow from the possibilities of adjunction in the elementary trees. Fundamental LTAG hypothesis: Every syntactic dependency is expressed locally within a single elementary tree. Non-local dependency corollary: Non-local dependencies always reduce to local ones once recursive structure is factored away. What do the elementary trees look like for the following sentence? (8) which book did Harvey say Cecile had read Elementary trees (10) Tree families In the lexicon, the trees are organized in tree families. Each family contains a base tree and trees derived from the base tree using transformations. Important: These transformations operate only on a finite set, i.e., on structures of bounded size. Tree families group together trees belonging to the same subegorization frame. Elementary trees (11) Tree family example The trees for the different forms of buy in (9) belong to one tree family. (9) a. bought a book b. What does buy? c. Who bought a book? d. A book was bought by e. The man who bought the book this morning was from Tübingen. buy in (10) has a different tree family. (10) bought Mary a book

(1) The derived tree gives the constituent structure. The derivation tree records the history of how the elementary trees are put together. the edges in the derivation tree represent predie-argument dependencies where a substitution-edge has a downward direction, an adjunction edge an upward direction; the derivation tree is close to a semantic dependency graph. compute semantics on derivation tree (2) Ditransitive verb (11) buys Bill a book Elementary trees: Derivation tree buys buys 1 22 23 Bill a book Bill Det N a book (3) entential Complement (12) Bill hopes that wins Bill wins ǫ 1 hopes 1 Bill hopes Comp that wins

(4) Raising to Object (13) expects Bill to win to win expects ǫ 1 expects Bill 1 to win (5) Object-control Equi (14) persuades Bill PRO to leave to leave ǫ persuades 1 22 Bill persuades PRO to leave (6) ubject raising (15) seems to like Bill seems to like to like 1 2 22 seems Bill

(7) Long distance phenomena (16) which book did Harvey say Cecile had read had read 1 2 21 which book did say Cecile 21 Harvey (8) The derivation tree is not always the semantic dependency structure: (17) roasted red pepper AP N roasted N AP red N N N pepper pepper ǫ red ǫ roasted proposal of alternative derivation with multiple adjunctions of modifier trees at the same node. (9) On the other hand, multiple adjunctions are not always desired: (18) seems to be likely to win the race to win 1 2 22 to be likely the race ǫ seems This is the correct dependency structure.

(10) Another problematic case: (19) claims Bill is likely to win to win 1 ǫ 2 Bill claims is likely 1 Conclusion TAG extend CFGs by introducing adjunction, in addition to substitution. TAG are only slightly more powerful that CFG. Elementary trees of lexical predies encapsulate subegorization frames: For each subegorized argument, there is a non-terminal leaf (either a substitution node or a foot node). Recursion is factored away: only slots for subegorized arguments are provided. Modifiers are added by adjunction. Extended domain of locality: yntactic dependencies are defined locally, within single elementary trees. Unbounded dependencies arise from adjunction between an argument and its lexical head. References Frank, R. (2002): Phrase tructure Copmposition and yntactic Dependencies. MIT Press, Cambridge, Mass. Gardent, C., Kallmeyer, L. (2003): emantic Construction in FTAG. Proceedings of EACL 2003, 123 130. Joshi, A.K., Levy, L.., Takahashi, M. (1975): Tree Adjunct Grammars. Journal of Computer and ystem cience 10, 136 163. Joshi, A.K., chabes, Y. (1997): Tree-Adjoning Grammars. In Rozenberg, G., alomaa, A., eds.: Handbook of Formal Languages. pringer, Berlin, 69 123. Kallmeyer, L., Romero, M. (2008): cope and ituation Binding in LTAG using emantic Unifiion. Research on Language and Computation 6(1), 3 52. Nesson, R., hieber,.m. (2006): impler TAG semantics through synchronization. In: Proceedings of the 11th Conference on Formal Grammar, Malaga, pain. hieber,.m. (1985): Evidence against the context-freeness of natural language. Linguistics and Philosophy 8, 333 343. ijay-hanker, K. and Joshi, A.K. (1988): Feature tructures Based Tree Adjoining Grammar. Proceedings of COLING, 714 719. XTAG Research Group (2001): A Lexicalized Tree Adjoining Grammar for English. Technical report, Institute for Research in Cognitive cience, Philadelphia. Available from ftp://ftp.cis.upenn.edu/pub/xtag/release-2.24.2001/tech-report.pdf.