Dependency Grammar. Dependency Grammar. Dependency Syntax. Linguistics 614. Spring 2015

Similar documents
Grammars & Parsing, Part 1:

Developing a TT-MCTAG for German with an RCG-based Parser

Dependency Annotation of Coordination for Learner Language

11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation

Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm

Basic Parsing with Context-Free Grammars. Some slides adapted from Julia Hirschberg and Dan Jurafsky 1

LING 329 : MORPHOLOGY

Proof Theory for Syntacticians

Introduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions.

Parsing of part-of-speech tagged Assamese Texts

UNIVERSITY OF OSLO Department of Informatics. Dialog Act Recognition using Dependency Features. Master s thesis. Sindre Wetjen

LTAG-spinal and the Treebank

MODELING DEPENDENCY GRAMMAR WITH RESTRICTED CONSTRAINTS. Ingo Schröder Wolfgang Menzel Kilian Foth Michael Schulz * Résumé - Abstract

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

ENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist

CS 598 Natural Language Processing

Chapter 4: Valence & Agreement CSLI Publications

Basic Syntax. Doug Arnold We review some basic grammatical ideas and terminology, and look at some common constructions in English.

Dependency, licensing and the nature of grammatical relations *

Case government vs Case agreement: modelling Modern Greek case attraction phenomena in LFG

Feature-Based Grammar

Derivational: Inflectional: In a fit of rage the soldiers attacked them both that week, but lost the fight.

Constraining X-Bar: Theta Theory

Underlying and Surface Grammatical Relations in Greek consider

Accurate Unlexicalized Parsing for Modern Hebrew

Inleiding Taalkunde. Docent: Paola Monachesi. Blok 4, 2001/ Syntax 2. 2 Phrases and constituent structure 2. 3 A minigrammar of Italian 3

Adding syntactic structure to bilingual terminology for improved domain adaptation

1/20 idea. We ll spend an extra hour on 1/21. based on assigned readings. so you ll be ready to discuss them in class

Informatics 2A: Language Complexity and the. Inf2A: Chomsky Hierarchy

What Can Neural Networks Teach us about Language? Graham Neubig a2-dlearn 11/18/2017

Syntactic Dependencies for Multilingual and Multilevel Corpus Annotation

Approaches to control phenomena handout Obligatory control and morphological case: Icelandic and Basque

The presence of interpretable but ungrammatical sentences corresponds to mismatches between interpretive and productive parsing.

Ensemble Technique Utilization for Indonesian Dependency Parser

Pseudo-Passives as Adjectival Passives

Context Free Grammars. Many slides from Michael Collins

Some Principles of Automated Natural Language Information Extraction

Derivational and Inflectional Morphemes in Pak-Pak Language

The Interface between Phrasal and Functional Constraints

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

Achim Stein: Diachronic Corpora Aston Corpus Summer School 2011

An Introduction to the Minimalist Program

THE INTERNATIONAL JOURNAL OF HUMANITIES & SOCIAL STUDIES

Using dialogue context to improve parsing performance in dialogue systems

Specifying a shallow grammatical for parsing purposes

Adapting Stochastic Output for Rule-Based Semantics

FinnTreeBank: Creating a research resource and service for language researchers with Constraint Grammar

Dissertation Summaries. Headedness in Word Formation and Lexical Semantics: Evidence from Italiot and Cypriot (University of Patras, 2014)*

Construction Grammar. University of Jena.

Chunk Parsing for Base Noun Phrases using Regular Expressions. Let s first let the variable s0 be the sentence tree of the first sentence.

Towards a Machine-Learning Architecture for Lexical Functional Grammar Parsing. Grzegorz Chrupa la

Prediction of Maximal Projection for Semantic Role Labeling

"f TOPIC =T COMP COMP... OBJ

Universal Grammar 2. Universal Grammar 1. Forms and functions 1. Universal Grammar 3. Conceptual and surface structure of complex clauses

How to analyze visual narratives: A tutorial in Visual Narrative Grammar

Type Theory and Universal Grammar

AQUA: An Ontology-Driven Question Answering System

The building blocks of HPSG grammars. Head-Driven Phrase Structure Grammar (HPSG) HPSG grammars from a linguistic perspective

Refining the Design of a Contracting Finite-State Dependency Parser

A relational approach to translation

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

Annotation Projection for Discourse Connectives

Argument structure and theta roles

Procedia - Social and Behavioral Sciences 154 ( 2014 )

Project in the framework of the AIM-WEST project Annotation of MWEs for translation

Depling Third International Conference on Dependency Linguistics. Proceedings of the Conference

Visual CP Representation of Knowledge

Natural Language Processing. George Konidaris

Minimalism is the name of the predominant approach in generative linguistics today. It was first

A High-Quality Web Corpus of Czech

Measuring the relative compositionality of verb-noun (V-N) collocations by integrating features

Modeling Attachment Decisions with a Probabilistic Parser: The Case of Head Final Structures

The Pennsylvania State University. The Graduate School. College of the Liberal Arts THE TEACHABILITY HYPOTHESIS AND CONCEPT-BASED INSTRUCTION

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections

The Discourse Anaphoric Properties of Connectives

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

Control and Boundedness

An Interactive Intelligent Language Tutor Over The Internet

LFG Semantics via Constraints

Compositional Semantics

Language acquisition: acquiring some aspects of syntax.

Parallel Evaluation in Stratal OT * Adam Baker University of Arizona

Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data

L1 and L2 acquisition. Holger Diessel

Pre-Processing MRSes

LNGT0101 Introduction to Linguistics

Noun incorporation in Sora: A case for incorporation as morphological merger TLS: 19 February Introduction.

A Framework for Customizable Generation of Hypertext Presentations

LEXICAL COHESION ANALYSIS OF THE ARTICLE WHAT IS A GOOD RESEARCH PROJECT? BY BRIAN PALTRIDGE A JOURNAL ARTICLE

RANKING AND UNRANKING LEFT SZILARD LANGUAGES. Erkki Mäkinen DEPARTMENT OF COMPUTER SCIENCE UNIVERSITY OF TAMPERE REPORT A ER E P S I M S

On the Notion Determiner

Specifying Logic Programs in Controlled Natural Language

Hindi Aspectual Verb Complexes

Language and Computers. Writers Aids. Introduction. Non-word error detection. Dictionaries. N-gram analysis. Isolated-word error correction

Citation for published version (APA): Veenstra, M. J. A. (1998). Formalizing the minimalist program Groningen: s.n.

The Internet as a Normative Corpus: Grammar Checking with a Search Engine

In Udmurt (Uralic, Russia) possessors bear genitive case except in accusative DPs where they receive ablative case.

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

arxiv:cmp-lg/ v1 7 Jun 1997 Abstract

Construction Grammar. Laura A. Michaelis.

Transcription:

Dependency Grammar Linguistics 614 With thanks to Sandra Kübler and Joakim Nivre Spring 2015 Motivation and Contents Dependency Grammar Not a coherent grammatical framework: wide range of different kinds of DG just as there are wide ranges of generative syntax Different core ideas than phrase structure grammar Dependency grammar is important for those interested in CL: Increasing interest in dependency-based approaches to syntactic parsing in recent years Basic Concepts of Dependency Syntax Dependency Syntax The basic idea: Syntactic structure consists of lexical items, linked by binary asymmetric relations called dependencies. In the (translated) words of Lucien Tesnière [Tesnière(1959)]: The sentence is an organized whole, the constituent elements of which are words. [1.2] Every word that belongs to a sentence ceases by itself to be isolated as in the dictionary. Between the word and its neighbors, the mind perceives connections, the totality of which forms the structure of the sentence. [1.3] The structural connections establish dependency relations between the words. Each connection in principle unites a superior term and an inferior term. [2.1] The superior term receives the name governor. The inferior term receives the name subordinate. Thus, in the sentence Alfred parle [... ], parle is the governor and Alfred the subordinate. [2.2]

Basic Concepts of Dependency Syntax Overview: constituency (1) Small birds sing loud songs What you might be more used to seeing: S NP VP Small birds sing NP loud songs Basic Concepts of Dependency Syntax Overview: dependency A corresponding dependency tree representation [Hudson(2000)]: Small birds sing loud songs What are Dependency Relations? DG is based on dependency relations between words: A B means A governs B or B depends on A... Dependency relations can refer to syntactic properties, semantic properties, or a combination of the two Some variants of DG separate syntactic and semantic relations by representing different layers of dependency structures These relations are generally syntactic functions: subject, ect/complement, adjunct, etc. Subject/Agent: John fished. Object/Patient: Mary hit John. PSG is based on groupings, or constituents Grammatical relations are not usually seen as primitives, but as being derived from structure

Simple relation example For the sentence John loves Mary, we have the relations: loves subj John loves Mary Both John and Mary depend on loves, which makes loves the head, or root, of the sentence (i.e., there is no word that governs loves) The structure of a sentence consists of the set of pairwise relations among words. Terminology Dependency Structure p Economic news had little effect on financial markets. Terminology Terminology Superior Head Governor Regent. Inferior Dependent Modifier Subordinate.

Notational Variants Notational Variants had p news effect Economic little on. markets financial Notational Variants Notational Variants NN had VBD NN p PU JJ news JJ effect IN. Economic little on NNS JJ markets financial Notational Variants Notational Variants p Economic news had little effect on financial markets.

Notational Variants Notational Variants (the one we ll focus on) p Economic news had little effect on financial markets. Comparison to Phrase Structure Phrase Structure S NP VP PU JJ NN VBD NP. Economic news had NP PP JJ NN IN NP little effect on JJ NNS financial markets Comparison to Phrase Structure Comparison Dependency structures explicitly represent head-dependent relations (directed arcs), functional categories (arc labels), possibly some structural categories (parts-of-speech). Phrase structures explicitly represent phrases (nonterminal nodes), structural categories (nonterminal labels), possibly some functional categories (grammatical functions). Hybrid representations may combine all elements.

Comparison to Phrase Structure Relation to phrase structure What is the relation between DG and PSG? If a PS tree has heads marked, then you can derive the dependencies Likewise, a DG tree can be converted into a PS tree by grouping a word with its dependents This only works for projective trees (no crossing branches) Some constituent distinctions are not possible: e.g., binary-branching vs. flat structures for the same head No categorization into phrasal levels Theoretical Frameworks Some Theoretical Frameworks Word Grammar (WG) [Hudson(1984), Hudson(1990)] Functional Generative Description (FGD) [Sgall et al.(1986)sgall, Hajičová and Panevová] Dependency Unification Grammar (DUG) [Hellwig(1986), Hellwig(2003)] Meaning-Text Theory (MTT) [Mel čuk(1988)] (Weighted) Constraint Dependency Grammar ([W]CDG) [Maruyama(1990), Harper and Helzerman(1995), Menzel and Schröder(1998), Schröder(2002)] Functional Dependency Grammar (FDG) [Tapanainen and Järvinen(1997), Järvinen and Tapanainen(1998)] Topological/Extensible Dependency Grammar ([T/X]DG) [Duchier and Debusmann(2001), Debusmann et al.(2004)debusmann, Duchier and Kruijff] Theoretical Issues Some Theoretical Issues Dependency structure sufficient as well as necessary? Mono-stratal or multi-stratal syntactic representations? What is the nature of lexical elements (nodes)? Morphemes? Word forms? Multi-word units? What is the nature of dependency types (arc labels)? Grammatical functions? Semantic roles? What are the criteria for identifying heads and dependents? What are the formal properties of dependency structures?

Headedness Criteria for Heads and Dependents Criteria for a syntactic relation between a head H and a dependent D in a construction C [Zwicky(1985), Hudson(1990)]: 1 H determines the syntactic category of C; H can replace C. 2 H determines the semantic category of C; D specifies H. 3 H is obligatory; D may be optional. 4 H selects D and determines whether D is obligatory. 5 The form of D depends on H (agreement or government). 6 The linear position of D is specified with reference to H. Issues: Syntactic (and morphological) versus semantic criteria Exocentric versus endocentric constructions Clear Cases Some Clear Cases Construction Head Dependent Exocentric Verb Subject () Verb Object () Endocentric Verb Adverbial (vmod) Noun Attribute () vmod Economic news suddenly affected financial markets.??

?? sbar

sbar?????? sbar co cj sbar?? co cj

sbar vc co cj?? sbar vc co cj p sbar vc co cj

Dependency Graphs A dependency structure can be defined as a directed graph G, consisting of a set V of nodes, a set E of arcs (edges), a linear precedence order < on V (not in every theory) Labeled graphs: Nodes in V are labeled with word forms (and annotation). Arcs in E are labeled with dependency types. Notational conventions (i, j V ): i j (i, j) E i j i = j k : i k, k j Conditions Formal Conditions on Dependency Graphs Intuitions: Syntactic structure is complete (Connectedness). Syntactic structure is hierarchical (Acyclicity). Every word has at most one syntactic head (Single-Head). Connectedness can be enforced by adding a special root node. p pred root Economic news had little effect on financial markets. Conditions Formal Conditions on Dependency Graphs G is (weakly) connected: For every node i there is a node j such that i j or j i. G is acyclic: If i j then not j i. G obeys the single-head constraint: If i j, then not k j, for any k i. G is projective: If i j then i k, for any k such that i <k <j or j <k <i.

Projectivity Projectivity Projectivity (or, less commonly, adjacency [Hudson(1990)]) A dependency is projective provided that every word between the head A and the dependent B is a subordinate of A. subordinate (base case): dependent of A subordinate (recursive case): dependent of a subordinate of A Projectivity Projectivity Most theoretical frameworks do not assume projectivity. Non-projective structures are needed to account for long-distance dependencies, free word order. p pred root What did economic news have little effect on? Word Order Grammaticality Obtaining word order constraints Non-projectivity in principle could allow any word order This would clearly overgenerate for most languages Some DGs use projectivity constraints [Hudson(1990)]: (2) with great difficulty (3) *great with difficulty *great with difficulty is ruled out because branches would have to cross in that case In general, this is too strong of a constraint

Word Order Obtaining word order constraints More often, DGs specify word order separately from dependencies e.g., Seq(det(w), adj(w), n(w)): determiner precedes adjective which precedes noun Valency Grammaticality Valency Important concept in DG is valency: ability of a word to take arguments Possible lexicon fragment [Hajič et al.(2003)hajič, Panevová, Urešová, Bémová, Kolářová and Pajas]: Slot 1 Slot 2 Slot 3 sink 1 ACT(nom) PAT(acc) sink 2 PAT(nom) give ACT(nom) PAT(acc) ADDR(dat) To determine grammaticality: 1 Words have valency requirements that must be satisfied 2 Constraints apply to the valencies to see if a sentence is valid Valency Lexicon Example lexical entry from a different framework [Duchier(1999)]: string loves cat V agr sing 3 nom { } comps subject, ect

Valency Adjuncts & Complements Two main kinds of dependencies for A (head) B (dependent): Head-Complement: if A has slot for B, B is a complement Head-Adjunct: if B has slot for A, B is an adjunct Adjunct/complement distinction may also be captured in the type (label) of the dependency relation Layers Grammaticality Using layers of dependencies Different frameworks allow for differing layers of dependencies e.g., FGD distinguishes tectogrammatical & analytical layers Example from MTT [Mel čuk(1988)]: Mutual dependence: verb selects subject (& other arguments), but verb form depends on the subject: (4) a. The child is playing. One solution: b. The children are playing. Dependence of child/children on the verb is syntactic Dependence of the verb(form) on the subject is morphological Layers Double dependencies Likewise, clean could depend both on the verb wash & on the noun dish: (5) Wash the dish clean. One solution: Dependence of clean on wash is syntactic (cf. case) Dependence of clean on dish is semantic (cf. gender) (6) My We našli found zal the hall masc pust-ym empty masc.sg.inst

Advantages and Disadvantages of DG Advantages: Close connection to semantic representation More flexible structure for, e.g., non-constituent coordination Easier to capture some typological regularities Large body of computational work on dependency parsing Disadvantages: No constituents makes analyzing coordination difficult No distinction between modifying a constituent vs. an individual word Harder to capture things like, e.g., subject-ect asymmetries References Debusmann, Ralph, Denys Duchier and Geert-Jan M. Kruijff (2004). Extensible Dependency Grammar: A New Methodology. In Proceedings of the Workshop on Recent Advances in Dependency Grammar. pp. 78 85. Duchier, Denys (1999). Axiomatizing Dependency Parsing Using Set Constraints. In Proceedings of the Sixth Meeting on Mathematics of Language. pp. 115 126. Duchier, Denys and Ralph Debusmann (2001). Topological Dependency Trees: A Constraint-based Account of Linear Precedence. In Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics (ACL). pp. 180 187. Hajič, Jan, Jarmila Panevová, Zdeňka Urešová, Alevtina Bémová, Veronika Kolářová and Petr Pajas (2003). PDT-VALLEX: Creating a Large-coverage Valency Lexicon for Treebank Annotation. In Proceedings of the Second Workshop on Treebanks and Linguistic Theories (TLT 2003). Växjö, Sweden, pp. 57 68. http://w3.msi.vxu.se/~rics/tlt2003/doc/hajic_et_al.pdf. Harper, Mary P. and R. A. Helzerman (1995). Extensions to constraint dependency parsing for spoken language processing. Computer Speech and Language 9, 187 234. References Hellwig, Peter (1986). Dependency Unification Grammar. In Proceedings of the 11th International Conference on Computational Linguistics (COLING). pp. 195 198. Hellwig, Peter (2003). Dependency Unification Grammar. In Vilmos Agel, Ludwig M. Eichinger, Hans-Werner Eroms, Peter Hellwig, Hans Jürgen Heringer and Hening Lobin (eds.), Dependency and Valency, Walter de Gruyter, pp. 593 635. Hudson, Richard A. (1984). Word Grammar. Blackwell. Hudson, Richard A. (1990). English Word Grammar. Blackwell. Hudson, Richard A. (2000). Dependency Grammar Course Notes. http: //www.cs.bham.ac.uk/research/conferences/esslli/notes/hudson.\html. Järvinen, Timo and Pasi Tapanainen (1998). Towards an Implementable Dependency Grammar.

References In Sylvain Kahane and Alain Polguère (eds.), Proceedings of the Workshop on Processing of Dependency-Based Grammars. pp. 1 10. Maruyama, Hiroshi (1990). Structural Disambiguation with Constraint Propagation. In Proceedings of the 28th Meeting of the Association for Computational Linguistics (ACL). pp. 31 38. Mel čuk, Igor (1988). Dependency Syntax: Theory and Practice. State University of New York Press. Menzel, Wolfgang and Ingo Schröder (1998). Decision Procedures for Dependency Parsing Using Graded Constraints. In Sylvain Kahane and Alain Polguère (eds.), Proceedings of the Workshop on Processing of Dependency-Based Grammars. pp. 78 87. Schröder, Ingo (2002). Natural Language Parsing with Graded Constraints. Ph.D. thesis, Hamburg University. Sgall, Petr, Eva Hajičová and Jarmila Panevová (1986). The Meaning of the Sentence in Its Pragmatic Aspects. Reidel. Tapanainen, Pasi and Timo Järvinen (1997). A non-projective dependency parser. References In Proceedings of the 5th Conference on Applied Natural Language Processing. pp. 64 71. Tesnière, Lucien (1959). Éléments de syntaxe structurale. Editions Klincksieck. Zwicky, A. M. (1985). Heads. Journal of Linguistics 21, 1 29.