Dependency Grammar. Lilja Øvrelid INF5830 Fall Dependency Grammar 1(37)

Similar documents
CS 598 Natural Language Processing

Introduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions.

Basic Parsing with Context-Free Grammars. Some slides adapted from Julia Hirschberg and Dan Jurafsky 1

Basic Syntax. Doug Arnold We review some basic grammatical ideas and terminology, and look at some common constructions in English.

Proof Theory for Syntacticians

UNIVERSITY OF OSLO Department of Informatics. Dialog Act Recognition using Dependency Features. Master s thesis. Sindre Wetjen

1/20 idea. We ll spend an extra hour on 1/21. based on assigned readings. so you ll be ready to discuss them in class

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

Case government vs Case agreement: modelling Modern Greek case attraction phenomena in LFG

Prediction of Maximal Projection for Semantic Role Labeling

LING 329 : MORPHOLOGY

Constraining X-Bar: Theta Theory

The presence of interpretable but ungrammatical sentences corresponds to mismatches between interpretive and productive parsing.

AQUA: An Ontology-Driven Question Answering System

Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm

Grammars & Parsing, Part 1:

An Introduction to the Minimalist Program

ENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist

Inleiding Taalkunde. Docent: Paola Monachesi. Blok 4, 2001/ Syntax 2. 2 Phrases and constituent structure 2. 3 A minigrammar of Italian 3

THE INTERNATIONAL JOURNAL OF HUMANITIES & SOCIAL STUDIES

Ensemble Technique Utilization for Indonesian Dependency Parser

Control and Boundedness

Universal Grammar 2. Universal Grammar 1. Forms and functions 1. Universal Grammar 3. Conceptual and surface structure of complex clauses

Towards a Machine-Learning Architecture for Lexical Functional Grammar Parsing. Grzegorz Chrupa la

11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation

The Role of the Head in the Interpretation of English Deverbal Compounds

Chapter 4: Valence & Agreement CSLI Publications

Approaches to control phenomena handout Obligatory control and morphological case: Icelandic and Basque

Some Principles of Automated Natural Language Information Extraction

Underlying and Surface Grammatical Relations in Greek consider

On the Notion Determiner

"f TOPIC =T COMP COMP... OBJ

Achim Stein: Diachronic Corpora Aston Corpus Summer School 2011

Parsing of part-of-speech tagged Assamese Texts

Developing a TT-MCTAG for German with an RCG-based Parser

Hindi Aspectual Verb Complexes

Dependency Annotation of Coordination for Learner Language

Natural Language Processing. George Konidaris

Accurate Unlexicalized Parsing for Modern Hebrew

Minimalism is the name of the predominant approach in generative linguistics today. It was first

Derivational: Inflectional: In a fit of rage the soldiers attacked them both that week, but lost the fight.

Feature-Based Grammar

Words come in categories

Visual CP Representation of Knowledge

Construction Grammar. University of Jena.

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Modeling full form lexica for Arabic

LNGT0101 Introduction to Linguistics

Beyond the Pipeline: Discrete Optimization in NLP

Dissertation Summaries. Headedness in Word Formation and Lexical Semantics: Evidence from Italiot and Cypriot (University of Patras, 2014)*

Ch VI- SENTENCE PATTERNS.

Multiple case assignment and the English pseudo-passive *

Applications of memory-based natural language processing

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

Syntactic Dependencies for Multilingual and Multilevel Corpus Annotation

Derivational and Inflectional Morphemes in Pak-Pak Language

The Interface between Phrasal and Functional Constraints

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections

The Pennsylvania State University. The Graduate School. College of the Liberal Arts THE TEACHABILITY HYPOTHESIS AND CONCEPT-BASED INSTRUCTION

Loughton School s curriculum evening. 28 th February 2017

Common Core State Standards for English Language Arts

Program Matrix - Reading English 6-12 (DOE Code 398) University of Florida. Reading

Update on Soar-based language processing

Dependency, licensing and the nature of grammatical relations *

What Can Neural Networks Teach us about Language? Graham Neubig a2-dlearn 11/18/2017

The building blocks of HPSG grammars. Head-Driven Phrase Structure Grammar (HPSG) HPSG grammars from a linguistic perspective

LTAG-spinal and the Treebank

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

Chapter 3: Semi-lexical categories. nor truly functional. As Corver and van Riemsdijk rightly point out, There is more

Project in the framework of the AIM-WEST project Annotation of MWEs for translation

Ontologies vs. classification systems

Survey on parsing three dependency representations for English

How to analyze visual narratives: A tutorial in Visual Narrative Grammar

A Framework for Customizable Generation of Hypertext Presentations

Hindi-Urdu Phrase Structure Annotation

Argument structure and theta roles

THE VERB ARGUMENT BROWSER

Specifying a shallow grammatical for parsing purposes

Seminar - Organic Computing

Adapting Stochastic Output for Rule-Based Semantics

California Department of Education English Language Development Standards for Grade 8

Heads and history NIGEL VINCENT & KERSTI BÖRJARS The University of Manchester

Pseudo-Passives as Adjectival Passives

Content Language Objectives (CLOs) August 2012, H. Butts & G. De Anda

Refining the Design of a Contracting Finite-State Dependency Parser

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

A Graph Based Authorship Identification Approach

A Computational Evaluation of Case-Assignment Algorithms

Linking Task: Identifying authors and book titles in verbose queries

Using dialogue context to improve parsing performance in dialogue systems

Language acquisition: acquiring some aspects of syntax.

Theoretical Syntax Winter Answers to practice problems

Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data

Som and Optimality Theory

Derivations (MP) and Evaluations (OT) *

LFG Semantics via Constraints

Character Stream Parsing of Mixed-lingual Text

MODELING DEPENDENCY GRAMMAR WITH RESTRICTED CONSTRAINTS. Ingo Schröder Wolfgang Menzel Kilian Foth Michael Schulz * Résumé - Abstract

Compositional Semantics

Corpus Linguistics (L615)

Transcription:

Dependency Grammar Lilja Øvrelid INF5830 Fall 2015 With thanks to Markus Dickinson, Sandra Kübler and Joakim Nivre Dependency Grammar 1(37) Overview INF5830 so far general methodology statistical, data-driven approaches words, frequencies The next four (or so) weeks theoretical background and practical experience with two NLP tasks deeper processing : syntactic and semantic analysis I data-driven dependency parsing II semantic role labeling (SRL) experimental methodology supervised machine learning classification evaluation Course overview Dependency Grammar 2(37)

Course overview Part I: Data-driven dependency parsing Dependency grammar (today) Dependency parsing (next week) Project A released next week Experimental methodology (next Thursday) Project A (written report due Oct. 23rd): training and evaluation of parsers for several languages CoNLL-X (2006, 2007) MaltParser: freely available software for data-driven dependency parsing Dependency Grammar 3(37) Part II: Semantic Role Labeling (SRL) Course overview Semantic roles, theoretical (Oct 19th) Semantic role labeling, practical/computational (Oct 26th) Project B (written report due Nov. 6th) CoNLL 2008: syntactic and semantic parsing of English solve part of this task: semantic argument classification feature engineering (using syntactic analysis) supervised machine learning evaluation Dependency Grammar 4(37)

Course overview Lectures and groups Curriculum: largely research literature Classics from linguistics, e.g. Zwicky Computational linguistics research literature, e.g. Nivre, Gildea & Jurafsky Lectures: introduction to topics, synthesis of curriculum Group teaching: focused reading groups (please prepare!!) practical sessions related to obligatory assignments Dependency Grammar 5(37) Today s lecture Introduction Very brief repetition of basic principles of syntax: form vs function constituents and phrases context-free grammars Dependency Grammar basic concepts: head, dependent comparison to constituent structure theoretical issues: head criteria, tricky constructions formal properties Dependency Grammar 6(37)

Syntax Syntax Syntax: study of the structure of sentences Who does what to whom? Wealth of theories: some differences, a lot in common Government and Binding (GB) Minimalist Program (MP) Head-driven phrase structure grammar (HPSG) Lexical Functional Grammar (LFG) Categorial Grammar Dependency Grammar... Dependency Grammar 7(37) Why bother? Syntax Theoretical syntacticians concerned with grammaticality The President nominated a new Supreme Court justice *President the new Supreme justice Court nominated Relevant for some NLP applications: text generation grammar checking But mostly want systems that are robust and can handle realistic (noisy) language Dependency Grammar 8(37)

Syntax Why bother? Parsing provides scaffolding for semantic analysis Direct, down-stream usage of syntactic information opinion mining information extraction syntax-informed statistical machine translation sentence compression etc. Dependency Grammar 9(37) Constituents Syntax The words in a sentence are organized into groupings function as a whole relate to other words as a unit The dog ate my homework linguistic tests of constituency Dependency Grammar 10(37)

Syntax Form and function Syntactic form - constituents are described using parts of speech and phrases phrases - larger constituents above word level phrases named after the head - central, obligatory member e.g. NP, VP, PP Syntactic function - constituents are described by their role in the sentence as a whole Subject (Direct and Indirect) Object Adverbial Dependency Grammar 11(37) Arguments vs. adjuncts Syntax Subconstituents which are not heads: arguments or adjuncts arguments: selected by the head and complete the meaning adjuncts: not selected by the head and refine the meaning Different PoS may take argument(s): John invited Mary to the event John s invitation of Mary to the event caused quite a stir Mary found the book under the couch Adjuncts are not obligatory and may often iterate John ran on Sunday / with Mary / in the park Dependency Grammar 12(37)

Syntax Phrase structure grammars Capture constituent status and ordering Formal model: context-free grammar 1. S NP VP 2. NP D N 3. VP V NP Syntactic structure as phrase structure trees Dependency Grammar 13(37) Now: Dependency Grammar Syntax An alternative to phrase structure representations Syntactic functions are central Claimed to be closer to semantic analysis obj sbj Small birds sing loud songs Dependency Grammar 14(37)

DG Dependency Grammar Not a coherent grammatical framework: wide range of different kinds of DG just as there are wide ranges of generative syntax Different core ideas than phrase structure grammar We will base a lot of our discussion on [Mel čuk 1988] Dependency grammar is important for those interested in CL: Increasing interest in dependency-based approaches to syntactic parsing in recent years (e.g., CoNLL-X shared task, 2006) Downstream applications: machine translation, question answering, ontology learning, sentiment analysis, etc. Dependency Grammar 15(37) Dependency Syntax DG The basic idea: Syntactic structure consists of lexical items, linked by binary asymmetric relations called dependencies. In the (translated) words of Lucien Tesnière [Tesnière 1959]: The sentence is an organized whole, the constituent elements of which are words. [1.2] Every word that belongs to a sentence ceases by itself to be isolated as in the dictionary. Between the word and its neighbors, the mind perceives connections, the totality of which forms the structure of the sentence. [1.3] The structural connections establish dependency relations between the words. Each connection in principle unites a superior term and an inferior term. [2.1] The superior term receives the name governor. The inferior term receives the name subordinate. Thus, in the sentence Alfred parle [...], parle is the governor and Alfred the subordinate. [2.2] Dependency Grammar 16(37)

DG Overview: constituency (1) Small birds sing loud songs What you might be more used to seeing: S NP VP Small birds sing NP loud songs Dependency Grammar 17(37) Overview: dependency The corresponding dependency tree representations [Hudson 2000]: obj DG sbj Small birds sing loud songs sbj sing obj birds small songs loud Dependency Grammar 18(37)

DG Constituency vs. Relations DG is based on relationships between words, i.e., dependency relations A B means A governs B or B depends on A... Dependency relations can refer to syntactic properties, semantic properties, or a combination of the two These relations are generally things like subject, object/complement, (pre-/post-)adjunct, etc. Subject/Agent: John fished. Object/Patient: Mary hit John. PSG is based on groupings, or constituents Grammatical relations are not usually seen as primitives, but as being derived from structure Dependency Grammar 19(37) Simple relation example DG For the sentence John loves Mary, we have the relations: loves subj John loves obj Mary Both John and Mary depend on loves, which makes loves the head, or root, of the sentence (i.e., there is no word that governs loves) The structure of a sentence, then, consists of the set of pairwise relations among words. Dependency Grammar 20(37)

DG Dependency Structure p obj pc sbj Economic news had little effect on financial markets. Dependency Grammar 21(37) Terminology DG Superior Head Governor Regent. Inferior Dependent Modifier Subordinate. Dependency Grammar 22(37)

DG Comparison Dependency structures explicitly represent head-dependent relations (directed arcs), functional categories (arc labels), possibly some structural categories (parts-of-speech). Phrase structures explicitly represent phrases (nonterminal nodes), structural categories (nonterminal labels), possibly some functional categories (grammatical functions). Hybrid representations may combine all elements. Dependency Grammar 23(37) Some Theoretical Issues DG What is the nature of lexical elements (nodes)? Morphemes? Word forms? Multi-word units? What is the nature of dependency types (arc labels)? Grammatical functions? Semantic roles? What are the criteria for identifying heads and dependents? What are the formal properties of dependency structures? Dependency Grammar 24(37)

Syntactic heads Syntactic heads Central concept in syntactic theory Seminal paper Heads [Zwicky 1985]: The intuition to be captured with the notion HEAD is that in certain syntactic constructs one constituent in some sense characterizes or dominates the whole 1. Det + N those penguins 2. V + NP control those penguins 3. Aux + VP must control those penguins 4. P + NP toward those penguins 5. NP + VP we control those penguins 6. Comp + S that we control those penguins semantic argument ( kind of thing ), subcategorisand (lexical property), morphosyntactic locus (bearer of inflection) Dependency Grammar 25(37) Criteria for Heads and Dependents Syntactic heads Criteria for a syntactic relation between a head H and a dependent D in a construction C [Zwicky 1985, Hudson 1990]: 1. H determines the syntactic category of C; H can replace C. 2. H determines the semantic category of C; D specifies H. 3. H is obligatory; D may be optional. 4. H selects D and determines whether D is obligatory. 5. The form of D depends on H (agreement or government). 6. The linear position of D is specified with reference to H. Issues: Syntactic (and morphological) versus semantic criteria Exocentric versus endocentric constructions Dependency Grammar 26(37)

Syntactic heads Criteria for Heads and Dependents Endocentric constructions: dependents are optional Economic news had little effect on [financial] markets Exocentric constructions: head cannot readily replace the whole Economic news had little effect on financial [markets] head-complement relations are exocentric head-modifier relations are endocentric Dependency Grammar 27(37) Some Clear Cases Syntactic heads Construction Head Dependent Exocentric Verb Subject (sbj) Verb Object (obj) Endocentric Verb Adverbial (vmod) Noun Attribute () sbj obj vmod Economic news suddenly affected financial markets. Dependency Grammar 28(37)

Syntactic heads Some Tricky Cases Complex verb groups (auxiliary main verb) Subordinate clauses (complementizer verb) Coordination (coordinator conjuncts) Prepositional phrases (preposition nominal) Punctuation sbj vg? sbar??p obj sbj vc pc? co? cj? I can see that they rely on this and that. Dependency Grammar 29(37) Dependency Graphs Formal properties A dependency structure can be defined as a directed graph G, consisting of a set V of nodes, a set E of arcs (edges), a linear precedence order < on V (not in every theory) Labeled graphs: Nodes in V are labeled with word forms (and annotation). Arcs in E are labeled with dependency types. Notational conventions (i,j V): i j (i,j) E Dependency Grammar 30(37)

Formal properties Formal Properties of Dependency Graphs antisymmetric: if A B, then B A If A governs B, B does not govern A cf. box lunch (lunch box) vs. lunch box (box lunch) antireflexive: if A B, then B A No word can govern itself. antitransitive: if A B and B C, then A C These are direct dependency relations cf. a usually reliable source: source reliable & reliable usually, but source usually labeled:, has a label (r) Dependency Grammar 31(37) Formal Conditions on Dependency Graphs Intuitions: Syntactic structure is complete (Connectedness). Syntactic structure is hierarchical (Acyclicity). Every word has at most one syntactic head (Single-Head). Formal properties Connectedness can be enforced by adding a special root node. p pred obj pc sbj root Economic news had little effect on financial markets. Dependency Grammar 32(37)

Formal properties Formal Conditions on Dependency Graphs G is (weakly) connected: For every node i there is a node j such that i j or j i. G is acyclic: If i j then not j i. G obeys the single-head constraint: If i j, then not k j, for any k i. Dependency Grammar 33(37) Projectivity Projectivity A head (A) and a dependent (B) must be adjacent: A is adjacent to B provided that every word between A and B is a subordinate of A. A projective graph: If i j then i k, for any k such that i<k<j or j<k<i (2) with great difficulty (3) *great with difficulty with difficulty difficulty great *great with difficulty is ruled out because branches would have to cross in that case Formal properties Dependency Grammar 34(37)

Formal properties Projectivity Most theoretical frameworks do not assume projectivity. Non-projective structures are needed to account for long-distance dependencies, free word order. vg pc p sbj obj What did economic news have little effect on? Dependency Grammar 35(37) Advantages and Disadvantages of DG Wrapping up Advantages: Close connection to semantic representation Easier to capture some typological regularities Vast & expanding body of computational work on dependency parsing Disadvantages: No constituents makes analyzing coordination difficult No distinction between modifying a constituent vs. an individual word Dependency Grammar 36(37)

Wrapping up Thursday: Reading group, in-class exercises Zwicky and Nivre Please prepare! Monday 5th: Dependency Parsing Dependency Grammar 37(37) Richard A. Hudson. 1990. English Word Grammar. Blackwell. References Richard A. Hudson. 2000. Dependency grammar course notes. http://www.cs.bham.ac.uk/research/conferences/esslli/notes/hudson.\html. Igor Mel čuk. 1988. Dependency Syntax: Theory and Practice. State University of New York Press. Lucien Tesnière. 1959. Éléments de syntaxe structurale. Editions Klincksieck. A. M. Zwicky. 1985. Heads. Journal of Linguistics, 21:1 29. Dependency Grammar 37(37)