Beyond Context-Free Grammar

Similar documents
Chapter 4: Valence & Agreement CSLI Publications

Introduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions.

ENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist

Basic Syntax. Doug Arnold We review some basic grammatical ideas and terminology, and look at some common constructions in English.

Grammars & Parsing, Part 1:

Informatics 2A: Language Complexity and the. Inf2A: Chomsky Hierarchy

An Introduction to the Minimalist Program

Natural Language Processing. George Konidaris

Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm

Basic Parsing with Context-Free Grammars. Some slides adapted from Julia Hirschberg and Dan Jurafsky 1

BULATS A2 WORDLIST 2

1/20 idea. We ll spend an extra hour on 1/21. based on assigned readings. so you ll be ready to discuss them in class

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

CS 598 Natural Language Processing

Construction Grammar. University of Jena.

Case government vs Case agreement: modelling Modern Greek case attraction phenomena in LFG

Context Free Grammars. Many slides from Michael Collins

On the Notion Determiner

Feature-Based Grammar

The building blocks of HPSG grammars. Head-Driven Phrase Structure Grammar (HPSG) HPSG grammars from a linguistic perspective

Words come in categories

Compositional Semantics

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

Derivational: Inflectional: In a fit of rage the soldiers attacked them both that week, but lost the fight.

Theoretical Syntax Winter Answers to practice problems

11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation

LING 329 : MORPHOLOGY

The presence of interpretable but ungrammatical sentences corresponds to mismatches between interpretive and productive parsing.

Ch VI- SENTENCE PATTERNS.

Universal Grammar 2. Universal Grammar 1. Forms and functions 1. Universal Grammar 3. Conceptual and surface structure of complex clauses

Citation for published version (APA): Veenstra, M. J. A. (1998). Formalizing the minimalist program Groningen: s.n.

Underlying and Surface Grammatical Relations in Greek consider

Proof Theory for Syntacticians

Parsing of part-of-speech tagged Assamese Texts

Minimalism is the name of the predominant approach in generative linguistics today. It was first

Argument structure and theta roles

Construction Grammar. Laura A. Michaelis.

A Computational Evaluation of Case-Assignment Algorithms

LNGT0101 Introduction to Linguistics

Derivational and Inflectional Morphemes in Pak-Pak Language

Type Theory and Universal Grammar

6.863J Natural Language Processing Lecture 12: Featured attraction. Instructor: Robert C. Berwick

Constraining X-Bar: Theta Theory

Approaches to control phenomena handout Obligatory control and morphological case: Icelandic and Basque

Derivations (MP) and Evaluations (OT) *

The Structure of Relative Clauses in Maay Maay By Elly Zimmer

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

AN ANALYSIS OF GRAMMTICAL ERRORS MADE BY THE SECOND YEAR STUDENTS OF SMAN 5 PADANG IN WRITING PAST EXPERIENCES

Participate in expanded conversations and respond appropriately to a variety of conversational prompts

A relational approach to translation

Developing Grammar in Context

a) analyse sentences, so you know what s going on and how to use that information to help you find the answer.

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

Developing a TT-MCTAG for German with an RCG-based Parser

Using a Native Language Reference Grammar as a Language Learning Tool

Control and Boundedness

Pseudo-Passives as Adjectival Passives

Some Principles of Automated Natural Language Information Extraction

Som and Optimality Theory

The Pennsylvania State University. The Graduate School. College of the Liberal Arts THE TEACHABILITY HYPOTHESIS AND CONCEPT-BASED INSTRUCTION

Chunk Parsing for Base Noun Phrases using Regular Expressions. Let s first let the variable s0 be the sentence tree of the first sentence.

Prediction of Maximal Projection for Semantic Role Labeling

Procedia - Social and Behavioral Sciences 154 ( 2014 )

Aspectual Classes of Verb Phrases

ELD CELDT 5 EDGE Level C Curriculum Guide LANGUAGE DEVELOPMENT VOCABULARY COMMON WRITING PROJECT. ToolKit

Heads and history NIGEL VINCENT & KERSTI BÖRJARS The University of Manchester

Psychology and Language

Language acquisition: acquiring some aspects of syntax.

The Inclusiveness Condition in Survive-minimalism

The College Board Redesigned SAT Grade 12

An Interactive Intelligent Language Tutor Over The Internet

Unit 8 Pronoun References

BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS

Interfacing Phonology with LFG

Hindi Aspectual Verb Complexes

cambridge occasional papers in linguistics Volume 8, Article 3: 41 55, 2015 ISSN

Writing a composition

Type-driven semantic interpretation and feature dependencies in R-LFG

Language and Computers. Writers Aids. Introduction. Non-word error detection. Dictionaries. N-gram analysis. Isolated-word error correction

Language Acquisition by Identical vs. Fraternal SLI Twins * Karin Stromswold & Jay I. Rifkin

The Interface between Phrasal and Functional Constraints

The optimal placement of up and ab A comparison 1

Intensive English Program Southwest College

Inleiding Taalkunde. Docent: Paola Monachesi. Blok 4, 2001/ Syntax 2. 2 Phrases and constituent structure 2. 3 A minigrammar of Italian 3

AN EXPERIMENTAL APPROACH TO NEW AND OLD INFORMATION IN TURKISH LOCATIVES AND EXISTENTIALS

Korean ECM Constructions and Cyclic Linearization

The Syntax of Discourse Functions in Greek: a Non-Congurational Approach. Theodora Alexopoulou. A thesis submitted in fullment of the requirements

Content Language Objectives (CLOs) August 2012, H. Butts & G. De Anda

Phenomena of gender attraction in Polish *

Parsing natural language

AQUA: An Ontology-Driven Question Answering System

Modeling full form lexica for Arabic

Dear Teacher: Welcome to Reading Rods! Reading Rods offer many outstanding features! Read on to discover how to put Reading Rods to work today!

Switched Control and other 'uncontrolled' cases of obligatory control

Structure-Preserving Extraction without Traces

Accurate Unlexicalized Parsing for Modern Hebrew

SAMPLE. Chapter 1: Background. A. Basic Introduction. B. Why It s Important to Teach/Learn Grammar in the First Place

"f TOPIC =T COMP COMP... OBJ

In Udmurt (Uralic, Russia) possessors bear genitive case except in accusative DPs where they receive ablative case.

Constructions with Lexical Integrity *

Transcription:

We are waiting for

Beyond Context-Free Grammar Weiwei Sun Institute of Computer Science and Technology Peking University October 17, 2017

Last lecture Questions How meaning is derived from syntax in the mainstream linguistic studies? How syntactic analysis is conducted in a real research? Why cross-linguistic variation and dialects are important in syntax?

After my lecture It seems that syntactic trees look like, S S NP VP NP VP N pl V AdvP AdjP NP V AdvP Ideas sleep Adv Adj N pl sleep Adv furiously Green ideas furiously S S NP VP NP VP AdjP NP V AdvP AdjP NP V AdvP Adj AdjP NP sleep Adv Adj N pl sleep Adv Colorless Adj N pl furiously Colorless ideas furiously green ideas

But last lecture Q upol yes+pol Foc IP Foc IP John upol is coming John +Pol is coming IP you I I NegP must Neg vp not Adv vp ever t VP not VP address him as Sir

But last lecture QP TP Q 张三 T T ConjP vp Conj 稀罕李四 Conj ΣP Σ negator vp 稀罕李四

What?

What?

The generative revolution Chomsky (Syntactic Structures) By pushing a precise but inadequate formulation to an unacceptable conclusion, we can often expose the exact source of this inadequacy and, consequently, gain a deeper understanding of the linguistic data.... Obscure and intuition-bound notions can neither lead to absurd conclusions nor provide new and correct ones,... 说好的 precise 呢?

Today More expressive grammar formalisms Multiple Context-Free Grammar, Tree-Adjoining Grammar Lexical-Functional Grammar, Head-driven Phrase Structure Grammar, Combinatory Categorial Grammar Minimalist Grammar

Outline 1 Typed feature structure 2 Phrase-structure rules with features 3 Rethink a tree 4 Go Back to Last Lecture s Example 5 Generative-enumerative vs. Model-theoretic approaches

Motivation Weakness of CFG CFG treats each grammatical category symbol as atomic without internal structure. Two categories are either identical or different. There is no mechanism for saying that two categories are alike in some ways, but different in others. Cross-cutting grammatical properties 3rd singular subject plural subject direct object NP denies deny No direct object NP disappears disappear

Using features Observation Words and s in natural languages typically behave alike in certain respects, but not others. Key idea: Using features The elements associated to linguistic expressions, such as words, can be broken down. Complex categories can be decomposed to features that are the atomic units. Linguistic feature: a property-like element that indicates the grammatical behavior of syntactic constituents. The VP has the feature value past tense. The verb is a past tense verb. The noun has a case feature accusative.

Linguistic features Example Feature Example Value person I go, you go, he goes 1st, 2nd, 3rd number he dances, they dance singular, plural case he brings Bob, Bob brings him nominative accusative tense go, went, gone past, present, future modality may, can, conditional, subjunctive A nice summary of linguistic features http://www.grammaticalfeatures.net

Feature structure Description Use a feature structure to specify of grammatical information. A feature structure is a specification of a set of features, each of which is paired with a particular value. A feature structure can be represented by an AVM. FEATURE 1 UE 1 FEATURE 2 UE 2... FEATURE n UE n Example: dog FORM NUMBER ANIMACY dog singular animate

More on feature values Atomic value An unstructured value, one with only one part TENSE PERSON 2 past Complex value A structured value, itself a feature structure TENSE AGREEMENT past PERSON 2 NUMBER singular

Typed feature structure Entities belonging to a particular type have their own special properties. Each type of entity has its own constellation of features Some features are declared appropriate for entities of the indicated type Other features are sanctioned by one of the supertypes Type has subtype and supertype Hierarchical organization Example feature structure expression pos word noun verb det prep adj conj

Example: Outside linguistic world TYPE FEATURES/UES IMMEDIATE ST entity NAME TEL individual BIRTHDAY string number organization FOUNDERS university PRESIDENT department CHAIR entity date entity list(individual) organization individual organization individual NAME Weiwei Sun NAME TEL 18****5 ICST.PKU TEL 010-82529922

Example: Outside linguistic world TYPE FEATURES/UES IMMEDIATE ST entity NAME TEL individual BIRTHDAY string number organization FOUNDERS university PRESIDENT department CHAIR entity date entity list(individual) organization individual organization individual entity NAME Weiwei Sun TEL 18****5 entity NAME ICST.PKU TEL 010-82529922

Example: Outside linguistic world TYPE FEATURES/UES IMMEDIATE ST entity NAME TEL individual BIRTHDAY string number organization FOUNDERS university PRESIDENT department CHAIR entity date entity list(individual) organization individual organization individual individual NAME Weiwei Sun TEL 18****5 department NAME ICST.PKU TEL 010-82529922

Example: Outside linguistic world TYPE FEATURES/UES IMMEDIATE ST entity NAME TEL individual BIRTHDAY string number organization FOUNDERS university PRESIDENT department CHAIR entity date entity list(individual) organization individual organization individual individual NAME Weiwei Sun BIRTHDAY **-**-198* TEL 18****5 department NAME ICST.PKU FOUNDER XUAN WANG CHAIR Zongming Guo TEL 010-82529922

Part-of-speech feature structure Linguistic features expression pos word agr-pos AGR prep adj conj noun verb AUX det V: N: NP: word word verb noun noun

Linguistic features Valence Feature: Feature of val-cat: Feature of val-cat: SPR Value of : val-cat Value of : itr, str, dtr Value of SPR: +/ Abbreviations IV: TV: DTV:... word verb val-cat itr word verb val-cat str

Linguistic features (1) a. We created a monster. b. our creation of a monster Example NP noun val-cat itr SPR + NOM noun val-cat itr SPR S verb val-cat itr SPR + VP verb val-cat itr SPR

Mini type hierarchy feature-structure expression val-cat SPR, pos word agr-pos AGR prep adj conj noun verb AUX det

Outline 1 Typed feature structure 2 Phrase-structure rules with features 3 Rethink a tree 4 Go Back to Last Lecture s Example 5 Generative-enumerative vs. Model-theoretic approaches

Reformulating the grammar rules VP V NP 1 verb itr SPR word 1 itr SPR 1 verb itr SPR word 1 str SPR NP 1 verb itr SPR word 1 dtr SPR NP NP

Reformulating the grammar rules S NP VP NP (D) NOM 1 verb itr NP 1 SPR SPR + word 1 noun itr det 1 itr SPR SPR + SPR +

NP (D) NOM word 1 noun itr det 1 itr SPR SPR + SPR + 1 noun itr 1 SPR + SPR + Common and proper nouns word cat, noun David, SPR word noun SPR +

Tree verb itr SPR + verb itr SPR noun itr SPR + noun itr SPR word noun itr SPR allegation word det itr SPR + the word verb str SPR denies noun itr SPR + word noun itr SPR + Alex

Generalizing grammar rules PP attachment VP VP PP NOM NOM PP Combining them 1 itr 1 SPR SPR PP Generalization Only one rule is needed.

Two features agr-cat PER NUM Lexical entry & Grammar rule AGR David, noun SPR + word 1 verb itr SPR + 3rd sg agr-cat AGR 2 Agreement PER NUM 3rd sg 1 AGR 2 SPR

Head feature principle Head Feature Principle (HFP) In any headed, the value of the mother and the value of the head daughter must be identical. SPR word itr word SPR itr SPR SPR itr SPR + AGR 2 itr str NP AGR 2 SPR

Outline 1 Typed feature structure 2 Phrase-structure rules with features 3 Rethink a tree 4 Go Back to Last Lecture s Example 5 Generative-enumerative vs. Model-theoretic approaches

PHON ALEX, DENIES, THE, ALLEGATION verb itr SPR + DTRS head-comp-struc COMP-DTR PHON ALEX noun itr SPR + DTRS head-comp-struc -DTR word PHON ALEX noun itr SPR + -DTR PHON DENIES, THE, ALLEGATION verb itr SPR - DTRS head-comp-struc -DTR word PHON DENIES verb str SPR - COMP-DTR PHON THE, ALLEGATION noun itr SPR + DTR...

Types of s Phrase structure can be represented by the various daughters attributes of phrasal signs. Each has a DTRS attribute which has a constituent-structure value This DTRS value corresponds to what we view in a tree as daughters By distinguishing different kinds of constituent-structures, we can define different kinds of constructions in a language Trees are used as a convenient graphic representation.

Types of s constituent-struc -DTR head-struc head-comp-struc COMP-DTRS list(sign)... sign coord-struc CONJ-DTRS CONJUNCTION set(sign) sign

Outline 1 Typed feature structure 2 Phrase-structure rules with features 3 Rethink a tree 4 Go Back to Last Lecture s Example 5 Generative-enumerative vs. Model-theoretic approaches

PHON ZHANGSAN, XIHAN, LISI Q DTRS H-DTR PHON Q C-DTR PHON ZHANGSAN, XIHAN, LISI T 1 DTRS C-DTR PHON ZHANGSAN noun H-DTR PHON XIHAN, LISI 1 DTRS H-DTR PHON 1 C-DTR PHON XIHAN, LISI conj 2 DTRS H-DTR PHON 2 DTRS H-DTR PHON 2 C-DTR 3 C-DTR 3 PHON XIHAN, LISI v DTRS...

Outline 1 Typed feature structure 2 Phrase-structure rules with features 3 Rethink a tree 4 Go Back to Last Lecture s Example 5 Generative-enumerative vs. Model-theoretic approaches

A non-derivational approach A CFG rule: S NPVP Top-down An S consists of an NP and a VP Bottom-up An NP and a VP make up an S Constraint-based approach with feature structures A structure is well-formed iff it satisfies all relevant constraints. Constraints are not violable lexical entries -structure rules (as definitions of types) principles Where is the derivation?

Representational or Derivational Two categories of grammars Derivationally oriented grammars Representationally oriented grammar Derivationally oriented grammar A grammar generally include a set of structural atoms (the basis) of the derivation. The derivational procedure constructs syntactic structures using operations of two types. 1 Structural composition: Either previously constructed syntactic representations or elements of the basis are combined to form larger representations. Fundamental: Such operations provide a way to generate the requisite infinity of possible structures. 2 Transformations: Modify an individual syntactic representation in some specified fashion.

Representational or Derivational Two categories of grammars Derivationally oriented grammars Representationally oriented grammar Representationally oriented grammar A grammar determines the set of linguistic expressions using a system of well-formedness constraints. Each constraint provides an evaluation of some part of the linguistic expression. The well-formedness of the entire linguistic expression is determined by combining together the evaluations of the individual constraints. Representationally oriented grammars don t specify how to find well-formed linguistic expressions, but only what properties well-formed expressions must have.

Reading 3, Syntactic Theory: A Formal Introduction 2.3, Aspects of the Theory of Syntax * Introduction, Head-driven Phrase Structure Grammar