Introduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions.

Similar documents
The building blocks of HPSG grammars. Head-Driven Phrase Structure Grammar (HPSG) HPSG grammars from a linguistic perspective

Chapter 4: Valence & Agreement CSLI Publications

Case government vs Case agreement: modelling Modern Greek case attraction phenomena in LFG

On the Notion Determiner

Basic Syntax. Doug Arnold We review some basic grammatical ideas and terminology, and look at some common constructions in English.

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

An Interactive Intelligent Language Tutor Over The Internet

BULATS A2 WORDLIST 2

Feature-Based Grammar

Underlying and Surface Grammatical Relations in Greek consider

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections

CS 598 Natural Language Processing

Som and Optimality Theory

Approaches to control phenomena handout Obligatory control and morphological case: Icelandic and Basque

Construction Grammar. University of Jena.

LFG Semantics via Constraints

Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm

Inleiding Taalkunde. Docent: Paola Monachesi. Blok 4, 2001/ Syntax 2. 2 Phrases and constituent structure 2. 3 A minigrammar of Italian 3

Parsing of part-of-speech tagged Assamese Texts

Words come in categories

The presence of interpretable but ungrammatical sentences corresponds to mismatches between interpretive and productive parsing.

Multiple case assignment and the English pseudo-passive *

Developing a TT-MCTAG for German with an RCG-based Parser

THE INTERNATIONAL JOURNAL OF HUMANITIES & SOCIAL STUDIES

Heads and history NIGEL VINCENT & KERSTI BÖRJARS The University of Manchester

Control and Boundedness

Ch VI- SENTENCE PATTERNS.

Hindi Aspectual Verb Complexes

ENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist

Citation for published version (APA): Veenstra, M. J. A. (1998). Formalizing the minimalist program Groningen: s.n.

Structure-Preserving Extraction without Traces

Phenomena of gender attraction in Polish *

1/20 idea. We ll spend an extra hour on 1/21. based on assigned readings. so you ll be ready to discuss them in class

Type-driven semantic interpretation and feature dependencies in R-LFG

An Introduction to the Minimalist Program

11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation

Proof Theory for Syntacticians

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

cmp-lg/ Jul 1995

Construction Grammar. Laura A. Michaelis.

Compositional Semantics

Pseudo-Passives as Adjectival Passives

Korean ECM Constructions and Cyclic Linearization

Advanced Topics in HPSG

The Pennsylvania State University. The Graduate School. College of the Liberal Arts THE TEACHABILITY HYPOTHESIS AND CONCEPT-BASED INSTRUCTION

Constraining X-Bar: Theta Theory

Emmaus Lutheran School English Language Arts Curriculum

Specifying Logic Programs in Controlled Natural Language

Switched Control and other 'uncontrolled' cases of obligatory control

Minimalism is the name of the predominant approach in generative linguistics today. It was first

Dependency, licensing and the nature of grammatical relations *

Argument structure and theta roles

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

Informatics 2A: Language Complexity and the. Inf2A: Chomsky Hierarchy

Constructions License Verb Frames

Controlled vocabulary

Grammars & Parsing, Part 1:

The Interface between Phrasal and Functional Constraints

ON THE SYNTAX AND SEMANTICS

Universal Grammar 2. Universal Grammar 1. Forms and functions 1. Universal Grammar 3. Conceptual and surface structure of complex clauses

ELA/ELD Standards Correlation Matrix for ELD Materials Grade 1 Reading

a) analyse sentences, so you know what s going on and how to use that information to help you find the answer.

Natural Language Processing. George Konidaris

Basic Parsing with Context-Free Grammars. Some slides adapted from Julia Hirschberg and Dan Jurafsky 1

Context Free Grammars. Many slides from Michael Collins

Word Formation is Syntactic: Raising in Nominalizations

"f TOPIC =T COMP COMP... OBJ

Indeterminacy by Underspecification Mary Dalrymple (Oxford), Tracy Holloway King (PARC) and Louisa Sadler (Essex) (9) was: ( case) = nom ( case) = acc

A Grammar for Battle Management Language

The Strong Minimalist Thesis and Bounded Optimality

LING 329 : MORPHOLOGY

Developing Grammar in Context

Procedia - Social and Behavioral Sciences 154 ( 2014 )

Today we examine the distribution of infinitival clauses, which can be

FOREWORD.. 5 THE PROPER RUSSIAN PRONUNCIATION. 8. УРОК (Unit) УРОК (Unit) УРОК (Unit) УРОК (Unit) 4 80.

Advanced Grammar in Use

Some Principles of Automated Natural Language Information Extraction

AN EXPERIMENTAL APPROACH TO NEW AND OLD INFORMATION IN TURKISH LOCATIVES AND EXISTENTIALS

Theoretical Syntax Winter Answers to practice problems

Constructions with Lexical Integrity *

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

Language acquisition: acquiring some aspects of syntax.

Agree or Move? On Partial Control Anna Snarska, Adam Mickiewicz University

MODELING DEPENDENCY GRAMMAR WITH RESTRICTED CONSTRAINTS. Ingo Schröder Wolfgang Menzel Kilian Foth Michael Schulz * Résumé - Abstract

Linguistic Variation across Sports Category of Press Reportage from British Newspapers: a Diachronic Multidimensional Analysis

UC Berkeley Berkeley Undergraduate Journal of Classics

AQUA: An Ontology-Driven Question Answering System

AN ANALYSIS OF GRAMMTICAL ERRORS MADE BY THE SECOND YEAR STUDENTS OF SMAN 5 PADANG IN WRITING PAST EXPERIENCES

Arabic language and its specification in TDL

Writing a composition

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

Implementing the Syntax of Japanese Numeral Classifiers

Specifying a shallow grammatical for parsing purposes

THE VERB ARGUMENT BROWSER

SAMPLE. Chapter 1: Background. A. Basic Introduction. B. Why It s Important to Teach/Learn Grammar in the First Place

A relational approach to translation

Pre-Processing MRSes

Adapting Stochastic Output for Rule-Based Semantics

Language Acquisition by Identical vs. Fraternal SLI Twins * Karin Stromswold & Jay I. Rifkin

CHILDREN S POSSESSIVE STRUCTURES: A CASE STUDY 1. Andrew Radford and Joseph Galasso, University of Essex

Transcription:

to as a linguistic theory to to a member of the family of linguistic frameworks that are called generative grammars a grammar which is formalized to a high degree and thus makes exact predictions about the objects (sentences) that are considered to belong to a language. does not have phrase structure rules or transformations. Instead, it is declarative, non-derivational, and constraint-based. Kordula De Kuthy April 29, 2009 Sets of constraints which hold simultaneously determine the collections of admissible linguistic structures without defining an order of the derivation or generation of signs. 1 / 31 2 / 31 The Beginnings to to Is there such a thing as one single coherent framework that was created sometime in the early days of, remained unchanged, and is employed by all linguists working in? Two different formalisms have been informally presented by Carl Pollard and Ivan Sag: One presented in their first book of 1987, Information-based Syntax and Semantics (Pollard & Sag 1987) the other one in their second book of 1994, Head-Driven Phrase Structure Grammar (Pollard & Sag 1994) began in the mid 1980s inspired by several other frameworks: Government and Binding (Chomsky 1981) ideas of the combinatorial system of from Categorial Grammar combination of feature structure and phrase structure from Lexical Functional Grammar (LFG) (Bresnan 1982) and Generalized Phrase Structure Grammar(GPSG) (Gazdar et al. 1985) 3 / 31 4 / 31

87-94 to grammars from a linguistic perspective to 87 is a typical instance of a unification-based grammar formalism. Underlying intuition is that linguist specify pieces of partial information about a language in their grammars. All pieces of partial information about a language are then combined by operations such as unification to obtain all available information about that language. 94 can be called an object-based grammar formalism or a constraint-based grammar formalism. An grammar, from a linguistic perspective, consists of a) a lexicon: licensing basic words b) lexical rules: licensing derived words c) immediate dominance (ID) schemata: licensing constituent structure d) linear precedence (LP) statements: constraining word order e) a set of grammatical principles: expressing generalizations about It envisions an architecture in which linguists use a logical language on order to specify language as a collection of total objects. 5 / 31 6 / 31 grammars from a formal perspective to The signature of an grammar to The signature An grammar formally consists of I. the signature as declaration of the domain, and II. the theory constraining the domain. defines the ontology ( declaration of what exists ): consists of which kind of objects are distinguished, and which properties of which objects are modeled. the type hierarchy (or sort hierarchy) and the appropriateness conditions, defining which type has which appropriate attributes (or features) with which appropriate values. 7 / 31 8 / 31

An example type hierarchy to to Signs sign PHON list(phonstring) SYNSEM synsem word phrase Models of What do the mathematical structures used as model for theories look like? The objects are modelled by typed feature structures, which can be notated as directed graphs. Since these models represent objects in the world (and not knowledge about objects in the world) they are total with respect to the ontology declared in the signature. Formally speaking, the feature structures are Part of speech p-o-s totally well-typed: Every type has every one of the attributes and their values which are appropriate for it. sort-resolved: Every type is maximally specific. Note that type and sort are often used synonymously, as well as attribute and feature. adj adv det noun prep 9 / 31 10 / 31 How do we express a theory? to (cont.) to A description language and its abbreviating AVM (attribute value matrix) notation is used to talk about sets of objects. consists of three building blocks: Type descriptions single out all objects of a particular type, e.g., word Attribute-value pairs describe objects that have a particular property. The attribute must be appropriate for the particular type of object, and the value can be any kind of description, e.g., [ SPOUSE [ NAME mary ]] Complex descriptions are obtained through conjunction ( ), disjunction () and negation ( ). In the AVM notation, conjunction is implicit. Tags (structure sharing) to specify token identity, e.g. 1 11 / 31 12 / 31

An example AVM - The pronoun she word PHON <she> local cat [ ] noun CAT CASE nom SUBCAT ppro ref SYNSEM LOC PER third CONT INDEX 1 NUM sing GEND fem RESTR {} context psoa CONTEXT BACKGR RELN female INST 1 to The theory of an grammar A theory is a set of description language statements, often referred to as the constraints. The theory singles out a subset of the objects declared in the signature, namely those which are grammatical. A linguistic object is admissible with respect to a theory iff it satisfies each of the descriptions in the theory and so does each of its substructures. to 13 / 31 14 / 31 Connection between, Model and Empirical Domain to Signs to phenomena linguistic objects predicts modelling specify set of descriptions constraints formal theory model feature structures are licensed word synsem LOCAL local NON-LOCAL non-local sign PHON list(phonstring) SYNSEM synsem [ ] phrase DTRS constituent-structure local CATEGORY category CONTENT content CONTEXT context category head SUBCAT list(synsem) MARKING marking 15 / 31 16 / 31

Motivating SUBCAT to Syntactic category information to (1) a. I laugh. (<NP>) b. I saw him. (<NP NP>) c. I give her the book. (<NP NP NP>) d. I said that she left. (<NP S[that]>) Cannot always be derived from semantics: (2) a. Paul ate a steak. (<NP>) b. Paul ate. (<NP NP>) (3) a. Paul devoured a steak. (<NP>) b. * Paul devoured (<NP NP>) [ ] functional SPEC synsem head marker determiner adjective VFORM vform AUX boolean INV boolean substantive PRD boolean MOD mod-synsem [ ] [ ] noun preposition... CASE case PFORM pform 17 / 31 18 / 31 Properties of particular categories to Motivating VFORM to vform (4) a. Peter will win the race. (base form) b. * Peter will won the race. c. * Peter will to win the race. finite infinitive base gerund present-part. past-part. passive-part. case pform (5) a. Peter has won the race. (past participle) b. * Peter has win the race. c. Peter has to win the race. ( different ) nominative accusative of to... (6) a. Peter seems to win the race. (to-infinitive) b. * Peter seems win the race. c. * Peter seems won the race. 19 / 31 20 / 31

Motivating CASE to Indices to (7) a. He left. (nom) b. * Him left. index PERSON person NUMBER number GENDER gender (8) a. She sees him. (acc) b. * She sees he. person referential there it number gender first second third singular plural masculine feminine neuter 21 / 31 22 / 31 Semantic representations to Auxiliary data structures to content quant [ ] laugh give LAUGHER ref GIVER ref GIVEN ref GIFT ref psoa nom-obj INDEX index RESTRICTION set(psoa) drink think DRINKER ref THINKER ref DRUNKEN ref THOUGHT psoa boolean true false list... empty-list non-empty-list FIRST REST list Alternative names for the attributes FIRST (FT) and REST (RT) of non-empty-list are (HD) and TAIL (TL). 23 / 31 24 / 31

Abbreviations for describing lists to Abbreviations for common AVMs to empty-list is abbreviated as e-list, <> non-empty-list is abbreviated as ne-list [ ] FIRST 1 REST 2... 1 FIRST 1 REST [ FIRST 2 REST 3 ] is abbreviated as is abbreviated as is abbreviated as 1 2... 1 1, 2 3 Pollard and Sag (1994) make use of the following abbreviations for describing synsem objects: Abbrev. abbreviated AVM synsem [ ] noun NP 1 CATEGORY LOCAL SUBCAT CONTENT INDEX 1 synsem [ ] S: 1 CATEGORY LOCAL SUBCAT CONTENT 1 synsem VP: 1 CATEGORY LOCAL SUBCAT synsem CONTENT 1 25 / 31 26 / 31 The Lexicon to An example lexicon to The basic lexicon is defined by the Word Principle as part of the theory. It is an implicational statement defining which of the ontologically possible words are grammatical: word lexical-entry 1 lexical-entry 2... with each of the lexical entries being descriptions, such as e.g.: word PHON <laughs> [ ] CAT VFORM fin SYNSEM LOC SUBCAT NP[nom] 1 [3rd,sing] [ ] laugh CONTENT LAUGHER 1 word PHON <drinks> [ ] CAT VFORM fin SUBCAT NP[nom] S L 1 [3rd,sing], NP[acc] 2 drink CONT DRINKER 1 DRUNKEN 2 PHON <drink> [ ] CAT VFORM fin SUBCAT NP[nom] S L 1 [plur], NP[acc] 2 drink CONT DRINKER 1 DRUNKEN 2 27 / 31 28 / 31

to to PHON <give> [ ] VFORM fin CAT NP[nom] 1 [plur], NP[acc], 2 SUBCAT PP[to] S L 3 give GIVER 1 CONT GIFT 2 GIVEN 3 PHON <to> [ ] preposition CAT PFORM to S L SUBCAT NP[acc] 1 [ ] CONT INDEX 1 PHON <think> [ ] CAT VFORM fin SUBCAT NP[nom] S L 1 [plur], S[fin]: 2 think CONT THINKER 1 THOUGHT 2 29 / 31 30 / 31 to References to PHON <poets> [ ] noun CAT SUBCAT SYNSEM LOC [ PER third CONT INDEX ] NUM plur Bresnan, J. (ed.) (1982). The Mental Representation of Grammatical Relations. Cambridge, MA: MIT Press. Chomsky, N. (1981). Lectures on Government and Binding. Dordrecht: Foris Publications. Gazdar, G., E. Klein, G. K. Pullum & I. A. Sag (1985). Generalized Phrase Structure Grammar. Cambridge, MA: Harvard University Press. Pollard, C. & I. A. Sag (1987). Information-based Syntax and Semantics, Vol. 1: Fundamentals. No. 13 in CSLI Lecture Notes. Stanford, CA: CSLI Publications. Pollard, C. & I. A. Sag (1994). Head-Driven Phrase Structure Grammar. Chicago, IL: University of Chicago Press. PHON <wine> [ ] noun CAT SUBCAT SYNSEM LOC [ PER third CONT INDEX ] NUM sing 31 / 31 31 / 31