Grammar Engineering March 29, 2004 Introduction, overview, HPSG basics

Similar documents
Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm

Building an HPSG-based Indonesian Resource Grammar (INDRA)

Control and Boundedness

An Interactive Intelligent Language Tutor Over The Internet

Introduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions.

Chapter 4: Valence & Agreement CSLI Publications

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

Proof Theory for Syntacticians

Constraining X-Bar: Theta Theory

Informatics 2A: Language Complexity and the. Inf2A: Chomsky Hierarchy

Case government vs Case agreement: modelling Modern Greek case attraction phenomena in LFG

Adapting Stochastic Output for Rule-Based Semantics

CS 598 Natural Language Processing

Context Free Grammars. Many slides from Michael Collins

1/20 idea. We ll spend an extra hour on 1/21. based on assigned readings. so you ll be ready to discuss them in class

11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation

Compositional Semantics

Hindi Aspectual Verb Complexes

Some Principles of Automated Natural Language Information Extraction

Argument structure and theta roles

Basic Parsing with Context-Free Grammars. Some slides adapted from Julia Hirschberg and Dan Jurafsky 1

Approaches to control phenomena handout Obligatory control and morphological case: Icelandic and Basque

Prediction of Maximal Projection for Semantic Role Labeling

CS Machine Learning

AQUA: An Ontology-Driven Question Answering System

Using dialogue context to improve parsing performance in dialogue systems

Towards a Machine-Learning Architecture for Lexical Functional Grammar Parsing. Grzegorz Chrupa la

Feature-Based Grammar

Underlying and Surface Grammatical Relations in Greek consider

Grammars & Parsing, Part 1:

"f TOPIC =T COMP COMP... OBJ

Modeling full form lexica for Arabic

Inleiding Taalkunde. Docent: Paola Monachesi. Blok 4, 2001/ Syntax 2. 2 Phrases and constituent structure 2. 3 A minigrammar of Italian 3

LING 329 : MORPHOLOGY

Parsing of part-of-speech tagged Assamese Texts

The presence of interpretable but ungrammatical sentences corresponds to mismatches between interpretive and productive parsing.

Pre-Processing MRSes

An Introduction to the Minimalist Program

Constructions with Lexical Integrity *

Natural Language Processing. George Konidaris

The Interface between Phrasal and Functional Constraints

Heads and history NIGEL VINCENT & KERSTI BÖRJARS The University of Manchester

Chunk Parsing for Base Noun Phrases using Regular Expressions. Let s first let the variable s0 be the sentence tree of the first sentence.

Universal Grammar 2. Universal Grammar 1. Forms and functions 1. Universal Grammar 3. Conceptual and surface structure of complex clauses

Accurate Unlexicalized Parsing for Modern Hebrew

Developing a TT-MCTAG for German with an RCG-based Parser

A relational approach to translation

Cross Language Information Retrieval

Interfacing Phonology with LFG

Formulaic Language and Fluency: ESL Teaching Applications

Update on Soar-based language processing

Beyond the Pipeline: Discrete Optimization in NLP

Spoken Language Parsing Using Phrase-Level Grammars and Trainable Classifiers

A Computational Evaluation of Case-Assignment Algorithms

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

Chapter 3: Semi-lexical categories. nor truly functional. As Corver and van Riemsdijk rightly point out, There is more

A First-Pass Approach for Evaluating Machine Translation Systems

A Framework for Customizable Generation of Hypertext Presentations

Multiple case assignment and the English pseudo-passive *

Unit 1: Scientific Investigation-Asking Questions

Can Human Verb Associations help identify Salient Features for Semantic Verb Classification?

Type-driven semantic interpretation and feature dependencies in R-LFG

Developing Grammar in Context

The Internet as a Normative Corpus: Grammar Checking with a Search Engine

Impact of Controlled Language on Translation Quality and Post-editing in a Statistical Machine Translation Environment

Implementing the Syntax of Japanese Numeral Classifiers

Authors note Chapter One Why Simpler Syntax? 1.1. Different notions of simplicity

Construction Grammar. Laura A. Michaelis.

Chapter 9 Banked gap-filling

ENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist

On the Notion Determiner

Sample Problems for MATH 5001, University of Georgia

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

cmp-lg/ Jul 1995

EAGLE: an Error-Annotated Corpus of Beginning Learner German

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,

How long did... Who did... Where was... When did... How did... Which did...

Derivational: Inflectional: In a fit of rage the soldiers attacked them both that week, but lost the fight.

Basic Syntax. Doug Arnold We review some basic grammatical ideas and terminology, and look at some common constructions in English.

Project in the framework of the AIM-WEST project Annotation of MWEs for translation

Switched Control and other 'uncontrolled' cases of obligatory control

LFG Semantics via Constraints

The building blocks of HPSG grammars. Head-Driven Phrase Structure Grammar (HPSG) HPSG grammars from a linguistic perspective

Language acquisition: acquiring some aspects of syntax.

Modeling Attachment Decisions with a Probabilistic Parser: The Case of Head Final Structures

Words come in categories

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections

The Verbmobil Semantic Database. Humboldt{Univ. zu Berlin. Computerlinguistik. Abstract

Construction Grammar. University of Jena.

SEMAFOR: Frame Argument Resolution with Log-Linear Models

THE ROLE OF DECISION TREES IN NATURAL LANGUAGE PROCESSING

Specifying Logic Programs in Controlled Natural Language

The Smart/Empire TIPSTER IR System

EdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

California Department of Education English Language Development Standards for Grade 8

Specifying a shallow grammatical for parsing purposes

Transcription:

Grammar Engineering March 29, 2004 Introduction, overview, HPSG basics

Overview The BIG Picture The LinGO Grammar Matrix Course requirements/workflow Pick a language, any language HPSG basics Other approaches

The BIG Picture: Precision Grammars relate surface strings to semantic representations distinguish grammatical from ungrammatical sentences knowledge engineering approach to parsing can be used for both parsing and generation

The BIG Picture: Applications language documentation/linguistic hypothesis testing machine translation automated email response augmentative and assistive communication computer assisted language learning IR (from structured or unstructured data)...

The BIG Picture: Hybrid approaches (1/2) Naturally occurring language is noisy Typos mark-up Addresses & other non-linguistic strings False starts Hesitations... Allowing for the noise within the grammar would reduce its precision And then there s ambiguity, unknown words,...

The BIG Picture: Hybrid approaches (2/2) Combine symbolic (aka deep) and stochastic (aka shallow) approaches: Statistical parse selection (Statistical) named entity recognition and POS tagging in a preprocessing step (for unknown word handling) Tiered systems with a shallow parser as a fall back for the precision parser Coming the other direction, deep grammars can provide richer linguistic resources for training statistical systems (e.g., MT systems).

The LinGO Grammar Matrix (1/3) One of the primary impediments to deploying precision grammars is that they are expensive to build. The Grammar Matrix aims to address this by providing a starter-kit which allows for quick initial development while supporting long-term expansion. The Grammar Matrix also represents a set of hypotheses about cross-linguistic universals.

The LinGO Grammar Matrix (2/3) A sampling of hypotheses: Words and phrases combine to make larger phrases. The semantics of a phrase is determined by the words in the phrase and how they are put together (Frege). Some rules for phrases add semantics, and some don t. Most phrases have an identifiable head daughter.

The LinGO Grammar Matrix (3/3) More hypotheses: Heads determine which types of arguments they require, and how they combine semantically with those arguments. Modifiers determine which kinds of heads they modify, and how they combine semantically with those heads. No lexical or syntactic rule can remove semantic information.

Course requirements/workflow (1/2) Over 9 weekly lab exercises, each student will build a Matrix-based grammar of a different language. On Mondays, I ll announce what you need to have prepared in order to do the Wednesday lab. Class time on Wednesdays will be lab time, to start each exercise. Labs are due (submitted via E-Submit) notionally on Fridays, effectively by midnight Sunday night.

Course requirements/workflow (2/2) Make use of EPost! There are no required readings, but if you do not have a strong background in syntax, I strongly recommend Sag et al 2003. Copestake 2002 provides an extensive introduction to the LKB. http://courses.washington.edu/ling471

Pick a language, any language (1/2) Each student must pick a different language. No English. Undergrads have priority for languages they already know.

Pick a language, any language (2/2) Languages with non-latin alphabets will need to be done in translation (sorry) Languages with complex morphophonology might require some fudging (sorry again) If you aren t working on a language you already know, pick a language with a good descriptive or teaching grammar available.

HPSG Basics Context-free(-like) grammar Feature structures Multiple inheritance type hierarchy Unification Rich lexical entries Constructions

CF(-like)G S NP VP S NP VP Problems: Quickly get too many rules (try dealing with case, subcategorization, and agreement...) Unconstrained: why not write rules like D NP S? Loss of generality: what do intrans-sg-v and ditrans-pl-v have in common?

Solution: Add features Same idea of rewrite rules, but the labels on the nodes are now bundles of information, expressed as feature value pairs. Underspecification: Only specify those features that you care about. (e.g., the VP rule doesn t care about the number value of NP objects). Capture generalizations: all verbs are [HEAD verb], regardless of their agreement properties, transitivity, etc. Allow values to be feature structures (and lists of feature structures) and the rules become quite simple.

Multiple inheritance type hierarchy A type hierarchy...... states what kinds of objects we claim exist (the types).... organizes the objects hierarchically into classes with shared properties (the IST relations).... states what general properties each kind of object has (the feature and feature value declarations).

Technical note: Types v. instances The LKB distinguishes between types and instances. Instances are the maximally specific items in the hierarchy which the parser/generator can use in processing sentences. Types are used in the definition of instances. Types can have multiple parents. Instances can only have one parent.

Unification Phrase structure rules provide some information about the phrases they build. The words (or phrases) that combine as the daughters of those phrase structure rules provide more. How to combine that information? Unification, which we ll come back to below.

A Pizza Type Hierarchy pizza-thing pizza CRUST, TOPPINGS topping-set OLIVES, ONIONS, MUSHROOMS vegetarian non-vegetarian SAUSAGE, PEPPERONI, HAM

TYPE FEATURES/VALUES IST pizza-thing pizza CRUST thick, thin, stuffed TOPPINGS topping-set pizza-thing topping-set OLIVES, ONIONS, MUSHROOMS, pizza-thing vegetarian topping-set non-vegetarian SAUSAGE, PEPPERONI, BBQ CHICKEN, topping-set

pizza-thing := *top*. pizza := pizza-thing & [ CRUST crust, TOPPINGS topping-set ]. crust := *top*. thick := crust. thin := crust. stuffed := crust. topping-set := pizza-thing & [ OLIVES bool, ONIONS bool, MUSHROOMS bool ]...

Unification pizza CRUST thick TOPPINGS OLIVES HAM pizza TOPPINGS OLIVES ONIONS

Unification pizza CRUST thick OLIVES TOPPINGS ONIONS HAM

Unification pizza CRUST thick TOPPINGS OLIVES HAM pizza CRUST thin TOPPINGS OLIVES ONIONS

Unification

Unification pizza CRUST TOPPINGS thick OLIVES HAM pizza CRUST TOPPINGS thick vegetarian

Unification

Unification pizza CRUST TOPPINGS thick OLIVES HAM pizza CRUST TOPPINGS thick vegetarian

Unification

A Pizza Type Hierarchy pizza-thing pizza CRUST, TOPPINGS topping-set OLIVES, ONIONS, MUSHROOMS vegetarian non-vegetarian SAUSAGE, PEPPERONI, HAM

A New Theory of Pizzas pizza : CRUST ONE-HALF OTHER-HALF thick, thin, stuffed topping-set topping-set

pizza ONE-HALF Unification pizza ONIONS OLIVES OTHER-HALF ONIONS OLIVES

Unification pizza ONE-HALF ONIONS OLIVES OTHER-HALF ONIONS OLIVES

Identity Constraints (Tags) pizza CRUST thin ONE-HALF OLIVES ONIONS OTHER-HALF OLIVES ONIONS

pizza ONE-HALF OTHER-HALF Unification ONIONS OLIVES pizza OTHER-HALF MSHRMS OLIVES

Unification pizza ONE-HALF OTHER-HALF ONIONS OLIVES MUSHROOMS

Unification pizza ONE-HALF OTHER-HALF ONIONS OLIVES MUSHROOMS

pizza ONE-HALF OTHER-HALF Unification ONIONS OLIVES vegetarian pizza ONE-HALF SAUSAGE HAM

Unification

Rich lexical entries (1/2) In HPSG/Matrix grammars, most of the information is encoded in the lexicon. The type hierarchy serves as a means of capturing generalizations across that information. Lexical item specify their orthography, part of speech, agreement information, valence requirements, semantic contribution, and argument linking.

Rich lexical entries (2/2) Most of that information is stated on various supertypes, so that an actual lexical entry (instance) specifies only its lexical type, orthography, and key relation. Lexical rules relate base lexical entries to other lexical entries (e.g., plural nouns, passive verbs...).

Constructions (1/2) A few very general phrase structure rules do most of the work. head-specifier head-complement head-subject head-filler head-modifier

Constructions (2/2) We also find that some mildly and some extremely quirky constructions require their own special rules. relative clauses (of various sorts) just because... doesn t mean noun noun compounds appositives... The ERG currently has 105 syntactic constructions.

HPSG Basics Context-free(-like) grammar Feature structures Multiple inheritance type hierarchy Unification Rich lexical entries Constructions

Other approaches The LinGO consortium specializes in large HPSG grammars. Other broad-coverage precision grammars have been built in/by/with: LFG (ParGram: Butt et al 1999) F/XTAG (Doran et al 1994) ALE/Controll (Götz & Meurers 1997) Proprietary formalisms at Microsoft and Boeing.

Bring for next time Your choice of language A transitive verb An intransitive verb Two nouns Determiners or particles required in NPs (as appropriate) An understanding of the basics of case and agreement in your language Knowledge of how to use emacs.

Overview The BIG Picture The LinGO Grammar Matrix Course requirements/workflow Pick a language, any language HPSG basics Other approaches