Ling 566 Sept 28, 2017

Similar documents
Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm

CS 598 Natural Language Processing

Introduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions.

Proof Theory for Syntacticians

Informatics 2A: Language Complexity and the. Inf2A: Chomsky Hierarchy

Grammars & Parsing, Part 1:

ENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist

Parsing of part-of-speech tagged Assamese Texts

11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

Basic Parsing with Context-Free Grammars. Some slides adapted from Julia Hirschberg and Dan Jurafsky 1

1/20 idea. We ll spend an extra hour on 1/21. based on assigned readings. so you ll be ready to discuss them in class

Natural Language Processing. George Konidaris

Derivational: Inflectional: In a fit of rage the soldiers attacked them both that week, but lost the fight.

Organizing Comprehensive Literacy Assessment: How to Get Started

A Grammar for Battle Management Language

Context Free Grammars. Many slides from Michael Collins

Construction Grammar. University of Jena.

Chapter 4: Valence & Agreement CSLI Publications

Developing Grammar in Context

The History of Language Teaching

Part I. Figuring out how English works

Basic Syntax. Doug Arnold We review some basic grammatical ideas and terminology, and look at some common constructions in English.

IN THIS UNIT YOU LEARN HOW TO: SPEAKING 1 Work in pairs. Discuss the questions. 2 Work with a new partner. Discuss the questions.

Psychology and Language

Inleiding Taalkunde. Docent: Paola Monachesi. Blok 4, 2001/ Syntax 2. 2 Phrases and constituent structure 2. 3 A minigrammar of Italian 3

Ch VI- SENTENCE PATTERNS.

Compositional Semantics

Information for Candidates

Argument structure and theta roles

Language acquisition: acquiring some aspects of syntax.

An Introduction to the Minimalist Program

Multiple case assignment and the English pseudo-passive *

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

NAME: East Carolina University PSYC Developmental Psychology Dr. Eppler & Dr. Ironsmith

LING 329 : MORPHOLOGY

Segmented Discourse Representation Theory. Dynamic Semantics with Discourse Structure

Loughton School s curriculum evening. 28 th February 2017

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

Constraining X-Bar: Theta Theory

Content Language Objectives (CLOs) August 2012, H. Butts & G. De Anda

PRODUCT PLATFORM DESIGN: A GRAPH GRAMMAR APPROACH

Minimalism is the name of the predominant approach in generative linguistics today. It was first

English Language and Applied Linguistics. Module Descriptions 2017/18

The presence of interpretable but ungrammatical sentences corresponds to mismatches between interpretive and productive parsing.

"f TOPIC =T COMP COMP... OBJ

Approaches to control phenomena handout Obligatory control and morphological case: Icelandic and Basque

Case government vs Case agreement: modelling Modern Greek case attraction phenomena in LFG

Unit 8 Pronoun References

Some Principles of Automated Natural Language Information Extraction

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

The suffix -able means "able to be." Adding the suffix -able to verbs turns the verbs into adjectives. chewable enjoyable

Procedia - Social and Behavioral Sciences 154 ( 2014 )

Pseudo-Passives as Adjectival Passives

L1 and L2 acquisition. Holger Diessel

The Strong Minimalist Thesis and Bounded Optimality

Sight Word Assessment

BASIC ENGLISH. Book GRAMMAR

Advanced Grammar in Use

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Kindergarten Lessons for Unit 7: On The Move Me on the Map By Joan Sweeney

Lower and Upper Secondary

Participate in expanded conversations and respond appropriately to a variety of conversational prompts

Making Sales Calls. Watertown High School, Watertown, Massachusetts. 1 hour, 4 5 days per week

Universal Grammar 2. Universal Grammar 1. Forms and functions 1. Universal Grammar 3. Conceptual and surface structure of complex clauses

Guidelines for Writing an Internship Report

Houghton Mifflin Online Assessment System Walkthrough Guide

Derivational and Inflectional Morphemes in Pak-Pak Language

On the Notion Determiner

COMPUTATIONAL COMPLEXITY OF LEFT-ASSOCIATIVE GRAMMAR

Name of Course: French 1 Middle School. Grade Level(s): 7 and 8 (half each) Unit 1

Prediction of Maximal Projection for Semantic Role Labeling

Underlying and Surface Grammatical Relations in Greek consider

The Interface between Phrasal and Functional Constraints

Virtually Anywhere Episodes 1 and 2. Teacher s Notes

Linguistics 220 Phonology: distributions and the concept of the phoneme. John Alderete, Simon Fraser University

CHAPTER IV RESEARCH FINDING AND DISCUSSION

First Grade Curriculum Highlights: In alignment with the Common Core Standards

Analysis of Probabilistic Parsing in NLP

Arizona s English Language Arts Standards th Grade ARIZONA DEPARTMENT OF EDUCATION HIGH ACADEMIC STANDARDS FOR STUDENTS

University of Waterloo School of Accountancy. AFM 102: Introductory Management Accounting. Fall Term 2004: Section 4

Experience Corps. Mentor Toolkit

A Version Space Approach to Learning Context-free Grammars

Theoretical Syntax Winter Answers to practice problems

Facing our Fears: Reading and Writing about Characters in Literary Text

Linguistics. Undergraduate. Departmental Honors. Graduate. Faculty. Linguistics 1

How to get the most out of EuroSTAR 2013

The Writing Process. The Academic Support Centre // September 2015

San José State University Department of Psychology PSYC , Human Learning, Spring 2017

UNIVERSITY OF OSLO Department of Informatics. Dialog Act Recognition using Dependency Features. Master s thesis. Sindre Wetjen

Prewriting: Drafting: Revising: Editing: Publishing:

Tutoring First-Year Writing Students at UNM

Aspectual Classes of Verb Phrases

Developing a TT-MCTAG for German with an RCG-based Parser

Chapter 4 - Fractions

An Interactive Intelligent Language Tutor Over The Internet

9.85 Cognition in Infancy and Early Childhood. Lecture 7: Number

ELA/ELD Standards Correlation Matrix for ELD Materials Grade 1 Reading

PROGRAMME DE TRAVAIL INTERNE

Erkki Mäkinen State change languages as homomorphic images of Szilard languages

Transcription:

Ling 566 Sept 28, 2017 Introduction, organization, first attempts at a theory of grammar

Overview Syllabus Prescriptive/descriptive grammar; Competence/performance Some history Why study syntax? Two theories that won t work Start on CFG

But first... For ARC: Parrington Lawn https://www.washington.edu/uwem/plans-and-procedures/uw-emergencyprocedures/

UW Mass Assembly Areas (Updated December 2014) 1 4 2 3 6 5 7 1. Archery Range 2. Parrington Lawn 3. Denny Yard 4. Denny Field 5. Fisheries Lawn 6. HUB Yard 7. Rainier Vista 8. South of Stadium 8

Syllabus http://courses.washington.edu/ling566

The winning strategy Work together: make study groups Homework: Discuss as much as you want, write up your own answers Exams: No discussion Post to Canvas discussions Read the book before class (and after again, if necessary) Ask questions... early and often!

Resources Glossary at back of textbook Bender 2013 ( 100 things ) Grammar summaries and Appendix A Answers to exercises at back of book Canvas, study groups, office hours...

Two Conceptions of Grammar PRESCRIPTIVE Rules against certain usages. Few if any rules for what is allowed Proscribed forms generally in use Explicitly normative enterprise DESCRIPTIVE Rules characterizing what people do say Goal to characterize all and only what speakers find acceptable Tries to be scientific

Uses of Grammar PRESCRIPTIVE Identify speaker s socioeconomic class & education level Identify level of formality of a particular usage DESCRIPTIVE Understand how people produce & understand language Identify similarities & differences across languages Development of language technologies

Prescriptive grammar Examples of silly prescriptive rules? Examples of useful prescriptive rules? Compling applications which might need to encode prescriptive rules?

Fill in the blanks: he/his, they/their, or something else? Everyone insisted that record was unblemished. Everyone drives own car to work. Everyone was happy because passed the test. Everyone left the room, didn t? Everyone left early. seemed happy to get home.

Descriptive Grammar: an example F--- yourself! Go f--- yourself! F--- you! *Go f--- you! Who taught you this? How did you learn it?

Kinds of Things We ll Worry About Where to use reflexives (e.g. myself) vs. ordinary pronouns (I or me) Agreement (e.g. We sing vs. *We sings) Word order (e.g. *Sing we) Case (e.g. *Us sing) Coordinate conjunction (e.g. We sing and dance) How to form questions, imperatives, negatives and much more

Competence vs. Performance The Distinction Competence - knowledge of language Performance - how the knowledge is used Examples That Sandy left bothered me. That that Sandy left bothered me bothered Kim That that that Sandy left bothered me bothered Kim bothered Bo The horse raced past the barn fell

Competence v. Performance You are what you eat You are what what you eat eats, too You are what what what you eat eats eats, too

Acceptability vs. grammaticality A sentence is acceptable if native speakers say it sounds good. A sentence is grammatical (with respect to a particular grammar) if the grammar licenses it. Linguists are sometimes sloppy about the difference.

Some History Writings on grammar go back at least 3000 years Until 200 years ago, almost all of it was prescriptive Until ~60 years ago, most linguistic work concerned sound systems (phonology), word structure (morphology), and the historical relationships among languages

The Generative Revolution Noam Chomsky s work in the 1950s radically changed linguistics, making syntax central. Chomsky has been the dominant figure in linguistics ever since. The theory we will develop is in the tradition started by Chomsky, but diverges from his work in many ways.

Main Tenets of Generative Grammar Grammars should be formulated precisely and explicitly. Languages are infinite, so grammars must be tested against invented data, not just attested examples. The theory of grammar is a theory of human linguistic abilities.

Some of Chomsky s Controversial Claims The superficial diversity of human languages masks their underlying similarity. All languages are fundamentally alike because linguistic knowledge is largely innate. The central problem for linguistics is explaining how children can learn language so quickly and easily.

Family Tree of Syntactic Theories Early Transformational Grammar (1955-1964) Standard Theory TG (1964-1967) EST (1967-1977) Generative Semantics (1966-1975) REST (1977-1981) GPSG (1979-1985) Realistic TG (1978-1980) RG (1974-present) GB (1981-1993) HPSG (1986-present) LFG (1980-present) APG (1980) MP (1993-present)

Why Study Syntax? Why should linguists study syntax? Why should computational linguists study syntax? Should anyone else study syntax? Why? Why are you studying syntax?

Insufficient Theory #1 A grammar is simply a list of sentences. What s wrong with this?

Insufficient Theory #2: FSMs the noisy dogs left D A N V the noisy dogs chased the innocent cats D A N V D A N a* = {ø, a, aa, aaa, aaaa,... } a + = {a, aa, aaa, aaaa,... } (D) A* N V ((D) A* N)

A Finite State Machine D N V D N A V A V

FSMs for Grammar, cont Why are FSMs insufficient as a representation of natural language syntax? How might they be useful anyway?

Chomsky Hierarchy Type 0 Languages Context-Sensitive Languages Context-Free Languages Regular Languages

Context-Free Grammar A quadruple: <C,Σ,P,S > C: set of categories Σ: set of terminals (vocabulary) P: set of rewrite rules α β 1,β 2,...,β n S in C: start symbol For each rule α β 1, β 2,...,β n P α C; β i C Σ; 1 i n

A Toy Grammar RULES S NP VP PP NP VP (D) A* N PP* V (NP) (PP) P NP LEXICON D: the, some A: big, brown, old N: birds, fleas, dog, hunter, I V: attack, ate, watched P: for, beside, with

Structural Ambiguity I saw the astronomer with the telescope.

Structure 1: PP under VP S NP VP N V NP PP I saw D N P NP the astronomer with D N the telescope

Structure 1: PP under NP S NP VP N V NP I saw D N PP the astronomer P NP with D N the telescope

Recurrent Patterns Constituency Tests The quick brown fox with the bushy tail jumped over the lazy brown dog with one ear. Coordination The quick brown fox with the bushy tail and the lazy brown dog with one ear are friends. Sentence-initial position The election of 2000, everyone will remember for a long time. Cleft sentences It was a book about syntax they were reading.

General Types of Constituency Tests Distributional Intonational Semantic Psycholinguistic... but they don t always agree.

Central claims implicit in CFG formalism: 1. Parts of sentences (larger than single words) are linguistically significant units, i.e. phrases play a role in determining meaning, pronunciation, and/or the acceptability of sentences. 2. Phrases are contiguous portions of a sentence (no discontinuous constituents). 3. Two phrases are either disjoint or one fully contains the other (no partially overlapping constituents). 4. What a phrase can consist of depends only on what kind of a phrase it is (that is, the label on its top node), not on what appears around it.

Claims 1-3 characterize what is called phrase structure grammar Claim 4 (that the internal structure of a phrase depends only on what type of phrase it is, not on where it appears) is what makes it context-free. There is another kind of phrase structure grammar called context-sensitive grammar (CSG) that gives up 4. That is, it allows the applicability of a grammar rule to depend on what is in the neighboring environment. So rules can have the form A X, in the context of Y_Z.

Possible Counterexamples To Claim 2 (no discontinuous constituents): A technician arrived who could solve the problem. To Claim 3 (no overlapping constituents): I read what was written about me. To Claim 4 (context independence): - He arrives this morning. - *He arrive this morning. - *They arrives this morning.

A Trivial CFG S NP VP NP VP D N V NP D: the V: chased N: dog, cat

Trees and Rules C 0 C 1... C n. is a well-formed nonlexical tree if (and only if). C n,...,c n are well-formed trees, and.. C 0 C 1...Cn is a grammar rule.

Bottom-up Tree Construction D: the V: chased N: dog, cat D V N N the chased dog cat

NP D N VP V NP NP NP VP D N D N V NP the dog the cat chased D N the cat

S NP VP S NP VP D N V NP the dog chased D N the cat

Top-down Tree Construction S NP VP NP D N VP V NP S NP VP NP VP D (twice) N V NP

S NP VP D N V NP D N

D V N N the chased dog cat

S NP VP D N V NP the dog chased D N the cat

Weaknesses of CFG (atomic node labels) It doesn t tell us what constitutes a linguistically natural rule Rules get very cumbersome once we try to deal with things like agreement and transitivity. It has been argued that certain languages (notably Swiss German and Bambara) contain constructions that are provably beyond the descriptive capacity of CFG. VP PNP NP VP S

On the other hand... It s a simple formalism that can generate infinite languages and assign linguistically plausible structures to them. Linguistic constructions that are beyond the descriptive power of CFG are rare. It s computationally tractable and techniques for processing CFGs are well understood.

So... CFG has been the starting point for most types of generative grammar. The theory we develop in this course is an extension of CFG.

Overview Syllabus Prescriptive/descriptive grammar; Competence/performance Some history Why study syntax? Two theories that won t work Start on CFG