POS tagging CMSC 723 / LING 723 / INST 725. Marine Carpuat

Similar documents
2/15/13. POS Tagging Problem. Part-of-Speech Tagging. Example English Part-of-Speech Tagsets. More Details of the Problem. Typical Problem Cases

CS 598 Natural Language Processing

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

Ch VI- SENTENCE PATTERNS.

Context Free Grammars. Many slides from Michael Collins

Words come in categories

ESSLLI 2010: Resource-light Morpho-syntactic Analysis of Highly

Chunk Parsing for Base Noun Phrases using Regular Expressions. Let s first let the variable s0 be the sentence tree of the first sentence.

Inleiding Taalkunde. Docent: Paola Monachesi. Blok 4, 2001/ Syntax 2. 2 Phrases and constituent structure 2. 3 A minigrammar of Italian 3

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data

Grammars & Parsing, Part 1:

Ensemble Technique Utilization for Indonesian Dependency Parser

Adjectives tell you more about a noun (for example: the red dress ).

Heuristic Sample Selection to Minimize Reference Standard Training Set for a Part-Of-Speech Tagger

ELD CELDT 5 EDGE Level C Curriculum Guide LANGUAGE DEVELOPMENT VOCABULARY COMMON WRITING PROJECT. ToolKit

Compositional Semantics

Dear Teacher: Welcome to Reading Rods! Reading Rods offer many outstanding features! Read on to discover how to put Reading Rods to work today!

(Sub)Gradient Descent

Training and evaluation of POS taggers on the French MULTITAG corpus

BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS

Prediction of Maximal Projection for Semantic Role Labeling

Basic Syntax. Doug Arnold We review some basic grammatical ideas and terminology, and look at some common constructions in English.

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

Introduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions.

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks

Outline. Dave Barry on TTS. History of TTS. Closer to a natural vocal tract: Riesz Von Kempelen:

Word Stress and Intonation: Introduction

The stages of event extraction

Today we examine the distribution of infinitival clauses, which can be

Developing Grammar in Context

An Evaluation of POS Taggers for the CHILDES Corpus

The Role of the Head in the Interpretation of English Deverbal Compounds

The Discourse Anaphoric Properties of Connectives

Parsing of part-of-speech tagged Assamese Texts

Emmaus Lutheran School English Language Arts Curriculum

11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation

Basic Parsing with Context-Free Grammars. Some slides adapted from Julia Hirschberg and Dan Jurafsky 1

EdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar

Linking Task: Identifying authors and book titles in verbose queries

University of Alberta. Large-Scale Semi-Supervised Learning for Natural Language Processing. Shane Bergsma

Indian Institute of Technology, Kanpur

Thornhill Primary School - Grammar coverage Year 1-6

UNIVERSITY OF OSLO Department of Informatics. Dialog Act Recognition using Dependency Features. Master s thesis. Sindre Wetjen

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections

Defragmenting Textual Data by Leveraging the Syntactic Structure of the English Language

Advanced Grammar in Use

Using dialogue context to improve parsing performance in dialogue systems

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Sample Goals and Benchmarks

Programma di Inglese

Natural Language Processing. George Konidaris

Chapter 4: Valence & Agreement CSLI Publications

Formulaic Language and Fluency: ESL Teaching Applications

Linguistic Variation across Sports Category of Press Reportage from British Newspapers: a Diachronic Multidimensional Analysis

The Smart/Empire TIPSTER IR System

Intensive English Program Southwest College

California Department of Education English Language Development Standards for Grade 8

BULATS A2 WORDLIST 2

The Structure of Multiple Complements to V

Unit 8 Pronoun References

Universal Grammar 2. Universal Grammar 1. Forms and functions 1. Universal Grammar 3. Conceptual and surface structure of complex clauses

a) analyse sentences, so you know what s going on and how to use that information to help you find the answer.

CORPUS ANALYSIS CORPUS ANALYSIS QUANTITATIVE ANALYSIS

CS Machine Learning

Procedia - Social and Behavioral Sciences 154 ( 2014 )

Applications of memory-based natural language processing

Derivational and Inflectional Morphemes in Pak-Pak Language

Writing a composition

Intension, Attitude, and Tense Annotation in a High-Fidelity Semantic Representation

Construction Grammar. University of Jena.

A Graph Based Authorship Identification Approach

Chinese Language Parsing with Maximum-Entropy-Inspired Parser

Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm

Course Outline for Honors Spanish II Mrs. Sharon Koller

Multilingual Sentiment and Subjectivity Analysis

Leveraging Sentiment to Compute Word Similarity

GERM 3040 GERMAN GRAMMAR AND COMPOSITION SPRING 2017

Argument structure and theta roles

Taught Throughout the Year Foundational Skills Reading Writing Language RF.1.2 Demonstrate understanding of spoken words,

BASIC ENGLISH. Book GRAMMAR

TWO OLD WOMEN (An Alaskan Legend of Betrayal, Courage and Survival) By Velma Wallis

Beyond the Pipeline: Discrete Optimization in NLP

Chapter 9 Banked gap-filling

AQUA: An Ontology-Driven Question Answering System

English for Life. B e g i n n e r. Lessons 1 4 Checklist Getting Started. Student s Book 3 Date. Workbook. MultiROM. Test 1 4

Extracting Verb Expressions Implying Negative Opinions

Proof Theory for Syntacticians

1/20 idea. We ll spend an extra hour on 1/21. based on assigned readings. so you ll be ready to discuss them in class

SAMPLE PAPER SYLLABUS

Vocabulary Usage and Intelligibility in Learner Language

IN THIS UNIT YOU LEARN HOW TO: SPEAKING 1 Work in pairs. Discuss the questions. 2 Work with a new partner. Discuss the questions.

A Vector Space Approach for Aspect-Based Sentiment Analysis

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Loughton School s curriculum evening. 28 th February 2017

Subject Pronouns Object Pronouns

Constraining X-Bar: Theta Theory

Copyright 2017 DataWORKS Educational Research. All rights reserved.

On the Notion Determiner

Transcription:

POS tagging CMSC 723 / LING 723 / INST 725 Marine Carpuat

Parts of Speech Equivalence class of linguistic entities Categories or types of words Study dates back to the ancient Greeks Dionysius Thrax of Alexandria (c. 100 BC) 8 parts of speech: noun, verb, pronoun, preposition, adverb, conjunction, participle, article Remarkably enduring list! 2

How can we define POS? By meaning? Verbs are actions Adjectives are properties Nouns are things By the syntactic environment What occurs nearby? What does it act as? By what morphological processes affect it What affixes does it take? Typically combination of syntactic+morphology

Parts of Speech Open class Impossible to completely enumerate New words continuously being invented, borrowed, etc. Closed class Closed, fixed membership Reasonably easy to enumerate Generally, short function words that structure sentences

Open Class POS Four major open classes in English Nouns Verbs Adjectives Adverbs All languages have nouns and verbs... but may not have the other two

Nouns Open class New inventions all the time: muggle, webinar,... Semantics: Generally, words for people, places, things But not always (bandwidth, energy,...) Syntactic environment: Occurring with determiners Pluralizable, possessivizable Other characteristics: Mass vs. count nouns

Verbs Open class New inventions all the time: google, tweet,... Semantics Generally, denote actions, processes, etc. Syntactic environment E.g., Intransitive, transitive Other characteristics Main vs. auxiliary verbs Gerunds (verbs behaving like nouns) Participles (verbs behaving like adjectives)

Adjectives and Adverbs Adjectives Generally modify nouns, e.g., tall girl Adverbs A semantic and formal hodge-podge Sometimes modify verbs, e.g., sang beautifully Sometimes modify adjectives, e.g., extremely hot

Closed Class POS Prepositions In English, occurring before noun phrases Specifying some type of relation (spatial, temporal, ) Examples: on the shelf, before noon Particles Resembles a preposition, but used with a verb ( phrasal verbs ) Examples: find out, turn over, go on

Particle vs. Prepositions He came by the office in a hurry He came by his fortune honestly We ran up the phone bill We ran up the small hill He lived down the block He never lived down the nicknames (by = preposition) (by = particle) (up = particle) (up = preposition) (down = preposition) (down = particle)

More Closed Class POS Determiners Establish reference for a noun Examples: a, an, the (articles), that, this, many, such, Pronouns Refer to person or entities: he, she, it Possessive pronouns: his, her, its Wh-pronouns: what, who

Closed Class POS: Conjunctions Coordinating conjunctions Join two elements of equal status Examples: cats and dogs, salad or soup Subordinating conjunctions Join two elements of unequal status Examples: We ll leave after you finish eating. While I was waiting in line, I saw my friend. Complementizers are a special case: I think that you should finish your assignment

Beyond English Chinese No verb/adjective distinction! 漂亮 : beautiful/to be beautiful Riau Indonesian/Malay No Articles No Tense Marking 3rd person pronouns neutral to both gender and number No features distinguishing verbs from nouns Ayam (chicken) Makan (eat) The chicken is eating The chicken ate The chicken will eat The chicken is being eaten Where the chicken is eating How the chicken is eating Somebody is eating the chicken The chicken that is eating

POS tagging

POS Tagging: What s the task? Process of assigning part-of-speech tags to words But what tags are we going to assign? Coarse grained: noun, verb, adjective, adverb, Fine grained: {proper, common} noun Even finer-grained: {proper, common} noun ± animate Important issues to remember Choice of tags encodes certain distinctions/non-distinctions Tagsets will differ across languages! For English, Penn Treebank is the most common tagset

Penn Treebank Tagset: 45 Tags

Penn Treebank Tagset: Choices Example: The/DT grand/jj jury/nn commmented/vbd on/in a/dt number/nn of/in other/jj topics/nns./. Distinctions and non-distinctions Prepositions and subordinating conjunctions are tagged IN ( Although/IN I/PRP.. ) Except the preposition/complementizer to is tagged TO

Why do POS tagging? One of the most basic NLP tasks Nicely illustrates principles of statistical NLP Useful for higher-level analysis Needed for syntactic analysis Needed for semantic analysis Sample applications that require POS tagging Machine translation Information extraction Lots more

Try your hand at tagging The back door On my back Win the voters back Promised to back the bill

Try your hand at tagging I hope that she wins That day was nice You can go that far

Why is POS tagging hard? Ambiguity! Ambiguity in English 11.5% of word types ambiguous in Brown corpus 40% of word tokens ambiguous in Brown corpus Annotator disagreement in Penn Treebank: 3.5%

POS tagging: how to do it? Given Penn Treebank, how would you build a system that can POS tag new text? Baseline: pick most frequent tag for each word type 90% accuracy if train+test sets are drawn from Penn Treebank Can we do better?

How to POS tag automatically?

How can we POS tag automatically? POS tagging as multiclass classification What is x? What is y? POS tagging as sequence labeling Models sequences of predictions

Linear Models for Classification Feature function representation Weights

Multiclass perceptron

POS tagging Sequence labeling with the perceptron Sequence labeling problem Input: sequence of tokens x = [x 1 x K ] Variable length K Output (aka label): sequence of tags y = [y 1 y K ] Size of output space? Structured Perceptron Perceptron algorithm can be used for sequence labeling But there are challenges How to compute argmax efficiently? What are appropriate features? Approach: leverage structure of output space

Feature functions for sequence labeling Example features? Number of times monsters is tagged as noun Number of times noun is followed by verb Number of times tasty is tagged as verb Number of times two verbs are adjacent

Feature functions for sequence labeling Standard features of POS tagging Unary features: # times word w has been labeled with tag l for all words w and all tags l Markov features: # times tag l is adjacent to tag l in output for all tags l and l Size of feature representation is constant wrt input length

Solving the argmax problem for sequences Efficient algorithms possible if the feature function decomposes over the input This holds for unary and markov features

Solving the argmax problem for sequences Trellis sequence labeling Any path represents a labeling of input sentence Gold standard path in red Each edge receives a weight such that adding weights along the path corresponds to score for input/ouput configuration Any max-weight max-weight path algorithm can find the argmax e.g. Viterbi algorithm O(LK 2 )

POS tagging CMSC 723 / LING 723 / INST 725 Marine Carpuat