lti Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments

Similar documents
SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

Indian Institute of Technology, Kanpur

BULATS A2 WORDLIST 2

2/15/13. POS Tagging Problem. Part-of-Speech Tagging. Example English Part-of-Speech Tagsets. More Details of the Problem. Typical Problem Cases

Defragmenting Textual Data by Leveraging the Syntactic Structure of the English Language

Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data

Heuristic Sample Selection to Minimize Reference Standard Training Set for a Part-Of-Speech Tagger

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

SAMPLE PAPER SYLLABUS

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Emmaus Lutheran School English Language Arts Curriculum

Linguistic Variation across Sports Category of Press Reportage from British Newspapers: a Diachronic Multidimensional Analysis

Parsing of part-of-speech tagged Assamese Texts

Sample Goals and Benchmarks

Dear Teacher: Welcome to Reading Rods! Reading Rods offer many outstanding features! Read on to discover how to put Reading Rods to work today!

Chinese Language Parsing with Maximum-Entropy-Inspired Parser

Ch VI- SENTENCE PATTERNS.

Mercer County Schools

Grammars & Parsing, Part 1:

BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS

Taught Throughout the Year Foundational Skills Reading Writing Language RF.1.2 Demonstrate understanding of spoken words,

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

CS 598 Natural Language Processing

1 st Quarter (September, October, November) August/September Strand Topic Standard Notes Reading for Literature

Grade 7. Prentice Hall. Literature, The Penguin Edition, Grade Oregon English/Language Arts Grade-Level Standards. Grade 7

Reading Grammar Section and Lesson Writing Chapter and Lesson Identify a purpose for reading W1-LO; W2- LO; W3- LO; W4- LO; W5-

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections

SEMAFOR: Frame Argument Resolution with Log-Linear Models

5 Star Writing Persuasive Essay

Ensemble Technique Utilization for Indonesian Dependency Parser

Today we examine the distribution of infinitival clauses, which can be

University of Alberta. Large-Scale Semi-Supervised Learning for Natural Language Processing. Shane Bergsma

Detecting Online Harassment in Social Networks

Advanced Grammar in Use

FOREWORD.. 5 THE PROPER RUSSIAN PRONUNCIATION. 8. УРОК (Unit) УРОК (Unit) УРОК (Unit) УРОК (Unit) 4 80.

Copyright 2017 DataWORKS Educational Research. All rights reserved.

Development of the First LRs for Macedonian: Current Projects

LQVSumm: A Corpus of Linguistic Quality Violations in Multi-Document Summarization

Chunk Parsing for Base Noun Phrases using Regular Expressions. Let s first let the variable s0 be the sentence tree of the first sentence.

Memory-based grammatical error correction

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks

Leveraging Sentiment to Compute Word Similarity

Dickinson ISD ELAR Year at a Glance 3rd Grade- 1st Nine Weeks

Houghton Mifflin Reading Correlation to the Common Core Standards for English Language Arts (Grade1)

Context Free Grammars. Many slides from Michael Collins

First Grade Curriculum Highlights: In alignment with the Common Core Standards

Opportunities for Writing Title Key Stage 1 Key Stage 2 Narrative

Cross-Lingual Dependency Parsing with Universal Dependencies and Predicted PoS Labels

Online Updating of Word Representations for Part-of-Speech Tagging

A Ruled-Based Part of Speech (RPOS) Tagger for Malay Text Articles

Copyright 2002 by the McGraw-Hill Companies, Inc.

The Internet as a Normative Corpus: Grammar Checking with a Search Engine

GERM 3040 GERMAN GRAMMAR AND COMPOSITION SPRING 2017

Linking Task: Identifying authors and book titles in verbose queries

Myths, Legends, Fairytales and Novels (Writing a Letter)

Developing Grammar in Context

DIRECT AND INDIRECT SPEECH

Grade 3: Module 2B: Unit 3: Lesson 10 Reviewing Conventions and Editing Peers Work

Loughton School s curriculum evening. 28 th February 2017

Effect of Word Complexity on L2 Vocabulary Learning

knarrator: A Model For Authors To Simplify Authoring Process Using Natural Language Processing To Portuguese

Adjectives tell you more about a noun (for example: the red dress ).

Word Stress and Intonation: Introduction

An Evaluation of POS Taggers for the CHILDES Corpus

BASIC ENGLISH. Book GRAMMAR

AQUA: An Ontology-Driven Question Answering System

Writing a composition

DEVELOPMENT OF A MULTILINGUAL PARALLEL CORPUS AND A PART-OF-SPEECH TAGGER FOR AFRIKAANS

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

Welcome to the Purdue OWL. Where do I begin? General Strategies. Personalizing Proofreading

The Smart/Empire TIPSTER IR System

11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation

Survey on parsing three dependency representations for English

A deep architecture for non-projective dependency parsing

Training and evaluation of POS taggers on the French MULTITAG corpus

A Dataset of Syntactic-Ngrams over Time from a Very Large Corpus of English Books

CORPUS ANALYSIS CORPUS ANALYSIS QUANTITATIVE ANALYSIS

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling

4 th Grade Reading Language Arts Pacing Guide

Chinese for Beginners CEFR Level: A1

Intensive English Program Southwest College

Set up your desk: Do Now Share-Out 1. Do Now Share-Out 2. Rule the World Essay 10/11/2012

The Ups and Downs of Preposition Error Detection in ESL Writing

ESSLLI 2010: Resource-light Morpho-syntactic Analysis of Highly

Coast Academies Writing Framework Step 4. 1 of 7

Netpix: A Method of Feature Selection Leading. to Accurate Sentiment-Based Classification Models

A Syllable Based Word Recognition Model for Korean Noun Extraction

National Literacy and Numeracy Framework for years 3/4

Lesson objective: Year: 5/6 Resources: 1a, 1b, 1c, 1d, 1e, 1f, Examples of newspaper orientations.

Stefan Engelberg (IDS Mannheim), Workshop Corpora in Lexical Research, Bucharest, Nov [Folie 1] 6.1 Type-token ratio

How to Take Accurate Meeting Minutes

Assessing Digital Identity and Promoting Online Professionalism: Social Media and Medical Education

5 th Grade Language Arts Curriculum Map

lgarfield Public Schools Italian One 5 Credits Course Description

Глубокие рекуррентные нейронные сети для аспектно-ориентированного анализа тональности отзывов пользователей на различных языках

Grade 2 Unit 2 Working Together

ELD CELDT 5 EDGE Level C Curriculum Guide LANGUAGE DEVELOPMENT VOCABULARY COMMON WRITING PROJECT. ToolKit

P a g e 1. Grade 4. Grant funded by: MS Exemplar Unit English Language Arts Grade 4 Edition 1

Transcription:

Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments Kevin Gimpel, Nathan Schneider, Brendan O'Connor, Dipanjan Das, Daniel Mills, Jacob Eisenstein, Michael Heilman, Dani Yogatama, Jeffrey Flanigan, and Noah A. Smith

Why does this paper have so many authors?

Why does this paper have so many authors? Our goal: Build a Twitter part-of-speech tagger in one day

Plan: Large team of annotators Simple, carefully-designed annotation scheme Features leveraging existing resources (treebanks) and unannotated data

Plan: Large team of annotators Simple, carefully-designed annotation scheme Features leveraging existing resources (treebanks) and unannotated data Outcome: Tag set for Twitter 1,827 annotated English tweets POS tagger with ~90% accuracy Didn t finish in a day, but took < 250 person-hours Available to download!

The Data

non-standard spellings mu-word abbreviations hashtags Also: at-mentions, URLs, emoticons, symbols, typos, etc.

Tag Set

Start with coarse set of Penn Treebank tags Add Twitter-specific tags

Coarse treebank tags: common noun proper noun pronoun verb adjective adverb punctuation determiner preposition verb particle coordinating conjunction numeral interjection predeterminer / existential there

Coarse treebank tags: common noun proper noun pronoun verb adjective adverb punctuation determiner preposition verb particle coordinating conjunction numeral interjection predeterminer / existential there

Penn Treebank tokenization is unsuitable for Twitter: @user1 OMG ur from PA? i am too (: where abouts? you re I m going to @user2 ima get me a flip phone for real

Penn Treebank tokenization is unsuitable for Twitter: @user1 OMG ur from PA? i am too (: where abouts? you re I m going to @user2 ima get me a flip phone for real Solution: Don t try to tokenize these Instead, introduce compound tags

Penn Treebank tokenization is unsuitable for Twitter: nominal+verbal @user1 OMG ur from PA? i am too (: where abouts? you re I m going to @user2 ima get me a flip phone for real nominal+verbal Solution: Don t try to tokenize these Instead, introduce compound tags

Twitter-specific tags: hashtag at-mention URL / email address emoticon Twitter discourse marker other (mu-word abbreviations, symbols, garbage)

Twitter-specific tags: hashtag at-mention URL / email address emoticon Twitter discourse marker other (mu-word abbreviations, symbols, garbage)

Hashtags Twitter hashtags are sometimes used as ordinary words (35% of the time) and other times as topic markers Innovative, but traditional, too! Another fun one to watch on the #ipad! http://bit.ly/ @user1 #utcd2 #utpol #tcot

Hashtags Twitter hashtags are sometimes used as ordinary words (35% of the time) and other times as topic markers proper noun Innovative, but traditional, too! Another fun one to watch on the #ipad! http://bit.ly/ @user1 #utcd2 #utpol #tcot hashtag We only use hashtag for topic markers

Twitter Discourse Marker Retweet construction: RT @user1 : I never bought candy bars from those kids on my doorstep so I guess they re all in gangs now.

Twitter Discourse Marker Retweet construction: RT @user1 : I never bought candy bars from those kids on my doorstep so I guess they re all in gangs now. Twitter discourse marker

Twitter Discourse Marker Retweet construction: RT @user1 : I never bought candy bars from those kids on my doorstep so I guess they re all in gangs now. Twitter discourse marker RT @user2 : LMBO! This man filed an EMERGENCY Motion for Continuance on account of the Rangers game tonight! Wow lmao

Twitter Discourse Marker Retweet construction: RT @user1 : I never bought candy bars from those kids on my doorstep so I guess they re all in gangs now. Twitter discourse marker RT @user2 : LMBO! This man filed an EMERGENCY Motion for Continuance on account of the Rangers game tonight! Wow lmao

Resung tag set: 25 tags

Annotation

17 researchers from Carnegie Mellon Each spent 2-20 hours annotating Annotators corrected output of Stanford tagger Penn Treebank consulted for difficult cases

Two annotators corrected and standardized annotations from the original 17 annotators A third annotator tagged a sample of the tweets from scratch Inter-annotator agreement: 92.2% Cohen s kappa: 0.914 One annotator made a single final pass through the data, correcting errors and improving consistency

Experiments

Experimental Setup 1,827 annotated tweets 1,000 for training 327 for development 500 for testing (OOV rate: 30%) Systems: Stanford tagger (retrained on our data) Our own baseline CRF tagger Our tagger augmented with Twitter-specific features

Results 94 92 92.2 90 89.37 88 86 85.85 84 83.38 82 80 78 Stanford Tagger Our tagger, base features Our tagger, all features Inter-annotator agreement

Results 94 92 92.2 90 89.37 88 86 85.85 84 83.38 82 80 78 Stanford Tagger Our tagger, base features Our tagger, all features Inter-annotator agreement

Twitter Orthographic Features 91 90 89 89.37-1.0 Regular expressions to detect at-mentions, hashtags, and URLs 88 87 86 With Without

Distributional Similarity Features 91 90 89 88 87 89.37-1.06 Embeddings in a lowdimensional space based on neighboring words Computed using 134k unannotated tweets 86 With Without

Phonetic Normalization Features 91 Metaphone algorithm (Philips, 1990) maps tokens to equivalence classes based on phonetics 90 89 88 89.37-0.42 Examples: tomarrow tommorow tomorr tomorrow tomorrowwww hahaaha hahaha hahahah hahahahhaa hehehe hehehee 87 86 With Without thangs thanks thanksss thanx things thinks thnx knew kno know knw n nah naw new no noo nooooooo now

Tag Dictionary Features 91 90 89 89.37-1.06 One feature for each tag a word occurs with in the Penn Treebank, with its frequency rank 88 87 A similar feature for Metaphone classes of Penn Treebank words 86 With Without

Conclusions We developed a tag set, annotated data, designed features, and trained models Case study in rapidly porting a fundamental NLP task to a social media domain Data may be useful for domain adaptation or semi-supervised learning

Thanks! Tagger, tokenizer, and annotations are available (50+ downloads already!): www.ark.cs.cmu.edu/tweetnlp/