Deep Learning in Natural Language Processing. Tong Wang Advisor: Prof. Ping Chen Computer Science University of Massachusetts Boston

Similar documents
System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

Applications of memory-based natural language processing

AQUA: An Ontology-Driven Question Answering System

Cross Language Information Retrieval

arxiv: v1 [cs.cl] 2 Apr 2017

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Second Exam: Natural Language Parsing with Neural Networks

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

Top US Tech Talent for the Top China Tech Company

Let's Learn English Lesson Plan

Linking Task: Identifying authors and book titles in verbose queries

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Natural Language Processing. George Konidaris

Python Machine Learning

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

CEFR Overall Illustrative English Proficiency Scales

Distant Supervised Relation Extraction with Wikipedia and Freebase

CS 598 Natural Language Processing

MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY

Probabilistic Latent Semantic Analysis

What Can Neural Networks Teach us about Language? Graham Neubig a2-dlearn 11/18/2017

A Case Study: News Classification Based on Term Frequency

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

A Vector Space Approach for Aspect-Based Sentiment Analysis

Ch VI- SENTENCE PATTERNS.

ENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist

Parsing of part-of-speech tagged Assamese Texts

Introduction to CS 100 Overview of UK. CS September 2015

Switchboard Language Model Improvement with Conversational Data from Gigaword

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks

Residual Stacking of RNNs for Neural Machine Translation

An Empirical and Computational Test of Linguistic Relativity

Natural Language Processing: Interpretation, Reasoning and Machine Learning

Context Free Grammars. Many slides from Michael Collins

A Simple VQA Model with a Few Tricks and Image Features from Bottom-up Attention

Construction Grammar. University of Jena.

THE world surrounding us involves multiple modalities

LEXICAL COHESION ANALYSIS OF THE ARTICLE WHAT IS A GOOD RESEARCH PROJECT? BY BRIAN PALTRIDGE A JOURNAL ARTICLE

Georgetown University at TREC 2017 Dynamic Domain Track

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

The College Board Redesigned SAT Grade 12

Deep Neural Network Language Models

What the National Curriculum requires in reading at Y5 and Y6

ACCT 3400, BUSN 3400-H01, ECON 3400, FINN COURSE SYLLABUS Internship for Academic Credit Fall 2017

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

Foundations of Knowledge Representation in Cyc

Speech Recognition at ICSI: Broadcast News and beyond

Guidelines for Writing an Internship Report

Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models

arxiv: v2 [cs.ir] 22 Aug 2016

EdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar

Compositional Semantics

Tour. English Discoveries Online

A Comparison of Two Text Representations for Sentiment Analysis

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

21st Century Community Learning Center

LQVSumm: A Corpus of Linguistic Quality Violations in Multi-Document Summarization

NCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches

Control and Boundedness

Constraining X-Bar: Theta Theory

BYLINE [Heng Ji, Computer Science Department, New York University,

Chapter 4: Valence & Agreement CSLI Publications

Organizing Comprehensive Literacy Assessment: How to Get Started

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Lessons from a Massive Open Online Course (MOOC) on Natural Language Processing for Digital Humanities

BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS

CSL465/603 - Machine Learning

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

English Language and Applied Linguistics. Module Descriptions 2017/18

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Vocabulary Usage and Intelligibility in Learner Language

Developing Grammar in Context

Learning Methods for Fuzzy Systems

A Domain Ontology Development Environment Using a MRD and Text Corpus

ANGLAIS LANGUE SECONDE

(Includes a Detailed Analysis of Responses to Overall Satisfaction and Quality of Academic Advising Items) By Steve Chatman

Rule discovery in Web-based educational systems using Grammar-Based Genetic Programming

Глубокие рекуррентные нейронные сети для аспектно-ориентированного анализа тональности отзывов пользователей на различных языках

Test Blueprint. Grade 3 Reading English Standards of Learning

Assignment 1: Predicting Amazon Review Ratings

Modeling function word errors in DNN-HMM based LVCSR systems

NAME: East Carolina University PSYC Developmental Psychology Dr. Eppler & Dr. Ironsmith

Psychology and Language

Illinois WIC Program Nutrition Practice Standards (NPS) Effective Secondary Education May 2013

Procedia - Social and Behavioral Sciences 154 ( 2014 )

An Introduction to the Minimalist Program

MBA 5652, Research Methods Course Syllabus. Course Description. Course Material(s) Course Learning Outcomes. Credits.

Intension, Attitude, and Tense Annotation in a High-Fidelity Semantic Representation

Statewide Framework Document for:

THE ROLE OF DECISION TREES IN NATURAL LANGUAGE PROCESSING

ARNE - A tool for Namend Entity Recognition from Arabic Text

Developing True/False Test Sheet Generating System with Diagnosing Basic Cognitive Ability

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

Transcription:

Deep Learning in Natural Language Processing Tong Wang Advisor: Prof. Ping Chen Computer Science University of Massachusetts Boston

Outline Natural Language Processing Deep Learning in NLP My Research Projects My Path in Computer Science My Experience to Find Internship

What is Natural Language Processing Natural Language Processing is related to the area of human-computer interaction. Natural language understanding Natural language generation

Natural Language Processing https://d396qusza40orc.cloudfront.net/nlangp/lectures/intro.pdf http://www.cs.nyu.edu/~petrov/lecture1.pdf

Natural Language Processing http://www.slideshare.net/benjaminbengfort/introduction-to-machinelearning-with-scikitlearn

NLP Applications Information Extraction Name Entity Recognition Machine Translation Question Answering Topic Model Summarization

Information Extraction https://d396qusza40orc.cloudfront.net/nlangp/lectures/intro.pdf

Name Entity Recognition Classify elements in text into categories such as location, time, name of person, organization. Jim worked in Google corp. in 2012 (Jim)[person] worked in (Google corp.) [organization] in (2012)[time]

Machine Translation

Machine Translation Difficulties Words together are more than the sum of their parts. Can not translated word by word E.g, Fast food, Light rain Need a big dictionary with grammar rules in both languages, large start-up cost Require computer to understand

Question Answering IBM Watson won Jeopardy on 02/16/2011

Question Answering

Question Answering

Question Answering

NLP Tasks https://class.coursera.org/nlp/lecture/124

Why NLP is hard Basically text is not computer-friendly Many different ways to represent the same thing Order and context are extremely important Language is very high dimensional and sparse. Tons of rare words. B4 (before), IC (I see), cre8(create) Ambiguity

Ambiguity At last, a computer understands you like your mother It understands you as well as your mother understands you It understands (that) you like your mother It understands you as well as it understands your mother

Ambiguity at Syntactic Level https://d396qusza40orc.cloudfront.net/nlangp/lectures/intro.pdf

DEEP LEARNING IN NATURAL LANGUAGE PROCESSING

Deep Learning (Representation learning) in NLP http://www.iro.umontreal.ca/~memisevr/dlss2015/dlss2015-nlp-1.pdf

Deep Learning in NLP Word Level Application: Word Embedding, word2vec Sentence/paragraph Level Application: Neural Machine Translation, doc2vec, etc.

Word Representation The majority of rule-based and statistical NLP work regarded words as atomic symbols In vector space terms, this is a vector with one 1 and many zeros, it is called one-hot representation Condo: [0,0,0,0,1,0,0, 0] Apartment: [0,1,0,0,0,0,0, 0] These two vectors are orthogonal, no similarity

Word2vec http://www.iro.umontreal.ca/~memisevr/dlss2015/dlss2015-nlp-1.pdf

Word Embedding From word2vec Parameter Learning Explained

Word Embedding From Distributed Representations of Words and Phrases and their Compositionality

Word Embedding W( woman ) W( man ) W( queen ) W( king )

Sentence Embedding From Paragraph Vector - Stanford Computer Science

Recurrent Neural Network http://colah.github.io/posts/2015-08-understanding-lstms/

Neural Machine Translation https://devblogs.nvidia.com/parallelforall/introduction-neural-machinetranslation-gpus-part-2/

MY RESEARCH ROJECT

Text Simplification Text simplification (TS) aims to simplify the lexical, grammatical, or structural complexity of text while retaining its semantic meaning It can help various groups of people, including children, non-native speakers, and people with cognitive disabilities

Lexical Simplification Substitute long and infrequent words with shorter and more frequent words Candidate selection Semantic similarity Syntax and grammar correct The meaning of the sentence remains the same Disadvantage: On word level

Lexical Simplification Lexical Simplification webpage: http://158.121.178.171/

LS System For each word w in text: Check part of speech tagging of w Retrieve top 20 most similar words from word2vec For c in 20 candidate words: If c is the same pos with w If c is not a different form of w, e.g, past tense. If w is more difficult than c: Put c in the sentence, compute sentence similarity and n- gram Otherwise continue

TS using Neural Machine Translation Original English and simplified English can be thought of as two different languages. TS would be the process to translate English to simplified English.

Text Simplification using Neural Machine Translation AAAI 2016, Student abstract

Steps Collecting training data Pairs of sentences: original sentence and simplified sentence From English Wikipedia and Simple English Wikipedia Build RNN Encoder Decoder Model Evaluation

Use Sentence Similarity to Collect Training Data From Siamese Recurrent Architectures for Learning Sentence Similarity

Other projects Extended topic model for word dependency Opinion mining for chemical spill in West Virginia http://158.121.178.175/ Compression and data mining

My Path in Computer Science Huazhong Agricultural University, Information and Computing Science, BS, China, 2006 2010 Bioinformatics lab, Huazhong Agricultural University, 2010-2010 Northeastern University, Computer Systems Engineering, MS, 2011-2013 IoMosaic, Software Engineer, 2013-2013 University of Massachusetts Boston, Computer Science, PhD, 2014 - present

Keep Healthy Play badminton almost every day from Monday to Friday Run 5 miles in weekend

Keys to find internship Good resume Did a lot of projects Networking (Very important!) Go to conference Ask for job reference from professors, friends, alumni, strangers from Linkedin

Prepare interview Know that company Behavior questions Technical questions You must start to practice programming in your favorite language at least 1 month before the interview. (Leetcode)

Thank you!