Natural Language Processing: Introduction. Matthias Naver Labs Europe. 08 th January NAVER LABS. All rights reserved.

Similar documents
System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Semantic and Context-aware Linguistic Model for Bias Detection

Second Exam: Natural Language Parsing with Neural Networks

arxiv: v4 [cs.cl] 28 Mar 2016

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

A deep architecture for non-projective dependency parsing

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

LIM-LIG at SemEval-2017 Task1: Enhancing the Semantic Similarity for Arabic Sentences with Vectors Weighting

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski

arxiv: v1 [cs.cl] 20 Jul 2015

Глубокие рекуррентные нейронные сети для аспектно-ориентированного анализа тональности отзывов пользователей на различных языках

Ask Me Anything: Dynamic Memory Networks for Natural Language Processing

MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY

arxiv: v5 [cs.ai] 18 Aug 2015

Residual Stacking of RNNs for Neural Machine Translation

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Georgetown University at TREC 2017 Dynamic Domain Track

Natural Language Processing. George Konidaris

Applications of memory-based natural language processing

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Top US Tech Talent for the Top China Tech Company

Lecture 1: Machine Learning Basics

arxiv: v3 [cs.cl] 7 Feb 2017

arxiv: v1 [cs.lg] 15 Jun 2015

(Sub)Gradient Descent

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

ON THE USE OF WORD EMBEDDINGS ALONE TO

Modeling function word errors in DNN-HMM based LVCSR systems

arxiv: v2 [cs.cl] 26 Mar 2015

Dual-Memory Deep Learning Architectures for Lifelong Learning of Everyday Human Behaviors

Probabilistic Latent Semantic Analysis

A JOINT MANY-TASK MODEL: GROWING A NEURAL NETWORK FOR MULTIPLE NLP TASKS

Discriminative Learning of Beam-Search Heuristics for Planning

Linking Task: Identifying authors and book titles in verbose queries

CS224d Deep Learning for Natural Language Processing. Richard Socher, PhD

FBK-HLT-NLP at SemEval-2016 Task 2: A Multitask, Deep Learning Approach for Interpretable Semantic Textual Similarity

Parsing of part-of-speech tagged Assamese Texts

Distant Supervised Relation Extraction with Wikipedia and Freebase

The Role of the Head in the Interpretation of English Deverbal Compounds

THE world surrounding us involves multiple modalities

CSL465/603 - Machine Learning

Calibration of Confidence Measures in Speech Recognition

Dialog-based Language Learning

Unsupervised Cross-Lingual Scaling of Political Texts

ARNE - A tool for Namend Entity Recognition from Arabic Text

CS Machine Learning

Boosting Named Entity Recognition with Neural Character Embeddings

Indian Institute of Technology, Kanpur

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

Topic Modelling with Word Embeddings

Online Updating of Word Representations for Part-of-Speech Tagging

Cross Language Information Retrieval

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

What Can Neural Networks Teach us about Language? Graham Neubig a2-dlearn 11/18/2017

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks

Basic Parsing with Context-Free Grammars. Some slides adapted from Julia Hirschberg and Dan Jurafsky 1

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

Joint Learning of Character and Word Embeddings

Ensemble Technique Utilization for Indonesian Dependency Parser

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

There are some definitions for what Word

A Vector Space Approach for Aspect-Based Sentiment Analysis

Probing for semantic evidence of composition by means of simple classification tasks

Robust Speech Recognition using DNN-HMM Acoustic Model Combining Noise-aware training with Spectral Subtraction

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Modeling function word errors in DNN-HMM based LVCSR systems

Python Machine Learning

A Review: Speech Recognition with Deep Learning Methods

Deep Neural Network Language Models

University of Alberta. Large-Scale Semi-Supervised Learning for Natural Language Processing. Shane Bergsma

Noisy SMS Machine Translation in Low-Density Languages

Improving Machine Learning Input for Automatic Document Classification with Natural Language Processing

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

Chinese Language Parsing with Maximum-Entropy-Inspired Parser

Compositional Semantics

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

UNIVERSITY OF OSLO Department of Informatics. Dialog Act Recognition using Dependency Features. Master s thesis. Sindre Wetjen

arxiv: v1 [cs.lg] 7 Apr 2015

The Karlsruhe Institute of Technology Translation Systems for the WMT 2011

Word Embedding Based Correlation Model for Question/Answer Matching

Learning Methods in Multilingual Speech Recognition

Lecture 1: Basic Concepts of Machine Learning

NCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches

Context Free Grammars. Many slides from Michael Collins

EdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar

Web as Corpus. Corpus Linguistics. Web as Corpus 1 / 1. Corpus Linguistics. Web as Corpus. web.pl 3 / 1. Sketch Engine. Corpus Linguistics

Bibliography Deep Learning Papers

PREDICTING SPEECH RECOGNITION CONFIDENCE USING DEEP LEARNING WITH WORD IDENTITY AND SCORE FEATURES

A Simple VQA Model with a Few Tricks and Image Features from Bottom-up Attention

BYLINE [Heng Ji, Computer Science Department, New York University,

SARDNET: A Self-Organizing Feature Map for Sequences

An investigation of imitation learning algorithms for structured prediction

The stages of event extraction

AQUA: An Ontology-Driven Question Answering System

Developing a TT-MCTAG for German with an RCG-based Parser

Evaluation of a Simultaneous Interpretation System and Analysis of Speech Log for User Experience Assessment

Arabic Orthography vs. Arabic OCR

Transcription:

Natural Language Processing: Introduction Matthias Gallé @mgalle Naver Labs Europe 08 th January 2018 2017 NAVER LABS. All rights reserved.

Natural Language Problem: Definition Natural language processing (NLP) is a collective term referring to automatic computational processing of human languages Neural Network Methods for NLP Yoav Goldberg

vs Computational Linguistics scientific study of language from a computational perspective [aclweb.com] Like any computational X => use computers to understand X In general: more focused on understanding/supporting a theory than solving a problem

Natural Language Processing Natural Formal https://xkcd.com/1090 [audience looks around] 'What just happened?' 'There must be some context we're missing.'

Noam Chomsky Key to convert linguistics into a science Focused on the structure of human language Created the link with mathematical models & learnability Results used every time you use formal grammars (also my academic grand-grand-grand-grand-father)

Why natural language is hard It's difficult to extract sense from strings, but they're the only communication coin we can count on. Alan Perlis (first Turing award recipient) Within a computer, natural language is unnatural. We all know NLP is not «solved» [comic relief]

Why text is hard Ambiguous Exceptions Noisy Contextualized Coreference/Synonyms Common-sense Evolves Alice saw Bob using the telescope Try to understand the gender of ships OMG did U c how teen wrte dez days? LOL pizza coke.. It was cold! The US president greeted the German prime minister. She critiqued his stand on. https://xkcd.com/1576/ The trophy would not fit in the brown suitcase because it was too big [small] 1000 new words to the Oxford Dictionary each year [https://blog.oxforddictionaries.com/august-2013-update/]

ML methods in NLP 1. Rule-based models (inference rules, planning) Hard to adapt (noise or new environment) 2. Data-drive (probabilistic linear models) Feature engineering 3. Learn the features Non-convex models 60-90 90 2014 2014 - To know more: https://video.ias.edu/machinelearning/2017/1115-christophermanning

Neural Networks in NLP Early successes Yoshua Bengio, et al. "A neural probabilistic language model." JMLR 2003 James Henderson. "Inducing history representations for broad coverage statistical parsing." NAACL 2003. Ronan Collobert and Jason Weston. "A unified architecture for natural language processing: Deep neural networks with multitask learning." ICML 2008

Impact of DL in NLP at least not so clearly (yet) Smart feature engineering often (still) outperforms DL Chen, Danqi, Jason Bolton, and Christopher D. Manning. "A thorough examination of the CNN/Daily Mail reading comprehension task." ACL 2016 Some part of traditional ML still used Huang, Zhiheng, Wei Xu, and Kai Yu. "Bidirectional LSTM-CRF models for sequence tagging." arxiv 2015 Lample, Guillaume, et al. Neural Architectures for Named Entity Recognition. NAACL 2016. https://qz.com/1034972/the-data-that-changed-the-direction-of-ai-researchand-possibly-the-world/

Impact of DL in NLP 1. Wide adoption of continuous representation Mikolov, Chen, Corrado, and Dean. Efficient estimation of word representations in vector space. ICLR Workshop, 2013. Pennington, Jeffrey, Richard Socher, and Christopher Manning. "Glove: Global vectors for word representation." EMNLP 2014 Text 2. Encoder/Decoder framework (sequence models without markovian assumption) + differentiable extensions Sutskever, Ilya, Oriol Vinyals, and Quoc V. Le. "Sequence to sequence learning with neural networks." Advances in neural information processing systems. 2014. tokenization Neural lemmatization Network POS-tagging NER classification 3. Removal of feature engineering (replaced by architecture engineering) Predic tion Higher-level representation

This course: Boundaries Assuming you know about Basics of optimization (SGD) & linear algebra (SVD) Basics of supervised learning (logistic regression, regularization) python Limits: No spoken language (neither ASR or TTS) No multi-modality No OCR Only an introduction!

This course: Approach Problem-specific: Language modelling Representation of words & documents Part-of-Speech (POS) tagging Named Entity Recognition Parsing Social Media Analysis Machine Translation Question & Answering Dialogue Methods will be introduced as needed, trying to strike a balance between historical importance and modern approaches

This course: Lecturers Matthias Gallé 8, 15, 22/01 Salah Ait-Mokhtar 29/01 Caroline Brun 5/02 Marc Dymetman 26/02 Julien Perez 5, 12/03 <firstname>.<lastname>@naverlabs.com

This course: Evaluation 3 programming exercises word-embeddings: mid-january information extraction: beginning Feburary seq2seq: beginning March Recommended language: python

This course: Bibliography Speech and Language Processing Jurafsky & Martin. 2017 (3 rd edition, ongoing) https://web.stanford.edu/~jurafsky/slp3/ MAIN Neural Network Methods in Natural Language Processing Y. Goldberg. 2017

C2017 NAVER Corp.

C2017 NAVER Corp.

C2017 NAVER Corp.

C2017 NAVER Corp.

NAVER LABS Europe Artificial Intelligence Computer Vision Machine Learning & Optimization Natural Language Processing Knowledge and Processes Geospatial Data Science UX and Ethnography http://www.europe.naverlabs.com/naver-labs-europe/internships http://www.europe.naverlabs.com/naver-labs-europe/jobs