Models of Dialog and Conversation

Similar documents
Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

Speech Recognition at ICSI: Broadcast News and beyond

Linking Task: Identifying authors and book titles in verbose queries

A Case Study: News Classification Based on Term Frequency

arxiv: v1 [cs.cl] 2 Apr 2017

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

THE world surrounding us involves multiple modalities

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Residual Stacking of RNNs for Neural Machine Translation

The Smart/Empire TIPSTER IR System

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Глубокие рекуррентные нейронные сети для аспектно-ориентированного анализа тональности отзывов пользователей на различных языках

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

A Vector Space Approach for Aspect-Based Sentiment Analysis

Second Exam: Natural Language Parsing with Neural Networks

CS 598 Natural Language Processing

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections

Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski

Word Segmentation of Off-line Handwritten Documents

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Assignment 1: Predicting Amazon Review Ratings

LQVSumm: A Corpus of Linguistic Quality Violations in Multi-Document Summarization

Python Machine Learning

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

The MSR-NRC-SRI MT System for NIST Open Machine Translation 2008 Evaluation

CS Machine Learning

Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models

Lip Reading in Profile

Applications of memory-based natural language processing

UNIVERSITY OF OSLO Department of Informatics. Dialog Act Recognition using Dependency Features. Master s thesis. Sindre Wetjen

Australian Journal of Basic and Applied Sciences

MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY

AQUA: An Ontology-Driven Question Answering System

Probabilistic Latent Semantic Analysis

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

Modeling function word errors in DNN-HMM based LVCSR systems

Dialog-based Language Learning

The taming of the data:

Evaluation of a Simultaneous Interpretation System and Analysis of Speech Log for User Experience Assessment

Welcome to. ECML/PKDD 2004 Community meeting

Ensemble Technique Utilization for Indonesian Dependency Parser

Corpus Linguistics (L615)

Language Model and Grammar Extraction Variation in Machine Translation

NEURAL DIALOG STATE TRACKER FOR LARGE ONTOLOGIES BY ATTENTION MECHANISM. Youngsoo Jang*, Jiyeon Ham*, Byung-Jun Lee, Youngjae Chang, Kee-Eung Kim

Pre-Algebra A. Syllabus. Course Overview. Course Goals. General Skills. Credit Value

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

What Can Neural Networks Teach us about Language? Graham Neubig a2-dlearn 11/18/2017

Radius STEM Readiness TM

CROSS-LANGUAGE INFORMATION RETRIEVAL USING PARAFAC2

Georgetown University at TREC 2017 Dynamic Domain Track

Yoshida Honmachi, Sakyo-ku, Kyoto, Japan 1 Although the label set contains verb phrases, they

Generative models and adversarial training

Language Independent Passage Retrieval for Question Answering

Overview of the 3rd Workshop on Asian Translation

Mastering Team Skills and Interpersonal Communication. Copyright 2012 Pearson Education, Inc. publishing as Prentice Hall.

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

On the Formation of Phoneme Categories in DNN Acoustic Models

Citrine Informatics. The Latest from Citrine. Citrine Informatics. The data analytics platform for the physical world

Natural Language Processing. George Konidaris

The stages of event extraction

BYLINE [Heng Ji, Computer Science Department, New York University,

CS 446: Machine Learning

A process by any other name

Noisy SMS Machine Translation in Low-Density Languages

PREDICTING SPEECH RECOGNITION CONFIDENCE USING DEEP LEARNING WITH WORD IDENTITY AND SCORE FEATURES

arxiv: v3 [cs.cl] 7 Feb 2017

ScienceDirect. Malayalam question answering system

*Net Perceptions, Inc West 78th Street Suite 300 Minneapolis, MN

SEMAFOR: Frame Argument Resolution with Log-Linear Models

Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

Lessons from a Massive Open Online Course (MOOC) on Natural Language Processing for Digital Humanities

Calibration of Confidence Measures in Speech Recognition

Metadiscourse in Knowledge Building: A question about written or verbal metadiscourse

Compositional Semantics

English Language and Applied Linguistics. Module Descriptions 2017/18

Adaptive Generation in Dialogue Systems Using Dynamic User Modeling

Learning Methods for Fuzzy Systems

Atypical Prosodic Structure as an Indicator of Reading Level and Text Difficulty

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

Instructor: Mario D. Garrett, Ph.D. Phone: Office: Hepner Hall (HH) 100

An Introduction to the Minimalist Program

(Sub)Gradient Descent

Cross Language Information Retrieval

Learning Methods in Multilingual Speech Recognition

ASTR 102: Introduction to Astronomy: Stars, Galaxies, and Cosmology

Phonetic- and Speaker-Discriminant Features for Speaker Recognition. Research Project

Robust Speech Recognition using DNN-HMM Acoustic Model Combining Noise-aware training with Spectral Subtraction

Developing a TT-MCTAG for German with an RCG-based Parser

Using Semantic Relations to Refine Coreference Decisions

ON THE USE OF WORD EMBEDDINGS ALONE TO

arxiv: v1 [cs.cv] 10 May 2017

Model Ensemble for Click Prediction in Bing Search Ads

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

Modeling function word errors in DNN-HMM based LVCSR systems

arxiv: v5 [cs.ai] 18 Aug 2015

A Simple VQA Model with a Few Tricks and Image Features from Bottom-up Attention

Transcription:

CS11-747 Neural Networks for NLP Models of Dialog and Conversation Graham Neubig Site https://phontron.com/class/nn4nlp2017/

Types of Dialog Who is talking? Human-human Human-computer Why are they talking? Task driven Chat

Models of Chat

Two Paradigms Generation-based models Take input, generate output Good if you want to be creative Retrieval-based models Take input, find most appropriate output Good if you want to be safe

Generation-based Models (Ritter et al. 2011) Train phrase-based machine translation system to perform translation from utterance to response Lots of filtering, etc., to make sure that the extracted translation rules are reliable

Neural Models for Dialog Response Generation (Sordoni et al. 2015, Sheng et al. 2015, Vinyals and Le 2015) Like other translation tasks, dialog response generation can be done with encoder-decoders Sheng et al. (2015) present simplest model, translating from previous utterance

Problem 1: Dialog More Dependent on Global Coherence Considering only a single previous utterance will lead to locally coherent but globally incoherent output Necessary to consider more context! (Sordoni et al. 2015) Contrast to MT, where context sometimes is (Matsuzaki et al. 2015) and sometimes isn t (Jean et al. 2015) helpful

One Solution: Use Standard Architecture w/ More Context Sordoni et al. (2015) consider one additional previous context utterance concatenated together Vinyals et al. (2015) just concatenate together all previous utterances and hope an RNN an learn

Hierarchical Encoderdecoder Model (Serban et al. 2016) Also have utterance-level RNN track overall dialog state

Discourse-level VAE Model (Zhao et al. 2017) Encode entire previous dialog context as latent variable in VAE Also meta-information such as dialog acts Also, bag-of-words loss

Problem 2: Dialog allows Much More Varied Responses For translation, there is lexical variation but content remains the same For dialog, content will also be different! (e.g. Li et al. 2016)

Diversity Promoting Objective for Conversation (Li et al. 2016) Basic idea: we want responses that are likely given the context, unlikely otherwise Method: subtract weighted unconditioned log probability from conditioned probability (calculated only on first few words)

Diversity is a Problem for Evaluation! Translation uses BLEU score; while imperfect, not horrible In dialog, BLEU shows very little correlation (Liu et al. 2016)

Using Multiple References with Human Evaluation Scores (Galley et al. 2015) Retrieve good-looking responses, perform human evaluation, up-weight good ones, down-weight bad ones

Learning to Evaluate Use context, true response, and actual response to learn a regressor that predicts goodness (Lowe et al. 2017) Important: similar to model, but has access to reference! Adversarial evaluation: try to determine whether response is true or fake (Li et al. 2017) One caveat from MT: learnable metrics tend to overfit

Problem 3: Dialog Agents should have Personality If we train on all of our data, our agent will be a mish-mash of personalities (e.g. Li et al. 2016) We would like our agents to be consistent!

Personality Infused Dialog (Mairesse et al. 2007) Train a generation system with controllable knobs based on personality traits e.g. Extraversion: Non-neural, but well done and perhaps applicable

Persona-based Neural Dialog Model (Li et al. 2017) Model each speaker in embedding space Also model who the speaker is speaking to in speaker-addressee model

Retrieval-based Models

Dialog Response Retrieval Idea: many things can be answered with template Simply find most relevant response out of existing ones in corpus Template responses Image Credit: Google

Retrieval-based Chat (Lee et al. 2009) Basic idea: given an utterance, find the most similar in the database and return it Similarity based on exact word match, plus extracted features regarding discourse

Neural Response Retrieval (Nio et al. 2014) Idea: use neural models to soften the connection between input and output and do more flexible matching Model uses Socher et al. (2011) recursive autoencoder + dynamic pooling

Smart Reply for Email Retrieval (Kannan et al. 2016) Implemented in GMail smart reply Similar response model with LSTM seq2seq scoring, but many improvements Beam search over response space for scalability Canonicalization of syntactic variants and clustering of similar responses Human curation of responses Enforcement of diversity through omission of redundant responses and enforcing positive/negative

Task-driven Dialog

Chat vs. Task Completion Chat is basically to keep the user entertained What if we want to do an actual task? Book a flight Access information from a database

Traditional Task-completion Dialog Framework In semantic frame based dialog: Natural language understanding to fill the slots in the frame based on the user utterance Dialog state tracking to keep track of the overall dialog state over multiple turns Dialog control to decide the next action based on state Natural language generation to generate utterances based on current state

NLU (for Slot Filling) w/ Neural Nets (Mesnil et al. 2015) Slot filing expressed as BIO scheme RNN-CRF based model for tags

Dialog State Tracking Track the belief about our current frame-filling state (Williams et al. 2013) Henderson et al. (2014) present RNN model that encodes multiple ASR hypotheses and generalizes by abstracting details

Language Generation from Dialog State w/ Neural Nets (Wen et al. 2015) Condition LSTM units based on the dialog input, output English

End-to-end Dialog Control (Williams et al. 2017) Train an LSTM that takes in text and entities and directly chooses an action to take (reply or API call) Trained using combination of supervised and reinforcement learning

Questions?