Lisa Amini Director, IBM Research Cambridge, Acting Director, MIT-IBM Watson AI Lab. MIT 6.S191 Intro to Deep Learning

Similar documents
Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Laboratorio di Intelligenza Artificiale e Robotica

Laboratorio di Intelligenza Artificiale e Robotica

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Lecture 1: Machine Learning Basics

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

A Vector Space Approach for Aspect-Based Sentiment Analysis

Dialog-based Language Learning

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

Probabilistic Latent Semantic Analysis

Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Citrine Informatics. The Latest from Citrine. Citrine Informatics. The data analytics platform for the physical world

Knowledge-Based - Systems

Rule Learning With Negation: Issues Regarding Effectiveness

Skillsoft Acquires SumTotal: Frequently Asked Questions. October 2014

Top US Tech Talent for the Top China Tech Company

Lecture 1: Basic Concepts of Machine Learning

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Axiom 2013 Team Description Paper

Natural Language Processing. George Konidaris

Linking Task: Identifying authors and book titles in verbose queries

arxiv: v1 [cs.cl] 20 Jul 2015

MYCIN. The MYCIN Task

Rule Learning with Negation: Issues Regarding Effectiveness

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Georgetown University at TREC 2017 Dynamic Domain Track

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Beyond the Blend: Optimizing the Use of your Learning Technologies. Bryan Chapman, Chapman Alliance

Python Machine Learning

TRANSFER LEARNING IN MIR: SHARING LEARNED LATENT REPRESENTATIONS FOR MUSIC AUDIO CLASSIFICATION AND SIMILARITY

Calibration of Confidence Measures in Speech Recognition

Knowledge Elicitation Tool Classification. Janet E. Burge. Artificial Intelligence Research Group. Worcester Polytechnic Institute

CS Machine Learning

Human Emotion Recognition From Speech

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Knowledge based expert systems D H A N A N J A Y K A L B A N D E

FUZZY EXPERT. Dr. Kasim M. Al-Aubidy. Philadelphia University. Computer Eng. Dept February 2002 University of Damascus-Syria

Web as Corpus. Corpus Linguistics. Web as Corpus 1 / 1. Corpus Linguistics. Web as Corpus. web.pl 3 / 1. Sketch Engine. Corpus Linguistics

Master s Programme in Computer, Communication and Information Sciences, Study guide , ELEC Majors

arxiv: v1 [cs.cl] 2 Apr 2017

On the Combined Behavior of Autonomous Resource Management Agents

Semantic and Context-aware Linguistic Model for Bias Detection

DIGITAL GAMING & INTERACTIVE MEDIA BACHELOR S DEGREE. Junior Year. Summer (Bridge Quarter) Fall Winter Spring GAME Credits.

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Detecting English-French Cognates Using Orthographic Edit Distance

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Semi-Supervised Face Detection

A Case Study: News Classification Based on Term Frequency

An OO Framework for building Intelligence and Learning properties in Software Agents

Generative models and adversarial training

EXAMINING THE DEVELOPMENT OF FIFTH AND SIXTH GRADE STUDENTS EPISTEMIC CONSIDERATIONS OVER TIME THROUGH AN AUTOMATED ANALYSIS OF EMBEDDED ASSESSMENTS

Circuit Simulators: A Revolutionary E-Learning Platform

Conversation Starters: Using Spatial Context to Initiate Dialogue in First Person Perspective Games

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

Emergent Narrative As A Novel Framework For Massively Collaborative Authoring

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

AQUA: An Ontology-Driven Question Answering System

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Artificial Neural Networks written examination

Unit 7 Data analysis and design

(Sub)Gradient Descent

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

CS224d Deep Learning for Natural Language Processing. Richard Socher, PhD

Operational Knowledge Management: a way to manage competence

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

Machine Learning and Development Policy

IAT 888: Metacreation Machines endowed with creative behavior. Philippe Pasquier Office 565 (floor 14)

Using Web Searches on Important Words to Create Background Sets for LSI Classification

B.S/M.A in Mathematics

Market Economy Lesson Plan

Forget catastrophic forgetting: AI that learns after deployment

Abstractions and the Brain

COMPUTER-AIDED DESIGN TOOLS THAT ADAPT

Community Power Simulation

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

Reinforcement Learning by Comparing Immediate Reward

Unpacking a Standard: Making Dinner with Student Differences in Mind

The Good Judgment Project: A large scale test of different methods of combining expert predictions

A student diagnosing and evaluation system for laboratory-based academic exercises

Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

Computerized Adaptive Psychological Testing A Personalisation Perspective

Speech Recognition at ICSI: Broadcast News and beyond

Mining Association Rules in Student s Assessment Data

EGRHS Course Fair. Science & Math AP & IB Courses

Switchboard Language Model Improvement with Conversational Data from Gigaword

CNS 18 21th Communications and Networking Simulation Symposium

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

Corporate learning: Blurring boundaries and breaking barriers

VIA ACTION. A Primer for I/O Psychologists. Robert B. Kaiser

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

USER ADAPTATION IN E-LEARNING ENVIRONMENTS

Some Principles of Automated Natural Language Information Extraction

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling

Transcription:

Beyond Deep Learning : Learning+Reasoning Lisa Amini Director, IBM Research Cambridge, Acting Director, MIT-IBM Watson AI Lab MIT 6.S191 Intro to Deep Learning

2011, IBM Watson computer wins human champions at Jeopardy! 2017, World s first 50 qubit quantum computer 2017, IBM demonstrates 95% scaling efficiency on Caffe deep learning framework 2017, quantum algo efficiently computes lowest energy state of small molecules. Leading corporate institution for high-quality science IBM Research 3000 creative, scientific and technical minds worldwide 6 Nobel Laureates 10 National Medals of Technology 5 National Medals of Science 6 Turing Awards

The MIT-IBM Watson AI Lab $240M 10 year commitment to jointly create the future of artificial intelligence Fundamental advances in AI algorithms Physics of AI AI Transforming Industries: Healthcare, Life Sciences & Cybersecurity Advancing Shared prosperity through AI http://mitibmwatsonailab.mit.edu/ 3

Moments in Time Landmark 1 Million video dataset to transform AI Vision Pushing Carrying MIT-IBM Team Three seconds events Open access http://moments.csail.mit.edu/ Goal: Recognizing and understanding actions in video

Recent successes in Deep Learning are awe-inspiring, but epic breakthroughs are still needed for Machine Intelligence Humans learn without a lot of labeled data per task Why can t machines? People learn continuously throughout their lives, remembering what they ve learned and leveraging it for new tasks Current algorithms suffer from catastrophic forgetting and are unable to recognize and generalize to analogous situations or tasks To interact sensibly with humans, machines must be able to remember, reason, explain, and seek to fill knowledge gaps Learning+reasoning 5

Making Language Computational Word Embeddings Represent words as a real-valued vector in some abstract space Goal: representations that capture multiple degrees of similarity Skip-gram model Maximize the average log probability of predicting surrounding words Distributed representations of words and phrases and their compositionality, Mikolov, et al, 2013 Don't count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors, Baroni, Dinu, Kruszewski, 2014 FastText Open library for unsupervised learning of word embeddings. http://fasttext.cc 6

Embeddings Impact on Automated Knowledgebase Construction (AKBC) Formal knowledge representation Reasoning with Neural Tensor Networks for Knowledge Base Completion, Socher, et al, 2013 Distant Supervision for Relation Extraction with an Incomplete KB, Min, Grishman, Wan, Wang, Gondek, 2013 Compositional Vector Space Models for Knowledge Base Completion, Neelakantan, Roth, McCallum, 2015 7

Embeddings Impact on Automated Knowledgebase Construction (AKBC) Predicting Drug-Drug Interactions Through Large-Scale Similarity-based Link Prediction, Fokoue, et al 2016 Relation prediction with confidence, leveraging disparate structured and unstructured data Socrates: Deep Relational Knowledge Induction, Glass, et al, 2017!st place winner: ISWC Semantic Web Challenge on AKBC 8

How to create differentiable machines to reason leveraging learned external knowledge bases? 9

Example Task: Question Answering with Long-term Memories Towards AI-Complete Question Answering: A set of prerequisite toy tasks, Weston, et al, 2015 10

Question Answering with External Memories Supervision (direct or reward-based) Output Example Inputs Memory Module m m read addressing read addressing q Controller module Jointly trained with Inputs (I èx è m), Questions (Qèq), Answer (u è o) Memory vectors Input Internal state Vector (initially: query) Memory Networks, Weston, et al, 2015 Memory Networks for Language Understanding, ICML Tutorial, Weston, et al, 2016 11

Question Answering with External Memories Supervision (direct or reward-based) Output Memory Module m m read addressing read addressing q Controller module Memory vectors Input Internal state Vector (initially: query) Memory Networks, Weston, et al, 2015 End-to-End Memory Networks, Sukhbaatar, et al, 2015 Memory Networks for Language Understanding, ICML Tutorial, Weston, et al, 2016 12

Want to Learn More? Improved detection of key relations KBQA Simulator to generate challenge questions from ambiguous texts Bringing commonsense knowledge into vector space Learning to represent and execute programs Learning representations to induce logical rules and perform multi-hop reasoning Improved Neural Relation Detection for Knowledge Base Question Answering, Yu, et al 2016 Learning to Query, Reason, and Answer Questions on Ambiguous Texts, Guo et al, 2017 Lifted Rule Injection for Relation Embeddings, Demeester, Rocktaschel, Riedel, 2016 Neural Program Interpreters, Reed, et al, 2015 End-to-end Differentiable Proving, Rocktaschel, Riedel, 2017 13

Want to do more? Watson Developer Cloud Language Data Insights Speech Language Natural Language Classifier Language Translator Data Insights Personality Insights Tone Analyzer Natural Language Understanding Conversation build chatbots and virtual agents across any channel and domain Discovery aggregate and organize massive amounts of enterprise data, and answer questions in context Vision Speech Speech to Text transcribe audio and take action Text to Speech verbalize written text into understandable audio Vision Visual Recognition help people understand and take action from visual data For the latest view of Watson API s available, go to: www/ibm.com/watsondevelopercloud IBM 14

Thank you! 15