How Machines Learn (Without Being Taught)

Similar documents
Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Laboratorio di Intelligenza Artificiale e Robotica

Python Machine Learning

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Lecture 1: Machine Learning Basics

Natural Language Processing. George Konidaris

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Laboratorio di Intelligenza Artificiale e Robotica

Artificial Neural Networks

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Lecture 1: Basic Concepts of Machine Learning

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Lecture 10: Reinforcement Learning

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Going to School: Measuring Schooling Behaviors in GloFish

Axiom 2013 Team Description Paper

PORTLAND VOCATIONAL ELECTRICAL

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Jack Jilly can play. 1. Can Jack play? 2. Can Jilly play? 3. Jack can play. 4. Jilly can play. 5. Play, Jack, play! 6. Play, Jilly, play!

MYCIN. The MYCIN Task

LEGO MINDSTORMS Education EV3 Coding Activities

Executive Summary. Gautier High School

STRATEGIC GROWTH FROM THE BASE OF THE PYRAMID

Proposal of Pattern Recognition as a necessary and sufficient principle to Cognitive Science

Artificial Neural Networks written examination

Context Free Grammars. Many slides from Michael Collins

CSL465/603 - Machine Learning

Work Stations 101: Grades K-5 NCTM Regional Conference &

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

An Introduction to School Finance in Texas

K5 Math Practice. Free Pilot Proposal Jan -Jun Boost Confidence Increase Scores Get Ahead. Studypad, Inc.

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Trends in Student Aid and Trends in College Pricing

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Rule Learning With Negation: Issues Regarding Effectiveness

Evolution of Symbolisation in Chimpanzees and Neural Nets

INPE São José dos Campos

CHAPTER 4: REIMBURSEMENT STRATEGIES 24

Algebra 2- Semester 2 Review

TCC Jim Bolen Math Competition Rules and Facts. Rules:

Device Independence and Extensibility in Gesture Recognition

The Good Judgment Project: A large scale test of different methods of combining expert predictions

Reinforcement Learning by Comparing Immediate Reward

What is this species called? Generation Bar Graph

The Evolution of Random Phenomena

Stimulation for Interaction. 1. Is your character old or young? He/She is old/young/in-between OR a child/a teenager/a grown-up/an old person

Learning Methods for Fuzzy Systems

A Case Study: News Classification Based on Term Frequency

An empirical study of learning speed in backpropagation

Unit 3: Lesson 1 Decimals as Equal Divisions

Rule-based Expert Systems

Welcome to. ECML/PKDD 2004 Community meeting

Evolutive Neural Net Fuzzy Filtering: Basic Description

IAT 888: Metacreation Machines endowed with creative behavior. Philippe Pasquier Office 565 (floor 14)

Information System Design and Development (Advanced Higher) Unit. level 7 (12 SCQF credit points)

Generative models and adversarial training

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

SEPERAC MEE QUICK REVIEW OUTLINE

Dialog-based Language Learning

An Introduction to Simio for Beginners

Pre-Algebra A. Syllabus. Course Overview. Course Goals. General Skills. Credit Value

Universal Design for Learning Lesson Plan

SCHOOL WITHOUT CLASSROOMS BERLIN ARCHITECTURE COMPETITION TO

Challenges in Deep Reinforcement Learning. Sergey Levine UC Berkeley

Experience College- and Career-Ready Assessment User Guide

Using focal point learning to improve human machine tacit coordination

(Sub)Gradient Descent

Taste And Sight Anatomy Study Guide

Syllabus: CS 377 Communication and Ethical Issues in Computing 3 Credit Hours Prerequisite: CS 251, Data Structures Fall 2015

Airplane Rescue: Social Studies. LEGO, the LEGO logo, and WEDO are trademarks of the LEGO Group The LEGO Group.

Prince2 Foundation and Practitioner Training Exam Preparation

BUILDING CONTEXT-DEPENDENT DNN ACOUSTIC MODELS USING KULLBACK-LEIBLER DIVERGENCE-BASED STATE TYING

Causal Link Semantics for Narrative Planning Using Numeric Fluents

Jigsaw- Yellow- Red White- Grey- Orange- Brown- Gold- Blue- Green Pink

FUZZY EXPERT. Dr. Kasim M. Al-Aubidy. Philadelphia University. Computer Eng. Dept February 2002 University of Damascus-Syria

Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models

Machine Learning and Development Policy

Council on Postsecondary Education Funding Model for the Public Universities (Excluding KSU) Bachelor's Degrees

Backwards Numbers: A Study of Place Value. Catherine Perez

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Infrared Paper Dryer Control Scheme

Word Segmentation of Off-line Handwritten Documents

Probabilistic Latent Semantic Analysis

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

Extending Place Value with Whole Numbers to 1,000,000

An OO Framework for building Intelligence and Learning properties in Software Agents

Chinese Language Parsing with Maximum-Entropy-Inspired Parser

Activity 2 Multiplying Fractions Math 33. Is it important to have common denominators when we multiply fraction? Why or why not?

Stochastic Calculus for Finance I (46-944) Spring 2008 Syllabus

Temper Tamer s Handbook

Investigate the program components

Robust Speech Recognition using DNN-HMM Acoustic Model Combining Noise-aware training with Spectral Subtraction

WASHINGTON COLLEGE SAVINGS

Answer each question by placing an X over the appropriate answer. Select only one answer for each question.

Transcription:

How Machines Learn (Without Being Taught) Michael I. Shamos, Ph.D., J.D. School of Computer Science Carnegie Mellon University

Machine Learning The computer is incredibly fast, accurate and stupid. Man is unbelievably slow, inaccurate and brilliant. The marriage of the two is a challenge and opportunity beyond imagination. -- Stuart G. Walesh, author and consultant

Background Ph.D., Yale University (computer science, 1978) J.D., Duquesne University (law, 1981) Carnegie Mellon computer science faculty since 1975 Visiting Professor, University of Hong Kong (2001- ), Electronic Payment Systems Director, Master s Program in ebusiness Technology, roughly equivalent to HKU Ecom/Icomp Incoming Director, MS in Artificial Intelligence and Entrepreneurship

Carnegie Mellon School of Computer Science SCHOOL OF COMPUTER SCIENCE COMPUTER SCIENCE DEPARTMENT (CSD) LANGUAGE TECHNOLOGIES INSTITUTE (LTI) HUMAN- COMPUTER INTERACTION INSTITUTE (HCII) INSTITUTE FOR SOFTWARE RESEARCH (ISR) ROBOTICS INSTITUTE (RI) MACHINE LEARNING DEPARTMENT (ML) COMP BIO DEPARTMENT MS IN AI AND ENTREPRENEURSHIP ebusiness TECHNOLOGY ENTERTAINMENT TECHNOLOGY CENTER (ETC)

Machine Learning A computer program learns from experience if its performance on a task improves based on that experience. -- paraphrased from Carnegie Mellon Professor Tom Mitchell

Machine Learning Examples

Types of Machine Learning No learning Static computer program. Always performs the same way. Changes made by humans. Supervised learning The program is given examples of inputs and desired outputs. Trains itself to perform well. Unsupervised learning Program given only inputs and must discover patterns in the data. Reinforcement learning Program is given only inputs, but gets rewards for good outputs. Objective: maximize reward.

A Computational System x 1 x 2 x N System h1, h2,..., hk y 1 y 2 y L Inputs: Internal Variables: Outputs: x = ( x, x,..., x ) 1 2 N h = ( h, h,..., h ) ( 1 2 K y = y y y ),,..., L 1 2

Predicting Apartment Prices Area in m 2 # of rooms Purchase price Purchase year... System h, h,..., hk 1 2 Predicted Price Today Which floor? Classic non-learning approach: construct a model of apartment prices and write a computer program No learning. If the model is inaccurate, we need a new model and a new program

Learning to Predict Apartment Prices Input X i : Output p i : Area in m 2 # of rooms Purchase price Purchase year Which floor?... System h1, h2,..., hk Predicted price p i Error = p i a i Supervised learning approach: use a large number M of actual price examples (X i, actual price a i ) Compare the predicted price p to the actual price a, and modify the program to reduce the error e = p-a

A Neuron Inputs: Weights: The neuron computes a function of the sum of the weighted inputs and outputs the value as Y

Biological Basis of Neurons SOURCE: QUORA.COM

A Neural Network Inputs: x 1 Outputs: x 2... x N s

A Neural Network is a Computational System x 1 x 2 x N System h1, h2,..., hk y 1 y 2 y L x 1 x 2 x N...

Neural Network for Price Estimation Output

Neural Networks Can Learn A learning algorithm: Given an input and a known desired output, run the neural network to see the actual output Error = desired output actual output Use the error to modify the weights in the network This is called training the network

Neural Networks Can Learn ERROR IN OUTPUT 1 IS USED TO ADJUST THE RED WEIGHTS 1 2 ERROR IN OUTPUT 2 IS USED TO ADJUST THE GREEN WEIGHTS

Neural Networks Can Learn BACKPROPAGATION ERROR IN OUTPUT 1 IS USED TO ADJUST THE RED WEIGHTS 1 2 ERROR IN OUTPUT 2 IS USED TO ADJUST THE GREEN WEIGHTS

What an Image Looks Like to a Machine A sequence of red-green-blue (RGB) color intensity values (0, 0, 0) = black (255, 255, 255) = white (255, 98, 89) = a shade of pink 2 24 = 16 million possible values for each pixel For a 1000 x 1000 pixel image, 16 trillion possible inputs SOURCE: DMYTRO FISHMAN

Variations of Cat SOURCE:POO KUAN HOONG

Supervised Learning SOURCE: DMYTRO FISHMAN

Supervised Learning Application SOURCE: E. ALPAYDIN

Supervised Learning Application Predicting Airline Ticket Prices

Unsupervised Learning No training data Network must detect similarities or patterns in the inputs

Example: Clustering News Stories

Unsupervised News Clustering SOURCE: HINTON & SALAKHUTDINOV

Unsupervised Image Recognition

Autoencoders Idea: compress patterns to represent them with fewer features in a code. Train the net to reproduce the original patterns just from the code. Gives a much more robust recognizer.

Autoencoders Better: Add noise! SOURCE: ARDEN DERTAT

A Deep Neural Network

Deep Neural Network SOURCE: AMAX.COM

Deep Image Recognition SOURCE: DATASKEPTIC

Application: Diagnosing Skin Cancer Stanford researchers collected 130,000 images of skin lesions representing over 2,000 different diseases Used the data as a training set on a deep neural network using only pixels and disease labels as inputs Performs as well as expert dermatologists, better than non-experts Projection: 6.3 billion smartphones by the year 2021 Can provide low-cost universal access to diagnostic procedures

Application: Diagnosing Skin Cancer SOURCE: EXTREMETECH.COM

SOURCE: EXTREMETECH.COM

Deep Learning Applications Uber estimates arrival time by training a neural network on millions of previous trips UberEATS estimates food preparation time to allow prediction of final delivery time Recommendation engines: Amazon, Netflix (estimated value: 20B HKD) Google Maps: analysis of 80 billion street view images to recognize house numbers and street signs Facebook DeepFace facial recognition

Facebook DeepFace

Reinforcement Learning

Reinforcement Learning Example: Atari Breakout Reward function is Score : number of targets removed

Google DeepMind Video (play to 2:03)

Reinforcement Learning in Go In 2017, AlphaGo Master defeated the world Go champion, Ke Jie. He called it God. A later version, AlphaGo Zero, can now beat AlphaGo Master

AlphaGo Zero No knowledge of Go except rules for legal moves Reward function: number of stones remaining at end of game AlphaGo Zero played a huge number of games against itself to maximize its reward 1.6 million games per day Outputs were used to train a neural network Hardware cost: 25 million USD

AlphaGo Zero Progress SOURCE: DEEPMIND.COM

AlphaGo Progress SOURCE: DEEPMIND.COM

SOURCE: NORMSHIELD.COM

Q A &