What is Machine Learning? Machine Learning Fall 2018

Similar documents
Python Machine Learning

CSL465/603 - Machine Learning

CS 446: Machine Learning

Lecture 1: Machine Learning Basics

Lecture 1: Basic Concepts of Machine Learning

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

(Sub)Gradient Descent

Laboratorio di Intelligenza Artificiale e Robotica

Laboratorio di Intelligenza Artificiale e Robotica

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

A survey of multi-view machine learning

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Active Learning. Yingyu Liang Computer Sciences 760 Fall

A Case Study: News Classification Based on Term Frequency

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

Learning From the Past with Experiment Databases

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Using Web Searches on Important Words to Create Background Sets for LSI Classification

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Rule Learning With Negation: Issues Regarding Effectiveness

Assignment 1: Predicting Amazon Review Ratings

CS Machine Learning

Time series prediction

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Probabilistic Latent Semantic Analysis

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Dinesh K. Sharma, Ph.D. Department of Management School of Business and Economics Fayetteville State University

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

Reducing Features to Improve Bug Prediction

University of Groningen. Systemen, planning, netwerken Bosman, Aart

Semi-Supervised Face Detection

Rule Learning with Negation: Issues Regarding Effectiveness

Universidade do Minho Escola de Engenharia

Knowledge-Based - Systems

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Artificial Neural Networks written examination

Learning Methods for Fuzzy Systems

Handling Concept Drifts Using Dynamic Selection of Classifiers

A Neural Network GUI Tested on Text-To-Phoneme Mapping

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

Applications of data mining algorithms to analysis of medical data

Australian Journal of Basic and Applied Sciences

An OO Framework for building Intelligence and Learning properties in Software Agents

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Welcome to. ECML/PKDD 2004 Community meeting

The Boosting Approach to Machine Learning An Overview

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

Large-Scale Web Page Classification. Sathi T Marath. Submitted in partial fulfilment of the requirements. for the degree of Doctor of Philosophy

Reinforcement Learning by Comparing Immediate Reward

Improving Simple Bayes. Abstract. The simple Bayesian classier (SBC), sometimes called

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

Axiom 2013 Team Description Paper

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

Softprop: Softmax Neural Network Backpropagation Learning

Indian Institute of Technology, Kanpur

Knowledge Elicitation Tool Classification. Janet E. Burge. Artificial Intelligence Research Group. Worcester Polytechnic Institute

arxiv: v2 [cs.cv] 30 Mar 2017

Online Updating of Word Representations for Part-of-Speech Tagging

Decision Analysis. Decision-Making Problem. Decision Analysis. Part 1 Decision Analysis and Decision Tables. Decision Analysis, Part 1

arxiv: v1 [cs.lg] 15 Jun 2015

Multi-label classification via multi-target regression on data streams

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

An investigation of imitation learning algorithms for structured prediction

JONATHAN H. WRIGHT Department of Economics, Johns Hopkins University, 3400 N. Charles St., Baltimore MD (410)

The Method of Immersion the Problem of Comparing Technical Objects in an Expert Shell in the Class of Artificial Intelligence Algorithms

Learning Methods in Multilingual Speech Recognition

arxiv: v1 [cs.cv] 10 May 2017

Experiment Databases: Towards an Improved Experimental Methodology in Machine Learning

IAT 888: Metacreation Machines endowed with creative behavior. Philippe Pasquier Office 565 (floor 14)

Switchboard Language Model Improvement with Conversational Data from Gigaword

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

Multi-label Classification via Multi-target Regression on Data Streams

2/15/13. POS Tagging Problem. Part-of-Speech Tagging. Example English Part-of-Speech Tagsets. More Details of the Problem. Typical Problem Cases

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Self Study Report Computer Science

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

Artificial Neural Networks

Learning Distributed Linguistic Classes

Introduction to CS 100 Overview of UK. CS September 2015

Robot Learning Simultaneously a Task and How to Interpret Human Instructions

SELF-STUDY QUESTIONNAIRE FOR REVIEW of the COMPUTER SCIENCE PROGRAM

Model Ensemble for Click Prediction in Bing Search Ads

A Comparison of Two Text Representations for Sentiment Analysis

Strategies for Solving Fraction Tasks and Their Link to Algebraic Thinking

Human Emotion Recognition From Speech

Study and Analysis of MYCIN expert system

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

An Introduction to Simio for Beginners

Frequently Asked Questions about Music Education at IU

CAFE Collaboration Aimed at Finding Experts

SARDNET: A Self-Organizing Feature Map for Sequences

Feature Selection based on Sampling and C4.5 Algorithm to Improve the Quality of Text Classification using Naïve Bayes

Multilingual Sentiment and Subjectivity Analysis

Preference Learning in Recommender Systems

Transcription:

What is Machine Learning? Machine Learning Fall 2018 1

Our goal today And through the semester What is (machine) learning? 2

Let s play a game 3

The badges game Attendees of the 1994 conference on Computational Learning Theory received conference badges labeled + or Only one person (Haym Hirsh) knew the function that generated the labels Depended only on the attendee s name The task for the attendees: Look at as many examples as you want in the conference and find the unknown function 4

Let s play Name Label Claire Cardie - Peter Bartlett + Eric Baum? Haym Hirsh? Shai Ben-David? Michael I. Jordan? 5

Let s play Name Label Claire Cardie - Peter Bartlett + Eric Baum? Haym Hirsh? Shai Ben-David? Michael I. Jordan? How were the labels generated? What is the label for my name? Yours? 6

Let s play Name Label Claire Cardie - Peter Bartlett + Eric Baum + Haym Hirsh + Shai Ben-David + Michael I. Jordan - How were the labels generated? What is the label for my name? Yours? (Full data on the class website, you can stare at it longer if you want) 7

What is machine learning? 8

Machine learning is everywhere! And you are probably already using it 9

Machine learning is everywhere! And you are probably already using it Is an email spam? Find all the people in this photo If I like these three movies, what should I watch next? Based on your purchase history, you might be interested in Will a stock price go up or down tomorrow? By how much? Handwriting recognition What are the best ads to place on this website? I would like to read that Dutch website in English Ok Google, Drive this car for me. And, fly this helicopter for me. Does this genetic marker correspond to Alzheimer s disease? 10

But what is learning? Let s try to define (machine) learning 11

What is machine learning? Field of study that gives computers the ability to learn without being explicitly programmed Arthur Samuel (1950s) From 1959! 12

Learning as generalization Learning denotes changes in the system that are adaptive in the sense that they enable the system to do the task (or tasks drawn from the same population) more effectively the next time. Herbert Simon (1983) Economist, psychologist, political scientist, computer scientist, sociologist, Nobel Prize (1978), Turing Award (1975) 13

Learning as generalization A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P, if its performance at tasks in T, as measured by P, improves with experience E. Tom Mitchell (1999) 14

Learning = generalization 15

Learning = generalization 16

Machine learning is the future Gives a system the ability to perform a task in a situation which has never been encountered before New way to think about programming Programs that can acquire new capabilities! Learning allows programs to interact more robustly with messy data Starting to make inroads into end-user facing applications already 17

How many people in this picture? 18

How many people in this picture? Three heads Three hands Four legs 19

How many people in this picture? Three heads Three hands Four legs And yet five people! 20

How many people in this picture? Three heads Three hands Four legs And yet five people! Classifiers are not used in isolation, but used in conjunction with each other And in the context of a larger application 21

Related fields All very active research areas! The artificial intelligence dream: Computers that are as intelligent as humans Machine learning closely tied to AI Theoretical CS and mathematics Formalizing and understanding learning mathematically Uses ideas from probability and statistics, linear algebra, theory of computation Philosophy, cognitive psychology, neuroscience, linguistics, robotics, Many, many application areas AI, medicine, engineering, other areas of CS like compilers, psychology, marketing 22

Overview of this course 23

The main question through the semester What is learning? Different formal answers to this problem will give us: Various families of learning algorithms Techniques for developing new learning algorithms 24

We will see 1. Different kinds of models 2. Different learning protocols 3. Learning algorithms 4. Computational learning theory 5. Representing data 25

We will see different models Or: functions that a learner learns Decision trees Linear classifiers, linear regressors Non-linear classifiers, kernels, neural networks Ensembles of classifiers 26

Different learning protocols Supervised learning A teacher supplies a collection of examples with labels The learner has to learn to label new examples using this data Unsupervised learning No teacher, learner has only unlabeled examples Data mining Semi-supervised learning Learner has access to both labeled and unlabeled examples Active learning Learner and teacher interact with each other Learner can ask questions Reinforcement learning Learner learns by interacting with the environment 27

Different learning protocols Supervised learning A teacher supplies a collection of examples with labels The learner has to learn to label new examples using this data Unsupervised learning No teacher, learner has only unlabeled examples Data mining Semi-supervised learning Learner has access to both labeled and unlabeled examples Who has seen supervised learning before? Active learning Learner and teacher interact with each other Learner can ask questions Reinforcement learning Learner learns by interacting with the environment 28

Learning algorithms Online algorithms: Learner can access only one labeled at a time Perceptron, Winnow Batch algorithms: Learner can access to the entire dataset Naïve Bayes Support vector machines, logistic regression, neural networks Decision trees and nearest neighbors Boosting Unsupervised/semi-supervised algorithms Expectation maximization K-Means 29

Learning algorithms Online algorithms: Learner can access only one labeled at a time Perceptron, Winnow Batch algorithms: Learner can access to the entire dataset Naïve Bayes Support vector machines, logistic regression, neural networks Decision trees and nearest neighbors Boosting Unsupervised/semi-supervised algorithms Expectation maximization K-Means Who has used any of these algorithms before 30

Representing data What is the best way to represent data for a particular task? The importance of the right features Dimensionality reduction (if time permits) 31

The theory of machine learning What does it mean to learn? Online learning Learner sees examples in a stream and stop making mistakes as we go along (or minimize regret in our decisions). Probably Approximately Correct (PAC) Learning After seeing a collection of examples, the learner will (with high probability) produce a function that makes small error. Bayesian learning Based on our observations, what is the probability distribution over possible functions that produced the data? 32

This course Focuses on the underlying concepts and algorithmic ideas in the field of machine learning This course is not about Using a specific machine learning tool Any single learning paradigm 33

What will you learn? 1. A broad theoretical and practical understanding of machine learning paradigms and algorithms 2. Ability to implement learning algorithms 3. Identify where machine learning can be applied and make the most appropriate decisions (about algorithms, models, supervision, etc) 34

How will you learn? or: Course information 35