INTRODUCTION TO MACHINE LEARNING

Similar documents
Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Laboratorio di Intelligenza Artificiale e Robotica

CSL465/603 - Machine Learning

Laboratorio di Intelligenza Artificiale e Robotica

Python Machine Learning

Welcome to. ECML/PKDD 2004 Community meeting

Intelligent Agents. Chapter 2. Chapter 2 1

Lecture 1: Basic Concepts of Machine Learning

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Reinforcement Learning by Comparing Immediate Reward

(Sub)Gradient Descent

LEGO MINDSTORMS Education EV3 Coding Activities

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Rule Learning With Negation: Issues Regarding Effectiveness

CS Machine Learning

A Case Study: News Classification Based on Term Frequency

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

Axiom 2013 Team Description Paper

TD(λ) and Q-Learning Based Ludo Players

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

CS 446: Machine Learning

Paper Reference. Edexcel GCSE Mathematics (Linear) 1380 Paper 1 (Non-Calculator) Foundation Tier. Monday 6 June 2011 Afternoon Time: 1 hour 30 minutes

Detecting Wikipedia Vandalism using Machine Learning Notebook for PAN at CLEF 2011

Merry-Go-Round. Science and Technology Grade 4: Understanding Structures and Mechanisms Pulleys and Gears. Language Grades 4-5: Oral Communication

Probability and Statistics Curriculum Pacing Guide

Linking Task: Identifying authors and book titles in verbose queries

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Fourth Grade. Reporting Student Progress. Libertyville School District 70. Fourth Grade

Lecture 1: Machine Learning Basics

TUESDAYS/THURSDAYS, NOV. 11, 2014-FEB. 12, 2015 x COURSE NUMBER 6520 (1)

Evolutive Neural Net Fuzzy Filtering: Basic Description

The Evolution of Random Phenomena

Computerized Adaptive Psychological Testing A Personalisation Perspective

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

The taming of the data:

Learning From the Past with Experiment Databases

Rule Learning with Negation: Issues Regarding Effectiveness

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Lecture 10: Reinforcement Learning

Human Emotion Recognition From Speech

SYLLABUS. EC 322 Intermediate Macroeconomics Fall 2012

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

AQUA: An Ontology-Driven Question Answering System

Probabilistic Latent Semantic Analysis

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Mathematics process categories

Designing a Computer to Play Nim: A Mini-Capstone Project in Digital Design I

IT Students Workshop within Strategic Partnership of Leibniz University and Peter the Great St. Petersburg Polytechnic University

Artificial Neural Networks written examination

Firms and Markets Saturdays Summer I 2014

Ph.D in Advance Machine Learning (computer science) PhD submitted, degree to be awarded on convocation, sept B.Tech in Computer science and

A Comparison of Standard and Interval Association Rules

Applications of memory-based natural language processing

Reducing Features to Improve Bug Prediction

12- A whirlwind tour of statistics

Learning Disability Functional Capacity Evaluation. Dear Doctor,

Spinners at the School Carnival (Unequal Sections)

Generative models and adversarial training

Contents. Foreword... 5

P-4: Differentiate your plans to fit your students

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

Seminar - Organic Computing

Leader s Guide: Dream Big and Plan for Success

CLASS EXODUS. The alumni giving rate has dropped 50 percent over the last 20 years. How can you rethink your value to graduates?

AGENDA LEARNING THEORIES LEARNING THEORIES. Advanced Learning Theories 2/22/2016

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

Rule-based Expert Systems

A Decision Tree Analysis of the Transfer Student Emma Gunu, MS Research Analyst Robert M Roe, PhD Executive Director of Institutional Research and

Introduction to CS 100 Overview of UK. CS September 2015

Activities for School

Quantitative analysis with statistics (and ponies) (Some slides, pony-based examples from Blase Ur)

Undergraduate Program Guide. Bachelor of Science. Computer Science DEPARTMENT OF COMPUTER SCIENCE and ENGINEERING

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

A Reinforcement Learning Variant for Control Scheduling

Word Segmentation of Off-line Handwritten Documents

An investigation of imitation learning algorithms for structured prediction

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Assignment 1: Predicting Amazon Review Ratings

Evolution of Symbolisation in Chimpanzees and Neural Nets

MYCIN. The MYCIN Task

Essentials of Ability Testing. Joni Lakin Assistant Professor Educational Foundations, Leadership, and Technology

When Student Confidence Clicks

Analyzing sentiments in tweets for Tesla Model 3 using SAS Enterprise Miner and SAS Sentiment Analysis Studio

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Speeding Up Reinforcement Learning with Behavior Transfer

LEARNER VARIABILITY AND UNIVERSAL DESIGN FOR LEARNING

Mining Student Evolution Using Associative Classification and Clustering

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

Hentai High School A Game Guide

AP Calculus AB. Nevada Academic Standards that are assessable at the local level only.

Defragmenting Textual Data by Leveraging the Syntactic Structure of the English Language

Top US Tech Talent for the Top China Tech Company

CS177 Python Programming

Common Core Exemplar for English Language Arts and Social Studies: GRADE 1

Learning Methods for Fuzzy Systems

Transcription:

https://xkcd.com/894/ INTRODUCTION TO MACHINE LEARNING David Kauchak CS 158 Fall 2016 Why are you here? Machine Learning is What is Machine Learning? Machine learning is a subfield of computer science that evolved from the study of pattern recognition and computational learning theory in artificial intelligence. Why are you taking this course? What topics would you like to see covered? 1

Machine Learning is Machine Learning is Machine learning is programming computers to optimize a performance criterion using example data or past experience. -- Ethem Alpaydin The goal of machine learning is to develop methods that can automatically detect patterns in data, and then to use the uncovered patterns to predict future data or other outcomes of interest. -- Kevin P. Murphy Machine learning is about predicting the future based on the past. -- Hal Daume III The field of pattern recognition is concerned with the automatic discovery of regularities in data through the use of computer algorithms and with the use of these regularities to take actions. -- Christopher M. Bishop Machine Learning is Machine learning is about predicting the future based on the past. -- Hal Daume III Machine Learning, aka data mining: data analysis, not prediction, though often involves some shared techniques inference and/or estimation in statistics past Training learn model/ predictor future Testing model/ predictor predict pattern recognition in engineering signal processing in electrical engineering induction optimization 2

Goals of the course: learn about Goals of the course Different machine learning problems Common techniques/tools used! theoretical understanding! practical implementation Proper experimentation and evaluation Dealing with large (huge) data sets! Parallelization frameworks! Programming tools Be able to laugh at these signs (or at least know why one might ) Administrative Course page:! http://www.cs.pomona.edu/~dkauchak/classes/cs158/ Assignments! Weekly! Mostly programming (Java, mostly)! Some written/write-up! Generally due Sunday evenings Course expectations Plan to stay busy! Applied class, so lots of programming Machine learning involves math Two midterm exams and one final Late Policy Collaboration 3

Other things to note Videos before class Lots of class participation! Machine learning problems What high-level machine learning problems have you seen or heard of before? Read the book (it s good) 4

Supervised learning Supervised learning 1 3 ed 1 3 model/ predictor 4 4 5 5 Supervised learning: given ed Supervised learning: given ed 5

Supervised learning Supervised learning: classification model/ predictor predicted Classification: a finite set of s Supervised learning: learn to predict new example Supervised learning: given ed Classification Example Classification Applications Face recognition Differentiate between low-risk and high-risk customers from their income and savings Character recognition Spam detection Medical diagnosis: From symptoms to illnesses Biometrics: Recognition/authentication using physical and/ or behavioral characteristics: Face, iris, signature, etc... 6

Supervised learning: regression Regression Example -4.5 Price of a used car 10.1 3.2 Regression: is real-valued x : car attributes (e.g. mileage) y : price y = wx+w 0 4.3 Supervised learning: given ed 26 Regression Applications Supervised learning: ranking Economics/Finance: predict the value of a stock Epidemiology Car/plane navigation: angle of the steering wheel, acceleration, Temporal trends: weather over time 1 4 2 3 Ranking: is a ranking Supervised learning: given ed 7

Ranking example Given a query and a set of web pages, rank them according to relevance Ranking Applications User preference, e.g. Netflix My List -- movie queue ranking itunes flight search (search in general) reranking N-best output lists Unsupervised learning Unsupervised learning applications learn clusters/groups without any customer segmentation (i.e. grouping) image compression bioinformatics: learn motifs Unupervised learning: given data, i.e., but no s 8

Reinforcement learning Reinforcement learning example left, right, straight, left, left, left, straight GOOD Backgammon left, straight, straight, left, right, straight, straight BAD WIN! left, right, straight, left, left, left, straight left, straight, straight, left, right, straight, straight 18.5 Given a sequence of /states and a reward after completing that sequence, learn to predict the action to take in for an individual example/state -3 LOSE! Given sequences of moves and whether or not the player won at the end, learn to make good moves Reinforcement learning example Other learning variations What data is available:! Supervised, unsupervised, reinforcement learning! semi-supervised, active learning, How are we getting the data:! online vs. offline learning http://www.youtube.com/watch?v=vcdxqn0fcne Type of model:! generative vs. discriminative! parametric vs. non-parametric 9

Representing Features features What is an example? How is it represented? f 1, f 2, f 3,, f n f 1, f 2, f 3,, f n f 1, f 2, f 3,, f n How our algorithms actually view the data Features are the questions we can ask about the f 1, f 2, f 3,, f n Features Classification revisited features red, round, leaf, 3oz, green, round, no leaf, 4oz, yellow, curved, no leaf, 8oz, How our algorithms actually view the data Features are the questions we can ask about the red, round, leaf, 3oz, green, round, no leaf, 4oz, yellow, curved, no leaf, 8oz, green, curved, no leaf, 7oz, learn model/ classifier green, curved, no leaf, 7oz, During learning/training/induction, learn a model of what distinguishes s and s based on the features 10

Classification revisited Classification revisited red, round, no leaf, 4oz, model/ classifier predict Apple or? red, round, no leaf, 4oz, model/ classifier predict Apple Why? The model can then classify a new example based on the features The model can then classify a new example based on the features Classification revisited Classification revisited red, round, leaf, 3oz, red, round, leaf, 3oz, green, round, no leaf, 4oz, red, round, no leaf, 4oz,? green, round, no leaf, 4oz, red, round, no leaf, 4oz,? yellow, curved, no leaf, 4oz, green, curved, no leaf, 5oz, yellow, curved, no leaf, 4oz, green, curved, no leaf, 5oz, Learning is about generalizing from the training data What does this assume about the training and test set? 11

Past predicts future Past predicts future Not always the case, but we ll often assume it is! Past predicts future More technically We are going to use the probabilistic model of learning There is some probability distribution over example/ pairs called the data generating distribution Both the training data and the test set are generated based on this distribution What is a probability distribution? Not always the case, but we ll often assume it is! 12

Probability distribution Describes how likely (i.e. probable) certain events are Probability distribution High probability round s curved s s with leaves Low probability curved s red s yellow s data generating distribution data generating distribution data generating distribution data generating distribution 13

data generating distribution data generating distribution 14