INTRODUCTION TO MACHINE LEARNING. Machine Learning: What s The Challenge?

Similar documents
Python Machine Learning

STT 231 Test 1. Fill in the Letter of Your Choice to Each Question in the Scantron. Each question is worth 2 point.

CS Machine Learning

Lecture 1: Machine Learning Basics

(Sub)Gradient Descent

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

J j W w. Write. Name. Max Takes the Train. Handwriting Letters Jj, Ww: Words with j, w 321

Algebra 2- Semester 2 Review

Using Web Searches on Important Words to Create Background Sets for LSI Classification

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Probability and Statistics Curriculum Pacing Guide

Lecture 1: Basic Concepts of Machine Learning

CSL465/603 - Machine Learning

Focus of the Unit: Much of this unit focuses on extending previous skills of multiplication and division to multi-digit whole numbers.

Spinners at the School Carnival (Unequal Sections)

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Generative models and adversarial training

Assignment 1: Predicting Amazon Review Ratings

Laboratorio di Intelligenza Artificiale e Robotica

Functional Maths Skills Check E3/L x

A Model to Predict 24-Hour Urinary Creatinine Level Using Repeated Measurements

12- A whirlwind tour of statistics

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

Word learning as Bayesian inference

STA 225: Introductory Statistics (CT)

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape

Using focal point learning to improve human machine tacit coordination

The Evolution of Random Phenomena

AQUA: An Ontology-Driven Question Answering System

Ohio s Learning Standards-Clear Learning Targets

Grade 2: Using a Number Line to Order and Compare Numbers Place Value Horizontal Content Strand

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

End-of-Module Assessment Task

Word Segmentation of Off-line Handwritten Documents

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Unit: Human Impact Differentiated (Tiered) Task How Does Human Activity Impact Soil Erosion?

Historical maintenance relevant information roadmap for a self-learning maintenance prediction procedural approach

Mining Student Evolution Using Associative Classification and Clustering

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Rule Learning With Negation: Issues Regarding Effectiveness

AP Statistics Summer Assignment 17-18

The following shows how place value and money are related. ones tenths hundredths thousandths

Paper Reference. Edexcel GCSE Mathematics (Linear) 1380 Paper 1 (Non-Calculator) Foundation Tier. Monday 6 June 2011 Afternoon Time: 1 hour 30 minutes

Online Updating of Word Representations for Part-of-Speech Tagging

University of Alberta. Large-Scale Semi-Supervised Learning for Natural Language Processing. Shane Bergsma

A Decision Tree Analysis of the Transfer Student Emma Gunu, MS Research Analyst Robert M Roe, PhD Executive Director of Institutional Research and

Medical Complexity: A Pragmatic Theory

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

Answer Key For The California Mathematics Standards Grade 1

arxiv: v2 [cs.cv] 30 Mar 2017

The University of Amsterdam s Concept Detection System at ImageCLEF 2011

Hardhatting in a Geo-World

Evaluation of Teach For America:

Australian Journal of Basic and Applied Sciences

MADERA SCIENCE FAIR 2013 Grades 4 th 6 th Project due date: Tuesday, April 9, 8:15 am Parent Night: Tuesday, April 16, 6:00 8:00 pm

Extending Place Value with Whole Numbers to 1,000,000

ACTIVITY: Comparing Combination Locks

Defragmenting Textual Data by Leveraging the Syntactic Structure of the English Language

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Function Tables With The Magic Function Machine

CS 446: Machine Learning

1. READING ENGAGEMENT 2. ORAL READING FLUENCY

Laboratorio di Intelligenza Artificiale e Robotica

FUZZY EXPERT. Dr. Kasim M. Al-Aubidy. Philadelphia University. Computer Eng. Dept February 2002 University of Damascus-Syria

Introduction to Forensics: Preventing Fires in the First Place. A Distance Learning Program Presented by the FASNY Museum of Firefighting

Build on students informal understanding of sharing and proportionality to develop initial fraction concepts.

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

Work Stations 101: Grades K-5 NCTM Regional Conference &

Applications of data mining algorithms to analysis of medical data

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Unit 3: Lesson 1 Decimals as Equal Divisions

Teaching a Laboratory Section

A survey of multi-view machine learning

Characteristics of the Text Genre Informational Text Text Structure

Chapters 1-5 Cumulative Assessment AP Statistics November 2008 Gillespie, Block 4

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

Dear Teacher: Welcome to Reading Rods! Reading Rods offer many outstanding features! Read on to discover how to put Reading Rods to work today!

THE UNIVERSITY OF TEXAS RIO GRANDE VALLEY GRAPHIC IDENTITY GUIDELINES

BUAD 425 Data Analysis for Decision Making Syllabus Fall 2015

Physical Features of Humans

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Rule Learning with Negation: Issues Regarding Effectiveness

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Name: Class: Date: ID: A

Guide to Teaching Computer Science

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

Foundations of Knowledge Representation in Cyc

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Instructor: Mario D. Garrett, Ph.D. Phone: Office: Hepner Hall (HH) 100

Linking the Ohio State Assessments to NWEA MAP Growth Tests *

Probabilistic Latent Semantic Analysis

An Empirical Analysis of the Effects of Mexican American Studies Participation on Student Achievement within Tucson Unified School District

Individual Differences & Item Effects: How to test them, & how to test them well

Experience College- and Career-Ready Assessment User Guide

Student s Edition. Grade 6 Unit 6. Statistics. Eureka Math. Eureka Math

Transcription:

INTRODUCTION TO MACHINE LEARNING Machine Learning: What s The Challenge?

Goals of the course Identify a machine learning problem Use basic machine learning techniques Think about your data/results

What is Machine Learning? Construct/use algorithms that learn from data More information Higher performance Previous solutions Experience

Example Label squares: size and edge color Earlier observations (labeled by humans): Task for computer = label unseen square:? Result: right or wrong!

Input Knowledge Features Label In example: pre-labeled squares size edge color small dotted green Observations big striped yellow In R - use data.frame() medium normal green > squares <- data.frame( size = c("small", "big", "medium"), edge = c("dotted", "striped", "normal"), color = c("green", "yellow", "green"))

Data Frame Functions > dim(squares) #Observations, #Features > str(squares) Structured Overview > summary(squares) Distribution Measures

Formulation INPUT FUNCTION OUTPUT ESTIMATED FUNCTION COLOR

ML: What It Is Not Determining most occurring color Calculating average size } NOT Machine Learning Goal: Building models for prediction!

Regression Regression INPUT: Weight OUTPUT: Height Estimated function: Weight Height

More Applications! Shopping basket analysis Movie recommendation systems Decision making for self-driving cars and many more!

INTRODUCTION TO MACHINE LEARNING Let s practice!

INTRODUCTION TO MACHINE LEARNING Classification Regression Clustering

Common ML Problems Classification Regression Clustering

Classification Problem Goal: predict category of new observation Estimate Earlier Observations CLASSIFIER CLASSIFIER Unseen Data Class

Classification Applications Medical Diagnosis Sick and Not Sick Animal Recognition Dog, Cat and Horse Important: Qualitative Output Predefined Classes

Regression PREDICTORS REGRESSION FUNCTION RESPONSE Relationship: Height - Weight? Linear? Predict: Weight Height

Regression Model Fitting a linear function Predictor: Response: Coefficients: Estimate on previous input-output > lm(response ~ predictor)

Regression Applications Payments Credit Scores Time Subscriptions Grades Landing a Job Quantitative Output Previous input-output observations

Clustering Clustering: grouping objects in clusters Similar within cluster Dissimilar between clusters Example: Grouping similar animal photos No labels No right or wrong Plenty possible clusterings

k-means Cluster data in k clusters! y 5 0 5 y 5 0 5 0 5 10 x 0 5 10 x

INTRODUCTION TO MACHINE LEARNING Let s Practice

INTRODUCTION TO MACHINE LEARNING Supervised vs. Unsupervised

Machine Learning Tasks Classification Regression quite similar Clustering

Supervised Learning Find: function f which can be used to assign a class or value to unseen observations. Given: a set of labeled observations Supervised Learning

Unsupervised Learning Labeling can be tedious, often done by humans Some techniques don t require labeled data Unsupervised Learning Clustering: find groups observation that are similar Does not require labeled observations

Performance of the model Supervised Learning Compare real labels with predicted labels Predictions should be similar to real labels Unsupervised Learning No real labels to compare Techniques will be explained in this course

Semi-Supervised Learning A lot of unlabeled observations A few labeled Group similar observations using clustering Use clustering information and classes of labeled observations to assign a class to unlabelled observations More labeled observations for supervised learning

INTRODUCTION TO MACHINE LEARNING Let s practice!