Lecture 1: What is Machine Learning? STAT161/261 Introduction to Pattern Recognition and Machine Learning Spring 2018 Prof.

Similar documents
Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

CSL465/603 - Machine Learning

Lecture 1: Basic Concepts of Machine Learning

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Laboratorio di Intelligenza Artificiale e Robotica

Python Machine Learning

Laboratorio di Intelligenza Artificiale e Robotica

Lecture 1: Machine Learning Basics

Probabilistic Latent Semantic Analysis

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

(Sub)Gradient Descent

Word Segmentation of Off-line Handwritten Documents

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Rule Learning With Negation: Issues Regarding Effectiveness

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Human Emotion Recognition From Speech

Welcome to. ECML/PKDD 2004 Community meeting

CS Machine Learning

Top US Tech Talent for the Top China Tech Company

Axiom 2013 Team Description Paper

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

Rule Learning with Negation: Issues Regarding Effectiveness

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Learning Methods for Fuzzy Systems

Semi-Supervised Face Detection

Artificial Neural Networks written examination

Assignment 1: Predicting Amazon Review Ratings

Reducing Features to Improve Bug Prediction

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Knowledge based expert systems D H A N A N J A Y K A L B A N D E

Time series prediction

Applications of data mining algorithms to analysis of medical data

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

Mining Association Rules in Student s Assessment Data

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Evolutive Neural Net Fuzzy Filtering: Basic Description

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

Lecture 10: Reinforcement Learning

Seminar - Organic Computing

HIERARCHICAL DEEP LEARNING ARCHITECTURE FOR 10K OBJECTS CLASSIFICATION

Computerized Adaptive Psychological Testing A Personalisation Perspective

TD(λ) and Q-Learning Based Ludo Players

A study of speaker adaptation for DNN-based speech synthesis

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Multisensor Data Fusion: From Algorithms And Architectural Design To Applications (Devices, Circuits, And Systems)

Australian Journal of Basic and Applied Sciences

WHEN THERE IS A mismatch between the acoustic

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Lecture 6: Applications

Massachusetts Institute of Technology Tel: Massachusetts Avenue Room 32-D558 MA 02139

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

Reinforcement Learning by Comparing Immediate Reward

Comparison of EM and Two-Step Cluster Method for Mixed Data: An Application

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION

Forget catastrophic forgetting: AI that learns after deployment

Mining Student Evolution Using Associative Classification and Clustering

Linking Task: Identifying authors and book titles in verbose queries

Probability and Statistics Curriculum Pacing Guide

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

ADVANCES IN DEEP NEURAL NETWORK APPROACHES TO SPEAKER RECOGNITION

STT 231 Test 1. Fill in the Letter of Your Choice to Each Question in the Scantron. Each question is worth 2 point.

A Case Study: News Classification Based on Term Frequency

CS 446: Machine Learning

Citrine Informatics. The Latest from Citrine. Citrine Informatics. The data analytics platform for the physical world

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

arxiv: v1 [cs.lg] 15 Jun 2015

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape

IAT 888: Metacreation Machines endowed with creative behavior. Philippe Pasquier Office 565 (floor 14)

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Speech Recognition at ICSI: Broadcast News and beyond

Machine Learning and Development Policy

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling

Ericsson Wallet Platform (EWP) 3.0 Training Programs. Catalog of Course Descriptions

Learning to Schedule Straight-Line Code

Using dialogue context to improve parsing performance in dialogue systems

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Ryerson University Sociology SOC 483: Advanced Research and Statistics

USER ADAPTATION IN E-LEARNING ENVIRONMENTS

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

A student diagnosing and evaluation system for laboratory-based academic exercises

Speech Emotion Recognition Using Support Vector Machine

University of Groningen. Systemen, planning, netwerken Bosman, Aart

Geospatial Visual Analytics Tutorial. Gennady Andrienko & Natalia Andrienko

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

Applications of memory-based natural language processing

Handling Concept Drifts Using Dynamic Selection of Classifiers

THE enormous growth of unstructured data, including

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

Learning Human Utility from Video Demonstrations for Deductive Planning in Robotics

MYCIN. The MYCIN Task

Knowledge-Based - Systems

An Effective Framework for Fast Expert Mining in Collaboration Networks: A Group-Oriented and Cost-Based Method

A Model to Predict 24-Hour Urinary Creatinine Level Using Repeated Measurements

Transcription:

Lecture 1: What is Machine Learning? STAT161/261 Introduction to Pattern Recognition and Machine Learning Spring 2018 Prof. Allie Fletcher

Lecture 1 Outline Course information and details What and why machine learning? Supervised Learning Examples Classification Regression Unsupervised Learning Reinforcement Learning Why now?

Course Info (see web; most significant bits here) We will be using CCLE, after enrollment settles down Instructor: Allie Fletcher Required Books: Introduction to Machine Learning by Ethem Alpaydin and Pattern Recognition and Machine Learning by Christopher Bishop The majority of what is important will be covered in lectured. However, you will be required to know readings, website handouts, and lecture--not just lecture Lecture notes may be slides and handwritten--union of both important :)

What is Machine Learning? Learn to improve algorithms from data Optimize a performance criterion using example data or past experience Role of Statistics: Inference from a sample Role of Computer science: Efficient algorithms to Solve the optimization problem Representing and evaluating the model for inference

Why "Learn? Machine learning is programming computers to optimize a performance criterion using example data or past experience There is no need to learn to calculate payroll Learning is used when: Human expertise does not exist (navigating on Mars) Humans are unable to explain their expertise (speech recognition) Solution changes in time (routing on a computer network) Solution needs to be adapted to particular cases (user biometrics)

What We Talk About When We Talk About Learning Learning general models from a data of particular examples Data is cheap and abundant (data warehouses, data marts); knowledge is expensive and scarce Example in retail: Customer transactions to consumer behavior: People who bought Blink also bought Outliers (www.amazon.com) Build a model that is a good and useful approximation to the data

Lecture 1 Outline Course information and details What is machine learning? Why do machine learning? Supervised Learning Examples Classification Regression Unsupervised Learning Reinforcement Learning Why now?

Example 1: Digit Recognition Recognize a digit from the image Learn a function ff xx {0,1,, 9}, xx is a 28 x 28 matrix Expert systems do not work well: You can recognize the digits, but difficult to program a function ff xx that works well Try it!

Supervised Learning on Handwritten Digits Supervised: Start with training data, labelled data Ex: 6000 examples of each digit Learn for example classifier ff(xx) that matches label well on training data Given new data xx use function to guess digit Current systems get <0.21% errors (as of 1/20/2018) http://rodrigob.github.io/are_we_there_yet/build/classification_dat asets_results.html#4d4e495354 First commercial application: Used by USPS for recognizing zip codes on letters Training examples Each sample must be labeled by hand who knows truth

Example 2: Credit Score and Classification Example: Credit score Determine/classify if customer is high-risk or low-risk Select some features: Example: income & savings Represent as a vector xx = (xx 1, xx 2 ) Learn a function from features to target Use past training data Need to get this data The function on the right is an example of a decision tree. If savings are above a line, and then if income is above a line, then the candidate is low-risk.

Example 3: Spam Detection Classification problem: Is email junk or not junk? For ML, must represent email numerically Common model: bag of words Enumerate all words, ii = 1,, NN Represent email via word count xx ii = num instances of word ii Challenge: Very high-dimensional vector System must continue to adapt (keep up with spammers)

Example 4: Face Detection Also a supervised learning problem For each image region, determine if Face or non-face

Training Data Typical early face recognition datasets: 5000 faces All near frontal Vary age, race, gender, lighting 10^8 non faces Faces are normalized (scale, translation) functions that work well may be very complex Many more datasets are available now: See http://www.face-rec.org/databases/ You can use this for your project! Rowley, Baluja and Kanade, 1998

Example 5: Stock Price Prediction Can you predict the price of a stock? What variables would you use? What is a non-machine learning approach?

Supervised Learning in General Prediction of future cases: Use the rule to predict the output for future inputs Knowledge extraction: The rule is easy to understand Compression:The rule is simpler than the data it explains Outlier detection: Exceptions that are not covered by the rule, e.g., fraud

Classification and SL: Many Applications Aka Pattern recognition Face recognition: Pose, lighting, occlusion (glasses, beard), make-up, hair style Character recognition: Different handwriting styles. Speech recognition: Temporal dependency. Medical diagnosis: From symptoms to illnesses Biometrics: Recognition/authentication using physical and/or behavioral characteristics: Face, iris, signature, etc...

Target variable yy is continuous-valued Example: Predict yy = price of car From xx = mileage, size, horsepower,.. Can use multiple predictors Assume some form of the mapping Ex. Linear: yy = ββ 0 + ββ 1 xx Find parameters ββ 0, ββ 1 from data Note: predictors need not be cnts Regression

Regression Example Predict blood glucose level Many possible predictors: Recent past levels Insulin dose Time of last meal Check out data in: https://archive.ics.uci.edu/ml/datasets/d iabetes

Lecture 1 Outline Course information and details What is machine learning? Why do machine learning? Supervised Learning Examples Classification Regression Unsupervised Learning Reinforcement Learning Why now?

Unsupervised Learning Learning what normally happens No output Clustering: Grouping similar instances Example applications Customer segmentation Image compression: Color quantization Bioinformatics: Learning motifs Example: Document classification http://www.ibm.com/support/knowledgecenter /SSBRAM_8.7.0/com.ibm.classify.ccenter.doc/ c_wbg_taxonomy_proposer.htm

Reinforcement Learning Learning a policy: A sequence of outputs No supervised output but delayed reward Credit assignment problem Game playing Robot in a maze Multiple agents, partial observability,...

What ML is Doing Today? Autonomous driving Jeopardy Very difficult games: Alpha Go Machine translation Many, many others

Why Now? Machine learning is an old field Much of the pioneering statistical work dates to the 1950s So what is new now? Big Data: Massive storage. Large data centers Massive connectivity Sources of data from Internet and elsewhere Computational advances Distributed machines, clusters GPUs and hardware Google Tensor Processing Unit (TPU) 23

Resources: Journals Journal of Machine Learning Research www.jmlr.org Machine Learning Neural Computation Neural Networks IEEE Trans on Neural Networks and Learning Systems IEEE Trans on Pattern Analysis and Machine Intelligence Journals on Statistics/Data Mining/Signal Processing/Natural Language Processing/Bioinformatics/... 24

Resources: Conferences International Conference on Machine Learning (ICML) European Conference on Machine Learning (ECML) Neural Information Processing Systems (NIPS) Uncertainty in Artificial Intelligence (UAI) Computational Learning Theory (COLT) International Conference on Artificial Neural Networks (ICANN) International Conference on AI & Statistics (AISTATS) Knowledge Discovery and Data Mining (KDD) International Conference on ComputerVision and Pattern Recognition (CVPR) International Conference on ComputerVision (ICCV) European Conference on ComputerVision (ECCV) 25

Machine Learning in Almost All Fields Retail: Market basket analysis, Customer relationship management (CRM) Finance: Credit scoring, fraud detection Manufacturing: Control, robotics, troubleshooting Medicine: Medical diagnosis Telecommunications: Spam filters, intrusion detection Bioinformatics: Motifs, alignment Web mining: Search engines...

Objectives Provide examples of machine learning used today Given a new problem, qualitatively describe how machine learning can be used Formulate a potential machine learning task Identify the data needed for the task Identify objectives Classify a machine learning task: Supervised vs. unsupervised, regression vs. classification For supervised learning, identify the predictors and target variables Determine the role of expert knowledge in the task vs. data-driven learning