CS540 Machine learning Lecture 1 Introduction

Similar documents
(Sub)Gradient Descent

CS Machine Learning

Python Machine Learning

Lecture 1: Machine Learning Basics

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

CSL465/603 - Machine Learning

Laboratorio di Intelligenza Artificiale e Robotica

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

A Case Study: News Classification Based on Term Frequency

COMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS

Probabilistic Latent Semantic Analysis

AP Statistics Summer Assignment 17-18

Generative models and adversarial training

School of Innovative Technologies and Engineering

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

EECS 700: Computer Modeling, Simulation, and Visualization Fall 2014

Reinforcement Learning by Comparing Immediate Reward

Laboratorio di Intelligenza Artificiale e Robotica

Radius STEM Readiness TM

ENME 605 Advanced Control Systems, Fall 2015 Department of Mechanical Engineering

Lahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017

EGRHS Course Fair. Science & Math AP & IB Courses

Evolutive Neural Net Fuzzy Filtering: Basic Description

EDIT 576 (2 credits) Mobile Learning and Applications Fall Semester 2015 August 31 October 18, 2015 Fully Online Course

Course Development Using OCW Resources: Applying the Inverted Classroom Model in an Electrical Engineering Course

Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification

ME 4495 Computational Heat Transfer and Fluid Flow M,W 4:00 5:15 (Eng 177)

EDIT 576 DL1 (2 credits) Mobile Learning and Applications Fall Semester 2014 August 25 October 12, 2014 Fully Online Course

Artificial Neural Networks written examination

Lecture 1: Basic Concepts of Machine Learning

Interactive Whiteboard

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

College Pricing. Ben Johnson. April 30, Abstract. Colleges in the United States price discriminate based on student characteristics

Self Study Report Computer Science

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Comparison of network inference packages and methods for multiple networks inference

B.S/M.A in Mathematics

WHEN THERE IS A mismatch between the acoustic

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

LEGO MINDSTORMS Education EV3 Coding Activities

DOCTORAL SCHOOL TRAINING AND DEVELOPMENT PROGRAMME

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Business 712 Managerial Negotiations Fall 2011 Course Outline. Human Resources and Management Area DeGroote School of Business McMaster University

Statistics and Data Analytics Minor

MTH 141 Calculus 1 Syllabus Spring 2017

Axiom 2013 Team Description Paper

Sociology 521: Social Statistics and Quantitative Methods I Spring Wed. 2 5, Kap 305 Computer Lab. Course Website

ECE-492 SENIOR ADVANCED DESIGN PROJECT

THE UNIVERSITY OF SYDNEY Semester 2, Information Sheet for MATH2068/2988 Number Theory and Cryptography

Longest Common Subsequence: A Method for Automatic Evaluation of Handwritten Essays

Unit 3: Lesson 1 Decimals as Equal Divisions

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering

arxiv: v2 [cs.cv] 30 Mar 2017

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Comparison of EM and Two-Step Cluster Method for Mixed Data: An Application

Lecture Videos to Supplement Electromagnetic Classes at Cal Poly San Luis Obispo

COSI Meet the Majors Fall 17. Prof. Mitch Cherniack Undergraduate Advising Head (UAH), COSI Fall '17: Instructor COSI 29a

MATH 108 Intermediate Algebra (online) 4 Credits Fall 2008

Design Of An Automatic Speaker Recognition System Using MFCC, Vector Quantization And LBG Algorithm

MGT/MGP/MGB 261: Investment Analysis

Learning Methods for Fuzzy Systems

Data Structures and Algorithms

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Multisensor Data Fusion: From Algorithms And Architectural Design To Applications (Devices, Circuits, And Systems)

University of Groningen. Systemen, planning, netwerken Bosman, Aart

Notetaking Directions

Probability and Statistics Curriculum Pacing Guide

Frequently Asked Questions about Music Education at IU

Please read this entire syllabus, keep it as reference and is subject to change by the instructor.

Level 6. Higher Education Funding Council for England (HEFCE) Fee for 2017/18 is 9,250*

Diploma in Library and Information Science (Part-Time) - SH220

Ansys Tutorial Random Vibration

ECO 3101: Intermediate Microeconomics

HOLMER GREEN SENIOR SCHOOL CURRICULUM INFORMATION

IS FINANCIAL LITERACY IMPROVED BY PARTICIPATING IN A STOCK MARKET GAME?

Sociology 521: Social Statistics and Quantitative Methods I Spring 2013 Mondays 2 5pm Kap 305 Computer Lab. Course Website

Edexcel GCSE. Statistics 1389 Paper 1H. June Mark Scheme. Statistics Edexcel GCSE

ENEE 302h: Digital Electronics, Fall 2005 Prof. Bruce Jacob

Stochastic Calculus for Finance I (46-944) Spring 2008 Syllabus

Course Content Concepts

The Method of Immersion the Problem of Comparing Technical Objects in an Expert Shell in the Class of Artificial Intelligence Algorithms

ME 443/643 Design Techniques in Mechanical Engineering. Lecture 1: Introduction

Integrating simulation into the engineering curriculum: a case study

INTERMEDIATE ALGEBRA PRODUCT GUIDE

Navigating the PhD Options in CMS

4. Long title: Emerging Technologies for Gaming, Animation, and Simulation

BADM 641 (sec. 7D1) (on-line) Decision Analysis August 16 October 6, 2017 CRN: 83777

Students Understanding of Graphical Vector Addition in One and Two Dimensions

LOUISIANA HIGH SCHOOL RALLY ASSOCIATION

Transcription:

CS540 Machine learning Lecture 1 Introduction

Administrivia Overview Supervised learning Unsupervised learning Other kinds of learning Outline

Administrivia Class web page www.cs.ubc.ca/~murphyk/teaching/cs540-fall08 Join groups.google.com/group/cs540-fall08 Office hours: Fri 10.30-11.30am Midterm: Tue Oct 14 Final project due Fri Dec 5 th weekly homeworks Grading Midterm (open-book):30% Final project: 50% Weekly Assignments: 20%

Homeworks Weekly homeworks, out on Tue, due back on Tue Collaboration policy: You can collaborate on homeworks if you write the name of your collaborators on what you hand in; however, you must understand everything you write, and be able to do it on your own (eg. in the exam!) Sickness policy: If you cannot do an assignment or an exam, you must come see me in person; a doctor's note (or equivalent) will be required.

Workload This class will be quite time consuming. Attending lectures: 3h. Weekly homeworks: about 6h. Weekly reading: about 6h. Total: 15h/week. If this is too time consuming, and/or you don t have the pre-reqs, why not take CS340, the ugrad ML class, this Fall? (Can still get grad credit!)

You should know Pre-requisites Basic multivariate calculus e.g., Basic linear algebra e.g., Basic probability/ statistics e.g. Basic data structures and algorithms (e.g., trees, lists, sorting, dynamic programming, etc)

Textbook Machine learning: a probabilistic approach Draft copies available from Copiesmart in the UBC Village (next to Macdonald s) for about $35 pdf online for color pictures/ easy searching please do not distribute by email! See whiteboard for secret password Extra credit (up to 5% of your grade) for finding errors (5 points) or typos (1 point) consult list of typos on book webpage before sending me your list (one email per chapter). Please bring your book to every class.

Other good books If you want a book that is already debugged, see one of these

Matlab Matlab is a mathematical scripting language widely used for machine learning (and engineering and numerical computation in general). Everyone should have access to Matlab via their CS account. If not, ask for a CS guest account. You can buy a student version for $170 from the UBC bookstore. Please make sure it has the Stats toolbox. Matt Dunham has written an excellent Matlab tutorial which is on the class web site please study it carefully!

BLT Bayesian Learning Toolkit (BLT) is a Matlab package I am currently developing to go along with my book. It uses the latest object oriented features of Matlab 2008a and will not run on older versions.

Learning objectives By the end of this class, you should be able to Understand basic principles and techniques of machine learning and its connection to other fields Create suitable statistical models for any given problem Derive the algorithm (equations etc) needed to learn and apply the model Implement the algorithm in reasonably efficient Matlab Demonstrate your skills by doing a reasonably challenging project

Ask questions early and often!

Administrivia Overview Supervised learning Unsupervised learning Other kinds of learning Outline

What is machine learning? Electrical engineering CS Statistics ML Psychology Philosophy Neuroscience

What is machine learning? ``Learning denotes changes in the system that are adaptive in the sense that they enable the system to do the task or tasks drawn from the same population more efficiently and more effectively the next time.'' -- Herbert Simon Closely related to Statistics (fitting models to data and testing them) Data mining/ exploratory data analysis (discovering patterns in data) Adaptive control theory (learning models online and using them to achieve goals) AI (building intelligent machines by hand)

Types of machine learning Supervised Learning Predict output from input Unsupervised Learning Find patterns in data Reinforcement Learning Learn how to behave in novel environments (eg robot navigation) not covered in this class see e.g., CS422

Why Learn? Machine learning is programming computers to optimize a performance criterion using example data or past experience. There is no need to learn to calculate payroll Learning is used when: Humans not in loop (navigating on Mars) Humans are unable to explain their expertise (speech recognition) Solution changes in time (routing on a computer network) Solution needs to be adapted to particular cases (user biometrics)

Administrivia Overview Supervised learning Unsupervised learning Other kinds of learning Outline

Supervised learning Learning a mapping f from input x to output y: If y 2 {1,,C}, this is called classification If y 2 R, this is called regression

Binary classification Training data Testing data X y

Classifying gene microarray data

Handwritten digit recognition x 2 R 16 16, y 2 {0,...,9}

Face Recognition Training examples of a person Possibly no negative examples Test images

http://demo.pittpatt.com Face detection

Jordan Reynolds, UBC, 2004 Car detection

Probabilistic output Training data Testing data P=0 P=0.5 P=0.5? X y

Structured output classification Predict multiple output labels, which may be correlated Here we use a conditional random field (CRF)

Regression Line denotes posterior mode arg max y p(y x) Error bars denote 95% credible interval

Regression Interaction term

Regression for control http://www-clmc.usc.edu/research/humanoidrobotics

Administrivia Overview Supervised learning Unsupervised learning Other kinds of learning Outline

Clustering K-means after 2 iterations

Clustering genes 310x7

PCA Principal components analysis

Learning graph structures Protein phosphorylation data DAG model See Stat521A Spring 2009

Assessing unsupervised learning 2 clusters or 3?

Assessing unsupervised learning 2 dimensions or more? Linear subspace or something else?

Density estimation Can formalize unsupervised learning as learning a model of p(y) instead of p(y x) Model should assign high probability to future data If we generate from the model, it should look like the observed data If we have too many clusters, it will overfit (see next lecture) If we have too few clusters, it will underfit (see next lecture) Choosing K is an example of model selection

Data compression In the information theory chapter, we show that finding a good data compression scheme relies on building an accurate probabilistic model of the data. Frequent data vectors get assigned short codewords (fewer bits required). Infrequent data vectors can be given long codewords. See Mackay s book

Vector quantization Replace each x i 2 R 2 with a codeword z i in {1,..,K} This is an index into the codebook m 1, m 2,, m K in R 2

K-means minimizes the distortion Original K=2 K=4

K-means minimizes the distortion Original K=2 K=4

K-means minimizes the distortion Original K=2 K=4

Administrivia Overview Supervised learning Unsupervised learning Other kinds of learning Outline

www.netflixprize.com $1M USD Collaborative filtering

Semi-supervised learning 2 labeled, 1000s unlabeled Propagate y labels to similar x s

Reinforcement learning Search over actions to maximize expected utility: - Predict effects of actions using probabilistic model - Use utility theory to decide which outcome is best -RL tries to learn a controller that simulates the above behavior See CS322 and CS502