CS340 Machine learning Lecture 1 Introduction

Similar documents
CSL465/603 - Machine Learning

(Sub)Gradient Descent

Lecture 1: Machine Learning Basics

Lecture 1: Basic Concepts of Machine Learning

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

COSI Meet the Majors Fall 17. Prof. Mitch Cherniack Undergraduate Advising Head (UAH), COSI Fall '17: Instructor COSI 29a

CS Machine Learning

Laboratorio di Intelligenza Artificiale e Robotica

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

EECS 700: Computer Modeling, Simulation, and Visualization Fall 2014

Cal s Dinner Card Deals

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

School of Innovative Technologies and Engineering

Stochastic Calculus for Finance I (46-944) Spring 2008 Syllabus

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

MATH 108 Intermediate Algebra (online) 4 Credits Fall 2008

Python Machine Learning

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

Laboratorio di Intelligenza Artificiale e Robotica

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

ENME 605 Advanced Control Systems, Fall 2015 Department of Mechanical Engineering

CS Course Missive

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

CS 446: Machine Learning

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Syllabus ENGR 190 Introductory Calculus (QR)

Math 22. Fall 2016 TROUT

Mathematics process categories

EDIT 576 (2 credits) Mobile Learning and Applications Fall Semester 2015 August 31 October 18, 2015 Fully Online Course

San José State University Department of Psychology PSYC , Human Learning, Spring 2017

Syllabus Foundations of Finance Summer 2014 FINC-UB

Navigating the PhD Options in CMS

Axiom 2013 Team Description Paper

Human Emotion Recognition From Speech

Introduction to CS 100 Overview of UK. CS September 2015

ASTR 102: Introduction to Astronomy: Stars, Galaxies, and Cosmology

B.S/M.A in Mathematics

Assignment 1: Predicting Amazon Review Ratings

AP Calculus AB. Nevada Academic Standards that are assessable at the local level only.

Rule Learning With Negation: Issues Regarding Effectiveness

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

LEGO MINDSTORMS Education EV3 Coding Activities

Undergraduate Program Guide. Bachelor of Science. Computer Science DEPARTMENT OF COMPUTER SCIENCE and ENGINEERING

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Business 712 Managerial Negotiations Fall 2011 Course Outline. Human Resources and Management Area DeGroote School of Business McMaster University

B. How to write a research paper

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

Spring 2015 Natural Science I: Quarks to Cosmos CORE-UA 209. SYLLABUS and COURSE INFORMATION.

Self Study Report Computer Science

BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS

University of Waterloo School of Accountancy. AFM 102: Introductory Management Accounting. Fall Term 2004: Section 4

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

CS 100: Principles of Computing

ECON 442: Economic Development Course Syllabus Second Semester 2009/2010

Mathematics. Mathematics

ANCIENT GREEK HISTORY MWF 8:30-9:20 Main 326. Frances B. Titchener Main 310 (435)

A study of speaker adaptation for DNN-based speech synthesis

Frequently Asked Questions about Music Education at IU

INTERMEDIATE ALGEBRA Course Syllabus


INPE São José dos Campos

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

Characteristics of Functions

*In Ancient Greek: *In English: micro = small macro = large economia = management of the household or family

CIS 2 Computers and the Internet in Society -

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

MYCIN. The MYCIN Task

Probabilistic Latent Semantic Analysis

IT Students Workshop within Strategic Partnership of Leibniz University and Peter the Great St. Petersburg Polytechnic University

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering

THE UNIVERSITY OF SYDNEY Semester 2, Information Sheet for MATH2068/2988 Number Theory and Cryptography

EDIT 576 DL1 (2 credits) Mobile Learning and Applications Fall Semester 2014 August 25 October 12, 2014 Fully Online Course

Course Content Concepts

MTH 141 Calculus 1 Syllabus Spring 2017

Top US Tech Talent for the Top China Tech Company

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

AC : DESIGNING AN UNDERGRADUATE ROBOTICS ENGINEERING CURRICULUM: UNIFIED ROBOTICS I AND II

Interactive Whiteboard

Data Structures and Algorithms

A Case Study: News Classification Based on Term Frequency

AQUA: An Ontology-Driven Question Answering System

CS/SE 3341 Spring 2012

Version Space. Term 2012/2013 LSI - FIB. Javier Béjar cbea (LSI - FIB) Version Space Term 2012/ / 18

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

Control Tutorials for MATLAB and Simulink

CS 3516: Computer Networks

Linking Task: Identifying authors and book titles in verbose queries

Reinforcement Learning by Comparing Immediate Reward

Agents and environments. Intelligent Agents. Reminders. Vacuum-cleaner world. Outline. A vacuum-cleaner agent. Chapter 2 Actuators

Bluetooth mlearning Applications for the Classroom of the Future

Instructor: Matthew Wickes Kilgore Office: ES 310

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

MTH 215: Introduction to Linear Algebra

Evaluation of a College Freshman Diversity Research Program

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

Lahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017

Introduction, Organization Overview of NLP, Main Issues

Rule Learning with Negation: Issues Regarding Effectiveness

Course Development Using OCW Resources: Applying the Inverted Classroom Model in an Electrical Engineering Course

Transcription:

CS340 Machine learning Lecture 1 Introduction

Administrivia Class web page (check regularly!): www.cs.ubc.ca/~murphyk/teaching/cs340-fall07 TAs: Hoyt Koepke Erik Zawadzki hoytak@cs.ubc.ca epz@cs.ubc.ca Tutorials: T1A Thur 3.30-4.30 (Hoyt), Frank Forward Building (behind barn), room 317 T1B Wed 4-5 (Erik), MacLeod 214 Office hours: By appointment Midterm: Wed 10 October

Grading Midterm: 25% Final: 50% Grading Weekly Assignments: 25% Collaboration policy: You can collaborate on homeworks if you write the name of your collaborators on what you hand in; however, you must understand everything you write, and be able to do it on your own (eg. in the exam!) Sickness policy: If you cannot do an assignment or an exam, you must come see me in person; a doctor's note (or equivalent) will be required.

Pre-requisites You should know (or be prepared to learn) Basic multivariate calculus e.g., Basic linear algebra e.g., Basic probability/ statistics e.g. Basic data structures and algorithms (e.g., trees, lists, sorting, dynamic programming, etc)

Textbook None required I will give handouts and slides. However, the following are recommended. Bishop HTF DHS (Duda, Hart, Stork)

More recommended books

Matlab Matlab is a mathematical scripting language widely used for machine learning (and engineering and numerical computation in general). Everyone should have access to Matlab via their CS account. If not, you can ask the TAs for a CS guest account. You can buy a student version for $170 from the UBC bookstore, but you will also need the stats toolbox (and sometimes also the optimization toolbox). Hoyt will give a brief introduction to Matlab in class on Mon 10th. Prof Mitchell will give a brief Matlab tutorial on Wed Sept 12, 5pm - 7pm, DMP 110 The first homework (due on Mon 17th) consists of some simple Matlab programming. Check you have Matlab today!

Learning objectives By the end of this class, you should be able to Understand basic principles of machine learning and its connection to other fields Derive, in a precise and concise fashion, the relevant mathematical equations needed for familiar and novel models/ algorithms Implement, in reasonably efficient Matlab, various familiar and novel ML model/ algorithms Choose an appropriate method and apply it to various kinds of data/ problem domains

Ask questions early and often!

End of administrivia

What is machine learning? Electrical engineering CS Statistics ML Psychology Philosophy Neuroscience

What is machine learning? ``Learning denotes changes in the system that are adaptive in the sense that they enable the system to do the task or tasks drawn from the same population more efficiently and more effectively the next time.'' -- Herbert Simon Closely related to Statistics (fitting models to data and testing them) Data mining/ exploratory data analysis (discovering patterns in data) Adaptive control theory (learning models online and using them to achieve goals) AI (building intelligent machines by hand)

Types of machine learning Supervised Learning Predict output from input Unsupervised Learning Find patterns in data Reinforcement Learning Learn how to behave in novel environments (eg robot navigation) not covered in this class see e.g., CS422

Why Learn? Machine learning is programming computers to optimize a performance criterion using example data or past experience. There is no need to learn to calculate payroll Learning is used when: Humans not in loop (navigating on Mars) Humans are unable to explain their expertise (speech recognition) Solution changes in time (routing on a computer network) Solution needs to be adapted to particular cases (user biometrics)

Binary classification - credit card scoring

Supervised learning as function fitting Given parametric function f in hypothesis class H f H : X Θ Y ŷ = f(x,θ) And labeled training data D={x 1,y 1,...,x N,y N } Estimate parameters θ given D so that predictions on test set are as accurate as possible

Example function X =R 2,Y={hi,lo} f(x,θ)= IF income > θ 1 AND savings > θ 2 THEN low-risk ELSE high-risk

Decision trees

H = {Axis-parallel hyper-planes}

Multi-class Decision trees

What's the right hypothesis class H?

Linearly separable data Linearly separable means if f is a linear function of x, we can perfectly fit the training data f(x,θ) = sgn(θ T x) = sgn(θ 0 +θ 1 x 1 +θ 2 x 2 ) { 1,+1}

Not linearly separable

Quadratically separable f(x,θ)=sgn(θ 0 +θ 1 x 1 +θ 2 x 2 +θ 3 x 2 1+θ 4 x 2 2+θ 5 x 1 x 2 )

Noisy/ mislabeled data

Overfitting An overly flexible function memorizes irrelevant details of training set

Overfitted functions do not predict test data Predict label of green points

Overfitted functions do not predict test data Test points are mis-predicted

Tradeoff simplicity for model fit

Tradeoff simplicity for model fit

Occam s razor If two models fit the data equally well, pick the simpler one In general, since our goal is to predict the test data, we may choose to incur errors on the training set if it results in a simpler function

Function fitting 1. Choose right hypothesis class H given D linear quadratic Depth-2 decision tree 2. Fit parameters of function θ given H and D f(x,θ)=sgn(θ T x)=sgn(θ 0 +θ 1 x 1 +θ 2 x 2 )

Hypothesis class depends on amount of data More complex function is ok if we have more data, because we have more evidence for it

Hypothesis class depends on type of data Decision regions may be discontinuous

Hypothesis class depends on type of data Input features may be discrete (X not Euclidean space) yes blue? yes oval? no big? no no yes

Classifying gene microarray data What s the right hypothesis class now?

Handwritten digit recognition x t R 16 16, y t {0,...,9} What s the right hypothesis class now?

Face Recognition Training examples of a person Possibly no negative examples Test images What s the right hypothesis class now?

Face detection in images

Face detection in images

Face detection in images

Face detection in images

Face detection in images

Face detection in images

Jordan Reynolds, UBC, 2004 Car detection

Classifying image patches Texture classification using SVMs foliage, building, sky, water Image Retrieval Source: Mike Cora, UBC, 2005

Place recognition for a wearable computer

Place recognition for a mobile robot

Natural language processing (NLP) We do not yet know good ways to represent the "meaning of a sentence (this is called the knowledge representation problem in AI) Current approaches to statistical NLP involve shallow parsing, where the meaning of a sentence can be represented by fields in a database eg "Microsoft acquired AOL for $1M yesterday" "Yahoo failed to avoid a hostile takeover from Google" Buyer Buyee When Price MS AOL Yesterday $1M Google Yahoo??

Learning how to talk: nettalk Mary had a little lamb, its fleece Source: Sejnowski & Rosenberg, 1987