EECS 349 Machine Learning

Similar documents
CSL465/603 - Machine Learning

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Python Machine Learning

(Sub)Gradient Descent

Lecture 1: Basic Concepts of Machine Learning

Lecture 1: Machine Learning Basics

Axiom 2013 Team Description Paper

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Laboratorio di Intelligenza Artificiale e Robotica

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Laboratorio di Intelligenza Artificiale e Robotica

Penn State University - University Park MATH 140 Instructor Syllabus, Calculus with Analytic Geometry I Fall 2010

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

Rule Learning With Negation: Issues Regarding Effectiveness

Probability and Game Theory Course Syllabus

Probabilistic Latent Semantic Analysis

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

CS 3516: Computer Networks

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

Math 96: Intermediate Algebra in Context

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

ENEE 302h: Digital Electronics, Fall 2005 Prof. Bruce Jacob

Spring 2016 Stony Brook University Instructor: Dr. Paul Fodor

CS 100: Principles of Computing

Statistics and Data Analytics Minor

CS 101 Computer Science I Fall Instructor Muller. Syllabus

Reducing Features to Improve Bug Prediction

San José State University Department of Marketing and Decision Sciences BUS 90-06/ Business Statistics Spring 2017 January 26 to May 16, 2017

Data Structures and Algorithms

Massachusetts Institute of Technology Tel: Massachusetts Avenue Room 32-D558 MA 02139

Accounting 312: Fundamentals of Managerial Accounting Syllabus Spring Brown

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Rule Learning with Negation: Issues Regarding Effectiveness

Welcome to. ECML/PKDD 2004 Community meeting

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Model Ensemble for Click Prediction in Bing Search Ads

Rule discovery in Web-based educational systems using Grammar-Based Genetic Programming

Human Emotion Recognition From Speech

ANT 3520 (Online) Skeleton Keys: Introduction to Forensic Anthropology Spring 2015

Self Study Report Computer Science

Learning Methods for Fuzzy Systems

Australian Journal of Basic and Applied Sciences

Foothill College Summer 2016

Instructor Dr. Kimberly D. Schurmeier

AGN 331 Soil Science. Lecture & Laboratory. Face to Face Version, Spring, Syllabus

EGRHS Course Fair. Science & Math AP & IB Courses

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

AGN 331 Soil Science Lecture & Laboratory Face to Face Version, Spring, 2012 Syllabus

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Office Hours: Mon & Fri 10:00-12:00. Course Description

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

Introduction to Forensic Drug Chemistry

CS Course Missive

Introduction. Chem 110: Chemical Principles 1 Sections 40-52

Active Learning. Yingyu Liang Computer Sciences 760 Fall

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

Learning From the Past with Experiment Databases

Artificial Neural Networks written examination

CS Machine Learning

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

PREDICTING SPEECH RECOGNITION CONFIDENCE USING DEEP LEARNING WITH WORD IDENTITY AND SCORE FEATURES

ECON492 Senior Capstone Seminar: Cost-Benefit and Local Economic Policy Analysis Fall 2017 Instructor: Dr. Anita Alves Pena

San José State University Department of Psychology PSYC , Human Learning, Spring 2017

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Agents and environments. Intelligent Agents. Reminders. Vacuum-cleaner world. Outline. A vacuum-cleaner agent. Chapter 2 Actuators

Assignment 1: Predicting Amazon Review Ratings

ITSC 1301 Introduction to Computers Course Syllabus

Introduction to Personality Daily 11:00 11:50am

COURSE WEBSITE:

Switchboard Language Model Improvement with Conversational Data from Gigaword

*Net Perceptions, Inc West 78th Street Suite 300 Minneapolis, MN

Dinesh K. Sharma, Ph.D. Department of Management School of Business and Economics Fayetteville State University

FINN FINANCIAL MANAGEMENT Spring 2014

General Physics I Class Syllabus

Time series prediction

BUS Computer Concepts and Applications for Business Fall 2012

MGT/MGP/MGB 261: Investment Analysis

Physics XL 6B Reg# # Units: 5. Office Hour: Tuesday 5 pm to 7:30 pm; Wednesday 5 pm to 6:15 pm

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

Psychology 102- Understanding Human Behavior Fall 2011 MWF am 105 Chambliss

Math 181, Calculus I

Computer Science 1015F ~ 2016 ~ Notes to Students

arxiv: v1 [cs.lg] 15 Jun 2015

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering

Semi-Supervised Face Detection

MATH 1A: Calculus I Sec 01 Winter 2017 Room E31 MTWThF 8:30-9:20AM

The Method of Immersion the Problem of Comparing Technical Objects in an Expert Shell in the Class of Artificial Intelligence Algorithms

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

Counseling 150. EOPS Student Readiness and Success

University of Florida ADV 3502, Section 1B21 Advertising Sales Fall 2017

A survey of multi-view machine learning

Math Techniques of Calculus I Penn State University Summer Session 2017

Lahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017

Transcription:

EECS 349 Machine Learning Instructor: Doug Downey (some slides from Pedro Domingos, University of Washington) 1

Logistics Instructor: Doug Downey Email: ddowney@eecs.northwestern.edu Office hours: Mondays 3:30-4:30 (or by appt), Ford 3-345 TAs: Mohammed Alam (Rony), Chen Liang, Nishant Subramani, Hosung Kwon, Jake Samson, Shengxin Zha Web: (linked from prof. homepage) http://www.cs.northwestern.edu/~downey/courses/349_spring 2016/ Also, Canvas and Piazza 2

Grading and Assignments (1 of 2) Assignment Due Date Points Homework 1 12-Apr-16 10 Homework 2 29-Apr-16 15 Project Proposal 7-Apr-15 5+5 Project Status Report 11-May-16 5+5 Homework 3 16-May-16 10 Homework 4 31-May-16 10 Project Website 8-Jun-15 25+5 Quizzes Every Friday (Wk2-Wk9) 8 TOTAL POINTS 103 A A- B+ B B- C+ C C- Etc 93+ 92-90 89-87 86-83 82-80 79-77 76-73 72-70 69 3

Grading and Assignments (2 of 2) Four homeworks (45 pts) Submitted via e-mail according to hmwk instructions Late penalty 10% per day must be within 1 week of original deadline Significant programming, some exercises Quizzes (8 pts) Each Friday weeks 2-9 Bring a device to access Canvas. Practice quiz this week Project (35 pts + 15 peer review) Teams of k Define a task, create/acquire data for the task, train ML algorithm(s), evaluate & report 4

Prerequisites Significant Programming Experience EECS 214, 325 or the equivalent Example: implement decision trees (covered starting Wednesday) Python is the language we ll use But you ll have skeleton code to help you through (also, I don t really know Python.) Basics of probability E.g. independence Basics of logic E.g. DeMorgan s laws 5

Source Materials E. Alpaydin, Introduction to Machine Learning, MIT Press ( required ) Papers & Web pages Reading for this week: Alpaydin, Ch 1, Ch 2 (skip 2.2, 2.3), Ch 9 Optional: When to Hold Out for a Lower Airfare Thinking Big about the Industrial Internet of Things 6

Think/Pair/Share Why study Machine Learning? Think Start End 7

Think/Pair/Share Why study Machine Learning? Think Start End 8

Think/Pair/Share Why study Machine Learning? Pair Start End 9

Think/Pair/Share Why study Machine Learning? Share 10

What is Machine Learning? The study of computer programs that improve automatically with experience T. Mitchell Machine Learning Automating automation Getting computers to program themselves Writing software is the bottleneck Let the data do the work instead! 11

Traditional Programming Input Program Computer Output Machine Learning Input Output Computer Program 12

Magic? No, more like gardening Seeds = Algorithms Nutrients = Data Gardener = You Plants = Programs 13

Case Study: Farecast 14

Sample Applications Web search Computational biology Finance E-commerce Space exploration Robotics Information extraction Social networks Finance Debugging [Your favorite area] Input Output Computer Program 15

Relationship of Machine Learning to Statistics Analytics / Data Science Data Mining Artificial Intelligence 16

Why study Machine Learning? (1 of 5) A breakthrough in machine learning would be worth ten Microsofts (Bill Gates, Chairman, Microsoft) Machine learning is the next Internet (Tony Tether, former Director, DARPA) These Machine quotes learning are is the ~10 hot new years thing old (John Hennessy, President, Stanford) (e.g. Gates is from the NYT, 2004) Web rankings today are mostly a matter of machine learning (Prabhakar Raghavan, Dir. Research, Yahoo) More Machine recent: learning is going to result in a real revolution Artificial (Greg intelligence Papadopoulos, is one CTO, of the Sun) great opportunities for Machine learning is today s discontinuity deep (Jerry learning Yang, research CEO, Yahoo) center) improving the world today, (Reid Hoffman, co-founder of $1B 17

Why study Machine Learning? (2 of 5) http://www.emc.com/leadership/digital-universe/2014iview/executive-summary.htm 18

Why study Machine Learning? (3 of 5) One example, proportion of physicians using EMRs 2001: 18% 2011: 57% 2013: 78% what will be able to learn from these? 19

Why study Machine Learning? (4 of 5) Gartner: 6.4B connected things in 2016 21B in 2020 Intel: 200B connected things by 2020! http://www.techprincess.it/tech-news/infografica-internet-of-things-cose-e-come-funziona/ http://www.gartner.com/newsroom/id/3165317 20

http://www.economist.com/technology-quarterly/2016-03-12/after-moores-law

ML in Practice Understanding domain, prior knowledge, and goals Data integration, selection, cleaning, pre-processing, etc. Learning models Interpreting results Consolidating and deploying discovered knowledge Loop 22

What You ll Learn in this Class How do ML algorithms work? Learn by implementing, using For a real problem, how do I: Express my problem as an ML task Choose the right ML algorithm Evaluate the results 23

ML in a Nutshell Tens of thousands of machine learning algorithms Hundreds new every year Every machine learning algorithm has three components: Representation Evaluation Optimization 24

Representation How do we represent the function from input to output? Decision trees Sets of rules / Logic programs Instances Graphical models (Bayes/Markov nets) Neural networks Support vector machines Model ensembles Etc. 25

Evaluation Given some data, how can we tell if a function is good? Accuracy Precision and recall Squared error Likelihood Posterior probability Cost / Utility Margin Entropy K-L divergence Etc. 26

Optimization Given some data, how do we find the best function? Combinatorial optimization E.g.: Greedy search Convex optimization E.g.: Gradient descent Constrained optimization E.g.: Linear programming 27

Types of Learning Supervised (inductive) learning Training data includes desired outputs Unsupervised learning Training data does not include desired outputs Semi-supervised learning Training data includes a few desired outputs Reinforcement learning Rewards from sequence of actions 28

Inductive Learning Given examples of a function (x, f(x)) Predict function f(x) for new instances x Discrete f(x): Classification Continuous f(x): Regression f(x) = Probability(x): Probability estimation Example: x = <Flight=United 102, FlightDate=May 26, Today=May 7> f(x) = +1 if flight price will increase in the next week, or -1 otherwise 29

What We ll Cover Inductive learning Decision tree induction Instance-based learning Linear Regression and Classification Neural networks Genetic Algorithms Support vector machines Bayesian Learning Learning theory Reinforcement Learning Unsupervised learning Clustering Dimensionality reduction 30

Parting Notes Bring a device to access Canvas for quiz on Friday 31