Lecture 1: Course outline and logistics What is Machine Learning. Aykut Erdem February 2016 Hacettepe University

Similar documents
Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Lecture 1: Basic Concepts of Machine Learning

Python Machine Learning

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Lecture 1: Machine Learning Basics

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

CS 446: Machine Learning

CSL465/603 - Machine Learning

Learning From the Past with Experiment Databases

(Sub)Gradient Descent

Universidade do Minho Escola de Engenharia

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

CS Machine Learning

Reducing Features to Improve Bug Prediction

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Human Emotion Recognition From Speech

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

A Case Study: News Classification Based on Term Frequency

Switchboard Language Model Improvement with Conversational Data from Gigaword

Knowledge based expert systems D H A N A N J A Y K A L B A N D E

Laboratorio di Intelligenza Artificiale e Robotica

Rule Learning With Negation: Issues Regarding Effectiveness

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Time series prediction

Laboratorio di Intelligenza Artificiale e Robotica

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

*Net Perceptions, Inc West 78th Street Suite 300 Minneapolis, MN

Learning Lesson Study Course

Welcome to. ECML/PKDD 2004 Community meeting

Speech Emotion Recognition Using Support Vector Machine

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Generative models and adversarial training

Australian Journal of Basic and Applied Sciences

Discriminative Learning of Beam-Search Heuristics for Planning

Firms and Markets Saturdays Summer I 2014

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

AUTOMATED TROUBLESHOOTING OF MOBILE NETWORKS USING BAYESIAN NETWORKS

An OO Framework for building Intelligence and Learning properties in Software Agents

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

Probabilistic Latent Semantic Analysis


Rule Learning with Negation: Issues Regarding Effectiveness

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

95723 Managing Disruptive Technologies

CS177 Python Programming

Axiom 2013 Team Description Paper

Indian Institute of Technology, Kanpur

Class Meeting Time and Place: Section 3: MTWF10:00-10:50 TILT 221

CS 100: Principles of Computing

Word Segmentation of Off-line Handwritten Documents

Knowledge-Based - Systems

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

MYCIN. The MYCIN Task

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Data Structures and Algorithms

arxiv: v1 [cs.lg] 15 Jun 2015

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

Applications of data mining algorithms to analysis of medical data

Lahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017

Multivariate k-nearest Neighbor Regression for Time Series data -

Model Ensemble for Click Prediction in Bing Search Ads

Large-Scale Web Page Classification. Sathi T Marath. Submitted in partial fulfilment of the requirements. for the degree of Doctor of Philosophy

COMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS

Evolution of Symbolisation in Chimpanzees and Neural Nets

Introduction to Psychology

Assignment 1: Predicting Amazon Review Ratings

Natural Language Processing: Interpretation, Reasoning and Machine Learning

Netpix: A Method of Feature Selection Leading. to Accurate Sentiment-Based Classification Models

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

Using EEG to Improve Massive Open Online Courses Feedback Interaction

Predicting Students Performance with SimStudent: Learning Cognitive Skills from Observation

MTH 141 Calculus 1 Syllabus Spring 2017

T Seminar on Internetworking

Issues in the Mining of Heart Failure Datasets

San José State University Department of Psychology PSYC , Human Learning, Spring 2017

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Evolutive Neural Net Fuzzy Filtering: Basic Description

Multi-Lingual Text Leveling

TU-E2090 Research Assignment in Operations Management and Services

ICRSA James D. Lynch William J. Padgett Edsel A. Peña. June 2, 2003

A Simple VQA Model with a Few Tricks and Image Features from Bottom-up Attention

Stochastic Calculus for Finance I (46-944) Spring 2008 Syllabus

Statistics and Data Analytics Minor

BIODIVERSITY: CAUSES, CONSEQUENCES, AND CONSERVATION

Computerized Adaptive Psychological Testing A Personalisation Perspective

Linking Task: Identifying authors and book titles in verbose queries

Guide to Teaching Computer Science

Penn State University - University Park MATH 140 Instructor Syllabus, Calculus with Analytic Geometry I Fall 2010

Learning Methods for Fuzzy Systems

Math-U-See Correlation with the Common Core State Standards for Mathematical Content for Third Grade

Humboldt-Universität zu Berlin

Transcription:

Lecture 1: Course outline and logistics What is Machine Learning Aykut Erdem February 2016 Hacettepe University

Today s Schedule Course outline and logistics An overview of Machine Learning 2

Course outline and logistics

Logistics Instructor: Aykut ERDEM (aykut@cs.hacettepe.edu.tr) Teaching Assistant: Aysun Kocak (aysunkocak@cs.hacettepe.edu.tr) Burcak Asal (basal@cs.hacettepe.edu.tr) Lectures: Tue 10:00-10:50_D10 Thu 09:00-10:50_D9 Tutorials: Fri 09:00-10:50_D8 4

About this course This is a undergraduate-level introductory course in machine learning (ML) A broad overview of many concepts and algorithms in ML. Requirements Basic algorithms, data structures. Basic probability and statistics. Basic linear algebra and calculus Good programming skills common distributions, Bayes rule, mean/median/model vector/matrix manipulations, partial derivatives BBM 409 Introduction to Machine Learning Practicum (New) Students will gain skills to apply the concepts to real world problems. 5

Communication The course webpage will be updated regularly throughout the semester with lecture notes, programming and reading assignments and important deadlines. http://web.cs.hacettepe.edu.tr/~aykut/classes/ spring2016/bbm406/ We will be using Piazza for course related discussions and announcements. Please enroll the class on Piazza by following the link http://piazza.com/class#spring2016/bbm406 6

Reference Books Artificial Intelligence: A Modern Approach (3rd Edition), Russell and Norvig. Prentice Hall, 2009 Bayesian Reasoning and Machine Learning, Barber, Cambridge University Press, 2012. (online version available) Introduction to Machine Learning (2nd Edition), Alpaydin, MIT Press, 2010 Pattern Recognition and Machine Learning, Bishop, Springer, 2006 Machine Learning: A Probabilistic Perspective, Murphy, MIT Press, 2012 7

Grading Policy Grading for BBM 406 will be based on a course project (done in pairs) (25%), a midterm exam (30%), a final exam (40%), and class participation (5%) In BBM 409, the grading will be based on a set of quizzes (20%), and 3 assignments (done individually) 8

Assignments 3 assignments, first one worth 20%, last two worth 30% each Theoretical: Pencil-and-paper derivations Programming: Implementing Python code to solve a given real-world problem A quick Python tutorial in this week s tutorial session. 9

10

Course Project Done individually, or in teams of two students. Choose your own topic and explore ways to solve the problem Proposal: 1 page (Mar 8) (10%) Progress Report: 4-5 pages (Apr 19) (25%) Poster Presentation: (last week of classes) (20%) Final Report: (due at the beginning of poster session) (45%) 11

Collaboration Policy All work on assignments have to be done individually. The course project, however, can be done in pairs. You are encouraged to discuss with your classmates about the given assignments, but these discussions should be carried out in an abstract way. In short, turning in someone else s work, in whole or in part, as your own will be considered as a violation of academic integrity. Please note that the former condition also holds for the material found on the web as everything on the web has been written by someone else. http://www.plagiarism.org/plagiarism-101/prevention/ 12

Course Outline Week1 Overview of Machine Learning, Nearest Neighbor Classifier Week2 Linear Regression, Least Squares Week3 Machine Learning Methodology Assg1 out Week4 Statistical Estimation: MLE, MAP, Naïve Bayes Classifier Week5 Linear Classification Models: Logistic Regression, Linear Discriminant Functions, Perceptron Week6 Neural Networks Week7 Midterm Exam Assg1 due, Assg2 out Course project proposal due Assg2 due Assg3 out 13

Course Outline (cont d.) Week8 Deep Learning Week9 Support Vector Machines (SVMs) Week10 Multi-class SVM Assg3 due Week11 Decision Tree Learning Week12 Ensemble Methods: Bagging, Random Forests, Boosting Week13 Clustering Project progress report due Week14 Principle Component Analysis, Autoencoders 14

Machine Learning: An Overview

Quotes If you were a current computer science student what area would you start studying heavily? Answer: Machine Learning. The ultimate is computers that learn Bill Gates, Reddit AMA Machine learning is the next Internet Tony Tether, Director, DARPA Machine learning is today s discontinuity slide by David Sontag Jerry Yang, CEO, Yahoo 16

Google Trends Machine learning Deep learning 17

2015 Edition

2016 Edition

Learning slide by Bernhard Schölkopf Richard Feynman 20

Two definitions of learning (1) Learning is the acquisition of knowledge about the world. Kupfermann (1985) (2) Learning is an adaptive change in behavior caused by experience. slide by Bernhard Schölkopf (1988) Shepherd 21

Empirical Inference Drawing conclusions from empirical data (observations, measurements) Example1: Scientific inference y = Σ i a i k(x,x i ) + b y x x x x y = a * x slide by Bernhard Schölkopf x x x Leibniz, Weyl, Chaitin x x x 8 Bernhard Schölkopf 22

Empirical Inference Example2: Perception slide by Bernhard Schölkopf 23

slide by Bernhard Schölkopf

slide by Bernhard Schölkopf

slide by Bernhard Schölkopf

slide by Bernhard Schölkopf

slide by Bernhard Schölkopf

slide by Bernhard Schölkopf

slide by Bernhard Schölkopf

slide by Bernhard Schölkopf

slide by Bernhard Schölkopf

slide by Bernhard Schölkopf

slide by Bernhard Schölkopf

slide by Bernhard Schölkopf

slide by Bernhard Schölkopf

slide by Bernhard Schölkopf

slide by Bernhard Schölkopf

slide by Bernhard Schölkopf

slide by Bernhard Schölkopf

slide by Bernhard Schölkopf

slide by Bernhard Schölkopf

slide by Bernhard Schölkopf

Empirical Inference Example2: Perception "The brain is nothing but a sta0s0cal decision organ" H. Barlow slide by Bernhard Schölkopf 44

slide by Bernhard Schölkopf X

slide by Bernhard Schölkopf X

slide by Bernhard Schölkopf

What is machine learning?

Example: Netflix Challenge Goal: Predict how a viewer will rate a movie 10% improvement = 1 million dollars slide by Yaser Abu-Mostapha 50

Example: Netflix Challenge Goal: Predict how a viewer will rate a movie 10% improvement = 1 million dollars Essence of Machine Learning: A pattern exists We cannot pin it down mathematically We have data on it slide by Yaser Abu-Mostapha 51

Watch out AlphaGo vs. Lee Sedol in March! 52

Comparison Traditional Programming Data Program Computer Output slide by Pedro Domingos, Tom Mitchel, Tom Dietterich Machine Learning Data Computer Output Program 53

What is Machine Learning? [Arthur Samuel, 1959] Field of study that gives computers the ability to learn without being explicitly programmed [Kevin Murphy] algorithms that automatically detect patterns in data use the uncovered patterns to predict future data or other outcomes of interest [Tom Mitchell] algorithms that improve their performance (P) slide by Dhruv Batra at some task (T) with experience (E) 54

What is Machine Learning? If you are a Scientist Data Machine Learning Understanding slide by Dhruv Batra If you are an Engineer / Entrepreneur Get lots of data Machine Learning??? Profit! 55

Why Study Machine Learning? Engineering Better Computing Systems Develop systems too difficult/expensive to construct manually because they require specific detailed skills/knowledge knowledge engineering bottleneck Develop systems that adapt and customize themselves to individual users. Personalized news or mail filter Personalized tutoring Discover new knowledge from large databases slide by Dhruv Batra Medical text mining (e.g. migraines to calcium channel blockers to magnesium) data mining 56

Why Study Machine Learning? Cognitive Science Computational studies of learning may help us understand learning in humans and other biological organisms. Hebbian neural learning Neurons that fire together, wire together. slide by Dhruv Batra 57

Why Study Machine Learning? The Time is Ripe Algorithms Many basic effective and efficient algorithms available. Data Large amounts of on-line data available. Computing Large amounts of computational resources available. slide by Ray Mooney 58

Where does ML fit in? slide by Fei Sha 59

A Brief History of AI slide by Dhruv Batra 60

adopted from Dhruv Batra 61

AI Predictions: Experts slide by Dhruv Batra Image Credit: http://intelligence.org/files/predictingai.pdf 62

AI Predictions: Non-Experts slide by Dhruv Batra Image Credit: http://intelligence.org/files/predictingai.pdf 63

AI Predictions: Failed slide by Dhruv Batra Image Credit: http://intelligence.org/files/predictingai.pdf 64

Why is AI hard? slide by Dhruv Batra Image Credit: http://karpathy.github.io/2012/10/22/state-of-computer-vision/ 65

What humans see slide by Larry Zitnick 66

What computers see 243 239 240 225 206 185 188 218 211 206 216 225 242 239 218 110 67 31 34 152 213 206 208 221 243 242 123 58 94 82 132 77 108 208 208 215 235 217 115 212 243 236 247 139 91 209 208 211 233 208 131 222 219 226 196 114 74 208 213 214 232 217 131 116 77 150 69 56 52 201 228 223 232 232 182 186 184 179 159 123 93 232 235 235 232 236 201 154 216 133 129 81 175 252 241 240 235 238 230 128 172 138 65 63 234 249 241 245 237 236 247 143 59 78 10 94 255 248 247 251 234 237 245 193 55 33 115 144 213 255 253 251 248 245 161 128 149 109 138 65 47 156 239 255 190 107 39 102 94 73 114 58 17 7 51 137 23 32 33 148 168 203 179 43 27 17 12 8 17 26 12 160 255 255 109 22 26 19 35 24 slide by Larry Zitnick 67

I saw her duck slide by Liang Huang Image Credit: Liang Huang 68

I saw her duck slide by Liang Huang Image Credit: Liang Huang 69

I saw her duck slide by Liang Huang Image Credit: Liang Huang 70

We ve come a long way IBM Watson What is Jeopardy? http://youtu.be/xqb66bdsqlw?t=53s Challenge: http://youtu.be/_429uizn1jm Watson Demo: http://youtu.be/wfr3lom_xhe?t=22s Explanation http://youtu.be/d_yxv22o6n4?t=4s IBM Watson wins on Jeopardy (February 2011) Watson provides cancer treatment options to doctors in seconds (February 2013) slide by Liang Huang Future: Automated operator, doctor assistant, finance 71

Why are things working today? More compute power Better More data Better algorithms/ models Accuracy slide by Dhruv Batra Amount of Training Data Figure Credit: Banko & Brill, 2011 72