Introduction to Machine Learning

Similar documents
(Sub)Gradient Descent

Lecture 1: Machine Learning Basics

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Python Machine Learning

CSL465/603 - Machine Learning

CS Machine Learning

A Case Study: News Classification Based on Term Frequency

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Lecture 1: Basic Concepts of Machine Learning

CS 446: Machine Learning

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Version Space. Term 2012/2013 LSI - FIB. Javier Béjar cbea (LSI - FIB) Version Space Term 2012/ / 18

Learning From the Past with Experiment Databases

MTH 215: Introduction to Linear Algebra

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

Rule Learning With Negation: Issues Regarding Effectiveness

Rule Learning with Negation: Issues Regarding Effectiveness

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Compositional Semantics

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Syllabus - ESET 369 Embedded Systems Software, Fall 2016

PHY2048 Syllabus - Physics with Calculus 1 Fall 2014

Data Structures and Algorithms

Laboratorio di Intelligenza Artificiale e Robotica

Exploration. CS : Deep Reinforcement Learning Sergey Levine

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

Penn State University - University Park MATH 140 Instructor Syllabus, Calculus with Analytic Geometry I Fall 2010

Course Syllabus for Math

Axiom 2013 Team Description Paper

Softprop: Softmax Neural Network Backpropagation Learning

Lecture 10: Reinforcement Learning

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Foothill College Summer 2016

EECS 700: Computer Modeling, Simulation, and Visualization Fall 2014

Artificial Neural Networks written examination

Class Meeting Time and Place: Section 3: MTWF10:00-10:50 TILT 221

Word learning as Bayesian inference

Intensive English Program Southwest College

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering

Mastering Biology Test Answers

EGRHS Course Fair. Science & Math AP & IB Courses

MAR Environmental Problems & Solutions. Stony Brook University School of Marine & Atmospheric Sciences (SoMAS)

School of Innovative Technologies and Engineering

Probability and Game Theory Course Syllabus

CS Course Missive

The lab is designed to remind you how to work with scientific data (including dealing with uncertainty) and to review experimental design.

Pre-AP Geometry Course Syllabus Page 1

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

Test How To. Creating a New Test

B. How to write a research paper

Quantitative analysis with statistics (and ponies) (Some slides, pony-based examples from Blase Ur)

Experiment Databases: Towards an Improved Experimental Methodology in Machine Learning

CS 101 Computer Science I Fall Instructor Muller. Syllabus

ANT 2000: Intro to Anthropology Room #RDB 1100 (Law Bldg) Mon. & Wed. 2:00 4:45 p.m. Summer B 2012 (June 25 Aug. 8)

Self Study Report Computer Science

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Word Segmentation of Off-line Handwritten Documents

Natural Language Processing: Interpretation, Reasoning and Machine Learning

INTERMEDIATE ALGEBRA Course Syllabus

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

Using the Attribute Hierarchy Method to Make Diagnostic Inferences about Examinees Cognitive Skills in Algebra on the SAT

Exploring Derivative Functions using HP Prime

A survey of multi-view machine learning

Students Understanding of Graphical Vector Addition in One and Two Dimensions

ECO 3101: Intermediate Microeconomics

Universidade do Minho Escola de Engenharia

Switchboard Language Model Improvement with Conversational Data from Gigaword

Laboratorio di Intelligenza Artificiale e Robotica

ME 443/643 Design Techniques in Mechanical Engineering. Lecture 1: Introduction

Fall 2016 ARA 4400/ 7152

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines

Data Fusion Through Statistical Matching

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Lahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017

arxiv: v1 [cs.lg] 15 Jun 2015

Instructor: Mario D. Garrett, Ph.D. Phone: Office: Hepner Hall (HH) 100

Model Ensemble for Click Prediction in Bing Search Ads

arxiv: v2 [cs.cv] 30 Mar 2017

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Reducing Features to Improve Bug Prediction

Syllabus: PHI 2010, Introduction to Philosophy

MGT/MGP/MGB 261: Investment Analysis

CAAP. Content Analysis Report. Sample College. Institution Code: 9011 Institution Type: 4-Year Subgroup: none Test Date: Spring 2011

An OO Framework for building Intelligence and Learning properties in Software Agents

Math-U-See Correlation with the Common Core State Standards for Mathematical Content for Third Grade

San José State University

Navigating the PhD Options in CMS

SYLLABUS: RURAL SOCIOLOGY 1500 INTRODUCTION TO RURAL SOCIOLOGY SPRING 2017

UNIT ONE Tools of Algebra

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Indian Institute of Technology, Kanpur

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

Generative models and adversarial training

The stages of event extraction

Large-Scale Web Page Classification. Sathi T Marath. Submitted in partial fulfilment of the requirements. for the degree of Doctor of Philosophy

STA 225: Introductory Statistics (CT)

Transcription:

Introduction to Machine Learning CMSC 422 MARINE CARPUAT marine@cs.umd.edu

What is this course about? Machine learning studies algorithms for learning to do stuff By finding (and exploiting) patterns in data

What can we do with machine learning? Analyze text & speech Teach robots how to cook from youtube videos Recognize objects in images Analyze genomics data

Sometimes machines even perform better than humans! Question Answering system beats Jeopardy champion Ken Jennings at Quiz bowl!

Machine Learning Paradigm: Programming by example Replace ``human writing code'' with ``human supplying data'' Most central issue: generalization How to abstract from ``training'' examples to ``test'' examples?

A growing and fast moving field Broad applicability Finance, robotics, vision, machine translation, medicine, etc. Close connection between theory and practice Open field, lots of room for new work!

Course Goals By the end of the semester, you should be able to Look at a problem Identify if ML is an appropriate solution If so, identify what types of algorithms might be applicable Apply those algorithms This course is not A survey of ML algorithms A tutorial on ML toolkits such as Weka, TensorFlow,

Topics Foundations of Supervised Learning Decision trees and inductive bias Geometry and nearest neighbors Perceptron Practical concerns: feature design, evaluation, debugging Beyond binary classification Advanced Supervised Learning Linear models and gradient descent Support Vector Machines Naive Bayes models and probabilistic modeling Neural networks Kernels Ensemble learning Unsupervised learning K-means PCA Expectation maximization

What you can expect from the instructors Teaching Assistant: Xing Niu We are here to help you learn by Introducing concepts from multiple perspectives Theory and practice Readings and class time Providing opportunities to practice, and feedback to help you stay on track Homeworks Programming assignments

What I expect from you Work hard (this is a 3-credit class!) Do a lot of math (calculus, linear algebra, probability) Do a fair amount of programming Come to class prepared Do the required readings!

Highlights from course logistics Grading Homeworks (20%), ~10, almost weekly Programming projects (30%), 3 of them, in teams of two or three students Midterm exam (20%), in class Final exam (30%), cumulative, in class. HW01 is due Thu 10:59am No late homeworks Read syllabus here: http://www.cs.umd.edu/ class/spring2017/cmsc4 22//syllabus/

Where to find the readings: A Course in Machine Learning view and submit assignments: Canvas check your grades: Canvas ask and answer questions, participate in discussions and surveys, contact the instructors, and everything else: Piazza Please use piazza instead of email

Today s topics What does it mean to learn by example? Classification tasks Inductive bias Formalizing learning

Classification tasks How would you write a program to distinguish a picture of me from a picture of someone else? Provide examples pictures of me and pictures of other people and let a classifier learn to distinguish the two.

Classification tasks How would you write a program to distinguish a sentence is grammatical or not? Provide examples of grammatical and ungrammatical sentences and let a classifier learn to distinguish the two.

Classification tasks How would you write a program to distinguish cancerous cells from normal cells? Provide examples of cancerous and normal cells and let a classifier learn to distinguish the two.

Classification tasks How would you write a program to distinguish cancerous cells from normal cells? Provide examples of cancerous and normal cells and let a classifier learn to distinguish the two.

Let s try it out Your task: learn a classifier to distinguish class A from class B from examples

Examples of class A:

Examples of class B

Let s try it out learn a classifier from examples Now: predict class on new examples using what you ve learned

What if I told you

Key ingredients needed for learning Training vs. test examples Memorizing the training examples is not enough! Need to generalize to make good predictions on test examples Inductive bias Many classifier hypotheses are plausible Need assumptions about the nature of the relation between examples and classes

Machine Learning as Function Approximation Problem setting Set of possible instances X Unknown target function f: X Y Set of function hypotheses H = h h: X Y} Input Training examples { x 1, y 1, x N, y N } of unknown target function f Output Hypothesis h H that best approximates target function f

Formalizing induction: Loss Function l(y, f(x)) where y is the truth and f x is the system s prediction e.g. l y, f(x) = 0 if y = f(x) 1 otherwise Captures our notion of what is important to learn

Formalizing induction: Data generating distribution Where does the data come from? Data generating distribution A probability distribution D over (x, y) pairs We don t know what D is! We only get a random sample from it: our training data

Formalizing induction: Expected loss f should make good predictions as measured by loss l on future examples that are also drawn from D Formally ε, the expected loss of f over D with respect to l should be small ε E x,y ~D l(y, f(x)) = D x, y l(y, f(x)) (x,y)

Formalizing induction: Training error We can t compute expected loss because we don t know what D is We only have a sample of D training examples { x 1, y 1, x N, y N } All we can compute is the training error ε N 1 n=1 N l(y n, f(x n ))

Formalizing Induction Given a loss function l a sample from some unknown data distribution D Our task is to compute a function f that has low expected error over D with respect to l. E x,y ~D l(y, f(x)) = D x, y l(y, f(x)) (x,y)

Recap: introducing machine learning What does learning by example mean? Classification tasks Learning requires examples + inductive bias Generalization vs. memorization Formalizing the learning problem Function approximation Learning as minimizing expected loss

Your tasks before next class Check out course webpage, Canvas, Piazza Start reading Get started on HW01 Let me know dates of religious holidays you observe this semester Let me know if you will need DSS arrangements