Machine Learning (CSE 446): Introduction

Similar documents
(Sub)Gradient Descent

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Python Machine Learning

Lecture 1: Machine Learning Basics

CSL465/603 - Machine Learning

Lecture 1: Basic Concepts of Machine Learning

COSI Meet the Majors Fall 17. Prof. Mitch Cherniack Undergraduate Advising Head (UAH), COSI Fall '17: Instructor COSI 29a

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Axiom 2013 Team Description Paper

CS Machine Learning

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

CS 101 Computer Science I Fall Instructor Muller. Syllabus

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Learning From the Past with Experiment Databases

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Navigating the PhD Options in CMS

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Version Space. Term 2012/2013 LSI - FIB. Javier Béjar cbea (LSI - FIB) Version Space Term 2012/ / 18

Rule Learning With Negation: Issues Regarding Effectiveness

Discriminative Learning of Beam-Search Heuristics for Planning

CS 100: Principles of Computing

Active Learning. Yingyu Liang Computer Sciences 760 Fall

B. How to write a research paper

EECS 700: Computer Modeling, Simulation, and Visualization Fall 2014

EGRHS Course Fair. Science & Math AP & IB Courses

Artificial Neural Networks written examination

Multisensor Data Fusion: From Algorithms And Architectural Design To Applications (Devices, Circuits, And Systems)

Getting Started with Deliberate Practice

Rule Learning with Negation: Issues Regarding Effectiveness

Welcome to. ECML/PKDD 2004 Community meeting

State University of New York at Buffalo INTRODUCTION TO STATISTICS PSC 408 Fall 2015 M,W,F 1-1:50 NSC 210

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

Office Hours: Mon & Fri 10:00-12:00. Course Description

Quantitative analysis with statistics (and ponies) (Some slides, pony-based examples from Blase Ur)

Time series prediction

Natural Language Processing: Interpretation, Reasoning and Machine Learning

Chapter 2. Intelligent Agents. Outline. Agents and environments. Rationality. PEAS (Performance measure, Environment, Actuators, Sensors)

BUAD 425 Data Analysis for Decision Making Syllabus Fall 2015

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

INTERMEDIATE ALGEBRA Course Syllabus

PREDICTING SPEECH RECOGNITION CONFIDENCE USING DEEP LEARNING WITH WORD IDENTITY AND SCORE FEATURES

CSC200: Lecture 4. Allan Borodin

Master s Programme in Computer, Communication and Information Sciences, Study guide , ELEC Majors

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

arxiv: v1 [cs.lg] 15 Jun 2015

ALL-IN-ONE MEETING GUIDE THE ECONOMICS OF WELL-BEING

Airplane Rescue: Social Studies. LEGO, the LEGO logo, and WEDO are trademarks of the LEGO Group The LEGO Group.

A Case Study: News Classification Based on Term Frequency

Softprop: Softmax Neural Network Backpropagation Learning

Agents and environments. Intelligent Agents. Reminders. Vacuum-cleaner world. Outline. A vacuum-cleaner agent. Chapter 2 Actuators

Statistics and Data Analytics Minor

Class Meeting Time and Place: Section 3: MTWF10:00-10:50 TILT 221

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Mathematics Success Level E

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

CS 446: Machine Learning

School of Innovative Technologies and Engineering

Assignment 1: Predicting Amazon Review Ratings

Multivariate k-nearest Neighbor Regression for Time Series data -

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Undergraduate Program Guide. Bachelor of Science. Computer Science DEPARTMENT OF COMPUTER SCIENCE and ENGINEERING

Reducing Features to Improve Bug Prediction

MATH 205: Mathematics for K 8 Teachers: Number and Operations Western Kentucky University Spring 2017

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Speech Recognition at ICSI: Broadcast News and beyond

Lecture 10: Reinforcement Learning

Extracting Opinion Expressions and Their Polarities Exploration of Pipelines and Joint Models

MTH 215: Introduction to Linear Algebra

Probabilistic Latent Semantic Analysis

WHEN THERE IS A mismatch between the acoustic

CS177 Python Programming

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines

Calibration of Confidence Measures in Speech Recognition

San José State University Department of Psychology PSYC , Human Learning, Spring 2017

arxiv: v1 [cs.cv] 10 May 2017

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

Model Ensemble for Click Prediction in Bing Search Ads

The Value of Visualization

Word learning as Bayesian inference

Speech Emotion Recognition Using Support Vector Machine

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

DOCTORAL SCHOOL TRAINING AND DEVELOPMENT PROGRAMME

Semantic and Context-aware Linguistic Model for Bias Detection

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

Modeling function word errors in DNN-HMM based LVCSR systems

Social Media Journalism J336F Unique Spring 2016

Course Content Concepts

Syllabus Foundations of Finance Summer 2014 FINC-UB

AST Introduction to Solar Systems Astronomy

Lahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017

Foothill College Summer 2016

LEGO MINDSTORMS Education EV3 Coding Activities

Human Emotion Recognition From Speech

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

Indian Institute of Technology, Kanpur

Modeling function word errors in DNN-HMM based LVCSR systems

Transcription:

Machine Learning (CSE 446): Introduction Sham M Kakade c 2018 University of Washington cse446-staff@cs.washington.edu Jan 3, 2018 1 / 18

Learning and Machine Learning? Broadly, what is learning? Wikipedia, Learning is the process of acquiring new or modifying existing knowledge, behaviors, skils, values, or preferences. Evidences that learning has occurred may be seen in changes in behavior from simle to complex. What is machine learning? An AI centric viewpoint: ML is about getting computers to do the types of things people are good at. How is it... different from statistics? different from AI? (When people say AI they almost always mean ML. ) 2 / 18

What is ML about? Easy for a computer: (42384 3421.82) 1/3 Easy for a child: speech recognition object recognition question/answering ( what color is the sky? ) Computers are designed to execute mathematically precise computational primitives (and they have become much faster!). This class: The algorithmic and statistical thinking (and techniques) for how we train computers to get better at these more easy-for-human tasks. 3 / 18

ML is starting to work... I No longer just an academic pursuit... I Almost overnight impacts to society: (threshold) improvements in performance translate into societal impact 4 / 18

Today, ML is begin used for: Video and image processing Speech and language processing Search engines Robot control Medical and health analysis not just AI-ish problems: sensor networks, traffic navigation, medical imaging, computational biology, finance 5 / 18

Is it Magic? sort of, yes : why is the future (and never-before-seen instances) predictable from the past? inductive bias is critical for learning. in practice, no : we will examine the algorithmic tools and statistical methods appropriately. responsibly, NO : there are consequences and limitations. 6 / 18

Course logistics 6 / 18

Your Instructors Sham Kakade (instructor) Research interests: theory: rigorous algorithmic and statistical analysis of these methods practice: understanding how to advance the state of the art (robotics, music +comp. vision, NLP) TAs: Kousuke Ariga, Benjamin Evans, Xingfan Huang, Sean Jaffe, Vardhman Mehta, Patrick Spieker, Jeannette Yu, Kaiyu Zheng. 7 / 18

Info Course website: https://courses.cs.washington.edu/courses/cse446/18wi/ Contact: cse446-staff@cs.washington.edu Please only use this email for course related questions (unless privacy is needed). Canvas: https://canvas.uw.edu/courses/1124156/discussion_topics Office hours: TBA. 8 / 18

Textbooks A Course in Machine Learning, Hal Daume. Machine Learning: A Probabilistic Perspective, Kevin Murphy. 9 / 18

Outline of CSE 446 Problem formulations: classification, regression Techniques: decision trees, nearest neighbors, perceptron, linear models, probabilistic models, neural networks, kernel methods, clustering Meta-techniques : ensembles, expectation-maximization Understanding ML: limits of learning, practical issues, bias & fairness Recurring themes: (stochastic) gradient descent, the scope of ML, overfitting 10 / 18

Grading Assignments (40%) 5 in total both mathematics pencil and paper, mostly programming Graded based on attempt and correctness Late policy: 33% off for (up to) one day late; 66% off for (up to) two days late;... Midterm (20%) Final exam (40%) Caveat: Your grade may go up or down in extreme cases. (down) Failure to hand in all the HW, (up) very strong exam scores You MUST make the exam dates (unless you have an exception based on UW policies). Do not enroll in the course otherwise. 11 / 18

Can I Take The Class? Short answer: if you are qualified and can register, yes Math prerequisites: probability, statistics, algorithms, and linear algebra background. Programming prereqs: strong programmer (e.g. comfortable in python) We will move fast; lectures will focus on concepts and mathematics work hard, do the readings, etc... 12 / 18

To-Do List Quiz section meetings start tomorrow. Bring your laptop! Python review Readings (do them, before the class) Academic integrity statement: on the course web page. ultimately, it is up to you to carry yourself with integrity. Gender and diversity statement (an acknowledgement): please try to act appropriately, knowing that. 13 / 18

Integrity Academic integrity policy: on the course web page. ultimately, it is up to you to carry yourself with integrity. Gender and diversity statement: (an acknowledgement) the current state is not balanced in any reasonable way; please try to act appropriately. people can surprise you... 14 / 18

The Standard Learning Framework 14 / 18

Inductive Supervised Machine Learning Training: a learning algorithm takes a set of example input-output pairs, {(x 1, y 1 ),... (x N, y N )}, and returns a function f (the hypothesis ); the goal is for f(x) to recover the true label y, for each example, and on future examples Testing: we check how well f predicts on a set of test examples, {(x 1, y 1 ),... (x M, y M )}, by measuring how well f(x ) matches y. training data (x, y) (x, y) (x, i, y) y i ) learning algorithm x f f(x) y 15 / 18

Inputs and Output x can be pretty much anything we can represent To start, we ll think of x as a vector (really, a tuple ) of features, where each feature φ(x) maps the instance into some set. Sometimes Φ(x) denotes the tuple (the vector of all the features). y can be a real value (regression) a label (classification) an ordering (ranking) a vector (multivariate regression) a sequence/tree/graph (structured prediction)... 16 / 18

Classification Examples Predict an object in image: (structured prediction) Predict words from an audio signal: (structured prediction) predict a sentence from a sentence: 17 / 18

More Examples: Regression: Predict the depth of an object (e.g. a pedestrian) in an image. Ranking: What order of ads should be displayed? 18 / 18