Introduction to Machine Learning

Similar documents
Python Machine Learning

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Lecture 1: Machine Learning Basics

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

(Sub)Gradient Descent

Lecture 1: Basic Concepts of Machine Learning

CSL465/603 - Machine Learning

Generative models and adversarial training

CS Machine Learning

Speech Emotion Recognition Using Support Vector Machine

Probabilistic Latent Semantic Analysis

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

Assignment 1: Predicting Amazon Review Ratings

Human Emotion Recognition From Speech

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Rule Learning With Negation: Issues Regarding Effectiveness

Calibration of Confidence Measures in Speech Recognition

Corrective Feedback and Persistent Learning for Information Extraction

arxiv: v2 [cs.cv] 30 Mar 2017

EGRHS Course Fair. Science & Math AP & IB Courses

INPE São José dos Campos

Rule Learning with Negation: Issues Regarding Effectiveness

Time series prediction

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

Statistical Analysis of Climate Change, Renewable Energies, and Sustainability An Independent Investigation for Introduction to Statistics

A survey of multi-view machine learning

The Good Judgment Project: A large scale test of different methods of combining expert predictions

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Introduction to Simulation

Learning Methods in Multilingual Speech Recognition

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Issues in the Mining of Heart Failure Datasets

Robot Learning Simultaneously a Task and How to Interpret Human Instructions

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

Evolutive Neural Net Fuzzy Filtering: Basic Description

Linking Task: Identifying authors and book titles in verbose queries

CS 446: Machine Learning

WHEN THERE IS A mismatch between the acoustic

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

Learning From the Past with Experiment Databases

Artificial Neural Networks written examination

Lahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Measurement. When Smaller Is Better. Activity:

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 17, NO. 3, MARCH

STA 225: Introductory Statistics (CT)

Speech Recognition at ICSI: Broadcast News and beyond

CS/SE 3341 Spring 2012

GUIDE CURRICULUM. Science 10

Missouri Mathematics Grade-Level Expectations

COMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS

Dublin City Schools Mathematics Graded Course of Study GRADE 4

Switchboard Language Model Improvement with Conversational Data from Gigaword

COMPUTER SCIENCE GRADUATE STUDIES Course Descriptions by Research Area

Space Travel: Lesson 2: Researching your Destination

Reducing Features to Improve Bug Prediction

SARDNET: A Self-Organizing Feature Map for Sequences

Softprop: Softmax Neural Network Backpropagation Learning

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

A Survey on Unsupervised Machine Learning Algorithms for Automation, Classification and Maintenance

Multivariate k-nearest Neighbor Regression for Time Series data -

Modeling function word errors in DNN-HMM based LVCSR systems

ME 443/643 Design Techniques in Mechanical Engineering. Lecture 1: Introduction

B.S/M.A in Mathematics

Introductory Astronomy. Physics 134K. Fall 2016

COMPUTER SCIENCE GRADUATE STUDIES Course Descriptions by Methodology

BODY LANGUAGE ANIMATION SYNTHESIS FROM PROSODY AN HONORS THESIS SUBMITTED TO THE DEPARTMENT OF COMPUTER SCIENCE OF STANFORD UNIVERSITY

PELLISSIPPI STATE TECHNICAL COMMUNITY COLLEGE MASTER SYLLABUS APPLIED MECHANICS MET 2025

LOUISIANA HIGH SCHOOL RALLY ASSOCIATION

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

INSTRUCTIONAL FOCUS DOCUMENT Grade 5/Science

A study of speaker adaptation for DNN-based speech synthesis

Modeling function word errors in DNN-HMM based LVCSR systems

Innovative Teaching in Science, Technology, Engineering, and Math

Semantic Segmentation with Histological Image Data: Cancer Cell vs. Stroma

3. Improving Weather and Emergency Management Messaging: The Tulsa Weather Message Experiment. Arizona State University

On-Line Data Analytics

Welcome to. ECML/PKDD 2004 Community meeting

Data Structures and Algorithms

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

arxiv: v1 [cs.lg] 15 Jun 2015

STA2023 Introduction to Statistics (Hybrid) Spring 2013

Comparison of EM and Two-Step Cluster Method for Mixed Data: An Application

A Comparison of Two Text Representations for Sentiment Analysis

Learning to Rank with Selection Bias in Personal Search

A Neural Network GUI Tested on Text-To-Phoneme Mapping

PELLISSIPPI STATE TECHNICAL COMMUNITY COLLEGE MASTER SYLLABUS APPLIED STATICS MET 1040

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Probability and Statistics Curriculum Pacing Guide

Mining Student Evolution Using Associative Classification and Clustering

Chapter 10 APPLYING TOPIC MODELING TO FORENSIC DATA. 1. Introduction. Alta de Waal, Jacobus Venter and Etienne Barnard

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION

Full text of O L O W Science As Inquiry conference. Science as Inquiry

Version Space. Term 2012/2013 LSI - FIB. Javier Béjar cbea (LSI - FIB) Version Space Term 2012/ / 18

Chinese Language Parsing with Maximum-Entropy-Inspired Parser

Extracting Opinion Expressions and Their Polarities Exploration of Pipelines and Joint Models

Transcription:

Introduction to Machine Learning CSCI 1950-F Instructors: Erik Sudderth & Mark Johnson Graduate TA: Deqing Sun Undergraduate TAs: Max Barrows & Evan Donahue

Visual Object Recognition sky skyscraper sky dome buildings trees temple bell

Spam Filtering Binary classification problem: is this e-mail useful or spam? Noisy training data: messages previously marked as spam Wrinkle: spammers evolve to counter filter innovations Spam Filter Express http://www.spam-filter-express.com/

Collaborative Filtering

Social Network Analysis Unsupervised discovery and visualization of relationships among people, companies, etc. Example: infer relationships among named entities directly from Wikipedia entries Chang, Boyd-Graber, & Blei, KDD 2009

Climate Modeling Satellites measure seasurface temperature at sparse locations Partial coverage of ocean surface Sometimes obscured by clouds, weather Would like to infer a dense temperature field, and track its evolution NASA Seasonal to Interannual Prediction Project http://ct.gsfc.nasa.gov/annual.reports/ess98/nsipp.html

Speech Recognition Given an audio waveform, robustly extract & recognize any spoken words Statistical models can be used to Provide greater robustness to noise Adapt to accent of different speakers Learn from training S. Roweis, 2004

Target Tracking Radar-based tracking of multiple targets Visual tracking of articulated objects (L. Sigal et. al., 2006) Estimate motion of targets in 3D world from indirect, potentially noisy measurements

Robot Navigation: SLAM Simultaneous Localization and Mapping Landmark SLAM (E. Nebot, Victoria Park) CAD Map (S. Thrun, San Jose Tech Museum) Estimated Map As robot moves, estimate its pose & world geometry

Human Tumor Microarray Data

Financial Forecasting http://www.steadfastinvestor.com/ Predict future market behavior from historical data, news reports, expert i i

Administrative Details Prerequisites: comfort with basic Programming Calculus Linear algebra Probability Grading: undergraduate versus graduate Syllabus: subject to revision!

What is machine learning? Given a collection of examples ( training data ), predict something about novel examples The novel examples are usually incomplete Example: sorting fish Fish come off a conveyor belt in a fish factory Your job: figure out what kind each fish is

Automatically sorting fish

Sorting fish as a machinelearning problem Training data D = ((x 1,y 1 ),..., (x n,y n )) A vector of measurements (features) x i (e.g., weight, length, color) of each fish A label y i for each fish At run-time: given a novel feature vector x predict the corresponding label y

Length as a feature for classifying fish Need to pick a decision boundary Minimize expected loss

Lightness as a feature for classifying fish

Length and lightness together as features Not unusual to have millions of features

More complex decision boundaries

Training set error test set error Occam's razor Bias-variance dilemma More data!

Recap: designing a fish classifier Choose the features Usually the most important step! Collect training data Choose the model (e.g., shape of decision boundary) Estimate the model from training data Use the model to classify new examples Machine learning is about last 3 steps

Supervised versus unsupervised learning Supervised learning Training data includes labels we have to predict i.e., labels are visible variables in training data Unsupervised learning Training data does not include labels i.e., labels are hidden variables in training data For classification problems, unsupervised learning is usually a kind of clustering

Unsupervised learning for classifying fish 25 25 20 20 15 15 10 10 5 5 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 0

Machine Learning Problems Supervised Learning Unsupervised Learning Discrete classification or categorization clustering Continuous regression dimensionality reduction

Machine Learning Buzzwords Bayesian and frequentist estimation Model selection, cross-validation, overfitting Kernel methods: support vector machines (SVMs), Gaussian processes Graphical models: hidden Markov models, Markov random fields, belief propagation Expectation-Maximization (EM) algorithm Markov chain Monte Carlo (MCMC) methods