Lecture Slides for. ETHEM ALPAYDIN The MIT Press, 2010

Similar documents
Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

CSL465/603 - Machine Learning

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Welcome to. ECML/PKDD 2004 Community meeting

Rule Learning With Negation: Issues Regarding Effectiveness

Lecture 1: Machine Learning Basics

Axiom 2013 Team Description Paper

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Laboratorio di Intelligenza Artificiale e Robotica

Rule Learning with Negation: Issues Regarding Effectiveness

Laboratorio di Intelligenza Artificiale e Robotica

Mining Association Rules in Student s Assessment Data

Python Machine Learning

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

A Comparison of Standard and Interval Association Rules

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Reinforcement Learning by Comparing Immediate Reward

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

ISFA2008U_120 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM

Top US Tech Talent for the Top China Tech Company

Computerized Adaptive Psychological Testing A Personalisation Perspective

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

Massachusetts Institute of Technology Tel: Massachusetts Avenue Room 32-D558 MA 02139

Neural Network Model of the Backpropagation Algorithm

Knowledge based expert systems D H A N A N J A Y K A L B A N D E

Lecture 10: Reinforcement Learning

Georgetown University at TREC 2017 Dynamic Domain Track

Applications of data mining algorithms to analysis of medical data

Shockwheat. Statistics 1, Activity 1

Lecture 1: Basic Concepts of Machine Learning

TUESDAYS/THURSDAYS, NOV. 11, 2014-FEB. 12, 2015 x COURSE NUMBER 6520 (1)

arxiv: v1 [cs.lg] 15 Jun 2015

TD(λ) and Q-Learning Based Ludo Players

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling

Handling Concept Drifts Using Dynamic Selection of Classifiers

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

Consultation skills teaching in primary care TEACHING CONSULTING SKILLS * * * * INTRODUCTION

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Softprop: Softmax Neural Network Backpropagation Learning

Learning Methods for Fuzzy Systems

Probabilistic Latent Semantic Analysis

FUZZY EXPERT. Dr. Kasim M. Al-Aubidy. Philadelphia University. Computer Eng. Dept February 2002 University of Damascus-Syria

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Intelligent Agents. Chapter 2. Chapter 2 1

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

Chapter 10 APPLYING TOPIC MODELING TO FORENSIC DATA. 1. Introduction. Alta de Waal, Jacobus Venter and Etienne Barnard

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

Comparison of EM and Two-Step Cluster Method for Mixed Data: An Application

Exploration. CS : Deep Reinforcement Learning Sergey Levine

. Town of birth. Nationality. address)

Instructional Approach(s): The teacher should introduce the essential question and the standard that aligns to the essential question

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Automatic Discretization of Actions and States in Monte-Carlo Tree Search

A Case-Based Approach To Imitation Learning in Robotic Agents

Semi-Supervised Face Detection

Learning From the Past with Experiment Databases

Reducing Features to Improve Bug Prediction

Evolutive Neural Net Fuzzy Filtering: Basic Description

A Reinforcement Learning Variant for Control Scheduling

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

Analysis of Hybrid Soft and Hard Computing Techniques for Forex Monitoring Systems

Geospatial Visual Analytics Tutorial. Gennady Andrienko & Natalia Andrienko

Pp. 176{182 in Proceedings of The Second International Conference on Knowledge Discovery and Data Mining. Predictive Data Mining with Finite Mixtures

Chinese Language Parsing with Maximum-Entropy-Inspired Parser

CS Machine Learning

COBRA: A Fast and Simple Method for Active Clustering with Pairwise Constraints

Time series prediction

Australian Journal of Basic and Applied Sciences

Measures of the Location of the Data

An Effective Framework for Fast Expert Mining in Collaboration Networks: A Group-Oriented and Cost-Based Method

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

Word Segmentation of Off-line Handwritten Documents

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

Artificial Neural Networks written examination

Lecture 6: Applications

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

Food Products Marketing

Designing a Rubric to Assess the Modelling Phase of Student Design Projects in Upper Year Engineering Courses

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Soft Computing based Learning for Cognitive Radio

Experiment Databases: Towards an Improved Experimental Methodology in Machine Learning

Constraining X-Bar: Theta Theory

Mining Student Evolution Using Associative Classification and Clustering

Computed Expert System of Support Technology Tests in the Process of Investment Casting Elements of Aircraft Engines

FF+FPG: Guiding a Policy-Gradient Planner

DOCTOR OF PHILOSOPHY HANDBOOK

BUILDING CONTEXT-DEPENDENT DNN ACOUSTIC MODELS USING KULLBACK-LEIBLER DIVERGENCE-BASED STATE TYING

Transfer Learning with Applications

Assignment 1: Predicting Amazon Review Ratings

City University of Hong Kong Course Syllabus. offered by Department of Architecture and Civil Engineering with effect from Semester A 2017/18

Multisensor Data Fusion: From Algorithms And Architectural Design To Applications (Devices, Circuits, And Systems)

Journal title ISSN Full text from

Causal Link Semantics for Narrative Planning Using Numeric Fluents

ONE YEAR IN BARCELONA, PART I+II

arxiv: v2 [cs.ir] 22 Aug 2016

ACCOUNTING FOR MANAGERS BU-5190-AU7 Syllabus

Transcription:

Lecture Slides for ETHEM ALPAYDIN The MIT Press, 2010

Why Learn? Machine learning is programming computers to opimize a performance criterion using example data or past experience. There is no need to learn to calculate payroll Learning is used when: Human experise does not exist (navigaing on Mars), Humans are unable to explain their experise (speech recogniion) SoluIon changes in Ime (rouing on a computer network) SoluIon needs to be adapted to paricular cases (user biometrics) 3

What We Talk About When We Talk About Learning Learning general models from a data of paricular examples Data is cheap and abundant (data warehouses, data marts); knowledge is expensive and scarce. Example in retail: Customer transacions to consumer behavior: People who bought Blink also bought Outliers (www.amazon.com) Build a model that is a good and useful approxima@on to the data. 4

Data Mining Retail: Market basket analysis, Customer relaionship management (CRM) Finance: Credit scoring, fraud detecion Manufacturing: Control, roboics, troubleshooing Medicine: Medical diagnosis TelecommunicaIons: Spam filters, intrusion detecion BioinformaIcs: MoIfs, alignment Web mining: Search engines... 5

What is Machine Learning? OpImize a performance criterion using example data or past experience. Role of StaIsIcs: Inference from a sample Role of Computer science: Efficient algorithms to Solve the opimizaion problem RepresenIng and evaluaing the model for inference 6

ApplicaIons AssociaIon Supervised Learning ClassificaIon Regression Unsupervised Learning Reinforcement Learning 7

Learning AssociaIons Basket analysis: P (Y X ) probability that somebody who buys X also buys Y where X and Y are products/services. Example: P ( chips beer ) = 0.7 8

ClassificaIon Example: Credit scoring DifferenIaIng between low- risk and high- risk customers from their income and savings Discriminant: IF income > θ 1 AND savings > θ 2 THEN low- risk ELSE high- risk 9

ClassificaIon: ApplicaIons Aka Pakern recogniion Face recogniion: Pose, lighing, occlusion (glasses, beard), make- up, hair style Character recogniion: Different handwriing styles. Speech recogniion: Temporal dependency. Medical diagnosis: From symptoms to illnesses Biometrics: RecogniIon/authenIcaIon using physical and/or behavioral characterisics: Face, iris, signature, etc... 10

Face RecogniIon Training examples of a person Test images ORL dataset, AT&T Laboratories, Cambridge UK 11

Regression Example: Price of a used car x : car akributes y : price y = g (x θ ) g ( ) model, θ parameters y = wx+w 0 12

Regression ApplicaIons NavigaIng a car: Angle of the steering KinemaIcs of a robot arm (x,y) α 1 = g 1 (x,y) α 2 = g 2 (x,y) α 2 α 1 n Response surface design 13

Supervised Learning: Uses PredicIon of future cases: Use the rule to predict the output for future inputs Knowledge extracion: The rule is easy to understand Compression: The rule is simpler than the data it explains Outlier detecion: ExcepIons that are not covered by the rule, e.g., fraud 14

Unsupervised Learning Learning what normally happens No output Clustering: Grouping similar instances Example applicaions Customer segmentaion in CRM Image compression: Color quanizaion BioinformaIcs: Learning moifs 15

Reinforcement Learning Learning a policy: A sequence of outputs No supervised output but delayed reward Credit assignment problem Game playing Robot in a maze MulIple agents, parial observability,... 16

Resources: Datasets UCI Repository: hkp://www.ics.uci.edu/~mlearn/mlrepository.html UCI KDD Archive: hkp://kdd.ics.uci.edu/summary.data.applicaion.html Statlib: hkp://lib.stat.cmu.edu/ Delve: hkp://www.cs.utoronto.ca/~delve/ 17

Resources: Journals Journal of Machine Learning Research www.jmlr.org Machine Learning Neural ComputaIon Neural Networks IEEE TransacIons on Neural Networks IEEE TransacIons on Pakern Analysis and Machine Intelligence Annals of StaIsIcs Journal of the American StaIsIcal AssociaIon... 18

Resources: Conferences InternaIonal Conference on Machine Learning (ICML) European Conference on Machine Learning (ECML) Neural InformaIon Processing Systems (NIPS) Uncertainty in ArIficial Intelligence (UAI) ComputaIonal Learning Theory (COLT) InternaIonal Conference on ArIficial Neural Networks (ICANN) InternaIonal Conference on AI & StaIsIcs (AISTATS) InternaIonal Conference on Pakern RecogniIon (ICPR)... 19