Machine Learning CSL 302 ARTIFICIAL INTELLIGENCE SPRING 2014

Similar documents
CSL465/603 - Machine Learning

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Lecture 1: Basic Concepts of Machine Learning

Laboratorio di Intelligenza Artificiale e Robotica

Laboratorio di Intelligenza Artificiale e Robotica

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Rule Learning With Negation: Issues Regarding Effectiveness

Welcome to. ECML/PKDD 2004 Community meeting

Lecture 1: Machine Learning Basics

Rule Learning with Negation: Issues Regarding Effectiveness

CS Machine Learning

(Sub)Gradient Descent

Python Machine Learning

Top US Tech Talent for the Top China Tech Company

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Probabilistic Latent Semantic Analysis

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

Learning Methods for Fuzzy Systems

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Australian Journal of Basic and Applied Sciences

Agents and environments. Intelligent Agents. Reminders. Vacuum-cleaner world. Outline. A vacuum-cleaner agent. Chapter 2 Actuators

Team Formation for Generalized Tasks in Expertise Social Networks

Intelligent Agents. Chapter 2. Chapter 2 1

MYCIN. The MYCIN Task

A Case-Based Approach To Imitation Learning in Robotic Agents

Using Web Searches on Important Words to Create Background Sets for LSI Classification

Knowledge Elicitation Tool Classification. Janet E. Burge. Artificial Intelligence Research Group. Worcester Polytechnic Institute

Switchboard Language Model Improvement with Conversational Data from Gigaword

Exposé for a Master s Thesis

Human Emotion Recognition From Speech

Massachusetts Institute of Technology Tel: Massachusetts Avenue Room 32-D558 MA 02139

A survey of multi-view machine learning

Linking Task: Identifying authors and book titles in verbose queries

Discriminative Learning of Beam-Search Heuristics for Planning

Axiom 2013 Team Description Paper

Assignment 1: Predicting Amazon Review Ratings

Probability and Statistics Curriculum Pacing Guide

Mining Student Evolution Using Associative Classification and Clustering

A Neural Network GUI Tested on Text-To-Phoneme Mapping

AGENDA LEARNING THEORIES LEARNING THEORIES. Advanced Learning Theories 2/22/2016

Applications of data mining algorithms to analysis of medical data

Constructive Induction-based Learning Agents: An Architecture and Preliminary Experiments

Reinforcement Learning by Comparing Immediate Reward

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Citrine Informatics. The Latest from Citrine. Citrine Informatics. The data analytics platform for the physical world

Word Segmentation of Off-line Handwritten Documents

Experiment Databases: Towards an Improved Experimental Methodology in Machine Learning

What is a Mental Model?

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

How do adults reason about their opponent? Typologies of players in a turn-taking game

Comparison of EM and Two-Step Cluster Method for Mixed Data: An Application

AQUA: An Ontology-Driven Question Answering System

Statistical Analysis of Climate Change, Renewable Energies, and Sustainability An Independent Investigation for Introduction to Statistics

Mining Association Rules in Student s Assessment Data

Automatic Discretization of Actions and States in Monte-Carlo Tree Search

Montana Content Standards for Mathematics Grade 3. Montana Content Standards for Mathematical Practices and Mathematics Content Adopted November 2011

Knowledge based expert systems D H A N A N J A Y K A L B A N D E

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Time series prediction

Characteristics of Collaborative Network Models. ed. by Line Gry Knudsen

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

This scope and sequence assumes 160 days for instruction, divided among 15 units.

Learning and Transferring Relational Instance-Based Policies

Evolutive Neural Net Fuzzy Filtering: Basic Description

Content-free collaborative learning modeling using data mining

Speech Recognition at ICSI: Broadcast News and beyond

Computerized Adaptive Psychological Testing A Personalisation Perspective

An Empirical and Computational Test of Linguistic Relativity

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

Issues in the Mining of Heart Failure Datasets

DOCTOR OF PHILOSOPHY HANDBOOK

Learning From the Past with Experiment Databases

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

Science with Kids, Science by Kids By Sally Bowers, Dane County 4-H Youth Development Educator and Tom Zinnen, Biotechnology Specialist

Disciplinary Literacy in Science

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

Customized Question Handling in Data Removal Using CPHC

Cooperative evolutive concept learning: an empirical study

Self-Supervised Acquisition of Vowels in American English

Unit 3: Lesson 1 Decimals as Equal Divisions

Active Ingredients of Instructional Coaching Results from a qualitative strand embedded in a randomized control trial

Reduce the Failure Rate of the Screwing Process with Six Sigma Approach

KLI: Infer KCs from repeated assessment events. Do you know what you know? Ken Koedinger HCI & Psychology CMU Director of LearnLab

IT Students Workshop within Strategic Partnership of Leibniz University and Peter the Great St. Petersburg Polytechnic University

Master s Programme in Computer, Communication and Information Sciences, Study guide , ELEC Majors

An Effective Framework for Fast Expert Mining in Collaboration Networks: A Group-Oriented and Cost-Based Method

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

TextGraphs: Graph-based algorithms for Natural Language Processing

The Method of Immersion the Problem of Comparing Technical Objects in an Expert Shell in the Class of Artificial Intelligence Algorithms

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Handling Concept Drifts Using Dynamic Selection of Classifiers

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

Transcription:

Machine Learning CSL 302 ARTIFICIAL INTELLIGENCE SPRING 2014

Reference Material CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 2

What is Learning? In the Human context? CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 3

What is Learning? Webster ogain knowledge or understanding of or skill in by study, instruction or experience omemorize Sleep learning? (www.links999.net) Synonym: Discovery oobtain knowledge of for the first time omay imply acquiring knowledge with little effort or conscious intention (as by simply being told) or it may imply study and practice Knowledge oknowing something with familiarity gained through experience or association ofacts or ideas acquired by study, investigation, observation, or experience CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 4

CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 5

What is Machine Learning? Herbert Simon (1970) oany process by which a system improves its performance. Tom Mitchell (1990) oa computer program that improves its performance at some task through experience. Ethem Alpaydin (2010) oprogramming computers to optimize a performance criterion using example data or past experience. CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 6

Why Do Machine Learning? Automated knowledge engineering oexpertise is scarce ocodification of expertise is difficult oexpertise frequently consists of a set of test cases Only one computer has to learn, then Ctrl+C Ctrl+P Discover new knowledge Understand human learning CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 7

Applications Medical diagnosis Autonomous driving Speech recognition Recommendation Systems Prediction (financial, climate, health, energy) Fraud/intrusion detection Activity recognition Computer Vision And the list goes on and on CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 8

Related Disciplines Statistics Pattern recognition Signal processing Artificial intelligence Data mining Neuroscience Cognitive science Psychology CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 9

Approaches Supervised Learning o Classification o Regression Unsupervised Learning o Clustering o Rule Mining Semi-supervised Learning Reinforcement Learning Transfer Learning Active Learning Online Learning Dimensionality Reduction CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 10

Approaches Supervised Learning o Classification o Regression Unsupervised Learning o Clustering o Rule Mining Semi-supervised Learning Reinforcement Learning Transfer Learning Active Learning Online Learning Dimensionality Reduction CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 11

Approaches Supervised Learning o Classification o Regression Unsupervised Learning o Clustering o Rule Mining Semi-supervised Learning Reinforcement Learning Transfer Learning Active Learning Online Learning Dimensionality Reduction CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 12

Approaches Supervised Learning o Classification o Regression Unsupervised Learning o Clustering o Rule Mining Semi-supervised Learning Reinforcement Learning Transfer Learning Active Learning Online Learning Dimensionality Reduction CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 13

Approaches Supervised Learning o Classification o Regression Unsupervised Learning o Clustering o Rule Mining Semi-supervised Learning Reinforcement Learning Transfer Learning Active Learning Online Learning Dimensionality Reduction CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 14

Approaches Supervised Learning o Classification o Regression Unsupervised Learning o Clustering o Rule Mining Semi-supervised Learning Reinforcement Learning Transfer Learning Active Learning Online Learning Dimensionality Reduction CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 15

Approaches Supervised Learning o Classification o Regression Unsupervised Learning o Clustering o Rule Mining Semi-supervised Learning Reinforcement Learning Transfer Learning Active Learning Online Learning Dimensionality Reduction CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 16

Approaches Supervised Learning o Classification o Regression Unsupervised Learning o Clustering o Rule Mining Semi-supervised Learning Reinforcement Learning Transfer Learning Active Learning Online Learning Dimensionality Reduction CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 17

Approaches Supervised Learning o Classification o Regression Unsupervised Learning o Clustering o Rule Mining Semi-supervised Learning Reinforcement Learning Transfer Learning Active Learning Online Learning Dimensionality Reduction CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 18

Other ML Issues Evaluation owhich learning approach is better Theoretical bounds owhat is and is not learnable Scalability olearning from massive datasets CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 19

Resources Software oweka (www.cs.waikato.ac.nz/~ml/weka) omachine learning open-source software (mloss.org) Data ouci ML Repository (archive.ics.uci.edu/ml) ouci KDD Repository (kdd.ics.uci.edu) ochallenges: KDD-Cup, Netflix, CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 20

Resources Conferences ointernational Conference on Machine Learning (ICML) oknowledge Discovery and Data Mining (KDD) oieee Conference on Data Mining (ICDM) osiam Data Mining Conference (SDM) oassociation for the Advancement of AI (AAAI) conference ointernational Joint Conference on AI (IJCAI) ointernational Conference on Machine Learning Applications omany more CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 21

Resources Journals omachine Learning Journal ojournal of Machine Learning Research odata Mining and Knowledge Discovery omany more WWW owww.kdnuggets.com (subscribe!) CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 22

Supervised Learning Learning task olearn to classify cars into one of two classes: family car or other oeach car is represented by two features (attributes): engine power and price ogiven several training examples of already-classified cars ooutput classifier that accurately classifies cars CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 23

Example: Family Car CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 24

Definitions Feature (attribute): x i o A property of the object to be classified o Discrete or continuous o e.g., engine power, price Instance: x = [x 1, x 2,, x d ] o The feature values for a specific object o e.g., engine power = 100, price = high Instance space: I o Space of all possible instances Class: C o Categorical feature of an object o Set of instances of objects in this category o e.g., family car CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 25

Example: Family Car Class CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 26

Definitions Example: (x, r) o Instance along with its class membership r o Positive example: member of class (r = 1) o Negative example: not a member of class (r = 0) Training set: X = {x t, r t }, 1 t N o Set of N examples Target concept (C) o Correct expression of class o E.g., (e 1 engine power e 2 ) and (p 1 price p 2 ) Concept class o Space of all possible target concepts o E.g., axis-aligned rectangles in instance space o E.g., power set of instance space CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 27

Definitions Hypothesis: h(x) {0,1} oapproximation to target concept Hypothesis class: H ospace of all possible hypotheses oe.g., axis-aligned rectangles oe.g., axis-aligned ellipses Learning goal ofind hypothesis h H that closely approximates target concept C oh is the output classifier otarget concept may not be in H CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 28

Example: Hypothesis Error Note: h is consistent with the training set, but not the target concept C. CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 29

Definitions Empirical error ohow well h classifies training set X N 1 t t E( h X ) 1( h( x ) r ) N t 1 Generalization error ohow well h classifies instances not in X True error ohow well h classifies entire instance space 1 E( h) 1( h( x) C( x)) I x I 1(expr) = 1 if expr is true, else 0 CSL 302 ARTIFICIAL INTELLIGENCE, INDIAN INSTITUTE OF TECHNOLOGY ROPAR 30