TDT4173 Machine Learning and Case-Based Reasoning

Similar documents
Lecture 1: Basic Concepts of Machine Learning

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Lecture 1: Machine Learning Basics

(Sub)Gradient Descent

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Python Machine Learning

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

Laboratorio di Intelligenza Artificiale e Robotica

CS Machine Learning

Laboratorio di Intelligenza Artificiale e Robotica

Artificial Neural Networks written examination

Exploration. CS : Deep Reinforcement Learning Sergey Levine

MYCIN. The MYCIN Task

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Learning From the Past with Experiment Databases

Version Space. Term 2012/2013 LSI - FIB. Javier Béjar cbea (LSI - FIB) Version Space Term 2012/ / 18

Learning Methods for Fuzzy Systems

Chapter 2 Rule Learning in a Nutshell

A Case Study: News Classification Based on Term Frequency

Rule Learning With Negation: Issues Regarding Effectiveness

The Strong Minimalist Thesis and Bounded Optimality

Speech Recognition at ICSI: Broadcast News and beyond

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

CSL465/603 - Machine Learning

Math 1313 Section 2.1 Example 2: Given the following Linear Program, Determine the vertices of the feasible set. Subject to:

Diagnostic Test. Middle School Mathematics

Lecture 10: Reinforcement Learning

Proof Theory for Syntacticians

A Version Space Approach to Learning Context-free Grammars

Axiom 2013 Team Description Paper

Assignment 1: Predicting Amazon Review Ratings

TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE. Pierre Foy

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

STA 225: Introductory Statistics (CT)

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Scientific Method Investigation of Plant Seed Germination

ALL-IN-ONE MEETING GUIDE THE ECONOMICS OF WELL-BEING

Lahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017

Person Centered Positive Behavior Support Plan (PC PBS) Report Scoring Criteria & Checklist (Rev ) P. 1 of 8

Book Review: Build Lean: Transforming construction using Lean Thinking by Adrian Terry & Stuart Smith

Experiment Databases: Towards an Improved Experimental Methodology in Machine Learning

Medical Complexity: A Pragmatic Theory

Rule-based Expert Systems

Evolutive Neural Net Fuzzy Filtering: Basic Description

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

TU-E2090 Research Assignment in Operations Management and Services

The Good Judgment Project: A large scale test of different methods of combining expert predictions

ENME 605 Advanced Control Systems, Fall 2015 Department of Mechanical Engineering

ECE-492 SENIOR ADVANCED DESIGN PROJECT

Rule Learning with Negation: Issues Regarding Effectiveness

Cooperative evolutive concept learning: an empirical study

Getting Started with Deliberate Practice

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

Human Emotion Recognition From Speech

Using focal point learning to improve human machine tacit coordination

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

THE UNIVERSITY OF SYDNEY Semester 2, Information Sheet for MATH2068/2988 Number Theory and Cryptography

A Reinforcement Learning Variant for Control Scheduling


Word Segmentation of Off-line Handwritten Documents

Innovative Methods for Teaching Engineering Courses

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Ruggiero, V. R. (2015). The art of thinking: A guide to critical and creative thought (11th ed.). New York, NY: Longman.

SARDNET: A Self-Organizing Feature Map for Sequences

Distributed Weather Net: Wireless Sensor Network Supported Inquiry-Based Learning

EXAMINING THE DEVELOPMENT OF FIFTH AND SIXTH GRADE STUDENTS EPISTEMIC CONSIDERATIONS OVER TIME THROUGH AN AUTOMATED ANALYSIS OF EMBEDDED ASSESSMENTS

LEGO MINDSTORMS Education EV3 Coding Activities

Science Fair Project Handbook

Carnegie Mellon University Department of Computer Science /615 - Database Applications C. Faloutsos & A. Pavlo, Spring 2014.

A study of speaker adaptation for DNN-based speech synthesis

A Decision Tree Analysis of the Transfer Student Emma Gunu, MS Research Analyst Robert M Roe, PhD Executive Director of Institutional Research and

Probability and Statistics Curriculum Pacing Guide

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

Getting Started with TI-Nspire High School Science

Foothill College Summer 2016

Calibration of Confidence Measures in Speech Recognition

ENEE 302h: Digital Electronics, Fall 2005 Prof. Bruce Jacob

TCC Jim Bolen Math Competition Rules and Facts. Rules:

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,

ACCOUNTING FOR MANAGERS BU-5190-OL Syllabus

Introduction to Questionnaire Design

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

Detailed course syllabus

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Outreach Connect User Manual

Classify: by elimination Road signs

CORRELATION FLORIDA DEPARTMENT OF EDUCATION INSTRUCTIONAL MATERIALS CORRELATION COURSE STANDARDS / BENCHMARKS. 1 of 16

On Human Computer Interaction, HCI. Dr. Saif al Zahir Electrical and Computer Engineering Department UBC

ACCOUNTING FOR MANAGERS BU-5190-AU7 Syllabus

Science with Kids, Science by Kids By Sally Bowers, Dane County 4-H Youth Development Educator and Tom Zinnen, Biotechnology Specialist

Probability estimates in a scenario tree

Assessment System for M.S. in Health Professions Education (rev. 4/2011)

Airplane Rescue: Social Studies. LEGO, the LEGO logo, and WEDO are trademarks of the LEGO Group The LEGO Group.

Softprop: Softmax Neural Network Backpropagation Learning

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Statistical Analysis of Climate Change, Renewable Energies, and Sustainability An Independent Investigation for Introduction to Statistics

FIGURE IT OUT! MIDDLE SCHOOL TASKS. Texas Performance Standards Project

Transcription:

TDT4173 Machine Learning and Case-Based Reasoning Lecture 1 Introduction Norwegian University of Science and Technology Helge Langseth and Anders Kofod-Petersen 1 TDT4173 Machine Learning and Case-Based Reasoning

Outline 1 Introduction to Machine learning Machine learning overview Examples The Learning Problem 2 Practical information About TDT4173 The other stuff 3 Checkers 4 Learning From Examples EnjoySport - example The Inductive Learning Hypothesis Find-S Version Spaces Learning Bias 5 Summary 2 TDT4173 Machine Learning and Case-Based Reasoning

The grand vision Introduction to Machine learning Machine learning overview An autonomous self-moving machine that acts, reasons, and learnslike a human We are still very far from achieving this... 3 TDT4173 Machine Learning and Case-Based Reasoning

Introduction to Machine learning Why Machine Learning Machine learning overview Recent progress in algorithms and theory Growing flood of online data Computational power is available Budding industry Three niches for machine learning: Data mining: using historical data to improve decisions - medical records medical knowledge Software applications we can t program by hand - autonomous driving - speech recognition Self customizing programs - Recommendation systems 4 TDT4173 Machine Learning and Case-Based Reasoning

Introduction to Machine learning Examples Typical Datamining Task Data: Given: 9714 patient records, each describing a pregnancy and birth Each patient record contains 215 features Learn to predict: Classes of future patients at high risk for Emergency Cesarean Section 5 TDT4173 Machine Learning and Case-Based Reasoning

Introduction to Machine learning Examples Problems Too Difficult to Program by Hand ALVINN [Pomerleau] drives 70 mph on highways Sharp Left Straight Ahead Sharp Right 30 Output Units 4 Hidden Units 30x32 Sensor Input Retina 6 TDT4173 Machine Learning and Case-Based Reasoning

Introduction to Machine learning Examples Software that Customizes to User http://www.last.fm 7 TDT4173 Machine Learning and Case-Based Reasoning

Introduction to Machine learning Where Is this Headed? Examples Today: tip of the iceberg: First-generation algorithms: neural nets, decision trees, regression... Applied to well-formatted database Budding industry Opportunity for tomorrow: enormous impact: Learn across full mixed-media data Learn by active experimentation Cumulative, lifelong learning Programming languages with learning embedded?... etc. (Only your imagination limits this list!) 8 TDT4173 Machine Learning and Case-Based Reasoning

Introduction to Machine learning What is Machine Learning? The Learning Problem Learning = Improving with experience at some task Improve over task T, with respect to performance measure P, based on experience E. Definition Machine Learning Methods and techniques that makes computer systems able to update its knowledge and problem-solving ability. Task Experience Task Program Program Program Performance Performance 9 TDT4173 Machine Learning and Case-Based Reasoning

TDT4173 Practical information About TDT4173 Goals of the course: The course will give a basic insight into principles and methods for how computer systems can learn from its own experience. Syllabus: The text-book Machine Learning by Tom Mitchell. A number of papers top be decided and made available How to get it... Book available at Tapir. Papers will be made available for downloaded from our webpage 10 TDT4173 Machine Learning and Case-Based Reasoning

Exercises Practical information About TDT4173 Designed to give hands-on experience with the different machine learning methods we talk about Will contain both coding tasks as well as requirements towards discussions Typically given with a one-and-a-half week deadline. Time of delivery Thursday at 20:00. 1 big and 4 small assignments. NB! Counts towards final grade All exercises count towards the final grade: If you fail one assignment, you will automatically take off 3.3% (6.7% for the big one) of the total available score. 11 TDT4173 Machine Learning and Case-Based Reasoning

Paper presentation Practical information About TDT4173 A number of classic texts Papers to be presented by students Will make up the last part of the semester course NB! Counts towards final grade Each student must participate in presenting at least one paper. Otherwise, 3.3% of the total available score will be taken off. 12 TDT4173 Machine Learning and Case-Based Reasoning

Examination Practical information The other stuff December 6th Written, 4 hours Counts for 80% of the final grade; the other 20% are determined by no. assignments handed in + participation in paper presentation. 13 TDT4173 Machine Learning and Case-Based Reasoning

Getting information Practical information The other stuff Sources for information: Check the web-page http://www.idi.ntnu.no/emner/tdt4173/ It s Learning Ask (Contact info on web-page.) 14 TDT4173 Machine Learning and Case-Based Reasoning

Reference group Practical information The other stuff If you want, we can have a reference group. Not much work (if all goes well): Evaluation meeting(s) Evaluation report Students spokesman if there is something I should take into account It is not required, but if two students to volunteer to be in the ref.grp., it may smooth of any problems we meet as we move along. 15 TDT4173 Machine Learning and Case-Based Reasoning

Checkers Learning to Play Checkers T: Play checkers P: Percent of games won in world tournament E: opportunity to play against self 16 TDT4173 Machine Learning and Case-Based Reasoning

Design Choices Checkers What experience can we learn from? What exactly should be learned? How shall it be represented? Target function: collection of rules? neural network? polynomial function of board features?... What specific algorithm can we use to learn it? 17 TDT4173 Machine Learning and Case-Based Reasoning

Checkers Type of knowledge learned We wish to learn a function that for any given board position B chooses the best move M, ChooseMove:B M. 18 TDT4173 Machine Learning and Case-Based Reasoning

Checkers Type of knowledge learned We wish to learn a function that for any given board position B chooses the best move M, ChooseMove:B M. Direct training: Examples of individual checkers board states and the correct move for each. Indirect training: Examples of sequences of moves and final outcomes of the various games played. Indirect training makes ChooseMove impractical to learn: If we end up winning, is the first move then optimal? 18 TDT4173 Machine Learning and Case-Based Reasoning

Checkers Approximation The start of the learning work Instead of ChooseMove, we establish a Value function V : V : B R that maps legal board states B into some real value. Playing rule:for any board position, choose the move that maximizes the value of the resulting board position. 19 TDT4173 Machine Learning and Case-Based Reasoning

Checkers Approximation The start of the learning work Instead of ChooseMove, we establish a Value function V : V : B R that maps legal board states B into some real value. Playing rule:for any board position, choose the move that maximizes the value of the resulting board position. 1 if b is a final board state that is won, then V(b) = 100 2 if b is a final board state that is lost, then V(b) = 100 3 if b is a final board state that is drawn, then V(b) = 0 4 if b is a not a final state in the game, then V(b) =?? 19 TDT4173 Machine Learning and Case-Based Reasoning

Checkers Approximation The start of the learning work Instead of ChooseMove, we establish a Value function V : V : B R that maps legal board states B into some real value. Playing rule:for any board position, choose the move that maximizes the value of the resulting board position. 1 if b is a final board state that is won, then V(b) = 100 2 if b is a final board state that is lost, then V(b) = 100 3 if b is a final board state that is drawn, then V(b) = 0 4 if b is a not a final state in the game, then V(b) = V(b ), b is the best final board state that can be achieved starting from b and playing optimally until the end of the game. It is still not trivial... 19 TDT4173 Machine Learning and Case-Based Reasoning

Checkers What is of importance? x 1 : # black pieces x 3 : # black kings x 5 : # white pieces threatened x 2 : # white pieces x 4 : # white kings x 6 : # black pieces threatened 20 TDT4173 Machine Learning and Case-Based Reasoning

Checkers What is of importance? x 1 : # black pieces x 3 : # black kings x 5 : # white pieces threatened x 2 : # white pieces x 4 : # white kings x 6 : # black pieces threatened Approximation: ˆV(b) = w 0 +w 1 x 1 +w 2 x 2 +w 3 x 3 +w 4 x 4 +w 5 x 5 +w 6 x 6, where w i is the weight assigned to x i. Learning task: Determine the weights w 0, w 1, w 2, w 3, w 4, w 5, and w 6. 20 TDT4173 Machine Learning and Case-Based Reasoning

How to learn Checkers In order to learn ˆV, we require a set of training examples,each describing a board state b and a training value V train for b: Current weights: w 0, w 1, w 2, w 3, w 4, w 5, and w 6 yield ˆV. New game: b 1,b 2,...,b end What value should V train (b i ) should we attach to position b i? 21 TDT4173 Machine Learning and Case-Based Reasoning

How to learn Checkers In order to learn ˆV, we require a set of training examples,each describing a board state b and a training value V train for b: Current weights: w 0, w 1, w 2, w 3, w 4, w 5, and w 6 yield ˆV. New game: b 1,b 2,...,b end What value should V train (b i ) should we attach to position b i? Idea: When faced with a situation b k, both players do the best they can resulting in b end. 21 TDT4173 Machine Learning and Case-Based Reasoning

How to learn Checkers In order to learn ˆV, we require a set of training examples,each describing a board state b and a training value V train for b: Current weights: w 0, w 1, w 2, w 3, w 4, w 5, and w 6 yield ˆV. New game: b 1,b 2,...,b end What value should V train (b i ) should we attach to position b i? Idea: When faced with a situation b k, both players do the best they can resulting in b end. In general: V train (b i ) ˆV(b i+1 ) This makes sense if ˆV is more accurate for board states closer to the end of the game. 21 TDT4173 Machine Learning and Case-Based Reasoning

Checkers Ehhh... And what does this mean? Current state: b i Next state: b i+1 The (system believes that) situation b i b i+1 Therefore V train (b i ) = V(b i+1 ) V(b i+1 ) is unknown, but assuming the system is very good, we have ˆV(b i+1 ) V(b i+1 ). Thus, we decide that V train (b i ) ˆV(b i+1 ). 22 TDT4173 Machine Learning and Case-Based Reasoning

And will it work? Checkers Can we get reasonable training data? We know what V(b end ) is for any state b end Using the previous setup, we should therefore be able to value situations that are onestep away from being finished!...and using the same setup again, we should next be able to value situations that are twosteps away from being finished...and so on This should work, but we need to be able to use the training data... 23 TDT4173 Machine Learning and Case-Based Reasoning

Checkers How to learn the weights Current weights:w 0, w 1, w 2, w 3, w 4, w 5, and w 6 yield ˆV. New game:< b 1,V train (b 1 ) >,...,< b end,v train (b end ) >. 24 TDT4173 Machine Learning and Case-Based Reasoning

Checkers How to learn the weights Current weights:w 0, w 1, w 2, w 3, w 4, w 5, and w 6 yield ˆV. New game:< b 1,V train (b 1 ) >,...,< b end,v train (b end ) >. Idea:Introduce error function E, and change weights such that the total error over all training examples is minimal. E = <b,v train (b)> training examples ( V train (b) ˆV(b) ) 2 Note: E is a function of the weights, E = E(w 0,w 1,w 2,w 3,w 4,w 5,w 6 ), and we will change the weights to make E obtain its minimal value. 24 TDT4173 Machine Learning and Case-Based Reasoning

Checkers LMS Weight update rule For each training set < b,v train (b) > do: Use the current weights to calculate ˆV(b). For each weight w i do: w i w i +µ x i where µ is the learning rate. ( V train (b) ˆV(b) ) 25 TDT4173 Machine Learning and Case-Based Reasoning

Checkers Implementation: A 5th year Student Project Results 26 TDT4173 Machine Learning and Case-Based Reasoning

Design Choices Checkers Determine Type of Training Experience Games against experts Games against self Table of correct moves... Determine Target Function Board move Board value... Determine Representation of Learned Function Polynomial Linear function of six features Artificial neural network... Determine Learning Algorithm Completed Design Gradient descent Linear programming... 27 TDT4173 Machine Learning and Case-Based Reasoning

Learning From Examples Training Examples for EnjoySport EnjoySport - example Index Sky Temp Humid Wind Water Forecast EnjoySport 1 Sunny Warm Normal Strong Warm Same Yes 2 Sunny Warm High Strong Warm Same Yes 3 Rainy Cold High Strong Warm Change No 4 Sunny Warm High Strong Cool Change Yes What is the general concept? 28 TDT4173 Machine Learning and Case-Based Reasoning

Learning From Examples Training Examples for EnjoySport EnjoySport - example Index Sky Temp Humid Wind Water Forecast EnjoySport 1 Sunny Warm Normal Strong Warm Same Yes 2 Sunny Warm High Strong Warm Same Yes 3 Rainy Cold High Strong Warm Change No 4 Sunny Warm High Strong Cool Change Yes Sky = Sunny? What is the general concept? Sky = Sunny AND Temp = Warm? Forecast = Same OR Water = Cool? When Index is written in binary digits it requires one1? 28 TDT4173 Machine Learning and Case-Based Reasoning

Learning From Examples Representing Hypotheses EnjoySport - example Many possible representations Here, h is conjunction of constraints on attributes Each constraint can be a specific value (e.g., Water = Warm ) don t care (e.g., Water =? ) no value allowed (e.g., Water = ) For example, Sky AirTemp Humid Wind Water Forecast Sunny?? Strong? Same 29 TDT4173 Machine Learning and Case-Based Reasoning

Learning From Examples Prototypical Concept Learning Task The Inductive Learning Hypothesis Given: Instances X: Possible days, each described by the attributes Sky, AirTemp, Humidity, Wind, Water, Forecast Target function c: EnjoySport: X {0, 1} Hypotheses H: Conjunctions of literals, e.g.,?,cold,high,?,?,?. Training examples D: Positive and negative examples of the target function x 1,c(x 1 ),... x m,c(x m ) Determinea hypothesis h Hsuch that x D : h(x) = c(x). 30 TDT4173 Machine Learning and Case-Based Reasoning

Learning From Examples Prototypical Concept Learning Task The Inductive Learning Hypothesis Given: Instances X: Possible days, each described by the attributes Sky, AirTemp, Humidity, Wind, Water, Forecast Target function c: EnjoySport: X {0, 1} Hypotheses H: Conjunctions of literals, e.g.,?,cold,high,?,?,?. Training examples D: Positive and negative examples of the target function x 1,c(x 1 ),... x m,c(x m ) Determinea hypothesis h Hsuch that x D : h(x) = c(x). The inductive learning hypothesis:any hypothesis found to approximate the target function well over a sufficiently large set of training examples will also approximate the target function well over other unobserved examples. 30 TDT4173 Machine Learning and Case-Based Reasoning

Learning From Examples The Inductive Learning Hypothesis Instance, Hypotheses, and More-General-Than Instances X Hypotheses H Specific x 1 x 2 h 1 h 2 h 3 General x 1 = <Sunny, Warm, High, Strong, Cool, Same> x = <Sunny, Warm, High, Light, Warm, Same> 2 h 1 = <Sunny,?,?, Strong,?,?> h = <Sunny,?,?,?,?,?> 2 h = <Sunny,?,?,?, Cool,?> 3 31 TDT4173 Machine Learning and Case-Based Reasoning

Find-S Algorithm Learning From Examples Find-S 1 Initialize h to the most specific hypothesis in H 2 For each positive training instance x For each attribute constraint a i in h If a i in h is satisfied by x Then do nothing Else replace a i in h by the next more general constraint that is satisfied by x 3 Output hypothesis h 32 TDT4173 Machine Learning and Case-Based Reasoning

Learning From Examples Find-S Hypothesis Space Search by Find-S Instances X Hypotheses H - x 3 h 0 h 1 Specific x + 1 x+ 2 h 2,3 x+ 4 h 4 General x = <Sunny Warm Normal Strong Warm Same>, + 1 x 2 = <Sunny Warm High Strong Warm Same>, + x 3 = <Rainy Cold High Strong Warm Change>, - x = <Sunny Warm High Strong Cool Change>, + 4 h = <,,,,, > 0 h 1 = <Sunny Warm Normal Strong Warm Same> h 2 = <Sunny Warm? Strong Warm Same> h = <Sunny Warm? Strong Warm Same> 3 h = <Sunny Warm? Strong?? > 4 33 TDT4173 Machine Learning and Case-Based Reasoning

Learning From Examples Complaints about Find-S Find-S Can t tell whether it has learned concept Can t tell when training data inconsistent Picks a maximally specific h (why?) Depending on H, there might be several! 34 TDT4173 Machine Learning and Case-Based Reasoning

Version Spaces Learning From Examples Version Spaces A hypothesis h is consistent with a set of training examples D of target concept c if and only if h(x) = c(x) for each training example x,c(x) in D. Consistent(h,D) ( x,c(x) D) h(x) = c(x) The version space, VS H,D, with respect to hypothesis space H and training examples D, is the subset of hypotheses from H consistent with all training examples in D. VS H,D {h H Consistent(h,D)} 35 TDT4173 Machine Learning and Case-Based Reasoning

Learning From Examples Version Spaces The List-Then-Eliminate Algorithm 1 VersionSpace a list containing every hypothesis in H 2 For each training example, x,c(x) remove from VersionSpace any hypothesis h for which h(x) c(x) 3 Output the list of hypotheses in VersionSpace 36 TDT4173 Machine Learning and Case-Based Reasoning

Learning From Examples Example Version Space Version Spaces S: { <Sunny, Warm,?, Strong,?,?> } <Sunny,?,?, Strong,?,?> <Sunny, Warm,?,?,?,?> <?, Warm,?, Strong,?,?> G: { <Sunny,?,?,?,?,?>, <?, Warm,?,?,?,?> } 37 TDT4173 Machine Learning and Case-Based Reasoning

Learning From Examples Representing Version Spaces Version Spaces The General boundary, G, of version space VS H,D is the set of its maximally general members The Specific boundary, S, of version space VS H,D is the set of its maximally specific members Every member of the version space lies between these boundaries VS H,D = {h H ( s S)( g G)(g h s)} where x y means x is more general or equal to y 38 TDT4173 Machine Learning and Case-Based Reasoning

Learning From Examples Candidate Elimination Algorithm Version Spaces G maximally general hypotheses in H S maximally specific hypotheses in H For each training example d, do If d is a positive example Remove from G any hypothesis inconsistent with d For each hypothesis s in S that is not consistent with d Remove s from S Add to S all minimal generalizations h of s such that - h is consistent with d, and - some member of G is more general than h Remove from S any hypothesis that is more general than another hypothesis in S [CONT d] 39 TDT4173 Machine Learning and Case-Based Reasoning

Learning From Examples Candidate Elimination Algorithm Version Spaces [...FROM PREVIOUS SLIDE] If d is a negative example Remove from S any hypothesis inconsistent with d For each hypothesis g in G that is not consistent with d Remove g from G Add to G all minimal specializations h of g such that - h is consistent with d, and - some member of S is more specific than h Remove from G any hypothesis that is less general than another hypothesis in G 40 TDT4173 Machine Learning and Case-Based Reasoning

Example Trace Learning From Examples Version Spaces S 0 : { <,,,,, > } S 1 : { <Sunny, Warm, Normal, Strong, Warm, Same> } S 2 : { <Sunny, Warm,?, Strong, Warm, Same> } G 0, G 1, G 2 : { <?,?,?,?,?,?>} Training examples: 1. <Sunny, Warm, Normal, Strong, Warm, Same>, Enjoy Sport = Yes 2. <Sunny, Warm, High, Strong, Warm, Same>, Enjoy Sport = Yes 41 TDT4173 Machine Learning and Case-Based Reasoning

Example Trace Learning From Examples Version Spaces S 2, S 3 : { <Sunny, Warm,?, Strong, Warm, Same> } G 3 : { <Sunny,?,?,?,?,?> <?, Warm,?,?,?,?> <?,?,?,?,?, Same> } G 2: { <?,?,?,?,?,?> } Training Example: 3. <Rainy, Cold, High, Strong, Warm, Change>, EnjoySport=No 41 TDT4173 Machine Learning and Case-Based Reasoning

Example Trace Learning From Examples Version Spaces S 3 : { <Sunny, Warm,?, Strong, Warm, Same> } S 4: { <Sunny, Warm,?, Strong,?,?>} G 4: { <Sunny,?,?,?,?,?> <?, Warm,?,?,?,?>} G 3 : { <Sunny,?,?,?,?,?> <?, Warm,?,?,?,?> <?,?,?,?,?, Same> } Training Example: 4.<Sunny, Warm, High, Strong, Cool, Change>, EnjoySport = Yes 41 TDT4173 Machine Learning and Case-Based Reasoning

Learning From Examples Version Spaces How Should These Be Classified?? S: { <Sunny, Warm,?, Strong,?,?> } <Sunny,?,?, Strong,?,?> <Sunny, Warm,?,?,?,?> <?, Warm,?, Strong,?,?> G: { <Sunny,?,?,?,?,?>, <?, Warm,?,?,?,?> } Sunny Warm Normal Strong Cool Change Rainy Cool Normal Light Warm Same Sunny Warm Normal Light Warm Same 42 TDT4173 Machine Learning and Case-Based Reasoning

Learning From Examples Learning Bias What Justifies this Inductive Leap? + Sunny Warm Normal Strong Cool Change + Sunny Warm Normal Light Warm Same S : Sunny Warm Normal??? Why believe we can classify the unseen Sunny Warm Normal Strong Warm Same 43 TDT4173 Machine Learning and Case-Based Reasoning

Inductive Bias Learning From Examples Learning Bias Consider concept learning algorithm L instances X, target concept c training examples D c = { x,c(x) } let L(x i,d c ) denote the classification assigned to the instance x i by L after training on data D c. Definition: The inductive bias of L is any minimal set of assertions B such that for any target concept c and corresponding training examples D c ( x i X)[(B D c x i ) L(x i,d c )] where A B means A logically entails B. 44 TDT4173 Machine Learning and Case-Based Reasoning

Learning From Examples Learning Bias Inductive and Equivalent Deductive Systems Training examples New instance Inductive system Candidate Elimination Algorithm Using Hypothesis Space H Classification of new instance, or "don t know" Training examples New instance Equivalent deductive system Theorem Prover Classification of new instance, or "don t know" Assertion " H contains the target concept" Inductive bias made explicit 45 TDT4173 Machine Learning and Case-Based Reasoning

Learning From Examples Learning Bias Three Learners with Different Biases 1 Rote learner: Store examples, Classify x iff it matches previously observed example. 2 Version space candidate elimination algorithm 3 Find-S 46 TDT4173 Machine Learning and Case-Based Reasoning

Summary Points Summary 1 Concept learning as search through H 2 General-to-specific ordering over H 3 Version space candidate elimination algorithm 4 S and G boundaries characterize learner s uncertainty 5 Learner can generate useful queries 6 Inductive leaps possible only if learner is biased 7 Inductive learners can be modelled by equivalent deductive systems 47 TDT4173 Machine Learning and Case-Based Reasoning