Version Space. Term 2012/2013 LSI - FIB. Javier Béjar cbea (LSI - FIB) Version Space Term 2012/ / 18

Similar documents
Chapter 2 Rule Learning in a Nutshell

A Version Space Approach to Learning Context-free Grammars

Rule-based Expert Systems

Proof Theory for Syntacticians

Self Study Report Computer Science

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Lecture 1: Basic Concepts of Machine Learning

Statewide Framework Document for:

Rule Learning With Negation: Issues Regarding Effectiveness

Disambiguation of Thai Personal Name from Online News Articles

Lecture 1: Machine Learning Basics

Rule Learning with Negation: Issues Regarding Effectiveness

LEGO MINDSTORMS Education EV3 Coding Activities

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

Stacks Teacher notes. Activity description. Suitability. Time. AMP resources. Equipment. Key mathematical language. Key processes

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Softprop: Softmax Neural Network Backpropagation Learning

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

B.S/M.A in Mathematics

Cal s Dinner Card Deals

On-Line Data Analytics

Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

(Sub)Gradient Descent

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Mathematics. Mathematics

Some Principles of Automated Natural Language Information Extraction

Discriminative Learning of Beam-Search Heuristics for Planning

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Language Evolution, Metasyntactically. First International Workshop on Bidirectional Transformations (BX 2012)

Evolution of Collective Commitment during Teamwork

Extending Place Value with Whole Numbers to 1,000,000

Transfer Learning Action Models by Measuring the Similarity of Different Domains

Compositional Semantics

This scope and sequence assumes 160 days for instruction, divided among 15 units.

Using focal point learning to improve human machine tacit coordination

1 Copyright Texas Education Agency, All rights reserved.

University of Groningen. Systemen, planning, netwerken Bosman, Aart

Using the Attribute Hierarchy Method to Make Diagnostic Inferences about Examinees Cognitive Skills in Algebra on the SAT

arxiv: v1 [math.at] 10 Jan 2016

Level 6. Higher Education Funding Council for England (HEFCE) Fee for 2017/18 is 9,250*

Generation of Referring Expressions: Managing Structural Ambiguities

Page 1 of 11. Curriculum Map: Grade 4 Math Course: Math 4 Sub-topic: General. Grade(s): None specified

Learning Methods in Multilingual Speech Recognition

Piaget s Cognitive Development

Learning to Rank with Selection Bias in Personal Search

Type-driven semantic interpretation and feature dependencies in R-LFG

Evolutive Neural Net Fuzzy Filtering: Basic Description

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,

Learning goal-oriented strategies in problem solving

A Characterization of Calculus I Final Exams in U.S. Colleges and Universities

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Grade 6: Correlated to AGS Basic Math Skills

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

Learning to Schedule Straight-Line Code

TEKS Resource System. Effective Planning from the IFD & Assessment. Presented by: Kristin Arterbury, ESC Region 12

Quantitative analysis with statistics (and ponies) (Some slides, pony-based examples from Blase Ur)

Abstractions and the Brain

School of Innovative Technologies and Engineering

The New York City Department of Education. Grade 5 Mathematics Benchmark Assessment. Teacher Guide Spring 2013

Cooperative evolutive concept learning: an empirical study

Radius STEM Readiness TM

Speech Recognition at ICSI: Broadcast News and beyond

Compositionality in Rational Analysis: Grammar-based Induction for Concept Learning

Foothill College Fall 2014 Math My Way Math 230/235 MTWThF 10:00-11:50 (click on Math My Way tab) Math My Way Instructors:

Science Olympiad Competition Model This! Event Guidelines

WSU Five-Year Program Review Self-Study Cover Page

Learning and Transferring Relational Instance-Based Policies

Constructive Induction-based Learning Agents: An Architecture and Preliminary Experiments

Focused on Understanding and Fluency

Digital Media Literacy

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

Artificial Neural Networks written examination

University of Michigan - Flint POLICY ON FACULTY CONFLICTS OF INTEREST AND CONFLICTS OF COMMITMENT

Technical Manual Supplement

Disciplinary Literacy in Science

Pearson Baccalaureate Higher Level Mathematics Worked Solutions

Unit: Human Impact Differentiated (Tiered) Task How Does Human Activity Impact Soil Erosion?

RANKING AND UNRANKING LEFT SZILARD LANGUAGES. Erkki Mäkinen DEPARTMENT OF COMPUTER SCIENCE UNIVERSITY OF TAMPERE REPORT A ER E P S I M S

Exemplar 6 th Grade Math Unit: Prime Factorization, Greatest Common Factor, and Least Common Multiple

How do adults reason about their opponent? Typologies of players in a turn-taking game

A cognitive perspective on pair programming

Word learning as Bayesian inference

Generative models and adversarial training

The Good Judgment Project: A large scale test of different methods of combining expert predictions

This Performance Standards include four major components. They are

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines

TU-E2090 Research Assignment in Operations Management and Services

Tabular and Textual Methods for Selecting Objects from a Group

A Polynomial Approach to the Constructive Induction of Structural Knowledge

A Case Study: News Classification Based on Term Frequency

Essentials of Ability Testing. Joni Lakin Assistant Professor Educational Foundations, Leadership, and Technology

CAAP. Content Analysis Report. Sample College. Institution Code: 9011 Institution Type: 4-Year Subgroup: none Test Date: Spring 2011

Biological Sciences, BS and BA

Florida Mathematics Standards for Geometry Honors (CPalms # )

The College Board Redesigned SAT Grade 12

STA2023 Introduction to Statistics (Hybrid) Spring 2013

TOPICS LEARNING OUTCOMES ACTIVITES ASSESSMENT Numbers and the number system

Transcription:

Version Space Javier Béjar cbea LSI - FIB Term 2012/2013 Javier Béjar cbea (LSI - FIB) Version Space Term 2012/2013 1 / 18

Outline 1 Learning logical formulas 2 Version space Introduction Search strategy Algorithm Applications Javier Béjar cbea (LSI - FIB) Version Space Term 2012/2013 2 / 18

Learning logical formulas 1 Learning logical formulas 2 Version space Introduction Search strategy Algorithm Applications Javier Béjar cbea (LSI - FIB) Version Space Term 2012/2013 3 / 18

Learning logical formulas Symbolic learning We are used to learn (and teach) symbolically and from our perspective it seems the natural way From all the possible space hypothesis, logical formulas are the best space for this task Restricted to propositional logic, examples are represented by expressions that denote their properties and values This representation is not different from the attribute-value pairs representation that we have defined Javier Béjar cbea (LSI - FIB) Version Space Term 2012/2013 4 / 18

Learning logical formulas Symbolic learning As we mentioned before the size of this hypothesis space is O(2 2n ) The main advantage of this is that we can define a partial order among the hypothesis Logical formulas form a lattice with a partial order defined by the generalization relation A B = A > B That order can help in the search process allowing to prune unwanted candidates Javier Béjar cbea (LSI - FIB) Version Space Term 2012/2013 5 / 18

Learning logical formulas Hierarchy of logical formulas............ Javier Béjar cbea (LSI - FIB) Version Space Term 2012/2013 6 / 18

1 Learning logical formulas 2 Version space Introduction Search strategy Algorithm Applications Javier Béjar cbea (LSI - FIB) Version Space Term 2012/2013 7 / 18

Introduction The Version Space algorithm General supervised inductive learning algorithm Examples are represented as value attribute pairs (propositional formulas) Explores the hypothesis space using the partial order (general/specific) The algorithm will have no preference criteria (bias) = All hypothesis are possible (In practice we are going to reduce the hypothesis space to pure conjunctive formulas) Javier Béjar cbea (LSI - FIB) Version Space Term 2012/2013 8 / 18

Introduction Assumptions Learning is obtained by searching in the hierarchy for the concept that best fits the examples Two kind of examples will be used, the positives (examples of the concept to learn) and the negatives (counterexamples of the concept) (Binary classification) We define the version space as the set of all hypothesis consistent with the examples that have been presented so far The goal is to reduce the hypothesis set to a single concept Javier Béjar cbea (LSI - FIB) Version Space Term 2012/2013 9 / 18

Search strategy Search Strategy A breadth first bidirectional search is used The more general concepts consistent with the examples are stored in (G) and the more specific ones in (S) Positive examples are used to prune the more specific hypothesis Negative examples will be used to prune the more general hypothesis If the set of learning examples is correct the search will converge Javier Béjar cbea (LSI - FIB) Version Space Term 2012/2013 10 / 18

Search strategy Search strategy Set G={Most general hypothesis consistent with the examples} Set S={Most specific hypothesis consistent with the examples} Adequate generalization and specialization operators for the concept representation language must be chosen Positive example allow to generalize the most specific hypothesis (for instance, deleting conditions) Negative examples allow to specialize the most general hypothesis Also must hold that S G Javier Béjar cbea (LSI - FIB) Version Space Term 2012/2013 11 / 18

Search strategy Searching in the hypothesis space G + S + S G + S G + + + + + + + + G=S Javier Béjar cbea (LSI - FIB) Version Space Term 2012/2013 12 / 18

Algorithm Candidate elimination algorithm (I) Initialize G to the most general concept Initialize S to the fist positive example while there are examples if it is a positive example (p) * Delete from G any hypothesis inconsistent with p (Concepts from G that do not include p) * for each concept from S inconsistent with p (s) - Delete s - Add to S all minimal generalizations of s that are consistent with p and an element from G is more general that them * Delete from S all concepts more general that any from S Javier Béjar cbea (LSI - FIB) Version Space Term 2012/2013 13 / 18

Algorithm Candidate elimination algorithm (II) if it is a negative example (n) * Delete from S any hypothesis inconsistent with n (Concepts from S that include n) * For each concept from G inconsistent with n (g) - Delete g - Add to G all minimal specializations of g that are consistent with n and an element from S is more specific that them * Delete from G all concepts less general than any from G end while if G=S and both have only one element this is the goal concept Otherwise the set of examples is inconsistent Javier Béjar cbea (LSI - FIB) Version Space Term 2012/2013 14 / 18

Algorithm Shortcomings of the algorithm The exhaustive search is too costly Improvements: To use simpler hypothesis space (some concepts can not be learned) To use heuristics to prune concepts from G and S (give a preference criteria over the hypothesis space, a bias) It is not tolerant to misclassified examples (noise) Javier Béjar cbea (LSI - FIB) Version Space Term 2012/2013 15 / 18

Applications LEX: An application to symbolic integration LEX is a symbolic integrator that learns from experience The hypothesis space of LEX is all the algebraic expressions Concepts: What integration operators are more adequate for different kinds of indefinite integrals OP1 : rf (x)dx r f (x)dx OP2 : udv uv vdu OP3 : f 1 (x) + f 2 (x)dx f 1 (x) + f 2 (x) Javier Béjar cbea (LSI - FIB) Version Space Term 2012/2013 16 / 18

Applications LEX: An application to symbolic integration The system is able to generate problems and label each operator depending on its success in solving a specific kind of integral as positive or negative example of application Each operator appears has one or more version spaces (disjunction) The version spaces are modified with the new positive or negative examples of application of operators If an expression is inside a version space of an operator this means that could be applicable to solve the integration of the expression Javier Béjar cbea (LSI - FIB) Version Space Term 2012/2013 17 / 18

LEX: Example EV OP2 Version space Applications f1(x) f2(x) dx pol(x)f2(x) dx f1(x) trig(x) dx pol(x) sen(x) dx 3x trig(x) dx 3x sen(x) dx Javier Béjar cbea (LSI - FIB) Version Space Term 2012/2013 18 / 18