Optimizing Similarity Assessment in Case-Based Reasoning

Similar documents
Laboratorio di Intelligenza Artificiale e Robotica

Laboratorio di Intelligenza Artificiale e Robotica

Learning Methods for Fuzzy Systems

A Case Study: News Classification Based on Term Frequency

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Cooperative evolutive concept learning: an empirical study

Probabilistic Latent Semantic Analysis

Rule Learning With Negation: Issues Regarding Effectiveness

Lecture 1: Machine Learning Basics

Linking Task: Identifying authors and book titles in verbose queries

Knowledge-Based - Systems

A Case-Based Approach To Imitation Learning in Robotic Agents

Evolution of Symbolisation in Chimpanzees and Neural Nets

A SURVEY OF FUZZY COGNITIVE MAP LEARNING METHODS

Automating the E-learning Personalization

Rule Learning with Negation: Issues Regarding Effectiveness

Axiom 2013 Team Description Paper

Learning From the Past with Experiment Databases

On-Line Data Analytics

Softprop: Softmax Neural Network Backpropagation Learning

Artificial Neural Networks written examination

Using dialogue context to improve parsing performance in dialogue systems

Seminar - Organic Computing

Rule discovery in Web-based educational systems using Grammar-Based Genetic Programming

Ordered Incremental Training with Genetic Algorithms

The Good Judgment Project: A large scale test of different methods of combining expert predictions

Lecture 1: Basic Concepts of Machine Learning

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Discriminative Learning of Beam-Search Heuristics for Planning

Version Space. Term 2012/2013 LSI - FIB. Javier Béjar cbea (LSI - FIB) Version Space Term 2012/ / 18

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Generating Test Cases From Use Cases

Causal Link Semantics for Narrative Planning Using Numeric Fluents

The Smart/Empire TIPSTER IR System

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

A Reinforcement Learning Variant for Control Scheduling

Universidade do Minho Escola de Engenharia

Learning to Schedule Straight-Line Code

CS Machine Learning

SARDNET: A Self-Organizing Feature Map for Sequences

What is a Mental Model?

METHODS FOR EXTRACTING AND CLASSIFYING PAIRS OF COGNATES AND FALSE FRIENDS

INPE São José dos Campos

Case Acquisition Strategies for Case-Based Reasoning in Real-Time Strategy Games

Learning Cases to Resolve Conflicts and Improve Group Behavior

Managing Experience for Process Improvement in Manufacturing

Soft Computing based Learning for Cognitive Radio

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Speech Recognition at ICSI: Broadcast News and beyond

Emergency Management Games and Test Case Utility:

Using Web Searches on Important Words to Create Background Sets for LSI Classification

Welcome to. ECML/PKDD 2004 Community meeting

arxiv: v1 [cs.cv] 10 May 2017

CSL465/603 - Machine Learning

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

Improving Result Adaptation through 2-step Retrieval

CWIS 23,3. Nikolaos Avouris Human Computer Interaction Group, University of Patras, Patras, Greece

Calibration of Confidence Measures in Speech Recognition

AQUA: An Ontology-Driven Question Answering System

Exploiting Phrasal Lexica and Additional Morpho-syntactic Language Resources for Statistical Machine Translation with Scarce Training Data

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

A Genetic Irrational Belief System

Generative models and adversarial training

Evolutive Neural Net Fuzzy Filtering: Basic Description

CONCEPT MAPS AS A DEVICE FOR LEARNING DATABASE CONCEPTS

I-COMPETERE: Using Applied Intelligence in search of competency gaps in software project managers.

Assignment 1: Predicting Amazon Review Ratings

Computer Software Evaluation Form

Conversational Framework for Web Search and Recommendations

Classification Using ANN: A Review

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Reinforcement Learning by Comparing Immediate Reward

Using Genetic Algorithms and Decision Trees for a posteriori Analysis and Evaluation of Tutoring Practices based on Student Failure Models

Word Sense Disambiguation

TOWARDS PROVISION OF KNOWLEDGE-INTENSIVE PRODUCTS AND SERVICES OVER THE WEB

Proposal of Pattern Recognition as a necessary and sufficient principle to Cognitive Science

Python Machine Learning

(Sub)Gradient Descent

Preference Learning in Recommender Systems

Australian Journal of Basic and Applied Sciences

Learning Methods in Multilingual Speech Recognition

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

16.1 Lesson: Putting it into practice - isikhnas

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

COMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS

AUTHOR COPY. Techniques for cold-starting context-aware mobile recommender systems for tourism

Document number: 2013/ Programs Committee 6/2014 (July) Agenda Item 42.0 Bachelor of Engineering with Honours in Software Engineering

What to Do When Conflict Happens

Using focal point learning to improve human machine tacit coordination

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

Chapter 2 Rule Learning in a Nutshell

Transfer Learning Action Models by Measuring the Similarity of Different Domains

CS 446: Machine Learning

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

TABLE OF CONTENTS TABLE OF CONTENTS COVER PAGE HALAMAN PENGESAHAN PERNYATAAN NASKAH SOAL TUGAS AKHIR ACKNOWLEDGEMENT FOREWORD

Institutionen för datavetenskap. Hardware test equipment utilization measurement

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

Transcription:

AAAI-06 Nectar Track July, 18th 2006 Optimizing Similarity Assessment in Case-Based Reasoning Image Understanding and Pattern Recognition Group German Research Center for Artificial Intelligence (DFKI) Kaiserslautern, Germany Institute of Cognitive Science Universtity of Osnabrück, Germany

Similarity Measures in CBR Semantics: Heuristic for selecting useful Cases New Problem? Unknown Solution Similarity Old Problem Known Solution Case Traditional Approaches similarity is based on geometric distance mainly estimate syntactical differences only e.g. Hamming Distance, Euclidean Distance,... Utility is influenced by characteristics of the domain, preferences of users, functionality of the CBR system,...

Knowledge-Intensive Similarity Measures kism encode specific knowledge about the application domain kism allow a much more accurate estimation of the cases' utility typical structure: Sim( Q, C) n i 1 w i sim examples (product recommendation system): i ( q, c ) w price = 0.5 w CPU-clock = 0.4 w CD-Drive = 0.1 i i price Sim c i -q i A lower price does not decrease the utility Sim c i -q i CPU-clock A higher clock rate does not decrease the utility q i ROM RW DVD c i ROM RW DVD 1.0 1.0 0.0 1.0 0.0 0.3 CD-Drive 0.9 0.3 1.0 The measure encodes knowledge about functionality of CD-Drives

Knowledge Acquisition Problems of kism modelling kism manually is costly required domain knowledge is often only partially available contradicts with the original idea of CBR Alternative: Applying Machine Learning Approaches statistical analysis of case base optimization by performing Leave-One-Out test Existing Approaches e.g. [Hastie & Tibshirani, 1996; Wettschereck & Aha, 1995] rely on labeled data which provides absolute utility information only applicable for classification tasks allow optimization of attribute weights only not suited for many CBR applications (e.g. recommender systems)

Learning from Relative Case Utility Feedback [Stahl, ICCBR 2001] Teacher Utility (User / Expert / Evaluation-Function) feedback Case 5 Case 7 Case 1 Case 3 Case 2 Training Example Query Retrieval Error E Similarity Measure Case Base CBR-System determines Case 3 Case 8 Case 1 Similarity Case 5 Case 7 Case 2 Case 6 Case 4 Retrieval Result Goal: Finding a similarity measure that minimises E

Applying Evolutionary Algorithms [Stahl & Gabel, ICCBR 2003] Idea: encode attribute weights and local similarity measures as individuals to be optimised be a GA define corresponding mutation/crossover operators Representation Crossover and Mutation-Operators 1.0 1.0 1.0 1.0 0.4 0.1 0.0 similarity function as vector of sampling points Example: Similarity Functions

Experimental Evaluation [Stahl, Ph.D. Thesis 2004] Product Recommendation Scenario generation of RCUF by simulating user preferences (with noise) quality measures on test set: percentage of retrievals where 1-in-1: the optimal product is the most similar product 1-in-10: the optimal product is in the retrieval set (10 most similar) % 100 90 80 70 60 50 40 30 20 10 0 Learning Learning of Weights of Weights and Local only Measures 0 50 100 250 500 1000 # Training Examples 1-in-10 (0% Noise) 1-in-10 (10% Noise) 1-in-10 (30% Noise) 1-in-1 0% (Noise) 1-in-1 (10% Noise) 1-in-1 (30% Noise)

Drawbacks of Brute-Force Learning [Stahl, ECCBR 2002] Learning kism from Utility Feedback only may be critical: underlying hypothesis space is huge given only few training data, learning tends to overfitting some certain low-level knowledge is often easily available trying to learn this knowledge is needless and counterproductive similarity measures have typical properties, e.g. monotony learning algorithms should ensure compliance with these properties Idea: model partially known knowledge manually learn remaining knowledge from relative case utility feedback Goal: Restricting the Search Space and biasing the Learner by exploiting available Background Knowledge

Incorporating Background Knowledge [Gabel & Stahl, ECCBR 2004; Gabel, GWCBR 2005 ] Definition of Knowledge-Based Optimization Filters m-filters: Similarity-Meta Knowledge e.g. monotony property e-filters: Expert Knowledge e.g. predefined similarity values, constraints Modification of Offspring Generation during GA terminate yes stop no SELECTION Current Population Chosen Parents filtering Chosen Operators advice BREEDING Knowledge Filter Layer expert values heuristics statistics EVALUATION Offspring (new, filtered individuals)

Relative Classification/Regression Error Experimental Evaluation 6 Domains of the UCI Repository Comparison: Average Accuracies achieved with default similarity measures (knowledge-poor, Euclidean Distance) learnt similarity measures (without using background knowledge) similarity measures learnt with help of knowledge filters 100 90 80 70 60 50 40 30 20 10 0 15 25 50 100 200 # Training Examples default no-filter m-filter e-filter me-filter

Conclusions Knowledge-Intensive Similarity Measures in CBR manual definition is difficult and costly existing learning approaches are not suited for many CBR applications Novel Approach: acquisition of relative case utility feedback [Stahl, ICCBR 2001] allows learning in non-classification domains optimization with Genetic Algorithms [Stahl & Gabel, ICCBR 2003] allows optimization of weights and local similarity measures incorporation of background knowledge [Stahl, ECCBR 2002; Gabel & Stahl, ECCBR 2004; Gabel, GWCBR 2005] avoids overfitting for small training data sets Current Work combination with case-based learning [Stahl, ECCBR 2006]

Questions? Thank You!