Machine Learning and Expert Systems

Similar documents
Laboratorio di Intelligenza Artificiale e Robotica

Laboratorio di Intelligenza Artificiale e Robotica

Rule Learning With Negation: Issues Regarding Effectiveness

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Knowledge-Based - Systems

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Lecture 1: Basic Concepts of Machine Learning

Rule discovery in Web-based educational systems using Grammar-Based Genetic Programming

Rule Learning with Negation: Issues Regarding Effectiveness

SARDNET: A Self-Organizing Feature Map for Sequences

Learning and Transferring Relational Instance-Based Policies

A Version Space Approach to Learning Context-free Grammars

Constructive Induction-based Learning Agents: An Architecture and Preliminary Experiments

Computerized Adaptive Psychological Testing A Personalisation Perspective

Evolutive Neural Net Fuzzy Filtering: Basic Description

Cooperative evolutive concept learning: an empirical study

IT Students Workshop within Strategic Partnership of Leibniz University and Peter the Great St. Petersburg Polytechnic University

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Lecture 10: Reinforcement Learning

A cognitive perspective on pair programming

Modeling user preferences and norms in context-aware systems

Word Segmentation of Off-line Handwritten Documents

Learning Methods for Fuzzy Systems

IAT 888: Metacreation Machines endowed with creative behavior. Philippe Pasquier Office 565 (floor 14)

Analysis of Hybrid Soft and Hard Computing Techniques for Forex Monitoring Systems

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

Knowledge based expert systems D H A N A N J A Y K A L B A N D E

Ph.D in Advance Machine Learning (computer science) PhD submitted, degree to be awarded on convocation, sept B.Tech in Computer science and

Learning Methods in Multilingual Speech Recognition

MYCIN. The MYCIN Task

Causal Link Semantics for Narrative Planning Using Numeric Fluents

Content-free collaborative learning modeling using data mining

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

Chapter 2 Rule Learning in a Nutshell

Classification Using ANN: A Review

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge

Python Machine Learning

CS Machine Learning

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

AUTOMATED TROUBLESHOOTING OF MOBILE NETWORKS USING BAYESIAN NETWORKS

A Case-Based Approach To Imitation Learning in Robotic Agents

Action Models and their Induction

Reducing Features to Improve Bug Prediction

The Good Judgment Project: A large scale test of different methods of combining expert predictions

Rule-based Expert Systems

AQUA: An Ontology-Driven Question Answering System

ABSTRACT. A major goal of human genetics is the discovery and validation of genetic polymorphisms

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

Study and Analysis of MYCIN expert system

CSL465/603 - Machine Learning

Axiom 2013 Team Description Paper

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

Welcome to. ECML/PKDD 2004 Community meeting

Lecture 1: Machine Learning Basics

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

Australian Journal of Basic and Applied Sciences

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Chamilo 2.0: A Second Generation Open Source E-learning and Collaboration Platform

Learning From the Past with Experiment Databases

Multi-label Classification via Multi-target Regression on Data Streams

INTRODUCTION TO PSYCHOLOGY

Transfer Learning Action Models by Measuring the Similarity of Different Domains

Knowledge Elicitation Tool Classification. Janet E. Burge. Artificial Intelligence Research Group. Worcester Polytechnic Institute

Parsing of part-of-speech tagged Assamese Texts

Using dialogue context to improve parsing performance in dialogue systems

Radius STEM Readiness TM

COSI Meet the Majors Fall 17. Prof. Mitch Cherniack Undergraduate Advising Head (UAH), COSI Fall '17: Instructor COSI 29a

Customized Question Handling in Data Removal Using CPHC

Version Space. Term 2012/2013 LSI - FIB. Javier Béjar cbea (LSI - FIB) Version Space Term 2012/ / 18

Seminar - Organic Computing

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,

Abstractions and the Brain

An Investigation into Team-Based Planning

A Genetic Irrational Belief System

The Paradox of Structure: What is the Appropriate Amount of Structure for Course Assignments with Regard to Students Problem-Solving Styles?

Preference Learning in Recommender Systems

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

GACE Computer Science Assessment Test at a Glance

TD(λ) and Q-Learning Based Ludo Players

Linking Task: Identifying authors and book titles in verbose queries

Course Law Enforcement II. Unit I Careers in Law Enforcement

Visual CP Representation of Knowledge

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Artificial Neural Networks written examination

Hi I m Ryan O Donnell, I m with Florida Tech s Orlando Campus, and today I am going to review a book titled Standard Celeration Charting 2002 by

Candidates must achieve a grade of at least C2 level in each examination in order to achieve the overall qualification at C2 Level.

ZACHARY J. OSTER CURRICULUM VITAE

Cognitive Prior-Knowledge Testing Method for Core Development of Higher Education of Computing in Academia

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Discriminative Learning of Beam-Search Heuristics for Planning

Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm

stateorvalue to each variable in a given set. We use p(x = xjy = y) (or p(xjy) as a shorthand) to denote the probability that X = x given Y = y. We al

Self Study Report Computer Science

Detecting Wikipedia Vandalism using Machine Learning Notebook for PAN at CLEF 2011

Top US Tech Talent for the Top China Tech Company

SOFTWARE EVALUATION TOOL

Reinforcement Learning by Comparing Immediate Reward

Transcription:

Machine Learning and Expert Systems By Alexander Bailey Introduction Why use machine learning? Strategies Usefulness and success Collaborative Systems 1

Why Use Machine Learning? Knowledge acquisition bottleneck ML helps in the knowledge elicitation process. Problems with KE Difficult to find experts Experts disagree Inconsistent knowledge Paradox of expertise There must be an expert! 2

What to do? Machine learning techniques can be used to attempt to tackle these problems. Shows lots of promise. Caveats: Knowledge is dynamic. Must be updated as experience is gained. There are limits to how ML can be applied! 3

Ideas Machine Learning is important for the long-term success of advanced AI systems. Hybrid systems should be built. 4

Primary Application of ML The primary application of machine learning in expert systems is to attempt to solve the knowledge acquisition bottleneck. Idea: Build knowledge base from a set of examples. Agreeable format, humans and machines can read. Many algorithms exist to do such a thing. ID3 and successors, GA. 5

Focus: ID3 An inductive inference algorithm Given a set of data, will try and build a decision tree which can be converted into rules. A top-down approach The idea is to select the next node that yields the highest information gain recurse. Terminate when growing tree makes no statistically significant difference in solution quality. ID3 is fast, simple, and produces small trees. 6

Focus: ID3 Problems Not very robust when noise exists in the data Guided locally by information gain. Prone to finding local optimums Equivalent to hill-climbing Hard to obtain different rules with near-by accuracy. There are strategies that overcome this. 7

Focus: Genetic Algorithms Genetic Algorithms can be used to build decision trees, or even rules expressed in predicate or first order logic. Genetic operators can be performed on decision trees. Subtree crossover. Fitness Function is the error. Speculation: Suffers from many of the same problems as GP. Decision tree bloat. 8

EFOPREL EFOPREL system by Vladamir Estivill-Castro. Inductive supervised learning system. Uses GAs to build rules in first order predicate logic. Knowledge is explicit for human understanding. 9

EFOPREL BNF of logic rules used by EFOPREL <Logic_Rule> <Consequent> <Antecedent> <Bool_Expr> ::= IF <Antecedent> Then <Consequent> ::= <Class_Label> ::= <Boolean_Expression> ::= <Predicate> <Bool_Expr> AND <Bool_Expr> <Bool_Expr> OR <Bool_Expr> NOT <Bool_Expr> Can also operate on non-discrete values Very_*High_Ai 10

EFOPREL Rule Encoding example: Fisher - Iris dataset If normal petal width(x) OR high petal width(x) OR normal petal length(x) Then versicolor 11

EFOPREL Rule Simplification is employed to reduce the size of the trees generated. Performs very well in comparison with ID3 and similar algorithms. Problems: Much slower than single-pass algorithms such as ID3. Because it is a GA it may not always produce the same tree with the same data. 12

What to do? Idea: Learning and evolution are not quite the same thing. GA is an evolutionary system while ID3 is a learning system. Evolution and learning can work together. [Hinton and Nowlan, 1987] Learning eases the pressure on evolution. 13

Focus: Hybrid Hybrid System for Pattern Classification [Bala and DeJong, 1995] 14

Focus: Hyrbid Performs very well! 15

ML's Dirty Little Secret How do ML techniques compare to traditional KE approaches? Recall the intro slide, there are limits to what ML can do for Knowledge acquisition. Paper by D Grzymala-Brusse and J Grzymala- Brusse. Any machine learning approach that uses learning by example produces only a subset of all potential rules. An exact covering by example is difficult and may be impossible. 16

ML's Dirty Little Secret There are few comparative studies! Grzymala-Brusse study is contrived, but illustrates the point. 17

ML's Dirty Little Secret Comparison 18

Collaborative Systems EVOPROL and EFOPREL Have a collaborative aspect! Idea: Allow human interaction with the evolutionary process. Rules are in a human-readable form. Human expert can add, modify or create partial rules as the GA runs. Better results than non-collaborative systems! 19

Conclusion Machine Learning can greatly speed up the process of knowledge acquisition. Evolutionary methods provide a powerful approach. Machine Learning can not, at present state, replace the knowledge engineer. SkyNet is a long way off! 20

Citations Bala J., Huang J., Vafaie H., Hybrid Learning Using Genetic Algorithms and Decision Trees For Pattern Classification, IJCAI Conference, Montreal, August, 1995. Grzymala-Brusse D., Grzymala-Brusse J., On The Usefulness Of Machine Learning Approach To Knowledge Acquisition, Computational Intelligence 11, 1995. Vladimir Estivill-Castro, Integration of Machine Learning and Knowledge Acquisition with a Genetic Algorithm, 1997. Vladimir Estivill-Castro, "Collaborative Knowledge Acquisition with a Genetic Algorithm," Tools with Artificial Intelligence, IEEE International Conference on, pp. 0270, 9th International Conference on Tools with Artificial Intelligence (ICTAI '97), 1997. Fogel, D., Hanson, J. C., Kick, R., Malki, H. A., Sigwart, C., Stinson, M., and Turban, E. 1993. The impact of machine learning on expert systems. In Proceedings of the 1993 ACM Conference on Computer Science (Indianapolis, Indiana, United States, February 16-18, 1993). CSC '93. ACM, New York, NY, 522-527. DOI= http://doi.acm.org/10.1145/170791.171158 21