Monographs in Computer Science

Similar documents
Perspectives of Information Systems

International Series in Operations Research & Management Science

Guide to Teaching Computer Science

PRODUCT PLATFORM AND PRODUCT FAMILY DESIGN

MARE Publication Series

Communication and Cybernetics 17

Lecture Notes on Mathematical Olympiad Courses

Rule-based Expert Systems

COMMUNICATION-BASED SYSTEMS

Physics 270: Experimental Physics

Conducting the Reference Interview:

Knowledge-Based - Systems

COMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS

AUTONOMY. in the Law

POLA: a student modeling framework for Probabilistic On-Line Assessment of problem solving performance

Principles of Public Speaking

Welcome to. ECML/PKDD 2004 Community meeting

Mathematics subject curriculum

InTraServ. Dissemination Plan INFORMATION SOCIETY TECHNOLOGIES (IST) PROGRAMME. Intelligent Training Service for Management Training in SMEs

Using Virtual Manipulatives to Support Teaching and Learning Mathematics

BENG Simulation Modeling of Biological Systems. BENG 5613 Syllabus: Page 1 of 9. SPECIAL NOTE No. 1:

Mathematics Program Assessment Plan

Lecture 1: Machine Learning Basics

Lecture Notes in Artificial Intelligence 4343

South Carolina English Language Arts

Graphical Data Displays and Database Queries: Helping Users Select the Right Display for the Task

A R "! I,,, !~ii ii! A ow ' r.-ii ' i ' JA' V5, 9. MiN, ;

Agent-Based Software Engineering

Instrumentation, Control & Automation Staffing. Maintenance Benchmarking Study

Henry Tirri* Petri Myllymgki

Lecture 1: Basic Concepts of Machine Learning

WHY SOLVE PROBLEMS? INTERVIEWING COLLEGE FACULTY ABOUT THE LEARNING AND TEACHING OF PROBLEM SOLVING

TABLE OF CONTENTS TABLE OF CONTENTS COVER PAGE HALAMAN PENGESAHAN PERNYATAAN NASKAH SOAL TUGAS AKHIR ACKNOWLEDGEMENT FOREWORD

THE PROMOTION OF SOCIAL AWARENESS

CURRICULUM VITAE OF MARIE-LOUISE VIERØ

Economics 201 Principles of Microeconomics Fall 2010 MWF 10:00 10:50am 160 Bryan Building

NATO ASI Series Advanced Science Institutes Series

The Good Judgment Project: A large scale test of different methods of combining expert predictions

Education for an Information Age

To link to this article: PLEASE SCROLL DOWN FOR ARTICLE

Automating the E-learning Personalization

Math-U-See Correlation with the Common Core State Standards for Mathematical Content for Third Grade

Python Machine Learning

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

Timeline. Recommendations

Physics Experimental Physics II: Electricity and Magnetism Prof. Eno Spring 2017

SAM - Sensors, Actuators and Microcontrollers in Mobile Robots

THE INFLUENCE OF COOPERATIVE WRITING TECHNIQUE TO TEACH WRITING SKILL VIEWED FROM STUDENTS CREATIVITY

Introduction and Motivation

This Performance Standards include four major components. They are

Extending Place Value with Whole Numbers to 1,000,000

S T A T 251 C o u r s e S y l l a b u s I n t r o d u c t i o n t o p r o b a b i l i t y

Radius STEM Readiness TM

EGRHS Course Fair. Science & Math AP & IB Courses

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

STA 225: Introductory Statistics (CT)

Field Experience and Internship Handbook Master of Education in Educational Leadership Program

Dublin City Schools Mathematics Graded Course of Study GRADE 4

KUTZTOWN UNIVERSITY KUTZTOWN, PENNSYLVANIA COE COURSE SYLLABUS TEMPLATE

A THESIS. By: IRENE BRAINNITA OKTARIN S

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Theory of Probability

Grade Band: High School Unit 1 Unit Target: Government Unit Topic: The Constitution and Me. What Is the Constitution? The United States Government

CS/SE 3341 Spring 2012

EDUCATION IN THE INDUSTRIALISED COUNTRIES

Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge

Submission of a Doctoral Thesis as a Series of Publications

Mathematics Success Grade 7

Pre-vocational Education in Germany and China

THEORETICAL CONSIDERATIONS

NORMAL AND ABNORMAL DEVELOPMENT OF BRAIN AND BEHAVIOUR

Developing a concrete-pictorial-abstract model for negative number arithmetic

Evolutive Neural Net Fuzzy Filtering: Basic Description

ME 4495 Computational Heat Transfer and Fluid Flow M,W 4:00 5:15 (Eng 177)

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

Preprint.

Self Study Report Computer Science

Shockwheat. Statistics 1, Activity 1

AQUA: An Ontology-Driven Question Answering System

HDR Presentation of Thesis Procedures pro-030 Version: 2.01

Answers To Hawkes Learning Systems Intermediate Algebra

On-Line Data Analytics

Course Name: Elementary Calculus Course Number: Math 2103 Semester: Fall Phone:

Applying Fuzzy Rule-Based System on FMEA to Assess the Risks on Project-Based Software Engineering Education

Rotary Club of Portsmouth

Learning Cases to Resolve Conflicts and Improve Group Behavior

HIGHER EDUCATION IN POLAND

understand a concept, master it through many problem-solving tasks, and apply it in different situations. One may have sufficient knowledge about a do

Excel Formulas & Functions

An Interactive Intelligent Language Tutor Over The Internet

Northeastern University Online Course Syllabus

What is PDE? Research Report. Paul Nichols

Digital Fabrication and Aunt Sarah: Enabling Quadratic Explorations via Technology. Michael L. Connell University of Houston - Downtown

TEXAS CHRISTIAN UNIVERSITY M. J. NEELEY SCHOOL OF BUSINESS CRITERIA FOR PROMOTION & TENURE AND FACULTY EVALUATION GUIDELINES 9/16/85*

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

Data Fusion Models in WSNs: Comparison and Analysis

Level 6. Higher Education Funding Council for England (HEFCE) Fee for 2017/18 is 9,250*

content First Introductory book to cover CAPM First to differentiate expected and required returns First to discuss the intrinsic value of stocks

Transcription:

Monographs in Computer Science Editors David Gries Fred B. Schneider Springer New York Berlin Heidelberg Barcelona Budapest Hong Kong London Milan Paris Santa Clara Singapore Tokyo

Monographs in Computer Science Abadi and Cardelli, A Theory of Objects Brzozowski and Seger, Asynchronous Circuits Selig, Geometrical Methods in Robotics Nielson [editor], ML with Concurrency Castillo, Gutierrez, and Hadi, Expert Systems and Probabilistic Network Models

Enrique Castillo Jose Manuel Gutierrez Ali S. Hadi Expert Systems and Probabilistic Network Models With 250 Figures, Springer

Enrique Castillo Cantabria University 39005 Santander, Spain E-mail: castie@ccaix3.unican.es Jose Manuel Gutierrez Cantabria University 39005 Santander, Spain E-mail: gutierjm@ccaix3.unican.es Ali S. Hadi Cornell University 3581ves Hall Ithaca, NY 14853-3901 USA Series Editors: David Gries Department of Computer Science Cornell University Upson Hall Ithaca, NY 14853-7501 USA Fred B. Schneider Department of Computer Science Cornell University Upson Hall Ithaca, NY 14853-7501 USA Library of Congress Catologing-in-Publication Data Castillo. Enrique. Expert systems and probabilistic network models I Enrique Castillo, Jose Manuel Gutierrez, Ali S. Hadi. p. cm. - (Monographs in computer science) Includes bibliographical references and index. ISBN 13:978-1-4612-7481-0 e-isbn-13:978 1-4612 2270-5 DOl: 10.1007/978 1-4612 2270-5 1. Expert systems (Computer science) 2. Probabilities. I. Gutierrez, Jose Manuel. II. Hadi, Ali S. III. Title. IV. Series. QA76.76.E95C378 1997 006.3'3-dc20 96-33161 Printed on acid-free paper. 1997 Springer-Verlag New York, Inc. Softcover reprint of the hardcover 1st edition 1997 All rights reserved. This work may not be translated or copied in whole or in part without the written permission of the publisher (Springer-Verlag New York, Inc., 175 Fifth Avenue, New York, NY 10010, USA), except for brief excerpts in connection with reviews or scholarly analysis. Use in connection with any form of information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed is forbidden. The use of general descriptive names, trade names, trademarks, etc., in this publication, even if the former are not especially identified, is not to be taken as a sign that such names, as understood by the Trade Marks and Merchandise Marks Act, may accordingly be used freely by anyone. Production managed by Lesley Poliner; manufacturing supervised by Johanna Tschebull. Photocomposed using the authors' LATEX files. 987654321 ISBN-13:978-1-4612-7481-0 Springer-Verlag New York Berlin Heidelberg SPIN 10524048

To all the people of the former Yugoslavia with the hope that they will live together in peace and be friends, as are the authors, despite the differences in our religions, languages, and national origins.

Preface The artificial intelligence area in general and the expert systems and probabilistic network models in particular have seen a great surge of research activity during the last decade. Because of the multidisciplinary nature of the field, the research has been scattered in professional journals in many fields such as computer science, engineering, mathematics, probability, and statistics. This book collects, organizes, and summarizes these research works in what we hope to be a clear presentation. Every effort has been made to keep the treatment of the subject as up-to-date as possible. Actually, some of the material presented in the book is yet to be published in the literature. See, for example, the material in Chapter 12 and some of the material in Chapters 7 and II. The book is intended for students and research workers from many fields such as computer science; engineering and manufacturing; medical and pharmaceutical sciences; mathematical, statistical, and decision sciences; business and management; economics and social sciences; etc. For this reason, we assumed no previous background in the subject matter of the book. The reader, however, is assumed to have some background in probability and statistics and to be familiar with some matrix notation (see, e.g., Hadi (1996)). In a few instances, we give some programs in Mathematica to perform the calculations. For a full understanding of these programs some knowledge of Mathematica is needed. The book can be used as a reference or consulting book and as a textbook in upper-division undergraduate courses or in graduate-level courses. The book contains numerous illustrative examples and end-of-chapter exercises. We have also developed some computer programs to implement the various algorithms and methodologies presented in this book. The current version of these programs, together with a brief User's Guide, can be obtained from the World Wide Web site http://ccaix3.unican.esr AIGroup. We have used these programs to do the examples and we encourage the reader to use them to solve some of the exercises. The computer programs can also help research workers and professionals apply the methodology to their own

viii Preface fields of study. Actually, we have used these programs to analyze some reallife applications (case studies) in Chapter 12. We therefore encourage the reader to use and explore the capabilities of these programs. It is suggested that the reader repeat the computations in the examples and solve the exercises at the end of the chapters using these programs. We hope that making such programs available will facilitate the learning of the material presented in this book. Finally, the extensive bibliography included at the end of the book can also serve as a basis for additional research. Although some theory is present in the book, the emphasis is on applications rather than on theory. For this reason, the proofs of many theorems are left out, numerous examples are used to illustrate the concepts and theory, and the mathematical level is kept to a minimum. The book is organized as follows. Chapter 1 is an introductory chapter, which among other things, gives some motivating examples, describes the components and development of an expert system, and surveys other related areas of artificial intelligence. Chapters 2 and 3 describe the main two types of expert systems: rule-based and probabilistic expert systems. Although the two types of expert systems are introduced separately, rulebased expert system can be thought of as a special case of the more powerful probabilistic expert system. It is argued in Chapters 1-3 that two of the most important and complex components of expert systems are the coherence control and the inference engine. These are perhaps the two weakest links in almost all current expert systems, the former because it has appeared relatively recently and many of the existing expert systems do not have it, and the latter because of its complexity. In Chapters 1-3 we show how these subsystems can be implemented in rule-based and probability-based expert systems and how the probability assignment must be done in order to avoid inconsistencies. For example, the automatic updating of knowledge and the automatic elimination of object values are important for maintaining the coherence of the system. Chapters 5-10 are mainly devoted to the details of such implementations. The materials in Chapter 5 and beyond require some concepts of graph theory. Since we expect that some of the readers may not be familiar with these concepts, Chapter 4 presents these concepts. This chapter is an essential prerequisite for understanding the topics covered in the remaining chapters. Building probabilistic models, which are needed for the knowledge base of a probabilistic expert system, is presented in Chapters 5-7. In particular, the independence and conditional independence concepts, which are useful for defining the internal structure of probabilistic network models and for knowing whether or not some variables or sets of variables have information about other variables, are discussed in Chapter 5. As mentioned in Chapter 4, graphs are essential tools for building probabilistic and other models used in expert systems. Chapter 6 presents the Markov and Bayesian network models as two of the most widely used graphical

Preface ix network models. Chapter 7 extends graphically specified models to more powerful models such as models specified by multiple graphs, models specified by input lists, multifactorized probabilistic models, and conditionally specified probabilistic models. Chapters 8 and 9 present the most commonly used exact and approximate methods for the propagation of evidence, respectively. Chapter 10 introduces symbolic propagation, which is perhaps one of the most recent advances in evidence propagation. Chapter 11 deals with the problem of learning Bayesian network models from data. Finally, Chapter 12 includes several examples of applications (case studies). Many of our colleagues and students have read earlier versions of this manuscript and have provided us with valuable comments and suggestions. Their contributions have given rise to the current substantially improved version. In particular, we acknowledge the help of the following (in alphabetical order): Noha Adly, Remco Bouckaert, Federico Ceballos, Jong Wang Chow, Javier Dfez, Dan Geiger, Joseph Halpern, Judea Pearl, Julius Reiner, Milan Studeny, and Jana Zv8xova. Enrique Castillo Jose Manuel Gutierrez Ali S. Hadi

Contents Preface 1 Introduction 1.1 Introduction... 1.2 What Is an Expert System? 1.3 Motivating Examples.. 1.4 Why Expert Systems?... 1.5 Types of Expert System.. 1.6 Components of an Expert System. 1.7 Developing an Expert System 1.8 Other Areas of AI 1.9 Concluding Remarks.... 2 Rule-Based Expert Systems 2.1 Introduction.... 2.2 The Knowledge Base. 2.3 The Inference Engine. 2.4 Coherence Control.. 2.5 Explaining Conclusions 2.6 Some Applications... 2.7 Introducing Uncertainty Exercises.... 3 Probabilistic Expert Systems 3.1 Introduction.... 3.2 Some Concepts in Probability Theory 3.3 Generalized Rules............ 3.4 Introducing Probabilistic Expert Systems 3.5 The Knowledge Base. 3.6 The Inference Engine. 3.7 Coherence Control.. vii 1 1 2 3 7 8 10 14 16 20 21 21 22 28 48 52 53 65 65 69 69 71 85 86 91 102 104

xii Contents 3.8 Comparing Rule-Based and Probabilistic Expert Systems 106 Exercises... 108 4 Some Concepts of Graphs 4.1 Introduction.... 4.2 Basic Concepts and Definitions.... 4.3 Characteristics of Undirected Graphs. 4.4 Characteristics of Directed Graphs 4.5 Triangulated Graphs... 4.6 Cluster Graphs......... 4.7 Representation of Graphs... 4.8 Some Useful Graph Algorithms Exercises.... 113 113 114 118 122 129 139 144 158 172 5 Building Probabilistic Models 5.1 Introduction....... 5.2 Graph Separation................ 5.3 Some Properties of Conditional Independence 5.4 Special Types of Input Lists. 5.5 Factorizations of the JPD 175 175 177 184 192 195 5.6 Constructing the JPD. 200 Appendix to Chapter 5. 204 Exercises... 206 6 Graphically Specified Models 211 6.1 Introduction... 211 6.2 Some Definitions and Questions..... 213 6.3 Undirected Graph Dependency Models. 218 6.4 Directed Graph Dependency Models.. 237 6.5 Independence Equivalent Graphical Models 252 6.6 Expressiveness of Graphical Models. 259 Exercises... 262 7 Extending Graphically Specified Models 267 7.1 Introduction... 267 7.2 Models Specified by Multiple Graphs. 269 7.3 Models Specified by Input Lists... 275 7.4 Multifactorized Probabilistic Models 279 7.5 Multifactorized Multinomial Models 279 7.6 Multifactorized Normal Models... 292 7.7 Conditionally Specified Probabilistic Models. 298 Exercises... 311 8 Exact Propagation in Probabilistic Network Models 317 8.1 Introduction... 317

Contents xiii 8.2 Propagation of Evidence............ 8.3 Propagation in Poly trees............ 8.4 Propagation in Multiply-Connected Networks 8.5 Conditioning Method..... 8.6 Clustering Methods.... 8.7 Propagation Using Join Trees 8.8 Goal-Oriented Propagation. 8.9 Exact Propagation in Gaussian Networks Exercises.... 9 Approximate Propagation Methods 9.1 Introduction.... 9.2 Intuitive Basis of Simulation Methods. 9.3 General Frame for Simulation Methods. 9.4 Acceptance-Rejection Sampling Method 9.5 Uniform Sampling Method.... 9.6 The Likelihood Weighing Sampling Method 9.7 Backward-Forward Sampling Method. 9.8 Markov Sampling Method.... 9.9 Systematic Sampling Method.... 9.10 Maximum Probability Search Method 9.11 Complexity Analysis Exercises.... 10 Symbolic Propagation of Evidence 10.1 Introduction.... 10.2 Notation and Basic Framework.. 10.3 Automatic Generation of Symbolic Code. 10.4 Algebraic Structure of Probabilities.... 10.5 Symbolic Propagation Through Numeric Computations 10.6 Goal-Oriented Symbolic Propagation... 10.7 Symbolic Treatment of Random Evidence....... 10.8 Sensitivity Analysis.................... 10.9 Symbolic Propagation in Gaussian Bayesian Networks Exercises.... 11 Learning Bayesian Networks 11.1 Introduction......... 11.2 Measuring the Quality of a Bayesian Network Model 11.3 Bayesian Quality Measures.... 11.4 Bayesian Measures for Multinomial Networks 11.5 Bayesian Measures for Multinormal Networks 11.6 Minimum Description Length Measures 11. 7 Information Measures.... 11.8 Further Analyses of Quality Measures. 318 321 342 342 351 366 377 382 387 393 393 394 400 406 409 411 413 415 419 429 439 440 443 443 445 447 454 455 464 470 472 474 478 481 481 484 486 490 499 506 509 509

xiv Contents 11.9 Bayesian Network Search Algorithms. l1.lothe Case of Incomplete Data..... Appendix to Chapter 11: Bayesian Statistics. Exercises 511 513 515 525 12 Case Studies 529 12.1 Introduction......... 529 12.2 Pressure Tank System... 530 12.3 Power Distribution System 542 12.4 Damage of Concrete Structures 550 12.5 Damage of Concrete Structures: The Gaussian Model 562 Exercises....................... 567 List of Notation 573 References 581 Index 597