Automated Reasoning. Christoph Kreitz. 1. What is Automated Reasoning? 2. Why Automated Reasoning? 3. Achievements

Similar documents
Logical Aspects of Digital Mathematics Libraries (extended abstract)

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Extending Place Value with Whole Numbers to 1,000,000

Lecture 1: Machine Learning Basics

B. How to write a research paper

Axiom 2013 Team Description Paper

Evolution of Collective Commitment during Teamwork

Introduction to Simulation

On Human Computer Interaction, HCI. Dr. Saif al Zahir Electrical and Computer Engineering Department UBC

A Version Space Approach to Learning Context-free Grammars

This Performance Standards include four major components. They are

Lecture 1: Basic Concepts of Machine Learning

Radius STEM Readiness TM

EGRHS Course Fair. Science & Math AP & IB Courses

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

Seminar - Organic Computing

Mathematics Program Assessment Plan

Proof Theory for Syntacticians

Some Principles of Automated Natural Language Information Extraction

Software Maintenance

Classroom Connections Examining the Intersection of the Standards for Mathematical Content and the Standards for Mathematical Practice

Page 1 of 11. Curriculum Map: Grade 4 Math Course: Math 4 Sub-topic: General. Grade(s): None specified

Mathematics. Mathematics

POLA: a student modeling framework for Probabilistic On-Line Assessment of problem solving performance

Cal s Dinner Card Deals

University of Groningen. Systemen, planning, netwerken Bosman, Aart

Circuit Simulators: A Revolutionary E-Learning Platform

Level 6. Higher Education Funding Council for England (HEFCE) Fee for 2017/18 is 9,250*

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

Update on Standards and Educator Evaluation

THINKING TOOLS: Differentiating the Content. Nanci Cole, Michelle Wikle, and Sacha Bennett - TOSAs Sandi Ishii, Supervisor of Gifted Education

Compositional Semantics

A BOOK IN A SLIDESHOW. The Dragonfly Effect JENNIFER AAKER & ANDY SMITH

MTH 141 Calculus 1 Syllabus Spring 2017

GACE Computer Science Assessment Test at a Glance

Natural Language Processing. George Konidaris

Probability and Game Theory Course Syllabus

Master s Programme in Computer, Communication and Information Sciences, Study guide , ELEC Majors

Major Milestones, Team Activities, and Individual Deliverables

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

Language properties and Grammar of Parallel and Series Parallel Languages

Grade 6: Correlated to AGS Basic Math Skills

Probabilistic Latent Semantic Analysis

Spring 2016 Stony Brook University Instructor: Dr. Paul Fodor

Generating Test Cases From Use Cases

B.S/M.A in Mathematics

Given a directed graph G =(N A), where N is a set of m nodes and A. destination node, implying a direction for ow to follow. Arcs have limitations

Dublin City Schools Mathematics Graded Course of Study GRADE 4

Self Study Report Computer Science

1.1 Examining beliefs and assumptions Begin a conversation to clarify beliefs and assumptions about professional learning and change.

Stacks Teacher notes. Activity description. Suitability. Time. AMP resources. Equipment. Key mathematical language. Key processes

Reinforcement Learning by Comparing Immediate Reward

Action Models and their Induction

FIGURE IT OUT! MIDDLE SCHOOL TASKS. Texas Performance Standards Project

AQUA: An Ontology-Driven Question Answering System

Learning Disability Functional Capacity Evaluation. Dear Doctor,

(Sub)Gradient Descent

CS 100: Principles of Computing

What is Thinking (Cognition)?

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Rule-based Expert Systems

MYCIN. The MYCIN Task

Missouri Mathematics Grade-Level Expectations

Foundations of Knowledge Representation in Cyc

Getting Started with Deliberate Practice

Diagnostic Test. Middle School Mathematics

An Introduction to Simio for Beginners

How to read a Paper ISMLL. Dr. Josif Grabocka, Carlotta Schatten

Focused on Understanding and Fluency

TEACHING AND EXAMINATION REGULATIONS PART B: programme-specific section MASTER S PROGRAMME IN LOGIC

Lecture 2: Quantifiers and Approximation

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

Introduction and Motivation

COMPUTER SCIENCE GRADUATE STUDIES Course Descriptions by Methodology

Computerized Adaptive Psychological Testing A Personalisation Perspective

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Machine Learning and Development Policy

TabletClass Math Geometry Course Guidebook

DIGITAL GAMING & INTERACTIVE MEDIA BACHELOR S DEGREE. Junior Year. Summer (Bridge Quarter) Fall Winter Spring GAME Credits.

School of Innovative Technologies and Engineering

COMPUTER SCIENCE GRADUATE STUDIES Course Descriptions by Research Area

Alpha provides an overall measure of the internal reliability of the test. The Coefficient Alphas for the STEP are:

An Automated Data Fusion Process for an Air Defense Scenario

Discriminative Learning of Beam-Search Heuristics for Planning

Education: Professional Experience: Personnel leadership and management

Statewide Framework Document for:

Playing It By Ear The First Year of SCHEMaTC: South Carolina High Energy Mathematics Teachers Circle

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling

COMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS

Visual CP Representation of Knowledge

ENEE 302h: Digital Electronics, Fall 2005 Prof. Bruce Jacob

Abstractions and the Brain

content First Introductory book to cover CAPM First to differentiate expected and required returns First to discuss the intrinsic value of stocks

Lecture 10: Reinforcement Learning

EECS 700: Computer Modeling, Simulation, and Visualization Fall 2014

ICTCM 28th International Conference on Technology in Collegiate Mathematics

The Strong Minimalist Thesis and Bounded Optimality

Florida Mathematics Standards for Geometry Honors (CPalms # )

Timeline. Recommendations

Transcription:

Automated Reasoning Christoph Kreitz 1. What is Automated Reasoning? 2. Why Automated Reasoning? 3. Achievements

Logical Reasoning the Center of Intelligence Mathematics Programming Problem Solving Philosophy Physics Planning Logical Reasoning Cognitive Modelling Law... Engineering Communication Biology Chemistry Cognitive Studies Proseminar (Cogst 773) 1 Automated Reasoning

Automated Reasoning: Leibniz s dream fulfilled? 1700: Make logical reasoning precise } A universal & accurate scientific language Rules for evaluating scientific arguments 1890: Formal logics Calculemus Formal language + Inference rules Laws of though expressed by mechanical manipulation of text 1950: Computers tools for symbolic manipulations Error-free application of rules Ability to search for solutions by exploring millions of possibilites Simulate logical reasoning on a computer Cognitive Studies Proseminar (Cogst 773) 2 Automated Reasoning

Major Application Areas Prove mathematical theorems Detect and correct errors in proofs (Proof Checking) Find new proofs automatically (Theorem Proving) Support development of reliable software Find bugs / prove correctness (Verification) Improve performance (Optimization) Generate from specifications (Synthesis) Inference engine for AI-Systems General problem solver, robot planning,... Cognitive Studies Proseminar (Cogst 773) 3 Automated Reasoning

Proof Calculi: Foundation of Automated Reasoning Formal language Syntax: expressions built from parameters and logical symbols Semantics: logical symbols have fixed meaning Proof calculus Inference rules for symbolic manipulation of expressions Must be proven correct and complete wrt. semantics A A B B Many logics for different purposes Classical logic (standard mathemantics) Modal logics (knowledge and belief) Constructive / intuitionistic logic (methods and programming) Linear logic (resources, actions, planning,... ) Nonmonotonic logics, default logic, probabilistic logic,.... Type Theory: higher-order logic + programming language + data types Cognitive Studies Proseminar (Cogst 773) 4 Automated Reasoning

Theoretical Limitations There are no general algorithms to decide whether a given logical formula is valid whether a given program terminates whether a given program is correct whether two programs have the same functionality We can only search for positive results Infinite search tree no answer in negative case Search techniques from AI do not apply to Theorem Proving, Software Verification, or Program Synthesis Intelligent proof techniques required Cognitive Studies Proseminar (Cogst 773) 5 Automated Reasoning

Machine Support for Formal Reasoning Interactive Proof Editors User constructs proofs interactively by applying rules Machine executes rules and returns unsolved subproblems Basic mechanism: pattern matching + term rewriting Automated Proof Procedures Tactics: programmed application of individual reasoning steps Decision Procedures for restricted domains Proof Search strategies, complete for small logics Resolution, Matrix Methods, Model Checking,... Knowledge-Based Reasoning: search guided by domain knowledge Cognitive Studies Proseminar (Cogst 773) 6 Automated Reasoning

Systems for Automated Reasoning Proof Development Systems Build Formal Knowledge NuPRL, PVS, HOL, Coq, KIV, Ωmega,... Human user guides proof system Proof editors enhanced by tactics and/or decision procedures Additional support: libraries, definitions, program evaluation,... Automated Theorem Provers Find Proofs Otter/EQP, Setheo, Gandalf, Spass,... No interaction Search + Unification, Paramodulation, AC-Matching, Rewriting... Search parameters can be modified, lemmas can be provided... Special Purpose Systems KIDS, SpecWare, VSE,... System guides human user through choice points Search strategies taylored for application domain Synthesize Software Cognitive Studies Proseminar (Cogst 773) 7 Automated Reasoning

Why Automated Reasoning? Too many errors in informal reasoning 40 50% of the results published in journals are wrong Formalization Software controls major aspects of our life Air traffic, Banking, Government, Utilities, Schools,... Errors can be annoying (Reboot, loss of data,... ) Errors can be very expensive (Pentium bug, failed rocket launches) Errors can cost lives (Airbus crashes in the early 1990 s) Software Development unreliable Tested programs still contain errors Correctness proofs are tedious and error-prone Automatization Cognitive Studies Proseminar (Cogst 773) 8 Automated Reasoning

An example from Mathematics: the Stamps Problem 3c 5c Can you represent any postage of 8 cents or higher with only stamps of 5c and 3c? 8c = 5c 3c, 9c = 3c 3c 3c, 10c = 5c 5c, 11c = 8c + 3c... YES Can you prove it? Precise formulation: n. 8 x < 8+3n. i,j. x = 5i+3j Inductive proof: we can represent x+1 if we can represent x-2 Base cases: 8, 9, 10 Can you do the same with other pairs of stamps? Obviously 1c and any other stamp, 2c and any odd number Can you prove that there are no others? Cognitive Studies Proseminar (Cogst 773) 9 Automated Reasoning

Proof of the Stamps Problem c S.B., Marktoberdorf 7/95 If x a+b. i,j. x = i a+j b then a=1 or a=2, b odd or a=3, b=5 (a<b) Assume 1<a<b and do some number theory (if a=1 we re done) i,j. a+b+1 = i a+j b a (b+1) or b (a+1) (1.) i,j. a+b+2 = i a+j b a=2 or a (b+2) or b (a+2) (2.) i,j. a+b+3 = i a+j b a=3 or a (b+3) or b (a+3) (3.) Case analysis a=2: by (1.) b must be odd a>2: then b>3. Split into subcases according to (1.) a (b+1): then a (b+2) and by (2.) b (a+2) thus b=a+2 Split into subcases according to (3.) b (a+3) is impossible since b=a+2 a (b+3) is impossible since a (b+1) and a>2 Thus a=3 and b=5 b (a+1): then b=a+1 by (2.) a (a+3) or a+1 (a+2) both of which are impossible Cognitive Studies Proseminar (Cogst 773) 10 Automated Reasoning

Problems of Informal Reasoning If we re not forced to look at details we won t We usually trust the one who presents the proof We care about the method, not the details Our mental model may not capture all aspects We jump to conclusions Proving only the complicated cases carefully Inappropriate analogies Hidden assumptions that are invalid in special cases Ad håc solutions appear better than the are Formal mathematical reasoning leads to deeper understanding and better solutions Cognitive Studies Proseminar (Cogst 773) 11 Automated Reasoning

Achievements: Four-Color Problem Can every two-dimensional map be colored with only 4 colors? Unsolved for several centuries Computerized proof in the late 70 s Mathematicians reduced infinite number of situations to a few thousand cases Computer verified that all cases can be colored with 4 colors Special-purpose software, not really Automated Reasoning Correctness of proof depends on (unverified) software Cognitive Studies Proseminar (Cogst 773) 12 Automated Reasoning

Achievements: Automated Theorem Proving 1995: Proof of unsolved quasi group theorem 1996: Robbins Algorithm Conjecture NY Times Open mathematical problem for more than 60 years Reduced to two sufficient conditions in 1980 (hand proof) EQP prover verified both conditions as sufficient in 1996 EQP proved condition to be true in October 1996 Proof found automatically with general-purpose prover Depth-limit 70 8 days, 49,548 equations generated and checked Resulting proof is sequence of 15 equations 1995: Pentium Bug found by Model Checking Division algorithm mapped incorrectly onto hardware tables Missing case detected as countermodel No new hardware design without control by model checkers Cognitive Studies Proseminar (Cogst 773) 13 Automated Reasoning

Achievements: Program Synthesis (KIDS) Synthesis of correct-by-construction algorithms 1990: Efficient Costas-Arrays Algorithm 1993: US-Army Transport Scheduling Algorithm General Program Synthesis Tool User chooses efficient algorithm structure + a posteriori optimizations Created within a few hours, correct-by-construction Generated LISP algorithm 2000 times faster than existing ADA program Commercial production of scheduling algorithms with KIDS Cognitive Studies Proseminar (Cogst 773) 14 Automated Reasoning

Achievements: Proof Development Systems (NuPRL) Mathematics Proof of Girard s paradox (1987) Formal mathematical textbook Hardware Design: Verification of logic synthesis tool (1993) Verification of SCI Cache Coherency Protocol (1996) Software Verification & Optimization: Code of Ensemble group communication system imported into NuPRL Verification of total order protocol uncovered subtle bug (1998) Fast-path optimization improved performance by factor 3 (1999) Cognitive Studies Proseminar (Cogst 773) 15 Automated Reasoning

Automated Reasoning vs. Human Intelligence Formal inferences are logical Automated Reasoning shows intelligent behavior Proof tactics correspond to human reasoning strategies Systems are personal reasoning assistants for experienced users Systems can teach methodology to newcomers Automated Reasoning is more accurate Automated Reasoning finds better solutions Can all brilliant ideas be found by search? Will fully automated systems dominate the future of mathematics and programming or will we always need a human component? Cognitive Studies Proseminar (Cogst 773) 16 Automated Reasoning