Variations on a Theory of Problem Solving

Similar documents
Integrating Meta-Level and Domain-Level Knowledge for Task-Oriented Dialogue

MYCIN. The MYCIN Task

Rule-based Expert Systems

Software Maintenance

A Version Space Approach to Learning Context-free Grammars

University of Groningen. Systemen, planning, netwerken Bosman, Aart

Learning goal-oriented strategies in problem solving

UC Merced Proceedings of the Annual Meeting of the Cognitive Science Society

MYCIN. The embodiment of all the clichés of what expert systems are. (Newell)

Integrating derivational analogy into a general problem solving architecture

Cognitive Modeling. Tower of Hanoi: Description. Tower of Hanoi: The Task. Lecture 5: Models of Problem Solving. Frank Keller.

POLA: a student modeling framework for Probabilistic On-Line Assessment of problem solving performance

Proof Theory for Syntacticians

Lecture 1: Machine Learning Basics

CREATING SHARABLE LEARNING OBJECTS FROM EXISTING DIGITAL COURSE CONTENT

AGENDA LEARNING THEORIES LEARNING THEORIES. Advanced Learning Theories 2/22/2016

ECE-492 SENIOR ADVANCED DESIGN PROJECT

M55205-Mastering Microsoft Project 2016

Knowledge based expert systems D H A N A N J A Y K A L B A N D E

An Investigation into Team-Based Planning

THE ROLE OF TOOL AND TEACHER MEDIATIONS IN THE CONSTRUCTION OF MEANINGS FOR REFLECTION

Learning Methods for Fuzzy Systems

Developing an Assessment Plan to Learn About Student Learning

Maths Games Resource Kit - Sample Teaching Problem Solving

Certified Six Sigma - Black Belt VS-1104

An Introduction to the Minimalist Program

Concept Acquisition Without Representation William Dylan Sabo

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

Seminar - Organic Computing

EVOLVING POLICIES TO SOLVE THE RUBIK S CUBE: EXPERIMENTS WITH IDEAL AND APPROXIMATE PERFORMANCE FUNCTIONS

Language properties and Grammar of Parallel and Series Parallel Languages

Introduction to Simulation

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

Version Space. Term 2012/2013 LSI - FIB. Javier Béjar cbea (LSI - FIB) Version Space Term 2012/ / 18

Highlighting and Annotation Tips Foundation Lesson

Activities, Exercises, Assignments Copyright 2009 Cem Kaner 1

ADAPTIVE PLANNING. 1 Powered by POeT Solvers Limited

CS 101 Computer Science I Fall Instructor Muller. Syllabus

The CTQ Flowdown as a Conceptual Model of Project Objectives

Case Acquisition Strategies for Case-Based Reasoning in Real-Time Strategy Games

Major Milestones, Team Activities, and Individual Deliverables

Teaching Algorithm Development Skills

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

Word learning as Bayesian inference

Community-oriented Course Authoring to Support Topic-based Student Modeling

Classroom Connections Examining the Intersection of the Standards for Mathematical Content and the Standards for Mathematical Practice

FUZZY EXPERT. Dr. Kasim M. Al-Aubidy. Philadelphia University. Computer Eng. Dept February 2002 University of Damascus-Syria

Rover Races Grades: 3-5 Prep Time: ~45 Minutes Lesson Time: ~105 minutes

Document number: 2013/ Programs Committee 6/2014 (July) Agenda Item 42.0 Bachelor of Engineering with Honours in Software Engineering

A Metacognitive Approach to Support Heuristic Solution of Mathematical Problems

What is PDE? Research Report. Paul Nichols

Generating Test Cases From Use Cases

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

An extended dual search space model of scientific discovery learning

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

A Context-Driven Use Case Creation Process for Specifying Automotive Driver Assistance Systems

Discriminative Learning of Beam-Search Heuristics for Planning

Group A Lecture 1. Future suite of learning resources. How will these be created?

South Carolina College- and Career-Ready Standards for Mathematics. Standards Unpacking Documents Grade 5

Deploying Agile Practices in Organizations: A Case Study

21 st Century Skills and New Models of Assessment for a Global Workplace

Learning and Transferring Relational Instance-Based Policies

Planning with External Events

Visual CP Representation of Knowledge

Maintaining Resilience in Teaching: Navigating Common Core and More Site-based Participant Syllabus

GACE Computer Science Assessment Test at a Glance

Introduction. 1. Evidence-informed teaching Prelude

Introduction to Modeling and Simulation. Conceptual Modeling. OSMAN BALCI Professor

Axiom 2013 Team Description Paper

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge

A Study of Metacognitive Awareness of Non-English Majors in L2 Listening

Approaches to control phenomena handout Obligatory control and morphological case: Icelandic and Basque

An Introduction to Simio for Beginners

Ricochet Robots - A Case Study for Human Complex Problem Solving

MBA6941, Managing Project Teams Course Syllabus. Course Description. Prerequisites. Course Textbook. Course Learning Objectives.

Distributed Weather Net: Wireless Sensor Network Supported Inquiry-Based Learning

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Backwards Numbers: A Study of Place Value. Catherine Perez

1.11 I Know What Do You Know?

Tap vs. Bottled Water

Transfer Learning Action Models by Measuring the Similarity of Different Domains

Soaring With Strengths

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

Grounding Language for Interactive Task Learning

On-Line Data Analytics

Short vs. Extended Answer Questions in Computer Science Exams

A GENERIC SPLIT PROCESS MODEL FOR ASSET MANAGEMENT DECISION-MAKING

Module Title: Managing and Leading Change. Lesson 4 THE SIX SIGMA

Learning From the Past with Experiment Databases

Exploring Creativity in the Design Process:

SARDNET: A Self-Organizing Feature Map for Sequences

Action Models and their Induction

Parallel Evaluation in Stratal OT * Adam Baker University of Arizona

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

Abstractions and the Brain

Using the Attribute Hierarchy Method to Make Diagnostic Inferences about Examinees Cognitive Skills in Algebra on the SAT

(Sub)Gradient Descent

Department of Geography Bachelor of Arts in Geography Plan for Assessment of Student Learning Outcomes The University of New Mexico

Transcription:

Variations on a Theory of Problem Solving Pat Langley Institute for the Study of Learning and Expertise Palo Alto, California, USA Chris Pearce, Yu Bai, Charlotte Worsfold, Mike Barley Department of Computer Science University of Auckland, Auckland, NZ This research was supported by Grant N00014-15-1-2517. We thank Miranda Emery, John Laird, Rao Kambhampati, Chris MacLellan, and Manuela Veloso for useful input.

Aims of the Research The ability to solve novel problems is a distinctive feature of human cognition, but current accounts are incomplete. We desire a computational theory of problem solving that: Retains as many of the core assumptions from the standard framework as possible; Accounts for the great variety of strategies observed in humans and machines; Explain how domain expertise can reduce search and make problem solving more effective. In this talk, we present such a revised theory and describe HPS, an implemented architecture that incorporates its tenets. 2

The Standard Theory The standard theory of problem solving (Newell & Simon, 1961) makes a number of claims: Problem solving involves the mental representation, interpretation, and manipulation of symbol structures. This process involves search through a space of candidates that it encodes as such symbol structures. Search is not exhaustive but rather is guided by heuristics that make it selective and tractable. Problem solvers use means-ends analysis to decompose complex problems into simpler ones. Expert behavior calls on knowledge about a domain to reduce and sometimes even eliminate search. Evidence from repeated empirical studies has been consistent with most aspects of this theory. 3

Means-Ends Analysis As embodied in Newell and Simon s General Problem Solver, means-ends analysis relies on four basic ideas: Problem solving interleaves transforming the current state into a desired one with applying an operator to do the transformation. Operators are considered only if they reduce differences between the current and desired state. Problem solving involves search through a space of alternative problem decompositions. The selected operator determines the structure of the resulting problem decomposition. Only the second assumption is limiting and problematic. The other three tenets may well be worth retaining. 4

A Revised Theory of Problem Solving We propose a revised theory of problem solving that replaces means-ends analysis with three postulates: Problem solving involves recursively dividing problems into subproblems, with solutions stated as decomposition trees. Search details - operator generation / evaluation, node selection, success / failure criteria - are controlled by strategic parameters. Domain expertise is often encoded as generalized decompositions for breaking a problem into subproblems (as in HTNs). This revision retains the key ideas of means-ends analysis without committing to chaining off goals. We have embedded these assumptions in HPS, an architecture for hierarchical problem solving. 5

Encoding Problem Decompositions The HPS architecture encodes problem solutions - both partial and complete - as decomposition trees. Each element in such an AND tree has two components: A problem, which includes a state and a goal description; A decomposition, which specifies an operator instance that breaks the problem into: A down subproblem, which must be solved before applying the operator instance; A right subproblem, which must be solved afterward to achieve the parent s goals. Each operator instance has application conditions, expected results, and constraints on shared variables. 6

Encoding Problem Decompositions State descriptions S1: (on B C) (ontable A D) (ontable C D) S2: (holding B) (ontable A D) (ontable C D) S3: (ontable B D) (ontable A D) (ontable C D) S4: (holding C) (ontable B D) (ontable A D) S5: (on C A) (ontable B D) (ontable A D) Goal descriptions G1: (on C A) G2: (not (on?any A)) (holding C) G3: (ontable C?y) (not (on?any C)) (not (holding?other)) G4: (on B C) (not (holding?any)) G5: (holding B) 7

Organizing Nodes into Search Trees Here is a search tree HPS generates during its problem solving, with nodes along a successful path shaded. solution Each node (partial plan) elaborates its parent by adding a new subproblem decomposition. Numbers reflect generation order. 8

The HPS Problem-Solving Cycle HPS operates in cycles that take one of five meta-level actions: Option 1. If there are enough acceptable plans or no further choices, then halt and return these solutions. Option 2. If the plan for the current node is acceptable, then store it and select another node in the search tree. Option 3. If the current plan is unacceptable (e.g., involves a loop), then mark it as failed and select another node in the search tree. Option 4. If the current plan has no operator for focus subproblem F, then generate operators for F and calculate their evaluation scores. Option 5. If the current plan has untried operators but subproblem F has none, then select one and elaborate the plan by decomposing F. The focus problem is an open task in a hierarchical plan that HPS does not yet consider solved. 9

Modulating the Search Process HPS incorporates ten parameters that determine its choices during problem solving: Option 1. When checking whether to halt, HPS calls on a parameter that sees if it has found enough acceptable plans. Option 2. A parameter tests for plan acceptability (e.g., if all goals satisfied); if so, another picks the next node to visit (e.g., the parent). Option 3. A fifth parameter tests the plan for failure (e.g., if a loop has occurred); if so, another one chooses the next node to visit. Option 4. A parameter generates operator instances (e.g., forward or backward chaining); another computes a score for each candidate. Option 5. Final parameters select an operator to decompose the focus problem, score the resulting elaboration, and select the next node. These settings are intrinsically composable, so that different combinations can reproduce many distinct strategies. 10

Encoding and Using Domain Expertise Strategic parameters support different varieties of problem solving, but not expert behavior; to this end, HPS can: Store domain-specific knowledge on how to decompose classes of problems into subproblems, with each method stating: An associated task name, conditions, and optional effects; A down, main, and right subtask that must be carried out. Use this knowledge during operator generation and evaluation; On entering a hierarchical method, considers only candidates with relevant task names and matched conditions; On completing a method, returns to knowledge-lean search. HPS can solve problems stated only as tasks, only as state-goal pairs, or as their combination, much like UPS (To et al., 2015). 11

Basic Problem-Solving Ability We have demonstrated HPS's ability to solve novel problems on two familiar domains: Blocks World - finding plans that produce desired configurations: Solutions to problems ranged from four to 12 steps. Depth-first forward chaining solved 30 out of 30 tasks. Depth-first means ends solved only 23 of these problems. Kinship inference - deducing complex relations from simple ones: Problems required from one to eight inference steps. Forward chaining could not solve more complex tasks. Goal-driven means-ends analysis had little difficulty. These basic results show that strategy effectiveness interacts with domain characteristics. 12

Forward Chaining vs. Means-Ends Means-ends analysis / log(nodes) 0.0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 Means-ends does not solve Each point denotes a pair of runs on a distinct problem from the Blocks World 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 Forward chaining / log(nodes) HPS parameters support empirical comparison of different strategies. This study compares the nodes visited by forward chaining and means-ends analysis when combined with depth-first search. 13

Depth-First Search vs. Iterative Sampling Iterative sampling 0.0 0.9 1.8 2.7 3.6 4.5 Each point denotes a pair of runs on a distinct problem from the Blocks World 0 0.9 1.8 2.7 3.6 4.5 Depth-first search HPS parameters support empirical comparison of different strategies. This study compares the nodes visited by depthfirst search and iterative sampling when combined with forward chaining. 14

Benefits of Domain Knowledge Our final study examined the ability to use domain-specific decompositions to reduce search. We showed that HPS solves Blocks World problems efficiently using hierarchical methods, including recursive ones: When stated in terms an initial state and a task to accomplish; When specified using an initial state, goals, and a task; When stated using an initial state and goals but no task. The first two conditions required little search; the third took more but still less than without domain knowledge. The second and third settings exceed the abilities of traditional HTN planning systems. 15

Relations to Earlier Research How do our theoretical claims, and their embodiment in HPS, relate to previous research? Problem solutions are represented by decomposition trees. Widely adopted in both theorem proving and HTN planning. Strategic content underlies variation in problem solving. Soar uses meta-level control to implement different strategies. HPS is closer to Prodigy, but has substantially greater coverage. Domain expertise takes the form of generalized decompositions. In HTN planning, but seldom joined with primitive operators. Other forms of knowledge, like rejection rules, are possible. We have drawn on these earlier ideas but also combined them to produce an extended theory of problem solving. 16

But What About Soar? Soar (Laird, 2012) can reproduce the same strategies as HPS, but they offer different theories of problem solving: HPS makes stronger assumptions about structures and processes; Just as Soar makes stronger assumptions than EPIC or even Lisp. The ability to generate the same behaviors is not equivalent to providing the same theoretical account. Soar does not offer architecture-level support for storing multiple problem decompositions; One could write Soar programs that do so, but this extra layer of interpretation would slow processing. The tradeoff is that HPS must store a search tree of different decompositions, which may have other processing costs. 17

Plans for Future Work Our revised theory, and the HPS system, offer an improved account of problem solving, but future work should: Extend the framework to reproduce regression planning and abstraction. Support more adaptive problem solving by: Collecting meta-level data (e.g., relative branching factors) Making parameter settings conditional on these statistics Learn decompositions from successful problem solving by: Recording traces of solutions for particular problems; Storing generalized versions of these traces for future use. The result will be an even more flexible and inclusive account of problem solving in cognitive systems. 18

End of Presentation! 19