A Beginner s Introduction to Heuristic Search Planning

Similar documents
Discriminative Learning of Beam-Search Heuristics for Planning

Learning and Transferring Relational Instance-Based Policies

Causal Link Semantics for Narrative Planning Using Numeric Fluents

Domain Knowledge in Planning: Representation and Use

FF+FPG: Guiding a Policy-Gradient Planner

Transfer Learning Action Models by Measuring the Similarity of Different Domains

Version Space. Term 2012/2013 LSI - FIB. Javier Béjar cbea (LSI - FIB) Version Space Term 2012/ / 18

Agents and environments. Intelligent Agents. Reminders. Vacuum-cleaner world. Outline. A vacuum-cleaner agent. Chapter 2 Actuators

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

BMBF Project ROBUKOM: Robust Communication Networks

Radius STEM Readiness TM

Visual CP Representation of Knowledge

Chapter 2. Intelligent Agents. Outline. Agents and environments. Rationality. PEAS (Performance measure, Environment, Actuators, Sensors)

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

University of Groningen. Systemen, planning, netwerken Bosman, Aart

Rule-based Expert Systems

AN EXAMPLE OF THE GOMORY CUTTING PLANE ALGORITHM. max z = 3x 1 + 4x 2. 3x 1 x x x x N 2

Lecture 10: Reinforcement Learning

An Investigation into Team-Based Planning

Planning in Intelligent Systems: Model-based Approach to Autonomous Behavior

Knowledge-Based - Systems

Towards Team Formation via Automated Planning

Action Models and their Induction

Evolution of Collective Commitment during Teamwork

Stacks Teacher notes. Activity description. Suitability. Time. AMP resources. Equipment. Key mathematical language. Key processes

Major Milestones, Team Activities, and Individual Deliverables

Introduction to Simulation

Intelligent Agents. Chapter 2. Chapter 2 1

Artificial Neural Networks written examination

New Project Learning Environment Integrates Company Based R&D-work and Studying

9.85 Cognition in Infancy and Early Childhood. Lecture 7: Number

Probability and Statistics Curriculum Pacing Guide

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Lecture 1: Basic Concepts of Machine Learning

Learning Human Utility from Video Demonstrations for Deductive Planning in Robotics

Integrating derivational analogy into a general problem solving architecture

Planning with External Events

EGRHS Course Fair. Science & Math AP & IB Courses

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Lecture 2: Quantifiers and Approximation

Compositional Semantics

GACE Computer Science Assessment Test at a Glance

KIS MYP Humanities Research Journal

Navigating the PhD Options in CMS

Knowledge based expert systems D H A N A N J A Y K A L B A N D E

Axiom 2013 Team Description Paper

Using the Attribute Hierarchy Method to Make Diagnostic Inferences about Examinees Cognitive Skills in Algebra on the SAT

An Introduction to Simio for Beginners

Reinforcement Learning by Comparing Immediate Reward

Go fishing! Responsibility judgments when cooperation breaks down

EXECUTIVE SUMMARY. Online courses for credit recovery in high schools: Effectiveness and promising practices. April 2017

Bachelor Class

Lecture 1: Machine Learning Basics

AQUA: An Ontology-Driven Question Answering System

A theoretic and practical framework for scheduling in a stochastic environment

The Good Judgment Project: A large scale test of different methods of combining expert predictions

COMPUTER SCIENCE GRADUATE STUDIES Course Descriptions by Methodology

Getting the Story Right: Making Computer-Generated Stories More Entertaining

Learning to Schedule Straight-Line Code

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Using focal point learning to improve human machine tacit coordination

Learning Cases to Resolve Conflicts and Improve Group Behavior

While you are waiting... socrative.com, room number SIMLANG2016

(Sub)Gradient Descent

Two-Valued Logic is Not Sufficient to Model Human Reasoning, but Three-Valued Logic is: A Formal Analysis

COMPUTER SCIENCE GRADUATE STUDIES Course Descriptions by Research Area

NUMBERS AND OPERATIONS

Abstractions and the Brain

RANKING AND UNRANKING LEFT SZILARD LANGUAGES. Erkki Mäkinen DEPARTMENT OF COMPUTER SCIENCE UNIVERSITY OF TAMPERE REPORT A ER E P S I M S

Rule Learning With Negation: Issues Regarding Effectiveness

Grades. From Your Friends at The MAILBOX

Kelli Allen. Vicki Nieter. Jeanna Scheve. Foreword by Gregory J. Kaiser

CS Machine Learning

Multimedia Application Effective Support of Education

Ricochet Robots - A Case Study for Human Complex Problem Solving

Clouds = Heavy Sidewalk = Wet. davinci V2.1 alpha3

ISFA2008U_120 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM

South Carolina English Language Arts

M55205-Mastering Microsoft Project 2016

Mathematics subject curriculum

Automatic Discretization of Actions and States in Monte-Carlo Tree Search

Regret-based Reward Elicitation for Markov Decision Processes

Laboratorio di Intelligenza Artificiale e Robotica

TD(λ) and Q-Learning Based Ludo Players

Top US Tech Talent for the Top China Tech Company

A Comparison of Annealing Techniques for Academic Course Scheduling

Rule Learning with Negation: Issues Regarding Effectiveness

Course Development Using OCW Resources: Applying the Inverted Classroom Model in an Electrical Engineering Course

POLA: a student modeling framework for Probabilistic On-Line Assessment of problem solving performance

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Missouri Mathematics Grade-Level Expectations

An Introduction to Simulation Optimization

UC Merced Proceedings of the Annual Meeting of the Cognitive Science Society

Probability and Game Theory Course Syllabus

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

EdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar

A simulated annealing and hill-climbing algorithm for the traveling tournament problem

1 Use complex features of a word processing application to a given brief. 2 Create a complex document. 3 Collaborate on a complex document.

Get with the Channel Partner Program

Detecting English-French Cognates Using Orthographic Edit Distance

Transcription:

A Beginner s Introduction to Heuristic Search Planning 7. Going Further Malte Helmert Gabriele Röger AAAI 2015 Tutorial January 25, 2015

What else happens...... in heuristic planning?

Heuristics Other current topics on heuristics for classical planning: making delete relaxations less relaxed semi-relaxed planning (Keyder et al., ICAPS 2012) red-black planning (e.g., Katz et al., AAAI 2013) heuristics based on linear programming (e.g., Pommerening et al., ICAPS 2014; AAAI 2015) heuristics vs. search vs. inference (e.g., Lipovetzky & Geffner, ECAI 2012)

Search Algorithms Search algorithms mostly interesting for satisficing planning: escaping local minima and heuristic plateaus: using multiple heuristics (e.g. Röger & Helmert, ICAPS 2010) combining local and systematic search (e.g. Hoffmann & Nebel, JAIR 2001; Xie et al., AAAI 2014) randomized systematic search (e.g. Valenzano et al., AAAI 2014) random-walk algorithms (e.g. Nakhost & Müller, IJCAI 2013) search algorithms tailored to specific objectives: anytime strategies for satisficing planning (Richter & Westphal, JAIR 2010) many papers from Wheeler Ruml s research group (e.g., Thayer et al., ICAPS 2012)

Search Space Pruning Search space pruning (mainly for optimal search): with symmetries (e.g., Pochter et al., IJCAI 2011) with redundant paths ( partial-order reduction ): (e.g. Wehrle et al., ICAPS 2013)

Invariant Synthesis invariant: property of all reachable states (e.g., Rintanen, ECAI 2008) example: mutually exclusive atoms set of atoms of which at most one is true in each state basis of translation from PDDL to SAS + (Helmert, AIJ 2009) useful for strengthening heuristics, e.g., constrained PDBs (Haslum et al., AAAI 2005))

Preferred Operators heuristics often can identify promising actions for example helpful action: first action in a relaxed plan (Hoffmann & Nebel, JAIR 2001) try these actions first during search often highly positive impact on overall performance (Richter & Helmert, ICAPS 2009)

Portfolios Idea: solve tasks by running multiple (more or less) independent planning systems different strategies: fixed schedule select planners after task analysis results of IPC 2014 classic tracks http://helios.hud.ac.uk/scommv/ipc-14/ results of IPC 2014 learning track http://www.cs.colostate.edu/~ipc2014/

What else happens...... in classical planning?

Symbolic Planning Symbolic search algorithms: search processes sets of states at a time operators, state sets, heuristics etc. represented by binary decision diagrams (BDDs) (or related structures) planning systems: Gamer (Edelkamp & Kissmann, IPC 2008) SymBA (Torralba et al., IPC 2014)... comprehensive treatment: Álvaro Torralba s PhD thesis (Torralba, 2015)

SAT Planning SAT planning: create propositional formula that is satisfiable iff there is a plan with at most k steps use off-the-shelf or specially tailored SAT solver to find plans e.g., Madagascar planner (Rintanen, IPC 2011 & 2014) many papers by Jussi Rintanen (e.g., Rintanen; AIJ 2012) related: property-directed reachability (Suda, JAIR 2014)

Other Approaches Many other approaches: partial-order causal link planning (POCL) compilation to (mixed) integer programming (IP/MIP) compilation to answer-set programming (ASP) compilation to quantified boolean formulae (QBF)...... but heuristic search, symbolic search and SAT are currently working best

What else happens...... in planning?

More general planning topics More general kinds of planning include: offline: online planning; planning and execution discrete: continuous planning (e.g., real-time/hybrid systems) deterministic: FOND planning; probabilistic planning single-agent: multi-agent planning; general game playing; game-theoretic planning fully observable: POND planning; conformant planning sequential: e.g., temporal planning Domain-dependent planning problems in AI include: pathfinding, including grid-based and multi-agent (MAPF) continuous motion planning

Resources

Resources ICAPS conferences (International Conference for Automated Planning and Scheduling): http://www.icaps-conference.org/ http://icaps15.icaps-conference.org/ Planing Domain Definition Language (PDDL): http://ipc.informatik.uni-freiburg.de/pddlresources International Planning Competitions (IPC): http://ipc.icaps-conference.org/ Fast Downward planning system: http://www.fast-downward.org/ VAL plan validator: https://github.com/kcl-planning/val

The End Thank you for your attention!