Chapter 2. Intelligent Agents. Outline. Agents and environments. Rationality. PEAS (Performance measure, Environment, Actuators, Sensors)

Similar documents
Agents and environments. Intelligent Agents. Reminders. Vacuum-cleaner world. Outline. A vacuum-cleaner agent. Chapter 2 Actuators

Intelligent Agents. Chapter 2. Chapter 2 1

LEGO MINDSTORMS Education EV3 Coding Activities

Lecture 10: Reinforcement Learning

MAE Flight Simulation for Aircraft Safety

Reinforcement Learning by Comparing Immediate Reward

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Seminar - Organic Computing

ISFA2008U_120 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM

Axiom 2013 Team Description Paper

SSE - Supervision of Electrical Systems

Introduction to Simulation

Modeling user preferences and norms in context-aware systems

MAKING YOUR OWN ALEXA SKILL SHRIMAI PRABHUMOYE, ALAN W BLACK

B.S/M.A in Mathematics

Laboratorio di Intelligenza Artificiale e Robotica

A Math Adventure Game Pi and the The Lost Function Episode 1 - Pre-Algebra/Algebra

Computers Change the World

Aviation English Solutions

Designing Autonomous Robot Systems - Evaluation of the R3-COP Decision Support System Approach

Speeding Up Reinforcement Learning with Behavior Transfer

Action Models and their Induction

AMULTIAGENT system [1] can be defined as a group of

An Introduction to Simulation Optimization

Information System Design and Development (Advanced Higher) Unit. level 7 (12 SCQF credit points)

Firms and Markets Saturdays Summer I 2014

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

A Practical Approach to Embedded Systems Engineering Workforce Development

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

CSL465/603 - Machine Learning

Robot Shaping: Developing Autonomous Agents through Learning*

Justification Paper: Exploring Poetry Online. Jennifer Jones. Michigan State University CEP 820

ENME 605 Advanced Control Systems, Fall 2015 Department of Mechanical Engineering

COMMUNITY RESOURCES, INC.

Telekooperation Seminar

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

COMPUTER-AIDED DESIGN TOOLS THAT ADAPT

Procedia - Social and Behavioral Sciences 237 ( 2017 )

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

AC : DESIGNING AN UNDERGRADUATE ROBOTICS ENGINEERING CURRICULUM: UNIFIED ROBOTICS I AND II

Continual Curiosity-Driven Skill Acquisition from High-Dimensional Video Inputs for Humanoid Robots

Emergency Management Games and Test Case Utility:

Fundraising 101 Introduction to Autism Speaks. An Orientation for New Hires

Laboratorio di Intelligenza Artificiale e Robotica

Can Money Buy Happiness? EPISODE # 605

DOCTOR OF PHILOSOPHY HANDBOOK

Individual Component Checklist L I S T E N I N G. for use with ONE task ENGLISH VERSION

Solution Focused Methods RAYYA GHUL 2017

Self Study Report Computer Science

TOKEN-BASED APPROACH FOR SCALABLE TEAM COORDINATION. by Yang Xu PhD of Information Sciences

Functional Skills Mathematics Level 2 assessment

High-level Reinforcement Learning in Strategy Games

Software Development: Programming Paradigms (SCQF level 8)

Division Strategies: Partial Quotients. Fold-Up & Practice Resource for. Students, Parents. and Teachers

Multisensor Data Fusion: From Algorithms And Architectural Design To Applications (Devices, Circuits, And Systems)

Robot manipulations and development of spatial imagery

SAM - Sensors, Actuators and Microcontrollers in Mobile Robots

Economics 201 Principles of Microeconomics Fall 2010 MWF 10:00 10:50am 160 Bryan Building

Surprise-Based Learning for Autonomous Systems

Stopping rules for sequential trials in high-dimensional data

The Good Judgment Project: A large scale test of different methods of combining expert predictions

BMBF Project ROBUKOM: Robust Communication Networks

Rule Learning With Negation: Issues Regarding Effectiveness

Navigating the PhD Options in CMS

UNIT ONE Tools of Algebra

IBM Software Group. Mastering Requirements Management with Use Cases Module 6: Define the System

FIGURE 8.2. Job Shadow Workplace Supervisor Feedback Form.

MYCIN. The MYCIN Task

INTERMEDIATE ALGEBRA PRODUCT GUIDE

Python Machine Learning

Thesis-Proposal Outline/Template

The Flaws, Fallacies and Foolishness of Benchmark Testing

Measuring physical factors in the environment

While you are waiting... socrative.com, room number SIMLANG2016

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Interaction Design Considerations for an Aircraft Carrier Deck Agent-based Simulation

Australian Journal of Basic and Applied Sciences

Faculty of Engineering

Time series prediction

The Foundations of Interpersonal Communication

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

Hayward Unified School District Community Meeting #2 at

CS 101 Computer Science I Fall Instructor Muller. Syllabus

Group A Lecture 1. Future suite of learning resources. How will these be created?

What is a Mental Model?

Computer Science 141: Computing Hardware Course Information Fall 2012

Undergraduate Program Guide. Bachelor of Science. Computer Science DEPARTMENT OF COMPUTER SCIENCE and ENGINEERING

DEVELOPMENT OF AN INTELLIGENT MAINTENANCE SYSTEM FOR ELECTRONIC VALVES

Ministry of Education, Republic of Palau Executive Summary

Cooperative Systems Modeling, Example of a Cooperative e-maintenance System

The Political Engagement Activity Student Guide

Infrared Paper Dryer Control Scheme

The NH Parent Partner Program

Planning with External Events

Master s Programme in Computer, Communication and Information Sciences, Study guide , ELEC Majors

A Reinforcement Learning Variant for Control Scheduling

INNOWIZ: A GUIDING FRAMEWORK FOR PROJECTS IN INDUSTRIAL DESIGN EDUCATION

PUBLIC SPEAKING, DISTRIBUTION OF LITERATURE, COMMERCIAL SOLICITATION AND DEMONSTRATIONS IN PUBLIC AREAS

Measurement & Analysis in the Real World

Transcription:

Intelligent Agents Chapter 2 1 Outline Agents and environments Rationality PEAS (Performance measure, Environment, Actuators, Sensors) Agent types 2

Agents and environments sensors environment percepts actions? agent actuators Agents include humans, robots, softbots, thermostats, etc. The agent function maps from percept histories to actions: f : P A The agent program runs on the physical architecture to produce f 3 Vacuum-cleaner world A B Percepts: location and contents, e.g., [A, Dirty] Actions: Left, Right, Suck, NoOp 4

A vacuum-cleaner agent Percept sequence [A, Clean] [A, Dirty] [B,Clean] [B,Dirty] [A, Clean], [A, Clean] [A, Clean], [A, Dirty]. Action Right Suck Left Suck Right Suck. 5 function REFLEX-VACUUM-AGENT([location,status]) returns an action if status = Dirty then return Suck else if location = A then return Right else if location = B then return Left What is the right function? Can it be implemented in a small agent program? 6

Rationality Fixed performance measure evaluates the environment sequence one point per square cleaned up in time T? one point per clean square per time step, minus one per move? penalize for >kdirty squares? A rational agent chooses whichever action maximizes the expected value of the performance measure given the percept sequence to date Rational omniscient Rational clairvoyant Rational successful Rational exploration, learning, autonomy 7 PEAS To design a rational agent, we must specify the task environment Consider, e.g., the task of designing an automated taxi: Performance measure?? Environment?? Actuators?? Sensors?? 8

PEAS To design a rational agent, we must specify the task environment Consider, e.g., the task of designing an automated taxi: Performance measure?? safety, destination, profits, legality, comfort,... Environment?? US streets/freeways, traffic, pedestrians, weather,... Actuators?? steering, accelerator, brake, horn, speaker/display,... Sensors?? video, accelerometers, gauges, engine sensors, keyboard, GPS,... 9 Internet shopping agent Performance measure?? Environment?? Actuators?? Sensors?? 10

Observable?? Deterministic?? Episodic?? Static?? Discrete?? Single-agent?? 11 Observable?? Yes Yes No No Deterministic?? Episodic?? Static?? Discrete?? Single-agent?? 12

Observable?? Yes Yes No No Deterministic?? Yes No Partly No Episodic?? Static?? Discrete?? Single-agent?? 13 Observable?? Yes Yes No No Deterministic?? Yes No Partly No Episodic?? No No No No Static?? Discrete?? Single-agent?? 14

Observable?? Yes Yes No No Deterministic?? Yes No Partly No Episodic?? No No No No Static?? Yes Semi Semi No Discrete?? Single-agent?? 15 Observable?? Yes Yes No No Deterministic?? Yes No Partly No Episodic?? No No No No Static?? Yes Semi Semi No Discrete?? Yes Yes Yes No Single-agent?? 16

Observable?? Yes Yes No No Deterministic?? Yes No Partly No Episodic?? No No No No Static?? Yes Semi Semi No Discrete?? Yes Yes Yes No Single-agent?? Yes No Yes (no auctions) No The environment type largely determines the agent design The real world is (of course) partially observable, stochastic, sequential, dynamic, continuous, multi-agent 17 Agent types Four basic types in order of increasing generality: simple reflex agents reflex agents with state goal-based agents utility-based agents All these can be turned into learning agents 18

Simple reflex agents Agent Sensors Condition action rules What the world is like now What action I should do now Environment Actuators 19 Reflex agents with state State How the world evolves What my actions do Condition action rules Sensors What the world is like now What action I should do now Environment Agent Actuators 20

Goal-based agents State How the world evolves What my actions do Goals Sensors What the world is like now What it will be like if I do action A What action I should do now Environment Agent Effectors 21 Utility-based agents State How the world evolves What my actions do Utility Sensors What the world is like now What it will be like if I do action A How happy I will be in such a state What action I should do now Environment Agent Effectors 22

Performance standard Learning agents Critic Sensors feedback learning goals Learning element changes knowledge Performance element Environment Problem generator Agent Actuators 23 AIMA code The code for each topic is divided into four directories: agents: code defining agent types and programs algorithms: code for the methods used by the agent programs environments: code defining environment types, simulations domains: problem types and instances for input to algorithms (Often run algorithms on domains rather than agents in environments.) (setq joe (make-agent :name joe :body (make-agent-body) :program (make-dumb-agent-program))) (defun make-dumb-agent-program () (let ((memory nil)) # (lambda (percept) (push percept memory) no-op))) 24