Agents and environments. Intelligent Agents. Reminders. Vacuum-cleaner world. Outline. A vacuum-cleaner agent. Chapter 2 Actuators

Similar documents
Chapter 2. Intelligent Agents. Outline. Agents and environments. Rationality. PEAS (Performance measure, Environment, Actuators, Sensors)

Intelligent Agents. Chapter 2. Chapter 2 1

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Lecture 10: Reinforcement Learning

Economics 201 Principles of Microeconomics Fall 2010 MWF 10:00 10:50am 160 Bryan Building

LEGO MINDSTORMS Education EV3 Coding Activities

MAE Flight Simulation for Aircraft Safety

Mathematics Success Level E

Strategic Management (MBA 800-AE) Fall 2010

Individual Component Checklist L I S T E N I N G. for use with ONE task ENGLISH VERSION

Reinforcement Learning by Comparing Immediate Reward

SSE - Supervision of Electrical Systems

Designing Autonomous Robot Systems - Evaluation of the R3-COP Decision Support System Approach

CSL465/603 - Machine Learning

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

Firms and Markets Saturdays Summer I 2014

Measurement. When Smaller Is Better. Activity:

Modeling user preferences and norms in context-aware systems

USER ADAPTATION IN E-LEARNING ENVIRONMENTS

Functional Skills Mathematics Level 2 assessment

Aviation English Solutions

A Math Adventure Game Pi and the The Lost Function Episode 1 - Pre-Algebra/Algebra

Guidelines for drafting the participant observation report

PUBLIC SPEAKING, DISTRIBUTION OF LITERATURE, COMMERCIAL SOLICITATION AND DEMONSTRATIONS IN PUBLIC AREAS

Information System Design and Development (Advanced Higher) Unit. level 7 (12 SCQF credit points)

Fundraising 101 Introduction to Autism Speaks. An Orientation for New Hires

Pronunciation: Student self-assessment: Based on the Standards, Topics and Key Concepts and Structures listed here, students should ask themselves...

21st Century Community Learning Center

Robot manipulations and development of spatial imagery

Course Content Concepts

ISFA2008U_120 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM

EdX Learner s Guide. Release

MYCIN. The MYCIN Task

Get with the Channel Partner Program

Innovative Methods for Teaching Engineering Courses

Parent Information Welcome to the San Diego State University Community Reading Clinic

Stochastic Calculus for Finance I (46-944) Spring 2008 Syllabus

Phys4051: Methods of Experimental Physics I

Justification Paper: Exploring Poetry Online. Jennifer Jones. Michigan State University CEP 820

Laboratorio di Intelligenza Artificiale e Robotica

PHY2048 Syllabus - Physics with Calculus 1 Fall 2014

HOLY CROSS PREPARATORY SCHOOL TRAVEL PLAN School Travel Plan Holy Cross Preparatory School 1

MAKING YOUR OWN ALEXA SKILL SHRIMAI PRABHUMOYE, ALAN W BLACK

Axiom 2013 Team Description Paper

An Introduction to Simulation Optimization

B.S/M.A in Mathematics

Implementing a tool to Support KAOS-Beta Process Model Using EPF

Seminar - Organic Computing

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

ME 4495 Computational Heat Transfer and Fluid Flow M,W 4:00 5:15 (Eng 177)

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Section 7, Unit 4: Sample Student Book Activities for Teaching Listening

Colorado State University Department of Construction Management. Assessment Results and Action Plans

AMULTIAGENT system [1] can be defined as a group of

By Merrill Harmin, Ph.D.

Multimedia Courseware of Road Safety Education for Secondary School Students

Space Travel: Lesson 2: Researching your Destination

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Introduction to Sociology SOCI 1101 (CRN 30025) Spring 2015

Lahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017

Class of 2018 Junior Proposal for Senior Project. Make the Most of Your Journey

Thinking Maps for Organizing Thinking

A process by any other name

Software Maintenance

Can Money Buy Happiness? EPISODE # 605

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

Office Hours: Mon & Fri 10:00-12:00. Course Description

PLANT SCIENCE/SOIL SCIENCE 2100 INTRODUCTION TO SOIL SCIENCE

The Foundations of Interpersonal Communication

Computers Change the World

Synthesis Essay: The 7 Habits of a Highly Effective Teacher: What Graduate School Has Taught Me By: Kamille Samborski

Foothill College Summer 2016

A. Permission. All students must have the permission of their parent or guardian to participate in any field trip.

Computational Data Analysis Techniques In Economics And Finance

Lecture 1: Machine Learning Basics

CS 100: Principles of Computing

White Paper. The Art of Learning

Speeding Up Reinforcement Learning with Behavior Transfer

Keith Weigelt. University of Pennsylvania The Wharton School Management Department 2022 Steinberg-Dietrich Hall Philadelphia, PA (215)

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering

GCSE. Mathematics A. Mark Scheme for January General Certificate of Secondary Education Unit A503/01: Mathematics C (Foundation Tier)

Washington Homeschool Organization

Culinary Arts and Foodservice Management

A MULTI-AGENT SYSTEM FOR A DISTANCE SUPPORT IN EDUCATIONAL ROBOTICS

PowerTeacher Gradebook User Guide PowerSchool Student Information System

Specification of the Verity Learning Companion and Self-Assessment Tool

Telekooperation Seminar

STANDARDS AND RUBRICS FOR SCHOOL IMPROVEMENT 2005 REVISED EDITION

Generating Test Cases From Use Cases

MAT 122 Intermediate Algebra Syllabus Summer 2016

SOFTWARE EVALUATION TOOL

IT4305: Rapid Software Development Part 2: Structured Question Paper

GEOG 473/573: Intermediate Geographic Information Systems Department of Geography Minnesota State University, Mankato

Automating the E-learning Personalization

CS 101 Computer Science I Fall Instructor Muller. Syllabus

INTERMEDIATE ALGEBRA PRODUCT GUIDE

"Be who you are and say what you feel, because those who mind don't matter and

Transcription:

s and environments Percepts Intelligent s? Chapter 2 Actions s include humans, robots, softbots, thermostats, etc. The agent function maps from percept histories to actions: f : P A The agent program runs on the physical architecture to produce f Chapter 2 1 Chapter 2 4 Read Dr. Luke Lisp Tutorial Reminders First Homework due next Monday A Vacuum-cleaner world B Percepts: location and contents, e.g., [A, Dirty] Actions: Left, Right,, NoOp Chapter 2 2 Chapter 2 5 Outline s and environments Rationality PEAS (Performance measure,,, ) types types A vacuum-cleaner agent Percept sequence Action [A, Clean] Right [A, Dirty] [B, Clean] Left [B, Dirty] [A, Clean], [A, Clean] Right [A, Clean], [A, Dirty].. function Reflex-Vacuum-([location,status]) returns an action if status = Dirty then return else if location = A then return Right else if location = B then return Left What is the right function? Can it be implemented in a small agent program? Chapter 2 3 Chapter 2 6

Rationality Fixed performance measure evaluates the environment sequence one point per square cleaned up in time T? one point per clean square per time step, minus one per move? penalize for > k dirty squares? A rational agent chooses whichever action maximizes the expected value of the performance measure given the percept sequence to date Rational omniscient percepts may not supply all relevant information Rational clairvoyant action outcomes may not be as expected Hence, rational successful Rational exploration, learning, autonomy Internet shopping agent Performance measure???????? Chapter 2 7 Chapter 2 10 PEAS To design a rational agent, we must specify the task environment Consider, e.g., the task of designing an automated taxi: Performance measure???????? Internet shopping agent Performance measure?? price, quality, appropriateness, efficiency?? current and future WWW sites, vendors, shippers?? display to user, follow URL, fill in form?? HTML pages (text, graphics, scripts) Chapter 2 8 Chapter 2 11 PEAS To design a rational agent, we must specify the task environment Consider, e.g., the task of designing an automated taxi: Performance measure?? safety, destination, profits, legality, comfort,...?? US streets/freeways, traffic, pedestrians, weather,...?? steering, accelerator, brake, horn, speaker/display,...?? video, accelerometers, gauges, engine sensors, keyboard, GPS,... Observable?? Deterministic?? types Chapter 2 9 Chapter 2 12

types Deterministic?? types Yes Semi Semi No Chapter 2 13 Chapter 2 16 types types Yes Semi Semi No Yes Yes Yes No Chapter 2 14 Chapter 2 17 types types Yes Semi Semi No Yes Yes Yes No Yes No Yes (except auctions) No The environment type largely determines the agent design The real world is (of course) partially observable, stochastic, sequential, dynamic, continuous, multi-agent Chapter 2 15 Chapter 2 18

types Goal-based agents Four basic types in order of increasing generality: simple reflex agents reflex agents with state goal-based agents utility-based agents All these can be turned into learning agents Goals What it will be like if I do action A Chapter 2 19 Chapter 2 22 Simple reflex agents Utility-based agents Condition-action rules Utility What it will be like if I do action A How happy I will be in such a state Chapter 2 20 Chapter 2 23 Reflex agents with state Learning agents Performance standard Critic Condition-action rules feedback learning goals Learning element changes knowledge Performance element Problem generator Chapter 2 21 Chapter 2 24

Summary s interact with environments through actuators and sensors The agent function describes what the agent does in all circumstances The performance measure evaluates the environment sequence A perfectly rational agent maximizes expected performance programs implement (some) agent functions PEAS descriptions define task environments s are categorized along several dimensions: observable? deterministic? episodic? static? discrete? single-agent? Several basic agent architectures exist: reflex, reflex with state, goal-based, utility-based Chapter 2 25