Introduction to Multi-Agent Systems

Similar documents
Chapter 2. Intelligent Agents. Outline. Agents and environments. Rationality. PEAS (Performance measure, Environment, Actuators, Sensors)

Agents and environments. Intelligent Agents. Reminders. Vacuum-cleaner world. Outline. A vacuum-cleaner agent. Chapter 2 Actuators

Intelligent Agents. Chapter 2. Chapter 2 1

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Agent-Based Software Engineering

Seminar - Organic Computing

A GENERIC SPLIT PROCESS MODEL FOR ASSET MANAGEMENT DECISION-MAKING

Action Models and their Induction

TOKEN-BASED APPROACH FOR SCALABLE TEAM COORDINATION. by Yang Xu PhD of Information Sciences

An OO Framework for building Intelligence and Learning properties in Software Agents

Reinforcement Learning by Comparing Immediate Reward

Evolution of Collective Commitment during Teamwork

LEGO MINDSTORMS Education EV3 Coding Activities

AMULTIAGENT system [1] can be defined as a group of

Axiom 2013 Team Description Paper

Modeling user preferences and norms in context-aware systems

A MULTI-AGENT SYSTEM FOR A DISTANCE SUPPORT IN EDUCATIONAL ROBOTICS

Lecture 10: Reinforcement Learning

GACE Computer Science Assessment Test at a Glance

Knowledge based expert systems D H A N A N J A Y K A L B A N D E

November 17, 2017 ARIZONA STATE UNIVERSITY. ADDENDUM 3 RFP Digital Integrated Enrollment Support for Students

On Human Computer Interaction, HCI. Dr. Saif al Zahir Electrical and Computer Engineering Department UBC

High-level Reinforcement Learning in Strategy Games

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

Designing Autonomous Robot Systems - Evaluation of the R3-COP Decision Support System Approach

Citrine Informatics. The Latest from Citrine. Citrine Informatics. The data analytics platform for the physical world

Laboratorio di Intelligenza Artificiale e Robotica

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

An Investigation into Team-Based Planning

CREATING SHARABLE LEARNING OBJECTS FROM EXISTING DIGITAL COURSE CONTENT

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

Java Programming. Specialized Certificate

Lecture 1: Basic Concepts of Machine Learning

COMPUTER-AIDED DESIGN TOOLS THAT ADAPT

Expert Reference Series of White Papers. Mastering Problem Management

Davidson College Library Strategic Plan

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

PATROL OFFICER CQB. A u n i q u e C Q B c o u r s e f o r P o l i c e p e r s o n a l o n l y.

Laboratorio di Intelligenza Artificiale e Robotica

Firms and Markets Saturdays Summer I 2014

On the Combined Behavior of Autonomous Resource Management Agents

Development of an IT Curriculum. Dr. Jochen Koubek Humboldt-Universität zu Berlin Technische Universität Berlin 2008

Emergency Management Games and Test Case Utility:

Software Development: Programming Paradigms (SCQF level 8)

An Introduction to the Minimalist Program

M55205-Mastering Microsoft Project 2016

Telekooperation Seminar

Higher education is becoming a major driver of economic competitiveness

ProFusion2 Sensor Data Fusion for Multiple Active Safety Applications

Economics 201 Principles of Microeconomics Fall 2010 MWF 10:00 10:50am 160 Bryan Building

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Cooperative Game Theoretic Models for Decision-Making in Contexts of Library Cooperation 1

MYCIN. The MYCIN Task

The open source development model has unique characteristics that make it in some

Introduction to Simulation

Innovating Toward a Vibrant Learning Ecosystem:

The Success Principles How to Get from Where You Are to Where You Want to Be

IAT 888: Metacreation Machines endowed with creative behavior. Philippe Pasquier Office 565 (floor 14)

ISFA2008U_120 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM

Software Maintenance

University of Groningen. Systemen, planning, netwerken Bosman, Aart

Ontologies vs. classification systems

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

Probabilistic Latent Semantic Analysis

SSE - Supervision of Electrical Systems

ASSESSMENT OF STUDENT LEARNING OUTCOMES WITHIN ACADEMIC PROGRAMS AT WEST CHESTER UNIVERSITY

Rover Races Grades: 3-5 Prep Time: ~45 Minutes Lesson Time: ~105 minutes

Lecture 1: Machine Learning Basics

HARPER ADAMS UNIVERSITY Programme Specification

MASTER S COURSES FASHION START-UP

Statewide Strategic Plan for e-learning in California s Child Welfare Training System

Moderator: Gary Weckman Ohio University USA

College Pricing. Ben Johnson. April 30, Abstract. Colleges in the United States price discriminate based on student characteristics

Learning Methods for Fuzzy Systems

Surprise-Based Learning for Autonomous Systems

Towards Team Formation via Automated Planning

A. Permission. All students must have the permission of their parent or guardian to participate in any field trip.

DIGITAL GAMING & INTERACTIVE MEDIA BACHELOR S DEGREE. Junior Year. Summer (Bridge Quarter) Fall Winter Spring GAME Credits.

Speeding Up Reinforcement Learning with Behavior Transfer

Embedded Real-Time Systems

Study in Berlin at the HTW. Study in Berlin at the HTW

A theoretic and practical framework for scheduling in a stochastic environment

AUTOMATED TROUBLESHOOTING OF MOBILE NETWORKS USING BAYESIAN NETWORKS

Thoughtful Commitment: How the Greece Teachers Association (GTA) Advances Social Justice, Student Centered Advocacy and Collaboration ADV400

BMBF Project ROBUKOM: Robust Communication Networks

MSW Field Placement Manual Foundation and Advanced

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Title II of WIOA- Adult Education and Family Literacy Activities 463 Guidance

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

The Future of Consortia among Indian Libraries - FORSA Consortium as Forerunner?

European Cooperation in the field of Scientific and Technical Research - COST - Brussels, 24 May 2013 COST 024/13

Knowledge-Based - Systems

THE DEPARTMENT OF DEFENSE HIGH LEVEL ARCHITECTURE. Richard M. Fujimoto

Master s Programme in Computer, Communication and Information Sciences, Study guide , ELEC Majors

Multiagent Simulation of Learning Environments

Geospatial Visual Analytics Tutorial. Gennady Andrienko & Natalia Andrienko

Using focal point learning to improve human machine tacit coordination

Infrared Paper Dryer Control Scheme

Intelligent Agent Technology in Command and Control Environment

Transcription:

Introduction to Multi-Agent Systems Michal Jakob, Milan Rollo Agent Technology Center, Dept. of Computer Science and Engineering, FEE, Czech Technical University Selected illustrations taken from Russel and Norvig Artificial Intelligence: Modern Approach

General Information Recommended reading: Russel and Norvig: Artificial Intelligence: Modern Approach J. M. Vidal: Multiagent Systems: with NetLogo Examples (available on-line) Y. Shoham and K. Leyton-Brown: Multiagent Systems: Algorithmic, Game-Theoretic, and Logical Foundations (available on-line) M. Wooldridge: An Introduction to MultiAgent Systems V. Marik, O. Stepankova, J. Lazansky a kol.: Umela inteligence (3)

Introduction to Multiagent systems Introduction 3

Trends in Computing Ubiquity: Cost of processing power decreases dramatically (e.g. Moore s Law), computers used everywhere Interconnection: Formerly only user-computer interaction, nowadays distributed/networked machine-to-machine interactions (e.g. Web APIs) Complexity: Elaboration of tasks carried out by computers has grown Delegation: Giving control to computers even in safety-critical tasks (e.g. aircraft or nuclear plant control) Human-orientation: Increasing use of metaphors that better reflect human intuition from everyday life (e.g. GUIs, speech recognition, object orientation) 4

New Challenges for Computer Systems Traditional design problem: How can I build a system that produces the correct output given some input? Each system is more or less isolated, built from scratch Modern-day design problem: How can I build a system that can operate independently on my behalf in a networked, distributed, large-scale environment in which it will need to interact with different other components pertaining to other users? Each system is built into an existing, persistent but constantly evolving computing ecosystem it should be robust with respect to changes No single owner and/or central authority 5

Multiagent Systems (MAS) Multiagent system is a collection of multiple autonomous (intelligent) agents, each acting towards its objectives while all interacting in a shared environment, being able to communicate and possibly coordinating their actions. 6

Human teams and companies Markets and economies Transportation networks Distributed software systems Communication networks Robotic teams 7

Multi-Agent System Engineering Novel paradigm for building robust, scalable and extensible control, planning and decision-making systems socially-inspired computing self-organized teamwork collective (artificial) intelligence MAS become increasingly relevant as the connectivity and intelligence of devices grows! Systems of the future will need to be good at teamwork 8

Application Areas (at ATG) Air Traffic Management Tactical Operations Autonomous Aerial Vehicles Physical/ Critical Infrastructure Security Cybersecurity and Steganography Intelligent Transport Systems 9

user group 1 user group 2 avoid collisions execute monitoring maneuvers land for recharging choose service providers and price form teams anticipating and strategically maximizing detection coordinating activities within teams We Formal want models, the whole data system structures to run and fully decide which automatically. team algorithms join The for only automating human intervention such is in specifying high-level processes tasks for UAVs to complete. decide which services to purchase user group 3 11

Agent-control architecture and programming languages Goal: Developing robust controllers capable of executing complex activities in a dynamic, non-deterministic environment E.g. Avoiding collisions, executing monitoring maneuvers, land for recharging Challenges Modularizing the agent into modules Describing the control logic in a compact form Handling concurrency, interruptions, complex plans, communications, 12

Coalition Formation 1 2 Goal: Forming and incentivizing teams that have highest value E.g. Determining which assets should form a team and how they should split payment for executing a task Challenges determining right coalitions (centralized vs. decentralized) defining payments within coalitions 13

Distributed Coordination Goal: Coordinating assignment of tasks / resource so that constraints are met and an objective function maximized E.g. choosing which areas / targets should be tracked by whom so that coverage / tracking duration is maximized Challenges: primarily algorithmic: efficient scalable algorithms that can handle many constraints distributed algorithms (due to communication limitations or privacy issues) 14

Auctions Goal: Allocate a scarce resource and determine payments so that profit is maximized E.g.: matching UAV teams with task issuers which team should execute which task and for how much Challenges representations: single vs. multi-attribute, single vs. multi-unit, single vs. multi-item protocols: bidding rules, market clearing rules, information dissemination rules bidding strategies centralized vs. distributed 15

Social Choice / Negotiation u(a)=9, u(b)=7, u(c)=11 Goal: Agree on a single choice between multiple agents with different preferences E.g.: choosing between monitoring crop quality or looking for forest fires Challenges define what s best: egalitarian, utilitarian, Nash bargaining solution, pareto efficiency, independence of irrelevant alternatives, non-dictatorship protocols to find the best: the number of iterations / deadlines, stopping rules with or without trusted third party monotonic concession protocol C > B u(a)=11 A > B >C 16

Non-cooperative Game Theory Goal: Acting strategically in the presence of other rational agents E.g. deciding where to check for intruders assuming the intruders know they are going to be checked Challenges defining good strategies: Nash equilibrium, minimax, finding a good / best strategy various extensions: partial observation, sequential interactions, uncertainty about the objectives of the opponent, 17

Introduction to Multi-Agent Systems Defining Agency 21

What is Agent? Definition (Russell & Norvig) An agent is anything that can perceive its environment (through its sensors) and act upon that environment (through its effectors) Focus on situatedness in the environment (embodiment) The agent can only influence the environment but not fully control it (sensor/effector failure, non-determinism)

What is Agent? (2) Definition (Wooldridge & Jennings) An agent is a computer system that is situated in some environment, and that is capable of autonomous action in this environment in order to meet its design objectives/delegated goals Adds a second dimension to agent definition: the relationship between agent and designer/user Agent is capable of independent action Agent action is purposeful Autonomy is a central, distinguishing property of agents 23

Autonomous Agent Properties autonomous the agent is self goal-directed and acts without requiring user initiation and guidance; it can choose its own goal and the way to achieve it; its behavior is determined by its experience; we have no direct control over it reactive the agent maintains an ongoing interaction with its environment, and responds to changes that occur in it proactive the agent generates and attempts to achieve goals; it is not driven solely by events but takes the initiative 24

Autonomous Agent Properties sociable the agent interacts with other agents (and possibly humans) via cooperation, coordination, and negotiation; it is aware and able to reason about other agents and how they can help it achieve its own goals coordination is managing the interdependencies between actions of multiple agents (not necessarily cooperative) cooperation is working together as a team to achieve a shared goal negotiation is the ability to reach agreements on matters of common interest Systems of the future will need to be good at teamwork 25

Agents vs. Objects An agent has unpredictable behaviour as observed from the outside unless its simple reflexive agent An agent is situated in the environment Agent communication model is asynchronous Objects do it for free; agents do it because they want to 26

Agents vs. Expert Systems Expert systems are disembodied from the environment Expert systems are not capable of reactive and proactive behaviour Expert systems are not equipped with the social ability 27

Types of Agent Systems single-agent multi-agent cooperative competitive single shared utility multiple different utilities 28

Micro vs. Macro MAS Engineering 1. The agent design problem (micro perspective): How should agents act to carry out their tasks? 2. The society design problem (macro perspective): How should agents interact to carry out their tasks? 29

Typology of Interaction 30

Introduction to Multiagent Systems Specifying Agents 31

Agent Behavior Agent s behavior is described by the agent function that maps percept sequences to actions The agent program runs on a physical architecture to produce f Key questions: What is the right function? Can it be implemented in a small agent program?

Example: Vacuum Cleaner World Percepts: location and contents, e.g. [A, Dirty] Actions: Left, Right, Suck, NoOp 33

Vacuum Cleaner Agent Percept sequence [A,Clean] [A, Dirty] [B,Clean] [B, Dirty] [A,Clean], [A,Clean] [A,Clean], [A, Dirty] [A,Clean], [A,Clean], [A,Clean] [A,Clean], [A,Clean], [A, Dirty] Action Right Suck Left Suck Right Suck Right Suck 34

Rational Behavior Definition (Russell & Norvig) Rational agent chooses whichever action maximizes the expected value of the performance measure given the percept sequence to date and whatever bulit-in knowledge the agent has. Rationality is relative and depends on four aspects: 1. performance measure which defines the degree of success 2. percept sequence (complete perceptual history) 3. agent s knowledge about the environment 4. actions available to the agent Rational omniscient, rational clairvoyant => rational successful 35

Specifying Task Environments To design a rational agent, we must specify the task environment (PEAS) 1. Performance measure 2. Environment 3. Actuators 4. Sensors Task environments define problems to which rational agents are the solutions 36

PEAS Examples 37

Properties of Environments Fully observable vs. partially observable can agents obtain complete and correct information about the state of the world? Deterministic vs. stochastic Do actions have guaranteed and uniquely defined effects? Episodic vs. sequential Can agents decisions be made for different, independent episodes? Static vs. dynamic Does the environment change by processes beyond agent control? Discrete vs. continuous Is the number of actions and percepts fixed and finite? Single-agent vs. multi-agent Does the behavior of one agent depends on the behavior of other agents? 38

Example Environments Solitaire Backgammon Internet shopping Observable No Yes No No Deterministic Yes No Partly No Episodic No No No No Static Yes Semi Semi No Discrete Yes Yes Yes No Single-agent Yes No Yes (except auctions) Taxi No 39

Introduction to Agents Agent Architectures 40

Implementing the Agent How should one implement the agent function? So that the resulting behavior is (near) rational. So that its calculation is computationally tractable. Agent Sensors? Percepts Environment Actuators Actions

Hierarchy of Agents The key challenge for AI is to find out how to write programs that produce rational behavior from a small amount of code rather than from a large number of table entries. Four basic types of agent in the order of increasing capability: 1. simple reflex agents 2. model-based agents with state 3. goal-based agents 4. utility-based agents 42

Simple Reflex Agents Simple reflex agent chooses the next action on the basis of the current percept Condition-action rules provide a way to present common regularities appearing in input/output associations Ex.: if car-in-front-is-braking then initializebraking 43

Adding State / Model Reflex agents are simple but of limited intelligence Only work if environment is fully observable and the decision can be made based solely on the current percept If not the case => suboptimal action choices, infinite loops => It can be advantageous to store information about the world in the agent 44

Model-based Reflex Agent Keeps track of the world by extracting relevant information from percepts and storing it in its memory models: (1) how the world evolves, (2) how agent s actions affect the world 45

Telling the Agent What to Do Previous types of agents have the behavior hard-coded in their rules there is no way to tell them what to do Fundamental aspect of autonomy: we want to tell agent what to do but not how to do it! We can specify: action to perform not interesting (set of) goal state(s) to be reached goal-based agents a performance measure to be maximized utility-based agents 46

Goal-based Agents Problem: goals are not necessarily achievable by a single action: search and planning are subfields of AI devoted to finding actions sequences that achieve the agent s goals 47

Towards Utility-based Agents Goals only a very crude (binary) distinction between happy and unhappy states We introduce the concept of utility: utility is a function that maps a state onto a real number; it captures quality of a state if an agent prefers one world state to another state then the former state has higher utility for the agent. Utility can be used for: 1. choosing the best plan 2. resolving conflicts among goals 3. estimating the successfulness of an agent if the outcomes of actions are uncertain 48

Utility-based Agents Utility-based agent use the utility function to choose the most desirable action/course of actions to take 49

Summary Multiagent systems approach ever more important in the increasingly interconnected world where systems are required to cooperate flexibly socially-inspired computing Intelligent agent is autonomous, proactive, reactive and sociable Agents can be cooperative or competitive (or combination thereof) There are different agent architectures with different capabilities and complexity Related reading: Russel and Norvig: Artificial Intelligence: A Modern Approach Chapter 2 Wooldrige: An Introduction to Multiagent Systems Chapters 1 and 2 50