Laboratorio di Intelligenza Artificiale e Robotica

Similar documents
Laboratorio di Intelligenza Artificiale e Robotica

TD(λ) and Q-Learning Based Ludo Players

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

IAT 888: Metacreation Machines endowed with creative behavior. Philippe Pasquier Office 565 (floor 14)

Reinforcement Learning by Comparing Immediate Reward

Axiom 2013 Team Description Paper

Knowledge-Based - Systems

Artificial Neural Networks written examination

Lecture 1: Basic Concepts of Machine Learning

ISFA2008U_120 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM

Lecture 10: Reinforcement Learning

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

Lecture 1: Machine Learning Basics

Rule discovery in Web-based educational systems using Grammar-Based Genetic Programming

An OO Framework for building Intelligence and Learning properties in Software Agents

Robot Shaping: Developing Autonomous Agents through Learning*

Seminar - Organic Computing

Cooperative evolutive concept learning: an empirical study

(Sub)Gradient Descent

Python Machine Learning

Ordered Incremental Training with Genetic Algorithms

DIGITAL GAMING & INTERACTIVE MEDIA BACHELOR S DEGREE. Junior Year. Summer (Bridge Quarter) Fall Winter Spring GAME Credits.

High-level Reinforcement Learning in Strategy Games

TABLE OF CONTENTS TABLE OF CONTENTS COVER PAGE HALAMAN PENGESAHAN PERNYATAAN NASKAH SOAL TUGAS AKHIR ACKNOWLEDGEMENT FOREWORD

UNIVERSITY OF CALIFORNIA SANTA CRUZ TOWARDS A UNIVERSAL PARAMETRIC PLAYER MODEL

CSL465/603 - Machine Learning

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

XXII BrainStorming Day

Learning Methods for Fuzzy Systems

CS Machine Learning

Guru: A Computer Tutor that Models Expert Human Tutors

The dilemma of Saussurean communication

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

Learning and Transferring Relational Instance-Based Policies

CS 446: Machine Learning

The Method of Immersion the Problem of Comparing Technical Objects in an Expert Shell in the Class of Artificial Intelligence Algorithms

AMULTIAGENT system [1] can be defined as a group of

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Using Web Searches on Important Words to Create Background Sets for LSI Classification

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Improving Fairness in Memory Scheduling

Case Acquisition Strategies for Case-Based Reasoning in Real-Time Strategy Games

A SURVEY OF FUZZY COGNITIVE MAP LEARNING METHODS

Top US Tech Talent for the Top China Tech Company

MYCIN. The MYCIN Task

AQUA: An Ontology-Driven Question Answering System

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

A Genetic Irrational Belief System

Citrine Informatics. The Latest from Citrine. Citrine Informatics. The data analytics platform for the physical world

Comprehensive Program Review (CPR)

DOCTOR OF PHILOSOPHY HANDBOOK

Education for an Information Age

Evolution of Symbolisation in Chimpanzees and Neural Nets

Chapter 2. Intelligent Agents. Outline. Agents and environments. Rationality. PEAS (Performance measure, Environment, Actuators, Sensors)

Task Completion Transfer Learning for Reward Inference

Computerized Adaptive Psychological Testing A Personalisation Perspective

Using Genetic Algorithms and Decision Trees for a posteriori Analysis and Evaluation of Tutoring Practices based on Student Failure Models

Challenges in Deep Reinforcement Learning. Sergey Levine UC Berkeley

Knowledge Elicitation Tool Classification. Janet E. Burge. Artificial Intelligence Research Group. Worcester Polytechnic Institute

Automatic Discretization of Actions and States in Monte-Carlo Tree Search

STANDARDS AND RUBRICS FOR SCHOOL IMPROVEMENT 2005 REVISED EDITION

Multimedia Courseware of Road Safety Education for Secondary School Students

Welcome to. ECML/PKDD 2004 Community meeting

Unit purpose and aim. Level: 3 Sub-level: Unit 315 Credit value: 6 Guided learning hours: 50

Probabilistic Latent Semantic Analysis

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

InTraServ. Dissemination Plan INFORMATION SOCIETY TECHNOLOGIES (IST) PROGRAMME. Intelligent Training Service for Management Training in SMEs

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

A Study of Pre-Algebra Learning in the Context of a Computer Game-Making Course

Task Completion Transfer Learning for Reward Inference

Australian Journal of Basic and Applied Sciences

Master s Programme in Computer, Communication and Information Sciences, Study guide , ELEC Majors

EVOLVING POLICIES TO SOLVE THE RUBIK S CUBE: EXPERIMENTS WITH IDEAL AND APPROXIMATE PERFORMANCE FUNCTIONS

FF+FPG: Guiding a Policy-Gradient Planner

DIANA: A computer-supported heterogeneous grouping system for teachers to conduct successful small learning groups

COMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS

COMPUTER-AIDED DESIGN TOOLS THAT ADAPT

Evolutive Neural Net Fuzzy Filtering: Basic Description

Emergency Management Games and Test Case Utility:

The Enterprise Knowledge Portal: The Concept

FUZZY EXPERT. Dr. Kasim M. Al-Aubidy. Philadelphia University. Computer Eng. Dept February 2002 University of Damascus-Syria

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

SARDNET: A Self-Organizing Feature Map for Sequences

THE WEB 2.0 AS A PLATFORM FOR THE ACQUISITION OF SKILLS, IMPROVE ACADEMIC PERFORMANCE AND DESIGNER CAREER PROMOTION IN THE UNIVERSITY

Interaction Design Considerations for an Aircraft Carrier Deck Agent-based Simulation

Reducing Features to Improve Bug Prediction

Agents and environments. Intelligent Agents. Reminders. Vacuum-cleaner world. Outline. A vacuum-cleaner agent. Chapter 2 Actuators

SELECCIÓN DE CURSOS CAMPUS CIUDAD DE MÉXICO. Instructions for Course Selection

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Speeding Up Reinforcement Learning with Behavior Transfer

Designing a Rubric to Assess the Modelling Phase of Student Design Projects in Upper Year Engineering Courses

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

A Reinforcement Learning Variant for Control Scheduling

Using focal point learning to improve human machine tacit coordination

A simulated annealing and hill-climbing algorithm for the traveling tournament problem

Examity - Adding Examity to your Moodle Course

Medical Complexity: A Pragmatic Theory

Transcription:

Laboratorio di Intelligenza Artificiale e Robotica A.A. 2008-2009

Outline 2 Machine Learning Unsupervised Learning Supervised Learning Reinforcement Learning Genetic Algorithms Genetics-Based Machine Learning Applications Computational Intelligence and Games Machine Learning for Embedded Systems Design

What is Machine Learning?

What is Machine Learning? 4 The field of machine learning is concerned with the question of how to construct computer programs that automatically improve with experience. Tom Mitchell (1997) A program is said to learn from experience E with respect to some class of tasks T and performance measure P, if its performance at tasks in T, as measured by P, improves with experience E. A well-defined learning task is defined by P, T, and E.

Example: checkers 5 Task T: playing checkers Artificial Intelligence Design and implement a computer-based system that exhibit intelligent action Machine Learning Write a program that can learn how to play It can learn from examples of previous games, by playing against another opponent, by playing against itself

Unsupervised Learning

7

Supervised Learning

What is an apple? 9

What is an apple? 10

Reinforcement Learning

RL framework 12 reward...delay state Agent action Environment

Optimal value function 13 The problem becomes learning how to maximize the amount of reward collected (payoff) Usually RL algorithms try to learn the action-value function Q(s,a) and derive from it the optimal policy

Genetic Algorithms

The population 15

Ranking by fitness 16

Mate Selection 17 Selection probability is proportional to fitness

Crossover 18 Exploit goodness of parents

Mutation 19 Explore unknown

Best solution 20

The GA recipe 21

Genetic Algorithms + Machine Learning = Genetic Based Machine Learning

Examples of GBML 23 Neuroevolution Classifiers structure and parameters optimization with GAs Feature selection with GAs Rule-based Evolutionary Systems GA-based approach for training classifiers

GBML projects 24 Goal Implementation of a GBML approach, or extension of an available implementation, or comparison of different approaches Experimental analysis Technical report Skills Basic Knowledge of EC and ML Good analysis capabilities Good programming skills (C++) Students: max 2

Computational Intelligence and Games

Why games? 26 Modern computer games are ideal benchmarks for computational intelligence techniques Inexpensive Challenging Computational intelligence could be useful for modern games Speedup AI design Improvement of AI Adaptive AI Customization of game content Neverwinter Nights Civilization II Unreal Tournament Urban Combat

Issues in CIG 27 CIG is an attractive field but involves several technical issues: Interface Experiments design Simulation speed Customization Documentation Typical benchmarks are specifically designed for doing research Modern computer games are very complex and with different goals

Projects available on TORCS 28 TORCS is an open source car racing simulator Well suited for CIG research Software API available Examples available Competitions Available projects Automated generation of tracks Visual input for TORCS C++ Learning API Learn your bot!

SmarTrack 29 Goal Generation, evaluation and classification of tracks for TORCS Skills Good programming skills (C++) Good analysis capabilities Basic knowledge of EC and/or ML Students: max 3 Reference http://cig.dei.polimi.it http://torcs.sourceforge.net/

EyeBot 30 Goal Develop a TORCS controller based on visual inputs Skills Some knowledge/experience with computer vision is preferred Good programming skills (C++) Students: max 2 Reference http://cig.dei.polimi.it http://torcs.sourceforge.net/

C++ Learning API 31 Goal Developing a learning framework for TORCS (partially as porting of an existing JAVA API) Skills Basic knowledge of EC and/or ML Good programming skills (C++) Students: max 2 Reference http://cig.dei.polimi.it http://torcs.sourceforge.net/

Learn your bot! 32 Goal Apply any evolutionary computation or machine learning techniques to develop a controller for TORCS Provide the software, the controller and a presentation Skills Knowledge of EC and/or ML Good analysis capabilities Basic programming skills (C++) Students: max 1 Reference http://cig.dei.polimi.it http://torcs.sourceforge.net/

Racing Games for CIG research 33 Goal Analysis of racing games for research purposes Comparison of different games Prototype and tutorial for a specific game Skills Good programming skill Good analysis capabilities Reference http://www.rfactor.net/ http://vdrift.net/