Genetic Algorithms in Robotics

Similar documents
Laboratorio di Intelligenza Artificiale e Robotica

Seminar - Organic Computing

Laboratorio di Intelligenza Artificiale e Robotica

Artificial Neural Networks written examination

SARDNET: A Self-Organizing Feature Map for Sequences

TABLE OF CONTENTS TABLE OF CONTENTS COVER PAGE HALAMAN PENGESAHAN PERNYATAAN NASKAH SOAL TUGAS AKHIR ACKNOWLEDGEMENT FOREWORD

Knowledge-Based - Systems

The dilemma of Saussurean communication

Axiom 2013 Team Description Paper

Evolutive Neural Net Fuzzy Filtering: Basic Description

Reinforcement Learning by Comparing Immediate Reward

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Classification Using ANN: A Review

Evolution of Symbolisation in Chimpanzees and Neural Nets

Ordered Incremental Training with Genetic Algorithms

Cooperative evolutive concept learning: an empirical study

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Speeding Up Reinforcement Learning with Behavior Transfer

Implementation of Genetic Algorithm to Solve Travelling Salesman Problem with Time Window (TSP-TW) for Scheduling Tourist Destinations in Malang City

Lecture 1: Machine Learning Basics

Learning Methods for Fuzzy Systems

While you are waiting... socrative.com, room number SIMLANG2016

Lecture 10: Reinforcement Learning

(Sub)Gradient Descent

ISFA2008U_120 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM

TD(λ) and Q-Learning Based Ludo Players

DIANA: A computer-supported heterogeneous grouping system for teachers to conduct successful small learning groups

Python Machine Learning

ABSTRACT. A major goal of human genetics is the discovery and validation of genetic polymorphisms

IAT 888: Metacreation Machines endowed with creative behavior. Philippe Pasquier Office 565 (floor 14)

Introduction to Simulation

A SURVEY OF FUZZY COGNITIVE MAP LEARNING METHODS

BIOS 104 Biology for Non-Science Majors Spring 2016 CRN Course Syllabus

How do adults reason about their opponent? Typologies of players in a turn-taking game

BMBF Project ROBUKOM: Robust Communication Networks

A Case-Based Approach To Imitation Learning in Robotic Agents

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Softprop: Softmax Neural Network Backpropagation Learning

Research Article Hybrid Multistarting GA-Tabu Search Method for the Placement of BtB Converters for Korean Metropolitan Ring Grid

Learning and Transferring Relational Instance-Based Policies

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Exploration. CS : Deep Reinforcement Learning Sergey Levine

2017 Florence, Italty Conference Abstract

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Radius STEM Readiness TM

CS Machine Learning

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

CSL465/603 - Machine Learning

INPE São José dos Campos

EVOLVING POLICIES TO SOLVE THE RUBIK S CUBE: EXPERIMENTS WITH IDEAL AND APPROXIMATE PERFORMANCE FUNCTIONS

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Generative models and adversarial training

arxiv: v1 [cs.lg] 15 Jun 2015

Robot Shaping: Developing Autonomous Agents through Learning*

Continual Curiosity-Driven Skill Acquisition from High-Dimensional Video Inputs for Humanoid Robots

COMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS

LOUISIANA HIGH SCHOOL RALLY ASSOCIATION

Vodcasts and Captures: Using Multimedia to Improve Student Learning in Introductory Biology

Massachusetts Institute of Technology Tel: Massachusetts Avenue Room 32-D558 MA 02139

AMULTIAGENT system [1] can be defined as a group of

Lecture 1: Basic Concepts of Machine Learning

A Genetic Irrational Belief System

Evolution in Paradise

Solving Combinatorial Optimization Problems Using Genetic Algorithms and Ant Colony Optimization

To link to this article: PLEASE SCROLL DOWN FOR ARTICLE

On the Combined Behavior of Autonomous Resource Management Agents

Rule Learning With Negation: Issues Regarding Effectiveness

FF+FPG: Guiding a Policy-Gradient Planner

Planning with External Events

Master s Programme in Computer, Communication and Information Sciences, Study guide , ELEC Majors

STA 225: Introductory Statistics (CT)

AGENDA LEARNING THEORIES LEARNING THEORIES. Advanced Learning Theories 2/22/2016

Artificial Neural Networks

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

I-COMPETERE: Using Applied Intelligence in search of competency gaps in software project managers.

The Strong Minimalist Thesis and Bounded Optimality

Biological Sciences (BS): Ecology, Evolution, & Conservation Biology (17BIOSCBS-17BIOSCEEC)

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

Computational Data Analysis Techniques In Economics And Finance

POLA: a student modeling framework for Probabilistic On-Line Assessment of problem solving performance

Discriminative Learning of Beam-Search Heuristics for Planning

Prerequisite: General Biology 107 (UE) and 107L (UE) with a grade of C- or better. Chemistry 118 (UE) and 118L (UE) or permission of instructor.

Test Effort Estimation Using Neural Network

Linking Task: Identifying authors and book titles in verbose queries

DEVELOPING AN INTERACTIVE METHOD TO MAP THE STUDENT PERSPECTIVES ON EVOLUTION

Using focal point learning to improve human machine tacit coordination

Learning to Schedule Straight-Line Code

Probability and Game Theory Course Syllabus

Information System Design and Development (Advanced Higher) Unit. level 7 (12 SCQF credit points)

The Good Judgment Project: A large scale test of different methods of combining expert predictions

Chapter 2. Intelligent Agents. Outline. Agents and environments. Rationality. PEAS (Performance measure, Environment, Actuators, Sensors)

Proposal of Pattern Recognition as a necessary and sufficient principle to Cognitive Science

Case Acquisition Strategies for Case-Based Reasoning in Real-Time Strategy Games

Multiagent Simulation of Learning Environments

Biological Sciences, BS and BA

Knowledge Transfer in Deep Convolutional Neural Nets

Department of Anatomy and Cell Biology Curriculum

By Laurence Capron and Will Mitchell, Boston, MA: Harvard Business Review Press, 2012.

Transcription:

Genetic Algorithms in Robotics Julius Mayer Universität Hamburg Fakultät für Mathematik, Informatik und Naturwissenschaften Technische Aspekte Multimodaler Systeme October 31, 2016 J. Mayer 1

Outline 1. Introduction Motivation Classification 2. Algorithm Overview Phases 3. Application Neuroevolution 4. Discussion Evaluation Conclusion J. Mayer 2

Introduction - Motivation Motivation Problems that are hard to solve with classical optimization because of too many parameters (intractability), large search space, non-differentiable objective functions, varying numbers of variables within the optimization process, a lacking mathematical function specification. [1] J. Mayer 3

Introduction - Classification Classification Biologic: Theory of evolution Adaptation of population to the environment Gradual, hereditary change in the individuals of a species Information storage and transfer through genomes Optimizing population fitness by reproduction [3] J. Mayer 4

Introduction - Classification Classification Biologic: Theory of evolution Adaptation of population to the environment Gradual, hereditary change in the individuals of a species Information storage and transfer through genomes Optimizing population fitness by reproduction [5] J. Mayer 5

Introduction - Classification Classification Algorithmic: Stochastic optimization Can solve NP-hard problems Solution approximation Computational complexity as prohibiting factor e.g. ant colony & particle swarm optimization [7] J. Mayer 6

Algorithm - Overview Algorithm 1. Initialization 2. Evaluation 3. Selection 4. Mutation 5. Terminate J. Mayer 7

Algorithm - Phases Phase 1. Initialization Initialize random population Choose appropriate representation Individuals as encoded problem solutions e.g. real values or binary strings 1,67 2,34 1,04 5,83 2,10 101010011011010010101 [12] J. Mayer 8

Algorithm - Phases Phase 2. Evaluation Choose objective function Decode & evaluate chromosome Find minima in objective function Survival of the fittest [9] J. Mayer 9

Algorithm - Phases Phase 3. Selection Probabilistic selection Roulette wheel selection Tournament Selection Balance exploration & exploitation dilemma J. Mayer 10

Algorithm - Phases Phase 4. Mutation Assign mutation rate Mutate Bitflips Normal distribution mutation Recombination Crossover Permutation Mutation drives change [12] J. Mayer 11

Algorithm - Phases Phase 4. Mutation Assign mutation rate Mutate Bitflips Normal distribution mutation Recombination Crossover Permutation Mutation drives change [12] J. Mayer 12

Algorithm - Phases Phase 5. Termination If solution is sufficiently close Performance is satisfying Change is stagnant J. Mayer 13

Application - Evolutionary robotics Path planing Multivariate parameter optimization Evolving artificial network architectures & optimizing connection weights (learning) [6] J. Mayer 14

Application - Neuroevolution Neuroevolution of augmenting topologies (NEAT) Uses crossover Uses speciation to protect structures Increment growth from minimal structures [10] J. Mayer 15

Application - Neuroevolution NEAT: Mutation Connection weights Network structure Add new connection (random weight) Add new node (split connection) [10] J. Mayer 16

Application - Neuroevolution NEAT: Crossover Genes are lined up according to innovation number Matching Genes are randomly chosen Disjoint / excess are taken from fitter parent [10] J. Mayer 17

Application - Neuroevolution NEAT for RoboCup Soccer Keepaway (robot soccer subtask) Large state space (unable to explore exhaustively) Only partial state information for each agent Continuous action space Multiple teammates need to learn simultaneously [4] J. Mayer 18

Application - Neuroevolution NEAT vs. Temporal Difference Methods [11] J. Mayer 19

Application - Neuroevolution NEAT vs. Temporal Difference Methods [11] J. Mayer 20

Discussion - Evaluation Evaluation Challenges Get GA s parameters right Solution encoding Exploration vs. exploitation Premature convergence (local minima) slow convergence big search space Find representative fitness function Drawbacks Approximation instead of exact solution Computational complexity as prohibiting factor No guaranteed convergence to global optimum Non-exhaustive coverage of the complete solution space J. Mayer 21

Discussion - Conclusion Conclusion Inspired by biological evolution Stochastic optimization Population containing encoded problem solutions (parameters) Different applications in robotics e.g. evolving ANN (NEAT) for RoboCup Soccer No guarantee for convergence to global optimum [2] J. Mayer 22

Discussion - Conclusion References [1] http://rednuht.org/genetic walkers/, 2016. [2] https://cdn.meme.am/instances/41036988.jpg. 2016. [3] https://i.ytimg.com/vi/3zjnyscv8to/maxresdefault.jpg, 2016. [4] https://i.ytimg.com/vi/hhln0tdglle/maxresdefault.jpg. 2016. [5] http://www.hanskottke.de/wordpress/wp-content/uploads/2012/09/evolutions hip hop.jpg. 2016. [6] http://www.orocos.org. 2016. [7] http://www.turingfinance.com/wp-content/uploads/2014/10/self-organizing-feature-map-3.png. 2016. [8] John H Holland. Adaptation in natural and artificial systems: an introductory analysis with applications to biology, control, and artificial intelligence. U Michigan Press, 1975. [9] Melanie Mitchell. Complexity: A Guided Tour. In Oxford, volume 1, chapter Cellular A, pages 145 159. Oxford University Press, Inc, New York, 2009. J. Mayer 23

Discussion - Conclusion References (cont.) [10] Kenneth O Stanley and Risto Miikkulainen. Evolving neural networks through augmenting topologies. Evolutionary computation, 10(2):99 127, 2002. [11] Matthew E Taylor. Comparing Evolutionary and Temporal Difference Methods in a Reinforcement Learning Domain. pages 1321 1328. [12] Mattias Wahde. Biologically Inspired Optimization Methods: An introduction. WIT Press, Boston, MA, 2008. J. Mayer 24