Perception for robot autonomy. Perception for robot autonomy

Similar documents
Lecture 1: Machine Learning Basics

Decision Analysis. Decision-Making Problem. Decision Analysis. Part 1 Decision Analysis and Decision Tables. Decision Analysis, Part 1

Lecture 10: Reinforcement Learning

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

Introduction to Simulation

Probability and Statistics Curriculum Pacing Guide

Seminar - Organic Computing

Lecture 2: Quantifiers and Approximation

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

BMBF Project ROBUKOM: Robust Communication Networks

Chapter 2. Intelligent Agents. Outline. Agents and environments. Rationality. PEAS (Performance measure, Environment, Actuators, Sensors)

Learning Methods in Multilingual Speech Recognition

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Reinforcement Learning by Comparing Immediate Reward

Software Maintenance

PM tutor. Estimate Activity Durations Part 2. Presented by Dipo Tepede, PMP, SSBB, MBA. Empowering Excellence. Powered by POeT Solvers Limited

Rule-based Expert Systems

An Introduction to Simio for Beginners

Rover Races Grades: 3-5 Prep Time: ~45 Minutes Lesson Time: ~105 minutes

A Case-Based Approach To Imitation Learning in Robotic Agents

Axiom 2013 Team Description Paper

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Acquiring Competence from Performance Data

Agents and environments. Intelligent Agents. Reminders. Vacuum-cleaner world. Outline. A vacuum-cleaner agent. Chapter 2 Actuators

Discriminative Learning of Beam-Search Heuristics for Planning

Author: Justyna Kowalczys Stowarzyszenie Angielski w Medycynie (PL) Feb 2015

Module Title: Managing and Leading Change. Lesson 4 THE SIX SIGMA

Speech Recognition at ICSI: Broadcast News and beyond

CHAPTER 4: REIMBURSEMENT STRATEGIES 24

Intelligent Agents. Chapter 2. Chapter 2 1

Radius STEM Readiness TM

Interaction Design Considerations for an Aircraft Carrier Deck Agent-based Simulation

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Chapter 2 Rule Learning in a Nutshell

SOFTWARE EVALUATION TOOL

The Good Judgment Project: A large scale test of different methods of combining expert predictions

LEGO MINDSTORMS Education EV3 Coding Activities

On Human Computer Interaction, HCI. Dr. Saif al Zahir Electrical and Computer Engineering Department UBC

B. How to write a research paper

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Regret-based Reward Elicitation for Markov Decision Processes

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

Mathematics Success Grade 7

Analysis of Hybrid Soft and Hard Computing Techniques for Forex Monitoring Systems

(Sub)Gradient Descent

The IDN Variant Issues Project: A Study of Issues Related to the Delegation of IDN Variant TLDs. 20 April 2011

Extracting Opinion Expressions and Their Polarities Exploration of Pipelines and Joint Models

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

Abstractions and the Brain

Surprise-Based Learning for Autonomous Systems

SPRING GROVE AREA SCHOOL DISTRICT

M55205-Mastering Microsoft Project 2016

On the Combined Behavior of Autonomous Resource Management Agents

Commanding Officer Decision Superiority: The Role of Technology and the Decision Maker

MYCIN. The MYCIN Task

SARDNET: A Self-Organizing Feature Map for Sequences

Finding Your Friends and Following Them to Where You Are

Modeling user preferences and norms in context-aware systems

Beyond the Pipeline: Discrete Optimization in NLP

BADM 641 (sec. 7D1) (on-line) Decision Analysis August 16 October 6, 2017 CRN: 83777

Designing a Rubric to Assess the Modelling Phase of Student Design Projects in Upper Year Engineering Courses

POLA: a student modeling framework for Probabilistic On-Line Assessment of problem solving performance

Lecture 1: Basic Concepts of Machine Learning

CS Machine Learning

AUTOMATED TROUBLESHOOTING OF MOBILE NETWORKS USING BAYESIAN NETWORKS

Evolutive Neural Net Fuzzy Filtering: Basic Description

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

CSC200: Lecture 4. Allan Borodin

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Device Independence and Extensibility in Gesture Recognition

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Generating Test Cases From Use Cases

A Data Fusion Model for Location Estimation in Construction

Causal Link Semantics for Narrative Planning Using Numeric Fluents

Probability estimates in a scenario tree

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

An investigation of imitation learning algorithms for structured prediction

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Probability and Game Theory Course Syllabus

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Knowledge Elicitation Tool Classification. Janet E. Burge. Artificial Intelligence Research Group. Worcester Polytechnic Institute

Given a directed graph G =(N A), where N is a set of m nodes and A. destination node, implying a direction for ow to follow. Arcs have limitations

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Planning with External Events

Designing Autonomous Robot Systems - Evaluation of the R3-COP Decision Support System Approach

An Effective Framework for Fast Expert Mining in Collaboration Networks: A Group-Oriented and Cost-Based Method

Challenges in Deep Reinforcement Learning. Sergey Levine UC Berkeley

Syllabus for CHEM 4660 Introduction to Computational Chemistry Spring 2010

Forget catastrophic forgetting: AI that learns after deployment

Rule Learning With Negation: Issues Regarding Effectiveness

Cross Language Information Retrieval

Semi-Supervised GMM and DNN Acoustic Model Training with Multi-system Combination and Confidence Re-calibration

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

GACE Computer Science Assessment Test at a Glance

Learning Methods for Fuzzy Systems

The History of Language Teaching

Build on students informal understanding of sharing and proportionality to develop initial fraction concepts.

Visual CP Representation of Knowledge

Executive Guide to Simulation for Health

P. Belsis, C. Sgouropoulou, K. Sfikas, G. Pantziou, C. Skourlas, J. Varnas

Transcription:

Perception for robot autonomy Martial Hebert Drew Bagnell Tony Stentz Perception for robot autonomy Sensor input Perception algorithms Interpretation Execution 1

Perception for robot autonomy From sensor features to plans and paths DARPA UPI From sensor features to semantic interpretation DARPA MARS One perspective on the perception+autonomy challenges From sensor features to plans and paths Learning and optimal control Generating multiple options Sequential processes From sensor features to semantic interpretation The tractability challenge Generating multiple hypothesis Incorporating external knowledge sources and data Limited computation Deciding when to generate an output Semi-supervised modeling 2

From sensor features to plans and paths: The pothole example 3

Local features: color, texture, + Context features How to map the features to control/decision policies? From sensor features to plans and paths Even if we know an exact control model for the system: Noisy data + not the right information Hard to model mapping from sensor data to actions and decisions Mapping depends on objective mission Learn mapping from data 4

Learning and optimal control Learning to optimally map features to plans and paths X (Sensor (Input) Data) Learning Y Y (Output) (Path to goal) 5

Optimal Control Solution Cost Map Learning 2-D Planner Y (Path to goal) Mode 1: Training example 6

Mode 1: Training example Mode 1: Learned behavior 7

Mode 1: Learned behavior Mode 1: Learned cost map 8

Mode 2: Training example Mode 2: Training example 9

Mode 2: Learned behavior Mode 2: Learned behavior 10

Mode 2: Learned cost map There is no single right mapping between sensor features and actions Mapping depends on subjective operator objectives Learning the mapping: Assume that behavior we wish to imitate can be recovered by a planning/optimal control algorithm Generalization to handle noisy, imperfect behavior Extension to strategic and multi-agent cases 11

Generating and evaluating multiple alternatives (semi-)optimal planning/decision given learned cost from sensor features may not be feasible In fact in general it is not possible Alternative: Evaluate a set of alternatives with respect to the learned costs 12

Research questions: Optimize content and order of list Find optimal evaluation strategy for: Relevance: Early items are highly likely to succeed Diversity: Enough variation across early items such that redundancy is minimized Online operation Sequential processes Decisions are temporally dependent Cannot learn from individual samples Correlated errors leading to compounding effect 13

The dangers of optimistic models Training data sees (almost) only good examples Optimistic modeling, overfitting, catastrophic failure [D. Pomerleau circa 1987] Example: Learning to drive Worst-case quadratic growth in likelihood of errors over time 14

Example Results explore the fundamental role of interaction between a teacher and a student learning to mimic a task Formal results show decrease in the error as function of the number of training iterations 15

Semantic interpretation Mapping directly from sensor data to plans and paths may not be sufficient Describing the environment in order to reason and make decisions 16

The tractability challenge tree building object road Input Joint Inference Output Off-line training: Model the distribution of labels vs. features Run-time inference: Find most likely labels given model 17

Multiple hypothesis generation Explaining decisions Explaining mistakes Perfect perception for robotics? Visual input Perception algorithms Interpretation Execution 18

Reason about multiple hypothesis Visual input Perception algorithms Execution Predict and refine Representing uncertainty in perception Visual input Perception algorithms Interpretation Application 19

Representing ambiguity Predicted P(Class 1) P(Class 2) 20

Incorporating external knowledge sources Bottom-up processing from sensor data is not sufficient Example DoT Univ. Transportation Center 21

Using prior (approximate) maps How to combine uncertain labels from the map with perception output? Does it improve accuracy? How to represent the uncertainty? Example: Uncertainty modeling P image label = x external information = x = 22

Limited computation Anytime processing Distributed processing Computation challenge Onboard computation resources may be insufficient for processing sensor data at high enough rate The situation may call for a (possibly partial) answer sooner than a full computation cycle Essential in robotics/autonomy applications Computational performance is a dominant concern Anytime processing Distributed processing 23

Distributed perception processing DARPA/ARL/SEI Graceful degradation based on availability of computationl nodes Automatic selection of onboard data and computation Fast switching One perspective on the perception+autonomy challenges From sensor features to plans and paths Learning and optimal control Generating multiple options Sequential processes From sensor features to semantic interpretation The tractability challenge Generating multiple hypothesis Incorporating external knowledge sources and data Limited computation Deciding when to generate an output Semi-supervised modeling 24