Bootstrap Learning for Visual Perception on Mobile Robots

Similar documents
Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Axiom 2013 Team Description Paper

Lecture 1: Machine Learning Basics

Exploration. CS : Deep Reinforcement Learning Sergey Levine

A basic cognitive system for interactive continuous learning of visual concepts

Welcome to. ECML/PKDD 2004 Community meeting

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Lecture 1: Basic Concepts of Machine Learning

Lecture 10: Reinforcement Learning

Xinyu Tang. Education. Research Interests. Honors and Awards. Professional Experience

Word Segmentation of Off-line Handwritten Documents

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

On the Combined Behavior of Autonomous Resource Management Agents

Seminar - Organic Computing

Learning Human Utility from Video Demonstrations for Deductive Planning in Robotics

Laboratorio di Intelligenza Artificiale e Robotica

On-Line Data Analytics

Speech Emotion Recognition Using Support Vector Machine

A Computer Vision Integration Model for a Multi-modal Cognitive System

Python Machine Learning

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Multisensor Data Fusion: From Algorithms And Architectural Design To Applications (Devices, Circuits, And Systems)

COMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS

Knowledge based expert systems D H A N A N J A Y K A L B A N D E

Learning and Transferring Relational Instance-Based Policies

A Review: Speech Recognition with Deep Learning Methods

Continual Curiosity-Driven Skill Acquisition from High-Dimensional Video Inputs for Humanoid Robots

Conversation Starters: Using Spatial Context to Initiate Dialogue in First Person Perspective Games

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

Probabilistic Latent Semantic Analysis

Word learning as Bayesian inference

Game-based formative assessment: Newton s Playground. Valerie Shute, Matthew Ventura, & Yoon Jeon Kim (Florida State University), NCME, April 30, 2013

ISFA2008U_120 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM

Mining Student Evolution Using Associative Classification and Clustering

11:00 am Robotics and the Law: An American Perspective Prof. Ryan Calo, University of Washington School of Law

EGRHS Course Fair. Science & Math AP & IB Courses

Graphical Data Displays and Database Queries: Helping Users Select the Right Display for the Task

An Investigation into Team-Based Planning

XXII BrainStorming Day

HIERARCHICAL DEEP LEARNING ARCHITECTURE FOR 10K OBJECTS CLASSIFICATION

Laboratorio di Intelligenza Artificiale e Robotica

Master s Programme in Computer, Communication and Information Sciences, Study guide , ELEC Majors

The Enterprise Knowledge Portal: The Concept

Artificial Neural Networks written examination

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks

Planning with External Events

Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models

Surprise-Based Learning for Autonomous Systems

Australian Journal of Basic and Applied Sciences

TD(λ) and Q-Learning Based Ludo Players

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Task Completion Transfer Learning for Reward Inference

Learning Semantic Maps Through Dialog for a Voice-Commandable Wheelchair

CSL465/603 - Machine Learning

Generative models and adversarial training

Evolutive Neural Net Fuzzy Filtering: Basic Description

A Case-Based Approach To Imitation Learning in Robotic Agents

AUTOMATED FABRIC DEFECT INSPECTION: A SURVEY OF CLASSIFIERS

Proposal of Pattern Recognition as a necessary and sufficient principle to Cognitive Science

BODY LANGUAGE ANIMATION SYNTHESIS FROM PROSODY AN HONORS THESIS SUBMITTED TO THE DEPARTMENT OF COMPUTER SCIENCE OF STANFORD UNIVERSITY

Patterns for Adaptive Web-based Educational Systems

Task Completion Transfer Learning for Reward Inference

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

DOCTOR OF PHILOSOPHY HANDBOOK

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

TOKEN-BASED APPROACH FOR SCALABLE TEAM COORDINATION. by Yang Xu PhD of Information Sciences

Cooperative Systems Modeling, Example of a Cooperative e-maintenance System

Knowledge Transfer in Deep Convolutional Neural Nets

Planning for Preassessment. Kathy Paul Johnston CSD Johnston, Iowa

AQUA: An Ontology-Driven Question Answering System

Speeding Up Reinforcement Learning with Behavior Transfer

Bootstrapping Personal Gesture Shortcuts with the Wisdom of the Crowd and Handwriting Recognition

Self-Supervised Acquisition of Vowels in American English

2017 P-16 Statewide Professional Development Conference What You Don t Know Can Limit You!

Beyond the Pipeline: Discrete Optimization in NLP

Developing True/False Test Sheet Generating System with Diagnosing Basic Cognitive Ability

Learning Prospective Robot Behavior

Self-Supervised Acquisition of Vowels in American English

Disambiguation of Thai Personal Name from Online News Articles

High-level Reinforcement Learning in Strategy Games

arxiv: v1 [cs.lg] 15 Jun 2015

A student diagnosing and evaluation system for laboratory-based academic exercises

CS Machine Learning

Saliency in Human-Computer Interaction *

ProFusion2 Sensor Data Fusion for Multiple Active Safety Applications

The Good Judgment Project: A large scale test of different methods of combining expert predictions

FIGURE IT OUT! MIDDLE SCHOOL TASKS. Texas Performance Standards Project

Knowledge Elicitation Tool Classification. Janet E. Burge. Artificial Intelligence Research Group. Worcester Polytechnic Institute

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape

Automating the E-learning Personalization

Planning for Preassessment. Kathy Paul Johnston CSD Johnston, Iowa

ECE-492 SENIOR ADVANCED DESIGN PROJECT

Distributed Weather Net: Wireless Sensor Network Supported Inquiry-Based Learning

Liquid Narrative Group Technical Report Number

Applying Fuzzy Rule-Based System on FMEA to Assess the Risks on Project-Based Software Engineering Education

Degree Qualification Profiles Intellectual Skills

Rule-based Expert Systems

CS/SE 3341 Spring 2012

Transcription:

and Outline Bootstrap Learning for Visual Perception on Mobile Robots ICRA-11 Workshop Mohan Sridharan Stochastic Estimation and Autonomous Robotics (SEAR) Lab Department of Computer Science Texas Tech University May 9, 2011

Collaborators and Outline Mohan Sridharan, Texas Tech University. Xiang Li, Shiqi Zhang, Mamatha Aerolla (Graduate Students); Texas Tech University. Peter Stone; The University of Texas at Austin. Ian Fasel; The University of Arizona. Jeremy Wyatt, Richard Dearden; University of Birmingham (UK).

and Outline Desiderata + Challenges Talk Outline Focus: Integrated systems, visual inputs. Desiderata: Real-world robots systems require high reliability. Dynamic response requires real-time operation. Learn from limited feedback and operate autonomously. Challenges: Partial observability: varying levels of uncertainty. Constrained processing: large amounts of raw data. Limited human attention: consider high-level feedback.

Research Thrusts and Outline Talk Outline Learn models of the world and revise learned models over time (bootstrap learning). Tailor learning and processing to the task at hand (probabilistic planning). Enable human-robot interaction with high-level input (Human-robot Interaction).

and Outline Talk Outline Robot Platforms and Generalization Evaluation on robot platforms and in simulated domains. Social engagement in elderly care homes.

Talk Outline and Outline Talk Outline Unsupervised learning of object models: Local, global and temporal visual cues to learn probabilistic layered object models. Hierarchical planning for visual learning and collaboration: Constrained convolutional policies and belief propagation in hierarchical POMDPs..

Talk Outline and Outline Talk Outline Unsupervised learning of object models: Local, global and temporal visual cues to learn probabilistic layered object models. Hierarchical planning for visual learning and collaboration: Constrained convolutional policies and belief propagation in hierarchical POMDPs..

and Outline Learning Phase Recognition Phase Learning object models autonomously: : novel objects can be introduced; existing objects can move. Observations: moving objects are interesting! Objects have considerable structure. Approach: Analyze image regions corresponding to moving objects. Extract visual features to learn probabilistic object models. Revise models over time to account for changes.

and Outline Tracking Gradient Features Learning Phase Recognition Phase Tracking and cluster gradient features based on velocity. Model spatial coherence of gradient features.

and Outline Learning Color Features Learning Phase Recognition Phase Use perceptually-motivated color space. Learn color distribution statistics. Learn second-order distribution statistics: JS(a,b) = 1 2 {KL(a,m)+KL(b,m)}, KL(a,m) = i {a i ln a i m i } m= 1 2 (a+b)

Parts-based Models and Outline Learning Phase Recognition Phase Graph-based segmentation of input images. Gaussian models for individual parts. Gamma distribution for inter-part dissimilarity and intra-part similarity.

and Outline Layered Object Model Learning Phase Recognition Phase Model Overview:

and Outline Layered Object Model Learning Phase Recognition Phase Bayesian belief propagation:

Recognition and Outline Learning Phase Recognition Phase Stationary and moving objects motion required only to learn object models. Extract features and compare with learned models. Find region of relevance based on gradient features.

and Outline Recognition - Gradients Learning Phase Recognition Phase Find probabilistic match using spatial similarity measure. SSM(scv i, scv test ) = Ni,test x,correct + Ni,test 2(N 1) y,correct, SSM [0, 1]

and Outline Recognition - Color Distributions Learning Phase Recognition Phase

and Outline Learning Phase Recognition Phase Recognition - Parts-based Models Dynamic programming to match learned models over the relevant region. Similarity within a part, dissimilarity between parts. p i,arr j =f (sim) f (diff ) p i,arr = j w l i p i,arr j j

and Outline Recognition - Overall Learning Phase Recognition Phase Combine evidence from individual visual features. Bayesian update for belief propagation. Recognize objects or identify novel objects.

and Outline Learning Phase Recognition Phase Good classification and recognition performance. p(o A) Box Human Robot Car Other Box 0.913 0.013 0.02 0 0.054 Human 0.027 0.74 0.007 0.013 0.213 Robot 0.033 0.007 0.893 0 0.067 Car 0 0.02 0 0.833 0.147

Talk Outline and Outline Formulation Unsupervised learning of object models: Local, global and temporal visual cues to learn models. Hierarchical planning for visual learning and collaboration: Constrained convolutional policies and belief propagation in POMDPs..

and Outline Formulation Large amount of data, many processing algorithms. Cannot learn all models comprising all possible features! Sensing and processing can vary with task and environment: Where do I look? What do I look for? How to process the data? Approach: tailor sensing and processing to the task. Partially Observable Markov Decision Processes (POMDPs).

POMDP Overview and Outline Formulation Tuple: S, A, Z, T, O, R Belief distribution B t over states. Actions A. Observations Z: action outcomes. Transition function: T : S A S [0, 1] Observation function O : S A Z [0, 1] Reward specification R : S A R Policy π : B t a t+1

Challenges and Outline Formulation State space increases exponentially. Policy generation methods are exponential (worst-case) in the state space dimensions. Model definition may not be known and may change. Intractable for real-world applications! Observations: Only a subset of scenes and inputs are relevant to any task. Visual sensing and processing can be organized hierarchically.

and Outline Hierarchical Visual Planning Formulation Constrained convolutional policies. Automatic belief propagation.

and Outline Formulation HL Search Convolutional Policies Rotation and shift invariance of local visual search. K (s) =(π H C K m)(s)= π H ( s)c K m(s s)d s, K =( a i K ) /W π H C (s) =(K CE m)(s)= K ( s)c E m(s s)d s

and Outline Formulation Accurate and efficient visual search. Reliable (93% vs 87%) and autonomous processing.

and Outline Multirobot Collaboration Formulation Extension to multirobot collaboration (96% vs. 88%).

Talk Outline and Outline Challenges References Extras Unsupervised learning of object models: Local, global and temporal visual cues to learn models. Hierarchical planning for visual learning and collaboration: Constrained convolutional policies and belief propagation in POMDPs..

and Outline Challenges References Extras Robot autonomously acquires models for different object categories. Detects and tracks objects in subsequent images with high ( 90%) accuracy. Hierarchical planning enables a team of robots to share beliefs and collaborate robustly in dynamic domains. Learning and hierarchical planning inform and guide each other to result in autonomous (and real-time) operation of mobile robots in complex environments.

and Outline Additional Challenges Challenges References Extras Learn correlations between visual cues to learn better object models. Assess quality of (information in) object models. Infer lack of information and the presence of novel objects. Reason with non-visual inputs by incorporating hierarchical decompositions that match corresponding cognitive requirements.

Recent Papers I and Outline Challenges References Extras Xiang Li, Mohan Sridharan and Shiqi Zhang. Autonomous Learning of Vision-based Layered Object Models on Mobile Robots. To Appear In the International Conference on Robotics and Automation (ICRA 2011), Shanghai, China, May 9-13, 2011. Shiqi Zhang, Mohan Sridharan and Xiang Li. To Look or Not to Look: A Hierarchical Representation for Visual Planning on Mobile Robots. To Appear In the International Conference on Robotics and Automation (ICRA 2011), Shanghai, China, May 9-13, 2011.

Recent Papers II and Outline Challenges References Extras Xiang Li and Mohan Sridharan. Safe Navigation on a Mobile Robot using Local and Temporal Visual Cues. In the International Conference on Intelligent Autonomous Systems (IAS 2010), Ottawa, Canada, August 30-September 1, 2010. Mohan Sridharan, Jeremy Wyatt and Richard Dearden. Planning to See: A Hierarchical Approach to Planning Visual Actions on a Robot using POMDPs. Artificial Intelligence Journal, Volume 174, Issue 11, pages 704-725, July 2010. All papers available for download: www.cs.ttu.edu/ smohan/publications.html

We are done! and Outline Challenges References Extras Questions? Comments?