CSCI598A: Robot Intelligence. Apr. 23, 2015

Similar documents
Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Probabilistic Latent Semantic Analysis

Lecture 1: Machine Learning Basics

Generative models and adversarial training

Laboratorio di Intelligenza Artificiale e Robotica

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Lecture 1: Basic Concepts of Machine Learning

Speech Emotion Recognition Using Support Vector Machine

Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models

PELLISSIPPI STATE TECHNICAL COMMUNITY COLLEGE MASTER SYLLABUS APPLIED MECHANICS MET 2025

Word Segmentation of Off-line Handwritten Documents

Laboratorio di Intelligenza Artificiale e Robotica

CSL465/603 - Machine Learning

WHEN THERE IS A mismatch between the acoustic

Action Recognition and Video

arxiv: v2 [cs.ro] 3 Mar 2017

Artificial Neural Networks written examination

Human Emotion Recognition From Speech

Python Machine Learning

Assignment 1: Predicting Amazon Review Ratings

Speech Recognition at ICSI: Broadcast News and beyond

Speech Recognition by Indexing and Sequencing

Switchboard Language Model Improvement with Conversational Data from Gigaword

Australian Journal of Basic and Applied Sciences

Phonetic- and Speaker-Discriminant Features for Speaker Recognition. Research Project

Interactive Whiteboard

Corpus Linguistics (L615)

Robot manipulations and development of spatial imagery

How to analyze visual narratives: A tutorial in Visual Narrative Grammar

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

Semi-Supervised GMM and DNN Acoustic Model Training with Multi-system Combination and Confidence Re-calibration

ADVANCES IN DEEP NEURAL NETWORK APPROACHES TO SPEAKER RECOGNITION

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 17, NO. 3, MARCH

(Sub)Gradient Descent

Welcome to. ECML/PKDD 2004 Community meeting

Modeling function word errors in DNN-HMM based LVCSR systems

TD(λ) and Q-Learning Based Ludo Players

Axiom 2013 Team Description Paper

Corrective Feedback and Persistent Learning for Information Extraction

A Reinforcement Learning Variant for Control Scheduling

PELLISSIPPI STATE TECHNICAL COMMUNITY COLLEGE MASTER SYLLABUS APPLIED STATICS MET 1040

Agents and environments. Intelligent Agents. Reminders. Vacuum-cleaner world. Outline. A vacuum-cleaner agent. Chapter 2 Actuators

Automatic Speaker Recognition: Modelling, Feature Extraction and Effects of Clinical Environment

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Making the ELPS-TELPAS Connection Grades K 12 Overview

The NICT/ATR speech synthesis system for the Blizzard Challenge 2008

Learning Human Utility from Video Demonstrations for Deductive Planning in Robotics

Bug triage in open source systems: a review

A student diagnosing and evaluation system for laboratory-based academic exercises

Role of Pausing in Text-to-Speech Synthesis for Simultaneous Interpretation

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

A Comparison of DHMM and DTW for Isolated Digits Recognition System of Arabic Language

Seminar - Organic Computing

Affective Classification of Generic Audio Clips using Regression Models

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Uncertainty concepts, types, sources

Instruction: The Differences That Make A Difference. Mario Campanaro

Chapter 10 APPLYING TOPIC MODELING TO FORENSIC DATA. 1. Introduction. Alta de Waal, Jacobus Venter and Etienne Barnard

Semi-Supervised Face Detection

Universidade do Minho Escola de Engenharia

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Quantitative Evaluation of an Intuitive Teaching Method for Industrial Robot Using a Force / Moment Direction Sensor

Activity Recognition from Accelerometer Data

Challenges in Deep Reinforcement Learning. Sergey Levine UC Berkeley

Reinforcement Learning by Comparing Immediate Reward

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

Go fishing! Responsibility judgments when cooperation breaks down

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

End-of-Module Assessment Task

Learning Methods for Fuzzy Systems

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

Comparison of EM and Two-Step Cluster Method for Mixed Data: An Application

BODY LANGUAGE ANIMATION SYNTHESIS FROM PROSODY AN HONORS THESIS SUBMITTED TO THE DEPARTMENT OF COMPUTER SCIENCE OF STANFORD UNIVERSITY

A study of speaker adaptation for DNN-based speech synthesis

Lesson plan for Maze Game 1: Using vector representations to move through a maze Time for activity: homework for 20 minutes

Constructing Parallel Corpus from Movie Subtitles

A Vector Space Approach for Aspect-Based Sentiment Analysis

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

SACS Reaffirmation of Accreditation: Process and Reports

TRANSFER LEARNING IN MIR: SHARING LEARNED LATENT REPRESENTATIONS FOR MUSIC AUDIO CLASSIFICATION AND SIMILARITY

Completing the Pre-Assessment Activity for TSI Testing (designed by Maria Martinez- CARE Coordinator)

Probability and Statistics Curriculum Pacing Guide

Truth Inference in Crowdsourcing: Is the Problem Solved?

Modeling function word errors in DNN-HMM based LVCSR systems

Time series prediction

CS 446: Machine Learning

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

THE world surrounding us involves multiple modalities

arxiv: v1 [cs.cv] 2 Jun 2017

THE EFFECTS OF TEACHING THE 7 KEYS OF COMPREHENSION ON COMPREHENSION DEBRA HENGGELER. Submitted to. The Educational Leadership Faculty

Mathematics Success Grade 7

Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition

Vision for Science Education A Framework for K-12 Science Education: Practices, Crosscutting Concepts, and Core Ideas

Why Did My Detector Do That?!

Transcription:

CSCI598A: Robot Intelligence Apr. 23, 2015

Reasoning Over Time Object recognition (static problem) We consider spatial relations with uncertainty We don t care about time Motion planning (dynamic problem) It is a dynamic problem with uncertainty Variable values change over time Locations, velocity, acceleration of joints Time must be modeled to estimate present status and probably predict future states High-Level Task Abstraction (static problem) We consider temporal relations of subtasks 2

Issues in Time Reasoning Temporal segmentation of streaming/times series data Alignment of time series data Reasoning high-level task abstraction

Temporal Segmentation A naïve, uniform segmentation

Temporal Segmentation A naïve, uniform segmentation Right: the standard deviation of the scores and its mean computed on a sliding window. The local minima of the standard deviation function are break points.

Temporal Segmentation A naïve, uniform segmentation After normalization: Blue dots are the break points computed that indicate the end of a segmentation and the beginning of a new one.

Temporal Segmentation Fuzzy segmentation Ground truth The uniform segmentation and many others assume break points and segments can be distinctly separated.

Temporal Segmentation Fuzzy segmentation Ground truth Frames Key Concept: Gradual Transition Write on Board Gradual Transition Answer Phone

Temporal Segmentation Fuzzy segmentation Ground truth Proposed approach Frames The fuzzy approach models each segment/event as a fuzzy set with fuzzy boundaries.

Alignment of Time Series Dynamic Time Warping (DTW)

Reasoning Task Abstraction Goal: reason about chronological order of subtasks

Reasoning Task Abstraction Goal: reason about chronological order of subtasks

Reasoning Task Abstraction

Reasoning Task Abstraction

Reasoning Task Abstraction

Reasoning Task Abstraction

Reasoning Task Abstraction Inference tasks:

Reasoning Task Abstraction Hidden Markov Model:

Reasoning Task Abstraction Hidden Conditional Random Field (HCRF): Learn a mapping from temporal data to a label Use latent variable to model underlying temporal structures y h 1 h 2 h 3 h 4 h 5 x 1 x 2 x 3 x 4 x 5 Frame 1 Frame 2 Frame 3 Frame 4 Frame 5

Reasoning Task Abstraction Example of HCRF y = tennis-serve h {Toss, Swing, Hit} Assuming identical h x HCRFs h: T T T S S S S S H H H H

Summary of the class Definition of robot and its intelligence Robot perception (perception) Sensing technologies Object recognition from 2D and 3D Learning from demonstration (action) Reinforcement learning Data and time modeling (reasoning) Tutorial of ROS, PCL, and deep learning A focus on the Amazon Picking Challenge

Summary of the class Definition of robot and its intelligence Robot perception (perception) Sensing technologies Object recognition from 2D and 3D Learning from demonstration (action) Reinforcement learning Data and time modeling (reasoning) Tutorial of ROS, PCL, and deep learning A focus on the Amazon Picking Challenge

Summary of the class A working definition of robot: Physical machine that generates intelligent connection between perception and action Robot intelligence: Robot intelligence includes recognizing patterns, comprehending ideas, plan, making decisions, and communicating

Summary of the class Definition of robot and its intelligence Robot perception (perception) Sensing technologies Object recognition from 2D and 3D Learning from demonstration (action) Reinforcement learning Data and time modeling (reasoning) Tutorial of ROS, PCL, and deep learning A focus on the Amazon Picking Challenge

Summary of the class Robot perception Face examples Classification Result Off-line training Classifier Feature Extraction Representation Non-face examples Search for faces at different resolutions and locations 25

Summary of the class Bag of word models 1. Feature detection 2. Feature description 3. Dictionary learning 4. Bag-of-features representation

Summary of the class 3D object recognition

Summary of the class Definition of robot and its intelligence Robot perception (perception) Sensing technologies Object recognition from 2D and 3D Learning from demonstration (action) Reinforcement learning Data and time modeling (reasoning) Tutorial of ROS, PCL, and deep learning A focus on the Amazon Picking Challenge

Summary of the class Learning from Demonstration Learning by watching: correspondence problem Learning by acting Gaussian mixture models and regressions

Summary of the class Learning from Demonstration Learning by watching: correspondence problem Learning by acting Gaussian mixture models and regressions Key issues in Learning from Demonstration Parameter learning: Expectation-Maximization Gaussian component estimation: Bayesian Information Criteria (BIC) Trajectory alignment: Dynamic Time Warping (DTW) Dimension reduction: Principal Component Analysis (PCA)

Summary of the class Reinforcement learning A learning approach that can adapt through interaction with the environment

Summary of the class Definition of robot and its intelligence Robot perception (perception) Sensing technologies Object recognition from 2D and 3D Learning from demonstration (action) Reinforcement learning Data and time modeling (reasoning) Tutorial of ROS, PCL, and deep learning A focus on the Amazon Picking Challenge

Summary of the class Learning from Data Supervised learning Unsupervised learning K-means

Summary of the class Time modeling Temporal segmentation Sequence alignment Reasoning time orders of subtasks

Summary of the class Definition of robot and its intelligence Robot perception (perception) Sensing technologies Object recognition from 2D and 3D Learning from demonstration (action) Reinforcement learning Data and time modeling (reasoning) Tutorial of ROS, PCL, and deep learning A focus on the Amazon Picking Challenge

Summary of the class Deep learning

Summary of the class Definition of robot and its intelligence Robot perception (perception) Sensing technologies Object recognition from 2D and 3D Learning from demonstration (action) Reinforcement learning Data and time modeling (reasoning) Tutorial of ROS, PCL, and deep learning A focus on the Amazon Picking Challenge

Summary of the class Amazon Picking Challenge using the Baxter robot (named Zuko, the firelord)

Summary of the class Definition of robot and its intelligence Robot perception (perception) Sensing technologies Object recognition from 2D and 3D Learning from demonstration (action) Reinforcement learning Data and time modeling (reasoning) Tutorial of ROS, PCL, and deep learning A focus on the Amazon Picking Challenge

Examples of LfD and RL Work from Dr. Aude Billard 40

Examples of LfD and RL Work from Dr. Aude Billard 41

Examples of LfD and RL Work from Dr. Aude Billard 42

Additional Examples Work from Dr. Aude Billard 43