Advanced Artificial Intelligence CS 687. Jana Kosecka, 4444 Research II

Similar documents
Agents and environments. Intelligent Agents. Reminders. Vacuum-cleaner world. Outline. A vacuum-cleaner agent. Chapter 2 Actuators

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Intelligent Agents. Chapter 2. Chapter 2 1

Chapter 2. Intelligent Agents. Outline. Agents and environments. Rationality. PEAS (Performance measure, Environment, Actuators, Sensors)

Lecture 10: Reinforcement Learning

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Lecture 1: Machine Learning Basics

Laboratorio di Intelligenza Artificiale e Robotica

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Lecture 1: Basic Concepts of Machine Learning

Knowledge-Based - Systems

Knowledge based expert systems D H A N A N J A Y K A L B A N D E

Proposal of Pattern Recognition as a necessary and sufficient principle to Cognitive Science

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

Evolutive Neural Net Fuzzy Filtering: Basic Description

Python Machine Learning

Learning Methods for Fuzzy Systems

CSL465/603 - Machine Learning

Laboratorio di Intelligenza Artificiale e Robotica

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

Seminar - Organic Computing

MYCIN. The MYCIN Task

EGRHS Course Fair. Science & Math AP & IB Courses

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

MAE Flight Simulation for Aircraft Safety

DIGITAL GAMING & INTERACTIVE MEDIA BACHELOR S DEGREE. Junior Year. Summer (Bridge Quarter) Fall Winter Spring GAME Credits.

Axiom 2013 Team Description Paper

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

Firms and Markets Saturdays Summer I 2014

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

A Bayesian Model of Imitation in Infants and Robots

AUTOMATED TROUBLESHOOTING OF MOBILE NETWORKS USING BAYESIAN NETWORKS

Reinforcement Learning by Comparing Immediate Reward

Master s Programme in Computer, Communication and Information Sciences, Study guide , ELEC Majors

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

Date : Controller of Examinations Principal Wednesday Saturday Wednesday

Human Emotion Recognition From Speech

AC : DESIGNING AN UNDERGRADUATE ROBOTICS ENGINEERING CURRICULUM: UNIFIED ROBOTICS I AND II

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Robot Shaping: Developing Autonomous Agents through Learning*

Learning Prospective Robot Behavior

ISFA2008U_120 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM

(Sub)Gradient Descent

MGT/MGP/MGB 261: Investment Analysis

COMPUTER SCIENCE GRADUATE STUDIES Course Descriptions by Methodology

Integrating E-learning Environments with Computational Intelligence Assessment Agents

COMPUTER SCIENCE GRADUATE STUDIES Course Descriptions by Research Area

Software Maintenance

Artificial Neural Networks

OFFICE SUPPORT SPECIALIST Technical Diploma

Level 6. Higher Education Funding Council for England (HEFCE) Fee for 2017/18 is 9,250*

Math-U-See Correlation with the Common Core State Standards for Mathematical Content for Third Grade

Action Models and their Induction

Rule-based Expert Systems

CS Machine Learning

Development of an IT Curriculum. Dr. Jochen Koubek Humboldt-Universität zu Berlin Technische Universität Berlin 2008

B.S/M.A in Mathematics

Speeding Up Reinforcement Learning with Behavior Transfer

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Learning Methods in Multilingual Speech Recognition

Lecture 6: Applications

Multisensor Data Fusion: From Algorithms And Architectural Design To Applications (Devices, Circuits, And Systems)

Time series prediction

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

The Value of Visualization

Designing Autonomous Robot Systems - Evaluation of the R3-COP Decision Support System Approach

FUZZY EXPERT. Dr. Kasim M. Al-Aubidy. Philadelphia University. Computer Eng. Dept February 2002 University of Damascus-Syria

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Self Study Report Computer Science

AGENDA LEARNING THEORIES LEARNING THEORIES. Advanced Learning Theories 2/22/2016

K5 Math Practice. Free Pilot Proposal Jan -Jun Boost Confidence Increase Scores Get Ahead. Studypad, Inc.

What s in a Step? Toward General, Abstract Representations of Tutoring System Log Data

Rule Learning With Negation: Issues Regarding Effectiveness

Welcome to. ECML/PKDD 2004 Community meeting

EECS 700: Computer Modeling, Simulation, and Visualization Fall 2014

Speech Recognition at ICSI: Broadcast News and beyond

COMPUTER-AIDED DESIGN TOOLS THAT ADAPT

TOKEN-BASED APPROACH FOR SCALABLE TEAM COORDINATION. by Yang Xu PhD of Information Sciences

Computational Data Analysis Techniques In Economics And Finance

POLA: a student modeling framework for Probabilistic On-Line Assessment of problem solving performance

Universidade do Minho Escola de Engenharia

TD(λ) and Q-Learning Based Ludo Players

A Reinforcement Learning Variant for Control Scheduling

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

A Case-Based Approach To Imitation Learning in Robotic Agents

Mathematics subject curriculum

Agent-Based Software Engineering

Natural Language Processing. George Konidaris

The Method of Immersion the Problem of Comparing Technical Objects in an Expert Shell in the Class of Artificial Intelligence Algorithms

TABLE OF CONTENTS TABLE OF CONTENTS COVER PAGE HALAMAN PENGESAHAN PERNYATAAN NASKAH SOAL TUGAS AKHIR ACKNOWLEDGEMENT FOREWORD

COMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS

IAT 888: Metacreation Machines endowed with creative behavior. Philippe Pasquier Office 565 (floor 14)

Speech Emotion Recognition Using Support Vector Machine

Transcription:

Advanced Artificial Intelligence CS 687 Jana Kosecka, 4444 Research II kosecka@gmu.edu, 3-1876

Logistics Grading: Homeworks 35% Midterm: 35% Final project: 30% Prerequisites: basic statistical concepts, geometry, linear algebra, calculus, CS 580 Course web page cs.gmu.edu/~kosecka/cs687/ Course newsgroup Homeworks every 2 weeks, Midterm, Final Project Choose among offered projects/propose your own Detailed project proposal due mid March Write a report and prepare the final presentation Late policy: budget of 3 late days

Required Text S. Russell and P. Norvig: Artificial Intelligence: A Modern Approach (at least second edition) R. Sutton and A. G. Barto: Introduction to Reinforcement Learning (on-line materials see course www) Course goal gain breadth in AI Required Software MATLAB, for homework Student version of MATLAB available in bookstore Octave http://www.gnu.org/software/octave/ For project language of your choice Project apply techniques studied in the class to the problem of your choice, investigate some details of covered algorithms

Relation to other courses CS 685 Intelligent Robotic Systems CS 682 Computer Vision CS 688 Pattern Recognition CS 780 Data Mining CS 782 Machine Learning CS 659 Theory and Applications of Data Mining SYS/STAT 664 Bayesian Inference and Decision Theory Breath course can be followed More in depth coverage of Probabilistic Graphical Models, Reinforcement Learning, Natural Language Processing, Planning continuation of CS580

Today s outline History of AI AI approaches AI applications to intelligent agent design, robotics, computer vision, game playing, medical diagnosis Outline of course topics Advanced AI in 10 slides Part I Supervised Learning Regression and Classification problems

Intelligent Agents Agents humans, robots, termostats, web applications Agent programs map percept histories to actions We focus on the design of rational agents, which will try to maximize the expected value of the performance measure given the percepts up to now Performance measure, environment, actuators, sensors Automated taxi Internet Shopping agent Environment types Observable, deterministic, episodic, static, discrete What are the environment types for different agents? Environment type determines the type of agent

Applications of AI Intelligent Agents AI agents in finance AI agents in games AI agents in medicine AI agent on www AI in robotics All these can be learning agents

Intelligent Agents Agent types Simple reflex agent Reflex agent with state Goal-based agent Utility based agent All these can be learning agents

Reflex agent

Reflex agent with state

Goal-oriented agent

Utility-based agent

Robotics and AI Knowledge representation - how to represent objects, humans, environments - symbol grounding problem Computer Vision - study of perception - recognition, vision and motion, segmentation and grouping representation Natural Language Processing - provides better interfaces, symbol grounding problem Planning and Decision Making How to make optimal decision, actions give the current knowledge of the state, currently available actions Flakey robot video

Robotic Navigation Stanford Stanley Grand Challenge Outdoors unstructured env., single vehicle Urban Challenge Outdoors structured env., mixed traffic, traffic rules

Sensors Robot Components (Stanley) Actuators-Effectors Locomotion System Computer system Architectures (the brain) Lasers, camera, radar, GPS, compass, antenna, IMU, Steer by wire system Rack of PC s with Ethernet for processing information from sensors

Stanley Software System

Terrain mapping using lasers Determining obstacle course

Example 6: Classification

Rhino First Museum Tour giving robot University of Bonn ( 96)

Computer Vision and AI

Computer Vision Visual Sensing Images I(x,y) brightness patterns - image apperance depends on structure of the scene - material and reflectance properties of the objects - position and strength of light sources

Recovery of the properties of the environment from single or multiple views Vision problems Semantic Segmentation Recognition Reconstruction Vision Based Control - Action Visual Cues Stereo, motion, shading, texture, contour, brightness

Segmentation partition image into separate objects Clustering and search algorithms in the space of visual cues Supervised and unsupervised learning strategies Object and Scene recognition/categorization

So what does object recognition involve?

Object categorization mountain tree banner building street lamp people vendor

Consumer application: iphoto 2009 http://www.apple.com/ilife/iphoto/

Consumer application: iphoto 2009 Can be trained to recognize pets! http://www.maclife.com/article/news/iphotos_faces_recognizes_cats

Consumer application: iphoto 2009 Things iphoto thinks are faces

The Brain (analogy) 100 Billion neurons On average, connected to 1 K others Neurons are slow. Firing rates < 100 Hz. Can be classified into Sensory vision, somatic, audition, chemical Motor locomotion, manipulation, speech Central reasoning and problem solving

Trends in biological and machine evolution Hans Moravec: Robot 1 neuron = 1000 instructions/sec 1 synapse = 1 byte of information Human brain then processes 10^14 IPS and has 10^14 bytes of storage In 2000, we have 10^9 IPS and 10^9 bytes on a desktop machine In 25 years, assuming Moore s law we obtain human level computing power

Various AI areas and projects Coctail party demo Alvinn demo Boids Medical Diagnosis Face detection Machine learning Robotics Agent design Computer Vision http://www.research.ibm.com/deepblue/ Games Intelligent rooms Human-Computer Interaction Shopping assistant Web applications AI current and past projects ICA algoritm in Matlab [W, s,v] = svd((repmat(sum(x.*x,1), size(x,1),1).*x)*x );

Overview of the topics Supervised learning Representation of uncertainty Bayesian Networks Inference and Learning in Bayesian networks Hidden Markov Models Bayes filters, Kalman filters Visual Perception Robot Perception and Control Reinforcement learning With applications to intelligent agent design, robotics, computer vision, game playing, medical diagnosis

Supervised/Unsupervised learning Design of agents which learn from observations and improve performance on future tasks Regression and classification problems Regression - E.g. prediction of house prices Classification disease/no disease Artificial neural networks Unsupervised learning Finding structure in the available data

Representation of uncertainty Needs of agents to handle uncertainty due to nondeterminism or partial observability How to represent uncertain knowledge Basis of probabilistic reasoning E.g. Bayes rule

Bayes nets - Probabilistic Graphical Models Graphical models offer several useful properties: 1. Models are descriptions of how parts of the world work 2. May not account for every variable 3. May not account for every interaction 4. Enable us to reason about unknown variables given some evidence - explanation (diagnostic reasoning) - prediction (causal reasoning)

Probabilistic Graphical Models Graphical models offer several useful properties: 1. They provide a simple way to visualize the structure of a probabilistic model and can be used to design and motivate new models. 2. Insights into the properties of the model, including conditional independence properties, can be obtained by inspection of the graph. 3. Complex computations, required to perform inference and learning in sophisticated models, can be expressed in terms of graphical manipulations, in which underlying mathematical expressions are carried along implicitly.

Example X 4 X 2 X 6 X 1 X 3 X 5 Joint Probability: p(x 1, x 2 x 3, x 4,x 5,x 6 ) = p(x 1 ) p(x 2 x 1 ) p(x 3 x 1 ) p(x 4 x 2 ) p(x 5 x 3 ) p(x 6 x 2, x 5 )

x 2 0 1 x 1 0 1 Example X 4 X 2 x 4 0 1 x 2 0 1 X 6 1 0 0 x 1 0 1 X 1 X 3 X 5 1 0 1 x 1 0 1 0 0 x 3 x 1 5 1 x 3 0 1

Applications Implementations in real life : It is used in the Microsoft products (Microsoft Office) Medical applications and Biostatistics (BUGS) In NASA Autoclass project for data analysis Collaborative filtering (Microsoft MSBN) Fraud Detection (ATT) Speech recognition (UC, Berkeley ) 44

Bayesian Networks Graphical models, efficient representation of joint probability distribution Credit card companies - Fradulent transaction detection

Probabilistic Reasoning in Time Tracking Robotic localization Propagating beliefs Includes models of dynamics of the worlds Hidden Markov Model Natural Language Processing, Speech Analysis

Markov Localization 1. Start No knowledge at start, thus we have an uniform probability distribution. 2. Robot perceives first pillar Seeing only one pillar, the probability being at pillar 1, 2 or 3 is equal. 3. Robot moves Action model enables to estimate the new probability distribution based on the previous one and the motion. 4. Robot perceives second pillar Base on all prior knowledge the probability being at pillar 2 Becomes dominant

48

49

50

51

52

53

54

55

56

57

58

59

60

61

62

63

64

65

Probabilistic models More general Bayesian Programs Bayesian Networks Courtesy of Julien Diard S: State O: Observation A: Action S t S t-1 DBNs S t S t-1 A t Markov Chains Bayesian Filters Markov Loc MDPs Particle Filters discrete HMMs semi-cont. HMMs continuous HMMs MCML POMDPs More specific S t S t-1 O t Kalman Filters S t S t-1 O t A t R. Siegwart, I. Nourbakh

Reinforcement Learning How to improve performance over time from our own/ systems experience Goal directed learning from interaction How to map situations to action to maximize reward http://www.youtube.com/user/stanfordhelicopter state(t) Agent reward(t+1) action(t) Environment state(t+1)

Blackboard Notes Supervised Learning