What is Machine Learning? Computer Science 6100/4100: Machine Learning. Where Does This Fit in AI? Rational Behavior

Similar documents
Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Lecture 10: Reinforcement Learning

Laboratorio di Intelligenza Artificiale e Robotica

Axiom 2013 Team Description Paper

Lecture 1: Machine Learning Basics

CSL465/603 - Machine Learning

Reinforcement Learning by Comparing Immediate Reward

Lecture 1: Basic Concepts of Machine Learning

Laboratorio di Intelligenza Artificiale e Robotica

Python Machine Learning

A Reinforcement Learning Variant for Control Scheduling

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

(Sub)Gradient Descent

Speeding Up Reinforcement Learning with Behavior Transfer

The Good Judgment Project: A large scale test of different methods of combining expert predictions

Chapter 2. Intelligent Agents. Outline. Agents and environments. Rationality. PEAS (Performance measure, Environment, Actuators, Sensors)

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

TD(λ) and Q-Learning Based Ludo Players

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

MYCIN. The MYCIN Task

An OO Framework for building Intelligence and Learning properties in Software Agents

ISFA2008U_120 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM

Probabilistic Latent Semantic Analysis

Agents and environments. Intelligent Agents. Reminders. Vacuum-cleaner world. Outline. A vacuum-cleaner agent. Chapter 2 Actuators

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Action Models and their Induction

Firms and Markets Saturdays Summer I 2014

Artificial Neural Networks written examination

Using focal point learning to improve human machine tacit coordination

Seminar - Organic Computing

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

Learning Methods for Fuzzy Systems

Robot Learning Simultaneously a Task and How to Interpret Human Instructions

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

Natural Language Processing. George Konidaris

Continual Curiosity-Driven Skill Acquisition from High-Dimensional Video Inputs for Humanoid Robots

SAM - Sensors, Actuators and Microcontrollers in Mobile Robots

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

ENME 605 Advanced Control Systems, Fall 2015 Department of Mechanical Engineering

MGT/MGP/MGB 261: Investment Analysis

CS Machine Learning

IAT 888: Metacreation Machines endowed with creative behavior. Philippe Pasquier Office 565 (floor 14)

An investigation of imitation learning algorithms for structured prediction

Learning Prospective Robot Behavior

Word Segmentation of Off-line Handwritten Documents

COMPUTER-AIDED DESIGN TOOLS THAT ADAPT

Content-free collaborative learning modeling using data mining

AMULTIAGENT system [1] can be defined as a group of

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

SSE - Supervision of Electrical Systems

Rule Learning With Negation: Issues Regarding Effectiveness

Robot Shaping: Developing Autonomous Agents through Learning*

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

Teachable Robots: Understanding Human Teaching Behavior to Build More Effective Robot Learners

Mathematics. Mathematics

The Strong Minimalist Thesis and Bounded Optimality

Word learning as Bayesian inference

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Challenges in Deep Reinforcement Learning. Sergey Levine UC Berkeley

How do adults reason about their opponent? Typologies of players in a turn-taking game

Learning and Transferring Relational Instance-Based Policies

Lecture 6: Applications

Human Emotion Recognition From Speech

Self Study Report Computer Science

Intelligent Agents. Chapter 2. Chapter 2 1

Applications of data mining algorithms to analysis of medical data

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

Speech Recognition at ICSI: Broadcast News and beyond

2/15/13. POS Tagging Problem. Part-of-Speech Tagging. Example English Part-of-Speech Tagsets. More Details of the Problem. Typical Problem Cases

arxiv: v1 [cs.cv] 10 May 2017

Rule Learning with Negation: Issues Regarding Effectiveness

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

IT Students Workshop within Strategic Partnership of Leibniz University and Peter the Great St. Petersburg Polytechnic University

NATIONAL CENTER FOR EDUCATION STATISTICS RESPONSE TO RECOMMENDATIONS OF THE NATIONAL ASSESSMENT GOVERNING BOARD AD HOC COMMITTEE ON.

Applying Fuzzy Rule-Based System on FMEA to Assess the Risks on Project-Based Software Engineering Education

Introduction to Simulation

Improving Fairness in Memory Scheduling

A Case Study: News Classification Based on Term Frequency

Probability and Game Theory Course Syllabus

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

Shockwheat. Statistics 1, Activity 1

A NEW ALGORITHM FOR GENERATION OF DECISION TREES

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

Surprise-Based Learning for Autonomous Systems

Experiment Databases: Towards an Improved Experimental Methodology in Machine Learning

University of Groningen. Systemen, planning, netwerken Bosman, Aart

Real Estate Agents Authority Guide to Continuing Education. June 2016

Evolution of Symbolisation in Chimpanzees and Neural Nets

Constructive Induction-based Learning Agents: An Architecture and Preliminary Experiments

High-level Reinforcement Learning in Strategy Games

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

Georgetown University at TREC 2017 Dynamic Domain Track

Softprop: Softmax Neural Network Backpropagation Learning

Radius STEM Readiness TM

Corrective Feedback and Persistent Learning for Information Extraction

Agent-Based Software Engineering

Transcription:

Computer Science 6100/4100: Machine Learning RPI, Fall 2008 Instructor: Sanmay Das What is Machine Learning? Enabling computers to learn from data Supervised learning: generalizing from seen data to unseen Unsupervised learning: Finding patterns in input data Reinforcement learning: learning how to act Where Does This Fit in AI? What is the goal of AI? Two dichotomies: thinking/acting and humans/rationality (Russell & Norvig [RN]) Turing test: acting like humans (restricted domain) Alternative goal: acting rationally? Rational Behavior [RN]: Maximize goal attainment Computational limitations Definition of optimality Easier when we specify utility functions Perfect rationality is problematic:

Intelligent Agents [RN] environment percepts actions actuators sensors agent Agents include humans and artificial agents Agent function maps percept histories to actions f : P A Agent program runs on the physical architecture to produce f Human Beings and Humanoid Robots Environment:? Sensors/Percepts:? Actuators/Actions:? Performance measure:? A Trading Agent Environment:? Sensors:? Percepts:? Actuators:? Actions:? Performance measure:? Supervised Learning Induction Simplest form: true function f : X Y You are given pairs generated from f (x 1, y 1 ), (x 2, y 2 ),... (x n, y n ) Learn h close to f

Learning a Hypothesis How do we learn h? Algorithms from machine learning and statistics What can we prove about h? Statistical/ computational learning theory Different learning algorithms have different inductive biases Example...good loans Hypothesis Spaces Examples: decision trees, linear classifiers Finding a hypothesis can be thought of as search in the hypothesis space Hypothesis space is part of the inductive bias Unsupervised Learning No explicit outputs Build a model of the inputs in some way Probabilistic model of feature distribution Clustering Reinforcement Learning Agent interacts with the world Receives feedback in the form of rewards (or costs) Must choose which actions to take Major issues: Delayed reward/credit assignment Exploration/Exploitation

Elements of RL Sutton & Barto [SB]: Policy Reward function Value function Model? Markov Decision Processes (Problems) State space: Initial state: S S 0 Action space: A Transition model: T : S A S [0, 1] Reward function: R : S R Utility Theory Going from the real-world to sensible reward functions Parallelism: in economics, utility theory is useful in abstracting over preferences Thought question: which of these two options would you prefer? $1000 with 50% prob., or $400 for sure? Basis for the insurance industry! Learning in Economics Agents that participate in markets are assumed to be rational This means they solve interesting learning and decision-making problems Change the focus to understanding how the interaction of rational players leads to system-wide dynamics... Two restaurants Kyle s model

Syllabus and Course Policies Come to class Participate 3 projects (can be done in pairs) 2 in-class exams Check the website regularly! A Note on Math Calculus Ability to play with matrices Probability! Uniform, Gaussian distributions, Bayes rule Let s do a quick problem... Problem 1 MBC Instruments has designed a new test for Horrible Disease. This test is correct with 99% accuracy. 1 in 10000 people in the general population has the disease. SBC took the test and it came out positive. Is it more likely that SBC has the disease or doesn t? Calculate the probability and turn in your work Statistics Difference between standard deviation and standard error?