Artificial Intelligence Recap. Mausam

Similar documents
Lecture 10: Reinforcement Learning

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Lecture 1: Machine Learning Basics

Knowledge-Based - Systems

Rule-based Expert Systems

Laboratorio di Intelligenza Artificiale e Robotica

FF+FPG: Guiding a Policy-Gradient Planner

Chapter 2. Intelligent Agents. Outline. Agents and environments. Rationality. PEAS (Performance measure, Environment, Actuators, Sensors)

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Decision Analysis. Decision-Making Problem. Decision Analysis. Part 1 Decision Analysis and Decision Tables. Decision Analysis, Part 1

Intelligent Agents. Chapter 2. Chapter 2 1

Seminar - Organic Computing

Transfer Learning Action Models by Measuring the Similarity of Different Domains

MYCIN. The MYCIN Task

Axiom 2013 Team Description Paper

Learning and Transferring Relational Instance-Based Policies

ISFA2008U_120 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM

Introduction to Simulation

Laboratorio di Intelligenza Artificiale e Robotica

Knowledge based expert systems D H A N A N J A Y K A L B A N D E

AMULTIAGENT system [1] can be defined as a group of

Probabilistic Latent Semantic Analysis

Lecture 1: Basic Concepts of Machine Learning

Planning with External Events

TD(λ) and Q-Learning Based Ludo Players

Discriminative Learning of Beam-Search Heuristics for Planning

Natural Language Processing. George Konidaris

Agents and environments. Intelligent Agents. Reminders. Vacuum-cleaner world. Outline. A vacuum-cleaner agent. Chapter 2 Actuators

Planning in Intelligent Systems: Model-based Approach to Autonomous Behavior

The Good Judgment Project: A large scale test of different methods of combining expert predictions

Evolutive Neural Net Fuzzy Filtering: Basic Description

An OO Framework for building Intelligence and Learning properties in Software Agents

Regret-based Reward Elicitation for Markov Decision Processes

Artificial Neural Networks written examination

Probability and Game Theory Course Syllabus

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Modeling user preferences and norms in context-aware systems

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

(Sub)Gradient Descent

Python Machine Learning

Visual CP Representation of Knowledge

An Investigation into Team-Based Planning

Case Acquisition Strategies for Case-Based Reasoning in Real-Time Strategy Games

A Case-Based Approach To Imitation Learning in Robotic Agents

The Evolution of Random Phenomena

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

Learning Methods for Fuzzy Systems

Reinforcement Learning by Comparing Immediate Reward

CS Machine Learning

Georgetown University at TREC 2017 Dynamic Domain Track

2/15/13. POS Tagging Problem. Part-of-Speech Tagging. Example English Part-of-Speech Tagsets. More Details of the Problem. Typical Problem Cases

Toward Probabilistic Natural Logic for Syllogistic Reasoning

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation

Challenges in Deep Reinforcement Learning. Sergey Levine UC Berkeley

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

POLA: a student modeling framework for Probabilistic On-Line Assessment of problem solving performance

Action Models and their Induction

Causal Link Semantics for Narrative Planning Using Numeric Fluents

Automatic Discretization of Actions and States in Monte-Carlo Tree Search

Chapter 2 Rule Learning in a Nutshell

A Genetic Irrational Belief System

AQUA: An Ontology-Driven Question Answering System

FUZZY EXPERT. Dr. Kasim M. Al-Aubidy. Philadelphia University. Computer Eng. Dept February 2002 University of Damascus-Syria

A simulated annealing and hill-climbing algorithm for the traveling tournament problem

Citrine Informatics. The Latest from Citrine. Citrine Informatics. The data analytics platform for the physical world

An Introduction to Simio for Beginners

CSL465/603 - Machine Learning

UDL AND LANGUAGE ARTS LESSON OVERVIEW

GACE Computer Science Assessment Test at a Glance

Speech Recognition at ICSI: Broadcast News and beyond

High-level Reinforcement Learning in Strategy Games

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

University of Cincinnati College of Medicine. DECISION ANALYSIS AND COST-EFFECTIVENESS BE-7068C: Spring 2016

CS 598 Natural Language Processing

Proof Theory for Syntacticians

Applications of memory-based natural language processing

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

IAT 888: Metacreation Machines endowed with creative behavior. Philippe Pasquier Office 565 (floor 14)

How do adults reason about their opponent? Typologies of players in a turn-taking game

Continual Curiosity-Driven Skill Acquisition from High-Dimensional Video Inputs for Humanoid Robots

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

While you are waiting... socrative.com, room number SIMLANG2016

MYCIN. The embodiment of all the clichés of what expert systems are. (Newell)

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

Chinese Language Parsing with Maximum-Entropy-Inspired Parser

Learning to Schedule Straight-Line Code

ENME 605 Advanced Control Systems, Fall 2015 Department of Mechanical Engineering

Pod Assignment Guide

Truth Inference in Crowdsourcing: Is the Problem Solved?

A Context-Driven Use Case Creation Process for Specifying Automotive Driver Assistance Systems

Radius STEM Readiness TM

*Net Perceptions, Inc West 78th Street Suite 300 Minneapolis, MN

University of Groningen. Systemen, planning, netwerken Bosman, Aart

Learning Methods in Multilingual Speech Recognition

Universidade do Minho Escola de Engenharia

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION

Passport to Your Identity

Transcription:

Artificial Intelligence Recap Mausam

What is intelligence? (bounded) Rationality We have a performance measure to optimize Given our state of knowledge Choose optimal action Given limited computational resources Human-like intelligence/behavior

Simple reflex agents AGENT Sensors Condition/Action rules what world is like now what action should I do now? ENVIRONMENT Effectors

Reflex agent with internal state What world was like How world evolves Condition/Action rules Sensors what world is like now what action should I do now? ENVIRONMENT AGENT Effectors

Goal-based agents What world was like How world evolves Sensors what world is like now What my actions do Goals what it ll be like if I do actions A1-An what action should I do now? ENVIRONMENT AGENT Effectors

Utility-based agents What world was like How world evolves What my actions do Utility function Sensors what world is like now what it ll be like if I do acts A1-An How happy would I be? what action should I do now? ENVIRONMENT AGENT Effectors

Learning agents What world was like Learn how world evolves Learn what my actions do Learn utility function Feedback Sensors what world is like now what it ll be like if I do acts A1-An How happy would I be? what action should I do now? ENVIRONMENT AGENT Effectors

Search in Discrete State Spaces This is different from Web Search Every discrete problem can be cast as a search problem. (states, actions, transitions, cost, goal-test) Types uninformed systematic: often slow DFS, BFS, uniform-cost, iterative deepening Heuristic-guided: better Greedy best first, A* relaxation leads to heuristics Local: fast, fewer guarantees; often local optimal Hill climbing and variations Simulated Annealing: global optimal Genetic algorithms: somewhat non-local due to crossing over (Local) Beam Search

Search Example: Game Playing Game Playing AND/OR search space (max, min) minimax objective function minimax algorithm (~dfs) alpha-beta pruning Utility function for partial search Learning utility functions by playing with itself Openings/Endgame databases Secondary search/quiescence search

Knowledge Representation and Reasoning Representing: what I know Reasoning: what I can infer Logic PDDL Bayes Nets

KR&R Example: Propositional Logic Representation: Propositional Logic Formula CNF, Horn Clause, Reasoning: Deduction Forward Chaining Resolution Model Finding Enumeration SAT Solving

Search+KR&R Example: SAT Solving Representation: CNF Formula Reasoning pure literals; unit clauses; unit propagation Search DPLL (~ backtracking search) MOM s heuristic Local: GSAT, WalkSAT Advances Clause Learning: learning from mistakes Restarts in systematic search Portfolio of SAT solvers; Parameter tuning c b a c b Phase Transitions in SAT problems

Search+KR&R Example: Planning Representation: STRIPS Reasoning: Planning Graph Polynomial data structure reasons about constraints on plans (mutual exclusion) Search Forward: state space search planning graph based heuristic Backward: subgoal space search Local: FF (enforced hill climbing) Planning as SAT: SATPlan C A B

KR&R: Probability Representation: Bayesian Networks encode probability distributions compactly by exploiting conditional independences Earthquake Burglary Reasoning Exact inference: var elimination Approx inference: sampling based methods Alarm JohnCalls MaryCalls rejection sampling, likelihood weighting, Gibbs sampling

KR&R: One-step Decision Theory Representation actions, probabilistic outcomes, rewards Reasoning expected value/regret of action Expected value of perfect information Non-deterministic uncertainty Maximax, maximin, eq likelihood, minimax regret.. Utility theory: value of money

KR&R: Markov Decision Process Representation states, actions, probabilistic outcomes, rewards ~AND/OR Graph (sum, max) max Reasoning: V*(s) Value Iteration: search thru value space Policy Iteration: search thru policy space State space search LAO* (AND/OR version of A*) a 1 V 1 = 6.5 ( ~1) s 0 5 a 2 a 3 s 1 s 2 s 3

Learning: BNs/NB ML estimation. max P(D θ) counting; smoothing MAP estimation max P(θ D).. Hidden data Expectation Maximization (EM) {local search} Structure learning (BN) Local search thru structure space Trade off structure complexity and data likelihood

Learning: Reinforcement Learning Learn model while taking actions What to learn T and R: model based Policy: Model free Which actions to take Exploration - Exploitation

Popular Themes Weak AI vs. Strong AI Syntax vs. Semantics Logic vs. Probability

Weak AI vs. Strong AI Weak general methods primarily for problem solving A*, CSP, Bayes Nets, MDPs Strong -- knowledge intensive more knowledge less computation achieve better performance in specific tasks POS tagging, Chess, Jeopardy Daniel S. Weld 26

Syntax vs. Semantics Syntax: what can I say Sentence in English Logic formula in Prop logic CPT in BN Semantics: what does it mean meaning that we understand A ^ B: both A and B are true Conditional independence

Logic vs. Probability Discrete Continuous Hill climbing Gradient ascent SAT solving BN inference Tree structured CSP Polytree Bayes nets Cutset Cutset Classical Planning Factored MDP Bellman Ford Value Iteration A* LAO*

Advanced Ideas in AI Factoring state/actions Hierarchical decomposition Hierarchy of actions Sampling based approaches Sampling in systematic search Markov Chain Monte Carlo UCT algorithm: game playing Particle filters: belief tracking in robotics Context sensitive independence Cutsets Backbones in logic Combining probability and logic Markov Logic Networks, Probabilistic Relational Models

AI we didn t cover Temporal models: HMMs, Kalman filters Ontologies Robotics Vision Mechanism design Multi-agent systems Sensor Networks Computational Neuroscience

AI is about problems. It is an application-driven field Happy to beg, borrow, steal ideas from anywhere Traditionally discrete more and more cont. Traditionally logic almost all probability Recent close connections with EE/Stat due to ML HUGE field

Applications of AI Mars rover: planning Jeopardy: NLP, info retrieval, machine learning Puzzles: search, CSP, logic Chess: search Web search: IR Text categorization: machine learning Self-driving cars: robotics, prob. reasoning, ML

Ethics of Artificial Intelligence Robots Robot Rights Three Laws of Robotics AI replacing people jobs Any different from industrial revolution? Ethical use of technology Dynamite vs. Speech understanding Privacy concerns Humans/Machines reading freely available data on Web Gmail reading our news AI for developing countries/improving humanity

AI-Centric World Graphics Algorithms Theory Databases Operations Research Linguistics AI Robot Design Psychology Neurosc. Statistics