Review. Philipp Koehn. 3 December 2015

Similar documents
Knowledge-Based - Systems

Lecture 10: Reinforcement Learning

Evolutive Neural Net Fuzzy Filtering: Basic Description

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

Introduction to Simulation

Planning with External Events

TD(λ) and Q-Learning Based Ludo Players

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

ISFA2008U_120 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM

FF+FPG: Guiding a Policy-Gradient Planner

Learning and Transferring Relational Instance-Based Policies

Knowledge based expert systems D H A N A N J A Y K A L B A N D E

Proof Theory for Syntacticians

Reinforcement Learning by Comparing Immediate Reward

Probability and Game Theory Course Syllabus

Learning Methods for Fuzzy Systems

Axiom 2013 Team Description Paper

Probabilistic Latent Semantic Analysis

Rule-based Expert Systems

An OO Framework for building Intelligence and Learning properties in Software Agents

(Sub)Gradient Descent

Discriminative Learning of Beam-Search Heuristics for Planning

Self Study Report Computer Science

An Investigation into Team-Based Planning

University of Cincinnati College of Medicine. DECISION ANALYSIS AND COST-EFFECTIVENESS BE-7068C: Spring 2016

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

POLA: a student modeling framework for Probabilistic On-Line Assessment of problem solving performance

Clouds = Heavy Sidewalk = Wet. davinci V2.1 alpha3

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Corrective Feedback and Persistent Learning for Information Extraction

University of Groningen. Systemen, planning, netwerken Bosman, Aart

AMULTIAGENT system [1] can be defined as a group of

Learning to Schedule Straight-Line Code

MYCIN. The MYCIN Task

A Comparison of Annealing Techniques for Academic Course Scheduling

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

Agents and environments. Intelligent Agents. Reminders. Vacuum-cleaner world. Outline. A vacuum-cleaner agent. Chapter 2 Actuators

Radius STEM Readiness TM

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

Chapter 2. Intelligent Agents. Outline. Agents and environments. Rationality. PEAS (Performance measure, Environment, Actuators, Sensors)

stateorvalue to each variable in a given set. We use p(x = xjy = y) (or p(xjy) as a shorthand) to denote the probability that X = x given Y = y. We al

AQUA: An Ontology-Driven Question Answering System

Laboratorio di Intelligenza Artificiale e Robotica

Action Models and their Induction

CSL465/603 - Machine Learning

Georgetown University at TREC 2017 Dynamic Domain Track

The Conversational User Interface

COMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS

Generative models and adversarial training

A Genetic Irrational Belief System

Task Completion Transfer Learning for Reward Inference

Artificial Neural Networks written examination

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

Modeling user preferences and norms in context-aware systems

CS 598 Natural Language Processing

Automatic Discretization of Actions and States in Monte-Carlo Tree Search

An Interactive Intelligent Language Tutor Over The Internet

Lecture 1: Machine Learning Basics

Designing A Computer Opponent for Wargames: Integrating Planning, Knowledge Acquisition and Learning in WARGLES

Causal Link Semantics for Narrative Planning Using Numeric Fluents

Task Completion Transfer Learning for Reward Inference

A Reinforcement Learning Variant for Control Scheduling

Speeding Up Reinforcement Learning with Behavior Transfer

Neuro-Symbolic Approaches for Knowledge Representation in Expert Systems

Version Space. Term 2012/2013 LSI - FIB. Javier Béjar cbea (LSI - FIB) Version Space Term 2012/ / 18

Transfer Learning Action Models by Measuring the Similarity of Different Domains

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

Decision Analysis. Decision-Making Problem. Decision Analysis. Part 1 Decision Analysis and Decision Tables. Decision Analysis, Part 1

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

Uncertainty concepts, types, sources

Visual CP Representation of Knowledge

A Neural Network GUI Tested on Text-To-Phoneme Mapping

IAT 888: Metacreation Machines endowed with creative behavior. Philippe Pasquier Office 565 (floor 14)

Segmented Discourse Representation Theory. Dynamic Semantics with Discourse Structure

EVOLVING POLICIES TO SOLVE THE RUBIK S CUBE: EXPERIMENTS WITH IDEAL AND APPROXIMATE PERFORMANCE FUNCTIONS

Universidade do Minho Escola de Engenharia

ENME 605 Advanced Control Systems, Fall 2015 Department of Mechanical Engineering

Softprop: Softmax Neural Network Backpropagation Learning

Chapter 2 Rule Learning in a Nutshell

Intelligent Agents. Chapter 2. Chapter 2 1

The Good Judgment Project: A large scale test of different methods of combining expert predictions

The Evolution of Random Phenomena

TEACHING AND EXAMINATION REGULATIONS PART B: programme-specific section MASTER S PROGRAMME IN LOGIC

An Experimental Comparison of Knowledge Representation Schemes

Seminar - Organic Computing

AUTOMATED TROUBLESHOOTING OF MOBILE NETWORKS USING BAYESIAN NETWORKS

Regret-based Reward Elicitation for Markov Decision Processes

WE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT

Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

TOKEN-BASED APPROACH FOR SCALABLE TEAM COORDINATION. by Yang Xu PhD of Information Sciences

1. Answer the questions below on the Lesson Planning Response Document.

Laboratorio di Intelligenza Artificiale e Robotica

Foundations of Knowledge Representation in Cyc

Henry Tirri* Petri Myllymgki

Applications of memory-based natural language processing

2/15/13. POS Tagging Problem. Part-of-Speech Tagging. Example English Part-of-Speech Tagsets. More Details of the Problem. Typical Problem Cases

INTRODUCTION TO DECISION ANALYSIS (Economics ) Prof. Klaus Nehring Spring Syllabus

Truth Inference in Crowdsourcing: Is the Problem Solved?

Transcription:

Review Philipp Koehn 3 December 2015

Exam 1 Date: Tuesday, December 15, 9am 12pm Location: Hodson 213 (here) Format closed book 2 pages of notes allowed Grading: homework is 60%, exam is 40%

Lectures Covered By Exam 2 Artificial Intelligence in Context not covered Intelligent Agents, Heuristic Search, and Game Playing Intelligent Agents Basic Search Informed Search Game Playing Constraint Satisfaction Logic and Knowledge Representation Logical Agents First Order Logic Inference in First-Order Logic Knowledge Representation Planning Uncertainty Probabilistic Reasoning Bayesian Networks Markov Decision Processes Decision Theory Machine Learning Statistical Learning not covered Neural Networks not covered Reinforcement Learning Natural Language not covered

3 intelligent agents

Intelligent Agents 4 Types of environments: (in)accessible, (non-)deterministic, (non)-episodic Types of agents: reflex, with memory, with goals, with learning, utility-based

Basic Search 5 Problem solving agents Analysis completeness time complexity space complexity optimality Basic search algorithms tree search breadth / depth-first search iterative deepening

Informed Search 6 Best-first search A search Heuristic algorithms hill-climbing simulated annealing

Game Playing 7 Types of games deterministic / probabilistic (im)perfect information Search over game tree minimax algorithm α-β pruning evaluation functions Solvable games, but typically resource limits Probabilistic games: pruning with bounds

Constraint Satisfaction 8 Variables, domains, constraints Backtracking search Constraint propagation forward checking arc consistency Problems structure Iterative algorithms

9 logic

Logical Agents 10 Knowledge-based agents internal representations incorporate new percepts deduce hidden properties of the world Logic formal language (syntax) truth in real world (semantics) entailment and inference Algorithms forward chaining backward chaining resolution

First Order Logic 11 Adding variables relations functions quanitifiers Modeling natural language Dynamic world: states and fluents Situation calculus

Inference in First-Order Logic 12 Reducing first-order inference to propositional inference Unification Generalized modus ponens Forward and backward chaining Logic programming (Prolog) Resolution

Knowledge Representation 13 Representation systems Categories and objects ontologies Frames Events and scripts Practical examples Cyc Semantic web

Planning 14 Search vs. planning STRIPS operators Partial-order planning The real world incomplete information incorrect information quantification problem Conditional planning Monitoring and replanning

15 uncertainty

Probabilistic Reasoning 16 Uncertainty Probability conditional probability independence Bayes rule Inference Independence and Bayes Rule

Bayesian Networks 17 Bayesian Networks Parameterized distributions Exact inference inference by enumeration variable elimination Approximate inference rejection sampling likelihood weighting Markov chain Monte Carlo

Markov Decision Processes 18 Temporal processes Hidden Markov models Inference filtering smoothing best sequence Kalman filters Dynamic Bayesian nets Example: speech recognition

Decision Theory 19 Rational preferences Utilities Decision networks Value of information Markov decision processes Inference algorithms value iteration policy iteration Partially observable Markov decision processes (POMDP)

Reinforcement Learning 20 Rewards, often delayed Passive reinforcement learning (compute utility of policy) adaptive dynamic programming temporal difference learning Active Reinforcement Learning greedy agent extended adaptive dynamic programming Q-learning Generalizations over the state space Policy search

21 exam questions

Sample Exams 22 Exam will assess understanding of core concepts understanding of algorithms ability to carry them out by hand Exam will be similar to http://www.cs.berkeley.edu/ russell/classes/cs188/f05/#oldexams http://pages.cs.wisc.edu/ shavlik/cs540.html#previous-exams

Sample Question 23 Logical knowledge representation Which of the following are semantically and syntactically correct translations of Everyone s zipcode within a state has the same first digit? 1. x, s, z1 [State(s) LivesIn(x, s) Zip(x) = z1] [ y, z2 LivesIn(y, s) Zip(y) = z2 Digit(1, z1) = Digit(1, z2)]. 2. x, s [State(s) LivesIn(x, s) z1 Zip(x) = z1] [ y, z2 LivesIn(y, s) Zip(y) = z2 Digit(1, z1) = Digit(1, z2)]. 3. x, y, s State(s) LivesIn(x, s) LivesIn(y, s) Digit(1, Zip(x) =Zip(y)). 4. x, y, s State(s) LivesIn(x, s) LivesIn(y, s) Digit(1, Zip(x)) = Digit(1, Zip(y)).

Sample Question 24 Game playing Consider the game of 2 2 tictactoe. 1. Draw the full game tree down to depth 2. You need not show nodes that are rotations or reflections of siblings already shown. 2. Circle any node that would not be evaluated by alpha beta during a left-to-right exploration of your tree.

25 questions?