Foundations of Intelligent Systems CSCI (Fall 2015)

Similar documents
Python Machine Learning

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

(Sub)Gradient Descent

Artificial Neural Networks written examination

Evolutive Neural Net Fuzzy Filtering: Basic Description

Rule Learning With Negation: Issues Regarding Effectiveness

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Knowledge-Based - Systems

Softprop: Softmax Neural Network Backpropagation Learning

Lecture 1: Machine Learning Basics

Axiom 2013 Team Description Paper

CS Machine Learning

Rule Learning with Negation: Issues Regarding Effectiveness

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Learning Methods for Fuzzy Systems

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

AQUA: An Ontology-Driven Question Answering System

CSL465/603 - Machine Learning

Laboratorio di Intelligenza Artificiale e Robotica

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

MYCIN. The MYCIN Task

Assignment 1: Predicting Amazon Review Ratings

STT 231 Test 1. Fill in the Letter of Your Choice to Each Question in the Scantron. Each question is worth 2 point.

Seminar - Organic Computing

An OO Framework for building Intelligence and Learning properties in Software Agents

Analysis of Hybrid Soft and Hard Computing Techniques for Forex Monitoring Systems

A Decision Tree Analysis of the Transfer Student Emma Gunu, MS Research Analyst Robert M Roe, PhD Executive Director of Institutional Research and

An Introduction to Simio for Beginners

Dublin City Schools Mathematics Graded Course of Study GRADE 4

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

CS 446: Machine Learning

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Laboratorio di Intelligenza Artificiale e Robotica

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

Universidade do Minho Escola de Engenharia

INPE São José dos Campos

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

The Good Judgment Project: A large scale test of different methods of combining expert predictions

Introduction to Simulation

Radius STEM Readiness TM

Discriminative Learning of Beam-Search Heuristics for Planning

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Learning From the Past with Experiment Databases

arxiv: v1 [cs.lg] 15 Jun 2015

Test Effort Estimation Using Neural Network

FUZZY EXPERT. Dr. Kasim M. Al-Aubidy. Philadelphia University. Computer Eng. Dept February 2002 University of Damascus-Syria

Word Segmentation of Off-line Handwritten Documents

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

About How Good is Estimation? Assessment Materials Page 1 of 12

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

University of Groningen. Systemen, planning, netwerken Bosman, Aart

*** * * * COUNCIL * * CONSEIL OFEUROPE * * * DE L'EUROPE. Proceedings of the 9th Symposium on Legal Data Processing in Europe

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

COMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS

Classification Using ANN: A Review

A Simple VQA Model with a Few Tricks and Image Features from Bottom-up Attention

Grade 6: Correlated to AGS Basic Math Skills

Name: Class: Date: ID: A

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

Lecture 10: Reinforcement Learning

Lecture 1: Basic Concepts of Machine Learning

Mathematics process categories

Speaker Identification by Comparison of Smart Methods. Abstract

I-COMPETERE: Using Applied Intelligence in search of competency gaps in software project managers.

Probability and Statistics Curriculum Pacing Guide

IT Students Workshop within Strategic Partnership of Leibniz University and Peter the Great St. Petersburg Polytechnic University

Emotional Variation in Speech-Based Natural Language Generation

Model Ensemble for Click Prediction in Bing Search Ads

Firms and Markets Saturdays Summer I 2014

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,

Version Space. Term 2012/2013 LSI - FIB. Javier Béjar cbea (LSI - FIB) Version Space Term 2012/ / 18

Time series prediction

A Genetic Irrational Belief System

Evolution of Symbolisation in Chimpanzees and Neural Nets

LEGO MINDSTORMS Education EV3 Coding Activities

Using focal point learning to improve human machine tacit coordination

Computerized Adaptive Psychological Testing A Personalisation Perspective

Experiment Databases: Towards an Improved Experimental Methodology in Machine Learning

Human Emotion Recognition From Speech

Switchboard Language Model Improvement with Conversational Data from Gigaword

Cooperative evolutive concept learning: an empirical study

Algebra 2- Semester 2 Review

An empirical study of learning speed in backpropagation

Reducing Features to Improve Bug Prediction

Developing True/False Test Sheet Generating System with Diagnosing Basic Cognitive Ability

Using dialogue context to improve parsing performance in dialogue systems

AP Calculus AB. Nevada Academic Standards that are assessable at the local level only.

Proof Theory for Syntacticians

Chinese Language Parsing with Maximum-Entropy-Inspired Parser

Neuro-Symbolic Approaches for Knowledge Representation in Expert Systems

Transfer Learning Action Models by Measuring the Similarity of Different Domains

Diagnostic Test. Middle School Mathematics

SARDNET: A Self-Organizing Feature Map for Sequences

Unit 3: Lesson 1 Decimals as Equal Divisions

Self Study Report Computer Science

Rule-based Expert Systems

Transcription:

Foundations of Intelligent Systems CSCI-630-01 (Fall 2015) Final Examination, Fri. Dec 18, 2015 Instructor: Richard Zanibbi, Duration: 120 Minutes Name: Instructions The exam questions are worth a total of 100 points. After the exam has started, once a student leaves the exam room, they may not return to the exam room until the exam has finished. Remain in the exam room if you finish during the final five minutes of the exam. Close the door behind you quietly if you leave before the end of the examination. The exam is closed book and notes. Place any coats or bags at the front of the exam room. If you require clarification of a question, please raise your hand You may use pencil or pen, and write on the backs of pages in the booklets. Additional pages are provided at the back of the exam - clearly indicate where answers to each question may be found. 1

Questions 1. True/False (10 points) (a) (b) (c) (d) (e) (f) (g) (h) (i) (j) ( T / F ) Shannon s entropy metric suggests that the information content of a prediction increases as the probability distribution for possible outcomes becomes more random. ( T / F ) Training algorithms for multi-layer perceptrons, logistic regression, and the (classical) perceptron algorithm minimize the sum of squared error for output values. ( T / F ) Newell won a Nobel Prize for his work in economics related to the concept of satisficing, the idea that agents (e.g. people) will seek goals until an acceptable rather than optimal outcome is obtained. ( T / F ) Nearly all optimization algorithms that we studied in the course, including genetic algorithms, min-conflicts (used to solve the 8-queen s problem), simulated annealing, decision tree learning, and backpropagation are variations of hill climbing. ( T / F )...(continued from previous question)...and a common variation is to use randomness in some way to increase the amount of exploration during the search for optimal parameters, in the hope of avoiding local maxima or minima. (T / F) Predicate logic is semi-decidable. ( T / F ) Bayes rule is defined by: P (A, B) = P (A B)P (B). ( T / F ) Turing created his imitation game as a way to determine when human intelligence had been acquired by a machine, i.e. a true Artificial Intelligence. (T / F) In practice, for problems with small search spaces and little or no noise in variable values, a brute-force solution may be preferable to an intelligent solution. ( T / F ) A logical statement α is satisfiable when every combination of truth values that makes a knowledge base true also makes α true. 2. Miscellaneous Topics (18 points) (a) (4) Define what it means for an agent to act rationally. (b) (2) Name the four parts of an incremental search problem definition. 2

(c) (4) Identify how the parts you identified in part (b) change for each of the following. Game search Local search (d) (2) Which type of attributes is a decision tree better suited for than a neural network? (e) (3) Who coined the term Artificial Intelligence? (f) (3) When is it necessary to use the expectiminimax algorithm in a game playing program? 3. Logic (26 points) (a) (6) Convert each of the following statements into 1) a sentence in propositional logic, and 2) a sentence in predicate (first-order) logic. i. Gary has a nice cell phone. ii. All people that attend RIT are fun. iii. If Gary goes, Mary goes, and vice-versa. 3

(b) (6) Define and provide an example for each of the following. i. A complete inference algorithm. ii. A constant in predicate logic. iii. A contradiction. 4

(c) (14) The knowledge base below represents a Canadian legal matter. 1. x, y, z ally(x, y) ally(y, z) (x = z) ally(x, z) 2. x, y sold(x, Beer, y) canadian(x) ally(y, Belgium) criminal(x) 3. x has(x, Beer) ally(x, Belgium) sold(colonellabatt, Beer, x) 4. ally(spain, India) 5. ally(india, Belgium) 6. has(spain, Beer) 7. canadian(colonellabatt) 8. (Spain = Belgium) i. (6) Use forward chaining to prove that Colonel Labatt is a criminal (criminal(colonellabatt)). Make sure to show the unification of variables at each step of your proof. 5

ii. (8) Convert the rules from the knowlege base to Conjunctive Normal Form (CNF), and then prove criminal(colonellabatt) using resolution. Again, make sure to show the unification of variables at each step of your proof. 6

4. Decision Trees and AdaBoost (18 points) (a) (3) Identify the three stopping conditions (base cases) for the decision tree learning algorithm. These cases are where the algorithm decides not to split the training samples at a node. (b) (4) Chi-squared pruning can be used to prevent over-fitting, by reducing the size of a decision tree after its construction. The Chi-square metric computes a difference between two probability distributions. At a node whose children are being considered for pruning, specifically which two distributions are being compared? After this comparison is made, when will pruning be applied? (c) (4) Using the entropy formula, show how to compute the entropy for a distribution of 10 people: 8 have a sandwhich (+), and 2 do not have a sandwhich (-). You do not need to compute the final value. (d) (2) Why does the decision tree learning algorithm split samples using the attribute that reduces entropy the most? 7

(e) AdaBoost creates an ensemble of classifiers that work together to make classification decisions, with member classifiers being trained one-at-a-time. i. (3) What is different about how AdaBoost handles training samples versus other machine learning algorithms such as regular decision trees or the backpropagation algorithm? ii. (2) What determines the weight of each classifier in the vote to select the final classification of an AdaBoost classifier? 5. Linear Regression and Neural Networks (28 points) (a) EZRide prices its cars based on interior size ($100/cubic foot) and top speed in miles per hour of the car ($50/mile per hour). The base price of a car before considering the size of the interior and top speed is $1500. i. (2) Provide a linear model for the cost of an EZRide car. ii. (4) Over a ten year period, EZRide changes their pricing. We have obtained a long list of (cubic feet, top speed, actual car price) samples in a spreadsheet. Assuming that we want to minimize the sum of squared differences between our model s predictions and actual car prices, how can we accurately estimate the new prices without machine learning? 8

(b) (6) Draw a linear regressor neuron for our EZRide model. Identify the inputs, the outputs, and where the prices in the model are represented in the neuron. (c) (4) What would you need to change in order for the linear regressor in the last question to become a logistic regressor? Also, what would the range of output values be after making this change? And what is the derivative of the output? (d) (4) Define the problem of fitting the weights in a neural network as a modified local search problem. For a multi-layer perceptron (MLP), what part of the search problem definition does backpropagation address? 9

(e) (8) When training an MLP to classify three different types of cat image, how are correct outputs represented in the training data? Also, How likely is the MLP to produce this target outputs in the training data exactly? Why is it necessary to use backpropagation to compute nodes in layers other than the output layer? (Bonus: +2) In what order should you read the sections of a research paper? 10

Additional Space 11

Additional Space 12

Additional Space 13

Additional Space 14