CS434 Machine Learning and Data Mining. Fall 2008

Similar documents
Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Lecture 1: Basic Concepts of Machine Learning

Laboratorio di Intelligenza Artificiale e Robotica

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

Laboratorio di Intelligenza Artificiale e Robotica

Firms and Markets Saturdays Summer I 2014

CSL465/603 - Machine Learning

CS 101 Computer Science I Fall Instructor Muller. Syllabus

Stochastic Calculus for Finance I (46-944) Spring 2008 Syllabus

Navigating the PhD Options in CMS

Exploration. CS : Deep Reinforcement Learning Sergey Levine

A Case Study: News Classification Based on Term Frequency

CS177 Python Programming

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

IT Students Workshop within Strategic Partnership of Leibniz University and Peter the Great St. Petersburg Polytechnic University

CS Machine Learning

Course Content Concepts

Course Syllabus for Math

CWSEI Teaching Practices Inventory

Lecture 10: Reinforcement Learning

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Word Segmentation of Off-line Handwritten Documents

FINN FINANCIAL MANAGEMENT Spring 2014

Foothill College Summer 2016

LOUISIANA HIGH SCHOOL RALLY ASSOCIATION

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

CLASSROOM PROCEDURES FOR MRS.

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

ENEE 302h: Digital Electronics, Fall 2005 Prof. Bruce Jacob

Economics 201 Principles of Microeconomics Fall 2010 MWF 10:00 10:50am 160 Bryan Building

Class Numbers: & Personal Financial Management. Sections: RVCC & RVDC. Summer 2008 FIN Fully Online

Financial Accounting Concepts and Research

Math 121 Fundamentals of Mathematics I

Artificial Neural Networks written examination

Active Learning. Yingyu Liang Computer Sciences 760 Fall

General Physics I Class Syllabus

Bachelor Class

Axiom 2013 Team Description Paper

Coming in. Coming in. Coming in

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

DOCTOR OF PHILOSOPHY HANDBOOK

Physics Experimental Physics II: Electricity and Magnetism Prof. Eno Spring 2017

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Humboldt-Universität zu Berlin

Syllabus Fall 2014 Earth Science 130: Introduction to Oceanography

How can climate change be considered in Vulnerability and Capacity Assessments? - A summary for practitioners April 2011

GERM 3040 GERMAN GRAMMAR AND COMPOSITION SPRING 2017

Networks and the Diffusion of Cutting-Edge Teaching and Learning Knowledge in Sociology

Design and Creation of Games GAME

ASTR 102: Introduction to Astronomy: Stars, Galaxies, and Cosmology


ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

EXPO MILANO CALL Best Sustainable Development Practices for Food Security

Valcik, N. A., & Tracy, P. E. (2013). Case studies in disaster response and emergency management. Boca Raton, FL: CRC Press.

(Sub)Gradient Descent

Language Arts: ( ) Instructional Syllabus. Teachers: T. Beard address

SPM 5309: SPORT MARKETING Fall 2017 (SEC. 8695; 3 credits)

Lecture 6: Applications

Spring 2016 Stony Brook University Instructor: Dr. Paul Fodor

Office: Gallagher Hall 3406

Python Machine Learning

EECS 700: Computer Modeling, Simulation, and Visualization Fall 2014

1. Welcome and introduction from the Director of Undergraduate Studies

GIS 5049: GIS for Non Majors Department of Environmental Science, Policy and Geography University of South Florida St. Petersburg Spring 2011

Evaluating Statements About Probability

CS 446: Machine Learning

Welcome to. ECML/PKDD 2004 Community meeting

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Introduction to Questionnaire Design

PELLISSIPPI STATE TECHNICAL COMMUNITY COLLEGE MASTER SYLLABUS APPLIED STATICS MET 1040

Academic Integrity RN to BSN Option Student Tutorial

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Constraining X-Bar: Theta Theory

GEOG 473/573: Intermediate Geographic Information Systems Department of Geography Minnesota State University, Mankato

GEOCODING LOCATIONS OF HISTORIC RECLAMATION RESEARCH SITES USING GOOGLE EARTH

preassessment was administered)

MinE 382 Mine Power Systems Fall Semester, 2014

Introduce yourself. Change the name out and put your information here.

Everton Library, Liverpool: Market assessment and project viability study 1

The Heart of Philosophy, Jacob Needleman, ISBN#: LTCC Bookstore:

A Teacher Toolbox. Let the Great World Spin. for. by Colum McCann ~~~~ The KCC Reads Selection. for the. Academic Year ~~~~

Outreach Connect User Manual

INTERMEDIATE ALGEBRA Course Syllabus

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

CS 100: Principles of Computing

Individual Component Checklist L I S T E N I N G. for use with ONE task ENGLISH VERSION

ECE-492 SENIOR ADVANCED DESIGN PROJECT

Tips for Academic Scholarship Success. Handouts from today s presentation are available online: studentaffairs.pitt.edu/fye/academicscholarships/

CLASS EXODUS. The alumni giving rate has dropped 50 percent over the last 20 years. How can you rethink your value to graduates?

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

Human Emotion Recognition From Speech

Syllabus Foundations of Finance Summer 2014 FINC-UB

INTERMEDIATE ALGEBRA PRODUCT GUIDE

Syllabus Education Department Lincoln University EDU 311 Social Studies Methods

White Paper. The Art of Learning

Reinforcement Learning by Comparing Immediate Reward

Lesson 1 Taking chances with the Sun

Syllabus: CS 377 Communication and Ethical Issues in Computing 3 Credit Hours Prerequisite: CS 251, Data Structures Fall 2015

B. How to write a research paper

Transcription:

CS434 Machine Learning and Data Mining Fall 2008 1

Administrative Trivia Instructor: Dr. Xiaoli Fern (Back on Wednesday) web.engr.oregonstate.edu/~xfern Office hour: 1 hour before class, or by appointment Course webpage web.engr.oregonstate.edu/~xfern/classes/cs434 Please check course webpage frequently Learning objectives Syllabus Course policy Course announcements 2

Briefly Grading: Homeworks and projects 55% Midterm 20% Final exam 25% Homeworks due at the beginning of the class (first 5 minutes of the class) Late submission will be accepted if it s no more than 24 hours late, but only gets 80% Collaborations policy (for solo assignments) Verbal discussion about general approaches and strategies allowed Can talk about examples not in the assignments Anything you turn in has be created by you and you alone For team assignments, the above policies apply between teams. 3

Course materials No text book required, slides and reading materials will be provided on course webpage There are a number of recommended books that are good references Machine learning by Tom Mitchell (TM) Pattern recognition and machine learning by Chris Bishop (Bishop) 4

What is learning? Generally speaking any change in a system that allows it to perform better the second time on repetition of the same task or on another task drawn from the same distribution --- Herbert Simon 5

Machine learning Task T Performance P Learning Algorithm Experience E Learning = Improving with experience at some task Improve over task T with respect to P based on experience E

When do we need computer to learn? What is not learning? A program that does tax return A program that looks up phone numbers in phone directory 7

When do we need learning? Sometimes there is no human expert knowledge Predict whether a new compound will be effective for treating some disease Sometimes humans can do it but can t describe how they do it Recognize hand written digits Sometimes the things we need to learn change frequently Stock market, weather forecasting, computer network routing Sometimes the thing we need to learn needs customization Spam filters 8

Fields of Interest Supervised learning learn to predict Unsupervised learning learn to understand and describe the data Reinforcement learning learn to act Data mining A highly overlapping concept, but focuses on large volume of data: To obtain useful knowledge from large volume of data 9

Supervised Learning: example Learn to predict output from input E.g. predict the risk level of a loan applicant based on income and savings MANY interesting applications! Spam filters, Collaborative filtering (predicting if a customer will be interested in an advertisement), Ecological (predicting if a species is absent/present in a certain environment), Medical 10

Unsupervised learning Find patterns and structure in data Clustering art 11

Example Applications Market Segmentation: divide a market into distinct subsets of customers Collect different attributes of customers based on their geographical and lifestyle Find clusters of similar customers, where each cluster may conceivably be selected as a market target to be reached with a distinct marketing strategy Document clustering For organizing search results etc. 12

Reinforcement learning 13

Example Applications Robot controls Elevator scheduling Games such as backgammon and chess 14

Learning objectives Students are able to apply supervised learning algorithms to prediction problems and evaluate the results. Students are able to apply unsupervised learning algorithms to data analysis problems and evaluate results. Students are able to apply reinforcement learning algorithms to control problem and evaluate results. Students are able to take a description of a new problem and decide what kind of problem (supervised, unsupervised, or reinforcement) it is. 15

Example: Learning to play T: play checkers checkers P: percent of games won in world tournament What experience? What should we exactly learn? How should we represent it? What specific algorithm to learn it? 16

Type of training experience Direct For each board state, we obtain a best move for that position Observe many states and many moves Try to learn what is the best move for an unseen state Indirect Just observe a sequence of plays and the end result More difficult, because which of the moves are the bad (good) ones for a bad (good) game? This is the credit assignment problem, very difficult to solve 17

Choose the Target Function (what should we learn) Choosemove: board state -> move? V: Board state -> Reward (value of the state)? 18

Possible definition for target function V If b is a final board state that won, V(b)=100 If b is a final board state that is lost, V(b)= -100 If b is a final board state that is drawn, the V(b)=0 If b is not a final board state, then V(b)=V(b ), where b is the best possible final state reachable from b. This gives correct values, but is not operational 19

Choose representation for target function Collection of rules Neural network? Polynomial functions of board features? 20

A representation for learned function w0 + w1 f1( b) + w2 f2( b) + L+ wn fn( b) f1, f2,, fn are features describing a board state For example, f1 can be the number of black pieces on board f2 can be the number of red pieces on board, etc. 21

A diagram of design choices In this class, you will become familiar with many of these choices, and even try them in practice. We would like to prepare you so that you can make good design choices when facing a new learning problem! 22