Machine Learning: CS 6375 Introduction. Instructor: Vibhav Gogate The University of Texas at Dallas

Similar documents
Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

(Sub)Gradient Descent

Lecture 1: Basic Concepts of Machine Learning

Lecture 1: Machine Learning Basics

CSL465/603 - Machine Learning

Python Machine Learning

Data Structures and Algorithms

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Axiom 2013 Team Description Paper

Welcome to. ECML/PKDD 2004 Community meeting

CS Machine Learning

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

ASTR 102: Introduction to Astronomy: Stars, Galaxies, and Cosmology

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Laboratorio di Intelligenza Artificiale e Robotica

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering

Lecture 10: Reinforcement Learning

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Foothill College Summer 2016

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

*In Ancient Greek: *In English: micro = small macro = large economia = management of the household or family

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

Artificial Neural Networks written examination

Lahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017

DOCTOR OF PHILOSOPHY HANDBOOK

Time series prediction

Speech Emotion Recognition Using Support Vector Machine

Firms and Markets Saturdays Summer I 2014

Students Understanding of Graphical Vector Addition in One and Two Dimensions

Navigating the PhD Options in CMS

Agents and environments. Intelligent Agents. Reminders. Vacuum-cleaner world. Outline. A vacuum-cleaner agent. Chapter 2 Actuators

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

Spring 2015 IET4451 Systems Simulation Course Syllabus for Traditional, Hybrid, and Online Classes

A Case Study: News Classification Based on Term Frequency

Laboratorio di Intelligenza Artificiale e Robotica

Probabilistic Latent Semantic Analysis

Syllabus Foundations of Finance Summer 2014 FINC-UB

Decision Analysis. Decision-Making Problem. Decision Analysis. Part 1 Decision Analysis and Decision Tables. Decision Analysis, Part 1

Evaluation of Teach For America:

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

TUESDAYS/THURSDAYS, NOV. 11, 2014-FEB. 12, 2015 x COURSE NUMBER 6520 (1)

Math 96: Intermediate Algebra in Context

MGT/MGP/MGB 261: Investment Analysis

Model Ensemble for Click Prediction in Bing Search Ads

A Simple VQA Model with a Few Tricks and Image Features from Bottom-up Attention

SAM - Sensors, Actuators and Microcontrollers in Mobile Robots

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Exploration. CS : Deep Reinforcement Learning Sergey Levine

CS 446: Machine Learning

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Course Content Concepts

EECS 700: Computer Modeling, Simulation, and Visualization Fall 2014

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

AC : DESIGNING AN UNDERGRADUATE ROBOTICS ENGINEERING CURRICULUM: UNIFIED ROBOTICS I AND II

Syllabus: CS 377 Communication and Ethical Issues in Computing 3 Credit Hours Prerequisite: CS 251, Data Structures Fall 2015

Foothill College Fall 2014 Math My Way Math 230/235 MTWThF 10:00-11:50 (click on Math My Way tab) Math My Way Instructors:

Intelligent Agents. Chapter 2. Chapter 2 1

Rule Learning With Negation: Issues Regarding Effectiveness

Human Emotion Recognition From Speech

Georgetown University at TREC 2017 Dynamic Domain Track

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE


Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

MTH 141 Calculus 1 Syllabus Spring 2017

CS 101 Computer Science I Fall Instructor Muller. Syllabus

The Good Judgment Project: A large scale test of different methods of combining expert predictions

Course Syllabus p. 1. Introduction to Web Design AVT 217 Spring 2017 TTh 10:30-1:10, 1:30-4:10 Instructor: Shanshan Cui

LEGO MINDSTORMS Education EV3 Coding Activities

CS Course Missive

A Neural Network GUI Tested on Text-To-Phoneme Mapping

GEOG 473/573: Intermediate Geographic Information Systems Department of Geography Minnesota State University, Mankato

EGRHS Course Fair. Science & Math AP & IB Courses

FINN FINANCIAL MANAGEMENT Spring 2014

Australian Journal of Basic and Applied Sciences

WHEN THERE IS A mismatch between the acoustic

An Introductory Blackboard (elearn) Guide For Parents

A Context-Driven Use Case Creation Process for Specifying Automotive Driver Assistance Systems

CS/SE 3341 Spring 2012

A Reinforcement Learning Variant for Control Scheduling

JONATHAN H. WRIGHT Department of Economics, Johns Hopkins University, 3400 N. Charles St., Baltimore MD (410)

CS 100: Principles of Computing

Multisensor Data Fusion: From Algorithms And Architectural Design To Applications (Devices, Circuits, And Systems)

Class Meeting Time and Place: Section 3: MTWF10:00-10:50 TILT 221

B. How to write a research paper

HIST 3300 HISTORIOGRAPHY & METHODS Kristine Wirts

Dinesh K. Sharma, Ph.D. Department of Management School of Business and Economics Fayetteville State University

Detailed course syllabus

Learning From the Past with Experiment Databases

Generative models and adversarial training

International Business BADM 455, Section 2 Spring 2008

Evolutive Neural Net Fuzzy Filtering: Basic Description

MYCIN. The MYCIN Task

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Nutrition 10 Contemporary Nutrition WINTER 2016

Transcription:

Machine Learning: CS 6375 Introduction Instructor: Vibhav Gogate The University of Texas at Dallas

Logistics Instructor: Vibhav Gogate Email: vgogate@hlt.utdallas.edu Office: ECSS 3.406 Office hours: M/W 12:45 p.m. to 1:45 p.m. TA: TBA Web: http://www.hlt.utdallas.edu/~vgogate/ml/2014f/index.ht ml Discussion Board Discussion board on Piazza. This will be the main on-line forum for discussing assignments and course material, and interacting with other students, TA and me. We will also post course-wide announcements on Piazza.

Evaluation Five homeworks (40%) 8% each Due two weeks later Some programming, some exercises Assigned via ELearning. One Project (10%) Your favorite problem (apply machine learning) One Midterm (20%), One Final (30%) Exams are closed book. You will be allowed a cheat sheet, a double-sided 8.5 x 11 page. Attendance is mandatory Grade reduced by a letter grade for lack of attendance (e.g., A- becomes B-; B- becomes a C-; etc)

Source Materials T. Mitchell, Machine Learning, McGraw-Hill C. Bishop, Pattern Recognition and Machine Learning, Springer Kevin Murphy, Machine Learning: A probabilistic perspective Class Notes/Slides

Why Study Machine Learning: A Few Quotes A breakthrough in machine learning would be worth ten Microsofts (Bill Gates, Microsoft) Machine learning is the next Internet (Tony Tether, Former Director, DARPA) Machine learning is the hot new thing (John Hennessy, President, Stanford) Web rankings today are mostly a matter of machine learning (Prabhakar Raghavan, Former Dir. Research, Yahoo) Machine learning is going to result in a real revolution (Greg Papadopoulos, CTO, Sun)

Traditional Programming Getting computers to program themselves Writing software is the bottleneck, let data do the work Requirements Data Human Program Computer Output Requirements and data change often Machine Learning Input Requirements Data Machine Learning Program Computer Output Input

Training Data Training Example Two Classes: {Yes,No}

Magic? No, more like gardening Seeds = Algorithms Nutrients = Data Gardener = You Plants = Programs

Definition: Machine Learning! T. Mitchell: Well posed machine learning Improving performance via experience Formally, A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P, it its performance at tasks in T as measured by P, improves with experience. H. Simon Learning denotes changes in the system that are adaptive in the sense that they enable the system to do the task or tasks drawn from the same population more efficiently and more effectively the next time. The ability to perform a task in a situation which has never been encountered before (Learning = Generalization)

Example 1: A Chess learning problem Task T: playing chess Performance measure P: percent of games won against opponents Training Experience E: playing practice games against itself

Example 2: Autonomous Vehicle Problem Task T: driving on a public highway/roads using vision sensors Performance Measure P: percentage of time the vehicle is involved in an accident Training Experience E: a sequence of images and steering commands recorded while observing a human driver

When to use Machine Learning? Human expertise is absent Example: navigating on mars Humans are unable to explain their expertise Example: vision, speech, language Requirements and data change over time Example: Tracking, Biometrics, Personalized fingerprint recognition The problem or the data size is just too large Example: Web Search

Types of Learning Supervised (inductive) learning Training data includes desired outputs Unsupervised learning Training data does not include desired outputs Find hidden/interesting structure in data Semi-supervised learning Training data includes a few desired outputs Reinforcement learning the learner interacts with the world via actions and tries to find an optimal policy of behavior with respect to rewards it receives from the environment

Examples/Types of Machine Learning Tasks Forecasting or Prediction Stock price of Google tomorrow? Classification and Regression Is Ana credit-worthy? What is Ana s credit score? Ranking How to rank images that contain An awesome machine learning model? Outlier/Anomaly/Fraud detection Is it Ana using the credit card in Mexico or is it someone else? Finding patterns Almost 60% of shoppers buy Diapers and Milk together!

Machine Learning: Applications Examples of what you will study in class in action!

Classification Example: Spam Filtering Classify as Spam or Not Spam

Classification Example: Weather Prediction

Regression example: Predicting Gold/Stock prices Good ML can make you rich (but there is still some risk involved). Given historical data on Gold prices, predict tomorrow s price!

Similarity Determination

Collaborative Filtering The problem of collaborative filtering is to predict how well a user will like an item that he has not rated given a set of historical preference judgments for a community of users.

Collaborative Filtering

Collaborative Filtering

Clustering: Discover Structure in data

Machine learning has grown in leaps and bounds The main approach for Speech Recognition Robotics Natural Language Processing Computational Biology Sensor networks Computer Vision Web And so on Alice/Bob says: I know machine learning very well! Potential Employer: You are hired!!!

What We ll Cover Supervised learning: Decision tree induction, Rule induction, Instance-based learning, Bayesian learning, Neural networks, Support vector machines, Linear Regression, Model ensembles, Graphical models, Learning theory, etc. Unsupervised learning: Clustering, Dimensionality reduction Reinforcement learning: Markov Decision Processes, Q- learning, etc. General machine learning concepts and techniques: Feature selection, cross-validation, maximum likelihood estimation, gradient descent, expectation-maximization Your responsibility: Brush up on some important background Linear algebra, Statistics 101, Vectors, Probability theory