Machine Learning. Introduction. Hamid Beigy. Sharif University of Technology. Fall 1393

Similar documents
CSL465/603 - Machine Learning

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Lecture 1: Basic Concepts of Machine Learning

Welcome to. ECML/PKDD 2004 Community meeting

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Axiom 2013 Team Description Paper

Python Machine Learning

Generative models and adversarial training

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Rule Learning With Negation: Issues Regarding Effectiveness

A Reinforcement Learning Variant for Control Scheduling

Laboratorio di Intelligenza Artificiale e Robotica

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Probabilistic Latent Semantic Analysis

CS Machine Learning

Reinforcement Learning by Comparing Immediate Reward

Laboratorio di Intelligenza Artificiale e Robotica

Lecture 1: Machine Learning Basics

Rule Learning with Negation: Issues Regarding Effectiveness

Evolutive Neural Net Fuzzy Filtering: Basic Description

Learning Methods for Fuzzy Systems

TD(λ) and Q-Learning Based Ludo Players

Time series prediction

Learning From the Past with Experiment Databases

An investigation of imitation learning algorithms for structured prediction

Reducing Features to Improve Bug Prediction

Word Segmentation of Off-line Handwritten Documents

Handling Concept Drifts Using Dynamic Selection of Classifiers

Semi-Supervised Face Detection

Australian Journal of Basic and Applied Sciences

A study of speaker adaptation for DNN-based speech synthesis

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Agents and environments. Intelligent Agents. Reminders. Vacuum-cleaner world. Outline. A vacuum-cleaner agent. Chapter 2 Actuators

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Mining Association Rules in Student s Assessment Data

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

Multisensor Data Fusion: From Algorithms And Architectural Design To Applications (Devices, Circuits, And Systems)

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Experiment Databases: Towards an Improved Experimental Methodology in Machine Learning

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

A survey of multi-view machine learning

Human Emotion Recognition From Speech

Georgetown University at TREC 2017 Dynamic Domain Track

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

Lecture 10: Reinforcement Learning

ISFA2008U_120 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

A Case Study: News Classification Based on Term Frequency

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

COMPUTER-AIDED DESIGN TOOLS THAT ADAPT

A Comparison of Two Text Representations for Sentiment Analysis

Switchboard Language Model Improvement with Conversational Data from Gigaword

Discriminative Learning of Beam-Search Heuristics for Planning

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Using Web Searches on Important Words to Create Background Sets for LSI Classification

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Interactive Whiteboard

Speeding Up Reinforcement Learning with Behavior Transfer

A Case-Based Approach To Imitation Learning in Robotic Agents

FSL-BM: Fuzzy Supervised Learning with Binary Meta-Feature for Classification

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

Automatic Discretization of Actions and States in Monte-Carlo Tree Search

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

Seminar - Organic Computing

Assignment 1: Predicting Amazon Review Ratings

Exploration. CS : Deep Reinforcement Learning Sergey Levine

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Chapter 2. Intelligent Agents. Outline. Agents and environments. Rationality. PEAS (Performance measure, Environment, Actuators, Sensors)

Rule discovery in Web-based educational systems using Grammar-Based Genetic Programming

FF+FPG: Guiding a Policy-Gradient Planner

EDCI 699 Statistics: Content, Process, Application COURSE SYLLABUS: SPRING 2016

stateorvalue to each variable in a given set. We use p(x = xjy = y) (or p(xjy) as a shorthand) to denote the probability that X = x given Y = y. We al

Massachusetts Institute of Technology Tel: Massachusetts Avenue Room 32-D558 MA 02139

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Case Acquisition Strategies for Case-Based Reasoning in Real-Time Strategy Games

Intelligent Agent Technology in Command and Control Environment

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION

Top US Tech Talent for the Top China Tech Company

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

An OO Framework for building Intelligence and Learning properties in Software Agents

DOCTORAL SCHOOL TRAINING AND DEVELOPMENT PROGRAMME

Disambiguation of Thai Personal Name from Online News Articles

TRANSFER LEARNING IN MIR: SHARING LEARNED LATENT REPRESENTATIONS FOR MUSIC AUDIO CLASSIFICATION AND SIMILARITY

Pp. 176{182 in Proceedings of The Second International Conference on Knowledge Discovery and Data Mining. Predictive Data Mining with Finite Mixtures

Problems of the Arabic OCR: New Attitudes

How do adults reason about their opponent? Typologies of players in a turn-taking game

Robot Learning Simultaneously a Task and How to Interpret Human Instructions

Lahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017

MGT/MGP/MGB 261: Investment Analysis

Indian Institute of Technology, Kanpur

(Sub)Gradient Descent

Longest Common Subsequence: A Method for Automatic Evaluation of Handwritten Essays

Transcription:

Machine Learning Introduction Hamid Beigy Sharif University of Technology Fall 1393 Hamid Beigy (Sharif University of Technology) Machine Learning Fall 1393 1 / 15

Table of contents 1 What is machine learning? 2 Types of machine learning 3 Outline of course Hamid Beigy (Sharif University of Technology) Machine Learning Fall 1393 2 / 15

What is machine learning? The field of machine learning is concerned with the question of how to construct computer programs that automatically improve with the experience. Definition A computer program is said to learn from training experience E with respect to some class of tasks T and performance measure P, if its performance at tasks in T, as measured by P, improves with experience E. Example (Checkers learning problem) Class of task T : playing checkers. Performance measure P: percent of games won against opponents. Training experience E: playing practice game against itself. Hamid Beigy (Sharif University of Technology) Machine Learning Fall 1393 3 / 15

What is machine learning? (Cont.) Example (Handwriting recognition learning problem) Class of task T : recognizing and classifying handwritten words within images. Performance measure P: percent of words correctly classified. Training experience E: a database of handwritten words with given classifications. Example (Robot driving learning problem) Class of task T : driving a robot on the public highways using vision sensors. Performance measure P: average distance travelled before an error. Training experience E: a sequence of images and steering command recorded. Hamid Beigy (Sharif University of Technology) Machine Learning Fall 1393 4 / 15

Why we need machine learning? We need machine learning because 1 Tasks are too complex to program Tasks performed by animals/humans such as driving, speech recognition, image understanding, and etc. Tasks beyond human capabilities such as weather prediction, analysis of genomic data, web search engines, and etc. 2 Some tasks need adaptivity. When a program has been written down, it stays unchanged. In some tasks such as optical character recognition and speech recognition, we need the behavior to be adapted when new data arrives. Hamid Beigy (Sharif University of Technology) Machine Learning Fall 1393 5 / 15

Types of machine learning Machine learning algorithms based on the information provided to the learner can be classified into three main groups. 1 Supervised/predictive learning: The goal is to learn a mapping from inputs x to outputs y given the labeled set S = {(x 1, t 1 ), (x 2, t 2 ),..., (x N, t N )}. x k is called feature vector. When t i {1, 2,..., C}, the learning problem is called classification. When t i R, the problem is called regression. 2 Unsupervised/descriptive learning: The goal is to find interesting pattern in data S = {x 1, x 2,..., x N }. Unsupervised learning is arguably more typical of human and animal learning. 3 Reinforcement learning: Reinforcement learning is learning by interacting with an environment. A reinforcement learning agent learns from the consequences of its actions. Hamid Beigy (Sharif University of Technology) Machine Learning Fall 1393 6 / 15

Applications of machine learning 1 Supervised learning: Classification: Document classification and spam filtering. Image classification and handwritten recognition. Face detection and recognition. Regression: Predict stock market price. Predict temperature of a location. Predict the amount of PSA. 2 Unsupervised/descriptive learning: Discovering clusters. Discovering latent factors. Discovering graph structures (correlation of variables). Matrix completion (filling missing values). Collaborative filtering. Market-basket analysis (frequent item-set mining). 3 Reinforcement learning: Game playing. robot navigation. Hamid Beigy (Sharif University of Technology) Machine Learning Fall 1393 7 / 15

The need for probability theory A key concept in machine learning is uncertainty. Data comes from a process that is not completely known. This lack of knowledge is indicated by modeling the process as a random process. The process actually may be deterministic, but we don t have access to complete knowledge about it, we model it as random and we use the probability theory to analyze it. Hamid Beigy (Sharif University of Technology) Machine Learning Fall 1393 8 / 15

Outline of course 1 Introduction to machine learning & probability theory 2 Supervised learning: Linear models for regression Classifiers based on Bayes decision theory Linear & Nonlinear models for classification Combining classifiers Evaluating classifiers Computational learning theory 3 Unsupervised/descriptive learning: Feature selection & Feature extraction/dimensionality reduction Clustering & clustering evaluation 4 Reinforcement learning: Reinforcement model & model-based learning Monte-carlo & Temporal difference methods 5 Advanced topics: Statistical learning theory Graphical models Deep & semi-supervised & Active & online learning Large scale machine learning Hamid Beigy (Sharif University of Technology) Machine Learning Fall 1393 9 / 15

References Kevin P. Murphy, Machine Learning: A Probabilistic Perspective, MIT Press, 2012. Christopher M. Bishop, Pattern Recognition and Machine Learning, Springer, 2007. Tom M. Mitchell, Machine Learning, McGraw-Hill, 1997. Ethem Alpaydin, Introduction to Machine Learning, Second Edition, MIT Press, 2012. Trevor Hastie, Robert Tibshirani, and Jerome Friedman, The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition, Springer, 2013. Csaba Szepesvari, Algorithms for Reinforcement Learning, Morgan & Claypool, 2010. Richard S. Sutton and Andrew G. Barto, Reinforcement Learning: An Introduction, MIT Press, 1998 Hamid Beigy (Sharif University of Technology) Machine Learning Fall 1393 10 / 15

Relevant journals 1 IEEE Trans on Pattern Analysis and Machine Intelligence 2 Journal of Machine Learning Research 3 Pattern Recognition 4 Machine Learning 5 Neural Networks 6 Neural Computation 7 Neurocomputing 8 IEEE Trans. on Neural Networks and Learning Systems 9 Annuals of Statistics 10 Journal of the American Statistical Association 11 Pattern Recognition Letters 12 Artificial Intelligence 13 Data Mining and Knowledge Discovery 14 IEEE Transaction on Cybernetics (SMC-B) 15 IEEE Transaction on Knowledge and Data Engineering 16 Knowledge and Information Systems Hamid Beigy (Sharif University of Technology) Machine Learning Fall 1393 11 / 15

Relevant conferences 1 Neural Information Processing Systems (NIPS) 2 International Conference on Machine Learning (ICML) 3 European Conference on Machine Learning (ECML) 4 Asian Conference on Machine Learning (ACML2013) 5 Conference on Learning Theory (COLT) 6 Algorithmic Learning Theory (ALT) 7 Conference on Uncertainty in Artificial Intelligence (UAI) 8 Practice of Knowledge Discovery in Databases (PKDD) 9 International Joint Conference on Artificial Intelligence (IJCAI) 10 IEEE International Conference on Data Mining series (ICDM) Hamid Beigy (Sharif University of Technology) Machine Learning Fall 1393 12 / 15

Relevant packages and datasets 1 Packages: R http://www.r-project.org/ Weka http://www.cs.waikato.ac.nz/ml/weka/ RapidMiner http://rapidminer.com/ MOA http://moa.cs.waikato.ac.nz/ 2 Datasets: UCI Machine Learning Repository http://archive.ics.uci.edu/ml/ StatLib http://lib.stat.cmu.edu/datasets/ Delve http://www.cs.toronto.edu/~delve/data/datasets.html Hamid Beigy (Sharif University of Technology) Machine Learning Fall 1393 13 / 15

Course evaluation Evaluation: Mid-term exam 6 1393/8/8 Final exam 6 Mid term + Final 8 for passing Quiz 1 Presentation 1 1393/8/8 Homework 2 Programming projects 2 Term project 1 Scribe notes 1 optional Course page: http://ce.sharif.edu/courses/93-94/1/ce717-1/ TAs : Aslan Mehrabi Sahba Ezami Mohammadali Bashiri Amir Keramatian Afrooz Vazifedan aslan.mehrabi@gmail.com sahba.ezami@gmail.com m.a.bashiri69@gmail.com amir.keramatian91@gmail.com af.vazifedan@gmail.com Hamid Beigy (Sharif University of Technology) Machine Learning Fall 1393 14 / 15

Papers for seminars Hamid Beigy (Sharif University of Technology) Machine Learning Fall 1393 15 / 15