CS798: Selected topics in Machine Learning

Similar documents
CSL465/603 - Machine Learning

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Python Machine Learning

Lecture 1: Machine Learning Basics

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Lecture 1: Basic Concepts of Machine Learning

Welcome to. ECML/PKDD 2004 Community meeting

Probabilistic Latent Semantic Analysis

Semi-Supervised Face Detection

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Generative models and adversarial training

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Assignment 1: Predicting Amazon Review Ratings

Rule Learning With Negation: Issues Regarding Effectiveness

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

CS 446: Machine Learning

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

A study of speaker adaptation for DNN-based speech synthesis

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

A survey of multi-view machine learning

Rule Learning with Negation: Issues Regarding Effectiveness

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Learning From the Past with Experiment Databases

Human Emotion Recognition From Speech

arxiv: v1 [cs.lg] 15 Jun 2015

Speech Emotion Recognition Using Support Vector Machine

Knowledge Transfer in Deep Convolutional Neural Nets

Semantic Segmentation with Histological Image Data: Cancer Cell vs. Stroma

Chapter 10 APPLYING TOPIC MODELING TO FORENSIC DATA. 1. Introduction. Alta de Waal, Jacobus Venter and Etienne Barnard

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Handling Concept Drifts Using Dynamic Selection of Classifiers

Axiom 2013 Team Description Paper

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Learning Methods for Fuzzy Systems

An investigation of imitation learning algorithms for structured prediction

Word Segmentation of Off-line Handwritten Documents

Laboratorio di Intelligenza Artificiale e Robotica

(Sub)Gradient Descent

Softprop: Softmax Neural Network Backpropagation Learning

Laboratorio di Intelligenza Artificiale e Robotica

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

Speech Recognition at ICSI: Broadcast News and beyond

Time series prediction

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

MGT/MGP/MGB 261: Investment Analysis

Knowledge based expert systems D H A N A N J A Y K A L B A N D E

Evolutive Neural Net Fuzzy Filtering: Basic Description

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Evolution of Symbolisation in Chimpanzees and Neural Nets

Chinese Language Parsing with Maximum-Entropy-Inspired Parser

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Australian Journal of Basic and Applied Sciences

Massachusetts Institute of Technology Tel: Massachusetts Avenue Room 32-D558 MA 02139

Reinforcement Learning by Comparing Immediate Reward

PREDICTING SPEECH RECOGNITION CONFIDENCE USING DEEP LEARNING WITH WORD IDENTITY AND SCORE FEATURES

Modeling function word errors in DNN-HMM based LVCSR systems

A Survey on Unsupervised Machine Learning Algorithms for Automation, Classification and Maintenance

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

AUTOMATED TROUBLESHOOTING OF MOBILE NETWORKS USING BAYESIAN NETWORKS

A Case Study: News Classification Based on Term Frequency

Calibration of Confidence Measures in Speech Recognition

Artificial Neural Networks written examination

Firms and Markets Saturdays Summer I 2014

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Pp. 176{182 in Proceedings of The Second International Conference on Knowledge Discovery and Data Mining. Predictive Data Mining with Finite Mixtures

Mining Association Rules in Student s Assessment Data

Experiment Databases: Towards an Improved Experimental Methodology in Machine Learning

CS Machine Learning

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Switchboard Language Model Improvement with Conversational Data from Gigaword

AC : DESIGNING AN UNDERGRADUATE ROBOTICS ENGINEERING CURRICULUM: UNIFIED ROBOTICS I AND II

Reducing Features to Improve Bug Prediction

SAM - Sensors, Actuators and Microcontrollers in Mobile Robots

Transfer Learning with Applications

COBRA: A Fast and Simple Method for Active Clustering with Pairwise Constraints

Using Web Searches on Important Words to Create Background Sets for LSI Classification

PROVIDENCE UNIVERSITY COLLEGE

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

A Reinforcement Learning Variant for Control Scheduling

A Case-Based Approach To Imitation Learning in Robotic Agents

COMPUTER-AIDED DESIGN TOOLS THAT ADAPT

A student diagnosing and evaluation system for laboratory-based academic exercises

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

Computerized Adaptive Psychological Testing A Personalisation Perspective

DOCTORAL SCHOOL TRAINING AND DEVELOPMENT PROGRAMME

BUILDING CONTEXT-DEPENDENT DNN ACOUSTIC MODELS USING KULLBACK-LEIBLER DIVERGENCE-BASED STATE TYING

Room: Office Hours: T 9:00-12:00. Seminar: Comparative Qualitative and Mixed Methods

TRANSFER LEARNING OF WEAKLY LABELLED AUDIO. Aleksandr Diment, Tuomas Virtanen

Georgetown University at TREC 2017 Dynamic Domain Track

School of Innovative Technologies and Engineering

Detecting Wikipedia Vandalism using Machine Learning Notebook for PAN at CLEF 2011

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

Model Ensemble for Click Prediction in Bing Search Ads

Lahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017

Transcription:

CS798: Selected topics in Machine Learning Introduction Jakramate Bootkrajang Department of Computer Science Chiang Mai University Jakramate Bootkrajang CS798: Selected topics in Machine Learning 1 / 22

About the course CS789: Machine learning (and optimisation) Lecturer: Jakramate Bootkrajang Email: jakramateb@cmuacth Office hour: I am almost always at my desk, just walk in Grading: 40% homework, 30% midterm, 30% final Programming language: MATLAB, Scilab, Julia Background: Linear algebra, calculus, basic probability Jakramate Bootkrajang CS798: Selected topics in Machine Learning 2 / 22

Course outline Supervised learning methods Bayes classifier Normal discriminant analysis Logistic regression Support Vector Machine + kernel method Combining multiple classifiers: boosting, bagging Classifier evaluation Regularisation Unsupervised learning method Mixture model and EM algorithm Clustering Introduction to learning theory Learnability, PAC learning Hypothesis space, Bias, Variance Jakramate Bootkrajang CS798: Selected topics in Machine Learning 3 / 22

Human learning What is learning? In general, learning is the act of acquiring new knowledge, or modifying and reinforcing, existing knowledge and may involve synthesizing different types of information (Wikipedia) Why do we learn? Short term: Be able to do things Longer term: Money Fame Happiness How did we do? How well did we do those things How much money do we have? How happy are we? Jakramate Bootkrajang CS798: Selected topics in Machine Learning 4 / 22

Machine learning What is [machine] learning? Mathematical modelling of nature and adjusting the parameters of the model using example data or past experience (Function estimation) In some sense, it is the estimation of a function f(x) where x is the representation of each data point Why does machine learn? Short term: Be able to do some specific tasks, for example, classification or clustering Longer term: too difficult This is called learning objective How well does it perform? The machine is expected to perform well according to performance criterion For classification: right/wrong predictions For clustering: compactness of the clusters found Jakramate Bootkrajang CS798: Selected topics in Machine Learning 5 / 22

When to learn Learning is used when: Humans are unable to explain their expertise (speech recognition) Human expertise does not exist (Fraud detection) Solution changes in time (online learning, objective function changes) Otherwise, learning might not be necessary (but still possible) For example, to convert kilometre to mile Jakramate Bootkrajang CS798: Selected topics in Machine Learning 6 / 22

Steps in machine learning 1 Know what you want to do Know your data (Objective) 2 Build a model that is a good and useful approximation to the data (Modelling) 3 Devise an algorithm to learn the model: how to adjust model s parameters (Learning) 4 Test your model using existing data or new unseen data (Performance measure) 5 Theoretically show that your model will work on any new data of the same kind (Performance measure, in general) Jakramate Bootkrajang CS798: Selected topics in Machine Learning 7 / 22

Machine learning and related fields Machine learning: Focuses on modelling, learning and performance measure in general Pattern recognition: Sub-field of machine learning focuses on classification tasks Data mining: Focuses on objective and performance measure Optimisation: Focuses on learning Supporting fields: Mathematics, Statistics Jakramate Bootkrajang CS798: Selected topics in Machine Learning 8 / 22

Definition and Terminology Example : item, instance of the data used Features : attributes associated to an item, often represented as a vector (eg, word counts) Labels : category (for classification) or real value (regression) associated to an item Data : training data (labelled) test data (labelled but hidden from the learning machine) validation data (labelled, for tuning parameters) Jakramate Bootkrajang CS798: Selected topics in Machine Learning 9 / 22

Paradigms of machine learning Supervised learning: the task of inferring a function from labelled data (classification and regression) Unsupervised learning: the task of inferring a function from unlabelled data to describe hidden structure (clustering) Semi-supervised learning: inferring a function from labelled and unlabelled data Reinforcement learning: the task of inferring a function from interaction with the world based on awards and penalty Correct input/output pairs are never presented (Robot control) Jakramate Bootkrajang CS798: Selected topics in Machine Learning 10 / 22

Classification Credit scoring Differentiating beween low-risk and high-risk customers from their income and savings Discriminant IF income > θ 1 AND savings > θ 2 THEN low-risk ELSE high risk Jakramate Bootkrajang CS798: Selected topics in Machine Learning 11 / 22

Classification: applications Also known as: pattern recognition Face recognition: Pose, lighting, occlusion (glasses, beard), make-up, hair style Character recognition: Different handwriting styles Speech recognition: Temporal dependency Medical diagnosis: From symptoms to illnesses Jakramate Bootkrajang CS798: Selected topics in Machine Learning 12 / 22

Face recognition Jakramate Bootkrajang CS798: Selected topics in Machine Learning 13 / 22

Regression Price of a used car y = g(x θ), x: car attributes, y: price g() is the model and θ is model s parameter For navigating a car: Angle of the steering wheel as output (CMU NavLab) Jakramate Bootkrajang CS798: Selected topics in Machine Learning 14 / 22

Unsupervised learning Learning what normally happens Clustering: Grouping similar instances Example applications Customer segmentation in CRM Image compression: Color quantization Jakramate Bootkrajang CS798: Selected topics in Machine Learning 15 / 22

Semi-supervised learning Learning from small set of labelled data + large amount of unlabelled data Focus on how to make use of unlabelled data to improve the performance Possible applications are very similar to supervised learning Example: Lanna OCR Jakramate Bootkrajang CS798: Selected topics in Machine Learning 16 / 22

Reinforcement learning Learning a policy: A sequence of outputs (actions) Reward good behaviour Punish bad behaviour Many control related applications Game playing Robot in a maze Some demo Jakramate Bootkrajang CS798: Selected topics in Machine Learning 17 / 22

Resources: Books Foundation of machine learning, Mehryar Mohri, Afshin Rostamizadeh and Ameet Talwalkar, The MIT Press Pattern classification, Richard Duda, Peter Hart, David Storck Wiley-Interscience Understanding machine learning, Shai Shalev-Shwartz, Shai Ben-David, Cambridge University Press Machine learning, Tom Mitchell, McGraw-Hill Education Pattern recognition and machine learning, Christopher Bishop, Springer Jakramate Bootkrajang CS798: Selected topics in Machine Learning 18 / 22

Resources: Datasets UCI Repository: http://archiveicsuciedu/ml/ UCI KDD Archive: https://kddicsuciedu/ Delve: http://wwwcstorontoedu/ delve/data/datasetshtml Pascal Large Scale Learning Challenge: http://largescalemltu-berlinde/about/ ImageNet: http://image-netorg/index Jakramate Bootkrajang CS798: Selected topics in Machine Learning 19 / 22

Prominant figures Michael I Jordan (UC Berkeley) (Bayesian inference) Bernhard Schölkopf (TU Berlin) (Kernel methods) Chris Williams (University of Edinburgh) (Gaussian process) John Shawe-Taylor (UCL) (Theory) Andrew Ng (Stanford) Geoffrey Hinton (University of Toronto) (Deep learning) Carl Edward Rasmussen, Christopher M Bishop, Zoubin Ghahramani, Terrence Sejnowski Jakramate Bootkrajang CS798: Selected topics in Machine Learning 20 / 22

Journals Journal of Machine Learning Research (JMLR) Machine Learning Neural Computation Neural Networks IEEE Transactions on Neural Networks IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) Pattern Recognition Neurocomputing Jakramate Bootkrajang CS798: Selected topics in Machine Learning 21 / 22

Conferences International Conference on Machine Learning (ICML) European Conference on Machine Learning (ECML) Neural Information Processing Systems (NIPS) Uncertainty in Artificial Intelligence (UAI) Computational Learning Theory (COLT) International Joint Conference on Artificial Intelligence (IJCAI) International Conference on Neural Networks (Europe) Jakramate Bootkrajang CS798: Selected topics in Machine Learning 22 / 22