Welcome to CMPS 142: Machine Learning. Administrivia. Lecture Slides for. Instructor: David Helmbold,

Similar documents
Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

CSL465/603 - Machine Learning

Python Machine Learning

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Lecture 1: Machine Learning Basics

(Sub)Gradient Descent

Rule Learning With Negation: Issues Regarding Effectiveness

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Active Learning. Yingyu Liang Computer Sciences 760 Fall

CS Machine Learning

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Rule Learning with Negation: Issues Regarding Effectiveness

Generative models and adversarial training

Human Emotion Recognition From Speech

Lecture 1: Basic Concepts of Machine Learning

Semi-Supervised Face Detection

Welcome to. ECML/PKDD 2004 Community meeting

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Learning From the Past with Experiment Databases

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Reducing Features to Improve Bug Prediction

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Top US Tech Talent for the Top China Tech Company

Evolutive Neural Net Fuzzy Filtering: Basic Description

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

Applications of data mining algorithms to analysis of medical data

Computerized Adaptive Psychological Testing A Personalisation Perspective

Artificial Neural Networks written examination

Laboratorio di Intelligenza Artificiale e Robotica

Laboratorio di Intelligenza Artificiale e Robotica

Handling Concept Drifts Using Dynamic Selection of Classifiers

CS 446: Machine Learning

Massachusetts Institute of Technology Tel: Massachusetts Avenue Room 32-D558 MA 02139

Word Segmentation of Off-line Handwritten Documents

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Mining Association Rules in Student s Assessment Data

Assignment 1: Predicting Amazon Review Ratings

Probabilistic Latent Semantic Analysis

A study of speaker adaptation for DNN-based speech synthesis

Time series prediction

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

arxiv: v2 [cs.cv] 30 Mar 2017

Softprop: Softmax Neural Network Backpropagation Learning

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Speech Emotion Recognition Using Support Vector Machine

Multi-label Classification via Multi-target Regression on Data Streams

arxiv: v1 [cs.lg] 15 Jun 2015

The University of Amsterdam s Concept Detection System at ImageCLEF 2011

Australian Journal of Basic and Applied Sciences

Learning Methods in Multilingual Speech Recognition

A survey of multi-view machine learning

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

Axiom 2013 Team Description Paper

Learning Methods for Fuzzy Systems

Exposé for a Master s Thesis

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

Multi-label classification via multi-target regression on data streams

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

Analysis of Hybrid Soft and Hard Computing Techniques for Forex Monitoring Systems

A Neural Network GUI Tested on Text-To-Phoneme Mapping

An investigation of imitation learning algorithms for structured prediction

Abstractions and the Brain

Calibration of Confidence Measures in Speech Recognition

Knowledge based expert systems D H A N A N J A Y K A L B A N D E

STT 231 Test 1. Fill in the Letter of Your Choice to Each Question in the Scantron. Each question is worth 2 point.

Switchboard Language Model Improvement with Conversational Data from Gigaword

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering

Transfer Learning with Applications

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Universidade do Minho Escola de Engenharia

COBRA: A Fast and Simple Method for Active Clustering with Pairwise Constraints

AUTOMATED TROUBLESHOOTING OF MOBILE NETWORKS USING BAYESIAN NETWORKS

Mining Student Evolution Using Associative Classification and Clustering

Discriminative Learning of Beam-Search Heuristics for Planning

FSL-BM: Fuzzy Supervised Learning with Binary Meta-Feature for Classification

Linking Task: Identifying authors and book titles in verbose queries

Multivariate k-nearest Neighbor Regression for Time Series data -

TRANSFER LEARNING IN MIR: SHARING LEARNED LATENT REPRESENTATIONS FOR MUSIC AUDIO CLASSIFICATION AND SIMILARITY

TD(λ) and Q-Learning Based Ludo Players

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

Chinese Language Parsing with Maximum-Entropy-Inspired Parser

Learning Rules from Incomplete Examples via Implicit Mention Models

The Boosting Approach to Machine Learning An Overview

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

COMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS

Using Web Searches on Important Words to Create Background Sets for LSI Classification

EDCI 699 Statistics: Content, Process, Application COURSE SYLLABUS: SPRING 2016

Citrine Informatics. The Latest from Citrine. Citrine Informatics. The data analytics platform for the physical world

A Reinforcement Learning Variant for Control Scheduling

Issues in the Mining of Heart Failure Datasets

Comparison of EM and Two-Step Cluster Method for Mixed Data: An Application

Probability and Statistics Curriculum Pacing Guide

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

Transcription:

Welcome to CMPS 142: Machine Learning Instructor: David Helmbold, dph@soe.ucsc.edu Web page: www.soe.ucsc.edu/classes/cmps142/winter07/ Text: Introduction to Machine Learning, Alpaydin Administrivia Sign up sheet (enrollment) Evaluation: Homework 10-25% Late midterm 50% Projects (group?) 25-40% Pictures Expectations/Style Reading assignments Attendance/participation My hearing/writing Academic honesty Topics: Introduction (ch1 and 2) Feature selection/measuring accuracy (ch 6?) Bayesian learning and parameter estimation (ch 3, 4, 5) Instance based learning (nearest neighbor) (ch 8) Decision Trees (ch 9) Linear Discrimination (ch 10), SVMs, and Perceptron algorithm Neural networks (ch 11) Boosting (AdaBoost) (ch 15) Clustering, EM Algorithm and K- means (ch 7) On-line prediction (Blum survey) 1 2 Lecture Slides for INTRODUCTION TO Machine Learning CHAPTER 1: Introduction ETHEM ALPAYDIN The MIT Press, 2004 (modified by DPH, fall 2006) alpaydin@boun.edu.tr http://www.cmpe.boun.edu.tr/~ethem/i2ml 1

Why Learn? Machine learning is programming computers to optimize a performance criterion using example data or past experience (inference in statistics) There is no need to learn to calculate payroll Learning is used when: Human expertise does not exist (navigating on Mars), Humans are unable to explain their expertise (speech recognition) Solution changes in time (routing on a computer network) Solution needs to be adapted or customized to particular cases (or users) What We Talk About When We Talk About Learning Learning general models from a set of particular examples Data is cheap and abundant (data warehouses, data marts); knowledge is expensive and scarce. Example in retail: Customer transactions to consumer behavior: People who bought Da Vinci Code also bought The Five People You Meet in Heaven (www.amazon.com) Build a model that is a good and useful approximation to the data. 5 6 What is Machine Learning? Stat. Machine learning is not: Optimize a performance criterion using example data or past experience. Role of Statistics: Inference from a sample Role of Computer science: Efficient algorithms to Solve the optimization problem Representing and evaluating the model for inference Cognitive science (how people think/learn) Teaching computers to think But is related to: Statistics Data Mining - KDD Control theory part of AI, but not traditional AI 7 8 2

Data Mining Supervised Batch Learning Retail: Market basket analysis, Customer relationship management (clustering) Finance: Credit scoring, fraud detection Manufacturing: Optimization, troubleshooting Medicine: Medical diagnosis Telecommunications: Quality of service optimization Bioinformatics: Motifs, alignment, protein structure Web mining: Search engines... Assume distribution over things Get instances by drawing things from distribution and recording observations. Teacher labels instances making examples Or (x, y) (x,r) Set of labeled examples is the training set or sample Create hypothesis (rule) from sample hypothesis predicts on new random instances, scored by loss function 9 10 Learning Framework Supervised Learning (cont.) P(x,r) Training set Learning algorithm Test point hypothesis Goal: find hypothesis with small loss (x,r) x Prediction ŷ r Loss function Classification: labels are nominal (unordered set, e.g. {ham, spam} {democrat, republican, indep.}) Binary Classification Regression: labels are numeric (e.g. price of used car) Sometimes labels are probabilities r, y, ŷ? L(ŷ, r) 11 12 3

Examples Face Recognition Thing Observations Prediction Training examples of a person Written Digit Pixel array Which digit? Email message Words, Subject, sender Ham or Spam? Test images Customer Recent purchase interest level in a new product Used car Year, make, mpg, options Price or value 13 AT&T Laboratories, Cambridge UK http://www.uk.research.att.com/facedatabase.html 14 Regression Supervised Learning: Uses Example: Price of a used car x : car attributes y : price y = g (x θ ) g ( ) model (e.g. linear) θ parameters (w, w 0 ) y = wx+w 0 Prediction of future cases: Use the rule to predict the output for future inputs Knowledge extraction: The rule is easy to understand Compression: The rule is simpler than the data it explains Outlier detection: Exceptions that are not covered by the rule, e.g., fraud and data entry errors 15 16 4

Other kinds of supervised learning Unsupervised Learning Reinforcement learning - learning a policy for influencing or reacting to environment No supervised output, but delayed rewards Credit assignment problem Game playing/robot in a maze, etc. On-line learning: predict on each instance in turn Semi-supervised learning uses both labeled and unlabeled data Learning what normally happens No labels Clustering: Grouping similar instances Example applications Segmentation in customer relationship mgmt Image compression: Color quantization Bioinformatics: Learning motifs Identifying unusual Airplane landings 17 18 Resources: Datasets Resources: Journals UCI Repository: http://www.ics.uci.edu/~mlearn/mlrepository.html UCI KDD Archive: http://kdd.ics.uci.edu/summary.data.application.html Statlib: http://lib.stat.cmu.edu/ Delve: http://www.cs.utoronto.ca/~delve/ 19 Journal of Machine Learning Research www.jmlr.org Machine Learning Neural Computation Neural Networks IEEE Transactions on Neural Networks IEEE Transactions on Pattern Analysis and Machine Intelligence Annals of Statistics Journal of the American Statistical Association... 20 5

Resources: Conferences International Conference on Machine Learning (ICML) ICML05: http://icml.ais.fraunhofer.de/ European Conference on Machine Learning (ECML) ECML05: http://ecmlpkdd05.liacc.up.pt/ Neural Information Processing Systems (NIPS) NIPS05: http://nips.cc/ Uncertainty in Artificial Intelligence (UAI) UAI05: http://www.cs.toronto.edu/uai2005/ Computational Learning Theory (COLT) COLT05: http://learningtheory.org/colt2005/ International Joint Conference on Artificial Intelligence (IJCAI) IJCAI05: http://ijcai05.csd.abdn.ac.uk/ International Conference on Neural Networks (Europe) ICANN05: http://www.ibspan.waw.pl/icann-2005/... 21 6