Computational Machine Learning

Similar documents
Python Machine Learning

(Sub)Gradient Descent

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

CSL465/603 - Machine Learning

Lecture 1: Machine Learning Basics

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Probabilistic Latent Semantic Analysis

Assignment 1: Predicting Amazon Review Ratings

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

CS Machine Learning

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Artificial Neural Networks written examination

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Lecture 1: Basic Concepts of Machine Learning

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

WHEN THERE IS A mismatch between the acoustic

Axiom 2013 Team Description Paper

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

EECS 700: Computer Modeling, Simulation, and Visualization Fall 2014

Learning Methods for Fuzzy Systems

Statistics and Data Analytics Minor

Stochastic Calculus for Finance I (46-944) Spring 2008 Syllabus

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

arxiv: v1 [cs.lg] 15 Jun 2015

arxiv: v2 [cs.cv] 30 Mar 2017

THE world surrounding us involves multiple modalities

Comment-based Multi-View Clustering of Web 2.0 Items

Speech Recognition at ICSI: Broadcast News and beyond

Knowledge Transfer in Deep Convolutional Neural Nets

Human Emotion Recognition From Speech

Math 96: Intermediate Algebra in Context

Welcome to. ECML/PKDD 2004 Community meeting

Learning From the Past with Experiment Databases

Australian Journal of Basic and Applied Sciences

Generative models and adversarial training

A Review: Speech Recognition with Deep Learning Methods

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

TRANSFER LEARNING IN MIR: SHARING LEARNED LATENT REPRESENTATIONS FOR MUSIC AUDIO CLASSIFICATION AND SIMILARITY

Modeling function word errors in DNN-HMM based LVCSR systems

A Simple VQA Model with a Few Tricks and Image Features from Bottom-up Attention

CS 101 Computer Science I Fall Instructor Muller. Syllabus

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

CS224d Deep Learning for Natural Language Processing. Richard Socher, PhD

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

ADVANCES IN DEEP NEURAL NETWORK APPROACHES TO SPEAKER RECOGNITION

Modeling function word errors in DNN-HMM based LVCSR systems

Multi-tasks Deep Learning Model for classifying MRI images of AD/MCI Patients

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling

Time series prediction

Word Segmentation of Off-line Handwritten Documents

A survey of multi-view machine learning

Automatic Speaker Recognition: Modelling, Feature Extraction and Effects of Clinical Environment

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

HIERARCHICAL DEEP LEARNING ARCHITECTURE FOR 10K OBJECTS CLASSIFICATION

arxiv: v2 [cs.ir] 22 Aug 2016

Phonetic- and Speaker-Discriminant Features for Speaker Recognition. Research Project

Calibration of Confidence Measures in Speech Recognition

Laboratorio di Intelligenza Artificiale e Robotica

MGT/MGP/MGB 261: Investment Analysis

UK Institutional Research Brief: Results of the 2012 National Survey of Student Engagement: A Comparison with Carnegie Peer Institutions

The Method of Immersion the Problem of Comparing Technical Objects in an Expert Shell in the Class of Artificial Intelligence Algorithms

Reducing Features to Improve Bug Prediction

COMPUTER SCIENCE GRADUATE STUDIES Course Descriptions by Methodology

Mathematics. Mathematics

Issues in the Mining of Heart Failure Datasets

Data Structures and Algorithms

Truth Inference in Crowdsourcing: Is the Problem Solved?

TD(λ) and Q-Learning Based Ludo Players

Massachusetts Institute of Technology Tel: Massachusetts Avenue Room 32-D558 MA 02139

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

COSI Meet the Majors Fall 17. Prof. Mitch Cherniack Undergraduate Advising Head (UAH), COSI Fall '17: Instructor COSI 29a

Math 181, Calculus I

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

A study of speaker adaptation for DNN-based speech synthesis

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

ECO 3101: Intermediate Microeconomics

Speech Emotion Recognition Using Support Vector Machine

BUAD 425 Data Analysis for Decision Making Syllabus Fall 2015

COMPUTER SCIENCE GRADUATE STUDIES Course Descriptions by Research Area

Math Pathways Task Force Recommendations February Background

TextGraphs: Graph-based algorithms for Natural Language Processing

Indian Institute of Technology, Kanpur

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Rule Learning With Negation: Issues Regarding Effectiveness

PREDICTING SPEECH RECOGNITION CONFIDENCE USING DEEP LEARNING WITH WORD IDENTITY AND SCORE FEATURES

AUTOMATED FABRIC DEFECT INSPECTION: A SURVEY OF CLASSIFIERS

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

BYLINE [Heng Ji, Computer Science Department, New York University,

Scottsdale Community College Spring 2016 CIS190 Intro to LANs CIS105 or permission of Instructor

San José State University Department of Marketing and Decision Sciences BUS 90-06/ Business Statistics Spring 2017 January 26 to May 16, 2017

Transcription:

Computational Machine Learning Zaid Harchaoui NYU Fall 2015 Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 1 / 40

Outline 1 Overview of machine learning Unsupervised learning Supervised learning Learning Machines Feature representation 2 Course logistics Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 2 / 40

Outline 1 Overview of machine learning Unsupervised learning Supervised learning Learning Machines Feature representation 2 Course logistics Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 3 / 40

Statistical learning : a tentative big picture Unsupervised learning (learning without a teacher) Find structure of x X, given observations x i, i = 1,..., n Supervised learning (learning with a teacher) Predict y Y from x X, given observations (x i, y i ), i = 1,..., n Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 4 / 40

Statistical learning : a tentative big picture Applications in many fields Computer vision Bioinformatics Audio/speech processing Text mining Computational astronomy etc. Interplays interplay between computer science and data science, towards ai interplay between theory, algorithms, and real-world applications Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 5 / 40

A blend of scientific disciplines A blend of scientific disciplines computer science : algorithmics, complexity, information theory data science : statistics, experimental design applied mathematics : matrix analysis, convex optimization, probability, Application in other scientific fields bioinformatics computer vision speech/audio processing neuroscience finance Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 6 / 40

Unsupervised learning Unsupervised learning Dimension reduction face images Zambian President Levy Mwanawasa has won a second term in office in an election his challenger Michael Sata accused him of rigging, official results showed on Monday. documents According to media reports, a pair of hackers said on Saturday that the Firefox Web browser, commonly perceived as the safer and more customizable alternative to market leader Internet Explorer, is critically flawed. A presentation on the flaw was shown during the ToorCon hacker conference in San Diego. gene expression data MEG readings Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 7 / 40

Unsupervised learning Unsupervised learning Dimension reduction Computational efficiency : space and time savings Statistical performance : fewer dimensions regularization Visualization : discover underlying structure of the data PCA and KPCA Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 8 / 40

Unsupervised learning Unsupervised learning Feature extraction z ϕ x (x) ϕ y (y) x: y: A view from Idyllwild, California, with pine trees and snow capped Marion Mountain under a blue sky. Learn kernelized projections that relate both spaces Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 9 / 40

Unsupervised learning Unsupervised learning Feature extraction Multimodality : leverage the correlation between the modalities Statistical performance : take advantage of both views of the data Putting in relation : discover underlying relations between the modalities CCA and KCCA Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 10 / 40

Unsupervised learning Unsupervised learning Clustering Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 11 / 40

Unsupervised learning Unsupervised learning Clustering Semantics : grouping datapoints in meaningful clusters Statistical performance : intrinsic degrees of freedom of the data Visualization : discover groupings between datapoints spectral clustering, temporal segmentation, and regularized clustering (DIFFRAC) Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 12 / 40

Unsupervised learning Unsupervised learning Detection problems Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 13 / 40

Unsupervised learning Unsupervised learning Detection problems Balance risks : control detection rate with a guaranteed false alarm probability Power : detect differences not only in mean or covariance homogeneity testing, change detection Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 14 / 40

Supervised learning Supervised learning Human action recognition Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 15 / 40

Supervised learning Supervised learning Image classification and scene understanding Variance : high intra-class variability Structure : spatial and temporal structure Unknowns : unknown localization of the object of interest Kernel ridge regression, Kernel logistic regression, Support vector machine Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 16 / 40

Supervised learning Image categorization/classification Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 17 / 40

Supervised learning Image categorization/classification Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 18 / 40

Learning Machines Artificial Learning Machines (1940-1950s) Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 19 / 40

Learning Machines Perceptron (Rosenblatt, 1957) Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 20 / 40

Learning Machines Perceptron (Rosenblatt, 1957) Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 21 / 40

Learning Machines Convolutional Neural Nets : origins Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 22 / 40

Learning Machines Neocognitron (Fukushima, 1980) Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 23 / 40

Learning Machines Convolutional Neural Net (LeCun, 1988) Figure : Picture from (LeCun, 1998) CNNs perform simple operations such as convolutions, point-wise non-linearities and subsampling. for most successful applications of CNNs, training is supervised. Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 24 / 40

Feature representation Visual recognition : traditional approach Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 25 / 40

Feature representation Visual recognition : traditional features Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 26 / 40

Feature representation Parallel approaches in vision, speech, and NLP Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 27 / 40

Feature representation Challenges in vision, speech, and NLP Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 28 / 40

Feature representation Recent popular strategy : learning all the way through Recent popular strategy : learning all the way through 1 Low-level feature representation is learnt from raw data 2 Mid-level feature representation is learnt from data 3 High-level feature representation should be learned from data 4 Downstream learning classifier is learnt from data Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 29 / 40

Course logistics Outline 1 Overview of machine learning Unsupervised learning Supervised learning Learning Machines Feature representation 2 Course logistics Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 30 / 40

Course logistics Computational Machine Learning, CSCI-GA-3033-012 Class webpage : http://cs.nyu.edu/courses/fall15/csci-ga.3033-012/ Syllabus on the website Piazza : https://piazza.com/nyu/fall2015/csciga3033012/ Ask questions here Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 31 / 40

Course logistics Evaluation About 4 to 5 homeworks Midterm Exam Project Extra Credit Opportunities Machine learning flashcards Optional problems or competitions on the homework Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 32 / 40

Course logistics Homework First assignment out after the 3rd course Due Fri. Sep. 25th noon Late homework : Accepted with a 10% penalty per hour late. Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 33 / 40

Course logistics Homework First assignment out after the 3rd course Due Fri. Sep. 25th noon Late homework : Accepted with a 10% penalty per hour late. Collaboration is fine, but Write up solutions and code on your own List names of who you talked to about each problem Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 33 / 40

Course logistics Homework First assignment out after the 3rd course Due Fri. Sep. 25th noon Late homework : Accepted with a 10% penalty per hour late. Collaboration is fine, but Write up solutions and code on your own List names of who you talked to about each problem Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 33 / 40

Course logistics Midterm In class during lecture Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 34 / 40

Course logistics Projects Find some new data or new approach to old data Project philosophy the same as in these courses : http://cs.nyu.edu/~dsontag/courses/ml14/assignments/ projects.html http://web.stanford.edu/class/cs221/project.html Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 35 / 40

Course logistics Projects Find some new data or new approach to old data Project philosophy the same as in these courses : http://cs.nyu.edu/~dsontag/courses/ml14/assignments/ projects.html http://web.stanford.edu/class/cs221/project.html Logistics : 2 students per group or Start meeting asap with instructor to discuss potential ideas for project Project proposal due on Week 6 Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 35 / 40

Course logistics Prerequisites Introduction to Data Science (DS-GA 1001) or equivalent data science-ish course Math Multivariate Calculus Linear Algebra Probability Theory Statistics Python programming (numpy) Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 36 / 40

Course logistics General Philosophy Mastery vs Performance (understanding vs getting the grade ) Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 37 / 40

Course logistics General Philosophy Mastery vs Performance (understanding vs getting the grade ) Don t confuse kind of understanding with actual understanding Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 37 / 40

Course logistics General Philosophy Mastery vs Performance (understanding vs getting the grade ) Don t confuse kind of understanding with actual understanding Can you explain this picture? Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 37 / 40

Course logistics Course Topics Week 1 : Overview of Machine Learning Week 2 : Perceptron, Neocognitron, Performance metrics, Capacity, Regularization Week 3 : Linear Predictors, Ridge Regression, Logistic Regression, Linear Support Vector Machines, Boosting Week 4 : Convex Learning Problems, Stochastic Gradient Descent Week 5 : Faster Stochastic Gradient Descent, Model Selection, Validation Week 6 : Kernel-based Methods, Boosting Week 7 : Decision Trees, Random Forests, Ensemble Methods Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 38 / 40

Course logistics Course Topics Week 8 : Deep Neural Networks, I Week 9 : Quantization, Clustering, Compression Week 10 : Dimensionality Reduction Week 11 : Latent Variable Models Week 12 : Feature Selection. Feature Design Week 13 : Deep Neural Networks, II Week 14 : The Art of Machine Learning Modelling Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 39 / 40

Course logistics Questions? What are you looking to get out of the course? Questions for me? Zaid Harchaoui (NYU) CSCI-GA-3033-012 Fall 2015 40 / 40