Big Data. Making sense of signals (RGB-D video): Hand Tracking from MSR Cambridge

Similar documents
Lecture 1: Machine Learning Basics

(Sub)Gradient Descent

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Python Machine Learning

CSL465/603 - Machine Learning

Probabilistic Latent Semantic Analysis

Generative models and adversarial training

CS Machine Learning

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

Speech Emotion Recognition Using Support Vector Machine

The Strong Minimalist Thesis and Bounded Optimality

Laboratorio di Intelligenza Artificiale e Robotica

Laboratorio di Intelligenza Artificiale e Robotica

Axiom 2013 Team Description Paper

Lecture 1: Basic Concepts of Machine Learning

STA 225: Introductory Statistics (CT)

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines

Seminar - Organic Computing

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Senior Project Information

BMBF Project ROBUKOM: Robust Communication Networks

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Speech Recognition at ICSI: Broadcast News and beyond

Exposé for a Master s Thesis

EGRHS Course Fair. Science & Math AP & IB Courses

Semi-Supervised Face Detection

INPE São José dos Campos

Artificial Neural Networks written examination

Xinyu Tang. Education. Research Interests. Honors and Awards. Professional Experience

IT Students Workshop within Strategic Partnership of Leibniz University and Peter the Great St. Petersburg Polytechnic University

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

A Case Study: News Classification Based on Term Frequency

Word Segmentation of Off-line Handwritten Documents

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

A study of speaker adaptation for DNN-based speech synthesis

Lecture 10: Reinforcement Learning

Cal s Dinner Card Deals

Comparison of EM and Two-Step Cluster Method for Mixed Data: An Application

Biology 10 - Introduction to the Principles of Biology Spring 2017

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Ecole Polytechnique Fédérale de Lausanne EPFL School of Computer and Communication Sciences IC. School of Computer and Communication Sciences

Analysis of Enzyme Kinetic Data

A Case-Based Approach To Imitation Learning in Robotic Agents

Evaluation of Teach For America:

Multivariate k-nearest Neighbor Regression for Time Series data -

Active Learning. Yingyu Liang Computer Sciences 760 Fall

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

Exploration. CS : Deep Reinforcement Learning Sergey Levine

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Learning From the Past with Experiment Databases

Self Study Report Computer Science

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

SARDNET: A Self-Organizing Feature Map for Sequences

Knowledge-Based - Systems

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

Analysis: Evaluation: Knowledge: Comprehension: Synthesis: Application:

arxiv: v2 [cs.cv] 30 Mar 2017

Time series prediction

Penn State University - University Park MATH 140 Instructor Syllabus, Calculus with Analytic Geometry I Fall 2010

Pp. 176{182 in Proceedings of The Second International Conference on Knowledge Discovery and Data Mining. Predictive Data Mining with Finite Mixtures

Decision Analysis. Decision-Making Problem. Decision Analysis. Part 1 Decision Analysis and Decision Tables. Decision Analysis, Part 1

Robot Learning Simultaneously a Task and How to Interpret Human Instructions

ECE-492 SENIOR ADVANCED DESIGN PROJECT

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

COMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS

Evolutive Neural Net Fuzzy Filtering: Basic Description

Unsupervised Acoustic Model Training for Simultaneous Lecture Translation in Incremental and Batch Mode

Issues in the Mining of Heart Failure Datasets

Commanding Officer Decision Superiority: The Role of Technology and the Decision Maker

Human Emotion Recognition From Speech

Phonetic- and Speaker-Discriminant Features for Speaker Recognition. Research Project

Welcome to. ECML/PKDD 2004 Community meeting

Likelihood-Maximizing Beamforming for Robust Hands-Free Speech Recognition

WHEN THERE IS A mismatch between the acoustic

Action Recognition and Video

THE UNIVERSITY OF SYDNEY Semester 2, Information Sheet for MATH2068/2988 Number Theory and Cryptography

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Statewide Framework Document for:

Probability and Statistics Curriculum Pacing Guide

The Good Judgment Project: A large scale test of different methods of combining expert predictions

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

2/15/13. POS Tagging Problem. Part-of-Speech Tagging. Example English Part-of-Speech Tagsets. More Details of the Problem. Typical Problem Cases

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 17, NO. 3, MARCH

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

CSC200: Lecture 4. Allan Borodin

Using Web Searches on Important Words to Create Background Sets for LSI Classification

TABLE OF CONTENTS TABLE OF CONTENTS COVER PAGE HALAMAN PENGESAHAN PERNYATAAN NASKAH SOAL TUGAS AKHIR ACKNOWLEDGEMENT FOREWORD

Honors Mathematics. Introduction and Definition of Honors Mathematics

Reducing Features to Improve Bug Prediction

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Interactive Whiteboard

Model Ensemble for Click Prediction in Bing Search Ads

A Model of Knower-Level Behavior in Number Concept Development

LOUISIANA HIGH SCHOOL RALLY ASSOCIATION

Corrective Feedback and Persistent Learning for Information Extraction

EXPO MILANO CALL Best Sustainable Development Practices for Food Security

Transcription:

Big Data DD2434 Machine Learning, Advanced Course Lecture 1: Introduction Hedvig Kjellström hedvig@kth.se https://www.kth.se/social/course/dd2434/ Making sense of signals (RGB-D video): Hand Tracking from MSR Cambridge Predicting future events knowing the history: Botten Ada from Linköping U https://www.youtube.com/watch?v=a-xxrmpohyc http://bottenada.se

Learning to see suddle patterns in huge amounts of data: Cancer Therapy based on DNA Sequencing from IBM https://www.youtube.com/watch?v=0m1dmdc1mq0 Today Check the homepage at least 2 times / week! Or set it to send you emails! Course preliminaries All info at https://www.kth.se/social/course/dd2434/ Ask questions through the News forum! Buy the book by Kevin Murphy: The three teachers Jens Lagergren Carl Henrik Hedvig Kjellström Introduction to Machine Learning Murphy Chapter 1 Learning outcomes Course Preliminaries Upon completion of the course, the student should be able to 1. explain, derive, and implement a number of models for supervised, unsupervised learning, 2. explain how various models and algorithms relate to one another, 3. describe the strengths and weaknesses of various models and algorithms, 4. select an appropriate model or approach for a new machine learning task.

Course organization Assignments, detailed schedule with reading, etc, on the homepage 1: Graphical models Jens Lagergren Nov 25 The Three Teachers Dec 3 2: Representations Carl Henrik Dec 15 3: Applied ML Hedvig Kjellström Dec 18 Project Jens Lagergren Professor of Computer Science at / Science for Life Laboratory Research area: Bio-informatics Responsible for Lectures 2-5 Practicals 1-2 Assignment 1 Royal Institute of Technology Computational Biology Machine Learning a main tool Jens Lagergren

Rates Sequences δ,µ, Probabilistic model Sequence evolution ACTA GA : : : : AGTA GT SPECIES TREE WITH TIME! Gene tree REALIZATION DLRS (Delirious) DISCOVERING GRAPH STRUCTURE Lecture 6-8, Assignment 2, Practical 3-4 Carl Henrik {chek}@csc.kth.se Royal Institute of Technology November 3, 2014 DIRECTED GRAPHICAL MODELS

My Research Representation Learning I multi-view representations I correspondence/alignment learning Non-parametric methods I Gaussian Processes Structural representations Applications I Animal welfare I Motion modelling I Computational Biology Lectures Lectures Theme How can I incorporate my knowledge/belief with observations such that data reduces my uncertainty? 6 Basic modelling I Likelihood, Prior & Posterior I Kernels 7 Non-parametric modelling I function uncertainty 8 Representation Learning I Pattern discovery 9 Hierarchical modelling I layered structures

Assignment Assignment Regression Regression f : Y! X f : Y! X Three parts 1. Building models 2. Learning models 3. Evaluating models Three parts 1. Building models 2. Learning models 3. Evaluating models MSc Thesis Work Practicals Start January/February Related to my research Associated with CVAP More on lecture 9 My best friend the Gaussian Example of Conjugate priors Multiplication Marginalisation Derivatives of Matrices

Hedvig Kjellström Practicals Associate Professor of Computer Science at CSC / CVAP Research area: Robotics and Computer Vision Learning: Tools of the trade How to fit models to data Beyond ML & MAP Variational Approximation Responsible for Entire course Lectures 1, 10-12 Practical 5 Assignment 3 Hedvig Kjellström: My research Hedvig Kjellström: My part of the course Machine learning applied to Robotics and Computer Vision: Automatic perception of human activity in video Course block 3: Applied Machine Learning Object affordances, object-action complexes automatic understanding of how objects are used in human activities what happens to them during the activity Multi-modality and context in activity recognition using several modalities vision, sound, touch etc to better understand human activity Human non-verbal communication automatic understanding and modeling of non-verbal signals face expressions, body motion both conscious and unconscious See my webpage for master project proposals! Topic models Just one out of many methods, but important Chosen to complement the methods covered in DD2427 Image Based Recognition and Classification, DD2431 Machine Learning, EN2202 Pattern Recognition Practical Machine Learning What happens to the performance when data is noisy and incomplete?

Introduction to Machine Learning Uncertainty Basic philosophy: Data (observations) noisy and incomplete i.e. uncertain Decision making (prediction, classification, detection, estimation) under uncertainty Uncertainty is best modeled with probability theory Common division: Supervised / Unsupervised Supervised/Predictive Learning Data (training set): Task: Learn mapping D = {(x i,y i )} N i=1 features/attributes x response variable 5 min: Discuss with your neighbor Give at least three examples of supervised learning problems What is x and y in each problem? What does the mapping look like (linear/non-linear, one-toone/many-to-many, smooth/noisy)? y Supervised/Predictive Learning Functional approximation: Use D Classification: to learn an approximative function is discrete and finite Probabilistic formulation: Model,, etc Best most probable : ŷ = ˆf(x) = arg y = f(x) y 2 {1,...,C} p(y =1 x, D) p(y =2 x, D) y unknown true function ŷ = ˆf(x) y max C p(y = c x, D) c=1

Unsupervised/Descriptive Learning Unsupervised/Descriptive Learning Data (training set): Task: discover patterns in Under-specified problem what patterns? How measure error? 5 min: Discuss with your neighbor Give at least three examples of unsupervised learning problems x What is in each problem? What kind of patterns are found? What is the purpose? D = {x i } N i=1 D Probabilistic formulation: Density estimation Models of the form D xi p(x i ) Use to maximize the probability of seeing each given the model New obstacles: Multivariate distributions p(x i ) Unsupervised learning is more similar to how humans and animals learn! Practical advantage: No labeling of data required! Basic concept: Parametric vs Non-Parametric Basic concept: Curse of Dimensionality Models p(x) and p(y x) Parametric: Number of parameters constant with more data E.g., linear classifier Non-parametric: Number of parameters grows with more data E.g., knn classifier 2D 3D 8D = = 4 2 2 2 3 = 4 3 2 8 24 cube/sphere cube/sphere cube/sphere Adressed by using parametric models (fewer parameters more robust)

Basic concept: Overfitting Basic concept: Model Selection Model fits training data perfectly but not novel data Reasons: Too little data, to high dimension, too flexible model 5 min: Discuss with your neighbor How can you test if your classifier is overfitting the training data? Overfitting and underfitting More complex model always have lower training data error Solution from last slide: Divide data into training set and validation set Evaluate each model, each parameter setting with the validation set Basic concept: No Free Lunch Theorem What is next? Check the homepage at least 2 times / week! Or set it to send you emails! Do not believe the preachers There is no universally best model! All models contain assumptions that work well in one domain but not in another. We use the homepage a lot: links to video lectures, readings for lectures, lecture slides, questions answered through the News forum https://www.kth.se/social/course/dd2434/ Next on the schedule Wed 5 Nov 13:15-15:00 V32 Lecture 2: Graphical Models Jens Lagergren Readings: Murphy Chapter 10 (except 10.2.4, 10.2.5, 10.4) Assignment 1 published today, deadline November 25