Machine Learning for Predictive Modelling Rory Adams

Similar documents
Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

Python Machine Learning

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Data Fusion Models in WSNs: Comparison and Analysis

Distributed Weather Net: Wireless Sensor Network Supported Inquiry-Based Learning

CSL465/603 - Machine Learning

Learning Methods for Fuzzy Systems

Australian Journal of Basic and Applied Sciences

STT 231 Test 1. Fill in the Letter of Your Choice to Each Question in the Scantron. Each question is worth 2 point.

Circuit Simulators: A Revolutionary E-Learning Platform

A Case Study: News Classification Based on Term Frequency

Historical maintenance relevant information roadmap for a self-learning maintenance prediction procedural approach

Multisensor Data Fusion: From Algorithms And Architectural Design To Applications (Devices, Circuits, And Systems)

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Mining Association Rules in Student s Assessment Data

(Sub)Gradient Descent

Human Emotion Recognition From Speech

Lecture 1: Machine Learning Basics

LEGO MINDSTORMS Education EV3 Coding Activities

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Analyzing sentiments in tweets for Tesla Model 3 using SAS Enterprise Miner and SAS Sentiment Analysis Studio

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Measurement and statistical modeling of the urban heat island of the city of Utrecht (the Netherlands)

Defragmenting Textual Data by Leveraging the Syntactic Structure of the English Language

INTERMEDIATE ALGEBRA PRODUCT GUIDE

Individual Component Checklist L I S T E N I N G. for use with ONE task ENGLISH VERSION

Lecture 1: Basic Concepts of Machine Learning

Word Segmentation of Off-line Handwritten Documents

Evolutive Neural Net Fuzzy Filtering: Basic Description

Reducing Features to Improve Bug Prediction

Speech Emotion Recognition Using Support Vector Machine

Agents and environments. Intelligent Agents. Reminders. Vacuum-cleaner world. Outline. A vacuum-cleaner agent. Chapter 2 Actuators

CS Machine Learning

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Rule Learning With Negation: Issues Regarding Effectiveness

Chapter 2. Intelligent Agents. Outline. Agents and environments. Rationality. PEAS (Performance measure, Environment, Actuators, Sensors)

FUZZY EXPERT. Dr. Kasim M. Al-Aubidy. Philadelphia University. Computer Eng. Dept February 2002 University of Damascus-Syria

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

Switchboard Language Model Improvement with Conversational Data from Gigaword

Introduction on Lean, six sigma and Lean game. Remco Paulussen, Statistics Netherlands Anne S. Trolie, Statistics Norway

Probabilistic Latent Semantic Analysis

Intelligent Agents. Chapter 2. Chapter 2 1

EdX Learner s Guide. Release

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

CNS 18 21th Communications and Networking Simulation Symposium

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Geospatial Visual Analytics Tutorial. Gennady Andrienko & Natalia Andrienko

arxiv: v2 [cs.ro] 3 Mar 2017

Green Belt Curriculum (This workshop can also be conducted on-site, subject to price change and number of participants)

AC : DESIGNING AN UNDERGRADUATE ROBOTICS ENGINEERING CURRICULUM: UNIFIED ROBOTICS I AND II

A student diagnosing and evaluation system for laboratory-based academic exercises

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Activity Recognition from Accelerometer Data

Rule Learning with Negation: Issues Regarding Effectiveness

School of Innovative Technologies and Engineering

Mathematics subject curriculum

Test Effort Estimation Using Neural Network

Meeting Agenda for 9/6

Time series prediction

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

DEVELOPMENT OF AN INTELLIGENT MAINTENANCE SYSTEM FOR ELECTRONIC VALVES

Radius STEM Readiness TM

Assignment 1: Predicting Amazon Review Ratings

Leveraging MOOCs to bring entrepreneurship and innovation to everyone on campus

One Way Draw a quick picture.

A Reinforcement Learning Variant for Control Scheduling

arxiv: v1 [cs.cv] 10 May 2017

Airplane Rescue: Social Studies. LEGO, the LEGO logo, and WEDO are trademarks of the LEGO Group The LEGO Group.

Mathematics process categories

Laboratorio di Intelligenza Artificiale e Robotica

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape

Senior Research Fellow, Intelligent Mobility Design Centre

Generative models and adversarial training

ACHIEVING SUSTAINABILITY THROUGH GREEN OFFICES PRACTICES

Using dialogue context to improve parsing performance in dialogue systems

Software Maintenance

SOCIAL STUDIES GRADE 1. Clear Learning Targets Office of Teaching and Learning Curriculum Division FAMILIES NOW AND LONG AGO, NEAR AND FAR

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

LIBRARY AND RECORDS AND ARCHIVES SERVICES STRATEGIC PLAN 2016 to 2020

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

A Biological Signal-Based Stress Monitoring Framework for Children Using Wearable Devices

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Multivariate k-nearest Neighbor Regression for Time Series data -

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

CS177 Python Programming

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

Strategy and Design of ICT Services

WHEN THERE IS A mismatch between the acoustic

Mining Student Evolution Using Associative Classification and Clustering

Applications of data mining algorithms to analysis of medical data

Lecture 10: Reinforcement Learning

The Method of Immersion the Problem of Comparing Technical Objects in an Expert Shell in the Class of Artificial Intelligence Algorithms

Answer Key For The California Mathematics Standards Grade 1

Transcription:

Machine Learning for Predictive Modelling Rory Adams 2015 The MathWorks, Inc. 1

Agenda Machine Learning What is Machine Learning and why do we need it? Common challenges in Machine Learning Example: Human activity learning using mobile phone data Example: Real-time object identification using images Example: Load forecasting using weather data Summary & Key Takeaways 2

Machine Learning is Everywhere Image Recognition Speech Recognition Stock Prediction Medical Diagnosis Data Analytics Robotics and more [TBD] 3

Machine Learning Machine learning uses data and produces a model to perform a task Task: Human Activity Detection Standard Approach Machine Learning Approach Computer Program Machine Learning Hand Written Program If X_acc > 0.5 then SITTING If Y_acc < 4 and Z_acc > 5 then STANDING Formula or Equation Y activity = β 1 X acc + β 2 Y acc + β 3 Z acc + model: Predictors Inputs Outputs Response model = < Machine Learning >(sensor_data, activity) Algorithm 4

Different Types of Learning Machine Learning Supervised Learning Unsupervised Learning Discover a good internal representation Learn a low dimensional representation Classification Response is a choice between classes (True, False) (Red, Blue, Green) Regression Response is a continuous number (temperature, stock prices). 5

Example: Human Activity Learning Using Mobile Phone Data Machine Learning Data: 3-axial Accelerometer data 3-axial Gyroscope data 6

essentially, all models are wrong, but some are useful George Box 7

Challenges in Machine Learning Hard to get started Steps Access, explore and analyze data Preprocess data Train models Assess model performance Challenge Data diversity Numeric, Images, Signals, Text not always tabular Lack of domain tools Filtering and feature extraction Feature selection and transformation Time consuming Train many models to find the best Avoid pitfalls Over Fitting Speed-Accuracy-Complexity tradeoffs Iterate 8

Machine Learning Workflow Train: Iterate till you find the best model LOAD PREPROCESS SUPERVISED LEARNING MODEL FILTERS PCA CLASSIFICATION SUMMARY STATISTICS CLUSTER ANALYSIS REGRESSION Predict: Integrate trained models into applications NEW PREPROCESS MODEL PREDICTION FILTERS PCA SUMMARY STATISTICS CLUSTER ANALYSIS 9

Agenda Machine Learning What is Machine Learning and why do we need it? Common challenges in Machine Learning Example: Human activity learning using mobile phone data Example: Real-time object identification using images Example: Load forecasting using weather data Summary & Key Takeaways 10

Example 1: Human Activity Learning Using Mobile Phone Data Objective: Train a classifier to classify human activity from sensor data Data: Predictors Response 3-axial Accelerometer and Gyroscope data Activity: (Classification) Approach: Extract features from raw sensor signals Train and compare classifiers Test results on new sensor data 11

Machine Learning Workflow for Example 1 Train: Iterate till you find the best model LOAD PREPROCESS SUPERVISED LEARNING MODEL 1. Mean 2. FILTERS Standard PCA deviation 3. PCA SUMMARY STATISTICS CLUSTER ANALYSIS CLASSIFICATION Classification Learner REGRESSION Predict: Integrate trained models into applications TEST PREPROCESS MODEL PREDICTION 1. Mean 2. FILTERS Standard deviation 3. PCA SUMMARY STATISTICS PCA CLUSTER ANALYSIS 12

Agenda Machine Learning What is Machine Learning and why do we need it? Common challenges in Machine Learning Example: Human activity learning using mobile phone data Example: Real-time object identification using images Example: Load forecasting using weather data Summary & Key Takeaways 13

Example 2: Real-time Toy Identification Using Images Objective: Train a classifier to identify toy type from a webcam video Data: Predictors Several images of cars: Response CAR, HELICOPTER, PLANE, BIKE (Classification) Approach: Extract features using Bag-of-words Train and compare classifiers Classify streaming video from a webcam 14

Machine Learning Workflow for Example Train: Iterate till you find the best model LOAD PREPROCESS SUPERVISED LEARNING MODEL 1. Build Bag-offeatures PCA FILTERS 2. Encode images as new features SUMMARY STATISTICS CLUSTER ANALYSIS CLASSIFICATION Classification Learner REGRESSION Predict: Integrate trained models into applications WEBCAM PREPROCESS MODEL PREDICTION Encode FILTERS images PCAas new features SUMMARY STATISTICS CLUSTER ANALYSIS 15

Agenda Machine Learning What is Machine Learning and why do we need it? Common challenges in Machine Learning Example: Human activity learning using mobile phone data Example: Real-time object identification using images Example: Load forecasting using weather data Summary & Key Takeaways 16

Example 3: Day-Ahead System Load Forecasting Objective: Train a neural network to predict the required system load for a zone Data: Predictors Response Temperature, Dew point, Month, Day of week, Prior day load, Prior week load LOAD (Regression) Approach: Extract additional features Train neural network Predict load 17

Machine Learning Workflow for Example 1 Train: Iterate till you find the best model LOAD PREPROCESS SUPERVISED LEARNING MODEL Temp, Dew point Day FILTERS of week SUMMARY STATISTICS PCA Prior day load Prior week load CLUSTER ANALYSIS CLASSIFICATION Neural Network REGRESSION Predict: Integrate trained models into applications TEST PREPROCESS MODEL PREDICTION Temp, Dew point Day FILTERS of week SUMMARY STATISTICS PCA Prior day load Prior week load CLUSTER ANALYSIS 18

Agenda Machine Learning What is Machine Learning and why do we need it? Common challenges in Machine Learning Example: Human activity learning using mobile phone data Example: Real-time object identification using images Example: Load forecasting using weather data Summary & Key Takeaways 19

Challenges in Machine Learning Steps Accessing, exploring and analyzing data Preprocess data Challenge Data diversity Lack of domain tools Train models Assess model performance Time consuming Avoid pitfalls Over Fitting, Speed-Accuracy-Complexity Iterate 20

MATLAB Strengths for Machine Learning Challenge Data diversity Lack of domain tools Time consuming Avoid pitfalls Over Fitting, Speed-Accuracy-Complexity Solution Extensive data support Import and work with signal, images, financial, Textual, geospatial, and several others formats High-quality libraries Industry-standard algorithms for Finance, Statistics, Signal, Image processing & more Interactive, app-driven workflows Focus on machine learning, not programing Integrated best practices Model validation tools built into app Rich documentation with step by step guidance Flexible architecture for customized workflows Complete machine learning platform 21

Key Takeaways Consider Machine Learning when: Hand written rules and equations are too complex Face recognition, speech recognition, recognizing patterns Rules of a task are constantly changing Fraud detection from transactions, anomaly in sensor data Nature of the data changes and the program needs to adapt Automated trading, energy demand forecasting, predicting shopping trends MATLAB for Machine Learning 22

Additional Resources Documentation: Machine Learning with MATLAB: 23

Q & A Topic of interest Working with IoT data Accessing, analysing and visualising data Working with big data sets Deploying machine learning algorithms Machine learning with computer vision Session / Demo Station Session: MATLAB and the Internet of Things (IoT): Collecting and Analysing IoT Data Session: Analysis of Experimental and Test Data Session: Tackling Big Data with MATLAB Demo: Building MATLAB Apps to Visualise Complex Data Demo: Identification of Objects in Real-Time Video 24