Simplifying Image Processing and Computer Vision Application Development

Similar documents
A Simple VQA Model with a Few Tricks and Image Features from Bottom-up Attention

Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Semantic Segmentation with Histological Image Data: Cancer Cell vs. Stroma

Forget catastrophic forgetting: AI that learns after deployment

Lip Reading in Profile

arxiv: v1 [cs.lg] 15 Jun 2015

TRANSFER LEARNING OF WEAKLY LABELLED AUDIO. Aleksandr Diment, Tuomas Virtanen

What is a Mental Model?

Python Machine Learning

HIERARCHICAL DEEP LEARNING ARCHITECTURE FOR 10K OBJECTS CLASSIFICATION

Speech Emotion Recognition Using Support Vector Machine

Circuit Simulators: A Revolutionary E-Learning Platform

Speech Recognition at ICSI: Broadcast News and beyond

Modeling function word errors in DNN-HMM based LVCSR systems

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Modeling function word errors in DNN-HMM based LVCSR systems

Using Deep Convolutional Neural Networks in Monte Carlo Tree Search

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski

Lecture 1: Machine Learning Basics

Guide to Teaching Computer Science

Knowledge Transfer in Deep Convolutional Neural Nets

Linking Task: Identifying authors and book titles in verbose queries

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

M55205-Mastering Microsoft Project 2016

MOTION PICTURE ANALYSIS FIRST READING (VIEWING)

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Word Segmentation of Off-line Handwritten Documents

The Evolution of Random Phenomena

arxiv: v2 [cs.cv] 30 Mar 2017

A student diagnosing and evaluation system for laboratory-based academic exercises

COMMUNITY ENGAGEMENT

The University of Amsterdam s Concept Detection System at ImageCLEF 2011

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

arxiv: v1 [cs.cv] 10 May 2017

CS Machine Learning

arxiv: v1 [cs.cl] 27 Apr 2016

Generative models and adversarial training

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

arxiv: v2 [cs.cv] 3 Aug 2017

School of Innovative Technologies and Engineering

Learning Methods for Fuzzy Systems

LEGO MINDSTORMS Education EV3 Coding Activities

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

Taxonomy-Regularized Semantic Deep Convolutional Neural Networks

OFFICE SUPPORT SPECIALIST Technical Diploma

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

Human Emotion Recognition From Speech

Beyond the Blend: Optimizing the Use of your Learning Technologies. Bryan Chapman, Chapman Alliance

A Review: Speech Recognition with Deep Learning Methods

A Compact DNN: Approaching GoogLeNet-Level Accuracy of Classification and Domain Adaptation

ENME 605 Advanced Control Systems, Fall 2015 Department of Mechanical Engineering

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Probability and Statistics Curriculum Pacing Guide

An Introduction to Simio for Beginners

Mathematics. Mathematics

INPE São José dos Campos

Multi-tasks Deep Learning Model for classifying MRI images of AD/MCI Patients

Control Tutorials for MATLAB and Simulink

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

arxiv: v1 [cs.lg] 7 Apr 2015

THE enormous growth of unstructured data, including

Grade 2: Using a Number Line to Order and Compare Numbers Place Value Horizontal Content Strand

DOCTORAL SCHOOL TRAINING AND DEVELOPMENT PROGRAMME

Dual-Memory Deep Learning Architectures for Lifelong Learning of Everyday Human Behaviors

Using EEG to Improve Massive Open Online Courses Feedback Interaction

Cultivating DNN Diversity for Large Scale Video Labelling

MULTIMEDIA Motion Graphics for Multimedia

Hardhatting in a Geo-World

arxiv: v2 [cs.ro] 3 Mar 2017

Gifted/Challenge Program Descriptions Summer 2016

Deep Facial Action Unit Recognition from Partially Labeled Data

THE world surrounding us involves multiple modalities

On the Formation of Phoneme Categories in DNN Acoustic Models

Course Content Concepts

Phonetic- and Speaker-Discriminant Features for Speaker Recognition. Research Project

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

Semi-Supervised GMM and DNN Acoustic Model Training with Multi-system Combination and Confidence Re-calibration

Patterns for Adaptive Web-based Educational Systems

Dublin City Schools Mathematics Graded Course of Study GRADE 4

MYCIN. The MYCIN Task

Disciplinary Literacy in Science

Taste And Sight Anatomy Study Guide

Introduction to Forensics: Preventing Fires in the First Place. A Distance Learning Program Presented by the FASNY Museum of Firefighting

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

arxiv: v1 [cs.cl] 2 Apr 2017

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Course Law Enforcement II. Unit I Careers in Law Enforcement

Diploma of Building and Construction (Building)

Education: Professional Experience: Personnel leadership and management

Merry-Go-Round. Science and Technology Grade 4: Understanding Structures and Mechanisms Pulleys and Gears. Language Grades 4-5: Oral Communication

On-Line Data Analytics

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

Diagnostic Test. Middle School Mathematics

Offline Writer Identification Using Convolutional Neural Network Activation Features

Applications of memory-based natural language processing

Transcription:

Simplifying Image Processing and Computer Vision Application Development Elza John 2015 The MathWorks, Inc. 1

Agenda Deep learning for Computer Vision Image processing on 3D data sets 2

Deep Learning for Computer Vision 3

New MATLAB framework makes deep learning easy and accessible 4

Deep Learning is a Subset of Machine Learning Machine Learning Deep Learning 5

What is Deep Learning? Deep learning is a type of machine learning that performs end-to-end learning by learning tasks directly from images, text, and sound. Deep Learning DATA TASK 6

Why is Deep Learning So Popular Now? 7

Deep Learning Enablers Acceleration with GPU s Massive sets of labeled data Availability of state of the art models from experts 8

MATLAB makes Deep Learning Easy and Accessible Learn about new MATLAB capabilities to Handle and label large sets of images Accelerate deep learning with GPU s Visualize and debug deep neural networks Access and use models from experts 9

Image classification using pre-trained network Transfer learning to classify new objects Locate & classify objects in images and video 10

Image classification using pre-trained network Transfer learning to classify new objects Locate & classify objects in images and video 11

Convolutional Neural Networks car truck van bicycle Input Convolution + ReLU Pooling Convolution + ReLU Pooling Flatten Fully Connected Softmax Feature Learning Classification 12

Image Classification Using Pre-trained Network (Video) 13

Agenda Image classification using pre-trained network Transfer learning to classify new objects Locate & classify objects in images and video 14

Why should I train my own network? o Models from research do not work on your data o Pre-trained model not available for your data type o Improve results by creating a model specific to your problem 15

Two Approaches for Deep Learning 1. Train a Deep Neural Network from Scratch 2. Fine-tune a pre-trained model (transfer learning) 16

Example: Classify Vehicles With Transfer Learning AlexNet Pretrained Model 1000 classes Trained on millions of images Transfer learning use AlexNet as starting point Vehicle Classifier ( 5 Class) Car SUV Van Truck Large Truck New Data 17

Why Perform Transfer Learning Requires less data and training time Reference models (like AlexNet, VGG-16, VGG-19) have learned rich feature representations for a wide range of images. Leverage best network types from top researchers 18

Transfer Learning to Classify New Objects 19

Transfer Learning to Classify New Objects 20

MATLAB makes Deep Learning Easy and Accessible Learn about new MATLAB capabilities to Handle and label large sets of images imageds = imagedatastore(dir) Easily manage large sets of images Accelerate deep learning with GPU s Visualize and debug deep neural networks Access and use models from experts 21

MATLAB makes Deep Learning Easy and Accessible Learn about new MATLAB capabilities to Handle and label large sets of images Accelerate deep learning with GPU s Training modes supported: Auto Select (CPU) GPU Multi GPU (local) Multi GPU (cluster) Visualize and debug deep neural networks Access and use models from experts Acceleration with Multiple GPUs 22

MATLAB makes Deep Learning Easy and Accessible Learn about new MATLAB capabilities to Handle and label large sets of images Training Accuracy Plot Accelerate deep learning with GPU s Visualize and debug deep neural networks Deep Dream Network Activations Access and use models from experts Feature Visualization 23

MATLAB makes Deep Learning Easy and Accessible Learn about new MATLAB capabilities to Handle and label large sets of images Accelerate deep learning with GPU s Visualize and debug deep neural networks Access and use models from experts Curated Set of Pretrained Models Access Models with 1-line of MATLAB Code Net1 = alexnet Net2 = vgg16 Net3 = vgg19 24

Regression Support for Deep Learning Classification vs. Regression Classification outputs categories/labels Regression outputs numbers Supported by new regression layer: routputlayer = regressionlayer('name','routput') Example predict facial key-points: 25

Image classification using pre-trained network Transfer learning to classify new objects Locate & classify objects in images and video 26

Is Object Recognition/Classification Enough? Car Label for entire image Car? SUV? Truck? 27

Object Detection Locate and Classify Object TRUCK SUV CAR 28

Goal: Create Object Detector to Locate Vehicles Step 1: Label / Crop data Step 2: Train detector Step 3: Use detector 29

Label Images with MATLAB 30

Labeling Videos with MATLAB 31

MATLAB makes Deep Learning Easy and Accessible Learn about new MATLAB capabilities to Handle and label large sets of images Accelerate deep learning with GPU s Visualize and debug deep neural networks Image Labeler Access and use models from experts Video Labeler 32

New MATLAB framework makes deep learning easy and accessible and MATLAB can be used by experts for real deep learning problems 33

Deep Learning Object Detection Frameworks in MATLAB Deep Learning R-CNN Fast R-CNN Faster R-CNN Single Line of Code to Train Each Detector E.g. trainfasterrcnnobjectdetector 34

Object Detection Frameworks in MATLAB Machine Learning 1. Cascade Object Detector 2. Aggregate Channel Features (ACF) Deep Learning 1. R-CNN 2. Fast R-CNN 3. Faster R-CNN Same labels, train any detector. 35

MATLAB makes Deep Learning Easy and Accessible Learn about new MATLAB capabilities to Handle and label large sets of images Accelerate deep learning with GPU s Visualize and debug deep neural networks Access and use models from experts 36

3D Image Processing 37

3-D Image Processing Over 40 functions support 3-D volumetric image processing Capabilities Includes: Image arithmetic Morphology Segmentation Geometric transforms Enhancement Volume Viewer App for exploration 38

3-D Image Processing 39

Flexible delivery options: Public training available worldwide Onsite training with standard or customized courses Web-based training with live, interactive instructor-led courses Self-paced interactive online training More than 30 course offerings: Introductory and intermediate training on MATLAB, Simulink, Stateflow, code generation, and Polyspace products Specialized courses in control design, signal processing, parallel computing, code generation, communications, financial analysis, and other areas Email: training@mathworks.in 40

Image Processing with MATLAB This two-day course provides hands-on experience with performing image analysis. Examples and exercises demonstrate the use of appropriate MATLAB and Image Processing Toolbox functionality throughout the analysis process. Topics include: Importing and exporting images Analyzing images interactively Removing noise Aligning images and creating a panoramic scene Detecting edges, lines, and circles in an image Segmenting objects based on their color and texture Modifying objects' shape using morphological operations Measuring shape properties Performing batch analysis over sets of images 41

Computer Vision with MATLAB This one-day course provides hands-on experience with performing computer vision tasks. Examples and exercises demonstrate the use of appropriate MATLAB and Computer Vision System Toolbox functionality Topics include: Importing, displaying and annotating images and videos Detecting, extracting and matching object features Automatically aligning images using geometric transformations Detecting objects in images and videos Tracking objects and estimating their motion in a video Removing lens distortion from images Measuring planar objects 42

Accelerating and Parallelizing MATLAB Code This two-day course covers a variety of techniques for making your MATLAB code run faster. If you are working with long-running simulations, you will benefit from the hands-on demonstrations and exercises in the course Topics include: Improving performance within core MATLAB Generating MEX-files Parallelizing computations Offloading execution Working with clusters GPU computing 43

MathWorks Training Guaranteed to run Upcoming Public Trainings Dates Location Image Processing with MATLAB May 24 25 Bangalore Computer Vision with MATLAB May 26 Bangalore Machine Learning with MATLAB July 10-11 Hyderabad Machine Learning with MATLAB Sept 26-27 Pune Email: training@mathworks.in URL: http://www.mathworks.in/services/training Phone: 080-6632-6000 44

Speaker Details Email: Elza.John@mathworks.in Contact MathWorks India Products/Training Enquiry Booth Call: 080-6632-6000 Email: info@mathworks.in Your feedback is valued. Please complete the feedback form provided to you. 45

Thank You 46