TCC Deep Learning with MATLAB

Similar documents
Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

A Simple VQA Model with a Few Tricks and Image Features from Bottom-up Attention

Semantic Segmentation with Histological Image Data: Cancer Cell vs. Stroma

Python Machine Learning

arxiv: v1 [cs.lg] 15 Jun 2015

HIERARCHICAL DEEP LEARNING ARCHITECTURE FOR 10K OBJECTS CLASSIFICATION

Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

TRANSFER LEARNING OF WEAKLY LABELLED AUDIO. Aleksandr Diment, Tuomas Virtanen

Forget catastrophic forgetting: AI that learns after deployment

Generative models and adversarial training

arxiv: v1 [cs.cv] 10 May 2017

Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Lip Reading in Profile

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

arxiv: v2 [cs.cv] 30 Mar 2017

Word Segmentation of Off-line Handwritten Documents

THE enormous growth of unstructured data, including

A Compact DNN: Approaching GoogLeNet-Level Accuracy of Classification and Domain Adaptation

Cultivating DNN Diversity for Large Scale Video Labelling

Multi-tasks Deep Learning Model for classifying MRI images of AD/MCI Patients

A Review: Speech Recognition with Deep Learning Methods

A Deep Bag-of-Features Model for Music Auto-Tagging

Modeling function word errors in DNN-HMM based LVCSR systems

Dual-Memory Deep Learning Architectures for Lifelong Learning of Everyday Human Behaviors

THE world surrounding us involves multiple modalities

Knowledge Transfer in Deep Convolutional Neural Nets

Modeling function word errors in DNN-HMM based LVCSR systems

Human Emotion Recognition From Speech

arxiv:submit/ [cs.cv] 2 Aug 2017

Circuit Simulators: A Revolutionary E-Learning Platform

FUZZY EXPERT. Dr. Kasim M. Al-Aubidy. Philadelphia University. Computer Eng. Dept February 2002 University of Damascus-Syria

Offline Writer Identification Using Convolutional Neural Network Activation Features

Assignment 1: Predicting Amazon Review Ratings

Lecture 1: Machine Learning Basics

Глубокие рекуррентные нейронные сети для аспектно-ориентированного анализа тональности отзывов пользователей на различных языках

arxiv: v1 [cs.lg] 7 Apr 2015

Learning Methods for Fuzzy Systems

Taxonomy-Regularized Semantic Deep Convolutional Neural Networks

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Lecture 1: Basic Concepts of Machine Learning

Model Ensemble for Click Prediction in Bing Search Ads

arxiv: v1 [cs.cl] 27 Apr 2016

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

INPE São José dos Campos

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

CS Machine Learning

(Sub)Gradient Descent

Speech Recognition at ICSI: Broadcast News and beyond

CSL465/603 - Machine Learning

Evolutive Neural Net Fuzzy Filtering: Basic Description

On the Formation of Phoneme Categories in DNN Acoustic Models

Time series prediction

BUILDING CONTEXT-DEPENDENT DNN ACOUSTIC MODELS USING KULLBACK-LEIBLER DIVERGENCE-BASED STATE TYING

Deep Facial Action Unit Recognition from Partially Labeled Data

Evolution of Symbolisation in Chimpanzees and Neural Nets

DNN ACOUSTIC MODELING WITH MODULAR MULTI-LINGUAL FEATURE EXTRACTION NETWORKS

Probabilistic Latent Semantic Analysis

Image based Static Facial Expression Recognition with Multiple Deep Network Learning

XXII BrainStorming Day

Multisensor Data Fusion: From Algorithms And Architectural Design To Applications (Devices, Circuits, And Systems)

Rule Learning With Negation: Issues Regarding Effectiveness

Autoencoder and selectional preference Aki-Juhani Kyröläinen, Juhani Luotolahti, Filip Ginter

Dropout improves Recurrent Neural Networks for Handwriting Recognition

Attributed Social Network Embedding

Linking Task: Identifying authors and book titles in verbose queries

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Robust Speech Recognition using DNN-HMM Acoustic Model Combining Noise-aware training with Spectral Subtraction

ADVANCES IN DEEP NEURAL NETWORK APPROACHES TO SPEAKER RECOGNITION

Using focal point learning to improve human machine tacit coordination

Speech Emotion Recognition Using Support Vector Machine

SORT: Second-Order Response Transform for Visual Recognition

Learning to Schedule Straight-Line Code

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

A study of speaker adaptation for DNN-based speech synthesis

A Latent Semantic Model with Convolutional-Pooling Structure for Information Retrieval

Diverse Concept-Level Features for Multi-Object Classification

Driving Author Engagement through IEEE Collabratec

arxiv: v2 [cs.ro] 3 Mar 2017

Webly Supervised Learning of Convolutional Networks

Laboratorio di Intelligenza Artificiale e Robotica

arxiv: v4 [cs.cl] 28 Mar 2016

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Deep Neural Network Language Models

Patterns for Adaptive Web-based Educational Systems

LEGO MINDSTORMS Education EV3 Coding Activities

INVESTIGATION OF UNSUPERVISED ADAPTATION OF DNN ACOUSTIC MODELS WITH FILTER BANK INPUT

Mining Association Rules in Student s Assessment Data

Using Deep Convolutional Neural Networks in Monte Carlo Tree Search

A Neural Network GUI Tested on Text-To-Phoneme Mapping

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks

Semi-Supervised GMM and DNN Acoustic Model Training with Multi-system Combination and Confidence Re-calibration

Transcription:

8.9.2016 Brno TCC 2016 Deep Learning with MATLAB Jan Studnička studnicka@humusoft.cz www.humusoft.cz info@humusoft.cz www.mathworks.com

Computer Vision Applications Computer Vision Pedestrian and traffic sign detection Landmark identification Scene recognition Medical diagnosis and drug discovery Public Safety / Surveillance Automotive Robotics and many more

What is Deep Learning? Deep learning performs end-end learning by learning features, representations and tasks directly from images, text and sound Traditional Machine Learning Manual Feature Extraction Classification Machine Learning Car Truck Bicycle Deep Learning approach Convolutional Neural Network (CNN) Learned features End-to-end learning Feature learning + Classification 95% 3% 2% Car Truck Bicycle

Deep Learning with MATLAB for Computer Vision Autoencoders Example: Classify digits in images Convolutional Neural Networks (CNN) Trained on massive sets of data High accuracy 4 A visualization of learned weights of the first layer of a CNN.

Neural Network Single neuron Layer of Neurons 5

Autoencoders Unsupervised Learning Hidden layer Encoder Pretrain Deep Neural Network Hidden layers Encoders of pretrained Autoencoders 6

Digit Classification Classify digits in images Data: 28 x 28 pixels 10 digit classes 5000 samples 7 Solution: 2 hidden layers autoencoders Classification Softmax layer Stack the Encoders with the Softmax layer to form a Deep Network Fine-tune the entire Deep Network Classification

Convolutional Neural Networks: Live Object Recognition with Webcam

Why is Deep Learning so Popular? Results: Achieved substantially better results on ImageNet large scale recognition challenge 95% + accuracy on ImageNet 1000 class challenge Year Pre-2012 (traditional computer vision and machine learning techniques) Error Rate > 25% 2012 (Deep Learning) ~ 15% 2015 (Deep Learning) <5 % Computing Power: GPU s and advances to processor technologies have enabled us to train networks on massive sets of data. Data: Availability of storage and access to large sets of labeled data E.g. ImageNet, PASCAL VoC, Kaggle

Two Approaches for Deep Learning 1. Train a Deep Neural Network from Scratch Lots of data Convolutional Neural Network (CNN) Learned features 95% 3% 2% Car Truck Bicycle 2. Fine-tune a pre-trained model (transfer learning) Pre-trained CNN Fine-tune network weights New Task Car Truck Medium amounts of data

Two Deep Learning Approaches Approach 1: Train a Deep Neural Network from Scratch Convolutional Neural Network (CNN) Learned features 95% 3% 2% Car Truck Bicycle Recommended only when: Training data 1000s to millions of labeled images Computation Compute intensive (requires GPU) Training Time Days to Weeks for real problems Model accuracy High (can overfit to small datasets)

Two Deep Learning Approaches Approach 2: Fine-tune a pre-trained model (transfer learning) CNN trained on massive sets of data Learned robust representations of images from larger data set Can be fine-tuned for use with new data or task with small medium size datasets Pre-trained CNN Fine-tune network weights New Task Car Truck New Data Recommended when: Training data Computation Training Time Model accuracy 100s to 1000s of labeled images (small) Moderate computation (GPU optional) Seconds to minutes Good, depends on the pre-trained CNN model

Convolutional Neural Networks Train deep neural networks on structured data (e.g. images, signals, text) Implements Feature Learning: Eliminates need for hand crafted features Trained using GPUs for performance car truck van bicycle Input Convolution + ReLu Pooling Convolution + ReLu Pooling Flatten Fully Connected Softmax Feature Learning Classification

Convolutional Neural Networks

Demo Fine-tune a pre-trained model (transfer learning) Pre-trained CNN (AlexNet 1000 Classes) Car SUV New Data New Task 2 Class Classification

Demo Fine-tune a pre-trained model (transfer learning)

Addressing Challenges in Deep Learning for CV Challenge Managing large sets of labeled images Resizing, Data augmentation Background in neural networks (deep learning) Computation intensive task (requires GPU) Solution imageset or imagedatastore to handle large sets of images imresize, imcrop, imadjust, imageinputlayer, etc. Intuitive interfaces, well-documented architectures and examples Training supported on GPUs No GPU expertise is required Automate. Offload computations to a cluster and test multiple architectures

International reseller of MathWorks for the Czech Republic and Slovakia Pobřežní 20 www.facebook.com/humusoft 186 00 Praha 8 www.youtube.com/humusoft Česká republika www.twitter.com/humusoft Email: info@humusoft.cz www.humusoft.cz Tel.: +420 284 011 720