J.D. Gallego-Posada D.A. Montoya-Zapata D.E. Sierra-Sosa O.L. Quintero-Montoya

Similar documents
HIERARCHICAL DEEP LEARNING ARCHITECTURE FOR 10K OBJECTS CLASSIFICATION

Semantic Segmentation with Histological Image Data: Cancer Cell vs. Stroma

Python Machine Learning

arxiv: v1 [cs.lg] 15 Jun 2015

A Deep Bag-of-Features Model for Music Auto-Tagging

Cultivating DNN Diversity for Large Scale Video Labelling

THE enormous growth of unstructured data, including

Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models

Taxonomy-Regularized Semantic Deep Convolutional Neural Networks

PREDICTING SPEECH RECOGNITION CONFIDENCE USING DEEP LEARNING WITH WORD IDENTITY AND SCORE FEATURES

A study of speaker adaptation for DNN-based speech synthesis

arxiv:submit/ [cs.cv] 2 Aug 2017

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Dual-Memory Deep Learning Architectures for Lifelong Learning of Everyday Human Behaviors

Lip Reading in Profile

SORT: Second-Order Response Transform for Visual Recognition

Image based Static Facial Expression Recognition with Multiple Deep Network Learning

arxiv: v2 [cs.cl] 26 Mar 2015

Robust Speech Recognition using DNN-HMM Acoustic Model Combining Noise-aware training with Spectral Subtraction

arxiv: v4 [cs.cv] 13 Aug 2017

Lecture 1: Machine Learning Basics

A Compact DNN: Approaching GoogLeNet-Level Accuracy of Classification and Domain Adaptation

Diverse Concept-Level Features for Multi-Object Classification

Offline Writer Identification Using Convolutional Neural Network Activation Features

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Speech Emotion Recognition Using Support Vector Machine

Evolutive Neural Net Fuzzy Filtering: Basic Description

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Generative models and adversarial training

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

INPE São José dos Campos

A Review: Speech Recognition with Deep Learning Methods

Word Segmentation of Off-line Handwritten Documents

Lecture 1: Basic Concepts of Machine Learning

The University of Amsterdam s Concept Detection System at ImageCLEF 2011

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

Deep Neural Network Language Models

arxiv: v1 [cs.lg] 7 Apr 2015

IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, VOL XXX, NO. XXX,

Dropout improves Recurrent Neural Networks for Handwriting Recognition

Knowledge Transfer in Deep Convolutional Neural Nets

DNN ACOUSTIC MODELING WITH MODULAR MULTI-LINGUAL FEATURE EXTRACTION NETWORKS

WebLogo-2M: Scalable Logo Detection by Deep Learning from the Web

WebLogo-2M: Scalable Logo Detection by Deep Learning from the Web

UNIDIRECTIONAL LONG SHORT-TERM MEMORY RECURRENT NEURAL NETWORK WITH RECURRENT OUTPUT LAYER FOR LOW-LATENCY SPEECH SYNTHESIS. Heiga Zen, Haşim Sak

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION

Second Exam: Natural Language Parsing with Neural Networks

Semi-Supervised GMM and DNN Acoustic Model Training with Multi-system Combination and Confidence Re-calibration

Learning Methods for Fuzzy Systems

Webly Supervised Learning of Convolutional Networks

Human Emotion Recognition From Speech

Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski

BUILDING CONTEXT-DEPENDENT DNN ACOUSTIC MODELS USING KULLBACK-LEIBLER DIVERGENCE-BASED STATE TYING

SEMI-SUPERVISED ENSEMBLE DNN ACOUSTIC MODEL TRAINING

CSL465/603 - Machine Learning

TRANSFER LEARNING OF WEAKLY LABELLED AUDIO. Aleksandr Diment, Tuomas Virtanen

Time series prediction

Deep Facial Action Unit Recognition from Partially Labeled Data

Assignment 1: Predicting Amazon Review Ratings

arxiv: v1 [cs.cl] 27 Apr 2016

Modeling function word errors in DNN-HMM based LVCSR systems

Softprop: Softmax Neural Network Backpropagation Learning

A Simple VQA Model with a Few Tricks and Image Features from Bottom-up Attention

Framewise Phoneme Classification with Bidirectional LSTM and Other Neural Network Architectures

FSL-BM: Fuzzy Supervised Learning with Binary Meta-Feature for Classification

Глубокие рекуррентные нейронные сети для аспектно-ориентированного анализа тональности отзывов пользователей на различных языках

arxiv: v2 [cs.cv] 30 Mar 2017

Knowledge-Based - Systems

DIRECT ADAPTATION OF HYBRID DNN/HMM MODEL FOR FAST SPEAKER ADAPTATION IN LVCSR BASED ON SPEAKER CODE

CS224d Deep Learning for Natural Language Processing. Richard Socher, PhD

Modeling function word errors in DNN-HMM based LVCSR systems

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

arxiv: v2 [cs.lg] 8 Aug 2017

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Australian Journal of Basic and Applied Sciences

(Sub)Gradient Descent

Multi-tasks Deep Learning Model for classifying MRI images of AD/MCI Patients

A deep architecture for non-projective dependency parsing

On the Formation of Phoneme Categories in DNN Acoustic Models

Axiom 2013 Team Description Paper

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

arxiv: v1 [cs.lg] 3 May 2013

INVESTIGATION OF UNSUPERVISED ADAPTATION OF DNN ACOUSTIC MODELS WITH FILTER BANK INPUT

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

arxiv: v2 [cs.ir] 22 Aug 2016

arxiv: v2 [cs.cv] 3 Aug 2017

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

arxiv: v1 [cs.cv] 2 Jun 2017

Distributed Learning of Multilingual DNN Feature Extractors using GPUs

arxiv: v2 [stat.ml] 30 Apr 2016 ABSTRACT

Test Effort Estimation Using Neural Network

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks

Using Deep Convolutional Neural Networks in Monte Carlo Tree Search

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

The Action Similarity Labeling Challenge

Forget catastrophic forgetting: AI that learns after deployment

arxiv: v2 [cs.cv] 4 Mar 2016

Reducing Features to Improve Bug Prediction

Transcription:

APPLICATION OF DEEP LEARNING ALGORITHMS TO IMAGE CLASSIFICATION PROPOSAL PRESENTATION J.D. Gallego-Posada D.A. Montoya-Zapata D.E. Sierra-Sosa O.L. Quintero-Montoya { jgalle29, dmonto39, dsierras, oquinte1} (at)eafit(dot)edu(dot)co Research Group on Mathematical Modeling School of Sciences Universidad EAFIT 19/02/2016

INTRODUCTION

What is Deep Learning?

Introduction How can we teach computers to locate faces in an image? 1Image retrieved on 17/02/2016 from http://www.ukprogressive.co.uk/wpcontent/uploads/2015/02/face-algorithm.png

Introduction How can we teach computers to understand our voices? 2Image retrieved on 17/02/2016 from http://www.psfk.com/2014/12/voice-recog nition-software-translates-words-from-those-with-speech-disorders.html

Introduction How can we teach computers to recognize characters? 3Image retrieved on 18/02/2016 from http://teaching.paganstudio.com/digital foundations/wp-content/uploads/2013/09/lpr_software_1.jpg

Introduction How can we teach computers to identify healthy and unhealthy patients?

Inspiration 4Image retrieved on 17/02/2016 from http://cosmonio.com/research/deep-lear ning/files/small_1420.png

Brain as a System Inputs Learning Mechanism Outputs

Brain as a System - Single-Layer Perceptron Inputs Learning Mechanism Outputs Input 1 Input layer Output layer Input 2 Output Input 3

The XOR Problem What about non linear-separable groups? 5Image retrieved on 18/02/2016 from http://lab.fs.uni-lj.si/lasin/wp/ IMIT_files/neural/nn06_rbfn_xor/html/nn06_rbfn_xor_3_newpnn_01.png

Neural Network - Multilayer Perceptron Input 1 Input 2 Input layer Hidden layer Output layer Input 3 Output Input 4 Input 5

Deep Learning Definition Deep Learning is a subfield of Machine Learning which uses computational models, with hierarchical architectures composed by multiple processing layers, to learn representations of complex data such as images, sound and text [1].

PRECEDING RESEARCH

Preceding Research 2004 Methods based on BoW for image classification problems [9] 2006 Incorporating spatial geometry to BoW models [11] 2006... 2010 Sparse coding for the image classification problem [10] 2011 Extracting high-order statistics - Fisher kernel [8] 2012 CNN for image classification problems [13] 2014 Development of a new visualization strategy [6] 2015 Successful use of deeper architectures [5], [12] 2015 Strategies for avoiding overfitting and underfitting [7] 2016 Representation learning for Deep Neural Networks [14] Not only improving performance, but also gaining a better understanding of DL and DNN.

Back-propagation Input layer First hidden layer Second hidden layer Output layer Error

Preceding Research 2004 Methods based on BoW for image classification problems [9] 2006 Incorporating spatial geometry to BoW models [11] 2006 Hinton [15], LeCun [16], Bengio [17] 2010 Sparse coding for the image classification problem [10] 2011 Extracting high-order statistics - Fisher kernel [8] 2012 CNN for image classification problems [13] 2014 Development of a new visualization strategy [6] 2015 Successful use of deeper architectures [5], [12] 2015 Strategies for avoiding overfitting and underfitting [7] 2016 Representation learning for Deep Neural Networks [14] Not only improving performance, but also gaining a better understanding of DL and DNN.

PROBLEM STATEMENT

Problem Statement Inputs Set of input images: X {X 1,..., X n } Matrix of lables: Y [ y 1 y n] where yi B k y ij 1 j s.t. i 1, 2,..., n

Problem Statement Output Matrix of predicted labels: Ŷ R nxk s.t. Y Ŷ < ɛ for a given tolerance level ɛ and a norm

OBJECTIVES AND METHODOLOGY

Objectives General Objective To assess the performance of Deep Learning techniques applied to the detection of specific structures in medical images.

Objectives Specific Objectives To perform a review on the state-of-the-art in Deep Learning. To synthesize the theoretical foundations for the Deep Learning techniques to be used. To implement a Deep Learning algorithm and benchmark it against analogue implementations of the same algorithm.

Methodology O1: State-of-the-art Review Database search and extraction of relevant aspects from the found sources. Order chronologically the information and write the state-of-the-art.

Methodology O2: Theoretical Foundations Search, select and read additional papers containing the mathematical structure needed to define Deep Learning theoretically.

Methodology O3: Implementation of Algorithm Write pseudocode and code a preliminary version. Calibrate the parameters of the computational model. Benchmark our implementation against a previous implementation of the same algorithm.

SCOPE

Scope Scope GRIMMAT research areas require Deep Learning tools. Implement a Deep Learning algorithm. Application to medical images classification. Gain understanding in Deep Learning techniques. Attend Cornell University s Program for Research Experience.

INTELLECTUAL PROPERTY

Intellectual Property Results Ownership According to the internal regulations on intellectual property within Universidad EAFIT, the results of this practice are product of the coautorship between Prof. Dr. Olga Lucia Quintero-Montoya, Prof. Dr. Daniel Esteban Sierra-Sosa, and students Jose Daniel Gallego- Posada and Diego Alejandro Montoya-Zapata.

REFERENCES

References I Y. LeCun, Y. Bengio, and G. Hinton, Deep learning, Nature, vol. 521, no. 7553, pp. 436 444, 2015. L. Deng, A tutorial survey of architectures, algorithms, and applications for deep learning, APSIPA Transactions on Signal and Information Processing, vol. 3, no. January, p. e2, 2014. Y. Guo, Y. Liu, A. Oerlemans, S. Lao, S. Wu, and M. S. Lew, Deep learning for visual understanding: A review, Neurocomputing, 2015. D. Novotny, Large Scale Object Detection, Ph.D. dissertation, Czech Technical University, 2014. C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich, Going deeper with convolutions, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015, pp. 1 9. M. D. Zeiler and R. Fergus, Visualizing and understanding convolutional networks, in Proceedings of the ECCV International Workshop on Statistical Learning in Computer Vision. Springer, 2014, pp. 818 833.

References II R. Wu, S. Yan, Y. Shan, Q. Dang, and G. Sun, Deep Image: Scaling up Image Recognition, Arxiv, p. 12, 2015. F. Perronnin, Y. Liu, J. S anchez, and H. Poirier, Large-scale image retrieval with compressed fisher vectors, in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2010, pp. 3384 3391. G. Csurka, C. R. Dance, L. Fan, J. Willamowski, and C. Bray, Visual categorization with bags of keypoints, Proceedings of the ECCV International Workshop on Statistical Learning in Computer Vision, pp. 59 74, 2004. Y. Lin, F. Lv, S. Zhu, M. Yang, T. Cour, K. Yu, L. Cao, and T. Huang, Large-scale image classification: Fast feature extraction and SVM training, in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2011, pp. 1689 1696. S. Lazebnik, C. Schmid, and J. Ponce, Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories, in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, 2006, pp. 2169 2178.

References III K. Simonyan and A. Zisserman, Very Deep Convolutional Networks for Large-Scale Image Recognition, Proceedings of the ICLR, pp. 1 14, 2015. A. Krizhevsky, I. Sutskever, and G. E. Hinton, ImageNet Classification with Deep Convolutional Neural Networks, Advances In Neural Information Processing Systems, pp. 1 9, 2012. Y. Li, J. Yosinski, J. Clune, H. Lipson, and J. Hopcroft, Convergent Learning: Do different neural networks learn the same representations? in ICLR, 2016, pp. 1 21. G, Hinton, S. Osindero, and Y. Teh, A Fast Learning Algorithm for Deep Belief Nets, Neural Computation, vol. 18, no. 7, pp. 1527 54, 2006. Y, Bengio, and Y. LeCun, Scaling Learning Algorithms towards AI, Large Scale Kernel Machines, no. 1, pp. 321-360, 2007. Y, Bengio, P. Lamblin, D. Popovici, and H. Larochelle, Greedy Layer-Wise Training of Deep Networks, Advances in Neural Information Processing Systems, vol 19., no. 1, pp. 153, 2007.

QUESTIONS