Deep learning. Introduction. Hamid Beigy. September 16, Sharif university of technology. Deep learning

Similar documents
Python Machine Learning

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

CSL465/603 - Machine Learning

Learning Methods for Fuzzy Systems

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Глубокие рекуррентные нейронные сети для аспектно-ориентированного анализа тональности отзывов пользователей на различных языках

Welcome to. ECML/PKDD 2004 Community meeting

Rule Learning With Negation: Issues Regarding Effectiveness

Word Segmentation of Off-line Handwritten Documents

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models

A Reinforcement Learning Variant for Control Scheduling

Knowledge Transfer in Deep Convolutional Neural Nets

arxiv: v1 [cs.lg] 15 Jun 2015

A Review: Speech Recognition with Deep Learning Methods

INPE São José dos Campos

THE world surrounding us involves multiple modalities

Artificial Neural Networks written examination

Lecture 1: Basic Concepts of Machine Learning

Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks

arxiv: v1 [cs.cl] 27 Apr 2016

Human Emotion Recognition From Speech

arxiv: v1 [cs.cv] 10 May 2017

A study of speaker adaptation for DNN-based speech synthesis

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Distributed Learning of Multilingual DNN Feature Extractors using GPUs

Time series prediction

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

arxiv: v4 [cs.cl] 28 Mar 2016

THE enormous growth of unstructured data, including

PREDICTING SPEECH RECOGNITION CONFIDENCE USING DEEP LEARNING WITH WORD IDENTITY AND SCORE FEATURES

Rule Learning with Negation: Issues Regarding Effectiveness

Robust Speech Recognition using DNN-HMM Acoustic Model Combining Noise-aware training with Spectral Subtraction

DIRECT ADAPTATION OF HYBRID DNN/HMM MODEL FOR FAST SPEAKER ADAPTATION IN LVCSR BASED ON SPEAKER CODE

Generative models and adversarial training

On the Formation of Phoneme Categories in DNN Acoustic Models

arxiv: v1 [cs.lg] 7 Apr 2015

Georgetown University at TREC 2017 Dynamic Domain Track

Mining Association Rules in Student s Assessment Data

Axiom 2013 Team Description Paper

Deep Neural Network Language Models

BUILDING CONTEXT-DEPENDENT DNN ACOUSTIC MODELS USING KULLBACK-LEIBLER DIVERGENCE-BASED STATE TYING

arxiv: v2 [cs.ir] 22 Aug 2016

Learning From the Past with Experiment Databases

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Dropout improves Recurrent Neural Networks for Handwriting Recognition

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

HIERARCHICAL DEEP LEARNING ARCHITECTURE FOR 10K OBJECTS CLASSIFICATION

Learning Methods in Multilingual Speech Recognition

Top US Tech Talent for the Top China Tech Company

arxiv:submit/ [cs.cv] 2 Aug 2017

Evolution of Symbolisation in Chimpanzees and Neural Nets

Artificial Neural Networks

Calibration of Confidence Measures in Speech Recognition

Exploration. CS : Deep Reinforcement Learning Sergey Levine

A Vector Space Approach for Aspect-Based Sentiment Analysis

Laboratorio di Intelligenza Artificiale e Robotica

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

Semi-Supervised GMM and DNN Acoustic Model Training with Multi-system Combination and Confidence Re-calibration

INVESTIGATION OF UNSUPERVISED ADAPTATION OF DNN ACOUSTIC MODELS WITH FILTER BANK INPUT

Dual-Memory Deep Learning Architectures for Lifelong Learning of Everyday Human Behaviors

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Second Exam: Natural Language Parsing with Neural Networks

DNN ACOUSTIC MODELING WITH MODULAR MULTI-LINGUAL FEATURE EXTRACTION NETWORKS

Soft Computing based Learning for Cognitive Radio

Lecture 1: Machine Learning Basics

Speech Emotion Recognition Using Support Vector Machine

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION

Multi-tasks Deep Learning Model for classifying MRI images of AD/MCI Patients

Evolutive Neural Net Fuzzy Filtering: Basic Description

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Massachusetts Institute of Technology Tel: Massachusetts Avenue Room 32-D558 MA 02139

arxiv: v1 [cs.cl] 20 Jul 2015

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Laboratorio di Intelligenza Artificiale e Robotica

A NOVEL SCHEME FOR SPEAKER RECOGNITION USING A PHONETICALLY-AWARE DEEP NEURAL NETWORK. Yun Lei Nicolas Scheffer Luciana Ferrer Mitchell McLaren

IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, VOL XXX, NO. XXX,

The University of Amsterdam s Concept Detection System at ImageCLEF 2011

arxiv: v2 [stat.ml] 30 Apr 2016 ABSTRACT

A Deep Bag-of-Features Model for Music Auto-Tagging

ADVANCES IN DEEP NEURAL NETWORK APPROACHES TO SPEAKER RECOGNITION

UNIDIRECTIONAL LONG SHORT-TERM MEMORY RECURRENT NEURAL NETWORK WITH RECURRENT OUTPUT LAYER FOR LOW-LATENCY SPEECH SYNTHESIS. Heiga Zen, Haşim Sak

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Handling Concept Drifts Using Dynamic Selection of Classifiers

Citrine Informatics. The Latest from Citrine. Citrine Informatics. The data analytics platform for the physical world

Probabilistic Latent Semantic Analysis

On-Line Data Analytics

Automating the E-learning Personalization

An OO Framework for building Intelligence and Learning properties in Software Agents

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

TD(λ) and Q-Learning Based Ludo Players

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

ON THE USE OF WORD EMBEDDINGS ALONE TO

Ask Me Anything: Dynamic Memory Networks for Natural Language Processing

Transcription:

Deep learning Deep learning Introduction Hamid Beigy Sharif university of technology September 16, 2018 Hamid Beigy Sharif university of technology September 16, 2018 1 / 21

Deep learning Table of contents 1 Course Information 2 Introduction 3 Success stories 4 Outline of course Hamid Beigy Sharif university of technology September 16, 2018 2 / 21

Deep learning Course Information Table of contents 1 Course Information 2 Introduction 3 Success stories 4 Outline of course Hamid Beigy Sharif university of technology September 16, 2018 2 / 21

Deep learning Course Information Course Information 1 Course name : Deep learning 2 The objective of deep learning is moving Machine Learning closer to one of its original goals: Artificial Intelligence. 3 Instructor : Hamid Beigy Email : beigy@sharif.edu 4 Course Website: http://ce.sharif.edu/courses/97-98/1/ce959-1/ 5 Lectures: Sat-Mon (10:30-12:00) 6 TAs : Fariba Lotfi Sara Rastegar Email: flotfi@ce.sharif.edu Email: s_rastegar@ce.sharif.edu Hamid Beigy Sharif university of technology September 16, 2018 3 / 21

Deep learning Course Information Course evaluation Evaluation: Mid-term exam 30% 1397/8/12 Final exam 30% Practical Assignments 30% Quiz 10% Hamid Beigy Sharif university of technology September 16, 2018 4 / 21

Deep learning Course Information Main reference Methods and Applications Li Deng and Dong Yu Deep Learning: Methods and Applications provides an overview of general deep learning methodology and its applications to a variety of signal and information processing tasks. The application areas are chosen with the following three criteria in mind: (1) expertise or knowledge of the authors; (2) the application areas that have already been transformed by the successful use of deep learning technology, such as speech recognition and computer vision; and (3) the application areas that have the potential to be impacted significantly by deep learning and that have been benefitting from recent research efforts, including natural language and text processing, information retrieval, and multimodal information processing empowered by multitask deep learning. This book provides an overview of a sweeping range of up-to-date deep learning methodologies and their application to a variety of signal and information processing tasks, including not only automatic speech recognition (ASR), but also computer vision, language modeling, text processing, multimodal learning, and information retrieval. This is the first and the most valuable book for deep and wide learning of deep learning, not to be missed by anyone who wants to know the breathtaking impact of deep learning on many facets of information processing, especially ASR, all of vital importance to our modern technological society. Sadaoki Furui, President of Toyota Technological Institute at Chicago, and Professor at the Tokyo Institute of Technology Foundations and Trends in Signal Processing 7:3-4 Deep Learning Methods and Applications Li Deng and Dong Yu Li Deng and Dong Yu Deep Learning: Methods and Applications is a timely and important book for researchers and students with an interest in deep learning methodology and its applications in signal and information processing. FnT SIG 7:3-4 Deep Learning; Methods and Applications Deep Learning This book is originally published as Foundations and Trends in Signal Processing Volume 7 Issues 3-4, ISSN: 1932-8346. now now the essence of knowledge Hamid Beigy Sharif university of technology September 16, 2018 5 / 21

Deep learning Course Information References I Deng, L., and Yu, D. Deep learning: Methods and applications. Foundations and Trends in Signal Processing 7, 3 4 (2013), 197 387. Goodfellow, I., Bengio, Y., and Courville, A. Deep Learning. MIT Press, 2016. Hamid Beigy Sharif university of technology September 16, 2018 6 / 21

Deep learning Course Information Relevant journals 1 IEEE Trans on Pattern Analysis and Machine Intelligence 2 Journal of Machine Learning Research 3 Pattern Recognition 4 Machine Learning 5 Neural Networks 6 Neural Computation 7 Neurocomputing 8 IEEE Trans. on Neural Networks and Learning Systems 9 Annuals of Statistics 10 Journal of the American Statistical Association 11 Pattern Recognition Letters 12 Artificial Intelligence 13 Data Mining and Knowledge Discovery 14 IEEE Transaction on Cybernetics (SMC-B) 15 IEEE Transaction on Knowledge and Data Engineering 16 Knowledge and Information Systems Hamid Beigy Sharif university of technology September 16, 2018 7 / 21

Deep learning Course Information Relevant conferences 1 Neural Information Processing Systems (NIPS) 2 International Conference on Machine Learning (ICML) 3 European Conference on Machine Learning (ECML) 4 Asian Conference on Machine Learning (ACML) 5 Conference on Learning Theory (COLT) 6 Algorithmic Learning Theory (ALT) 7 Conference on Uncertainty in Artificial Intelligence (UAI) 8 Practice of Knowledge Discovery in Databases (PKDD) 9 International Joint Conference on Artificial Intelligence (IJCAI) 10 IEEE International Conference on Data Mining series (ICDM) Hamid Beigy Sharif university of technology September 16, 2018 8 / 21

Deep learning Course Information Relevant packages and datasets 1 Packages: Keras https://keras.io TensorFlow http://www.tensorflow.org/ Cafe http://caffe.berkeleyvision.org 2 Datasets: UCI Machine Learning Repository http://archive.ics.uci.edu/ml/ MNIST: handwritten digits http://yann.lecun.com/exdb/mnist/ 20 newsgroups http://qwone.com/~jason/20newsgroups/ Hamid Beigy Sharif university of technology September 16, 2018 9 / 21

Deep learning Introduction Table of contents 1 Course Information 2 Introduction 3 Success stories 4 Outline of course Hamid Beigy Sharif university of technology September 16, 2018 9 / 21

Deep learning Introduction Gartner Hyper-Cycle of Emerging Technologies (2016) Hamid Beigy Sharif university of technology September 16, 2018 10 / 21

Deep learning Introduction Gartner Hyper-Cycle of Emerging Technologies (2017) Hamid Beigy Sharif university of technology September 16, 2018 11 / 21

Deep learning Introduction Gartner Hyper-Cycle of Emerging Technologies (2018) Hamid Beigy Sharif university of technology September 16, 2018 12 / 21

Deep learning Introduction What is deep learning? Deep learning has various closely related definitions or high-level descriptions. Definition (Deep learning) A sub-field of machine learning that is based on learning several levels of representations, corresponding to a hierarchy of features or factors or concepts, where higher-level concepts are defined from lower-level ones, and the same lower- level concepts can help to define many higher-level concepts. Hamid Beigy Sharif university of technology September 16, 2018 13 / 21

Deep learning Introduction An Example CHAPTER 1 CAR PERSON ANIMAL Output (object identity) 3rd hidden layer (object parts) 2nd hidden layer (corners and contours) 1st hidden layer (edges) Visible layer (input pixels) Hamid Beigy Sharif university of technology September 16, 2018 14 / 21

Deep learning Introduction What is deep learning? Definition (Deep learning) Deep learning is part of a broader family of machine learning methods based on learning representations. An observation (e.g., an image) can be represented in many ways (e.g., a vector of pixels), but some representations make it easier to learn tasks of interest (e.g., is this the image of a human face?) from examples, and research in this area attempts to define what makes better representations and how to learn them. Hamid Beigy Sharif university of technology September 16, 2018 15 / 21

Deep learning Introduction An Example CHAPTER 1 Output Output Output Mapping from features Output Mapping from features Mapping from features Additional layers of more abstract features Handdesigned program Handdesigned features Features Simple features Input Input Input Input Deep Classic Rule-based learning machine Hamid Beigy Sharif university of technology systems September learning 16, 2018 Representation 16 / 21

Deep learning Introduction What is deep learning? Common among the various high-level descriptions of deep learning are two key aspects: 1 Models consisting of multiple layers/stages of nonlinear information processing 2 Methods for supervised or unsupervised learning of feature representation at successively higher, more abstract layers. Deep learning is in the intersections among the research areas of 1 Neural networks 2 Artificial intelligence 3 Graphical modeling 4 Optimization 5 Pattern recognition 6 Signal processing. Hamid Beigy Sharif university of technology September 16, 2018 17 / 21

Deep learning Success stories Table of contents 1 Course Information 2 Introduction 3 Success stories 4 Outline of course Hamid Beigy Sharif university of technology September 16, 2018 17 / 21

Deep learning Success stories Success stories 1 1 Word2vec, Mikolov, 2013. king man + woman = queen 2 Finding nearest images Success Stories 1 This slide is taken from Prof. Ghodsi s slides. Hamid Beigy Sharif university of technology September 16, 2018 18 / 21

Deep learning Success stories Success stories 1 LeNet-5 LeNet-5 is designed for handwritten and machine-printed character recognition Live demo : http://yann.lecun.com/exdb/lenet/index.html 2 Sentiment Trees Predicting the sentiment of movie reviews. Live demo : http://nlp.stanford.edu:8080/sentiment/rntndemo.html Hamid Beigy Sharif university of technology September 16, 2018 19 / 21

Deep learning Outline of course Table of contents 1 Course Information 2 Introduction 3 Success stories 4 Outline of course Hamid Beigy Sharif university of technology September 16, 2018 19 / 21

Deep learning Outline of course Outline of course 1 Introduction 2 Review of machine learning and history of deep learning 3 Multi-layer perceptrons and Backpropagation (MLP) 4 Convolutional networks (CNN) 5 Sum-Product networks (SPN) 6 Recurrent networks (RNN) 7 Deep reinforcement learning (Deep RL) 8 Unsupervised deep methods 9 Auto-encoders (AE) 10 Generative Adversarial networks (GAN) 11 Variational Autoencoders (VAE) 12 Applications Text mining and natural language processing Computer vision 13 Advanced topics Hamid Beigy Sharif university of technology September 16, 2018 20 / 21

Deep learning Outline of course Reading Please read chapter 1 of Deep Learning Book. Hamid Beigy Sharif university of technology September 16, 2018 21 / 21