Deep Learning is a new area of Machine Learning research, which has been introduced with the

Similar documents
ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Python Machine Learning

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Top US Tech Talent for the Top China Tech Company

Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski

Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models

Development of an IT Curriculum. Dr. Jochen Koubek Humboldt-Universität zu Berlin Technische Universität Berlin 2008

arxiv: v4 [cs.cl] 28 Mar 2016

A Pipelined Approach for Iterative Software Process Model

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

HIERARCHICAL DEEP LEARNING ARCHITECTURE FOR 10K OBJECTS CLASSIFICATION

Forget catastrophic forgetting: AI that learns after deployment

Massachusetts Institute of Technology Tel: Massachusetts Avenue Room 32-D558 MA 02139

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks

arxiv: v1 [cs.cl] 2 Apr 2017

Learning Methods for Fuzzy Systems

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

A Neural Network GUI Tested on Text-To-Phoneme Mapping

USER ADAPTATION IN E-LEARNING ENVIRONMENTS

THE world surrounding us involves multiple modalities

DIGITAL GAMING & INTERACTIVE MEDIA BACHELOR S DEGREE. Junior Year. Summer (Bridge Quarter) Fall Winter Spring GAME Credits.

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Citrine Informatics. The Latest from Citrine. Citrine Informatics. The data analytics platform for the physical world

Speech Recognition at ICSI: Broadcast News and beyond

Knowledge Transfer in Deep Convolutional Neural Nets

Residual Stacking of RNNs for Neural Machine Translation

arxiv: v1 [cs.lg] 15 Jun 2015

Глубокие рекуррентные нейронные сети для аспектно-ориентированного анализа тональности отзывов пользователей на различных языках

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Second Exam: Natural Language Parsing with Neural Networks

CS224d Deep Learning for Natural Language Processing. Richard Socher, PhD

Ph.D in Advance Machine Learning (computer science) PhD submitted, degree to be awarded on convocation, sept B.Tech in Computer science and

Artificial Neural Networks written examination

arxiv: v1 [cs.cl] 27 Apr 2016

CSL465/603 - Machine Learning

arxiv: v1 [cs.cv] 10 May 2017

A study of speaker adaptation for DNN-based speech synthesis

Information System Design and Development (Advanced Higher) Unit. level 7 (12 SCQF credit points)

The Enterprise Knowledge Portal: The Concept

Hard Drive 60 GB RAM 4 GB Graphics High powered graphics Input Power /1/50/60

Dropout improves Recurrent Neural Networks for Handwriting Recognition

Building a Vibrant Alumni Network

Testimony in front of the Assembly Committee on Jobs and the Economy Special Session Assembly Bill 1 Ray Cross, UW System President August 3, 2017

arxiv: v1 [cs.lg] 7 Apr 2015

arxiv: v3 [cs.cl] 7 Feb 2017

THE enormous growth of unstructured data, including

UNIDIRECTIONAL LONG SHORT-TERM MEMORY RECURRENT NEURAL NETWORK WITH RECURRENT OUTPUT LAYER FOR LOW-LATENCY SPEECH SYNTHESIS. Heiga Zen, Haşim Sak

Generative models and adversarial training

Model Ensemble for Click Prediction in Bing Search Ads

Getting Started with Deliberate Practice

Circuit Simulators: A Revolutionary E-Learning Platform

Lecture 1: Machine Learning Basics

ON THE USE OF WORD EMBEDDINGS ALONE TO

Cultivating DNN Diversity for Large Scale Video Labelling

2017 Florence, Italty Conference Abstract

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

arxiv: v1 [cs.dc] 19 May 2017

Human Emotion Recognition From Speech

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

arxiv: v1 [cs.lg] 20 Mar 2017

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

ENGINEERING FIRST YEAR GUIDE

Framewise Phoneme Classification with Bidirectional LSTM and Other Neural Network Architectures

COURSE LISTING. Courses Listed. Training for Cloud with SAP SuccessFactors in Integration. 23 November 2017 (08:13 GMT) Beginner.

TRANSFER LEARNING OF WEAKLY LABELLED AUDIO. Aleksandr Diment, Tuomas Virtanen

Modeling function word errors in DNN-HMM based LVCSR systems

What Can Neural Networks Teach us about Language? Graham Neubig a2-dlearn 11/18/2017

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

Preliminary AGENDA. Practical Applications of Load Resistance Factor Design for Foundation and Earth Retaining System Design and Construction

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

Natural Language Processing. George Konidaris

Seminar - Organic Computing

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION

Axiom 2013 Team Description Paper

From Virtual University to Mobile Learning on the Digital Campus: Experiences from Implementing a Notebook-University

Statistics and Data Analytics Minor

INPE São José dos Campos

LEARN. LEAD. DISCOVER.

Modeling function word errors in DNN-HMM based LVCSR systems

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

Multisensor Data Fusion: From Algorithms And Architectural Design To Applications (Devices, Circuits, And Systems)

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

(Sub)Gradient Descent

Opening Essay. Darrell A. Hamlin, Ph.D. Fort Hays State University

Software Maintenance

"On-board training tools for long term missions" Experiment Overview. 1. Abstract:

Eduroam Support Clinics What are they?

PeopleSoft Human Capital Management 9.2 (through Update Image 23) Hardware and Software Requirements

DNV GL Joint Industry Project: Decision Support for Dynamic Barrier Management

A deep architecture for non-projective dependency parsing

ME 443/643 Design Techniques in Mechanical Engineering. Lecture 1: Introduction

Environmental Science: Earth s Resources (National 3) level 3 (6 SCQF credit points)

PRODUCT COMPLEXITY: A NEW MODELLING COURSE IN THE INDUSTRIAL DESIGN PROGRAM AT THE UNIVERSITY OF TWENTE

Calibration of Confidence Measures in Speech Recognition

SEMI-SUPERVISED ENSEMBLE DNN ACOUSTIC MODEL TRAINING

Software Development: Programming Paradigms (SCQF level 8)

Transcription:

DEEP LEARNING Deep Learning Deep Learning is a new area of Machine Learning research, which has been introduced with the objective of moving Machine Learning closer to one of its original goals: Artificial Intelligence. This course is intended to give a holistic understanding on Deep Learning and its applications. It would cover: Basics of Neural Networks and Deep Learning Mathematics and Programming for Deep Learning Detailed application oriented approach towards learning Deep Learning Tensor Flow and Keras based practicals

DEEP LEARNING SUMMARY Objective Machine learning is one of the fastest-growing and most exciting fields out there, and deep learning represents its true bleeding edge. In this course, you ll develop a clear understanding of the motivation for deep learning, and design intelligent systems that learn from complex and/or large-scale datasets. We ll show you how to train and optimise basic neural networks, convolutional neural networks, and long short term memory networks. Complete learning systems in TensorFlow will be introduced with working examples. You will learn to solve new classes of problems that were once thought prohibitively challenging, and come to better appreciate the complex nature of human intelligence as you solve these same problems effortlessly using deep learning methods. Hardware and Software A laptop running 64 bit OS (Linux/OSX/Windows) Minimum 8 GB RAM There are several installers necessary, we would be sharing the installations in advance. Pre-requisites Understanding of programming concepts Programming knowledge of python is mandatory. Outline Session 1 - Deep Learning Concepts What is a neural network? Why Deep Learning? How to choose between deep neural networks? An old problem: The Vanishing Gradient Deep Belief Networks Session 2 - Deep Learning Concepts Continued Convolutional Networks Recurrent Nets Auto-encoders Recursive Neural Tensor Nets Where is Deep Learning Uses

DEEP LEARNING Session 3 - Platforms for Deep Learning What is a Deep Learning Platform? H2O.ai Dato GraphLab Session 4 - Deep Learning Software Libraries What are several Deep Learning Libraries? Tensorflow Theano Deeplearning4j Torch Caffe Session 5: Lab Introduction to tensor flow, setting up the Computation Graphs and MLPs Tensor flow Language Modeling with LSTMs One end to end use case Session 6: Deep Neural networks Theory of deep neural networks. Train a simple deep network. Effectively regularise a simple deep network. Train a competitive deep network via model exploration and hyper-parameter tuning. Session 7: Convolutional Neural Networks** Train a simple convolutional neural net. Explore the design space for convolutional nets. Session 8: Lab Setting up ANN Setting up DNN Setting up CNN Setting up RNN Session 9: Full Case Study: Deep Learning and NLP Introduction to NLP and Deep Learning Word Vector Representations: word2vec Word Window Classification and Neural Networks End-to-end models for Speech Processing

Session 10: Google Cloud Machine Learning platform Setting up Deep Learning algorithms on Google Cloud Platform Practical description of the platform Further references

COURSE STRUCTURE Day 1: Understanding of Neural Networks, Deep Learning and Libraries This day will be about underlying concepts of neural networks, deep learning and other libraries Topic Session From To Basics of Neural Networks and Deep Learning 1 9 AM 10:15 AM Deep Learning Concepts 2 10:30 AM 11:45 AM Platforms and Languages for Deep Learning 3 12:00 PM 1:15 PM Open Source Libraries for Deep Learning 4 2:15 PM 3:30 PM Lab 1: Installation, and Basics of Tensor Flow 5 3:45 PM 5:00 PM Day 2: Advanced Concepts of Deep Learning and Applications Application driven advanced concepts understanding of Deep Learning Topic Session From To Understanding ANN, CNN, RNN 1 9 AM 10:15 AM Lab2: Designing ANN using Tensor Flow 2 10:30 AM 11: 45 AM Lab2 (Continued): Designing CNN, RNN using Tensor Flow 3 12:00 PM 1:15 PM NLP using Deep Learning 4 2:15 PM 3:30 PM Deep Deployment in Google Cloud Platform 5 3:45 PM 5:00 PM

ABOUT INSTRUCTORS Vidyasagar Nallapati Vidyasagar N is a Data Scientist and Software Architect from Bangalore with close to 10 years of experience. He loves all the things mathematics, statistics, machine learning, developer tools, automation and anything that will make systems better and easier, which brought him to data science, system architecture, building large scale distributed services and designing rock-solid data pipelines. An alumni of IIT BHU and IIM Bangalore, Vidyasagar when not writing code can be found travelling, playing music and indulging in coffee. Jitendra R Jitendra, an alumnus of University of Mumbai and IIM Bangalore, is a Data Scientist with over 10 years of experience in Data Analytics and Data Engineering in industries as varied as Telecom, Energy, Pharma, Automotive, and Maritime. He is a joint secretary in the Analytics Society of India (ASI) and additionally guest faculty at IIM Bangalore. He has also presented numerous plenary sessions and papers at the International Conference on Business Analytics and Intelligence. On the academic front his work in analytics has been published and distributed by Harvard as case studies. Naveen Bhansali Naveen, an alumnus IIM Bangalore, is a Big Data Analyst with over 12 years of experience in Data Analytics and Data Engineering in industries as varied as Telecom, Energy, Automotive, and Finance. He is a guest faculty at IIM Bangalore and has also presented numerous plenary sessions and papers at the International Conference on Business Analytics and Intelligence.