Deep Learning Explained

Similar documents
Python Machine Learning

(Sub)Gradient Descent

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Generative models and adversarial training

CS Machine Learning

Laboratorio di Intelligenza Artificiale e Robotica

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

DIGITAL GAMING & INTERACTIVE MEDIA BACHELOR S DEGREE. Junior Year. Summer (Bridge Quarter) Fall Winter Spring GAME Credits.

A Neural Network GUI Tested on Text-To-Phoneme Mapping

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Lecture 1: Machine Learning Basics

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE. Pierre Foy

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models

Laboratorio di Intelligenza Artificiale e Robotica

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Forget catastrophic forgetting: AI that learns after deployment

Lecture 1: Basic Concepts of Machine Learning

Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

CS 446: Machine Learning

Top US Tech Talent for the Top China Tech Company

Learning Methods for Fuzzy Systems

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

Web as Corpus. Corpus Linguistics. Web as Corpus 1 / 1. Corpus Linguistics. Web as Corpus. web.pl 3 / 1. Sketch Engine. Corpus Linguistics

A study of speaker adaptation for DNN-based speech synthesis

Human Emotion Recognition From Speech

Artificial Neural Networks

Глубокие рекуррентные нейронные сети для аспектно-ориентированного анализа тональности отзывов пользователей на различных языках

Knowledge Transfer in Deep Convolutional Neural Nets

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

Model Ensemble for Click Prediction in Bing Search Ads

Axiom 2013 Team Description Paper

UNIDIRECTIONAL LONG SHORT-TERM MEMORY RECURRENT NEURAL NETWORK WITH RECURRENT OUTPUT LAYER FOR LOW-LATENCY SPEECH SYNTHESIS. Heiga Zen, Haşim Sak

Multi-tasks Deep Learning Model for classifying MRI images of AD/MCI Patients

BUILDING CONTEXT-DEPENDENT DNN ACOUSTIC MODELS USING KULLBACK-LEIBLER DIVERGENCE-BASED STATE TYING

Android App Development for Beginners

A virtual surveying fieldcourse for traversing

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, VOL XXX, NO. XXX,

Second Exam: Natural Language Parsing with Neural Networks

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

CS224d Deep Learning for Natural Language Processing. Richard Socher, PhD

Radius STEM Readiness TM

Modeling function word errors in DNN-HMM based LVCSR systems

CS 3516: Computer Networks

arxiv: v1 [cs.lg] 7 Apr 2015

MYCIN. The MYCIN Task

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks

Running Head: STUDENT CENTRIC INTEGRATED TECHNOLOGY

arxiv: v1 [cs.lg] 15 Jun 2015

Human-Computer Interaction CS Overview for Today. Who am I? 1/15/2012. Prof. Stephen Intille

Indian Institute of Technology, Kanpur

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

PREDICTING SPEECH RECOGNITION CONFIDENCE USING DEEP LEARNING WITH WORD IDENTITY AND SCORE FEATURES

Technological Educational Institute (TEI) of Crete School of Applied Sciences

Operational Knowledge Management: a way to manage competence

Software Development Plan

What Can Neural Networks Teach us about Language? Graham Neubig a2-dlearn 11/18/2017

TRANSFER LEARNING OF WEAKLY LABELLED AUDIO. Aleksandr Diment, Tuomas Virtanen

Deep Neural Network Language Models

Intel-powered Classmate PC. SMART Response* Training Foils. Version 2.0

A Case Study: News Classification Based on Term Frequency

Emergent Narrative As A Novel Framework For Massively Collaborative Authoring

CSL465/603 - Machine Learning

Modeling function word errors in DNN-HMM based LVCSR systems

arxiv:submit/ [cs.cv] 2 Aug 2017

Evolutive Neural Net Fuzzy Filtering: Basic Description

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Chamilo 2.0: A Second Generation Open Source E-learning and Collaboration Platform

CS 100: Principles of Computing

Automating the E-learning Personalization

Group Assignment: Software Evaluation Model. Team BinJack Adam Binet Aaron Jackson

Earthsoft s EQuIS Database Lower Duwamish Waterway Source Data Management

A Hands-on First-year Electrical Engineering Introduction Course

Designing a Rubric to Assess the Modelling Phase of Student Design Projects in Upper Year Engineering Courses

Dialog-based Language Learning

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

Using Moodle in ESOL Writing Classes

A Review: Speech Recognition with Deep Learning Methods

BUS Computer Concepts and Applications for Business Fall 2012

THE world surrounding us involves multiple modalities

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

MINISTRY OF EDUCATION

SAM - Sensors, Actuators and Microcontrollers in Mobile Robots

Knowledge-Based - Systems

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

GACE Computer Science Assessment Test at a Glance

Blackboard Communication Tools

SEMI-SUPERVISED ENSEMBLE DNN ACOUSTIC MODEL TRAINING

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Semi-Supervised GMM and DNN Acoustic Model Training with Multi-system Combination and Confidence Re-calibration

Reducing Features to Improve Bug Prediction

Transcription:

Deep Learning Explained Module 1: Introduction and Overview Sayan D. Pathak, Ph.D., Principal ML Scientist, Microsoft Roland Fernandez, Senior Researcher, Microsoft

Course outline What is deep learning? Who are the audience? What to expect from this course?

What to expect Deep learning basics (6 modules): - Module 1: Introduction and high-level modeling workflow - Module 2: Logistic regression for optical character recognition - Module 3: Multi-layer perceptron - Module 4: Convolution networks - Module 5: Recurrence for time-series modeling - Module 6: Long-short term memory (LSTM) recurrence for text modeling Assignments: - 5 hands-on lab and 5 quizzes (required for certification) - Use Python Jupyter notebooks Upon completion, students will: - Have a working knowledge of deep learning concepts and algorithms - Be able to build deep neural network models - Solve practical problems in AI involving large data

Who are the audience? Data Scientists and Engineers who are new to deep learning Technical Managers who are interested and involved in development of AI based technologies Machine learning scientists looking to build deep models with agility and ease

Who are the instructors? Sayan Pathak, PhD. Industry - Principal ML Scientist @ Microsoft - Principal Investigator, National Institutes of Health funded projects in Healthcare - Allen Institute for Brain Sciences - Practical ML experience in Computer Vision, On-line Advertisement, Social Networks, Neuro and healthcare informatics Academic - Instructor at AI School @ Microsoft - Affl Faculty @ University of Washington in Bioengineering / EE (from 2001) - Affl Professor @ IIT (Indian Institute of Technology), Kharagpur in CS (from 2012) - Courses taught: - Image Computing Systems, Information Retrieval, Social Computing, Machine Learning

Who are the instructors? Roland Fernandez - Researcher and AI School instructor @ Microsoft - Research Areas: - reinforcement learning, autonomous multitask learning, symbolic representation, - information visualization, and Computer Human Interaction - Worked in the areas of Natural User Interfaces, activity based computing, advanced prototyping, programmer tools, operating systems, and databases. - Actively involved in AI education efforts within Microsoft

Programming environment Toolkit: - Microsoft Cognitive Toolkit (CNTK) Tutorials with Python Notebooks Environments: - Install locally (recommended) - Use pre-installed notebooks in Azure Notebooks (CPU only) for free - Azure Data Science Virtual Machines (sign up trial account)

References Deep Learning by I. Goodfellow, Y. Bengio & A. Courville, MIT Press, http://www.deeplearningbook.org, 2016 Stanford cs231n: http://cs231n.github.io/ Neural Networks Tutorials by G. Hinton https://www.cs.toronto.edu/~hinton/nntut.html

Acknowledgement Chris Basoglu, Ph.D., & Cha Zhang, Ph.D., @ Microsoft for several support and suggested improvement. Surya Pathak, Ph.D. Assoc. Prof. @ University of Washington for helping with pedagogical approach to teaching and content updates. Jonathan Sanito, Content Developer @ Microsoft, for assistance with graphics especially for the convolution networks. Nikos Karampatziakis, Ph.D., for help with course material preparation(especially the Text Classification tutorial).

Machine Learning (ML)?? Supervised Learning? Unsupervised Learning?? Semi - supervised Learning

Supervised Machine Learning Input Model Predicted Classification SPAM vs. Not SPAM Data: E-mail Label: Spam / not Spam Categories: Spam / Not Spam Regression? 850k Data: House features Label: Price House Price: a real number

Real-world applications Image - Autonomous Driving - Disease detection Text - Machine translation - Document comprehension Speech - Voice recognition - Speech to text https://en.wikipedia.org/wiki/autonomous_car

What is deep learning? Deep learning = Deep Neural Networks (DNN) - Mimics several layers in the brain Machine Learning DNN Deep Neural Networks - Have multiple layers - Each layer learns a higher abstraction on the input from the layer before it - Requires fitting a large number of parameters (100+ Millions) - Facilitated by (1) large amount of data and (2) computing capabilities Application domains - Image / Videos - Speech - Text - Multimodality and IOT data https://www.nimh.nih.gov/news/science-news/2013/transgenic-micelines-aid-in-brain-circuit-imaging.shtml

Solar panel Output (in W) ML recap y = m x +b Input x = Feature y Output y = observed output (labels) y*= predicted output x Average day temperature (in o F) x Model z(params) y* Model function (z) m: Slope b: Intercept are model parameters

ML Recap Input Model Predicted Data: E-mail Label: Spam / not Spam Classification Regression SPAM vs. Not SPAM Categories: Spam / Not Spam? 850k Data: House features Label: Price House Price: a real number

loss Train Workflow Data Sampler Features (x), Labels (Y) Training Data Train (learner) params update params Model z(params) Reporting Y Train more? iterations

loss Validation Workflow Data Sampler Features (x), Labels (Y) Training Data Validation Data Data Sampler Features (x), Labels (Y) Train (learner) params update params Model z(params) trained params Validate Reporting Reporting Y Train more? iterations Model final Test more? Y

Test Workflow Test Data Data Sampler Features (x), Labels (Y) Model final trained params Test Reporting Test more? Y

Where to begin with Cognitive Toolkit? On GitHub: https://github.com/microsoft/cntk/wiki Tutorials: https://www.cntk.ai/pythondocs/tutorials.html (latest release) https://github.com/microsoft/cntk/tree/master/tutorials (latest) Azure Notebooks: Try for free pre-hosted https://notebooks.azure.com/cntk/libraries/tutorials Seek help on Stack Overflow: http://stackoverflow.com/search?q=cntk (please add cntk tag) Seek help on Stack Overflow: http://stackoverflow.com/search?q=cntk (please add cntk tag)