Era of AI (Deep Learning) and harnessing its true potential

Similar documents
Python Machine Learning

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Probabilistic Latent Semantic Analysis

Lecture 1: Machine Learning Basics

Lecture 1: Basic Concepts of Machine Learning

CS Machine Learning

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

CSL465/603 - Machine Learning

Generative models and adversarial training

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Learning Methods for Fuzzy Systems

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Assignment 1: Predicting Amazon Review Ratings

CS 446: Machine Learning

Laboratorio di Intelligenza Artificiale e Robotica

(Sub)Gradient Descent

A Simple VQA Model with a Few Tricks and Image Features from Bottom-up Attention

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

HIERARCHICAL DEEP LEARNING ARCHITECTURE FOR 10K OBJECTS CLASSIFICATION

Using Web Searches on Important Words to Create Background Sets for LSI Classification

Human Emotion Recognition From Speech

Forget catastrophic forgetting: AI that learns after deployment

arxiv: v1 [cs.cv] 10 May 2017

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

arxiv: v2 [cs.cv] 30 Mar 2017

Evolution of Symbolisation in Chimpanzees and Neural Nets

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Detecting Wikipedia Vandalism using Machine Learning Notebook for PAN at CLEF 2011

Speech Emotion Recognition Using Support Vector Machine

CROSS-LANGUAGE INFORMATION RETRIEVAL USING PARAFAC2

Laboratorio di Intelligenza Artificiale e Robotica

Artificial Neural Networks written examination

A Reinforcement Learning Variant for Control Scheduling

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

On the Formation of Phoneme Categories in DNN Acoustic Models

AQUA: An Ontology-Driven Question Answering System

Modeling function word errors in DNN-HMM based LVCSR systems

Axiom 2013 Team Description Paper

The University of Amsterdam s Concept Detection System at ImageCLEF 2011

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

Modeling user preferences and norms in context-aware systems

Knowledge Transfer in Deep Convolutional Neural Nets

A study of speaker adaptation for DNN-based speech synthesis

Second Exam: Natural Language Parsing with Neural Networks

arxiv: v1 [cs.lg] 15 Jun 2015

Modeling function word errors in DNN-HMM based LVCSR systems

A Case Study: News Classification Based on Term Frequency

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

An OO Framework for building Intelligence and Learning properties in Software Agents

Semantic Segmentation with Histological Image Data: Cancer Cell vs. Stroma

Linking Task: Identifying authors and book titles in verbose queries

Word Segmentation of Off-line Handwritten Documents

ADVANCES IN DEEP NEURAL NETWORK APPROACHES TO SPEAKER RECOGNITION

Reducing Features to Improve Bug Prediction

Artificial Neural Networks

Large-Scale Web Page Classification. Sathi T Marath. Submitted in partial fulfilment of the requirements. for the degree of Doctor of Philosophy

Exploration. CS : Deep Reinforcement Learning Sergey Levine

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

Chapter 10 APPLYING TOPIC MODELING TO FORENSIC DATA. 1. Introduction. Alta de Waal, Jacobus Venter and Etienne Barnard

TRANSFER LEARNING OF WEAKLY LABELLED AUDIO. Aleksandr Diment, Tuomas Virtanen

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

Discriminative Learning of Beam-Search Heuristics for Planning

Deep Neural Network Language Models

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

On Human Computer Interaction, HCI. Dr. Saif al Zahir Electrical and Computer Engineering Department UBC

Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models

arxiv: v1 [cs.cl] 2 Apr 2017

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

Analyzing sentiments in tweets for Tesla Model 3 using SAS Enterprise Miner and SAS Sentiment Analysis Studio

AGENDA LEARNING THEORIES LEARNING THEORIES. Advanced Learning Theories 2/22/2016

Speech Recognition at ICSI: Broadcast News and beyond

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

A Vector Space Approach for Aspect-Based Sentiment Analysis

arxiv: v2 [cs.ir] 22 Aug 2016

THE world surrounding us involves multiple modalities

Media Literacy in the Information Society. Cyprus, 4 February 2011

Глубокие рекуррентные нейронные сети для аспектно-ориентированного анализа тональности отзывов пользователей на различных языках

MYCIN. The MYCIN Task

Beyond the Pipeline: Discrete Optimization in NLP

Universidade do Minho Escola de Engenharia

MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

PREDICTING SPEECH RECOGNITION CONFIDENCE USING DEEP LEARNING WITH WORD IDENTITY AND SCORE FEATURES

Model Ensemble for Click Prediction in Bing Search Ads

A Comparison of Two Text Representations for Sentiment Analysis

USER ADAPTATION IN E-LEARNING ENVIRONMENTS

Online Updating of Word Representations for Part-of-Speech Tagging

Postprint.

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

FUZZY EXPERT. Dr. Kasim M. Al-Aubidy. Philadelphia University. Computer Eng. Dept February 2002 University of Damascus-Syria

Semi-Supervised Face Detection

Transcription:

Era of AI (Deep Learning) and harnessing its true potential

Artificial Intelligence (AI) AI Augments our brain with infallible memories and infallible calculators Humans and Computers have become a tightly coupled cognitive unit Instrumental in disrupting various industries and closely connected with research Medicine Research Environmental Catastrophe Eliminating Poverty

About Me. Analytics Consultant Lead - Opera Solutions Senior Consultant - PwC Principal Data Scientist - Infoedge Senior Data Scientist - Experian (APAC) Gaurav Kumar

Current State of AI In 2016, companies invested $26BN - $39BN in AI 3X External Investment growth since 2013 AI adoption is greatest among stong digital adoptors Embedded in almost all devices we interact with - Cars, Mobiles, wearing devices, equipments etc. Amazon Kiva - $775M Click to ship cycle time Netfilx Recommendations Impact $1B annually Google Company Strategy Mobile first - AI First

Machine Learning - Background Machine Learning is a type of Artificial Intelligence that provides computers with the ability to learn without being explicitly programmed. Labelled data Machine Learning Algorithm Data Training Prediction Learned Model Prediction Various techniques that can learn from and make predictions on data

Machine Learning - Learning Approaches Supervised Learning: Learning with a labeled training set Example: email spam detector with training set of already labeled emails Unsupervised Learning: Discovering patterns in unlabeled data Example: cluster similar documents based on the text content Reinforcement Learning: learning based on feedback or reward Example: learn to play chess by winning or losing

Types of Machine Learning Problem Types Problem Types Comparision of ML Classifiers Classification (supervised predictive) Regression (supervised predictive) Clustering (unsupervised descriptive) Anomaly Detection (unsupervised descriptive)

What is Deep Learning? A class of machine learning algorithms that use a cascade of multiple non-linear processing layers where higher level features are derived from lower level features to learn different representations of the data in each layer A deep neural network consists of a hierarchy of layers, whereby each layer transforms the input data into more abstract representations (e.g. edge -> nose -> face). The output layer combines those features to make predictions Edges Nose, Ear, eyes Faces

A new concept? 1958 Perceptron Convolution Neural Networks for Handwritten Recognition 1998 Google Brain Project on 16k Cores 2012 awkward silence (AI Winter) 1969 Perceptron criticized 1995 SVM reigns 2006 Restricted Boltzmann Machine 2012 AlexNet wins ImageNet

What Changed? Big Data (Digitalization) Computation (Moore s Law, GPUs) Algorithmic Progress

Why so popular? Traditional ML Input Data Feature Engineering Traditional learning Algorithm Deep Learning ML Input Data Deep learning Algorithm Applications of Deep Learning Speech/Audio Processing Computer Vision Natural Language Processing

Deep learning vs Humans Human Performance Human Performance ImageNet: The computer vision World Cup Deep Learning in Speech Recognition

How does it work? Feed-Forward and Backpropagation Optimization Algorithms - Comparison

General Architecture - Deep Learning Convolution layer is a feature detector that automatically learns to filter out not needed information from an input by using convolution kernel. Convolutional neural network shares weights between local regions Recurrent neural network shares weights between time-steps. Maps input sequence to output sequence. The output vector s contents are influenced by the entire history of inputs.

Requirements Large data set with good quality (input-output mappings) Measurable and describable goals (define the cost) Enough computing power (AWS GPU Instance) Excels in tasks where the basic unit (pixel, word) has very little meaning in itself, but the combination of such units has a useful meaning.

Popular Applications Deep Q-Learning (Reinforcement Learning) Deep Learning for Games (Environment by Google DeepMind and OpenAI Gym Alpha go - trained deep learning model Image beats Localization the world champion in Go Image Completion Image Image Segmentation Image Captioning Image Compression Image Transformation Image Sharpening Syntax and Semantics Image Colorization Image Augmentation Adding Emotions Summarization and Text Generation Image generation via Generative Fine Adversial Arts Networks Album Covers (Inceptionism)

Current State of Recommendation Engines Collaborative Filtering Peer Information Preferences Information from various users Generation of User x Item (Utility) Matrix Methods of performing Collaborative Filtering: User-User similarity Item-Item similarity SVD decomposition and Similarity Some drawbacks but reco. are relatively more relevant Content Based Recommender Profile Information User declared content used to recommend NLP applications on Text (or Image data) Methods of performing Content Matching: Creation of User Features Creation of Item Features Cosine similarity between user and item Time consuming and difficult to productionize

NLP - Application You shall know a word by the company it keeps (Firth, J. R. 1957:11) Embeddings are used to turn textual data (words, sentences, paragraphs) into high- dimensional vector representations and group them together with semantically similar data in a vector-space. Thereby, computer can detect similarities mathematically. Woman Man Aunt - Uncle King - Male + Female Queen Human - Animal Ethics

Create word embeddings or using off the shelf Easiest approach Use learned word embedding (GLOVE Stanford) Google News 3 Billion word corpus Twitter 3 Billion tweets Wikipedia 6Billion word corpus

Recommendation using Word/Doc Embeddings For every user/item based on the content, 1.Calculate Word2vec for each word in document 2.Use weighted average (weights from TF/IDF) 3.Doc2vec to find document representations 4.Cosine product of User x Item to rank-ordering

Adoption in RecSys NLP embeddings used in content based recommendation engine to match relevant text metadata between users and products. Model based recommendation mimicking collaborative filtering with multiple variants proved and adopted in industry Change of paradigm from static overnight computed recommendations to dynamic realtime recommendations using current session information

Thank You

Real Time Recommendations (Session Info)