L intelligence artificielle et l apprentissage profond appliqués à la maintenance prédictive

Size: px
Start display at page:

Download "L intelligence artificielle et l apprentissage profond appliqués à la maintenance prédictive"

Transcription

1 L intelligence artificielle et l apprentissage profond appliqués à la maintenance prédictive Stéphane Canu, INSA Rouen Normandy University asi.insa-rouen.fr/enseignants/~scanu Tech Hour Normandie AeroEspace 19 décembre 2017

2 Road map 1 L intelligence artificielle et l apprentissage automatique 2 L apprentissage profond (Deep learning) Du neurone aux réseaux profonds ImageNet La mode du deep 3 Quoi de neuf avec le deep learning? Big is beautiful Apprentissage et optimisation Architectures profondes 4 Conclusion

3 Airplane Health Management Safety inspection inspection visuelle tap test : Contrôle santé des structures composites Inspecter grace à ses sens voir, écouter et sentir (toucher) Thèse de Meriem GHRIB

4 Automated visual inspection Automated image processing aircobot.akka.eu/, donecle.com/about-donecle/ Automated exterior inspection of an aircraft with a pan-tilt-zoom camera mounted on a mobile robot, 2015 Apprendre à voir, écouter et sentir

5 Human vs. machine learning Child s learning capacities to walk: one year to speak: two years to think: the rest of my life

6 Apprendre à penser

7 Deux types d intelligence artificielle (AI) Artificial general intelligence (AGI), long term AI, short term AI, applied AI, Deep learning

8 Machine learning definition Machine Learning (T. Mitchell, 2006) A computer program CP learn from experience E with respect to some class of tasks T and performance measure P, if its performance at tasks in T, as measured by P, improves with experience E Key points experience E : data performance measure P : optimization tasks T : utility automatic translation play chess or go...do what humans do

9 Exemples of image processing related learning tasks T Recognize Predict Detect Read Localize Scene analysis

10 Exemples of image processing related learning tasks T Recognize Predict Detect Read Localize Scene analysis A single device can solve them all: our brain

11 Learn the features together with the prediction Zhao, Rui, et al. "Deep Learning and Its Applications to Machine Health Monitoring: A Survey." arxiv preprint (2016).

12 So far so good Machine learning is Example-based programming task data performance A lot of applications in computer vision (as human learn to see) detection recognition localization captioning... Deep learning the dream of using a single multi task device for all vision tasks feature learning is a challenge addressed by deep learning on many application deep learning give the best results

13 Road map 1 L intelligence artificielle et l apprentissage automatique 2 L apprentissage profond (Deep learning) Du neurone aux réseaux profonds ImageNet La mode du deep 3 Quoi de neuf avec le deep learning? Big is beautiful Apprentissage et optimisation Architectures profondes 4 Conclusion

14 Le Neurone biologique

15 Le Neurone formel (McCulloch & Pitts, 1943) Défini un hyperplan x 2 w t x + b = 0 x 1, w 1 x 2, w 2... x p, w p 1, b h y = ϕ(w t x + b) x 1 x entrée IR p w poids synaptiques, b biais ϕ fonction d activation y sortie IR ϕ(t) = tanh(t)

16 Le perceptron (Rosenblatt, 1957) p f (x) = H ϕ j (x)w j + w 0 j=1 Règle du Perceptron Tant que on n a pas convergé : 1 tirer un exemple (x i, y i ) 2 calculer la prédiction f (x i ) 3 adaptater les poids w w + ρ 2 (y i f (x i ))ϕ(x i ) Du point de vue optimisation n ˆR n (w) = max(0, y i w ϕ(x i )) i=1 Méthode de gradient stochastique

17 Non linearity combining linear neurons: the Xor case Alpaydın, Introduction to Machine Learning, 2010

18 Neural networks as universal approximator Running several neurons at the same time y = ϕ(w 3 h (2) ) h (2) = ϕ(w 2 h (1) ) h (1) = ϕ(w 1 x) x Multilayered neural networks in layers Use backpropagation to learn internal representation W 1, W 2, W 3 from L. Arnold PhD

19 The Asimov Institute:

20 OCR: the MNIST database (Y. LeCun, 1989) use convolution layers

21 The caltech 101 database (2004) 101 classes, 30 training images per category...and the winner is NOT a deep network dataset is too small use convolution + Recitification + Normalization + Pooling in What is the Best Multi-Stage Architecture for Object Recognition? Jarrett et al, 2009

22 Deep networks, big computers: use GPU (2009) Large-scale deep unsupervised learning using graphics processors R Raina, A Madhavan, AY Ng - Proceedings of the ACM, 2009

23 Big Data : the image net database (Deng et al., 2012) ImageNet = 15 million labeled high-resolution images of 22,000 categories. Large-Scale Visual Recognition Challenge (a subset of ImageNet) 1000 categories. 1.2 million training images, 50,000 validation images, 150,000 testing images.

24 A new fashion in image processing shallow approaches deep learning Y. LeCun StatLearn tutorial

25 karpathy s blog: karpathy.github.io/2014/09/02/what-i-learned-from-competing-against-a-convnet-on-imagenet/ ImageNet results 35 top-5 error in % 152 layers number of layers 30 human performance publication year 2012 Alex Net 2013 ZFNet 2014 VGG 2015 GoogLeNet / Inception 2016 Residual Network

26 Deep architecture and the image net (15%) The Alex Net architecture [Krizhevsky, Sutskever, Hinton, 2012] Convolution + Recitification (ReLU) + Normalization + Pooling 60 million parameters using 2 GPU 6 days regularization data augmentation dropout weight decay

27 From 15% to 7%: Inceptionism Network in a network (deep learning lecture at Udacity) Christian Szegedy et. al. Going deeper with convolutions. CVPR 2015.

28 From 7% to 3%: Residual Nets. Beating the gradient vanishing effect K. He et al, 2016

29 Learning Deep architecture min W IR d n f (x i, W ) y i 2 + λ w 2 i=1 d = n = 1, λ = f is a deep NN Y. Bengio tutorial

30 Deep learning and the industry M. Zukerberg show up at a deep learning research workshop (2013) the GAFAM they got the infrastructure (hard+software) they got the data deep learning bridges the gap between applications and ML Deep Learning As A Service a lot of available API (google, microsoft, Nvidia, Amazon... )

31 Deap learning applications image audio speech text (NLP) translation robotics playing games science (Higgs Boson) Mostly related with specialized low level perception

32 So far so good from the formal neuron to deep learning one neuron is a linear perceptron many layered neurons are non linear multilayered perceptrons deep networks is a new name for multilayered perceptrons deep learning breakthrough starts with ImageNet better than human performances on many perception tasks deep learning could transform almost any industry the AI revolution Neural networks+backpropagation exist since 1985 what s new?

33 Road map 1 L intelligence artificielle et l apprentissage automatique 2 L apprentissage profond (Deep learning) Du neurone aux réseaux profonds ImageNet La mode du deep 3 Quoi de neuf avec le deep learning? Big is beautiful Apprentissage et optimisation Architectures profondes 4 Conclusion

34 What s new with deep learning a lot of data (big data) big computing resources (hardware & software), big model (deep vs. shalow) new architectures new learning tricks from Recent advances in convolutional neural networks Gu et al. Pattern Recognition, 2017

35 Dealing with a lot of data. ImageNet: 1,200,000x256x256x3 (about 200GB) block of pixels Image understanding MS COCO for supervised learning YFCC100M for unsupervised learning Visual genome: data + knowledge Video understanding: Youtube 8M (1.71 Terabytes) Climate monitoring with HD images (about 15 TB and DL at 15PF)

36 Dans l industrie aéronautique crédit HP,

37 Andrew Ng basic recipe for machine learning Why is Deep Learning taking off? fuel = data engine = model (a deep network). Andrew Ng GTC 2015 Keynote, GPU Technology Nvidia

38 GPU needed Now 2 hours with Nvidia DGX-1, and enough Memory Yann LeCun: learning a relevant model takes 3 weeks

39 Deep learing frameworks Tensoflow (Google) is the most popular with Keras

40 How to start with deep learning? Andrej Karpathy, Deep Learning Summer School 2016

41 Success story: Updating Google Maps with Deep Learning Requirements Installed TensorFlow library 158Gb to download FSNS dataset: 16Gb of RAM (32Gb is recommended) training 60 h with GPU Titan X to train from scratch: python train.py to train a model using a pre-trained inception weights as initialization: wget tar xf inception_v3_2016_08_28.tar.gz python train.py --checkpoint_inception=inception_v3.ckpt

42 What s new with deep learning from Mastering the game of Go without human knowledge D. Silver et al. Nature, 550, 2017

43 Road map 1 L intelligence artificielle et l apprentissage automatique 2 L apprentissage profond (Deep learning) Du neurone aux réseaux profonds ImageNet La mode du deep 3 Quoi de neuf avec le deep learning? Big is beautiful Apprentissage et optimisation Architectures profondes 4 Conclusion

44 Conclusions so far so good deep learning major breakthrough better than human on low level perception tasks data needed complex algorithms the future of deep learning data: the new energy hard to compete with GAFAMs unsupervised learning (Generative Adversarial Networks) transfer learning (representation learning)

45 Pendant ce temps en Normandie, au LITIS, au GREYC 200 chercheurs en Normandie Universités et écoles d ingénieurs litislab.fr, greyc.fr NormaSTIC: fédération CNRS 20 en machine learning 20 en traitement d image Applications {x i, y i } Chemoinformatics BCI (interfaces cerveaux machines) Audio détection de piétons... Deep learning Boites à outil, CRIAN Deep in France (soutenu par l ANR) optimisation et ML, transport optimal et GAN

46 Shameless advertising: june in Normandy

47 To go further books I. Goodfellow, Y. Bengio & A. Courville, Deep Learning, MIT Press book, Gitbook leonardoaraujosantos.gitbooks.io/artificial-inteligence/ conferences NIPS, ICLR, xcml, AIStats, Journals JMLR, Machine Learning, Foundations and Trends in Machine Learning, machine learning survey lectures Deep Learning: Course by Yann LeCun at Collège de France in 2016 college-de-france.fr/site/en-yann-lecun/inaugural-lecture h00.htm Convolutional Neural Networks for Visual Recognition (Stanford) deep mind ( CS 229: Machine Learning at stanford Andrew Ng Blogs Andrej Karpathy blog (

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks 1 Tzu-Hsuan Yang, 2 Tzu-Hsuan Tseng, and 3 Chia-Ping Chen Department of Computer Science and Engineering

More information

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach #BaselOne7 Deep search Enhancing a search bar using machine learning Ilgün Ilgün & Cedric Reichenbach We are not researchers Outline I. Periscope: A search tool II. Goals III. Deep learning IV. Applying

More information

Python Machine Learning

Python Machine Learning Python Machine Learning Unlock deeper insights into machine learning with this vital guide to cuttingedge predictive analytics Sebastian Raschka [ PUBLISHING 1 open source I community experience distilled

More information

HIERARCHICAL DEEP LEARNING ARCHITECTURE FOR 10K OBJECTS CLASSIFICATION

HIERARCHICAL DEEP LEARNING ARCHITECTURE FOR 10K OBJECTS CLASSIFICATION HIERARCHICAL DEEP LEARNING ARCHITECTURE FOR 10K OBJECTS CLASSIFICATION Atul Laxman Katole 1, Krishna Prasad Yellapragada 1, Amish Kumar Bedi 1, Sehaj Singh Kalra 1 and Mynepalli Siva Chaitanya 1 1 Samsung

More information

A Simple VQA Model with a Few Tricks and Image Features from Bottom-up Attention

A Simple VQA Model with a Few Tricks and Image Features from Bottom-up Attention A Simple VQA Model with a Few Tricks and Image Features from Bottom-up Attention Damien Teney 1, Peter Anderson 2*, David Golub 4*, Po-Sen Huang 3, Lei Zhang 3, Xiaodong He 3, Anton van den Hengel 1 1

More information

Semantic Segmentation with Histological Image Data: Cancer Cell vs. Stroma

Semantic Segmentation with Histological Image Data: Cancer Cell vs. Stroma Semantic Segmentation with Histological Image Data: Cancer Cell vs. Stroma Adam Abdulhamid Stanford University 450 Serra Mall, Stanford, CA 94305 adama94@cs.stanford.edu Abstract With the introduction

More information

Lip Reading in Profile

Lip Reading in Profile CHUNG AND ZISSERMAN: BMVC AUTHOR GUIDELINES 1 Lip Reading in Profile Joon Son Chung http://wwwrobotsoxacuk/~joon Andrew Zisserman http://wwwrobotsoxacuk/~az Visual Geometry Group Department of Engineering

More information

TRANSFER LEARNING OF WEAKLY LABELLED AUDIO. Aleksandr Diment, Tuomas Virtanen

TRANSFER LEARNING OF WEAKLY LABELLED AUDIO. Aleksandr Diment, Tuomas Virtanen TRANSFER LEARNING OF WEAKLY LABELLED AUDIO Aleksandr Diment, Tuomas Virtanen Tampere University of Technology Laboratory of Signal Processing Korkeakoulunkatu 1, 33720, Tampere, Finland firstname.lastname@tut.fi

More information

A Compact DNN: Approaching GoogLeNet-Level Accuracy of Classification and Domain Adaptation

A Compact DNN: Approaching GoogLeNet-Level Accuracy of Classification and Domain Adaptation A Compact DNN: Approaching GoogLeNet-Level Accuracy of Classification and Domain Adaptation Chunpeng Wu 1, Wei Wen 1, Tariq Afzal 2, Yongmei Zhang 2, Yiran Chen 3, and Hai (Helen) Li 3 1 Electrical and

More information

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE EE-589 Introduction to Neural Assistant Prof. Dr. Turgay IBRIKCI Room # 305 (322) 338 6868 / 139 Wensdays 9:00-12:00 Course Outline The course is divided in two parts: theory and practice. 1. Theory covers

More information

THE enormous growth of unstructured data, including

THE enormous growth of unstructured data, including INTL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2014, VOL. 60, NO. 4, PP. 321 326 Manuscript received September 1, 2014; revised December 2014. DOI: 10.2478/eletel-2014-0042 Deep Image Features in

More information

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF Read Online and Download Ebook ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF Click link bellow and free register to download

More information

Laboratorio di Intelligenza Artificiale e Robotica

Laboratorio di Intelligenza Artificiale e Robotica Laboratorio di Intelligenza Artificiale e Robotica A.A. 2008-2009 Outline 2 Machine Learning Unsupervised Learning Supervised Learning Reinforcement Learning Genetic Algorithms Genetics-Based Machine Learning

More information

Forget catastrophic forgetting: AI that learns after deployment

Forget catastrophic forgetting: AI that learns after deployment Forget catastrophic forgetting: AI that learns after deployment Anatoly Gorshechnikov CTO, Neurala 1 Neurala at a glance Programming neural networks on GPUs since circa 2 B.C. Founded in 2006 expecting

More information

arxiv: v1 [cs.lg] 15 Jun 2015

arxiv: v1 [cs.lg] 15 Jun 2015 Dual Memory Architectures for Fast Deep Learning of Stream Data via an Online-Incremental-Transfer Strategy arxiv:1506.04477v1 [cs.lg] 15 Jun 2015 Sang-Woo Lee Min-Oh Heo School of Computer Science and

More information

Generative models and adversarial training

Generative models and adversarial training Day 4 Lecture 1 Generative models and adversarial training Kevin McGuinness kevin.mcguinness@dcu.ie Research Fellow Insight Centre for Data Analytics Dublin City University What is a generative model?

More information

arxiv: v1 [cs.lg] 7 Apr 2015

arxiv: v1 [cs.lg] 7 Apr 2015 Transferring Knowledge from a RNN to a DNN William Chan 1, Nan Rosemary Ke 1, Ian Lane 1,2 Carnegie Mellon University 1 Electrical and Computer Engineering, 2 Language Technologies Institute Equal contribution

More information

Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski

Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski Problem Statement and Background Given a collection of 8th grade science questions, possible answer

More information

Knowledge Transfer in Deep Convolutional Neural Nets

Knowledge Transfer in Deep Convolutional Neural Nets Knowledge Transfer in Deep Convolutional Neural Nets Steven Gutstein, Olac Fuentes and Eric Freudenthal Computer Science Department University of Texas at El Paso El Paso, Texas, 79968, U.S.A. Abstract

More information

Laboratorio di Intelligenza Artificiale e Robotica

Laboratorio di Intelligenza Artificiale e Robotica Laboratorio di Intelligenza Artificiale e Robotica A.A. 2008-2009 Outline 2 Machine Learning Unsupervised Learning Supervised Learning Reinforcement Learning Genetic Algorithms Genetics-Based Machine Learning

More information

arxiv: v1 [cs.cl] 27 Apr 2016

arxiv: v1 [cs.cl] 27 Apr 2016 The IBM 2016 English Conversational Telephone Speech Recognition System George Saon, Tom Sercu, Steven Rennie and Hong-Kwang J. Kuo IBM T. J. Watson Research Center, Yorktown Heights, NY, 10598 gsaon@us.ibm.com

More information

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System QuickStroke: An Incremental On-line Chinese Handwriting Recognition System Nada P. Matić John C. Platt Λ Tony Wang y Synaptics, Inc. 2381 Bering Drive San Jose, CA 95131, USA Abstract This paper presents

More information

SORT: Second-Order Response Transform for Visual Recognition

SORT: Second-Order Response Transform for Visual Recognition SORT: Second-Order Response Transform for Visual Recognition Yan Wang 1, Lingxi Xie 2( ), Chenxi Liu 2, Siyuan Qiao 2 Ya Zhang 1( ), Wenjun Zhang 1, Qi Tian 3, Alan Yuille 2 1 Cooperative Medianet Innovation

More information

Development of an IT Curriculum. Dr. Jochen Koubek Humboldt-Universität zu Berlin Technische Universität Berlin 2008

Development of an IT Curriculum. Dr. Jochen Koubek Humboldt-Universität zu Berlin Technische Universität Berlin 2008 Development of an IT Curriculum Dr. Jochen Koubek Humboldt-Universität zu Berlin Technische Universität Berlin 2008 Curriculum A curriculum consists of everything that promotes learners intellectual, personal,

More information

arxiv:submit/ [cs.cv] 2 Aug 2017

arxiv:submit/ [cs.cv] 2 Aug 2017 Associative Domain Adaptation Philip Haeusser 1,2 haeusser@in.tum.de Thomas Frerix 1 Alexander Mordvintsev 2 thomas.frerix@tum.de moralex@google.com 1 Dept. of Informatics, TU Munich 2 Google, Inc. Daniel

More information

CSL465/603 - Machine Learning

CSL465/603 - Machine Learning CSL465/603 - Machine Learning Fall 2016 Narayanan C Krishnan ckn@iitrpr.ac.in Introduction CSL465/603 - Machine Learning 1 Administrative Trivia Course Structure 3-0-2 Lecture Timings Monday 9.55-10.45am

More information

Artificial Neural Networks written examination

Artificial Neural Networks written examination 1 (8) Institutionen för informationsteknologi Olle Gällmo Universitetsadjunkt Adress: Lägerhyddsvägen 2 Box 337 751 05 Uppsala Artificial Neural Networks written examination Monday, May 15, 2006 9 00-14

More information

Second Exam: Natural Language Parsing with Neural Networks

Second Exam: Natural Language Parsing with Neural Networks Second Exam: Natural Language Parsing with Neural Networks James Cross May 21, 2015 Abstract With the advent of deep learning, there has been a recent resurgence of interest in the use of artificial neural

More information

Taxonomy-Regularized Semantic Deep Convolutional Neural Networks

Taxonomy-Regularized Semantic Deep Convolutional Neural Networks Taxonomy-Regularized Semantic Deep Convolutional Neural Networks Wonjoon Goo 1, Juyong Kim 1, Gunhee Kim 1, Sung Ju Hwang 2 1 Computer Science and Engineering, Seoul National University, Seoul, Korea 2

More information

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur Module 12 Machine Learning 12.1 Instructional Objective The students should understand the concept of learning systems Students should learn about different aspects of a learning system Students should

More information

A Deep Bag-of-Features Model for Music Auto-Tagging

A Deep Bag-of-Features Model for Music Auto-Tagging 1 A Deep Bag-of-Features Model for Music Auto-Tagging Juhan Nam, Member, IEEE, Jorge Herrera, and Kyogu Lee, Senior Member, IEEE latter is often referred to as music annotation and retrieval, or simply

More information

Lecture 1: Basic Concepts of Machine Learning

Lecture 1: Basic Concepts of Machine Learning Lecture 1: Basic Concepts of Machine Learning Cognitive Systems - Machine Learning Ute Schmid (lecture) Johannes Rabold (practice) Based on slides prepared March 2005 by Maximilian Röglinger, updated 2010

More information

Lecture 1: Machine Learning Basics

Lecture 1: Machine Learning Basics 1/69 Lecture 1: Machine Learning Basics Ali Harakeh University of Waterloo WAVE Lab ali.harakeh@uwaterloo.ca May 1, 2017 2/69 Overview 1 Learning Algorithms 2 Capacity, Overfitting, and Underfitting 3

More information

Human Emotion Recognition From Speech

Human Emotion Recognition From Speech RESEARCH ARTICLE OPEN ACCESS Human Emotion Recognition From Speech Miss. Aparna P. Wanare*, Prof. Shankar N. Dandare *(Department of Electronics & Telecommunication Engineering, Sant Gadge Baba Amravati

More information

FUZZY EXPERT. Dr. Kasim M. Al-Aubidy. Philadelphia University. Computer Eng. Dept February 2002 University of Damascus-Syria

FUZZY EXPERT. Dr. Kasim M. Al-Aubidy. Philadelphia University. Computer Eng. Dept February 2002 University of Damascus-Syria FUZZY EXPERT SYSTEMS 16-18 18 February 2002 University of Damascus-Syria Dr. Kasim M. Al-Aubidy Computer Eng. Dept. Philadelphia University What is Expert Systems? ES are computer programs that emulate

More information

CS Machine Learning

CS Machine Learning CS 478 - Machine Learning Projects Data Representation Basic testing and evaluation schemes CS 478 Data and Testing 1 Programming Issues l Program in any platform you want l Realize that you will be doing

More information

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model Xinying Song, Xiaodong He, Jianfeng Gao, Li Deng Microsoft Research, One Microsoft Way, Redmond, WA 98052, U.S.A.

More information

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models Stephan Gouws and GJ van Rooyen MIH Medialab, Stellenbosch University SOUTH AFRICA {stephan,gvrooyen}@ml.sun.ac.za

More information

CS224d Deep Learning for Natural Language Processing. Richard Socher, PhD

CS224d Deep Learning for Natural Language Processing. Richard Socher, PhD CS224d Deep Learning for Natural Language Processing, PhD Welcome 1. CS224d logis7cs 2. Introduc7on to NLP, deep learning and their intersec7on 2 Course Logis>cs Instructor: (Stanford PhD, 2014; now Founder/CEO

More information

arxiv: v2 [cs.cl] 26 Mar 2015

arxiv: v2 [cs.cl] 26 Mar 2015 Effective Use of Word Order for Text Categorization with Convolutional Neural Networks Rie Johnson RJ Research Consulting Tarrytown, NY, USA riejohnson@gmail.com Tong Zhang Baidu Inc., Beijing, China Rutgers

More information

Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models

Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models Navdeep Jaitly 1, Vincent Vanhoucke 2, Geoffrey Hinton 1,2 1 University of Toronto 2 Google Inc. ndjaitly@cs.toronto.edu,

More information

(Sub)Gradient Descent

(Sub)Gradient Descent (Sub)Gradient Descent CMSC 422 MARINE CARPUAT marine@cs.umd.edu Figures credit: Piyush Rai Logistics Midterm is on Thursday 3/24 during class time closed book/internet/etc, one page of notes. will include

More information

Cultivating DNN Diversity for Large Scale Video Labelling

Cultivating DNN Diversity for Large Scale Video Labelling Cultivating DNN Diversity for Large Scale Video Labelling Mikel Bober-Irizar mikel@mxbi.net Sameed Husain sameed.husain@surrey.ac.uk Miroslaw Bober m.bober@surrey.ac.uk Eng-Jon Ong e.ong@surrey.ac.uk Abstract

More information

Dropout improves Recurrent Neural Networks for Handwriting Recognition

Dropout improves Recurrent Neural Networks for Handwriting Recognition 2014 14th International Conference on Frontiers in Handwriting Recognition Dropout improves Recurrent Neural Networks for Handwriting Recognition Vu Pham,Théodore Bluche, Christopher Kermorvant, and Jérôme

More information

Top US Tech Talent for the Top China Tech Company

Top US Tech Talent for the Top China Tech Company THE FALL 2017 US RECRUITING TOUR Top US Tech Talent for the Top China Tech Company INTERVIEWS IN 7 CITIES Tour Schedule CITY Boston, MA New York, NY Pittsburgh, PA Urbana-Champaign, IL Ann Arbor, MI Los

More information

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE Master of Science (M.S.) Major in Computer Science 1 MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE Major Program The programs in computer science are designed to prepare students for doctoral research,

More information

arxiv: v4 [cs.cv] 13 Aug 2017

arxiv: v4 [cs.cv] 13 Aug 2017 Ruben Villegas 1 * Jimei Yang 2 Yuliang Zou 1 Sungryull Sohn 1 Xunyu Lin 3 Honglak Lee 1 4 arxiv:1704.05831v4 [cs.cv] 13 Aug 17 Abstract We propose a hierarchical approach for making long-term predictions

More information

Глубокие рекуррентные нейронные сети для аспектно-ориентированного анализа тональности отзывов пользователей на различных языках

Глубокие рекуррентные нейронные сети для аспектно-ориентированного анализа тональности отзывов пользователей на различных языках Глубокие рекуррентные нейронные сети для аспектно-ориентированного анализа тональности отзывов пользователей на различных языках Тарасов Д. С. (dtarasov3@gmail.com) Интернет-портал reviewdot.ru, Казань,

More information

A MULTI-AGENT SYSTEM FOR A DISTANCE SUPPORT IN EDUCATIONAL ROBOTICS

A MULTI-AGENT SYSTEM FOR A DISTANCE SUPPORT IN EDUCATIONAL ROBOTICS A MULTI-AGENT SYSTEM FOR A DISTANCE SUPPORT IN EDUCATIONAL ROBOTICS Sébastien GEORGE Christophe DESPRES Laboratoire d Informatique de l Université du Maine Avenue René Laennec, 72085 Le Mans Cedex 9, France

More information

arxiv: v1 [cs.cv] 10 May 2017

arxiv: v1 [cs.cv] 10 May 2017 Inferring and Executing Programs for Visual Reasoning Justin Johnson 1 Bharath Hariharan 2 Laurens van der Maaten 2 Judy Hoffman 1 Li Fei-Fei 1 C. Lawrence Zitnick 2 Ross Girshick 2 1 Stanford University

More information

arxiv: v2 [cs.cv] 4 Mar 2016

arxiv: v2 [cs.cv] 4 Mar 2016 MULTI-SCALE CONTEXT AGGREGATION BY DILATED CONVOLUTIONS Fisher Yu Princeton University Vladlen Koltun Intel Labs arxiv:1511.07122v2 [cs.cv] 4 Mar 2016 ABSTRACT State-of-the-art models for semantic segmentation

More information

Information System Design and Development (Advanced Higher) Unit. level 7 (12 SCQF credit points)

Information System Design and Development (Advanced Higher) Unit. level 7 (12 SCQF credit points) Information System Design and Development (Advanced Higher) Unit SCQF: level 7 (12 SCQF credit points) Unit code: H226 77 Unit outline The general aim of this Unit is for learners to develop a deep knowledge

More information

DIGITAL GAMING & INTERACTIVE MEDIA BACHELOR S DEGREE. Junior Year. Summer (Bridge Quarter) Fall Winter Spring GAME Credits.

DIGITAL GAMING & INTERACTIVE MEDIA BACHELOR S DEGREE. Junior Year. Summer (Bridge Quarter) Fall Winter Spring GAME Credits. DIGITAL GAMING & INTERACTIVE MEDIA BACHELOR S DEGREE Sample 2-Year Academic Plan DRAFT Junior Year Summer (Bridge Quarter) Fall Winter Spring MMDP/GAME 124 GAME 310 GAME 318 GAME 330 Introduction to Maya

More information

Beginning to Flip/Enhance Your Classroom with Screencasting. Check out screencasting tools from (21 Things project)

Beginning to Flip/Enhance Your Classroom with Screencasting. Check out screencasting tools from  (21 Things project) Beginning to Flip/Enhance Your Classroom with Screencasting Check out screencasting tools from http://21things4teachers.net (21 Things project) This session Flipping out A beginning exploration of flipping

More information

1. Share the following information with your partner. Spell each name to your partner. Change roles. One object in the classroom:

1. Share the following information with your partner. Spell each name to your partner. Change roles. One object in the classroom: French 1A Final Examination Study Guide January 2015 Montgomery County Public Schools Name: Before you begin working on the study guide, organize your notes and vocabulary lists from semester A. Refer

More information

arxiv: v2 [stat.ml] 30 Apr 2016 ABSTRACT

arxiv: v2 [stat.ml] 30 Apr 2016 ABSTRACT UNSUPERVISED AND SEMI-SUPERVISED LEARNING WITH CATEGORICAL GENERATIVE ADVERSARIAL NETWORKS Jost Tobias Springenberg University of Freiburg 79110 Freiburg, Germany springj@cs.uni-freiburg.de arxiv:1511.06390v2

More information

Axiom 2013 Team Description Paper

Axiom 2013 Team Description Paper Axiom 2013 Team Description Paper Mohammad Ghazanfari, S Omid Shirkhorshidi, Farbod Samsamipour, Hossein Rahmatizadeh Zagheli, Mohammad Mahdavi, Payam Mohajeri, S Abbas Alamolhoda Robotics Scientific Association

More information

Exemplar for Internal Achievement Standard French Level 1

Exemplar for Internal Achievement Standard French Level 1 Exemplar for internal assessment resource French for Achievement Standard 90882 Exemplar for Internal Achievement Standard French Level 1 This exemplar supports assessment against: Achievement Standard

More information

Image based Static Facial Expression Recognition with Multiple Deep Network Learning

Image based Static Facial Expression Recognition with Multiple Deep Network Learning Image based Static Facial Expression Recognition with Multiple Deep Network Learning ABSTRACT Zhiding Yu Carnegie Mellon University 5000 Forbes Ave Pittsburgh, PA 1521 yzhiding@andrew.cmu.edu We report

More information

A Neural Network GUI Tested on Text-To-Phoneme Mapping

A Neural Network GUI Tested on Text-To-Phoneme Mapping A Neural Network GUI Tested on Text-To-Phoneme Mapping MAARTEN TROMPPER Universiteit Utrecht m.f.a.trompper@students.uu.nl Abstract Text-to-phoneme (T2P) mapping is a necessary step in any speech synthesis

More information

Artificial Neural Networks

Artificial Neural Networks Artificial Neural Networks Andres Chavez Math 382/L T/Th 2:00-3:40 April 13, 2010 Chavez2 Abstract The main interest of this paper is Artificial Neural Networks (ANNs). A brief history of the development

More information

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence COURSE DESCRIPTION This course presents computing tools and concepts for all stages

More information

PREDICTING SPEECH RECOGNITION CONFIDENCE USING DEEP LEARNING WITH WORD IDENTITY AND SCORE FEATURES

PREDICTING SPEECH RECOGNITION CONFIDENCE USING DEEP LEARNING WITH WORD IDENTITY AND SCORE FEATURES PREDICTING SPEECH RECOGNITION CONFIDENCE USING DEEP LEARNING WITH WORD IDENTITY AND SCORE FEATURES Po-Sen Huang, Kshitiz Kumar, Chaojun Liu, Yifan Gong, Li Deng Department of Electrical and Computer Engineering,

More information

Cooperative Systems Modeling, Example of a Cooperative e-maintenance System

Cooperative Systems Modeling, Example of a Cooperative e-maintenance System Cooperative Systems Modeling, Example of a Cooperative e-maintenance System David Saint-Voirin PhD Student LIFC 1 -LAB 2 saint-voirin@lifc.univ-fcomte.fr Christophe Lang Assistant Professor LIFC 1 lang@lifc.univ-fcomte.fr

More information

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology Tiancheng Zhao CMU-LTI-16-006 Language Technologies Institute School of Computer Science Carnegie Mellon

More information

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17. Semi-supervised methods of text processing, and an application to medical concept extraction Yacine Jernite Text-as-Data series September 17. 2015 What do we want from text? 1. Extract information 2. Link

More information

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks Devendra Singh Chaplot, Eunhee Rhim, and Jihie Kim Samsung Electronics Co., Ltd. Seoul, South Korea {dev.chaplot,eunhee.rhim,jihie.kim}@samsung.com

More information

Dual-Memory Deep Learning Architectures for Lifelong Learning of Everyday Human Behaviors

Dual-Memory Deep Learning Architectures for Lifelong Learning of Everyday Human Behaviors Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI-6) Dual-Memory Deep Learning Architectures for Lifelong Learning of Everyday Human Behaviors Sang-Woo Lee,

More information

Knowledge-Based - Systems

Knowledge-Based - Systems Knowledge-Based - Systems ; Rajendra Arvind Akerkar Chairman, Technomathematics Research Foundation and Senior Researcher, Western Norway Research institute Priti Srinivas Sajja Sardar Patel University

More information

Specification of a multilevel model for an individualized didactic planning: case of learning to read

Specification of a multilevel model for an individualized didactic planning: case of learning to read Specification of a multilevel model for an individualized didactic planning: case of learning to read Sofiane Aouag To cite this version: Sofiane Aouag. Specification of a multilevel model for an individualized

More information

Massachusetts Institute of Technology Tel: Massachusetts Avenue Room 32-D558 MA 02139

Massachusetts Institute of Technology Tel: Massachusetts Avenue  Room 32-D558 MA 02139 Hariharan Narayanan Massachusetts Institute of Technology Tel: 773.428.3115 LIDS har@mit.edu 77 Massachusetts Avenue http://www.mit.edu/~har Room 32-D558 MA 02139 EMPLOYMENT Massachusetts Institute of

More information

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation SLSP-2016 October 11-12 Natalia Tomashenko 1,2,3 natalia.tomashenko@univ-lemans.fr Yuri Khokhlov 3 khokhlov@speechpro.com Yannick

More information

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks POS tagging of Chinese Buddhist texts using Recurrent Neural Networks Longlu Qin Department of East Asian Languages and Cultures longlu@stanford.edu Abstract Chinese POS tagging, as one of the most important

More information

AI Agent for Ice Hockey Atari 2600

AI Agent for Ice Hockey Atari 2600 AI Agent for Ice Hockey Atari 2600 Emman Kabaghe (emmank@stanford.edu) Rajarshi Roy (rroy@stanford.edu) 1 Introduction In the reinforcement learning (RL) problem an agent autonomously learns a behavior

More information

A study of speaker adaptation for DNN-based speech synthesis

A study of speaker adaptation for DNN-based speech synthesis A study of speaker adaptation for DNN-based speech synthesis Zhizheng Wu, Pawel Swietojanski, Christophe Veaux, Steve Renals, Simon King The Centre for Speech Technology Research (CSTR) University of Edinburgh,

More information

Evolution of Symbolisation in Chimpanzees and Neural Nets

Evolution of Symbolisation in Chimpanzees and Neural Nets Evolution of Symbolisation in Chimpanzees and Neural Nets Angelo Cangelosi Centre for Neural and Adaptive Systems University of Plymouth (UK) a.cangelosi@plymouth.ac.uk Introduction Animal communication

More information

Georgetown University at TREC 2017 Dynamic Domain Track

Georgetown University at TREC 2017 Dynamic Domain Track Georgetown University at TREC 2017 Dynamic Domain Track Zhiwen Tang Georgetown University zt79@georgetown.edu Grace Hui Yang Georgetown University huiyang@cs.georgetown.edu Abstract TREC Dynamic Domain

More information

Designing Autonomous Robot Systems - Evaluation of the R3-COP Decision Support System Approach

Designing Autonomous Robot Systems - Evaluation of the R3-COP Decision Support System Approach Designing Autonomous Robot Systems - Evaluation of the R3-COP Decision Support System Approach Tapio Heikkilä, Lars Dalgaard, Jukka Koskinen To cite this version: Tapio Heikkilä, Lars Dalgaard, Jukka Koskinen.

More information

Detecting English-French Cognates Using Orthographic Edit Distance

Detecting English-French Cognates Using Orthographic Edit Distance Detecting English-French Cognates Using Orthographic Edit Distance Qiongkai Xu 1,2, Albert Chen 1, Chang i 1 1 The Australian National University, College of Engineering and Computer Science 2 National

More information

Natural Language Processing. George Konidaris

Natural Language Processing. George Konidaris Natural Language Processing George Konidaris gdk@cs.brown.edu Fall 2017 Natural Language Processing Understanding spoken/written sentences in a natural language. Major area of research in AI. Why? Humans

More information

arxiv: v2 [cs.ir] 22 Aug 2016

arxiv: v2 [cs.ir] 22 Aug 2016 Exploring Deep Space: Learning Personalized Ranking in a Semantic Space arxiv:1608.00276v2 [cs.ir] 22 Aug 2016 ABSTRACT Jeroen B. P. Vuurens The Hague University of Applied Science Delft University of

More information

ON THE USE OF WORD EMBEDDINGS ALONE TO

ON THE USE OF WORD EMBEDDINGS ALONE TO ON THE USE OF WORD EMBEDDINGS ALONE TO REPRESENT NATURAL LANGUAGE SEQUENCES Anonymous authors Paper under double-blind review ABSTRACT To construct representations for natural language sequences, information

More information

A Review: Speech Recognition with Deep Learning Methods

A Review: Speech Recognition with Deep Learning Methods Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 5, May 2015, pg.1017

More information

Language Acquisition French 2016

Language Acquisition French 2016 Unit title Key & Related Concepts Global context Statement of Inquiry MYP objectives ATL skills Content (topics, knowledge, skills) Unit 1 6 th grade Unit 2 Faisons Connaissance Getting to Know Each Other

More information

The Socially Structured Possibility to Pilot One s Transition by Paul Bélanger, Elaine Biron, Pierre Doray, Simon Cloutier, Olivier Meyer

The Socially Structured Possibility to Pilot One s Transition by Paul Bélanger, Elaine Biron, Pierre Doray, Simon Cloutier, Olivier Meyer The Socially Structured Possibility to Pilot One s by Paul Bélanger, Elaine Biron, Pierre Doray, Simon Cloutier, Olivier Meyer Toronto, June 2006 1 s, either professional or personal, are understood here

More information

*** * * * COUNCIL * * CONSEIL OFEUROPE * * * DE L'EUROPE. Proceedings of the 9th Symposium on Legal Data Processing in Europe

*** * * * COUNCIL * * CONSEIL OFEUROPE * * * DE L'EUROPE. Proceedings of the 9th Symposium on Legal Data Processing in Europe *** * * * COUNCIL * * CONSEIL OFEUROPE * * * DE L'EUROPE Proceedings of the 9th Symposium on Legal Data Processing in Europe Bonn, 10-12 October 1989 Systems based on artificial intelligence in the legal

More information

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA Testing a Moving Target How Do We Test Machine Learning Systems? Peter Varhol, Technology

More information

An OO Framework for building Intelligence and Learning properties in Software Agents

An OO Framework for building Intelligence and Learning properties in Software Agents An OO Framework for building Intelligence and Learning properties in Software Agents José A. R. P. Sardinha, Ruy L. Milidiú, Carlos J. P. Lucena, Patrick Paranhos Abstract Software agents are defined as

More information

Modeling function word errors in DNN-HMM based LVCSR systems

Modeling function word errors in DNN-HMM based LVCSR systems Modeling function word errors in DNN-HMM based LVCSR systems Melvin Jose Johnson Premkumar, Ankur Bapna and Sree Avinash Parchuri Department of Computer Science Department of Electrical Engineering Stanford

More information

arxiv: v4 [cs.cl] 28 Mar 2016

arxiv: v4 [cs.cl] 28 Mar 2016 LSTM-BASED DEEP LEARNING MODELS FOR NON- FACTOID ANSWER SELECTION Ming Tan, Cicero dos Santos, Bing Xiang & Bowen Zhou IBM Watson Core Technologies Yorktown Heights, NY, USA {mingtan,cicerons,bingxia,zhou}@us.ibm.com

More information

Dialog-based Language Learning

Dialog-based Language Learning Dialog-based Language Learning Jason Weston Facebook AI Research, New York. jase@fb.com arxiv:1604.06045v4 [cs.cl] 20 May 2016 Abstract A long-term goal of machine learning research is to build an intelligent

More information

Transferring End-to-End Visuomotor Control from Simulation to Real World for a Multi-Stage Task

Transferring End-to-End Visuomotor Control from Simulation to Real World for a Multi-Stage Task Transferring End-to-End Visuomotor Control from Simulation to Real World for a Multi-Stage Task Stephen James Dyson Robotics Lab Imperial College London slj12@ic.ac.uk Andrew J. Davison Dyson Robotics

More information

WebLogo-2M: Scalable Logo Detection by Deep Learning from the Web

WebLogo-2M: Scalable Logo Detection by Deep Learning from the Web WebLogo-2M: Scalable Logo Detection by Deep Learning from the Web Hang Su Queen Mary University of London hang.su@qmul.ac.uk Shaogang Gong Queen Mary University of London s.gong@qmul.ac.uk Xiatian Zhu

More information

There are some definitions for what Word

There are some definitions for what Word Word Embeddings and Their Use In Sentence Classification Tasks Amit Mandelbaum Hebrew University of Jerusalm amit.mandelbaum@mail.huji.ac.il Adi Shalev bitan.adi@gmail.com arxiv:1610.08229v1 [cs.lg] 26

More information

Time series prediction

Time series prediction Chapter 13 Time series prediction Amaury Lendasse, Timo Honkela, Federico Pouzols, Antti Sorjamaa, Yoan Miche, Qi Yu, Eric Severin, Mark van Heeswijk, Erkki Oja, Francesco Corona, Elia Liitiäinen, Zhanxing

More information

Deep Neural Network Language Models

Deep Neural Network Language Models Deep Neural Network Language Models Ebru Arısoy, Tara N. Sainath, Brian Kingsbury, Bhuvana Ramabhadran IBM T.J. Watson Research Center Yorktown Heights, NY, 10598, USA {earisoy, tsainath, bedk, bhuvana}@us.ibm.com

More information

Ask Me Anything: Dynamic Memory Networks for Natural Language Processing

Ask Me Anything: Dynamic Memory Networks for Natural Language Processing Ask Me Anything: Dynamic Memory Networks for Natural Language Processing Ankit Kumar*, Ozan Irsoy*, Peter Ondruska*, Mohit Iyyer*, James Bradbury, Ishaan Gulrajani*, Victor Zhong*, Romain Paulus, Richard

More information

Georgia Institute of Technology Graduate Curriculum Committee Minutes. January 20, 2011

Georgia Institute of Technology Graduate Curriculum Committee Minutes. January 20, 2011 Georgia Institute of Technology Graduate Curriculum Committee Minutes Present: Babensee (BME), Pikowsky (Registrar), Storici (BIO), Clarke (CoM), Flowers (ARCH), Mazalek (LCC), Silva (ECON), Corso (PSYC),

More information

Acquisition vs. Learning of a Second Language: English Negation

Acquisition vs. Learning of a Second Language: English Negation Interculturalia Acquisition vs. Learning of a Second Language: English Negation Oana BADEA Key-words: acquisition, learning, first/second language, English negation General Remarks on Theories of Second/

More information

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ; EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10 Instructor: Kang G. Shin, 4605 CSE, 763-0391; kgshin@umich.edu Number of credit hours: 4 Class meeting time and room: Regular classes: MW 10:30am noon

More information