APPLICATIONS OF DEEP LEARNING TO GEOINT

Similar documents
Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Python Machine Learning

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Lecture 1: Machine Learning Basics

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Probabilistic Latent Semantic Analysis

Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski

Forget catastrophic forgetting: AI that learns after deployment

Generative models and adversarial training

Semantic Segmentation with Histological Image Data: Cancer Cell vs. Stroma

Human Emotion Recognition From Speech

Using Deep Convolutional Neural Networks in Monte Carlo Tree Search

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

CSL465/603 - Machine Learning

INPE São José dos Campos

Modeling function word errors in DNN-HMM based LVCSR systems

Laboratorio di Intelligenza Artificiale e Robotica

Lecture 1: Basic Concepts of Machine Learning

Top US Tech Talent for the Top China Tech Company

Speech Emotion Recognition Using Support Vector Machine

Speech Recognition at ICSI: Broadcast News and beyond

Rule Learning with Negation: Issues Regarding Effectiveness

Rule Learning With Negation: Issues Regarding Effectiveness

Modeling function word errors in DNN-HMM based LVCSR systems

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Australian Journal of Basic and Applied Sciences

Word Segmentation of Off-line Handwritten Documents

Axiom 2013 Team Description Paper

Using Web Searches on Important Words to Create Background Sets for LSI Classification

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling

Assignment 1: Predicting Amazon Review Ratings

Seminar - Organic Computing

HIERARCHICAL DEEP LEARNING ARCHITECTURE FOR 10K OBJECTS CLASSIFICATION

ADVANCES IN DEEP NEURAL NETWORK APPROACHES TO SPEAKER RECOGNITION

AQUA: An Ontology-Driven Question Answering System

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

A Simple VQA Model with a Few Tricks and Image Features from Bottom-up Attention

Modeling user preferences and norms in context-aware systems

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

Circuit Simulators: A Revolutionary E-Learning Platform

Laboratorio di Intelligenza Artificiale e Robotica

Artificial Neural Networks written examination

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Natural Language Processing. George Konidaris

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

A Reinforcement Learning Variant for Control Scheduling

Linking Task: Identifying authors and book titles in verbose queries

Reducing Features to Improve Bug Prediction

Learning Methods for Fuzzy Systems

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

Geospatial Visual Analytics Tutorial. Gennady Andrienko & Natalia Andrienko

Text-mining the Estonian National Electronic Health Record

arxiv: v1 [cs.lg] 15 Jun 2015

Bootstrapping Personal Gesture Shortcuts with the Wisdom of the Crowd and Handwriting Recognition

*Net Perceptions, Inc West 78th Street Suite 300 Minneapolis, MN

Citrine Informatics. The Latest from Citrine. Citrine Informatics. The data analytics platform for the physical world

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape

We re Listening Results Dashboard How To Guide

A Context-Driven Use Case Creation Process for Specifying Automotive Driver Assistance Systems

arxiv: v2 [cs.cv] 30 Mar 2017

Automating the E-learning Personalization

A study of speaker adaptation for DNN-based speech synthesis

1.1 Background. 1 Introduction

WHEN THERE IS A mismatch between the acoustic

THE world surrounding us involves multiple modalities

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

Calibration of Confidence Measures in Speech Recognition

Computers Change the World

Master s Programme in Computer, Communication and Information Sciences, Study guide , ELEC Majors

LEGO MINDSTORMS Education EV3 Coding Activities

Blended E-learning in the Architectural Design Studio

On the Combined Behavior of Autonomous Resource Management Agents

BYLINE [Heng Ji, Computer Science Department, New York University,

THE enormous growth of unstructured data, including

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Mining Association Rules in Student s Assessment Data

FUZZY EXPERT. Dr. Kasim M. Al-Aubidy. Philadelphia University. Computer Eng. Dept February 2002 University of Damascus-Syria

SARDNET: A Self-Organizing Feature Map for Sequences

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Skillsoft Acquires SumTotal: Frequently Asked Questions. October 2014

A Case Study: News Classification Based on Term Frequency

Semi-Supervised GMM and DNN Acoustic Model Training with Multi-system Combination and Confidence Re-calibration

Historical maintenance relevant information roadmap for a self-learning maintenance prediction procedural approach

Cultivating DNN Diversity for Large Scale Video Labelling

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

AI Agent for Ice Hockey Atari 2600

XXII BrainStorming Day

arxiv: v1 [cs.cl] 2 Apr 2017

Exposé for a Master s Thesis

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION

arxiv: v1 [cs.cv] 10 May 2017

Transcription:

APPLICATIONS OF DEEP LEARNING TO GEOINT Jon Barker, Solutions Architect August 2015

Overview Motivation Introduction to Deep Learning GEOINT applications Deep Learning deployment Questions

Motivation 350 Million Images Uploaded a Day Tens of thousands of social and political events indexed daily Rapid growth in remote sensing numbers and capability 100 Hours Video Uploaded Every Minute There is not enough time or expertise to write algorithms for each individual information extraction task that needs to be performed Deep Learning provides general algorithms that identify missionrelevant content and patterns in raw data at machine speed

Motivation: Multi-INT analysis workflow TODAY: BIG DATA BOTTLENECK NUMBERS IMAGES SOUNDS VIDEOS TEXT Metadata filters Noisy content Human perception Near-perfect perception Mission focused analysis VISION: BIG DATA NUMBERS IMAGES SOUNDS VIDEOS TEXT Automated machine perception Near-human level perception Semantic content based filters Mission relevant content Mission focused analysis

What is Deep Learning? Deep Learning has become the most popular approach to developing Artificial Intelligence (AI) machines that perceive and understand the world CUDA for Deep Learning The focus is currently on specific perceptual tasks, and there are many successes. Today, some of the world s largest internet companies, as well as the foremost research institutions, are using GPUs for deep learning in research and production

Practical Deep Learning Examples Image Classification, Object Detection, Localization, Action Recognition, Scene Understanding Speech Recognition, Speech Translation, Natural Language Processing Pedestrian Detection, Traffic Sign Recognition Breast Cancer Cell Mitosis Detection, Volumetric Brain Image Segmentation

Traditional Machine Perception hand crafted features (Linear) Raw data Feature extraction Result Classifier e.g. SVM e.g. HMM Speaker ID, speech transcription, e.g LSA Topic classification, machine translation, sentiment analysis

Deep Neural Network (DNN) Modern reincarnation of Artificial Neural Networks A very large collection of simple, trainable mathematical units Collectively they can learn very complex functions mapping raw data to decisions Loosely inspired by biological brains dog Raw data Output decision

Deep Learning approach Train: Dog Cat Feature extraction (Linear) Classifier Dog Cat Raccoon Honey badger

Deep Learning approach Train: Dog Errors Cat Dog Cat Raccoon Honey badger

Deep Learning approach Train: Dog Errors Cat Dog Cat Raccoon Deploy: Honey badger Dog

Deep Learning for Visual Perception Application components: Input: pixels Local receptive field Output: image class prediction Task objective e.g. Identify face e.g. Classify age Training data Typically 10K 100M samples Network architecture Learning algorithm Biologically inspired Convolutional Neural Network (CNN)

Visual Perception: DL State of the Art NORB Dataset ( 2004): 5 categories, multiple views and illuminations person car helmet motorcycle bird frog NORB Dataset ( 2004): 5 categories, multiple views and illu NORB dataset (2004) Training instances Less than 6% error on test set with cluttere backgrounds Less Test than inst 6 test set wit background 291,600 training sample 58,320 test samples person dog chair 1000 object classes 1.2 million training images [1] Top-5 error (Google): 4.8% Top-5 error (Human): 5.1% person hammer flower pot power drill Training instances Training instances Test instances Test instances 5 object classes Multiple views and illuminations 291,600 training images 58,230 test images [2] <6% classification error on test set with cluttered backgrounds (NYU) 291,600 traini 58,320 test sam

Deep Learning Dominates at Visual Perception 120 GPU Entries person car helmet motorcycle person dog chair bird frog person hammer flower pot power drill 100 80 60 40 20 0 110 60 4 2010 2011 2012 2013 2014 1000 object classes 1.2 million training images [1] Top-5 error (Google): 4.8% Top-5 error (Human): 5.1%

Remote Sensing Imagery Exploitation Object detection and classification Scene segmentation Land usage classification Geologic feature classification Change detection Crop yield prediction Surface water estimation Population density estimation Super-resolution Photogrammetry [3] Keio University, Japan SPIE EI 2015 [4] University of Arizona

Deep Learning supports the analyst NVIDIA, 2015

Advanced Imaging Modalities CNN architecture supports: MSI/HSI data cubes SAR imagery Volumetric data, e.g. LIDAR Low-TRL research topics D. Maturana and S. Scherer. 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR. In ICRA. 2015

Open-source Imagery Exploitation Object detection Scene labeling Face recognition Image geo-location estimation Text extraction from images Geographic property estimation Image de-noising [6] Stanford University, NLP group

Deep Learning Dominates at Visual Perception NVIDIA, 2014

Deep Learning supports the analyst NVIDIA, 2015

Deep Learning generalizes across problems Varied data types (and multi-source) Real-valued feature vector Varied tasks Classification Structured NUMBERS IMAGES SOUNDS VIDEOS TEXT Unstructured x 1 x 2 x 3... x N Regression Unsupervised learning Clustering Topic extraction Anomaly detection Sequence prediction Control policy learning Constants: Big (high dimensional) Data + a complex function to learn

Geospatial Analytics 12 years of San Francisco crime reports Given date, time and location DL model predicts crime: Top-5 error: 59% ~4 hours work (including training) using open source tools [10] Kaggle San Francisco Crime Classification Competition

Geospatial activity data Deep Neural Networks (DNNs) naturally ingest structured data Modern networks can learn complex predictive patterns including temporal sequences Real-time destination prediction for taxis using DNN Montreal Institute for Learning Algorithms (MILA), 2015

Sensor/Platform Control Reinforcement learning: Δ(predicted future reward, actual reward) Data sequence Planning + Control policy Applications: Sensor tasking Autonomous vehicle navigation [11] Google DeepMind in Nature

Why is Deep learning hot now? Three Driving Factors Big Data Availability New DL Techniques GPU acceleration 350 millions images uploaded per day 2.5 Petabytes of customer data hourly 100 hours of video uploaded every minute

Why are GPUs good for deep learning? Neural Networks GPUs Inherently Parallel Matrix Operations FLOPS Bandwidth GPUs deliver -- - same or better prediction accuracy - faster results - smaller footprint - lower power - lower cost

GPUs make deep learning accessible Deep learning with COTS HPC systems A. Coates, B. Huval, T. Wang, D. Wu, A. Ng, B. Catanzaro ICML 2013 GOOGLE DATACENTER STANFORD AI LAB Now You Can Build Google s $1M Artificial Brain on the Cheap 1,000 CPU Servers 2,000 CPUs 16,000 cores 600 kwatts $5,000,000 3 GPU-Accelerated Servers 12 GPUs 18,432 cores 4 kwatts $33,000

Deep Learning deployment options Long training (hours to days), batch updates, leverage GPU acceleration Train Training data HPC Data Center or Cloud Classifier Deploy <100ms response for new data sample, model interactivity Enterprise desktop (virtual or local) Stream processor Embedded/ mobile systems

Deep Learning is a GEOINT force multiplier Managing Big Data Real-time near-human level perception at web-scale Integrates into analytical workflows Semantic content based filtering and search Drives data exploration and visualization Models improve based on analyst feedback Scales across problems Models improve with more, varied data Models from one dataset can be leveraged in new problems Compact models can be easily shared and deployed

Summary GPU accelerated Deep Learning is: Revolutionizing machine perception accuracy Adaptable to many varied GEOINT workflows and deployments scenarios Scalable thrives on complex raw data Available to apply in production and R&D today

THANK YOU

Resources Popular DL frameworks: Caffe (UC Berkeley) Theano (U Montreal) Torch DIGITS Examples from talk: [1] Imagenet Large Scale Visual Recognition Challenge [2] NORB dataset [3] Keio University, Japan - Aerial image segmentation [4] University of Arizona - Geographic feature detection [5] D. Maturana and S. Scherer. 3D Convolutional Neural Networks for Landing Zone Detection from LiDAR. In ICRA. 2015 [6], [8] Stanford NLP group Deep Learning research [9] Kaggle Taxi Trajectory Prediction Competition [10] Kaggle San Francisco Crime Classification Competition [11] Google DeepMind Nature article