SATURN 2018 Tutorial May 8 Plano. [DISTRIBUTION STATEMENT A] Approved for public release and unlimited distribution.

Similar documents
Measurement & Analysis in the Real World

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Python Machine Learning

Lecture 1: Machine Learning Basics

CS Machine Learning

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

(Sub)Gradient Descent

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

CS 446: Machine Learning

Laboratorio di Intelligenza Artificiale e Robotica

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Lecture 1: Basic Concepts of Machine Learning

Intel-powered Classmate PC. SMART Response* Training Foils. Version 2.0

CSL465/603 - Machine Learning

Teaching Architecture Metamodel-First

Learning From the Past with Experiment Databases

Major Milestones, Team Activities, and Individual Deliverables

Assignment 1: Predicting Amazon Review Ratings

Switchboard Language Model Improvement with Conversational Data from Gigaword

Computer Science PhD Program Evaluation Proposal Based on Domain and Non-Domain Characteristics

Human Emotion Recognition From Speech

Laboratorio di Intelligenza Artificiale e Robotica

Model Ensemble for Click Prediction in Bing Search Ads

GACE Computer Science Assessment Test at a Glance

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Reducing Features to Improve Bug Prediction

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

Artificial Neural Networks written examination

CAMP 4:4:3. Supplemental Tools

Rule Learning With Negation: Issues Regarding Effectiveness

Learning Methods for Fuzzy Systems

Time series prediction

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Using Moodle in ESOL Writing Classes

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

SCT Banner Student Fee Assessment Training Workbook October 2005 Release 7.2

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Course Groups and Coordinator Courses MyLab and Mastering for Blackboard Learn

SCT Banner Financial Aid Needs Analysis Training Workbook January 2005 Release 7

AUTOMATED TROUBLESHOOTING OF MOBILE NETWORKS USING BAYESIAN NETWORKS

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

Using EEG to Improve Massive Open Online Courses Feedback Interaction

Word Segmentation of Off-line Handwritten Documents

Information System Design and Development (Advanced Higher) Unit. level 7 (12 SCQF credit points)

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

Designing a Computer to Play Nim: A Mini-Capstone Project in Digital Design I

Introduction to Simulation

Physics 270: Experimental Physics

Applications of data mining algorithms to analysis of medical data

Create Quiz Questions

Generative models and adversarial training

Ministry of Education, Republic of Palau Executive Summary

Multivariate k-nearest Neighbor Regression for Time Series data -

THE DEPARTMENT OF DEFENSE HIGH LEVEL ARCHITECTURE. Richard M. Fujimoto

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

Forget catastrophic forgetting: AI that learns after deployment

Knowledge-Based - Systems

TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE. Pierre Foy

Guide to Teaching Computer Science

Practical Research Planning and Design Paul D. Leedy Jeanne Ellis Ormrod Tenth Edition

What s in a Step? Toward General, Abstract Representations of Tutoring System Log Data

Indian Institute of Technology, Kanpur

Control Tutorials for MATLAB and Simulink

Pragmatic Use Case Writing

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

JOURNALISM 250 Visual Communication Spring 2014

COSI Meet the Majors Fall 17. Prof. Mitch Cherniack Undergraduate Advising Head (UAH), COSI Fall '17: Instructor COSI 29a

We re Listening Results Dashboard How To Guide

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Probabilistic Latent Semantic Analysis

DIGITAL GAMING & INTERACTIVE MEDIA BACHELOR S DEGREE. Junior Year. Summer (Bridge Quarter) Fall Winter Spring GAME Credits.

Universidade do Minho Escola de Engenharia

Moodle Goes Corporate: Leveraging Open Source

Implementing a tool to Support KAOS-Beta Process Model Using EPF

PM tutor. Estimate Activity Durations Part 2. Presented by Dipo Tepede, PMP, SSBB, MBA. Empowering Excellence. Powered by POeT Solvers Limited

Mining Association Rules in Student s Assessment Data

Finding Your Friends and Following Them to Where You Are

COMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS

Rule Learning with Negation: Issues Regarding Effectiveness

DegreeWorks Advisor Reference Guide

Kronos KnowledgePass TM

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

HIERARCHICAL DEEP LEARNING ARCHITECTURE FOR 10K OBJECTS CLASSIFICATION

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Probability estimates in a scenario tree

Predicting Students Performance with SimStudent: Learning Cognitive Skills from Observation

Discriminative Learning of Beam-Search Heuristics for Planning

CNS 18 21th Communications and Networking Simulation Symposium

WE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT

Computerized Adaptive Psychological Testing A Personalisation Perspective

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Bootstrapping Personal Gesture Shortcuts with the Wisdom of the Crowd and Handwriting Recognition

The Good Judgment Project: A large scale test of different methods of combining expert predictions

The Moodle and joule 2 Teacher Toolkit

Transcription:

ACHINE LEARNING ESIGN, EMYSTIFIED SATURN 2018 Tutorial May 8 Plano

Copyright 2018 Carnegie Mellon University and Serge Haziyev. All Rights Reserved. This material is based upon work funded and supported by the Department of Defense under Contract No. FA8702-15-D-0002 with Carnegie Mellon University for the operation of the Software Engineering Institute, a federally funded research and development center. The view, opinions, and/or findings contained in this material are those of the author(s) and should not be construed as an official Government position, policy, or decision, unless designated by other documentation. NO WARRANTY. THIS CARNEGIE MELLON UNIVERSITY AND SOFTWARE ENGINEERING INSTITUTE MATERIAL IS FURNISHED ON AN "AS-IS" BASIS. CARNEGIE MELLON UNIVERSITY MAKES NO WARRANTIES OF ANY KIND, EITHER EXPRESSED OR IMPLIED, AS TO ANY MATTER INCLUDING, BUT NOT LIMITED TO, WARRANTY OF FITNESS FOR PURPOSE OR MERCHANTABILITY, EXCLUSIVITY, OR RESULTS OBTAINED FROM USE OF THE MATERIAL. CARNEGIE MELLON UNIVERSITY DOES NOT MAKE ANY WARRANTY OF ANY KIND WITH RESPECT TO FREEDOM FROM PATENT, TRADEMARK, OR COPYRIGHT INFRINGEMENT. [DISTRIBUTION STATEMENT A] This material has been approved Please see Copyright notice for non-us Government use and distribution. This material may be reproduced in its entirety, without modification, and freely distributed in written or electronic form without requesting formal permission. Permission is required for any other use. Requests for permission should be directed to the Software Engineering Institute at permission@sei.cmu.edu. DM18-0886

INTRODUCTIONS Rick Kazman Professor, University of Hawaii Research Scientist, SEI Serge Haziyev Head of Intelligent Enterprise, SoftServe Iurii Milovanov Data Science Practice Leader, SoftServe

AGENDA A Bit of Background Game Prototyping Summary & QA

MOTIVATION Clouds Mobile IoT Machine Learning Big Data Blockchain

ADD 3.0 ML Design Concepts: Problem Algorithm Family Algorithm

SMART DECISIONS GAME First presented at SATURN 2015 A fun, lightweight way to introduce architectural design and ADD Available at: http://smartdecisionsgame.com/

SHORT QUIZ What s the name of this company in AI field? 10x increase over the past 2 years!

AI PROGRESS SINCE 1950s Source: NVidia

MYTHS AND FICTION ABOUT ARTIFICIAL BEINGS R.U.R. (Karel Čapek) 1921 Golem (Bible) ~1000 BC Sumerian Anunnaki creating the first man ~2300 BC

THE CURRENT STATE OF AI 5 K 1M 16M 71M 760M 22B 86B Number of neurons

GAME CHALLENGE OVERVIEW Business Use Case Which ad will the user choose? Banner A Banner B

GAME CHALLENGE OVERVIEW Marketecture Diagram Ingest Web Servers Train Online user Hundreds of servers Massive logs from multiple sources Web Logs CTR Prediction System Predict CTR and show appropriate ad

WHY DO WE NEED MACHINE LEARNING?

WHY NOT JUST CODING?! Most of the today s AI problems: Deal with an infinite problem space think about how many words are there in the English language Poorly defined we still do not know how our brain solves problems Therefore, traditional rule-based handcoding for such problems suffers a 'complexity collapse' and is not feasible

MACHINE LEARNING APPROACH Instead of writing a program by hand, we use a set of examples to train the algorithm Developer writes code Algorithm writes code

ML BUILDING BLOCKS Training Data Testing Data Machine Learning Algorithm New Data Model Predictions

TYPES OF LEARNING

SUPERVISED LEARNING Input examples and corresponding ground truth outputs are provided The goal is to learn general rules that map a new example to the predicted output

SUPERVISED LEARNING Input Data Output Data Example: Given a set of house features along with corresponding house prices, predict a price for a new house based on its features (e.g. size, location, etc.)

UNSUPERVISED LEARNING Only input examples are provided No explicit information about ground truth The algorithm tries to discover the internal structure of the data based on some prior knowledge about desired outcome

UNSUPERVISED LEARNING Input Data Example: Given a set of customer transactions discover what would be the best way to group them into clusters based on customer similarity Output Preferences

THE SUPERVISED CURRENT LEARNING STATE OF AI UNSUPERVISED LEARNING

ITERATION 1: What type of learning best fits a given use case? Select from: supervised or unsupervised

ITERATION 1: Supervised or Unsupervised Learning? Historical Dataset Banner A Banner B Train Input: Logs ML Algorithm Predict Input: User Features Output: Most preferred banner to show Logs: User X Features Banner A Click 0 User Y Features Banner B Click 0 User X Features Banner B Click 1

MACHINE LEARNING CARDS

MACHINE LEARNING CARDS ITERATION 2: PROBLEM TYPE ITERATION 3a: ALGORITHM FAMILY ITERATION 3b: ML ALGORITHM

Legend: - Problem cards - Algorithm Family cards - Algorithm cards

PROBLEM TYPES

CLASSIFICATION Key Highlights: Identifies which category an object belongs to Supervised learning problem Examples: Detect fraudulent transactions (one-class) Categorize emails by spam or not spam (binary) Categorize articles based on their topic (multi-class) Detect objects on the image (multi-label)

REGRESSION Key Highlights: Predict a continuous value associated with an object Supervised learning problem Examples: Predict stock prices from market data Score a credit application based on historical data Estimate demand for a given product

CLUSTERING Key Highlights: Group similar objects into clusters Unsupervised learning problem Examples: Discover audiences to target on social networks Group checking data based on GEO-proximity Detect common topics in corporate knowledge base

ANOMALY DETECTION Key Highlights: Identify observations that do not conform to an expected pattern Addresses both supervised and unsupervised learning Examples: Identify fraudulent transactions or abnormal customer behavior In manufacturing, detect physical parts that are likely to fail in the near future

ITERATION 2: What type of problem best fits a given use case? Select problem card from: classification, regression, clustering or anomaly detection

ITERATION 2: What type of problem? Historical Dataset Banner A Banner B Train Input: Logs ML Algorithm Predict Input: User Features Output: Most preferred banner to show Logs: User X Features Banner A Click 0 User Y Features Banner B Click 0 User X Features Banner B Click 1

FAMILIES AND ALGORITHMS

CLASSIFICATION FAMILIES

CLASSIFICATION ALGORITHMS

DECISION DRIVERS

FAMILY DRIVERS Big Data scalability and ability to leverage from new data Small Data ability to learn from a few examples Imbalanced Data ability to distinguish rare events Results Interpretation human-friendly results Online Learning ability to continuously train from new data Ease of Use number of parameters to manually tune

ALGORITHM DRIVERS Accuracy ability to solve complex problems Training Speed training runtime performance Prediction Speed production runtime performance Overfitting Resistance ability to generalize to new data Probabilistic Interpretation return results as probabilities

ITERATION 3: Select a family and an algorithm card that would best fit a given use case Family Key Drivers: Big Data, Imbalanced Data, Ease of Use Algorithm Key Drivers: Accuracy, Training and Prediction Speed

DESIGN PROCESS

DESIGN PROCESS

PROTOTYPING AND EVALUATION SESSION

PROTOTYPING FOR EVALUATION

RESULTS SUMMARY Algorithm name Training Time Prediction Time Tuning Time Initial Accuracy Random Forest 2.61 0.47 94.44 81.61% 83.05% KNeighbors 0.41 44.29 84.27 80.57% 83.05% Logistic Regression 0.12 0.05 45.94 82.93% 82.93% MLP 0.80 0.08 164.04 66.25% 82.90% SVM 177.78 54.87 973.73 82.83% 82.83% Linear SVM 5.93 0.04 82.91 82.69% 82.69% Decision Trees 0.03 0.005 52.97 73.16% 82.36% Naive Bayes 0.02 0.01 0 78.46% 78.46% Final Accuracy

KEY TAKEAWAYS Machine Learning solution design is an iterative process ADD principles help make ML design decisions in a systematic way ML Cards aim to select candidate algorithms from a wide variety of alternatives Prototyping is necessary to validate design decisions

QUESTIONS? WE VE GOT THE ANSWERS.