SAP PREDICTIVE ANALYSIS. Ethan Durda InfoSol May 9, 2013

Similar documents
Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

Python Machine Learning

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE. Pierre Foy

Getting Started with Deliberate Practice

Contents. Foreword... 5

Mining Association Rules in Student s Assessment Data

Research computing Results

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

A Case Study: News Classification Based on Term Frequency

Minitab Tutorial (Version 17+)

Stacks Teacher notes. Activity description. Suitability. Time. AMP resources. Equipment. Key mathematical language. Key processes

Computerized Adaptive Psychological Testing A Personalisation Perspective

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Chamilo 2.0: A Second Generation Open Source E-learning and Collaboration Platform

OVERVIEW OF CURRICULUM-BASED MEASUREMENT AS A GENERAL OUTCOME MEASURE

The Method of Immersion the Problem of Comparing Technical Objects in an Expert Shell in the Class of Artificial Intelligence Algorithms

Lecture 1: Basic Concepts of Machine Learning

Paper Reference. Edexcel GCSE Mathematics (Linear) 1380 Paper 1 (Non-Calculator) Foundation Tier. Monday 6 June 2011 Afternoon Time: 1 hour 30 minutes

EDCI 699 Statistics: Content, Process, Application COURSE SYLLABUS: SPRING 2016

Simple Random Sample (SRS) & Voluntary Response Sample: Examples: A Voluntary Response Sample: Examples: Systematic Sample Best Used When

South Eastern User Group Meeting St Mary s Primary School - Dandenong 7 September am pm. Attendees

STT 231 Test 1. Fill in the Letter of Your Choice to Each Question in the Scantron. Each question is worth 2 point.

Thesis-Proposal Outline/Template

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

Automating the E-learning Personalization

Applications of data mining algorithms to analysis of medical data

The UNF Digital Commons

Bellevue University Bellevue, NE

Read the passage above. What does Chief Seattle believe about owning land?

WELCOME! Of Social Competency. Using Social Thinking and. Social Thinking and. the UCLA PEERS Program 5/1/2017. My Background/ Who Am I?

Process improvement, The Agile Way! By Ben Linders Published in Methods and Tools, winter

how download how free.

B. How to write a research paper

Functional Skills Mathematics Level 2 sample assessment

IN THIS UNIT YOU LEARN HOW TO: SPEAKING 1 Work in pairs. Discuss the questions. 2 Work with a new partner. Discuss the questions.

M55205-Mastering Microsoft Project 2016

Math-U-See Correlation with the Common Core State Standards for Mathematical Content for Third Grade

LEARN TO PROGRAM, SECOND EDITION (THE FACETS OF RUBY SERIES) BY CHRIS PINE

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Rule Learning With Negation: Issues Regarding Effectiveness

Challenges in Deep Reinforcement Learning. Sergey Levine UC Berkeley

AP Statistics Summer Assignment 17-18

(Sub)Gradient Descent

A process by any other name

Grade 6: Correlated to AGS Basic Math Skills

Making Sales Calls. Watertown High School, Watertown, Massachusetts. 1 hour, 4 5 days per week

Evolution of Symbolisation in Chimpanzees and Neural Nets

success. It will place emphasis on:

Mike Cohn - background

Focus of the Unit: Much of this unit focuses on extending previous skills of multiplication and division to multi-digit whole numbers.

Committee Member Responsibilities

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

Instructor: Mario D. Garrett, Ph.D. Phone: Office: Hepner Hall (HH) 100

Learning Methods for Fuzzy Systems

CSL465/603 - Machine Learning

Australian Journal of Basic and Applied Sciences

Virtually Anywhere Episodes 1 and 2. Teacher s Notes

On-Line Data Analytics

Assignment 1: Predicting Amazon Review Ratings

Office of Planning and Budgets. Provost Market for Fiscal Year Resource Guide

Unit 7 Data analysis and design

Word Segmentation of Off-line Handwritten Documents

Numeracy Medium term plan: Summer Term Level 2C/2B Year 2 Level 2A/3C

CS 446: Machine Learning

PLANNING FOR K TO 12. Don Brodeth, CFA Taft Consulting Group

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

Analyzing sentiments in tweets for Tesla Model 3 using SAS Enterprise Miner and SAS Sentiment Analysis Studio

Data Integration through Clustering and Finding Statistical Relations - Validation of Approach

ATW 202. Business Research Methods

Defragmenting Textual Data by Leveraging the Syntactic Structure of the English Language

Me on the Map. Standards: Objectives: Learning Activities:

The taming of the data:

KeyTrain Level 7. For. Level 7. Published by SAI Interactive, Inc., 340 Frazier Avenue, Chattanooga, TN

Edexcel GCSE. Statistics 1389 Paper 1H. June Mark Scheme. Statistics Edexcel GCSE

Section 3.4. Logframe Module. This module will help you understand and use the logical framework in project design and proposal writing.

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

LIBRARY AND RECORDS AND ARCHIVES SERVICES STRATEGIC PLAN 2016 to 2020

Probability and Statistics Curriculum Pacing Guide

Software Maintenance

new research in learning and working

Running head: FINAL CASE STUDY, EDCI Addressing a Training Gap. Final Case Study. Anna Siracusa. Purdue University

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

State: Original. Status: Planned July 2015-June. State: Original. Status: Planned. July 2015-June. State: Original. Status: Planned.

Division Strategies: Partial Quotients. Fold-Up & Practice Resource for. Students, Parents. and Teachers

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Integrating simulation into the engineering curriculum: a case study

COURSE SYNOPSIS COURSE OBJECTIVES. UNIVERSITI SAINS MALAYSIA School of Management

Lecture 1: Machine Learning Basics

Answers To The Energy Bus Discussion Guide

State University of New York at Buffalo INTRODUCTION TO STATISTICS PSC 408 Fall 2015 M,W,F 1-1:50 NSC 210

Let s think about how to multiply and divide fractions by fractions!

INSTRUCTOR USER MANUAL/HELP SECTION

UDW+ Student Data Dictionary Version 1.7 Program Services Office & Decision Support Group

Axiom 2013 Team Description Paper

DEVELOPMENT OF AN INTELLIGENT MAINTENANCE SYSTEM FOR ELECTRONIC VALVES

By Davis King;Bridgett Larson

Local Artists in Yuma, AZ

Transcription:

SAP PREDICTIVE ANALYSIS Ethan Durda InfoSol May 9, 2013

AGENDA Introduction Landscape Review Basic Concepts Development Status Workflow and Methodology Use Case and Demo Conclusion Questions?

INTRODUCTION WHO? Hi, I m Ethan SAP Predictive Analysis (PA) is the latest iteration of advanced analytical tools from SAP Business Objects family Replaces in the stack the Business Objects Predictive Workbench which is a wrapper of IBM SPSS Competes with tools such as: Minitab SAS SPSS Excel!

INTRODUCTION WHAT? Advanced Analytics Is: the exploration and analysis, by automatic or semiautomatic means, of large quantities of data in order to discover meaningful patterns and rules. Gordon Linoff and Michael Berry Authors of Data Mining Techniques the process of discovering meaningful new correlations, patterns and trends by sifting through large amounts of data stored in repositories, using pattern recognition technologies as well as statistical and mathematical techniques. Gartner Group

INTRODUCTION WHY? Use cases include: Associate and Cluster data: What do my customers buy together? Amazon, Google, Netflix, you name it! Develop forecasts via Regression and Time Series Modeling: What is going to happen next and what has a bigger impact on what I care about most? Create Decision Trees and Neural Networks: Complex, unknown relationship development Create Outliers Reports: Find what data is statistically different enough from the rest of your data to investigate further

LANDSCAPE REVIEW From SAP

BASIC CONCEPTS / FAQ Does not require statistical knowledge/understanding Predictive Analysis is installed on a local machine Can almost be considered a wrapper program for three separate components: Data input/cleansing R library and native modeling (3,500+ open source algorithms) Visual intelligence output and visualization Designed for single user developing models, sharing work is clunky at best, but promised to get better No SDK until 1.1

DEVELOPMENT STATUS Regular and rapid updates 1.0.4 two months ago 1.0.10 now Focused on adding more visualizations and statistical models Still very much a 1.x application Limited functionality Fairly stable coming from someone who has never used it in anger SAP has big dreams! They see this competing head to head with SAS See it as a sales tool for the H word

WORKFLOW AND METHODOLOGY Import data into Predictive Analysis Limited cleansing on import Once in it is now a separate data set, but can be refreshed manually

WORKFLOW AND METHODOLOGY Enrich Assign attributes, create hierarchies, create formulas Very limited formulas promised to grow

WORKFLOW AND METHODOLOGY Visualize at this point or go straight to predict! Choose algorithm, data manipulation and output (if you choose)

WORKFLOW AND METHODOLOGY Run and review data and statistical feedback New data comes back as either fill or new columns as you choose Don t worry about what all this means, but it tells you how good your predictions are based on the data available and the choice of algorithm.

WORKFLOW AND METHODOLOGY Visualize!

WORKFLOW AND METHODOLOGY Share via Data Sets: File Export Publish to HANA Streamworks Explorer Visualizations: E-mail Notice anything missing?

USE CASE AND DEMO There are these crickets that keep me up at night While counting them I think that there might be a correlation between their chirps and the temperature I wonder how many chirps I d have to live with if the temperature got a lot hotter or colder? Time to do some math! So really, how does this apply to my life?

USE CASE AND DEMO So really, how does this apply to my life? Correlating data from one event to another we do constantly in our heads If we can do it systematically and consistently we will get better results than when it gets cold we sell more coffee If we know the formula we can see what we can do to tweak it, change new variables and see those impacts with other noise effects hidden: Did the new marketing strategy work or did the weather just do the trick? How much of an impact did the tuition rate increase have on new students? What impact does a 1600 SAT score have on student performance vs. their age or parent s education level?

CONCLUSION Pretty solid tool all things considered Still immature Worth looking into if you have an analytics team or want to Cash cost will be significantly lower than SAS not likely the others Business costs will be significantly lower across the board Take advantage of the current content and press for your needs! Anyone want to work on this with me?

QUESTIONS? 18