Certificate in Business Analytics Course outline for Collaborative Program

Similar documents
School of Innovative Technologies and Engineering

Python Machine Learning

Probability and Statistics Curriculum Pacing Guide

University of Groningen. Systemen, planning, netwerken Bosman, Aart

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

STA 225: Introductory Statistics (CT)

Given a directed graph G =(N A), where N is a set of m nodes and A. destination node, implying a direction for ow to follow. Arcs have limitations

GACE Computer Science Assessment Test at a Glance

Radius STEM Readiness TM

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

Statewide Framework Document for:

Introduction to the Practice of Statistics

UNIT ONE Tools of Algebra

AN EXAMPLE OF THE GOMORY CUTTING PLANE ALGORITHM. max z = 3x 1 + 4x 2. 3x 1 x x x x N 2

Research Design & Analysis Made Easy! Brainstorming Worksheet

Visit us at:

Android App Development for Beginners

AP Statistics Summer Assignment 17-18

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

On-Line Data Analytics

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

The lab is designed to remind you how to work with scientific data (including dealing with uncertainty) and to review experimental design.

TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE. Pierre Foy

Dublin City Schools Mathematics Graded Course of Study GRADE 4

Using SAM Central With iread

STT 231 Test 1. Fill in the Letter of Your Choice to Each Question in the Scantron. Each question is worth 2 point.

Statistical Analysis of Climate Change, Renewable Energies, and Sustainability An Independent Investigation for Introduction to Statistics

BADM 641 (sec. 7D1) (on-line) Decision Analysis August 16 October 6, 2017 CRN: 83777

Lecture 1: Machine Learning Basics

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering

DOCTORAL SCHOOL TRAINING AND DEVELOPMENT PROGRAMME

Measurement & Analysis in the Real World

BMBF Project ROBUKOM: Robust Communication Networks

M55205-Mastering Microsoft Project 2016

Assignment 1: Predicting Amazon Review Ratings

Beyond the Blend: Optimizing the Use of your Learning Technologies. Bryan Chapman, Chapman Alliance

Rule discovery in Web-based educational systems using Grammar-Based Genetic Programming

The use of mathematical programming with artificial intelligence and expert systems

Moodle MyFeedback update April 2017

Interactive Whiteboard

Math 96: Intermediate Algebra in Context

Probabilistic Latent Semantic Analysis

Shockwheat. Statistics 1, Activity 1

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

USER ADAPTATION IN E-LEARNING ENVIRONMENTS

(Sub)Gradient Descent

ME 443/643 Design Techniques in Mechanical Engineering. Lecture 1: Introduction

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

Grade 6: Correlated to AGS Basic Math Skills

Certified Six Sigma Professionals International Certification Courses in Six Sigma Green Belt

A Note on Structuring Employability Skills for Accounting Students

Circuit Simulators: A Revolutionary E-Learning Platform

CS Machine Learning

Outreach Connect User Manual

AQUA: An Ontology-Driven Question Answering System

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Historical maintenance relevant information roadmap for a self-learning maintenance prediction procedural approach

CSL465/603 - Machine Learning

Contact: For more information on Breakthrough visit or contact Carmel Crévola at Resources:

Individual Differences & Item Effects: How to test them, & how to test them well

COMMUNITY ENGAGEMENT

Green Belt Curriculum (This workshop can also be conducted on-site, subject to price change and number of participants)

Missouri Mathematics Grade-Level Expectations

FIGURE IT OUT! MIDDLE SCHOOL TASKS. Texas Performance Standards Project

BUAD 425 Data Analysis for Decision Making Syllabus Fall 2015

We re Listening Results Dashboard How To Guide

Implementing a tool to Support KAOS-Beta Process Model Using EPF

Moodle 2 Assignments. LATTC Faculty Technology Training Tutorial

Innovative Methods for Teaching Engineering Courses

Minitab Tutorial (Version 17+)

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

Generative models and adversarial training

Lesson M4. page 1 of 2

Detailed course syllabus

MOODLE 2.0 GLOSSARY TUTORIALS

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

Software Maintenance

ECE-492 SENIOR ADVANCED DESIGN PROJECT

Probability and Game Theory Course Syllabus

Self Study Report Computer Science

*Net Perceptions, Inc West 78th Street Suite 300 Minneapolis, MN

Ericsson Wallet Platform (EWP) 3.0 Training Programs. Catalog of Course Descriptions

Full text of O L O W Science As Inquiry conference. Science as Inquiry

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

VOL. 3, NO. 5, May 2012 ISSN Journal of Emerging Trends in Computing and Information Sciences CIS Journal. All rights reserved.

Level 6. Higher Education Funding Council for England (HEFCE) Fee for 2017/18 is 9,250*

OFFICE SUPPORT SPECIALIST Technical Diploma

Schoology Getting Started Guide for Teachers

Lecturing Module

Numeracy Medium term plan: Summer Term Level 2C/2B Year 2 Level 2A/3C

Platform for the Development of Accessible Vocational Training

The Enterprise Knowledge Portal: The Concept

AGENDA LEARNING THEORIES LEARNING THEORIES. Advanced Learning Theories 2/22/2016

"On-board training tools for long term missions" Experiment Overview. 1. Abstract:

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Sociology 521: Social Statistics and Quantitative Methods I Spring Wed. 2 5, Kap 305 Computer Lab. Course Website

Transcription:

Certificate in Business Analytics Course outline for Collaborative Program Program Audience: Undergraduate Students, Graduate Students Course Mode : Lecture, Tutorial, Practical Course Objectives This course is designed to enable learners with the crucial step in the Data Lifecycle: using data to make decisions! Possess the modeling skills needed by companies all over the world to go beyond storing data to understanding data Learn how to use these skills to make decisions such as cancer detection, fraud detection, customer segmentation and predicting machine downtime. Get introduced to the data mining process and modeling techniques using one of the most popular software, IBM's SPSS Modeler. Learn how to build models on trained data, test the model with historical data, and use qualifying models on live data or other historical untested data. Save or earn companies millions of dollars with your decisions! All the Faculty Members participating in the program will be provided relevant teaching aids after they complete the sessions. Teaching aids consist of Instructor Guide, Case Study presentation slide, access to online IBM Business Analytics @ Campus Portal Course Pre-requisite 3.1. A familiarity with the basic concepts in statistics will be useful for Predictive Analytics sessions 3.2. Participants must have working knowledge of Windows, MS Excel etc 3.3. Concepts of diverse data set will be useful

Program Content Basics of Statistics 32 Hours This module is divided into two parts Foundation of Statistics & Mathematical Optimization for Business Problems Foundation of Statistics This is a beginner's course covering the fundamentals of statistics. Start with mean, mode, and median. Then learn about standard deviation using examples from basketball. Learn about probability with dice. Learn what it means to group data by categorical variables, and how you can transform your data into appropriate graphs and charts. This course is taught using SPSS Statistics. No prior experience necessary. Module 1 - Welcome to Statistics! Welcome to Statistics Data visualization All about data SPSS Statistics SPSS Statistics in 15 minutes Module 2 - Basic Statistics Types of data Measures of dispersion Mean, median, mode Statistics by data type Probability Module 3 - Summarizing data Statistics by groups Visualization of group statistics Pivoting Cross-tabulations Correlation Module 4- Data Visualization Visualization fundamentals Descriptive and statistical charts Scatterplots

Statistical charts Time series charts Module 5 Descriptive Statistics Weighted means, standard deviations Data wrangling Descriptive Statistics Reproducibility with syntax in SPSS Statistics Mathematical Optimization for Business Problems Mathematical Programming is a powerful technique used to model and solve optimization problems. This training provides the necessary fundamentals of mathematical programming and useful tips for good modeling practice in order to construct simple optimization models. In this training, students will explore several aspects of mathematical programing to start learning more about constructing optimization models using IBM Decision Optimization technology, including: Basic terminology: operations research, mathematical optimization, and mathematical programming Basic elements of optimization models: data, decision variables, objective functions, and constraints Different types of solution: feasible, optimal, infeasible, and unbounded Mathematical programming techniques for optimization: Linear Programming, Integer Programming, Mixed Integer Programming, and Quadratic Programming Algorithms used for solving continuous linear programming problems: simplex, dual simplex, and barrier Important mathematical programming concepts: sparsity, uncertainty, periodicity, network structure, convexity, piecewise linear and nonlinear These concepts are illustrated by concrete examples, including a production problem and different network models. Module 1 - The Big Picture What is Operations Research? What is Optimization? Optimization models Module 2 - Linear Programming

Introduction to Linear programming A production problem : Part 1 - Writing the model A production problem : Part 2 - Finding a solution A production problem : Part 3 - From feasibility to unboundedness Algorithms for solving linear programs : Part 1 - The Simplex and Dual Simplex Algorithm Algorithms for solving linear programs : Part 2 - The Simplex and Barrier methods Module 3 - Network Models Introduction to Network Models The Transportation problem The Transshipment problem The Assignment problem The shortest path problem Critical path analysis Module 4 - Beyond simple LP Nonlinearity and Convexity Piecewise linear programming Integer programming The branch and bound method Quadratic Programming Module 5 - Modelling Practice Modelling the real world The importance of Sparsity Tips for better models Foundation program in R Programming 24 Hours R is a powerful language for data analysis, data visualization, machine learning, and statistics. Originally developed for statistical programming, it is now one of the most popular languages in data science. In this course, participants will be learning about the basics of R, and will end with the confidence to start writing your own R scripts. But this isn't a typical textbook introduction to R. Not just learning about R fundamentals, they shall be using R to solve problems related to movies data. Using a concrete example makes the learning painless. Learn about the fundamentals of R syntax, including assigning variables and doing simple operations with one of R's most important data structures -- vectors! From vectors, you'll then learn about lists, matrix, arrays and data frames. Then jump into conditional statements, functions, classes and debugging. Further the module guides on how "A picture is worth a thousand words". We are all familiar with this expression. It especially applies when trying to explain the insight

obtained from the analysis of increasingly large data sets. Data visualization plays an essential role in the representation of both small and large scale data. One of the key skills of a data scientist is the ability to tell a compelling story, visualizing data and findings in an approachable and stimulating way. Learning how to leverage a software tool to visualize data will also enable you to extract information, better understand the data, and make more effective decisions. The main goal of this course is to teach you how to take data that at first glance has little meaning and present that data in a form that makes sense to people. Various techniques have been developed for presenting data visually but in this course, we will be using the open source language R. Module 1 - R basics Math, Variables, and Strings Vectors and Factors Vector operations Module 2 - Data structures in R Arrays & Matrices Lists Dataframes Module 3 - R programming fundamentals Conditions and loops Functions in R Objects and Classes Debugging Module 4 - Working with data in R Reading CSV and Excel Files Reading text files Writing and saving data objects to file in R Module 5 - Strings and Dates in R String operations in R Regular Expressions Dates in R Module 6 - Basic Visualization Tools Bar Charts Histograms Pie Charts Module 7 - Basic Visualization Tools Continued

Scatter Plots Line Plots and Regression Module 8 - Specialized Visualization Tools Word Clouds Radar Charts Waffle Charts Box Plots Module 9 - How to create Maps Creating Maps in R Digital Analytics & Regression 8 Hours Follow a case study where you define the business objective, establish the data required to address that objective, and use R, the programming language, to derive insights from the data. As with any business challenge, you will be required to articulate your findings to a business audience. Learn basic concepts in statistics with step-by-step guidance on how to conduct an analysis to solve the business problem. Data Science is like triathlon. Programming is cycling, by far the biggest investment is required in hardware and software. Running is domain expertise and communication skills and, swimming is mathematics, statistics and modelling. There are competitions in each of these disciplines, cycling, running and swimming (and there always will be), but the need for super athletes who can do all 3 is growing. An athlete who is brilliant at one discipline can learn the other two and succeed in the triathlon. Module 1 - A Case Study Approach to Analytics Understand the business context Formulate the business objective State the hypothesis Assess available data Assign data for use Module 2 - Data Scientist Workbench Using Data Scientist Workbench What is R? Loading data into R with Data Scientist Workbench Upload a CSV data file into Data Scientist Workbench and RStudio Module 3 - Google Trends Data in R Access Google Trends data in R Module 4 - Simple Linear Regression in R

Regression and Google Trends Data in R Box Plots and Histograms in R Scatter Plots & Lines of best fit in R Simple Linear Regression in R Module 5 - Presenting Data Analytics in Business Using data to answer a business question Summarizing the data analytics process Presenting data insights Cognitive analytics using Watson 16 Hours Analyzing data has no value if you cannot tell a story of insight out of it. This is where tools like Watson Analytics can take your analysis to the next level. Watson Analytics offers you the benefits of advanced analytics without the complexity. Use Excel to get you started and put you in the right analytical mindset. Learn, in a few hours, how to perform data exploration, predictive analytics and effortless dashboard and infographic creation. Get answers and new insights to make confident decisions in minutes all on your own! Module 1 - Overview of Watson Analytics Identify Watson Analytics components and the user interface (Overview) Module 2 - Data Identify data structure limitations and requirements Upload data from flat files Shape data before uploading Upload data from Twitter Module 3 - Work with data connections Create a secure connection Create a secure gateway Module 4 - Refine data Join data sets Add a calculation to the data set Add a data group to the data set Add a hierarchy Module 5 - Discover patterns, relationships, and predictive insights Change column headings Sort data Perform analysis using natural language questions Edit targets in a predictive analysis Navigate the spiral in a predictive analysis

View decision rules and the decision tree Module 6 - Provide added value to the analysis Share the analysis through email Module 7 - Assemble a display Change visualization types in a display Modify properties of a display Filter visualizations in a display Module 8 - Control access Share assets and set access permissions References - Create and Use expert Storybooks Author an expert storybook Use an expert storybook Predictive Analytics Modeling 40 Hours The Predictive Analytics Modeler will learn the essential analytics models to collect and analyze data efficiently. This will require skills in predictive analytics models, such as data mining, data collection and integration, nodes, and statistical analysis. The Predictive Analytics Modeler will use tools for market research and data mining in order to predict problems and improve outcomes. Module 1 - Introduction to Data Mining CRISP-DM Methodology Introduction to SPSS Modeler - predictive data mining workbench SPSS Modeler Interface Module 2 - The Data Mining Process Business Understanding Data Understanding Data Preparation Module 3 - Modeling Techniques Introduction to Common Modeling Techniques Cluster Analysis (Unsupervised Learning) Classification & Prediction (Supervised Learning) Classification - Training & Testing Sampling Data in Classification Predictive Modeling Algorithms in SPSS Modeler Automated Selection of Algorithms Module 4 - Model Evaluation Metrics for Performance Evaluation Accuracy as Performance Evaluation tool Overcoming Limitations of Accuracy Measure

ROC Curves Module - Predictive Analytics Modeler Integrating data (methods, options, merging, and sampling) Deriving and reclassifying fields (CLEM) Looking for relationships (matrix, distribution, means, histogram, statistics and plot) Functions (conversion, string, and statistical) Data transformation Statistical. graphical and sample nodes Automated data mining and modeling Predictive models and customer segmentation