Welcome to SQL Saturday Denmark

Similar documents
CS Machine Learning

Python Machine Learning

(Sub)Gradient Descent

Lecture 1: Machine Learning Basics

Learning From the Past with Experiment Databases

Rule Learning With Negation: Issues Regarding Effectiveness

Applications of data mining algorithms to analysis of medical data

12- A whirlwind tour of statistics

Rule Learning with Negation: Issues Regarding Effectiveness

Assignment 1: Predicting Amazon Review Ratings

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Probability and Statistics Curriculum Pacing Guide

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Universidade do Minho Escola de Engenharia

Redirected Inbound Call Sampling An Example of Fit for Purpose Non-probability Sample Design

Ricopili: Postimputation Module. WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015

Generative models and adversarial training

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

Instructor: Mario D. Garrett, Ph.D. Phone: Office: Hepner Hall (HH) 100

Model Ensemble for Click Prediction in Bing Search Ads

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Grade 6: Correlated to AGS Basic Math Skills

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Evaluation of Teach For America:

Pre-Algebra A. Syllabus. Course Overview. Course Goals. General Skills. Credit Value

Multi-Lingual Text Leveling

STA 225: Introductory Statistics (CT)

Human Emotion Recognition From Speech

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

Speech Emotion Recognition Using Support Vector Machine

CS 446: Machine Learning

Lecture 1: Basic Concepts of Machine Learning

Reducing Features to Improve Bug Prediction

STT 231 Test 1. Fill in the Letter of Your Choice to Each Question in the Scantron. Each question is worth 2 point.

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Paper Reference. Edexcel GCSE Mathematics (Linear) 1380 Paper 1 (Non-Calculator) Foundation Tier. Monday 6 June 2011 Afternoon Time: 1 hour 30 minutes

arxiv: v1 [cs.lg] 15 Jun 2015

Multi-label classification via multi-target regression on data streams

Experiment Databases: Towards an Improved Experimental Methodology in Machine Learning

WE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT

CHAPTER 4: REIMBURSEMENT STRATEGIES 24

EDEXCEL FUNCTIONAL SKILLS PILOT TEACHER S NOTES. Maths Level 2. Chapter 4. Working with measures

The University of Amsterdam s Concept Detection System at ImageCLEF 2011

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Australian Journal of Basic and Applied Sciences

Multivariate k-nearest Neighbor Regression for Time Series data -

Accessing Higher Education in Developing Countries: panel data analysis from India, Peru and Vietnam

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Stacks Teacher notes. Activity description. Suitability. Time. AMP resources. Equipment. Key mathematical language. Key processes

Numeracy Medium term plan: Summer Term Level 2C/2B Year 2 Level 2A/3C

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

Switchboard Language Model Improvement with Conversational Data from Gigaword

Word Segmentation of Off-line Handwritten Documents

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Understanding and Interpreting the NRC s Data-Based Assessment of Research-Doctorate Programs in the United States (2010)

A Case Study: News Classification Based on Term Frequency

A Decision Tree Analysis of the Transfer Student Emma Gunu, MS Research Analyst Robert M Roe, PhD Executive Director of Institutional Research and

The stages of event extraction

Indian Institute of Technology, Kanpur

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Data Fusion Through Statistical Matching

Individual Differences & Item Effects: How to test them, & how to test them well

Certified Six Sigma Professionals International Certification Courses in Six Sigma Green Belt

CSL465/603 - Machine Learning

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

What is related to student retention in STEM for STEM majors? Abstract:

Quantitative analysis with statistics (and ponies) (Some slides, pony-based examples from Blase Ur)

AUTOMATED FABRIC DEFECT INSPECTION: A SURVEY OF CLASSIFIERS

Chapters 1-5 Cumulative Assessment AP Statistics November 2008 Gillespie, Block 4

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

Calibration of Confidence Measures in Speech Recognition

Introduction to Causal Inference. Problem Set 1. Required Problems

Axiom 2013 Team Description Paper

Research Design & Analysis Made Easy! Brainstorming Worksheet

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Genevieve L. Hartman, Ph.D.

Build on students informal understanding of sharing and proportionality to develop initial fraction concepts.

16.1 Lesson: Putting it into practice - isikhnas

Networks and the Diffusion of Cutting-Edge Teaching and Learning Knowledge in Sociology

Artificial Neural Networks written examination

Algebra 2- Semester 2 Review

*Net Perceptions, Inc West 78th Street Suite 300 Minneapolis, MN

If we want to measure the amount of cereal inside the box, what tool would we use: string, square tiles, or cubes?

A study of speaker adaptation for DNN-based speech synthesis

Statistics and Data Analytics Minor

learning collegiate assessment]

Softprop: Softmax Neural Network Backpropagation Learning

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Measurement. When Smaller Is Better. Activity:

Mathematics (JUN14MS0401) General Certificate of Education Advanced Level Examination June Unit Statistics TOTAL.

TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE. Pierre Foy

Primary National Curriculum Alignment for Wales

Learning Methods in Multilingual Speech Recognition

Lecture 2: Quantifiers and Approximation

A Program Evaluation of Connecticut Project Learning Tree Educator Workshops

Transcription:

Welcome to SQL Saturday Denmark

Microsoft Azure Machine learning Algorithms Tomaž KAŠTRUN @tomaz_tsql Tomaz.kastrun@gmail.com http://tomaztsql.wordpress.com

Thanks you our PLATINUM sponsors

Thanks you our GOLD and SILVER sponsors

Speaker info https://tomaztsql.wordpress.com

Agenda Focus on explanation of algorithms available for predictive analytics in Azure Machine Learning service. Algorithms 1) regression algorithms, 2) Two-Class classifications, 3) Multi-class classification, 4) Clustering - Explore algorithm - which algorithm is used and useful for what kind of empirical problem - which is suitable for particular data-set.

Before start... Why this session?!

First things first

First things something By Type Categorical Numerical Nominal Ordinal Interval Ratio gender (male/ Female), eye colour (blue, green, black,...), marital status (S, M, D,V,...), Buyer Nonbuyer (0 1) No inherited order; all values are the same and all values equal in representation education (primary, secondary, high school,...), Salary (<101,101-200, 201-300,..) Ordering can be applied; values can be compared with =,>,< and recoded with numbers / classes. Height (173cm, 196cm), Invoice Value (24.4, 42.6,..) Ordering can be applied; values can be compared with =,>,< and expressed 2 is 2x times bigger.. Temperature: Celsius vs. Fahrenheit

For warmup...regression

First things second

Sample Data

1 General (interferance) statistics Apply Math Operation -> Applies a mathematical operation to column values Compute Elementary Statistics -> Calculates specified summary statistics for selected dataset columns Compute Linear Correlation -> Calculates the linear correlation between column values in a dataset Descriptive Statistics -> Generates a basic descriptive statistics report for the columns in a dataset Evaluate Probability Function -> Fits a specified probability distribution function to a dataset Replace Discrete Values -> Replaces discrete values from one column with numeric values based on another column Test Hypothesis Using t-test -> Compares means from two datasets using a t-test Source: https://msdn.microsoft.com/en-us/library/azure/dn905867.aspx

1 General Statistics

1 Spliting data Splitting Mode: - Split rows - Recommender split - Regular / Relative expression Random seed Stratified split it depends on the size of our dataset. If it is large enough, 66% split is a good choice (66% for training and the others for test). if it is a moderated dataset, 10-fold cross validation (or leave-one-out) can be a good choice. if your dataset is small then Bootstraping is good. Jackknife, cross validation, n-fold cross validation,...

1 Spliting data

1 Initializing Model

1 Initializing Model

1 Initializing Model Making list of algorithms more transparent

Algorithms in Theory Regression Linear and Logistic Azure ML: Linear Regression Azure ML: Two-class Classification Logistic Regression Multiclass classification Logistic Regression

Algorithms in Theory Decision Tree, Decision Forests, Decision Jungles Decision tree Decision Forest Decision Jungle Azure ML: Regression boosted decision tree Two-class classification boosted decision tree Azure ML: Regression decision forrest Two-class classification decision forrest Multi classification decision forrest Azure ML: Multi-class decision jungle Two-class classification decision jungle

Algorithms in Theory Naive Bayes Azure ML: Regression Bayes linear Two Class classification Bayes point machine

Algorithms in Theory Neural networks and perceptrons Azure ML: Regression Neural networks Two Class classification Neural networks Multi Class classification Neural networks Two Class Classification averaged perceptrons

Algorithms in Theory SVM Azure ML: Two Class classification SVM Two Class classification locally deep SVM Anomaly detection SVM

1 Regression Algorithms

1 Regression Algorithms Parameters

1 Regression Algorithms

Evaluating Regression Algorithms Metrics to measure how close predictions are to eventual outcomes Metrics of differences between predicted values and actual values. Summarization of regression model how well fits a statistical model; R^2 = 1 model is perfect, respectively http://mund-consulting.com/blog/understanding-evaluate-model-in-microsoft-azure-machine-learning/

Evaluating Regression Algorithms Predicting Salary http://mund-consulting.com/blog/understanding-evaluate-model-in-microsoft-azure-machine-learning/

Comparison of Regression Algorithms Regression Algorithm Accuracy Training time Linearity Customization Predicting Variable Type of independant variable(s) Data Quantity linear Good Fast Excellent Good Interval Any small to big Bayesian linear Good Fast Excellent Moderate Interval Any big decision forest Excellent Moderate Good Good Interval Any boosted decision tree Excellent Fast Good Good Interval Any big fast forest quantile Excellent Moderate Moderate Excellent Distribution (Interval) Any neural network Excellent Slow Moderate Excellent Interval Any smaller Excellent* Poisson Good Moderate (log linear) Good Interval (counts) Any small to big ordinal Good Moderate Excellent None Ordinal (order) Any small to big Scale: Excellent Good Moderate Fast Moderate Slow

2 Two-class Classification

2 Two-class Classification parameters

Regularization weight

2 Two-class Classification

2 Evaluating two-class Classification AUC/ROC: <= 0.5 -- 0.5 0.6 -- 0.6 0.7 -- 0.7 0.8 -- 0.8 0.9 -- 0.9 1 -- WTF?

2 Evaluating two-class Classification

Comparison of Two-class Classification Algorithms Two-class classification Accuracy Training time Linearity Customization Predicting Variable Type of independant variable(s) Data Quantity logistic regression Good Fast Excellent Good dichotomous / binary Any small-big decision forest Excellent Moderate Good Good dichotomous / binary Any small-big decision jungle Excellent Moderate Good Good dichotomous / binary Any big boosted decision tree Excellent Moderate Good Good dichotomous / binary Any big neural network Excellent Slow Moderate Excellent dichotomous / binary Any averaged perceptron Good Moderate Excellent Moderate dichotomous / binary Any support vector machine Excellent Moderate Excellent Good dichotomous / binary Any big locally deep support vector machine Good Slow Good Excellent dichotomous / binary Any big Bayes point machine Moderate Moderate Excellent Moderate dichotomous / binary Any Scale: Excellent Good Moderate Fast Moderate Slow

3 Multi-class Classification

3 Multi-class Classification parameters

3 Multi-class Classification

3 Evaluating multi-class Classification

3 Evaluating multi-class Classification

3 Comparison of Multi-class Classification Multi-class classification Accuracy Training time Linearity Customization Predicting Variable Type of independant variable(s) Data Quantity logistic regression Good Fast Excellent Good Nominal / ordinal (with 2+ classes) any small-big decision forest Excellent Moderate Good Good Nominal / ordinal (with 2+ classes) any big decision jungle Excellent Moderate Good Good Nominal / ordinal (with 2+ classes) any big neural network Excellent Slow Moderate Excellent Nominal / ordinal (with 2+ classes) any small Scale: Excellent Good Moderate Fast Moderate Slow

4 Using Sweeping and SMOTE

4 Using Sweeping and SMOTE

5 Clustering

5 Clustering

5 Clustering

5 Evaluating Clustering

Key takeaways https://azure.microsoft.com/en-us/documentation/articles/machinelearning-algorithm-cheat-sheet/ https://msdn.microsoft.com/enus/library/azure/dn906033.aspx

ENJOY YOUR NEW MACHINE LEARNING PROJECT

Please review the event and sessions EVENT SESSION http://speakerscore.com/zgvx http://speakerscore.com/xyz 17 Sept 2016 SQL Saturday 541 Copenhagen