Azure Machine Learning. Designing Iris Multi-Class Classifier

Similar documents
Python Machine Learning

Laboratorio di Intelligenza Artificiale e Robotica

CS Machine Learning

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Lecture 1: Machine Learning Basics

Laboratorio di Intelligenza Artificiale e Robotica

Lecture 1: Basic Concepts of Machine Learning

Rule Learning With Negation: Issues Regarding Effectiveness

Word Segmentation of Off-line Handwritten Documents

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

(Sub)Gradient Descent

Human Emotion Recognition From Speech

16.1 Lesson: Putting it into practice - isikhnas

CSL465/603 - Machine Learning

Rule Learning with Negation: Issues Regarding Effectiveness

Storytelling Made Simple

Citrine Informatics. The Latest from Citrine. Citrine Informatics. The data analytics platform for the physical world

Learning Microsoft Office Excel

EdX Learner s Guide. Release

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE. Pierre Foy

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

TeacherPlus Gradebook HTML5 Guide LEARN OUR SOFTWARE STEP BY STEP

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Introduction to Moodle

Linking Task: Identifying authors and book titles in verbose queries

A Case Study: News Classification Based on Term Frequency

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Education the telstra BLuEPRint

Chamilo 2.0: A Second Generation Open Source E-learning and Collaboration Platform

Houghton Mifflin Online Assessment System Walkthrough Guide

Creating an Online Test. **This document was revised for the use of Plano ISD teachers and staff.

Top US Tech Talent for the Top China Tech Company

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Assignment 1: Predicting Amazon Review Ratings

MyUni - Turnitin Assignments

Welcome to California Colleges, Platform Exploration (6.1) Goal: Students will familiarize themselves with the CaliforniaColleges.edu platform.

Time series prediction

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Millersville University Degree Works Training User Guide

Three Strategies for Open Source Deployment: Substitution, Innovation, and Knowledge Reuse

Carnegie Mellon University Department of Computer Science /615 - Database Applications C. Faloutsos & A. Pavlo, Spring 2014.

Artificial Neural Networks written examination

TotalLMS. Getting Started with SumTotal: Learner Mode

November 17, 2017 ARIZONA STATE UNIVERSITY. ADDENDUM 3 RFP Digital Integrated Enrollment Support for Students

Generative models and adversarial training

Your School and You. Guide for Administrators

Computerized Adaptive Psychological Testing A Personalisation Perspective

Excel Intermediate

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Manipulative Mathematics Using Manipulatives to Promote Understanding of Math Concepts

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Model Ensemble for Click Prediction in Bing Search Ads

Moodle 3.2 Backup and Simple Restore

Driving Author Engagement through IEEE Collabratec

Australian Journal of Basic and Applied Sciences

Appendix L: Online Testing Highlights and Script

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

On Human Computer Interaction, HCI. Dr. Saif al Zahir Electrical and Computer Engineering Department UBC

SCT Banner Student Fee Assessment Training Workbook October 2005 Release 7.2

Analyzing sentiments in tweets for Tesla Model 3 using SAS Enterprise Miner and SAS Sentiment Analysis Studio

TA Certification Course Additional Information Sheet

Intel-powered Classmate PC. SMART Response* Training Foils. Version 2.0

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

Generating Test Cases From Use Cases

Odyssey Writer Online Writing Tool for Students

Axiom 2013 Team Description Paper

Minitab Tutorial (Version 17+)

Learning Methods in Multilingual Speech Recognition

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

Modeling function word errors in DNN-HMM based LVCSR systems

Reducing Features to Improve Bug Prediction

AUTOMATED FABRIC DEFECT INSPECTION: A SURVEY OF CLASSIFIERS

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

Historical maintenance relevant information roadmap for a self-learning maintenance prediction procedural approach

Applications of data mining algorithms to analysis of medical data

Evolution of Symbolisation in Chimpanzees and Neural Nets

Please find below a summary of why we feel Blackboard remains the best long term solution for the Lowell campus:

PowerTeacher Gradebook User Guide PowerSchool Student Information System

Online Master of Business Administration (MBA)

USER ADAPTATION IN E-LEARNING ENVIRONMENTS

Moodle Student User Guide

Introduction to Causal Inference. Problem Set 1. Required Problems

Android App Development for Beginners

Using Web Searches on Important Words to Create Background Sets for LSI Classification

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

OFFICE SUPPORT SPECIALIST Technical Diploma

CHANCERY SMS 5.0 STUDENT SCHEDULING

Student User s Guide to the Project Integration Management Simulation. Based on the PMBOK Guide - 5 th edition

DegreeWorks Advisor Reference Guide

Modeling function word errors in DNN-HMM based LVCSR systems

Transcription:

Media Partners

Azure Machine Learning Designing Iris Multi-Class Classifier

Marcin Szeliga 20 years of experience with SQL Server Trainer & data platform architect Books & articles writer Speaker at numerous conferences SQL Microsoft Most Valuable Professional since 2006 President of PLSSUG Founder of SQLExpert http://sqlexpert.pl/ http://blog.sqlexpert.pl/ linkedin.com/in/marcinszeliga marcin@sqlexpert.pl facebook.com/marcin.szeliga.18

Session Overview Machine Learning overview Microsoft Azure overview Designing an Experiment using AzureML Iris Multi-Class Classifier Deploying a Model as a service Monetizing Your Azure ML application

Machine Learning Overview Machine learning is a discipline that emerged from the general field of artificial intelligence only quite recently To build intelligent machines researchers realized that these machines should learn from and adapt to their environment It is simply too costly and impractical to design intelligent systems by first gathering all the expert knowledge ourselves and then hard-wiring it into a machine Formal definition: A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P, if its performance at tasks in T, as measured by P, improves with experience E Tom M. Mitchell Another definition: The goal of machine learning is to program computers to use example data or past experience to solve a given problem. Introduction to Machine Learning, 2nd Edition, MIT Press

Successes and the growth of machine learning The first reason is rooted in its multidisciplinary character Incorporated ideas from fields as diverse as statistics, probability, computer science, information theory, convex optimization, control theory, cognitive science, theoretical neuroscience, physics and more More important reason is the exponential growth of both available data and computer power It leverages the enormous flood of data that is generated each year by satellites, sky observatories, particle accelerators, the human genome project, banks, the stock market, the army, seismic measurements, the internet, video, scanned text and so on http://www.internetlivestats.com/one-second/

Machine Learning Techniques Two primary techniques: Supervised Learning We are given examples of inputs and associated outputs Finding the mapping between inputs and outputs using correct values to train a model Unsupervised Learning We are given inputs, but no outputs Finding patterns in the input data Reinforcement learning (learn to select an action to maximize payoff) is difficult

Supervised Learning Used when you want to predict unknown answers from answers you already have requires data which shows the answers you can get now Data is divided into two parts: the data you will use to teach the system (data set), and the data you will use to see if the computer s algorithms are accurate (test set) After you select and clean the data, you select data points that show the right relationships in the data The answers are labels, the categories/columns/attributes are features and the values are values Then you select an algorithm to compute the outcome Often you choose more than one You run the program on the data set, and check to see if you got the right answer from the test set Once you perform the experiment, you select the best model This is the final output the model is then used against more data to get the answers you need

Unsupervised Learning Used when you want to find unknown answers mostly groupings directly from data No simple way to evaluate accuracy of what you learn Evaluates more vectors, groups into sets or classifications Start with the data Apply algorithm Evaluate groups

Machine Learning tasks Three common tasks Classification The learned attribute is categorical Regression The learned attribute is numeric Clustering Finding similiar groups (clusters)

Microsoft Azure Overview Setting up a Microsoft Azure Account Setting up an AzureML Workspace Accessing AzureML Studio

Designing an Experiment Using AzureML Loading a Data Set Creating the Test Experiment Training and Scoring the Model Saving the Trained Model Creating the Scoring Experiment Publishing the Model Using the Model

Loading a Data Set IRIS Dataset It is perhaps the best known database to be found in the pattern recognition literature The data set contains 3 classes of 50 instances each, where each class refers to a type of iris plant One class is linearly separable from the other 2 The latter are not linearly separable from each other Available from UC Irvine Machine Learning Repository http://archive.ics.uci.edu/ ml/datasets/iris

Creating the Test Experiment Drag Iris Dataset from the «Dataset» menu item on the left and drop it in the design area Under the Machine Learning menu look for Initialize Model \ Classification \ Multiclass Neural Network and drop it on the design area Drop the Split component from the Data Transformation \ Sample and Split menu and connect the Iris Dataset to the Split Input Split Data between 70% for training and 30% for evaluating the model Such configuration can be set in the Properties pane Drop the component from the Train \ Train Model element under Machine Learning menu Select the Train Model component that has been placed on the design area before and click on the «Launch columns selector» in the options area and then select the class column

Training and Scoring the Model Add Score Model component from Machine Learning \ Score Connect Second Split output to second Score Model input Run the experiment Model will be train used 70% of the data The trained model will be used to predict the 30% of the data we already know the classification but that wasn t used in training Visualize the scored results, by right clicking on the Score Model output and select Visualize In the Visualize window, select the class column and in the «Visualization» pane, in the compare to dropdown, select «Scored Labels»

Creating the Scoring Experiment Click «Create Scoring Experiment» icon Saved Trained Model will replace Initialize ad Train Model components Web service input and output will be added Add Project Columns from Data Transformation \ Manipulation It will be used to strip the class column from the data source and to define the correct metadata when the model will be published as a Web Service Connect it with Iris Dataset and with the Score Model Make sure al but class column are selected in Project Columns properties Run the Experiment Add another Project Columns connected to the Score Model Strip out all the source columns and keep only the results Connect it with Web service output

Publishing the Model Click on the «Publish Web Service» icon Now the web service can be tested and give sepal and petal data as input, it will return the probability for each class and the most probable class as result You ll find the Web Service in the «Web Service» section of AzureML homepage Testing page and Excel workbook are also there Click Test and input new data See predicted values in Creating scoring experiment details

Using the Model Two Web Services are available: REQUEST/RESPONSE and BATCH EXECUTION Both Web Services provides examples to use them with C#, R and Python Click API help page for REQUEST/RESPONSE Web Service Select R Sample Code and past it into R Studio Replace api_key with key grabbed on Web Services homepage Input new data and run the script

Monetizing Your Azure ML Application What if you needed to Develop a handwriting recognition app Manage a large data set Use a state-of-the-art neural network Deploy on thousands of devices How long would that take? What if you could Harness the power of open source Combine that with enterprise-tested algorithms Release that to the world What could you achieve with Azure ML API Service? Check out the Machine Learning marketplace at datamarket.azure.com

The rest is up to you Sensor data analysis Buyer propensity models Social network analysis Predictive maintenance Search engine optimization Churn analysis Natural resource exploration Weather forecasting Healthcare outcomes Fraud detection Life sciences research Targeted advertising Network intrusion detection Smart meter monitoring

Media Partners