Machine Learning for Humans. My journey from ignorance to Oxford

Similar documents
Python Machine Learning

IT Students Workshop within Strategic Partnership of Leibniz University and Peter the Great St. Petersburg Polytechnic University

Lecture 1: Machine Learning Basics

Probabilistic Latent Semantic Analysis

A Case Study: News Classification Based on Term Frequency

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

CS Machine Learning

Rule Learning With Negation: Issues Regarding Effectiveness

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Evaluating Interactive Visualization of Multidimensional Data Projection with Feature Transformation

Lecture 1: Basic Concepts of Machine Learning

Analyzing sentiments in tweets for Tesla Model 3 using SAS Enterprise Miner and SAS Sentiment Analysis Studio

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

Assignment 1: Predicting Amazon Review Ratings

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

(Sub)Gradient Descent

Australian Journal of Basic and Applied Sciences

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Human Emotion Recognition From Speech

Rule Learning with Negation: Issues Regarding Effectiveness

CSL465/603 - Machine Learning

Time series prediction

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

The Good Judgment Project: A large scale test of different methods of combining expert predictions

Statistics and Data Analytics Minor

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

A study of speaker adaptation for DNN-based speech synthesis

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Modeling function word errors in DNN-HMM based LVCSR systems

Citrine Informatics. The Latest from Citrine. Citrine Informatics. The data analytics platform for the physical world

Modeling function word errors in DNN-HMM based LVCSR systems

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

Forget catastrophic forgetting: AI that learns after deployment

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Machine Learning and Development Policy

Artificial Neural Networks written examination

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

Probability and Statistics Curriculum Pacing Guide

MGT/MGP/MGB 261: Investment Analysis

Using Web Searches on Important Words to Create Background Sets for LSI Classification

Linking Task: Identifying authors and book titles in verbose queries

Knowledge based expert systems D H A N A N J A Y K A L B A N D E

Generative models and adversarial training

Speech Emotion Recognition Using Support Vector Machine

Historical maintenance relevant information roadmap for a self-learning maintenance prediction procedural approach

Issues in the Mining of Heart Failure Datasets

Using dialogue context to improve parsing performance in dialogue systems

Grade 6: Correlated to AGS Basic Math Skills

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling

Laboratorio di Intelligenza Artificiale e Robotica

Exposé for a Master s Thesis

AN EXAMPLE OF THE GOMORY CUTTING PLANE ALGORITHM. max z = 3x 1 + 4x 2. 3x 1 x x x x N 2

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

Discriminative Learning of Beam-Search Heuristics for Planning

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

Multi-Lingual Text Leveling

Indian Institute of Technology, Kanpur

Axiom 2013 Team Description Paper

Switchboard Language Model Improvement with Conversational Data from Gigaword

Cross-lingual Short-Text Document Classification for Facebook Comments

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

COMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS

*Net Perceptions, Inc West 78th Street Suite 300 Minneapolis, MN

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

Word Segmentation of Off-line Handwritten Documents

Multivariate k-nearest Neighbor Regression for Time Series data -

Reducing Features to Improve Bug Prediction

The Method of Immersion the Problem of Comparing Technical Objects in an Expert Shell in the Class of Artificial Intelligence Algorithms

Comparison of EM and Two-Step Cluster Method for Mixed Data: An Application

Learning From the Past with Experiment Databases

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

What is a Mental Model?

Mining Association Rules in Student s Assessment Data

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Online Updating of Word Representations for Part-of-Speech Tagging

As a high-quality international conference in the field

CS 446: Machine Learning

Phonetic- and Speaker-Discriminant Features for Speaker Recognition. Research Project

EXAMINING THE DEVELOPMENT OF FIFTH AND SIXTH GRADE STUDENTS EPISTEMIC CONSIDERATIONS OVER TIME THROUGH AN AUTOMATED ANALYSIS OF EMBEDDED ASSESSMENTS

Answer Key For The California Mathematics Standards Grade 1

Laboratorio di Intelligenza Artificiale e Robotica

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Feature Selection based on Sampling and C4.5 Algorithm to Improve the Quality of Text Classification using Naïve Bayes

STA 225: Introductory Statistics (CT)

University of Groningen. Systemen, planning, netwerken Bosman, Aart

Introduction to the Practice of Statistics

Applications of data mining algorithms to analysis of medical data

Numeracy Medium term plan: Summer Term Level 2C/2B Year 2 Level 2A/3C

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

Truth Inference in Crowdsourcing: Is the Problem Solved?

Article A Novel, Gradient Boosting Framework for Sentiment Analysis in Languages where NLP Resources Are Not Plentiful: A Case Study for Modern Greek

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

Why Did My Detector Do That?!

Self Study Report Computer Science

Transcription:

Machine Learning for Humans My journey from ignorance to Oxford

Aim Why the hype? Overview of Machine Learning/Data Science Some code Give you an idea if it can help you in your day job Encourage you to try it out Some buzz words (for you to sound cool & knowledgeable)

About Me PHD s = 0 MSc s = 0 Degrees = 0 A levels = 0 First programme age 11 (zx81) Coding Professionally > 25years Therefore: I am an Old Dog and Machine Learning is a new trick

Why the Hype? The volumes of data are massive Computer languages have machine learning libraries GPUs are fast and cheap Machine learning systems are giving insights traditional systems either can t do at all or aren t cost effective They are now beating real people at games like Go

What is it? Machine learning is a type of artificial intelligence (AI) that provides computers with the ability to learn without being explicitly programmed. Machine learning focuses on the development of computer programs that can teach themselves to grow and change when exposed to new data. whatis.techtarget.com/definition/machine-learning Data science is an interdisciplinary field about processes and systems to extract knowledge or insights from data in various forms, either structured or unstructured, which is a continuation of some of the data analysis fields such as statistics, data mining, and predictive analytics en.wikipedia.org/wiki/data_science

Assumptions I needed to be at MSc level at least support vector machine constructs a hyperplane or set of hyperplanes in a high- or infinite-dimensional space, which can be used for classification, regression, or other tasks. Intuitively, a good separation is achieved by the hyperplane that has the largest distance to the nearest training-data point of any class (so-called functional margin), since in general the larger the margin the lower the generalization error of the classifier. en.wikipedia.org/wiki/support_vector_machine

alligator$lnweight 3.5 4.5 5.5 6.5 Some code Linear Regression alligator = data.frame( lnlength = c(3.87, 3.78), lnweight = c(4.87, 4.25) ) model <- lm(lnweight ~ lnlength, data=alligator) plot(alligator$lnweight ~ alligator$lnlength) abline(model) predict(model, newdata=data.frame(lnlength=4.0)) 5.248326 predict(model, newdata=data.frame(lnlength=4.2)) 5.934545 3.4 3.6 3.8 4.0 4.2 alligator$lnlength

How does it work? First there are two or three types Supervised learning Unsupervised learning Reinforced learning Using Mathematics it is attempting to infer a useful result from previously unseen data.

What is it doing? Classification two or multiple Clustering Anomaly Detection Regression

It s all about the data Python Iris sample data 150 instances, 50 of each class (Iris - Setosa, Versicolour, Virginica) 4 numeric predictive attributes (sepal length & width and petal length & width) Code Great support to help you create Machine Learning models Testing your model with training data leaves you with great results and no confidence

Some of the Lingo Feature an attribute e.g. petal length Vector all the attributes of a single iris e.g. [sepal length, sepal width, petal length, length width]

What is it good for? Predictive Maintenance Marketing Finance Operational Efficiency Energy Forecasting Internet of Things Text and Speech Processing Image Processing and Computer Vision

Should you use it? It depends What problem are you trying to solve? What level of accuracy do you need? Is the system CPU or memory constrained? Is there enough good quality training data? (supervised) Can data be changed to a suitable format?

Real world Machine Learning - Silos Problem: find out how full without blowing it up Level of accuracy: Ask sales or Engineering System constrained: Yes Good quality training data: Maybe Data in suitable format: Yes martinlishman.com/barn-owl-wireless

Can you use it? Data Scientist (n.): Person who is better at statistics than any software engineer and better at software engineering than any statistician. @josh_wills

Can you use it? It depends Can you learn to programme in Python/R/C/a JVM language? Can you learn some basic Mathematics? (the more the better) Can you prepare data? Can you learn to use libraries?

Easy Start Toy Data sets Python dataset package boston house prices - regression iris - classification diabetes - regression digits - classification linnerud - multivariate regression + other packages R - Datasets Package 80+

Working with data Gaps Data features of differing scales

Some more of the Lingo Interpolate fill in the gaps, lots of ways (better Maths will help here) Mean, Variance and Standard deviation By normalising the data you can give equal weight to features

Having knowledge to improve Metrics Confusion Matrix setosa [14, 0, 0], versicolor [ 0, 14, 4], virginica [ 0, 1, 17] Confusion Matrix virginica [setosa = 0, versicolor = 1, virginica = 17]

More Metrics Classification report precision recall f1-score support setosa 1.00 1.00 1.00 14 versicolor 0.93 0.78 0.85 18 virginica 0.81 0.94 0.87 18 avg / total 0.91 0.90 0.90 50 Precision virginica correct 17 predicted 21: 17/21 = 0.81 Recall virginica correct 17 actual 18: 17/18 = 0.94 F1-source mean of precision and recall

Working with text Text!= Numeric For machine learning Text -> numerical feature vectors. Each word is assigned an integer identifier

Real world Machine Learning - Text Problem: Feature extraction from documents Level of accuracy: Very high System constrained: No Good quality training data: Getting there Data in suitable format: Yes

Working with text Code (if there is time) Text processing Vectorisation Text feature extraction Term Frequencies times Inverse Document Frequency (tf-idf) Stop words

What we have covered What is Machine Learning Some of the ways Machine Learning can be used Some code using and reviewing results Some buzz words (for you to sound cool & knowledgeable)

Books www.manning.com/books/introducing-data-science www.manning.com/books/r-in-action-second-edition

Questions? & Links Information: www.analyticsvidhya.com/blog/2015/08/commonmachine-learning-algorithms www.analyticsvidhya.com/blog/2015/09/fullcheatsheet-machine-learning-algorithms Start coding: www.continuum.io/anaconda-overview www.r-project.org www.rstudio.com/home Email: peter@catalystcomputing.co.uk Web: catalystcomputing.co.uk Blog: catalystcomputing.co.uk/peter-marriott Twitter: @peter_marriott GitHub: github.com/catalystcomputing