Certified Professional Machine Learning & Data Science Foundation (CP-MLDS Foundation) Certification Course

Similar documents
Python Machine Learning

(Sub)Gradient Descent

Lecture 1: Machine Learning Basics

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Assignment 1: Predicting Amazon Review Ratings

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

CSL465/603 - Machine Learning

CS Machine Learning

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

School of Innovative Technologies and Engineering

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Evaluation of a College Freshman Diversity Research Program

TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE. Pierre Foy

B.S/M.A in Mathematics

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling

Learning From the Past with Experiment Databases

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

LOUISIANA HIGH SCHOOL RALLY ASSOCIATION

ME 443/643 Design Techniques in Mechanical Engineering. Lecture 1: Introduction

EGRHS Course Fair. Science & Math AP & IB Courses

Artificial Neural Networks written examination

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Lecture 1: Basic Concepts of Machine Learning

Radius STEM Readiness TM

Mathematics. Mathematics

Extending Place Value with Whole Numbers to 1,000,000

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

Green Belt Curriculum (This workshop can also be conducted on-site, subject to price change and number of participants)

Self Study Report Computer Science

Software Maintenance

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Academic Catalog Programs & Courses Manchester Community College

Pre-Algebra A. Syllabus. Course Overview. Course Goals. General Skills. Credit Value

Statistics and Data Analytics Minor

Truth Inference in Crowdsourcing: Is the Problem Solved?

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

A Note on Structuring Employability Skills for Accounting Students

Common Core Standards Alignment Chart Grade 5

CS224d Deep Learning for Natural Language Processing. Richard Socher, PhD

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

Model Ensemble for Click Prediction in Bing Search Ads

Mathematics Program Assessment Plan

Optimizing to Arbitrary NLP Metrics using Ensemble Selection

COMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS

Learning Disability Functional Capacity Evaluation. Dear Doctor,

Detailed course syllabus

Rule Learning With Negation: Issues Regarding Effectiveness

Visit us at:

Human Emotion Recognition From Speech

Historical maintenance relevant information roadmap for a self-learning maintenance prediction procedural approach

Certified Six Sigma Professionals International Certification Courses in Six Sigma Green Belt

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

Math-U-See Correlation with the Common Core State Standards for Mathematical Content for Third Grade

Mining Association Rules in Student s Assessment Data

Axiom 2013 Team Description Paper

Bachelor of Science in Mechanical Engineering with Co-op

Education & Training Plan Civil Litigation Specialist Certificate Program with Externship

Introduction, Organization Overview of NLP, Main Issues

Australian Journal of Basic and Applied Sciences

A Model to Predict 24-Hour Urinary Creatinine Level Using Repeated Measurements

CS 101 Computer Science I Fall Instructor Muller. Syllabus

Computational Data Analysis Techniques In Economics And Finance

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

Course Brochure 2016/17

Content Language Objectives (CLOs) August 2012, H. Butts & G. De Anda

Multivariate k-nearest Neighbor Regression for Time Series data -

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Laboratorio di Intelligenza Artificiale e Robotica

Field Experience Management 2011 Training Guides

Speech Recognition at ICSI: Broadcast News and beyond

Probabilistic Latent Semantic Analysis

The Algebra in the Arithmetic Finding analogous tasks and structures in arithmetic that can be used throughout algebra

Statewide Framework Document for:

Indian Institute of Technology, Kanpur

Article A Novel, Gradient Boosting Framework for Sentiment Analysis in Languages where NLP Resources Are Not Plentiful: A Case Study for Modern Greek

WHEN THERE IS A mismatch between the acoustic

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Navigating the PhD Options in CMS

Frequently Asked Questions about Music Education at IU

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

An empirical study of learning speed in backpropagation

ecampus Basics Overview

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

arxiv: v2 [cs.cv] 30 Mar 2017

Medical Complexity: A Pragmatic Theory

Training Pack. Kaizen Focused Improvement Teams (F.I.T.)

The Role of the Head in the Interpretation of English Deverbal Compounds

Circuit Simulators: A Revolutionary E-Learning Platform

Level 6. Higher Education Funding Council for England (HEFCE) Fee for 2017/18 is 9,250*

Software Development Plan

Development of Multistage Tests based on Teacher Ratings

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

Telekooperation Seminar

PATHWAYS IN FIRST YEAR MATHS

Honors Mathematics. Introduction and Definition of Honors Mathematics

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

Youth Sector 5-YEAR ACTION PLAN ᒫᒨ ᒣᔅᑲᓈᐦᒉᑖ ᐤ. Office of the Deputy Director General

PROJECT DESCRIPTION SLAM

Transcription:

CERTIFIED PROFESSIONAL MACHINE LEARNING AND DATA SCIENCE FOUNDATION PROGRAM Certified Professional Machine Learning & Data Science Foundation (CP-MLDS Foundation) Certification Course VERSION 1.1 (RELEASE DATE 17 OCTBER 2018) http://cpmldsf.devopsppalliance.org/ http://devopsppalliance.org/cp-mldsf.html ATAMalasiya@AgileTestingAlliance.org ATAIndia@AgileTestingAlliance.org /devopspp /devopspp http://devopsppalliance.org

What is CP-MLDS Foundation course? CP-MLDS Foundation stands for Certified Professional Machine Learning and Data Science Foundation certification prepared and honored by Agile Testing Alliance. The course is applicable for all roles and knowledge, experience & certification is consciously designed for all those who want to learning practical Machine learning and Data Science. What is new in this version 1.2 of CP-MLDS As per this latest version of learning objectives, a. Module 4 for Linear Regression now includes Regularization Techniques of i. Lasso Regression and ii. Ridge Regression b. Module 5 for classification is added with Decision Tree based classification technique. c. At the same time, the learning structure has been modified to include more practical assignments post the 3 days training. CP-MLDS Learning Structure 3 days training Post training project assignments

How is CP-MLDS useful? Machine learning is based on algorithms that can learn from data without relying on rules-based programming. As ever more of the analog world gets digitized, our ability to learn from data by developing and testing algorithms will only become more important for what are now seen as traditional businesses. Google chief economist Hal Varian calls this computer kaizen. For just as mass production changed the way products were assembled and continuous improvement changed how manufacturing was done, he says, so continuous [and often automatic] experimentation will improve the way we optimize business processes in our organizations. This is where it is clear, that we are into a computer kaizen world. A world where Machine learning and Data science algorithms are driving this selflearning and continuous learning process and bringing about a massive change in almost every industry. We have machine learning and data science now being used in banking, insurance, health care, sports, manufacturing, smart cities, Iot Solutioning, Automotive, Aviation, Shipping and every other industry for that matter. Machine learning and Data science need thus has increased multifold in past few years and would keep on increasing. At the same time there is a dearth of experienced professionals who know Machine learning and Data Science. There is no dearth of machine learning and data science programs where folks would need to spend infinite amount of time and efforts to acquire this knowledge. The challenge is for working professionals to spend so much time. Most often this rigor is lost over few weeks or months. This program solves this issue addresses two basic needs a) Practical tool-based Machine learning and Data Science exposure for every working professional b) Allow working professionals to acquire this knowledge in the most agile manner

Am I Eligible? There are no pre-requisites for this certification program except having some prior knowledge of any programming language and basics of mathematics and statistics. Program is Python driven and having prior knowledge of Python would be an advantage. What is the Training and Certification Exam structure? CP-MLDS is one of the only FastTrack program specially designed to Reskill / Upskill working professionals.

The Training and Certification structure is composed of a. Lab sessions 3 full days (24 Hrs) Instructor led fully hands on lab sessions b. Project assignments and Mentoring Post the three days training program, 2 projects assignments will be shared by the mentor. The process is designed to help the participants strengthen their learnings. c. Certification Exam. The certification exam comprises of one Theory Section (40 Marks) and one Practical section (60 Marks). Theory section is of 1 hour and practical section is of 2 hours. Getting 60% in both the sections of exam is necessary to get the CP-MLDS certificate. d. The training program (Part a and b above) is not mandatory for someone to go for CP-MLDS certification. e. Thus certification exam can be taken by following either of the two routes Route 1 - by going through a formal training (step a and step b) and then going for the certification exam. Route 2 This is a direct route which allows someone to appear for the exam and get the CPMLDS (Foundation) certification after passing the exam. This route is recommended for someone already familiar with the subject. Learning Objectives and Main Topics Module 1: The World of Machine Learning (ML) A. What is Machine Learning? B. Applications of Machine Learning in various industries C. Types of Machine Learning Supervised Learning Unsupervised Learning D. Maths refresher Advanced Stats and probability Calculus and Algebra

At the end of this module students should Understand the types of Machine Learning and the names of algorithms used Be able to tell the real-world applications of Machine Learning Be able to answer questions related to Linear Algebra, Calculus and Statistics Module 2: Setting up Environment and Recap of Python for Machine Learning A. Anaconda environment setup B. Installing libraries required for Machine Learning C. Introduction to Jupyter notebook D. Recap Python Data types Operators Conditions and Loops Data structures List, Dictionary, Set and Tuples functions E. Introduction to libraries of Python useful for ML NumPy Numerical processing Pandas Data Processing Matplotlib, Seaborn Data Visualizations in Python At the end of this module students should Setup an environment in Anaconda for Machine Learning Independently install various libraries required for Machine Learning Solve small programming assignments in python using NumPy, Pandas, Matplotlib and Seaborn Libraries.

Module 3: EDA - Exploratory Data Analysis A. Types of data B. Using summary statistics to understand data C. Using boxplot to visualize data D. Pre-processing Data Handling missing data and outliers Normalization E. Visualizing correlations between features At the end of this module students should Understand data that is used for Machine Learning Understand and identify missing data, outliers Pre-processing data Data imputation, normalization Explain summary statistics for a data frame Understand correlation between columns Module 4: Simple and Multiple Linear Regression and Regularization techniques A. Understanding Regression problems B. What is Simple and Multiple Linear Regression C. Understand maths and relevant concepts behind the Linear Regression Understanding the linear equation Features Learning a model for y=wx+b equation and parameters Defining and visualizing the cost function Understanding Gradient Descent algorithm for cost optimization D. Performance Metrics for Linear Regression E. Hands on Exercises Using SciKit-learn library for Machine Learning for Simple and Multiple Linear Regression datasets F. Regularization Techniques Lasso Regression Ridge Regression

At the end of this module participants Will understand basic concepts and maths of Simple and Multiple Linear Regression. Will be able to use scikit-learn library for applying linear regression Compute Performance metrics of the Linear model Understand Regularisation techniques using Lasso Regression and Ridge Regression to overcome overfitting Module 5: Classification using Logistic Regression (LR) and Decision Trees A. Understand Classification problems and differentiate it from regression problems. B. Understanding maths and relevant concepts for Logistic regression The Sigmoid function Cost function for classification Finding derivative of sigmoid function C. Understanding Decision Tree technique for classification D. Hands on Exercises for classification problem with Logistic Regression and Decision Tree using Scikit-learn library E. Performance Measures for Classification Confusion Matrix Precision Recall At the end of this module students Understands and easily differentiate between Regression problems and classification problems Will be able to apply logistic regression and Decision Tree to different datasets using scikit-learn Would be able to determine performance metrics for the given problems

Module 6: Introduction to Clustering A. Understanding UnSupervised ML problem B. Understand Clustering technique C. Types of Clustering D. Applications of Clustering E. Understanding k-means clustering algorithm F. Learn to implement clustering using scikit-learn At the end of this module students Understands what is clustering and its application areas Implementing clustering using scikit-learn to different datasets Module 7: Introduction to NLP (Natural Language Processing) A. Understanding NLP B. Application areas of NLP G. Understand preprocessing and terminology used for NLP using NLTK library At the end of this module students Understands what is NLP wrt Text Data and its application areas Preprocessing the Text Data with NLTK library