DataScience Training

Similar documents
Python Machine Learning

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

(Sub)Gradient Descent

Probability and Statistics Curriculum Pacing Guide

Lecture 1: Machine Learning Basics

STA 225: Introductory Statistics (CT)

School of Innovative Technologies and Engineering

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

Assignment 1: Predicting Amazon Review Ratings

Learning From the Past with Experiment Databases

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

CSL465/603 - Machine Learning

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

STT 231 Test 1. Fill in the Letter of Your Choice to Each Question in the Scantron. Each question is worth 2 point.

CS Machine Learning

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

12- A whirlwind tour of statistics

Lecture 1: Basic Concepts of Machine Learning

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski

Universidade do Minho Escola de Engenharia

Certified Six Sigma Professionals International Certification Courses in Six Sigma Green Belt

DOCTORAL SCHOOL TRAINING AND DEVELOPMENT PROGRAMME

Chapters 1-5 Cumulative Assessment AP Statistics November 2008 Gillespie, Block 4

A Model to Predict 24-Hour Urinary Creatinine Level Using Repeated Measurements

Multi-Lingual Text Leveling

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

Research Design & Analysis Made Easy! Brainstorming Worksheet

CS 446: Machine Learning

CS 101 Computer Science I Fall Instructor Muller. Syllabus

Human Emotion Recognition From Speech

Multivariate k-nearest Neighbor Regression for Time Series data -

Quantitative analysis with statistics (and ponies) (Some slides, pony-based examples from Blase Ur)

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

INTERMEDIATE ALGEBRA PRODUCT GUIDE

Optimizing to Arbitrary NLP Metrics using Ensemble Selection

Reducing Features to Improve Bug Prediction

Generative models and adversarial training

Speech Emotion Recognition Using Support Vector Machine

Computerized Adaptive Psychological Testing A Personalisation Perspective

Lahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017

arxiv: v1 [cs.lg] 15 Jun 2015

Australian Journal of Basic and Applied Sciences

Historical maintenance relevant information roadmap for a self-learning maintenance prediction procedural approach

Modeling function word errors in DNN-HMM based LVCSR systems

Focus of the Unit: Much of this unit focuses on extending previous skills of multiplication and division to multi-digit whole numbers.

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Large-Scale Web Page Classification. Sathi T Marath. Submitted in partial fulfilment of the requirements. for the degree of Doctor of Philosophy

Geospatial Visual Analytics Tutorial. Gennady Andrienko & Natalia Andrienko

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

Model Ensemble for Click Prediction in Bing Search Ads

Semi-Supervised GMM and DNN Acoustic Model Training with Multi-system Combination and Confidence Re-calibration

TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE. Pierre Foy

Modeling function word errors in DNN-HMM based LVCSR systems

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Affective Classification of Generic Audio Clips using Regression Models

Activity Recognition from Accelerometer Data

Analysis of Enzyme Kinetic Data

Medical Complexity: A Pragmatic Theory

Text-mining the Estonian National Electronic Health Record

S T A T 251 C o u r s e S y l l a b u s I n t r o d u c t i o n t o p r o b a b i l i t y

WHEN THERE IS A mismatch between the acoustic

Investment in e- journals, use and research outcomes

PROFESSIONAL TREATMENT OF TEACHERS AND STUDENT ACADEMIC ACHIEVEMENT. James B. Chapman. Dissertation submitted to the Faculty of the Virginia

Intelligent Agents. Chapter 2. Chapter 2 1

Top US Tech Talent for the Top China Tech Company

M55205-Mastering Microsoft Project 2016

OFFICE SUPPORT SPECIALIST Technical Diploma

MINUTE TO WIN IT: NAMING THE PRESIDENTS OF THE UNITED STATES

Abstractions and the Brain

A Decision Tree Analysis of the Transfer Student Emma Gunu, MS Research Analyst Robert M Roe, PhD Executive Director of Institutional Research and

Math-U-See Correlation with the Common Core State Standards for Mathematical Content for Third Grade

Notetaking Directions

Statewide Framework Document for:

Linking Task: Identifying authors and book titles in verbose queries

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Calibration of Confidence Measures in Speech Recognition

Ensemble Technique Utilization for Indonesian Dependency Parser

Probabilistic Latent Semantic Analysis

Rule Learning With Negation: Issues Regarding Effectiveness

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

APPENDIX A: Process Sigma Table (I)

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Implementing a tool to Support KAOS-Beta Process Model Using EPF

The Role of the Head in the Interpretation of English Deverbal Compounds

A Vector Space Approach for Aspect-Based Sentiment Analysis

Statistical Analysis of Climate Change, Renewable Energies, and Sustainability An Independent Investigation for Introduction to Statistics

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

Using EEG to Improve Massive Open Online Courses Feedback Interaction

Memory-based grammatical error correction

Individual Differences & Item Effects: How to test them, & how to test them well

Bachelor Programme Structure Max Weber Institute for Sociology, University of Heidelberg

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

Postprint.

Learning Methods in Multilingual Speech Recognition

Evaluating Interactive Visualization of Multidimensional Data Projection with Feature Transformation

Transcription:

LEARN BY DOING BY-SASMITA PANIGRAHI DataScience Training Build your own predictive models in 45 days with zero prior knowledge Project-1 Sale Prediction In this project,we will build a predictive model to find out the sales of each product at a particular store Course Details Machine Learning gives computer the ability to learn without being explicitly programmed ~l.samuel In this course, you will learn how to apply Data Science through seven pragmatic steps - Frame, Acquire, Refine, Transform, Explore, Model, and Insight - to any business problem. The focus will be to learn the principles through an applied case study and by actually coding in Python to solve this. Objective Learn how to employ statistical and machine learning algorithms to solve real life problems by working on real time Projects. Develop proficiency in using python and its libraries like Pandas, numpy, Seaborn, Project-2 Predict Taxi Destination In this project, we will build a predictive framework that is able to infer the final destination of taxi rides based on their (initial) partial trajectories. The output of such a framework will be the final trip's destination employee's attributes change over time. Approach Interactive and live coding session Taught by Real time Practitioners

Module-1(Python Basics) Welcome To The Course Introduction To DataScience Real Time UseCases Of DataScience Who is a DataScientist?? Github Tutorial Skillsets needed for DataScientist 6 Steps to take in 3 Months for a complete transformation to DataScience from any other domain Machine Learning-Giving Computers The ability to learn from data Supervised vs Unsupervised DeepLearning vs Machine Learning Link to get Free Data to Practice? Some Great self Learning DataScience Resources(Books,Tutorials,Vedios,Papers) Python Fundamentals Python Fundamentals begins with acquiring an in-depth knowledge of the Python programming language. By the end of the week, students will b e e x p e c t e d t o p r o g r a m intermediate level scripts in Python Software Installation Introduction To Python Hello Python Program in IDLE Jupyter Notebook Tutorial Spyder Tutorial Introduction to Python Variable,Operators,DataTypes If Else,For and While Loops Functions Lambda Expression Filter, Map,Reduce Taking input from keyboard - INTERVIEW QUESTION DISCUSSION

Module-2(Python Advance) NumPy Create Arrays Array Item Selection and Indexing Array Mathematics Array Operation Introduction to Pandas Series Pandas Series indexing and Selection Series Operation Introduction to Pandas Data Frames Data Collection from csv,json,html,excel Data Merging,Concatenation,join Group By and Aggregate Function Order By Missing Value Treatment Outlier Detection and Removal Pandas builtin Data Visualisation INTERVIEW QUESTION DISCUSSION

Module-3( Visualisation) Visualisationmatplotlib,seaborn we ll begin curriculum focused on various data visualization techniques and how they can help us engage and learn from our data using Matplotlib, Seaborn,ggplot Line Plots Scatter Plots Pair Plots Histograms Heat Maps Bar Plots Count Plots Factor Plots Box Plots Violin Plots Swarm Plots Strip Plots Pandas Builtin Visualisation Library INTERVIEW QUESTION DISCUSSION Project-1 Prcatice, Practice and Practice!!!!!!! Implement what you have learnt so far by working in a real time Project.. Pandas Numpy Seaborn MatplotLib

Module-4 (Statistics) Descriptive vs Inferential Statistics Statistics Mean,Median,Mode,Variance,Std. dev Central Limit Theorm Co-Variance Pearson s Product Moment Correlation R - Square Adjusted R-Square Spearman s. Rank order Coefficient This session is dedicated to creating a deep understanding of mathematical concepts we ll later see in topics like machine learning and statistical analysis. Contrary to the traditional mathematics course, students will learn statistics and linear algebra in programmatic way to fit a problem s needs. Sample vs Population Standardizing Data(Z-score) Hypothesis Testing Normal Distribution Bias Variance Tradeoff Skewness P Value Z-test vs T-test The F distribution The chi-square test of Independence Type-1 and Type-2 errors Annova INTERVIEW QUESTION DISCUSSION

Module-5 (Intro to ML) Introduction to Machine Learning Introduction to Machine Leaning Machine Learning Usecases Supervised vs Unsupervised vs Semi- Supervised Machine Learning process Workflow Training a model Validating results Overfitting vs Underfitting Ordinal vs Nominal data Structured vs unstructured vs semistructured data Intro to scikitlearn

Module-6 (Supervised) Regression Regression Vs Classification Linear regression Multivariate regression Polynomial regression Multi-Colinearity, Auto correlation Heteroscedascity Hands On Classification KNN Svm Decision Tree Random Forest Performance tuning of Random Forest Naive Bayse Overfitting Vs Underfitting Hands On Model Validation Classification Report Confusion Report ROC RMSE MSE Cross validation Hands On

Module-7 (Unsupervised) Kmeans How to choose number of K in KMeans Clustering & PCA Hands on PCA Hands on Module-8 (Ensemble) Ensemble Methods What is Ensembling Types of Ensembling Bagging Boosting Stacking Random Forest Important Feature Extraction XGBoost

Module-9 (NLP) NLP Tokenizer Stop Word Removal Tf-idf Document similarity Word2vec Model t-sne visualisation Sentiment Analysis Module-10 (Deep Learning) Basic of Neural Network Deep Learning Type of NN Cost Function Tensorflow Basics Hands on Simple NN with Tensorflow Image classification using CNN