Data Analyst Training Program

Similar documents
Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Python Machine Learning

Instructor: Mario D. Garrett, Ph.D. Phone: Office: Hepner Hall (HH) 100

DOCTORAL SCHOOL TRAINING AND DEVELOPMENT PROGRAMME

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Sociology 521: Social Statistics and Quantitative Methods I Spring Wed. 2 5, Kap 305 Computer Lab. Course Website

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Probability and Statistics Curriculum Pacing Guide

STA 225: Introductory Statistics (CT)

School of Innovative Technologies and Engineering

Research Design & Analysis Made Easy! Brainstorming Worksheet

Sociology 521: Social Statistics and Quantitative Methods I Spring 2013 Mondays 2 5pm Kap 305 Computer Lab. Course Website

VOL. 3, NO. 5, May 2012 ISSN Journal of Emerging Trends in Computing and Information Sciences CIS Journal. All rights reserved.

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

Assignment 1: Predicting Amazon Review Ratings

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

Lecture 1: Machine Learning Basics

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering

EDCI 699 Statistics: Content, Process, Application COURSE SYLLABUS: SPRING 2016

12- A whirlwind tour of statistics

Learning From the Past with Experiment Databases

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Best Practices in Internet Ministry Released November 7, 2008

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

EGRHS Course Fair. Science & Math AP & IB Courses

CSL465/603 - Machine Learning

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

Unit 7 Data analysis and design

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

On-Line Data Analytics

Reducing Features to Improve Bug Prediction

Kristin Moser. Sherry Woosley, Ph.D. University of Northern Iowa EBI

PHD COURSE INTERMEDIATE STATISTICS USING SPSS, 2018

Applications of data mining algorithms to analysis of medical data

Certified Six Sigma Professionals International Certification Courses in Six Sigma Green Belt

Research computing Results

Mining Association Rules in Student s Assessment Data

STT 231 Test 1. Fill in the Letter of Your Choice to Each Question in the Scantron. Each question is worth 2 point.

AC : PREPARING THE ENGINEER OF 2020: ANALYSIS OF ALUMNI DATA

Ryerson University Sociology SOC 483: Advanced Research and Statistics

TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE. Pierre Foy

Platform for the Development of Accessible Vocational Training

Multivariate k-nearest Neighbor Regression for Time Series data -

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

CS Machine Learning

Visit us at:

Lecture 1: Basic Concepts of Machine Learning

Citrine Informatics. The Latest from Citrine. Citrine Informatics. The data analytics platform for the physical world

Welcome to. ECML/PKDD 2004 Community meeting

Practical Research. Planning and Design. Paul D. Leedy. Jeanne Ellis Ormrod. Upper Saddle River, New Jersey Columbus, Ohio

GACE Computer Science Assessment Test at a Glance

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Statistics and Data Analytics Minor

Analyzing sentiments in tweets for Tesla Model 3 using SAS Enterprise Miner and SAS Sentiment Analysis Studio

Predicting the Performance and Success of Construction Management Graduate Students using GRE Scores

A Decision Tree Analysis of the Transfer Student Emma Gunu, MS Research Analyst Robert M Roe, PhD Executive Director of Institutional Research and

Knowledge management styles and performance: a knowledge space model from both theoretical and empirical perspectives

Minitab Tutorial (Version 17+)

Chapters 1-5 Cumulative Assessment AP Statistics November 2008 Gillespie, Block 4

Multi-Lingual Text Leveling

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

School Size and the Quality of Teaching and Learning

Australian Journal of Basic and Applied Sciences

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

Rule Learning With Negation: Issues Regarding Effectiveness

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

EXAMINING THE DEVELOPMENT OF FIFTH AND SIXTH GRADE STUDENTS EPISTEMIC CONSIDERATIONS OVER TIME THROUGH AN AUTOMATED ANALYSIS OF EMBEDDED ASSESSMENTS

Indian Institute of Technology, Kanpur

Quantitative analysis with statistics (and ponies) (Some slides, pony-based examples from Blase Ur)

(Sub)Gradient Descent

OFFICE SUPPORT SPECIALIST Technical Diploma

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

Survey and Analysis of University Clustering

Using Web Searches on Important Words to Create Background Sets for LSI Classification

Statewide Framework Document for:

Massachusetts Institute of Technology Tel: Massachusetts Avenue Room 32-D558 MA 02139

APPENDIX A: Process Sigma Table (I)

DATA MANAGEMENT PROCEDURES INTRODUCTION

Ph.D in Advance Machine Learning (computer science) PhD submitted, degree to be awarded on convocation, sept B.Tech in Computer science and

The lab is designed to remind you how to work with scientific data (including dealing with uncertainty) and to review experimental design.

Data Fusion Through Statistical Matching

The College of Law Mission Statement

CHALLENGES FACING DEVELOPMENT OF STRATEGIC PLANS IN PUBLIC SECONDARY SCHOOLS IN MWINGI CENTRAL DISTRICT, KENYA

Enhancing Students Understanding Statistics with TinkerPlots: Problem-Based Learning Approach

Investment in e- journals, use and research outcomes

CS 446: Machine Learning

Bangalore Mysore Pondicherry Tirupati

Software Maintenance

November 17, 2017 ARIZONA STATE UNIVERSITY. ADDENDUM 3 RFP Digital Integrated Enrollment Support for Students

Individual Differences & Item Effects: How to test them, & how to test them well

Large-Scale Web Page Classification. Sathi T Marath. Submitted in partial fulfilment of the requirements. for the degree of Doctor of Philosophy

Rule Learning with Negation: Issues Regarding Effectiveness

arxiv: v1 [cs.lg] 15 Jun 2015

Certified Six Sigma - Black Belt VS-1104

Analysis of Enzyme Kinetic Data

Axiom 2013 Team Description Paper

ATW 202. Business Research Methods

An Empirical Analysis of the Effects of Mexican American Studies Participation on Student Achievement within Tucson Unified School District

EMBA 2-YEAR DEGREE PROGRAM. Department of Management Studies. Indian Institute of Technology Madras, Chennai

Transcription:

R Data Analyst Training Program In exclusive association with 21,347+ Participants 10,000+ Brands 1200+ Trainings 45+ Countries [Since 2009] Training partner for

Course Highlights Who is this Course for Salient Features Programmers and Statisticians 3 Hrs/Week Live Instructor-Led Online Sessions 15 Days of Project Work Active Q/A Forum Class Labs/Home Assignment (10 hours/week Learning Time) Govt. of India (Vskills Certified Course) Placement Support Personalised Training Program Lifetime Access to Updated Content and Videos Industry and Academia Faculty Course Advisors Top Data Analytics Tools Covered Specialize in R Industry s Data Analytics Advisors Ajay Ohri Data Scientist Ajay Ohri is a Data Scientist and Blogger in an open source data science. Since 2007, he has published his blog DecisionStats.com. Manas Garg heads the Analytics for Marketing at Paypal. He takes Data Driven Decisions for Marketing Success. Manas Garg Architect Shweta Gupta Vice President, Tech. Shweta Gupta has 19+ years of Technology Leadership experience. She holds a patent and number of publications in ACM, IEEE and IBM journals like Redbook and developerworks. Vishal is a Technology Influencer and CEO of Right Relevance. (A platform used by millions for content & influencer discovery) Vishal Mishra CEO & Co-Founder

Course Instructors NITIKA MALHOTRA Nitika Malhotra is a Data Scientist at Zomato and handles data science and machine learning projects. She has worked as the Analytics Specialist at Transorg, Research Associate at IIT-Delhi and Research Intern at MOSPI (Ministry of Planning and Programme Implementation). She holds expertise in Probability, Statistics, Data Structures, PostgreSQL, R, SPSS, Pentaho, SAS, Machine Learning, and Hive. SHANTANU GARG Shantanu Garg is the Sr. Marketing Analyst at MakeMyTrip. He handles data science and web analytics projects. He has worked as the Analytics Specialist in Transorg and Research Associate for Nielsen. He is skilled in Probability, Statistics, Data Mining, PostgreSQL, R, Pentaho, Machine Learning, Adobe Analytics, Hive and Google Analytics. Course Curriculum The R for Data Analytics course is thoughtfully designed to allow learners with some programming background to make a transition into the analytics industry with correct skillsets using R language. It is designed in a way that the student starts with the introduction to R programming, and in a very hands-on learning method using R Studio, will learn the nuts and bolts of R to perform the role of data analyst. The student will progress to applied statistics and machine learning concepts & applications. Post completion of the program, learners will be prepared to device solutions for real-time problems in the industry. INTRODUCTION TO DATA ANALYTICS This will be an introduction session with a brief explaination about Data Analytics ecosystem, scope of this field and introducton to R platform. Introductory Session Briefing about Analytics domain How insides from data can help business solve day-to-day problems and find solution Various platforms which can help you in the journey of becoming Data Scientist Introduction to R as a platform

INTRODUCTION TO R PROGRAMMING This session will be an introduction to Basics of coding on R Studio platform. R Nuts and Bolts Understanding different windows of R Studio Basics of R Programming and some important rules for coding in R Installing predefined packages Entering inputs and R objects (Vector, Matrix, Dataframes and Factors) R Datatypes Using dplyr Package Text Manipulations using Strings Reading data (csv file) in R DATA MANIPULATIONS AND LOOPING IN R In-depth understanding about data manipulation using different packages and functions & conditional loopings in R. In Detail Hands on for Learning Data Manipulations Subsetting dataset Date and Time in R Loops: while & for Conditionals: if-else Functions: Defining functions, Anonymous functions Apply family of functions Sampling in R EXPLORATORY ANALYSIS IN R Exploratory Analysis will help you know more about the features of datasets, statistically. For understanding real-time data in the industry, this is the first step. Descriptive Statistical Analysis Central Tendencies Measurements of Dispersion Test of Normality Null Value Treatment Outlier Treatment Correlation Analysis Reshaping Data Merging Data

VISUALISATION Creating basic as well as interactive visualisation in R. R Studio Visualisations Interactive Dashboard Categorical Data: Barplot,Pie Chart Numeric: Boxplot, Histogram, Scatter Plot, Line Chart Using different libraries to make graph presentable (ggplot2, Rcolorbrewer) Using shiny to create interactive Graphical Dashboards INFERENTIAL ANALYSIS IN R Inferential Analysis is very useful in knowing underline information of data. It is generally used in the industry for A/B or Test/Control group comparisons. Parametric Statistical Tests Non-Parametric Statistical Test Basic theory of Inferential Statistics Hypothesis tests using Z Test T-statistics Test Two sampled Z Test and T Test ANOVA Post-hoc Test Wilcoxen Test Mann-Whitney U Test K.S. Test Runn Test Chi-Square Test DATA LOADING AND FILE FORMATS This section begins with loading and bringing data from different data sources in R. Descriptive Statistical Analysis Data loading and file formats Loading JSON files XML and HTML Web Scraping Interacting with HTML and Web APIs Interacting with databases Text Mining/Text Analytics in R

MACHINE LEARNING Introduction to machine learining and its further bifurcations. Learning most of the industry-wise used machine learning techniques. What is Machine Learning Machine Learning real-world Examples Assumptions for Linear Regression Supervised Learning Techniques Linear Regression Assumptions checks in R Building Linear Regression Model in R Stepwise method Case Study- Linear Regresssion Exploring Data Dividing data into Test and Train Model Building and R Predicting on Test Data using Model Logistic Regression Understanding Logistic Regression Classification Model Building using Logistic Model Confusion Matrix Random Forest Decision Tree Random Forest SVM and Naive Bayes SVM Naïve Bayes Unsupervised Learning Techniques Unsupervised Learning Clustering K-means Hierarchical Clustering Time Series Analysis

Capstone Project (3 Weeks) The Capstone project is the culminating assignment that will allow you to have an integrated experience of the program. The approach to this project is to think, define, design, code, test and tune your solution, in such a way that you apply all aspects of the data analytics process. The real world is filled with text data and is usually messy hence cleaning and handling text is an important step towards making smarter Machine Learning algorithms. You will be working on one such usual messy dataset which hides a lot of information under the hood which is awaiting to be discovered. Tools Duration Fee Batch Options 18 Weeks Rs. 34,900+GST Weekend Interested? Contact Us! +91-84680-02880 info@digitalvidya.com www.digitalvidya.com Attend a Free Orientation Session: http://www.digitalvidya.com/data-analytics-course