SUMMER SCHOOL. June 11 August 3, 2018 Almaty. In partnership with

Similar documents
Python Machine Learning

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

(Sub)Gradient Descent

CSL465/603 - Machine Learning

Lecture 1: Machine Learning Basics

arxiv: v1 [cs.lg] 15 Jun 2015

Learning From the Past with Experiment Databases

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Top US Tech Talent for the Top China Tech Company

CS Machine Learning

Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Artificial Neural Networks written examination

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Human Emotion Recognition From Speech

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

CS 100: Principles of Computing

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Assignment 1: Predicting Amazon Review Ratings

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

TREATMENT OF SMC COURSEWORK FOR STUDENTS WITHOUT AN ASSOCIATE OF ARTS

arxiv: v2 [cs.cv] 30 Mar 2017

arxiv: v1 [cs.cv] 10 May 2017

Statistics and Data Analytics Minor

Undergraduate Program Guide. Bachelor of Science. Computer Science DEPARTMENT OF COMPUTER SCIENCE and ENGINEERING

A Simple VQA Model with a Few Tricks and Image Features from Bottom-up Attention

A Case Study: News Classification Based on Term Frequency

Axiom 2013 Team Description Paper

Lecture 1: Basic Concepts of Machine Learning

Глубокие рекуррентные нейронные сети для аспектно-ориентированного анализа тональности отзывов пользователей на различных языках

HIERARCHICAL DEEP LEARNING ARCHITECTURE FOR 10K OBJECTS CLASSIFICATION

Probabilistic Latent Semantic Analysis

Universidade do Minho Escola de Engenharia

Online Master of Business Administration (MBA)

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Speech Emotion Recognition Using Support Vector Machine

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Word Segmentation of Off-line Handwritten Documents

Learning Methods for Fuzzy Systems

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Model Ensemble for Click Prediction in Bing Search Ads

MASTER S COURSES FASHION START-UP

Time series prediction

Generative models and adversarial training

MTH 141 Calculus 1 Syllabus Spring 2017

Reducing Features to Improve Bug Prediction

Multi-tasks Deep Learning Model for classifying MRI images of AD/MCI Patients

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

Cooperative evolutive concept learning: an empirical study

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Large-Scale Web Page Classification. Sathi T Marath. Submitted in partial fulfilment of the requirements. for the degree of Doctor of Philosophy

Nottingham Trent University Course Specification

MSc Education and Training for Development

Knowledge Transfer in Deep Convolutional Neural Nets

Macromedia University Bachelor of Arts (B.A.) Programme Information

Semantic Segmentation with Histological Image Data: Cancer Cell vs. Stroma

Self Study Report Computer Science

Unit 7 Data analysis and design

Bachelor of Science in Banking & Finance: Accounting Specialization

TRANSFER LEARNING OF WEAKLY LABELLED AUDIO. Aleksandr Diment, Tuomas Virtanen

Switchboard Language Model Improvement with Conversational Data from Gigaword

SARDNET: A Self-Organizing Feature Map for Sequences

DOCTOR OF PHILOSOPHY HANDBOOK

DOCTORAL SCHOOL TRAINING AND DEVELOPMENT PROGRAMME

Rule Learning With Negation: Issues Regarding Effectiveness

COURSE LISTING. Courses Listed. Training for Cloud with SAP SuccessFactors in Integration. 23 November 2017 (08:13 GMT) Beginner.

London College of Contemporary Arts. Short Courses 2017/18

Computational Data Analysis Techniques In Economics And Finance

Class Dates June 5th July 27th. Enroll Now! Visit us on Facebook

Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models

Speech Recognition at ICSI: Broadcast News and beyond

OFFICE SUPPORT SPECIALIST Technical Diploma

Platform for the Development of Accessible Vocational Training

Linking Task: Identifying authors and book titles in verbose queries

Henley Business School at Univ of Reading

CS 101 Computer Science I Fall Instructor Muller. Syllabus

Math-U-See Correlation with the Common Core State Standards for Mathematical Content for Third Grade

Reinforcement Learning by Comparing Immediate Reward

Issues in the Mining of Heart Failure Datasets

DIGITAL GAMING & INTERACTIVE MEDIA BACHELOR S DEGREE. Junior Year. Summer (Bridge Quarter) Fall Winter Spring GAME Credits.

TEACHING AND EXAMINATION REGULATIONS PART B: programme-specific section MASTER S PROGRAMME IN LOGIC

Massachusetts Institute of Technology Tel: Massachusetts Avenue Room 32-D558 MA 02139

ADVANCES IN DEEP NEURAL NETWORK APPROACHES TO SPEAKER RECOGNITION

Capturing and Organizing Prior Student Learning with the OCW Backpack

THREE-YEAR COURSES FASHION STYLING & CREATIVE DIRECTION Version 02

Len Lundstrum, Ph.D., FRM

Dinesh K. Sharma, Ph.D. Department of Management School of Business and Economics Fayetteville State University

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling

UDW+ Student Data Dictionary Version 1.7 Program Services Office & Decision Support Group

Transcription:

SUMMER SCHOOL June 11 August 3, 2018 Almaty In partnership with

Table of Contents About Yessenov Data Lab Program stages Who can apply for the program? Apply to the Program Week 1. Python Week 2. Linear Models for Classification and Regression Week 3. Working with Features (PCA, Classification) Week 4. Neural Networks Week 5. Deep Learning in Computer Vision and Reinforcement Learning Solving Kaggle cases? Week 6. Natural Language Processing (NLP) Weeks 7-8. Project challenge

About Yessenov Data Lab The Yessenov Data Lab is an 8-week long intensive summer school that fast launches into the Data Scientist specialization. Participants solve the challenges businesses face and are equipped with knowledge to continue growing by self-learning. 8 6 weeks of fast-paced learning weeks academics School's dates: June 11 August 3, 2018 Schedule: Mon-Fri, 9:00 am-6:00pm Participants: 20 people 2 weeks business cases Venue: Almaty Management University THE GRADUATES OF THE SUMMER SCHOOL CAN LOOK FOR TO ACQUIRE THE FOLLOWING SKILLS: 1. Programming in Python within data analysis 2. Preprocessing 3. Visualization of data and finding data dependencies 4. Forecasting based on historical data 5. Understanding different algorithms of training 6. Right choice of training model 7. Fundamental understanding of Neural Networks

Program stages Summer School Jun 11 Aug 3 Round 3: Interviews Feb 26 Mar 25 Applications Round 1: Application assessment Round 2: Logic & Statistics Exam Apr 30 May 13 winners Apr 9-22 up to Mar 26 Apr 8 up to 60 candidates 20 40 candidates

Who can apply for the program? 1 2 Kazakhstan citizens above 18 3 Students in their last year: undergraduate, graduate and Ph.D. programs Professionals REQUIREMENTS FOR CANDIDATES: Strong analytical skills Basic knowledge of statistics and linear algebra Determination and result-oriented THE FOLLOWING ARE A PLUS: 1 2 Programming skills 3 Upper intermediate English (6.5 IELTS/90 TOEFL ibt) or higher Certificates of successful completion of programming courses or a bachelor diploma in CS or other tech disciplines (math, physics, and engineering)

Apply to the Program Fill out application and prepare additional documents Visit yessenovfoundation.org Send them to info@yessenovfoundation.org before March 25 Round 1 results April 9 ADDITIONAL DOCUMENTS LIST: 1. Application form 2. Copy of ID 3. Copy of diplomas, certificates on completion of courses (programming, statistics, etc.), participation in Olympiads (math, IT or any other tech disciplines) 4. Copy of transcript (all completed semesters) and a copy of bachelor degree diploma with transcript (for graduates and specialists) 5. Essay on I want to learn data analysis to 6. Detailed portfolio demonstrating achievements in IT field (where possible) 7. Certificates of English language tests (where possible)

Week 1. Python June 11-15 Day 1 09:00 10:00 Registration What is Data Mining, Big Data? Examples Case study: Titanic on Kaggle Python: Introduction. Variables, list, conditions, loops : Basics of Python Day 2 16:00 18:00 Data structures: list, sets, dictionaries NumPy library: Introduction : data structures and NumPy Team building Day 3 Pandas and SciPy libraries: Introduction. Data upload Grouping of data. Filters, sorting : CSV, TXT, Quandl. : CSV, TXT, Quandl. Day 4 Object-oriented programming Case study: Coders Strike Back on codingame.com : codingame.com: simple tasks : codingame.com: Coders Strike Back Day 5 Data upload. Data pre-processing Simple visualization (2D Arrays) : Pandas : MatPlotLib Kuanysh Abeshev AlmaU Timur Bakibayev Professor AlmaU

Week 2. Linear Models for Classification and Regression June 18-22 Day 1 Optimization, gradient decent method : Day 2 Linear models for classification and regression Day 3 16:00 18:00 Overfitting, generalization Team building Day 4 Cross-validation Day 5 Quality metrics Dmitriy Rusanov Data Scientist, EPAM Systems

Week 3. Working with Features (PCA, Classification) June 25-29 Day 1 Classification, decision tree and k-nearest Neighbours Day 2 Decision tree ensembles: bagging, boosting, random forest Day 3 16:00 18:00 Unsupervised learning: PCA, clustering Team building Day 4 Feature selection Day 5 Support vector machine (SVM) Michael Lipkovich Lead big data engineer, EPAM Systems

Week 4. Neural Networks July 2-6 Day 1 Neural networks: Introduction. Perceptron Back-propagation : Neural Network implementation : Neural Network implementation Day 2 Keras library: Introduction Keras library: Introduction. Continued Day 3 16:00 18:00 Convolutional neural networks (CNN) : image analysis : image analysis Team building Day 4 Recurrent neural network (RNN) : text analysis : text analysis : text analysis Day 5 Problems of overfitting. Data augmentation Marina Gorlova Analyst, Yandex Money

Week 5. Deep Learning in Computer Vision and Reinforcement Learning. Solving Kaggle cases? July 9-13 Day 1 MNIST, Fashion MNIST, LFW datasets classification : work on an example : work on an example : work on an example Day 2 VGG, ResNet and Inception architectures. What neural networks see : work on an example : work on an example : work on an example Day 3 16:00 18:00 From classification to segmentation. Kaggle Challenges review : work on an example : work on an example Team building Day 4 Autoencoders and Variational Autoencoders. Pose estimation : work on an example : work on an example : work on an example Day 5 Reinforcement learning. Supervised learning limits : work on an example : work on an example : work on an example Dmitriy Kotovenko AGT International, Computer Vision Reseach Assistant

Week 6. Kaspi Lab July 16-20 Day 1 Who is an analyst and what does he work with? Who is an analyst and what is his purpose? (Part 1) Who is an analyst and what is his purpose? (Part 2) Practical case «Analyst dedication?». Part 1 Practical case «Analyst dedication?». Part 2 Day 2 Where to begin? Client analytics what kind of «fruit» is it? CRM + Analytics Developing key skills of an analyst. Part 1 Developing key skills of an analyst. Part 2 Day 3 Intellectual risks Credit: to be or not to be, here is the question? «Measure thrice and cut once». Behavioral analytics as one of the main lines of protection in antifraud process. Part 1 Behavioral analytics as one of the main lines of protection in antifraud process. Part 2 Day 4 Artificial intelligence in Kaspi Can you read between the lines? Part 1 Can you read between the lines? Part 2 When system knows better than the customer does. Part 1 When system knows better than the customer does. Part 2 Day 5 Marketing cases What to do, what to do? Definitely to buy! Practical case: «To each customer, own product». Part 1 Practical case: «To each customer, own product». Part 2 Practical case: «To each customer, own product». Part 3 Duman Uvatayev Chief Data Officer Aigerim Sagandykova Chief Analyst, Experimental Projects Group Ilyas Zhubanov Head of the data analytics department

Week 7. Kaspi Lab July 23-27 Kaspi Lab in numbers 8000+ students have listened presentation 100+ students had successfully passed examination and completed the training 7 1 500+ 420+ 40 largest specialized universities partners applied problems solved academic hours listened 16 students attended exam of students have found a good job full-fledged analytical services developed Kaspi Lab students on the basis of methods of machine learning have learnt to: Asses the risk profile of clients Optimize work processes by developing architecture of automatic decision making system by credit conveyor principles; through centralization of decision making contour and decreasing recourse intensity processes; Develop, introduce and evaluate Isolate primary from secondary various advisory systems on website based on behavioral data from website; on creation of design report or presentation content/ analytical summaries; Develop solutions on computer vision Understand business detection, matching, tracing, and classification of products; and implement data driven processes in a company. Develop a fair evaluation of environment any marketing activities, regardless of communication channels ( mass or personalized);

Week 8. Project challenge July 30 August 3 Kazakhstani companies that use data analysis will provide the program participants with challenges of real businesses. Successful graduates of the School will receive job offers.

STAY: IN: TOUCH:

In partnership with Almaty, 2018