UCSB Data Science Bootcamp 2015

Similar documents
Python Machine Learning

(Sub)Gradient Descent

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Lecture 1: Machine Learning Basics

Communities in Networks. Peter J. Mucha, UNC Chapel Hill

CS Machine Learning

Mathematics. Mathematics

School of Innovative Technologies and Engineering

Assignment 1: Predicting Amazon Review Ratings

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Generative models and adversarial training

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Massachusetts Institute of Technology Tel: Massachusetts Avenue Room 32-D558 MA 02139

A study of speaker adaptation for DNN-based speech synthesis

Using the Attribute Hierarchy Method to Make Diagnostic Inferences about Examinees Cognitive Skills in Algebra on the SAT

Human Emotion Recognition From Speech

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

Australian Journal of Basic and Applied Sciences

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

CSC200: Lecture 4. Allan Borodin

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Probabilistic Latent Semantic Analysis

2013 District STAR Coordinator Workshop

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

CS 101 Computer Science I Fall Instructor Muller. Syllabus

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

Seminar - Organic Computing

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

DOCTORAL SCHOOL TRAINING AND DEVELOPMENT PROGRAMME

Rule Learning With Negation: Issues Regarding Effectiveness

Learning From the Past with Experiment Databases

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

LOUISIANA HIGH SCHOOL RALLY ASSOCIATION

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

SAT & ACT PREP. Evening classes at GBS - open to all Juniors!

Page 1 of 11. Curriculum Map: Grade 4 Math Course: Math 4 Sub-topic: General. Grade(s): None specified

CSL465/603 - Machine Learning

Grade 6: Correlated to AGS Basic Math Skills

Radius STEM Readiness TM

Stochastic Calculus for Finance I (46-944) Spring 2008 Syllabus

Data Structures and Algorithms

COMPUTER SCIENCE GRADUATE STUDIES Course Descriptions by Methodology

COMPUTER SCIENCE GRADUATE STUDIES Course Descriptions by Research Area

S T A T 251 C o u r s e S y l l a b u s I n t r o d u c t i o n t o p r o b a b i l i t y

Given a directed graph G =(N A), where N is a set of m nodes and A. destination node, implying a direction for ow to follow. Arcs have limitations

BADM 641 (sec. 7D1) (on-line) Decision Analysis August 16 October 6, 2017 CRN: 83777

COSI Meet the Majors Fall 17. Prof. Mitch Cherniack Undergraduate Advising Head (UAH), COSI Fall '17: Instructor COSI 29a

Content Language Objectives (CLOs) August 2012, H. Butts & G. De Anda

Please read this entire syllabus, keep it as reference and is subject to change by the instructor.

Fragment Analysis and Test Case Generation using F- Measure for Adaptive Random Testing and Partitioned Block based Adaptive Random Testing

Navigating the PhD Options in CMS

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

Welcome to. ECML/PKDD 2004 Community meeting

OFFICE SUPPORT SPECIALIST Technical Diploma

Calibration of Confidence Measures in Speech Recognition

Attributed Social Network Embedding

Using Calculators for Students in Grades 9-12: Geometry. Re-published with permission from American Institutes for Research

Modeling function word errors in DNN-HMM based LVCSR systems

12- A whirlwind tour of statistics

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

On-Line Data Analytics

Detailed course syllabus

Foothill College Summer 2016

Digital Signal Processing: Speaker Recognition Final Report (Complete Version)

Axiom 2013 Team Description Paper

Rule Learning with Negation: Issues Regarding Effectiveness

STRUCTURAL ENGINEERING PROGRAM INFORMATION FOR GRADUATE STUDENTS

Wenguang Sun CAREER Award. National Science Foundation

Intuitive Practitioner Course Overview

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Modeling function word errors in DNN-HMM based LVCSR systems

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

Mining Association Rules in Student s Assessment Data

Proposal of Pattern Recognition as a necessary and sufficient principle to Cognitive Science

A survey of multi-view machine learning

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Montana Content Standards for Mathematics Grade 3. Montana Content Standards for Mathematical Practices and Mathematics Content Adopted November 2011

Universidade do Minho Escola de Engenharia

Mathematics subject curriculum

MATH Study Skills Workshop

A Decision Tree Analysis of the Transfer Student Emma Gunu, MS Research Analyst Robert M Roe, PhD Executive Director of Institutional Research and

Probability and Statistics Curriculum Pacing Guide

Networks in Cognitive Science

Implementation of a "Virtual Boot Camp" to Facilitate Graduate Online Learning

Team Love <3. Because it s all about heart.

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

STT 231 Test 1. Fill in the Letter of Your Choice to Each Question in the Scantron. Each question is worth 2 point.

Reducing Features to Improve Bug Prediction

PROGRAM REVIEW CALCULUS TRACK MATH COURSES (MATH 170, 180, 190, 191, 210, 220, 270) May 1st, 2012

Computed Expert System of Support Technology Tests in the Process of Investment Casting Elements of Aircraft Engines

Learning Disability Functional Capacity Evaluation. Dear Doctor,

Introducing the New Iowa Assessments Mathematics Levels 12 14

1.11 I Know What Do You Know?

Learning to Schedule Straight-Line Code

SAT MATH PREP:

Transcription:

A two week course, held just before the start of the academic year, meant to introduce and refresh skills around programming, software, and data. Supported by the Network Science IGERT through the National Science Foundation (Grant# DGE 1258507 ), and by the College of Engineering and the College of Letters and Science at the University of California, Santa Barbara. Registration: http://goo.gl/forms/1ikpkqtsai Location: Engineering Sciences Building, Cooper Lab. Schedule: Mon 9/7 Tues 9/8 Weds 9/9 Thurs 9/10 Fri 9/11 Labor Day 9:30 am Program Overview Everything Data Visualization I Linear Algebra 11:30 am Assessment 12:00 pm Faculty Lunch Visualization II Computer Basics Small Data, Big Data Mon 9/14 Tues 9/15 Weds 9/16 Thurs 9/17 Fri 9/18 Intro to Algorithms 12:00 pm Faculty Lunch Graph Algorithms I Graph Algorithms II Dynamical Systems Foundations of Statistics 12:00 pm Faculty Lunch Intro to Machine Learning Machine Learning on Graphs 4:00 pm Hands-on Machine Learning (Note time change) Graduate Student Panel Course Material can be found on Luca Foschini 's Git Repository: https://github.com/lucafoschini/ucsbdatasciencebootcamp2015

Introduction to Bootcamp Date: 9/8/15 Time: 9:30 am 11:30 am Format: Panel Instructors: Luca Foschini, John Mohr, Steven Proulx, Ambuj Singh Panel discussion and overview of the boot camp. Each of the panelists will discuss his research and express views on the need for Network Science. The presentations will be informal and informative. There will be no need for power point or other materials. Luca Foschini will then present the bootcamp structure and provide an overview of the various modules. The Introduction will be followed by an informal assessment of the boot camp participants at 11:30 am. Computer Basics Date: 9/8/15 Time: 3:30 pm Instructor: Haraldur Tómas Hallgrímsson Lab Unix Basics How to open and use the terminal How to connect to Unix servers (ssh) Text manipulation and command line magic Git The importance of version control Github, reproducibility and the scientific method Python and Jupyter notebooks Introduction to Python Jupyter (née IPython) notebooks Will be used throughout rest of the bootcamp

Everything Data Date: 9/9/15 Time: 12:00 pm, Lab What is data? Data representation in a computer Native data types in Python: integer, list, dict, numpy arrays, pandas From simple to complex: text, time series, networks, geometric objects Discuss complexity of manipulation of these objects. Load and visualize different datasets on Python Lunch with Faculty Date: 9/9/15 Time: 12 pm Instructor: Prof. Francesco Bullo Format: Catered Lunch Prof. Bullo will discuss his research and views of network science over an informal lunch. Small Data, Big Data Date: 9/9/15 Time: 3:30 pm, Lab Small data, big data. Do you really have big data? Storage latencies. cache/ram/ssd/redis/s3 Computation engines: single core, multi core, memory distributed, disk distributed. Pandas, numpy multi core extensions Examples: single machine SSD backed operation. Caveats (sequential access needed)

Visualization I Date: 9/10/15 Time: 12:00 pm Instructor: John O Donovan, Lab Introduction to Visualization visual variables, design, types etc. Visualization in Academia and Industry Visualizing Live Twitter Data (Demo) Anatomy of a research paper in Visualization Visualization II Date: 9/10/15 Time: 3:30 pm Instructor: John O Donovan, Lab Visualizing live feeds using Python and Plot.ly (Hands on) Visualizing network data (Fluo, TopicNets Demos) Visualizing data in R (Hands on) Interactive Visualization (Demo) Linear Algebra Date: 9/11/15 Time: 11:00 am Instructor: Victor Amelkin, Lab Review of Linear Algebra's Fundamentals Matrix algebra Vector spaces Angles, lengths, projection Eigenproblem, SVD Linear Algebra and Graphs Graphs: definitions, properties, representation Graph algorithms in the language of linear algebra Spectral graph theory

Introduction to Algorithms Date: 9/14/15 Time: 12:00 pm Format: Lecture Basic data structures: arrays, lists, balanced binary trees (sets), hash tables (dicts); some organized/categorial view of what each of these data structures are good for, and how they compare in cost/benefit NP Hardness (TSP vs Eulerian Path) Lunch with Faculty Date: 9/14/15 Time: 12 pm 1 pm Instructor: Prof. Alberto Busetto Format: Brown bag lunch Prof. Busetto will discuss his research and views of network science over an informal lunch. Graph Algorithms I Date: 9/14/15 Time: 1:30 am 3:30 pm Graph definitions (directed, undirected, weighted, unweighted, trees, cycles, bipartite, complete) etc Graph representations (adjacency matrix, adjacency list); pros and cons; Graph generation: Erdos Renyi model Simple graph definition in Python NetworkX

Graph Algorithms II Date: 9/15/15 Time: 12:00 pm Instructor: Nirman Kumar Algorithms on Graphs. Introduction to special classes of graphs Demonstration of a few algorithms above in networkx Examples in NetworkX on real networks (social, brain) Preferential attachment Small world networks Hands on Generating and characterizing several graphs (both synthetic and real) Counting triangles Visualization using GraphViz Dynamical Systems Date: 9/15/15 Time: 1:30 am 3:30 pm Instructor: Hari Sivakumar Introduction to dynamics Why is it important to study dynamics on networks? First order dynamics flows on a line Linear vs nonlinear dynamics Stability analysis an intuitive explanation Lyapunov equations an intuitive explanation Introduction to second order dynamics Eigenvalues and eigenvectors, stability Introduction to bifurcations and hysteresis

Foundations of Statistics Date: 9/16/15 Time: 12:00 pm Instructor: Arya Pourzanjani Format: Lecture Basic probability and combinatorics. Bernoulli trials. Expectation. Variance. Tail bounds. Significance and p values Regression, controlling, example in R Lunch with Faculty Date: 9/16/15 Time: 12 pm 1 pm Instructor: Prof. Susan Cassels Format: Brown bag lunch Prof. Cassels will discuss her research and views of network science over an informal lunch. Introduction to Machine Learning Date: 9/16/15 Time: 1:30 am 3:30 pm Instructor: Bo Zong Supervised learning Decision Tree and Random Forest Linear Regression and Support Vector Machine Logistic Regression and Neural Network Unsupervised learning k Means, k Medoids, and Hierarchical Clustering Mixture Modeling

Machine Learning on Graphs Date: 9/17/15 Time: 12:00 pm Instructor: Bo Zong Classification on graphs Community detection Frequent patterns Hands-on Machine Learning Date: 9/17/15 Time: 1:30 am 3:30 pm Fun with scikit learn. Deep learning example Graduate Student Panel Date: 9/18/15 Time: Instructors: Alyssa Newman, Hari Sivakumar, Herbert Cai, Bo Zong Format: Panel This panel will give the boot camp participants a chance to hear graduate students talk about their research and the value of a research background in network science.