IST 718: Advanced Information Analytics

Similar documents
IST 649: Human Interaction with Computers

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Class Numbers: & Personal Financial Management. Sections: RVCC & RVDC. Summer 2008 FIN Fully Online

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

COURSE WEBSITE:

Accounting 312: Fundamentals of Managerial Accounting Syllabus Spring Brown

Required Materials: The Elements of Design, Third Edition; Poppy Evans & Mark A. Thomas; ISBN GB+ flash/jump drive

Python Machine Learning

Instructor: Matthew Wickes Kilgore Office: ES 310


Foothill College Summer 2016

Pitching Accounts & Advertising Sales ADV /PR

Professors will not accept Extra Credit work nor should students ask a professor to make Extra Credit assignments.

Syllabus for CHEM 4660 Introduction to Computational Chemistry Spring 2010

AGN 331 Soil Science Lecture & Laboratory Face to Face Version, Spring, 2012 Syllabus

State University of New York at Buffalo INTRODUCTION TO STATISTICS PSC 408 Fall 2015 M,W,F 1-1:50 NSC 210

General Physics I Class Syllabus

ECO 2013: PRINCIPLES OF MACROECONOMICS Spring 2017

English Grammar and Usage (ENGL )

BUFFET THEORY AND PRODUCTION - CHEF 2332 Thursday 1:30pm 7:00pm Northeast Texas Community College - Our Place Restaurant Course Syllabus Fall 2013

Spring 2015 IET4451 Systems Simulation Course Syllabus for Traditional, Hybrid, and Online Classes

AGN 331 Soil Science. Lecture & Laboratory. Face to Face Version, Spring, Syllabus

SOUTHERN MAINE COMMUNITY COLLEGE South Portland, Maine 04106

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering

IST 440, Section 004: Technology Integration and Problem-Solving Spring 2017 Mon, Wed, & Fri 12:20-1:10pm Room IST 202

CS/SE 3341 Spring 2012

Scottsdale Community College Spring 2016 CIS190 Intro to LANs CIS105 or permission of Instructor

MTH 215: Introduction to Linear Algebra

ECON492 Senior Capstone Seminar: Cost-Benefit and Local Economic Policy Analysis Fall 2017 Instructor: Dr. Anita Alves Pena

Counseling 150. EOPS Student Readiness and Success

General Chemistry II, CHEM Blinn College Bryan Campus Course Syllabus Fall 2011

Syllabus Fall 2014 Earth Science 130: Introduction to Oceanography

Medical Terminology - Mdca 1313 Course Syllabus: Summer 2017

Lecture 1: Machine Learning Basics

BUS Computer Concepts and Applications for Business Fall 2012

Office Hours: Day Time Location TR 12:00pm - 2:00pm Main Campus Carl DeSantis Building 5136

Financial Accounting Concepts and Research

Psychology 101(3cr): Introduction to Psychology (Summer 2016) Monday - Thursday 4:00-5:50pm - Gruening 413

PSCH 312: Social Psychology

COURSE INFORMATION. Course Number SER 216. Course Title Software Enterprise II: Testing and Quality. Credits 3. Prerequisites SER 215

Office Hours: Mon & Fri 10:00-12:00. Course Description

STA2023 Introduction to Statistics (Hybrid) Spring 2013

Social Media Journalism J336F Unique ID CMA Fall 2012

Maintaining Resilience in Teaching: Navigating Common Core and More Online Participant Syllabus

CHMB16H3 TECHNIQUES IN ANALYTICAL CHEMISTRY

George Mason University Graduate School of Education Education Leadership Program. Course Syllabus Spring 2006

CRITICAL THINKING AND WRITING: ENG 200H-D01 - Spring 2017 TR 10:45-12:15 p.m., HH 205

ITSC 2321 Integrated Software Applications II COURSE SYLLABUS

Computer Architecture CSC

MAT 122 Intermediate Algebra Syllabus Summer 2016

MGMT 479 (Hybrid) Strategic Management

Syllabus ENGR 190 Introductory Calculus (QR)

Monday/Wednesday, 9:00 AM 10:30 AM

Math 181, Calculus I

Class Mondays & Wednesdays 11:00 am - 12:15 pm Rowe 161. Office Mondays 9:30 am - 10:30 am, Friday 352-B (3 rd floor) or by appointment

ACCT 100 Introduction to Accounting Course Syllabus Course # on T Th 12:30 1:45 Spring, 2016: Debra L. Schmidt-Johnson, CPA

ECD 131 Language Arts Early Childhood Development Business and Public Service

Introduction to Forensic Drug Chemistry

Math 96: Intermediate Algebra in Context

Accounting 380K.6 Accounting and Control in Nonprofit Organizations (#02705) Spring 2013 Professors Michael H. Granof and Gretchen Charrier

International Environmental Policy Spring :374:315:01 Tuesdays, 10:55 am to 1:55 pm, Blake 131

CS 100: Principles of Computing

Pre-AP Geometry Course Syllabus Page 1

Class Meeting Time and Place: Section 3: MTWF10:00-10:50 TILT 221

Strategic Management (MBA 800-AE) Fall 2010

(Sub)Gradient Descent

Course Syllabus. Alternatively, a student can schedule an appointment by .

CALCULUS III MATH

Computer Science 1015F ~ 2016 ~ Notes to Students

Course Syllabus MFG Modern Manufacturing Techniques I Spring 2017

Syllabus - ESET 369 Embedded Systems Software, Fall 2016

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

PHO 1110 Basic Photography for Photographers. Instructor Information: Materials:

COURSE DESCRIPTION PREREQUISITE COURSE PURPOSE

Aerospace Engineering

AST Introduction to Solar Systems Astronomy

MGMT 5303 Corporate and Business Strategy Spring 2016

Handbook for Graduate Students in TESL and Applied Linguistics Programs

PBHL HEALTH ECONOMICS I COURSE SYLLABUS Winter Quarter Fridays, 11:00 am - 1:50 pm Pearlstein 308

CHEMISTRY 104 FALL Lecture 1: TR 9:30-10:45 a.m. in Chem 1351 Lecture 2: TR 1:00-2:15 p.m. in Chem 1361

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

Course Syllabus p. 1. Introduction to Web Design AVT 217 Spring 2017 TTh 10:30-1:10, 1:30-4:10 Instructor: Shanshan Cui

MGT/MGP/MGB 261: Investment Analysis

The Policymaking Process Course Syllabus

MATH 205: Mathematics for K 8 Teachers: Number and Operations Western Kentucky University Spring 2017

SYLLABUS. EC 322 Intermediate Macroeconomics Fall 2012

Lahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017

FINN FINANCIAL MANAGEMENT Spring 2014

Chromatography Syllabus and Course Information 2 Credits Fall 2016

The University of Texas at Tyler College of Business and Technology Department of Management and Marketing SPRING 2015

ASTRONOMY 2801A: Stars, Galaxies & Cosmology : Fall term

EEAS 101 BASIC WIRING AND CIRCUIT DESIGN. Electrical Principles and Practices Text 3 nd Edition, Glen Mazur & Peter Zurlis

CPMT 1347 Computer System Peripherals COURSE SYLLABUS

MAR Environmental Problems & Solutions. Stony Brook University School of Marine & Atmospheric Sciences (SoMAS)

Coding II: Server side web development, databases and analytics ACAD 276 (4 Units)

BA 130 Introduction to International Business

IDS 240 Interdisciplinary Research Methods

3D DIGITAL ANIMATION TECHNIQUES (3DAT)

MKT ADVERTISING. Fall 2016

MAE Flight Simulation for Aircraft Safety

Transcription:

IST 718: Advanced Information Analytics Course information Instructor: Daniel E. Acuna, Ph.D. deacuna@syr.edu, acuna.io, github.com/daniel-acuna Week: May 22 May 26, 2017 Class: TBA Office: 312 Hinds Hall Catalog description A broad introduction to analytical processing tools and techniques for information professionals. Students will develop a portfolio of resources, demonstrations, recipes, and examples of various analytical techniques. Detailed course description This course will prepare you to participate as a Data Scientist on big data and data analytics projects. Upon the successful completion of this course, you will be able to: Translate a business challenge into an analytics challenge; Analyze big data, create statistical models, and identify insights that can lead to actionable results; Use Python and Apache Spark to build big data analytics pipelines Learn classic and state of the art machine learning techniques Explain how advanced analytics can be leveraged to create competitive advantage; Prerequisite knowledge required Familiarity with command-line interfaces, basic quantitative skills, including statistics, as well as programming skills in Python. Textbooks: We will use 2 textbooks in this course An introduction to Statistical Learning with Applications in R (ISLR) by Gareth James, Daniela Witten, Trevor Hastie, and Robert Tibshirani http://www-bcf.usc.edu/~gareth/isl/islr%20sixth%20printing.pdf Deep Learning (DL) by Ian Goodfellow, Yoshua Bengio, and Aaron Courville http://www.deeplearningbook.org/ 1 OF 5

Course Topics The following is a tentative outline of topics to be covered in the course: Day Topics Reading 1 Overview of the course May 22 Linux; some math and calculus. Python Programming Jupyter notebook; Pandas; Matplotlib 2 Introduction to MapReduce, Hadoop, and Yarn May 23 Introduction to Apache Spark Resilient Distributed Datasets DataFrames Spark SQL MLLIB ML Pipelines 3 A Statistical Perspective on Machine Learning Introduction to probability Bias variance tradeoff confusion matrix model selection: training, validating, and testing In depth case 1: Sentiment Analysis on Twitter Supervised learning Regularized logistic regression Model interpretation Model comparison 4 In depth case 2: A recommendation system for courses Unsupervised learning Nearest neighbors Dimensionality reduction Clustering 5 In depth case 3: Predicting Credit Scores with Random Forest (Bagging) and Gradient Boosting Bagging Boosting Interpretation of gray models ISLR Ch1; Ch 2.1 DL Ch 1-2 ISLR Ch 2.2 May 24 ISLR Ch 6 May 25 ISLR Ch. 10, sections.10.1, 10.2 and 10.3 Mar 26 2 OF 5

Methods of evaluation: Assessment Notes Points Quizzes (4) Covers concepts; unannounced; no make-ups; only the best 5 will be considered in the final grade 20 (5 x 4) Homework (3) Based on class materials; late submission will be discounted Group Project (1) In groups of 3 or 4; you choose your teammates 40 Participation In class and Blackboard forums 10 TOTAL 100 30 (30 x 10) Grading scale Total Points Earned Registrar Grade 95-100 A 90-94 A - 85-89 B + 80-84 B 75-79 B - 70-74 C + 65-69 C 60-64 C - 50-59 D 0-49 F Quizzes Quizzes are individual effort, in-class short tests which measure your understanding of the concepts and terminology covered in class, labs and the assigned readings. Quizzes could be issued at the beginning of class. Please report to class on time. No make-ups will be given to absent or late-arriving students. Consider quizzes part of your attendance. Quizzes are unannounced. There are no quiz dates posted on the syllabus, so expect there will be a quiz for class. Quizzes will cover all material up to the point where the quiz is issued 3 OF 5

Homework While you are encouraged to discuss homework with your classmates, homework programming and writing is individual effort. It is designed to check that you are keeping pace with in class concepts and out-of-class lab activities. The intention of homework is to ensure students are keeping pace with the out of class activities. For late submissions, the grade of your homework will be multiplied by the following factor, where days (could be fractional) is the number of days submitted late (days > 0): def grade_factor(days): if days > 3: return 0 else: return 1/(1 + days)**(3/4) e.g., 1 day late 60% of grade, 2 days late 43%, 2 hours late ~95% Submissions later than 72 hours from deadline will zero grade. Group Project The group project is your chance to demonstrate what you ve learned in the course and apply it to a new scenario. It is expected that each group s project will be novel, and that all will be of the highest quality. Your group will be responsible for finding a data set, analyzing it, and producing visualizations and findings from your analysis. Participation It is expected that you participate in class and Blackboard forum discussions. Also, it is expected that you visit the professor during office hours or set up a time to meet him during the semester. Additionally, it is expected that the group visits the professor to discuss the project. General information Teaching philosophy My teaching philosophy centers around students as critical thinkers that challenge current beliefs using arguments rooted in strong evidence. There are three concepts to this We are all inherently curious about how the world works and have an unbounded set of needs We all make mistakes and all questions are valid, however we only realize these blind spots when we critically think and discuss our ideas with others Data is only a means to a goal but we are responsible for keeping our analysis, policy recommendations, and conclusions as ethical and compassionate as possible. Who does well in the course? This is a relatively heavy load course and an ideal student should follow the following items: Study consistently throughout the semester in short burst. Research has shown that pulling allnighters and studying just before the class or lab will make you forget the contents later in your career. Make an effort to study everyday at least 30 minutes for the class. 4 OF 5

Be active in class and ask questions to the professor. Challenge the materials and try to see all angles of the ideas and conclusions being presented in class. Critical thinking is as important as technical ability in a data science job and it is a highly appreciated skill Focus on learning how to program and the pieces involved in developing professional software. Although this class does not assume a large amount of programming experience, the sooner you start learning about Python and the tools taught in this class, the better you are going to do throughout the semester. Data scientists who are excellent critical thinkers AND known how to transform ideas into software are the most prized in the job market. Academic Integrity Policy Syracuse University s academic integrity policy reflects the high value that we, as a university community, place on honesty in academic work. The pilot policy in effect at the School of Information Studies defines our expectations for academic honesty and holds students accountable for the integrity of all work they submit. Students should understand that it is their responsibility to learn about coursespecific expectations, as well as about university-wide academic integrity expectations. The pilot policy governs appropriate citation and use of sources, the integrity of work submitted in exams and assignments, and the veracity of signatures on attendance sheets and other verification of participation in class activities. The pilot policy also prohibits students from submitting the same work in more than one class without receiving written authorization in advance from both instructors. Under the pilot policy, students found in violation are subject to grade sanctions determined by the course instructor and nongrade sanctions determined by the School or College where the course is offered. SU students are required to read an online summary of the university s academic integrity expectations and provide an electronic signature agreeing to abide by them twice a year during pre-term check-in on MySlice. For more information and the pilot policy, see http://academicintegrity.syr.edu Disability-Related Accommodations Syracuse University values diversity and inclusion; we are committed to a climate of mutual respect and full participation. If you believe that you need accommodations for a disability, please contact the Office of Disability Services (ODS), disabilityservices.syr.edu, located at 804 University Avenue, room 309, or call 315.443.4498 for an appointment to discuss your needs and the process for requesting accommodations. ODS is responsible for coordinating disability-related accommodations and will issue Accommodation Authorization Letters to students as appropriate. Since accommodations may require early planning and generally are not provided retroactively, please contact ODS as soon as possible. Our goal at the ischool is to create learning environments that are useable, equitable, inclusive and welcoming. If there are aspects of the instruction or design of this course that result in barriers to your inclusion or accurate assessment or achievement, please meet with me to discuss additional strategies beyond official accommodations that may be helpful to your success. Religious Observances Notification and Policy SU s religious observances policy, found at supolicies.syr.edu/emp_ben/religious_observance.htm, recognizes the diversity of faiths represented in the campus community and protects the rights of students, faculty, and staff to observe religious holy days according to their tradition. Under the policy, students should have an opportunity to make up any examination, study, or work requirements that may be missed due to a religious observance provided they notify their instructors no later than the end of the second week of classes through an online notification form in MySlice listed under Student Services/Enrollment/My Religious Observances/Add a Notification. 5 OF 5