INF 553: Foundations and Applications of Data Mining. USC Viterbi School of Engineering. Syllabus. Units: 4

Similar documents
Coding II: Server side web development, databases and analytics ACAD 276 (4 Units)

Individual Instruction Voice (MPVA 300, 301, 501) COURSE INFORMATION Course Description Learning Objectives: Course Information

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

BUAD 425 Data Analysis for Decision Making Syllabus Fall 2015

ECON492 Senior Capstone Seminar: Cost-Benefit and Local Economic Policy Analysis Fall 2017 Instructor: Dr. Anita Alves Pena

MGMT 3362 Human Resource Management Course Syllabus Spring 2016 (Interactive Video) Business Administration 222D (Edinburg Campus)

Corporate Communication

BAEP 557: Technology Commercialization

SYLLABUS: RURAL SOCIOLOGY 1500 INTRODUCTION TO RURAL SOCIOLOGY SPRING 2017

ITAL 020x Fall 2017 Instructor: James Fortney. Italian 020x Fall 2017 Course in Reading Italian

INTERMEDIATE ALGEBRA Course Syllabus

Course Content Concepts

Applied Trumpet V VIII

CRITICAL THINKING AND WRITING: ENG 200H-D01 - Spring 2017 TR 10:45-12:15 p.m., HH 205

Foothill College Summer 2016

Course Syllabus MFG Modern Manufacturing Techniques I Spring 2017

HIST 3300 HISTORIOGRAPHY & METHODS Kristine Wirts

IST 440, Section 004: Technology Integration and Problem-Solving Spring 2017 Mon, Wed, & Fri 12:20-1:10pm Room IST 202

Cleveland State University Introduction to University Life Course Syllabus Fall ASC 101 Section:

Human Development: Life Span Spring 2017 Syllabus Psych 220 (Section 002) M/W 4:00-6:30PM, 120 MARB

FINN FINANCIAL MANAGEMENT Spring 2014

CS 100: Principles of Computing

Syllabus ENGR 190 Introductory Calculus (QR)

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering

MATH 1A: Calculus I Sec 01 Winter 2017 Room E31 MTWThF 8:30-9:20AM

Class Mondays & Wednesdays 11:00 am - 12:15 pm Rowe 161. Office Mondays 9:30 am - 10:30 am, Friday 352-B (3 rd floor) or by appointment

HCI 440: Introduction to User-Centered Design Winter Instructor Ugochi Acholonu, Ph.D. College of Computing & Digital Media, DePaul University

SYLLABUS. EC 322 Intermediate Macroeconomics Fall 2012

Scottsdale Community College Spring 2016 CIS190 Intro to LANs CIS105 or permission of Instructor

FINANCE 3320 Financial Management Syllabus May-Term 2016 *

Bittinger, M. L., Ellenbogen, D. J., & Johnson, B. L. (2012). Prealgebra (6th ed.). Boston, MA: Addison-Wesley.

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Required Texts: Intermediate Accounting by Spiceland, Sepe and Nelson, 8E Course notes are available on UNM Learn.

Class Tuesdays & Thursdays 12:30-1:45 pm Friday 107. Office Tuesdays 9:30 am - 10:30 am, Friday 352-B (3 rd floor) or by appointment

IDS 240 Interdisciplinary Research Methods

THE GEORGE WASHINGTON UNIVERSITY Department of Economics. ECON 1012: PRINCIPLES OF MACROECONOMICS Prof. Irene R. Foster

Syllabus - ESET 369 Embedded Systems Software, Fall 2016

SOCIAL PSYCHOLOGY. This course meets the following university learning outcomes: 1. Demonstrate an integrative knowledge of human and natural worlds

International Business BADM 455, Section 2 Spring 2008

Accounting 312: Fundamentals of Managerial Accounting Syllabus Spring Brown

CMST 2060 Public Speaking

Speak Up 2012 Grades 9 12

COURSE DESCRIPTION PREREQUISITE COURSE PURPOSE

EECS 700: Computer Modeling, Simulation, and Visualization Fall 2014

STA2023 Introduction to Statistics (Hybrid) Spring 2013

Ruggiero, V. R. (2015). The art of thinking: A guide to critical and creative thought (11th ed.). New York, NY: Longman.

MGMT 479 (Hybrid) Strategic Management

Penn State University - University Park MATH 140 Instructor Syllabus, Calculus with Analytic Geometry I Fall 2010

Interior Design 350 History of Interiors + Furniture

TA Decision Chart WHAT IS THIS RESOURCE? DESCRIPTION AND CONTACT INFORMATION POLICIES

ENGLISH 298: Intensive Writing

RM 2234 Retailing in a Digital Age SPRING 2016, 3 credits, 50% face-to-face (Wed 3pm-4:15pm)

Required Text: Oltmanns, T. & Emery, R. (2014). Abnormal Psychology (8th Edition) ISBN-13: ISBN-10:

COMM370, Social Media Advertising Fall 2017

Syllabus Foundations of Finance Summer 2014 FINC-UB

Philosophy in Literature: Italo Calvino (Phil. 331) Fall 2014, M and W 12:00-13:50 p.m.; 103 PETR. Professor Alejandro A. Vallega.

IPHY 3410 Section 1 - Introduction to Human Anatomy Lecture Syllabus (Spring, 2017)

AGN 331 Soil Science Lecture & Laboratory Face to Face Version, Spring, 2012 Syllabus

INTRODUCTION TO CULTURAL ANTHROPOLOGY ANT 2410 FALL 2015

Demography and Population Geography with GISc GEH 320/GEP 620 (H81) / PHE 718 / EES80500 Syllabus

MATH 205: Mathematics for K 8 Teachers: Number and Operations Western Kentucky University Spring 2017

Adler Graduate School

Syllabus: CS 377 Communication and Ethical Issues in Computing 3 Credit Hours Prerequisite: CS 251, Data Structures Fall 2015

Indiana University Northwest Chemistry C110 Chemistry of Life

SAINT LOUIS UNIVERSITY, Madrid Campus Fine and Performing Arts Department

INDES 350 HISTORY OF INTERIORS AND FURNITURE WINTER 2017

PBHL HEALTH ECONOMICS I COURSE SYLLABUS Winter Quarter Fridays, 11:00 am - 1:50 pm Pearlstein 308

Spring 2015 IET4451 Systems Simulation Course Syllabus for Traditional, Hybrid, and Online Classes

MBA 5652, Research Methods Course Syllabus. Course Description. Course Material(s) Course Learning Outcomes. Credits.

Texas A&M University-Central Texas CISK Comprehensive Networking C_SK Computer Networks Monday/Wednesday 5.

ITM2500 Spreadsheet & Database Productivity. Spreadsheet & Database Productivity

Master Syllabus ENGL 1020 English Composition II

MKT ADVERTISING. Fall 2016

ARH 390 Survey of Decorative Arts & Design: The Ancient World to Present Online, Sec. 01, 03 Credit Hours Summer 2017

Class meetings: Time: Monday & Wednesday 7:00 PM to 8:20 PM Place: TCC NTAB 2222

Course outline. Code: ICT310 Title: Systems Analysis and Design

COURSE WEBSITE:

ECO 2013: PRINCIPLES OF MACROECONOMICS Spring 2017

PSCH 312: Social Psychology

Course Syllabus for Math

CHEMISTRY 104 FALL Lecture 1: TR 9:30-10:45 a.m. in Chem 1351 Lecture 2: TR 1:00-2:15 p.m. in Chem 1361

Sociology 521: Social Statistics and Quantitative Methods I Spring 2013 Mondays 2 5pm Kap 305 Computer Lab. Course Website

The University of Texas at Tyler College of Business and Technology Department of Management and Marketing SPRING 2015

Texas A&M University - Central Texas PSYK EDUCATIONAL PSYCHOLOGY INSTRUCTOR AND CONTACT INFORMATION

AGN 331 Soil Science. Lecture & Laboratory. Face to Face Version, Spring, Syllabus

Navigating the PhD Options in CMS

Economics 201 Principles of Microeconomics Fall 2010 MWF 10:00 10:50am 160 Bryan Building

TEACHING SECOND LANGUAGE COMPOSITION LING 5331 (3 credits) Course Syllabus

Math 181, Calculus I

Course Syllabus. Alternatively, a student can schedule an appointment by .

KOREAN 305: ADVANCED KOREAN I (Fall 2017)

UNIVERSITY OF BALTIMORE SCHOOL OF LAW FALL SEMESTER 2017

MGMT3403 Leadership Second Semester

BUS Computer Concepts and Applications for Business Fall 2012

BSM 2801, Sport Marketing Course Syllabus. Course Description. Course Textbook. Course Learning Outcomes. Credits.

University of Florida SPM 6905 Leading and Coaching Athletics Online Course Summer A 2017

I275 Introduction to Human-Computer Interaction Theory

Syllabus Fall 2014 Earth Science 130: Introduction to Oceanography

Neuroscience I. BIOS/PHIL/PSCH 484 MWF 1:00-1:50 Lecture Center F6. Fall credit hours

MANA 7A97 - STRESS AND WORK. Fall 2016: 6:00-9:00pm Th. 113 Melcher Hall

Transcription:

USC Viterbi School of Engineering INF 553: Foundations and Applications of Data Mining Syllabus Units: 4 Term Day Time: Spring 2017, MW 6:00-7:50 pm Location: Online Instructor: Yao-Yi Chiang, PhD GISP Office: AHF B55C Regular Office Hours: Monday after class Contact Info: yaoyic@usc.edu, https://bluejeans.com/5067546751 (BlueJeans), 213-740- 7618 (office), yaoyichiang (Skype). Course Producer: TBD Office: TBD Office Hours: TBD Contact Info: TBD

Catalogue Course Description Data mining and machine learning algorithms for analyzing very large data sets. Emphasis on Map Reduce. Case studies. Expanded Course Description Data mining is a foundational piece of the data analytics skill set. At a high level, it allows the analyst to discover patterns in data, and transform it into a usable product. The course will teach data mining algorithms for analyzing very large data sets. It will have an applied focus, in that it is meant for preparing students to utilize topics in data mining to solve real world problems. Recommended Preparation INF 550, INF 551 and INF 552. Knowledge of probability, linear algebra, basic programming, and some machine learning. A basic understanding engineering principles is required, including basic programming skills; familiarity with the Python language is desirable. Most assignments are designed for the Unix environment; basic Unix skills will make programming assignments much easier. Students will need sufficient mathematical background, including probability, statistics, and linear algebra. Some knowledge of machine learning is helpful, but not required. Course Notes The course will be run as a lecture class with student participation strongly encouraged. There are weekly readings and students are encouraged to do the readings prior to the discussion in class. All of the course materials, including the readings, lecture slides, home works will be posted online Technological Proficiency and Hardware/Software Required Students are expected to know how to program in a language such as Python. Students are also expected to have their own laptop or desktop computer where they can install and run software to do the weekly homework assignments. Required Readings and Supplementary Materials Rajaraman, J. Leskovec and J. D. Ullman, Mining of Massive Datasets o Cambridge University Press, 2012. o Available free at: http://infolab.stanford.edu/~ullman/mmds.html In addition to the textbook, students may be given additional reading materials such as research papers. Students are responsible for all assigned reading assignments. Description and Assessment of Assignments Homework Assignments: There will be 5 homework assignments. The assignments must be done individually. Each assignment is graded on a scale of 0-100 and the specific rubric for each assignment is given in the assignment. Grading Breakdown INF 553 Syllabus, Page 2 of 6

Quizzes: There will be weekly quizzes based on the material from the week before. There is no mid-term for this class. Homework: There will be 5 homeworks based on the topics of the class each week. Comprehensive Exam: There is a final exam at the end of the semester covering all of the material covered in the class. Grading Schema: Quizzes 30% Homework 45% Comprehensive Exam: 25% Total 100% Grades will range from A through F. The following is the breakdown for grading: 94 100 = A 74 76 = C 90 93 = A- 70 73 = C- 87 89 = B+ 67 69 = D+ 84 86 = B 64 66 = D 80 83 = B- 60 63 = D- 77 79 =C+ Below 60 is an F Assignment Submission Policy Homework assignments are due at 11:59pm on the due date and should be submitted in Blackboard. You can submit homework up to one week late, but you will lose 20% of the possible points for the assignment. After one week, the assignment cannot be submitted. INF 553 Syllabus, Page 3 of 6

Schedule Topic Readings and Assignments Deliverables/Due Dates Week 1 1/9 Introduction to Data Mining, MapReduce Ch1: Data Mining and Ch2: Large-Scale File Systems and Map- Reduce Week 2 1/17* *Monday, 1/16 is a university holiday MapReduce (cont.) Ch2: Large-Scale File Systems and Map- Reduce Week 3 1/23 Frequent itemsets and Association rules Ch6: Frequent itemsets, Ch3: Finding Similar Items (section 3.5: Distance Measures) Homework 1 assigned Week 4 1/30 Frequent itemsets and Association rules Ch6: Frequent itemsets Week 5 2/6 Shingling, Minhashing, Locality Sensitive Hashing Ch3: Finding Similar Items Homework 1 due, Homework 2 assigned Week 6 2/13 Shingling, Minhashing, Locality Sensitive Hashing Ch3: Finding Similar Items Week 7 2/21* *Monday, 2/20 is university holiday Recommendation Systems: Content-based and Collaborative Filtering Ch9: Recommendation systems, additional readings Week 8 2/27 Recommendation Systems: Content-based and Collaborative Filtering Ch9: Recommendation systems Homework 2 due, Homework 3 assigned Week 9 Clustering Ch7: Clustering 3/6 INF 553 Syllabus, Page 4 of 6

3/13* Spring Recess *3/13-3/17 is Spring Recess Week 10 3/20 Analysis of Massive Graphs (Social Networks) Ch10: Analysis of Social Networks Homework 3 due, Homework 4 assigned Week 11 3/27 Analysis of Massive Graphs (Social Networks) Ch10: Analysis of Social Networks Week 12 4/3 Link Analysis: PageRank, Web spam and TrustRank, Random Walks with Restarts Ch5: Link Analysis Homework 4 due, Homework 5 assigned Week 13 4/10 Web Advertising Ch8: Advertising on the Web Week 14 4/17 Mining data streams Ch4: Mining data streams Homework 5 due Week 15 4/24* Mining data streams Comprehensive Exam Ch4: Mining data streams *Friday, 4/28 is the last day of class Statement on Academic Conduct and Support Systems Academic Conduct Plagiarism presenting someone else s ideas as your own, either verbatim or recast in your own words is a serious academic offense with serious consequences. Please familiarize yourself with the discussion of plagiarism in SCampus in Section 11, Behavior Violating University Standards https://policy.usc.edu/student/scampus/part-b/. Other forms of academic dishonesty are equally unacceptable. See additional information in SCampus and university policies on scientific misconduct, http://policy.usc.edu/scientific-misconduct. Discrimination, sexual assault, and harassment are not tolerated by the university. You are encouraged to report any incidents to the Office of Equity and Diversity http://equity.usc.edu or to the Department of Public Safety http://adminopsnet.usc.edu/department/departmentpublic-safety. This is important for the safety of the whole USC community. Another member of the university community such as a friend, classmate, advisor, or faculty member can help INF 553 Syllabus, Page 5 of 6

initiate the report, or can initiate the report on behalf of another person. The Relationship and Sexual Violence Prevention Services http://engemannshc.usc.edu/rsvp/ provides 24/7 confidential support, and the sexual assault resource center webpage http://sarc.usc.edu describes reporting options and other resources. Support Systems A number of USC s schools provide support for students who need help with scholarly writing. Check with your advisor or program staff to find out more. Students whose primary language is not English should check with the American Language Institute http://dornsife.usc.edu/ali, which sponsors courses and workshops specifically for international graduate students. The Office of Disability Services and Programs http://sait.usc.edu/academicsupport/centerprograms/dsp/home_index.html provides certification for students with disabilities and helps arrange the relevant accommodations. If an officially declared emergency makes travel to campus infeasible, USC Emergency Information http://emergency.usc.edu will provide safety and other updates, including ways in which instruction will be continued by means of blackboard, teleconferencing, and other technology. Resources for Online Students The Course Blackboard page has many resources available for students enrolled in our graduate programs. In addition, all registered students can access electronic library resources through the link https://libraries.usc.edu/. INF 553 Syllabus, Page 6 of 6