San José State University Computer Engineering Department CMPE/SE 188, Machine Learning for Big Data, Section 01, Spring 2017

Similar documents
Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

San José State University Department of Psychology PSYC , Human Learning, Spring 2017

San José State University

San José State University Department of Marketing and Decision Sciences BUS 90-06/ Business Statistics Spring 2017 January 26 to May 16, 2017

CS 3516: Computer Networks

Aerospace Engineering

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

MATH 1A: Calculus I Sec 01 Winter 2017 Room E31 MTWThF 8:30-9:20AM

Class Tuesdays & Thursdays 12:30-1:45 pm Friday 107. Office Tuesdays 9:30 am - 10:30 am, Friday 352-B (3 rd floor) or by appointment

COURSE NUMBER: COURSE NUMBER: SECTION: 01 SECTION: 01. Office Location: WSQ 104. (preferred contact)

COURSE WEBSITE:

Class Meeting Time and Place: Section 3: MTWF10:00-10:50 TILT 221

Class Numbers: & Personal Financial Management. Sections: RVCC & RVDC. Summer 2008 FIN Fully Online

CIS 121 INTRODUCTION TO COMPUTER INFORMATION SYSTEMS - SYLLABUS

Instructor Experience and Qualifications Professor of Business at NDNU; Over twenty-five years of experience in teaching undergraduate students.

MGT/MGP/MGB 261: Investment Analysis

CS 100: Principles of Computing

Class Mondays & Wednesdays 11:00 am - 12:15 pm Rowe 161. Office Mondays 9:30 am - 10:30 am, Friday 352-B (3 rd floor) or by appointment

Beginning and Intermediate Algebra, by Elayn Martin-Gay, Second Custom Edition for Los Angeles Mission College. ISBN 13:

Foothill College Summer 2016

Accounting 312: Fundamentals of Managerial Accounting Syllabus Spring Brown

Course Policies and Syllabus BUL3130 The Legal, Ethical, and Social Aspects of Business Syllabus Spring A 2017 ONLINE

Texas A&M University-Central Texas CISK Comprehensive Networking C_SK Computer Networks Monday/Wednesday 5.

MAT 122 Intermediate Algebra Syllabus Summer 2016

HCI 440: Introduction to User-Centered Design Winter Instructor Ugochi Acholonu, Ph.D. College of Computing & Digital Media, DePaul University

(Sub)Gradient Descent

Spring 2015 IET4451 Systems Simulation Course Syllabus for Traditional, Hybrid, and Online Classes

STANDARDIZED COURSE SYLLABUS

English Policy Statement and Syllabus Fall 2017 MW 10:00 12:00 TT 12:15 1:00 F 9:00 11:00

Course Syllabus for Math

Math 181, Calculus I

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering

SAN JOSÉ STATE UNIVERSITY URBAN AND REGIONAL PLANNING DEPARTMENT URBP 236 URBAN AND REGIONAL PLANNING POLICY ANALYSIS: TOOLS AND METHODS SPRING 2016

Principles Of Macroeconomics Case Fair Oster 10e

MGMT 479 (Hybrid) Strategic Management

MTH 141 Calculus 1 Syllabus Spring 2017

Python Machine Learning

International Business BADM 455, Section 2 Spring 2008

BA 130 Introduction to International Business

Syllabus Foundations of Finance Summer 2014 FINC-UB

ACCT 100 Introduction to Accounting Course Syllabus Course # on T Th 12:30 1:45 Spring, 2016: Debra L. Schmidt-Johnson, CPA

CSL465/603 - Machine Learning

Course Syllabus MFG Modern Manufacturing Techniques I Spring 2017


PSYCHOLOGY 353: SOCIAL AND PERSONALITY DEVELOPMENT IN CHILDREN SPRING 2006

Spring 2015 Natural Science I: Quarks to Cosmos CORE-UA 209. SYLLABUS and COURSE INFORMATION.

Social Media Journalism J336F Unique ID CMA Fall 2012

Mktg 315 Marketing Research Spring 2015 Sec. 003 W 6:00-8:45 p.m. MBEB 1110

Syllabus ENGR 190 Introductory Calculus (QR)

Learning From the Past with Experiment Databases

Bittinger, M. L., Ellenbogen, D. J., & Johnson, B. L. (2012). Prealgebra (6th ed.). Boston, MA: Addison-Wesley.

STA2023 Introduction to Statistics (Hybrid) Spring 2013

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Introduction to Information System

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

Computer Science 1015F ~ 2016 ~ Notes to Students

Lahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017

BUSINESS FINANCE 4265 Financial Institutions

ITSC 2321 Integrated Software Applications II COURSE SYLLABUS

Page 1 of 8 REQUIRED MATERIALS:

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

POFI 1349 Spreadsheets ONLINE COURSE SYLLABUS

ITSC 1301 Introduction to Computers Course Syllabus

Mining Association Rules in Student s Assessment Data

KOMAR UNIVERSITY OF SCIENCE AND TECHNOLOGY (KUST)

Process to Identify Minimum Passing Criteria and Objective Evidence in Support of ABET EC2000 Criteria Fulfillment

Penn State University - University Park MATH 140 Instructor Syllabus, Calculus with Analytic Geometry I Fall 2010

MATH 205: Mathematics for K 8 Teachers: Number and Operations Western Kentucky University Spring 2017

EECS 700: Computer Modeling, Simulation, and Visualization Fall 2014

IST 440, Section 004: Technology Integration and Problem-Solving Spring 2017 Mon, Wed, & Fri 12:20-1:10pm Room IST 202

Academic Support Services Accelerated Learning Classes The Learning Success Center SMARTHINKING Student computer labs Adult Education

SYLLABUS. EC 322 Intermediate Macroeconomics Fall 2012

Course Syllabus It is the responsibility of each student to carefully review the course syllabus. The content is subject to revision with notice.

*In Ancient Greek: *In English: micro = small macro = large economia = management of the household or family

SYLLABUS- ACCOUNTING 5250: Advanced Auditing (SPRING 2017)

Course Content Concepts

SOUTHERN MAINE COMMUNITY COLLEGE South Portland, Maine 04106

MGMT 5303 Corporate and Business Strategy Spring 2016

Foothill College Fall 2014 Math My Way Math 230/235 MTWThF 10:00-11:50 (click on Math My Way tab) Math My Way Instructors:

Fall 2016 ARA 4400/ 7152

Course Syllabus Chem 482: Chemistry Seminar

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Instructor. Darlene Diaz. Office SCC-SC-124. Phone (714) Course Information

Managerial Economics 12th Edition Answers

CIS Introduction to Digital Forensics 12:30pm--1:50pm, Tuesday/Thursday, SERC 206, Fall 2015

Chromatography Syllabus and Course Information 2 Credits Fall 2016

BIODIVERSITY: CAUSES, CONSEQUENCES, AND CONSERVATION

ECON492 Senior Capstone Seminar: Cost-Benefit and Local Economic Policy Analysis Fall 2017 Instructor: Dr. Anita Alves Pena

Mcgraw Hill Financial Accounting Connect Promo Code

GIS 5049: GIS for Non Majors Department of Environmental Science, Policy and Geography University of South Florida St. Petersburg Spring 2011

MGMT 3280: Strategic Management

Business Ethics Philosophy 305 California State University, Northridge Fall 2011

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Syllabus - ESET 369 Embedded Systems Software, Fall 2016

ASTRONOMY 2801A: Stars, Galaxies & Cosmology : Fall term

Social Media Marketing BUS COURSE OUTLINE

S T A T 251 C o u r s e S y l l a b u s I n t r o d u c t i o n t o p r o b a b i l i t y

Connect Mcgraw Hill Managerial Accounting Promo Code

Office Hours: Day Time Location TR 12:00pm - 2:00pm Main Campus Carl DeSantis Building 5136

SYLLABUS: RURAL SOCIOLOGY 1500 INTRODUCTION TO RURAL SOCIOLOGY SPRING 2017

Transcription:

San José State University Computer Engineering Department CMPE/SE 188, Machine Learning for Big Data, Section 01, Spring 2017 Course and Contact Information Instructor: Office Location: Magdalini Eirinaki ENG 283F Telephone: (408) 924-3828 Email: Office Hours: Class Days/Time: magdalini.eirinaki@sjsu.edu Tuesday, 2-4 PM Tuesday & Thursday, 10:30 11:45 AM Classroom: Hugh Gillis Hall 124 Prerequisites: CMPE 126 (for BS CMPE students) / CS 146 (for BS SE students) Course Format Technology Intensive, Hybrid, and Online Courses This course requires the student to have a personal laptop that is installed with a modern operating system. The lectures will be delivered in the classroom, however the students might be asked to use their laptops or smart devices during the class, or offline in order to participate in the class assignments. Faculty Web Page and MYSJSU Messaging Course materials such as syllabus, handouts, notes, assignment instructions, etc. can be found on Canvas Leaning Management System course login website at http://sjsu.instructure.com. You are responsible for daily checking with the messaging system through Canvas and MySJSU at http://my.sjsu.edu to learn of any updates. Course Description Introduction to machine learning and pattern recognition for big data analytics; machine learning concepts, theories, approaches, algorithms, and big data analytic applications; supervised learning, unsupervised learning, and learning theory. Course Goals This course focuses on machine learning algorithms and methodologies to support large-scale data analysis. The course covers fundamental machine learning algorithms and techniques, such as regression, classification, and clustering models, as well as more contemporary ones, including collaborative filtering and social network analysis techniques. The course will also review techniques that allow for scalability and processing of large amounts of data, such as parallelization models, hashing, and dimensionality reduction techniques. Machine Learning for Big Data, CMPE 188-01, Spring 2017 Page 1 of 7

This course involves a group-based term project to provide students with the opportunity to build a simplified data or web mining application, and to enhance their professional engineering skills including practical application of state-of-the-art big data and machine learning tools and frameworks, teamwork, technical leadership, and effective communication skills (both written and verbal). The course also includes a set of individual assignments and survey projects to enable students to deepen their knowledge on the material. Course Learning Outcomes (CLO) (Required) Upon successful completion of this course, students will be able to: 1. CLO 1- Describe the fundamental concepts of several machine learning algorithms and techniques. 2. CLO 2 Demonstrate an understanding of and ability to use emergent big data technologies. 3. CLO 3 Explain how appropriate machine learning approaches and techniques can be applied to solve given problems. 4. CLO 4 Use machine learning models, methods, and big data technology and tools to complete a given big data analytics project Required Texts/Readings This class does not have a single textbook. Instead, the students have to study material coming from various books, papers and other resources, all of which are free to download (for academic use). It is each student s responsibility to consult with the updated syllabus on Canvas in order to identify which readings cover the concepts that are taught each week. A list of reference textbooks is also provided for those who d like to get some background knowledge or seek more details on the topics covered in class. Textbooks [HKP] Data Mining: Concepts and Techniques, by Jiawei Han, Micheline Kamber and Jian Pei Morgan Kaufmann, Elsevier Inc. (2011), ISBN: 9780123814791 (available as ebook from the SJSU Library) [MMDS] Mining of Massive Datasets, by Jure Leskovec, Anand Rajaraman and Jeffrey Ullman, 2 nd edition, Cambridge University Press, December 2014 (download from http://infolab.stanford.edu/~ullman/mmds/book.pdf ) [ISLR] An Introduction to Statistical Learning with Applications In R, by Gareth James, Daniela Witten, Trevor Hastie and Robert Tibshirani, Springer Texts in Statistics, 2013 (download from http://wwwbcf.usc.edu/~gareth/isl/) Other Readings Papers, tutorial slides, articles and all other material that will be made available via Canvas Lecture slides (available via Canvas) Reference textbooks Data Mining, The Textbook, by Charu C. Aggarwal Springer (2015), ISBN: 9783319381169 Machine Learning for Big Data, CMPE 188-01, Spring 2017 Page 2 of 7

Recommender Systems: The textbook, by Charu Aggrawal Springer, 2016, ISBN 978-3-319-29659-3 (available as ebook from the SJSU Library) Machine Learning, by Tom M. Michell, McGraw Hill (1997), ISBN: 0070428077 Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data, by Bing Liu Springer (2011), ISBN: 3540378812 Social Media Mining, An Introduction, by Reza Zafarni, Mohammad Ali Abasi, Huan Liu Cambridge University Press, 2014 (available to download at: http://dmml.asu.edu/smm/smm.pdf ) Other technology requirements / equipment / material Programming languages, platforms, as well as software applications and tools, such as Spark, Mahout, R/RStudio, Python, WEKA, Tableau, etc. that will be required for this class are either free to download, or the instructor will provide the students with academic licenses. Students will be informed in class and via Canvas ahead of time in order to install all required software. Grading Information SJSU classes are designed such that in order to be successful, it is expected that students will spend a minimum of forty-five hours for each unit of credit (normally three hours per unit per week), including preparing for class, participating in course activities, completing assignments, and so on. More details about student workload can be found in University Policy S12-3 at http://www.sjsu.edu/senate/docs/s12-3.pdf. Student Assessment In-class activities 5% Individual homework assignments 5% Quizzes 10% Term Project 25% 2 Midterm Exams 25% Final Exam (comprehensive) 30% Descriptions of Assignments/Exams In-class activities: Students will be evaluated based on their participation in in-class assignments. All students are required to write their names on the submitted work and/or submit their answers online using their unique IDs, shared with the instructors. Failing to do so, even if the student was indeed present in the class, will result in zero credit as the instructor is unable to verify the student s claims. Moreover, students whose names appear on submitted work, but were not in class, as well as the students who submitted their name on their behalf are violating the academic integrity policy and will be reported immediately to the office of Student Conduct and Ethical Development. Individual Written/Programming Assignments and Quizzes: Students will be provided with details describing the assignments and how they will be graded every week. These assignments will be in-class or take-home written assignments, in-class or take-home lab assignments, and/or presentation assignments for research papers or articles. Students will also have to answer to quizzes that will be based on the homework assignment that is due that day. The worst quiz grade will not be counted towards the final pop quiz grade of each student ( worst-one out policy ). Machine Learning for Big Data, CMPE 188-01, Spring 2017 Page 3 of 7

Term Project: Groups of 2-3 students will be formed to work on a term-long group project related to data or web mining. The project has deliverables throughout the semester. The quality and completeness of all the deliverables will be considered in grading the projects. All projects will be demonstrated in class. The project details will be announced by the instructor and posted on the course s web site well before the deadlines. Each group member is expected to participate in every phase of the project. The final grade of each member will be proportional to his/her participation in the group, as assessed by the instructor and the student s peers. Each member should be able to answer questions regarding the project, present some part of the project demo, and participate in the system implementation and the writing of the technical reports. The term project will be graded on the basis of the following three components: a) project implementation, b) project report, c) project demonstration. Grading will be rubric-based. Exams: Exams will be a combination of multiple choice and short answer questions and will be based on the individual assignments and course material covered in class. The final exam is comprehensive and the date is determined by the University s Final Examination Schedule. NOTE that University policy F69-24 at http://www.sjsu.edu/senate/docs/f69-24.pdf states that Students should attend all meetings of their classes, not only because they are responsible for material discussed therein, but because active participation is frequently essential to insure maximum benefit for all members of the class. Attendance per se shall not be used as a criterion for grading. Determination of Grades The final grades will be calculated based on the following: (A+) >= 98, (A) >= 94 and <98 (A-) >= 90 and <94 (B+) >=85 and <90 (B) >= 75 and <85 (B-) >= 70 and <75 (C+)>= 68 and <70, (C) >=64 and <68 (C-) >= 60 and <64, (D) >=50 and <60, (F) < 50 No late assignments will be accepted. An extension will be granted only if a student has serious and compelling reasons that can be proven by an independent authority (e.g. doctor s note if the student has been sick). The exam dates are final. All students have the right, within a reasonable time, to know their academic scores, to review their gradedependent work, and to be provided with explanations for the determination of their course grades. Machine Learning for Big Data, CMPE 188-01, Spring 2017 Page 4 of 7

Classroom Protocol You are expected to arrive in time for class. While in class you need to turn off your cellphone unless directed otherwise by your instructor. Laptop/tablet/smart phone use is allowed only for activities related to the class. Please be considerate of your fellow students. University Policies Per University Policy S16-9, university-wide policy information relevant to all courses, such as academic integrity, accommodations, etc. will be available on Office of Graduate and Undergraduate Programs Syllabus Information web page at http://www.sjsu.edu/gup/syllabusinfo/ Department Policies Students who do not provide documentation of having satisfied the class prerequisite or co-requisite requirements (if any) by the second class meeting will be dropped from the class. All non-proctored report (or similarly sized) assignments in courses where some of the final grade depends on prose writing will be submitted to Turnitin.com. Machine Learning for Big Data, CMPE 188-01, Spring 2017 Page 5 of 7

CMPE/SE 188 / Machine Learning for Big Data, Spring 2017, Course Schedule The schedule (and related dates/readings/assignments) is tentative and subject to change with fair notice. In case of guest lectures the syllabus will be updated accordingly. Any changes will be announced in due time in class and on the course s web site (Canvas). The students are obliged to consult the most updated and detailed version of the reading material and syllabus, which will be posted on Canvas. Course Schedule Week Date Topics, Readings, Assignments, Deadlines 1 1/26 Introduction to CMPE 188 2 1/31 Introduction to Machine Learning 2/2 Machine Learning in the big data era 3-5 2/7 Supervised Learning: Linear Regression, K-Nearest Neighbors, Decision Trees, Naïve Bayes, SVM, Outlier detection, Ensemble Methods, Evaluation 2/9 2/14 2/16 2/21 2/23 6 2/28 Scaling for big data: Dimensionality Reduction, Spark 3/2 MIDTERM 1 7-8 3/7 Unsupervised Learning: K-Means, Hierarchical Clustering 3/9 3/14 3/16 9 3/21 Recommendation systems Content-based Collaborative Filtering, User- and 3/23 Item-based Collaborative Filtering Week of 3/27 SPRING BREAK 10-11 4/4 Recommendation systems Scaling for big data: Latent factor Collaborative Filtering/Matrix Factorization 4/6 Evaluation methods 4/11 11 4/13 MIDTERM 2 12 4/18 Unsupervised Learning: Association Rules Mining 4/20 13 4/25 Social Network Analysis 4/27 Machine Learning for Big Data, CMPE 188-01, Spring 2017 Page 6 of 7

Week Date Topics, Readings, Assignments, Deadlines 14 5/2 Deep Learning: Convolutional Neural Networks 5/4 15-16 5/9 Project Presentations 5/11 Finals Week 5/26 Monday, May 22, 9:45-12 FINAL EXAM Machine Learning for Big Data, CMPE 188-01, Spring 2017 Page 7 of 7