Data Mining for Business Analytics. ISOM 3360 (L1): Spring 2017

Similar documents
The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

CSL465/603 - Machine Learning

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Learning From the Past with Experiment Databases

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

95723 Managing Disruptive Technologies

Accounting 312: Fundamentals of Managerial Accounting Syllabus Spring Brown

CS Machine Learning

Spring 2015 Natural Science I: Quarks to Cosmos CORE-UA 209. SYLLABUS and COURSE INFORMATION.

GEOG 473/573: Intermediate Geographic Information Systems Department of Geography Minnesota State University, Mankato

Introduction to Forensic Drug Chemistry

ENEE 302h: Digital Electronics, Fall 2005 Prof. Bruce Jacob

BA 130 Introduction to International Business

FINANCE 3320 Financial Management Syllabus May-Term 2016 *

Python Machine Learning

Advanced Corporate Coaching Program (ACCP) Sample Schedule

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

(Sub)Gradient Descent

Assignment 1: Predicting Amazon Review Ratings

SYLLABUS- ACCOUNTING 5250: Advanced Auditing (SPRING 2017)

MGT/MGP/MGB 261: Investment Analysis

Ryerson University Sociology SOC 483: Advanced Research and Statistics

Name: Giovanni Liberatore NYUHome Address: Office Hours: by appointment Villa Ulivi Office Extension: 312

CTE Teacher Preparation Class Schedule Career and Technical Education Business and Industry Route Teacher Preparation Program

COURSE LISTING. Courses Listed. Training for Cloud with SAP SuccessFactors in Integration. 23 November 2017 (08:13 GMT) Beginner.

Lecture 1: Machine Learning Basics

*In Ancient Greek: *In English: micro = small macro = large economia = management of the household or family

Introduction to Psychology

Reducing Features to Improve Bug Prediction

BUAD 425 Data Analysis for Decision Making Syllabus Fall 2015

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Applications of data mining algorithms to analysis of medical data

SYLLABUS: RURAL SOCIOLOGY 1500 INTRODUCTION TO RURAL SOCIOLOGY SPRING 2017

COURSE WEBSITE:

A Case Study: News Classification Based on Term Frequency

San José State University Department of Psychology PSYC , Human Learning, Spring 2017

Lahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017

MMOG Subscription Business Models: Table of Contents

Statistics and Data Analytics Minor

ACC : Accounting Transaction Processing Systems COURSE SYLLABUS Spring 2011, MW 3:30-4:45 p.m. Bryan 202

Class Mondays & Wednesdays 11:00 am - 12:15 pm Rowe 161. Office Mondays 9:30 am - 10:30 am, Friday 352-B (3 rd floor) or by appointment

Data Structures and Algorithms

MTH 141 Calculus 1 Syllabus Spring 2017

CS/SE 3341 Spring 2012

Driving Author Engagement through IEEE Collabratec

International Environmental Policy Spring :374:315:01 Tuesdays, 10:55 am to 1:55 pm, Blake 131

Universidade do Minho Escola de Engenharia

Multivariate k-nearest Neighbor Regression for Time Series data -

INTRODUCTION TO DECISION ANALYSIS (Economics ) Prof. Klaus Nehring Spring Syllabus

Spring Course Syllabus. Course Number and Title: SPCH 1318 Interpersonal Communication

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler


EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

Active Learning. Yingyu Liang Computer Sciences 760 Fall

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

Required Materials: The Elements of Design, Third Edition; Poppy Evans & Mark A. Thomas; ISBN GB+ flash/jump drive

FY16 UW-Parkside Institutional IT Plan Report

ECO 3101: Intermediate Microeconomics

Introduction to Information System

Department of Legal Assistant Education THE SOONER DOCKET. Enroll Now for Spring 2018 Courses! American Bar Association Approved

Department of Anthropology ANTH 1027A/001: Introduction to Linguistics Dr. Olga Kharytonava Course Outline Fall 2017

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering

Education and Training Committee, 19 November Standards of conduct, performance and ethics communications plan

Physics Experimental Physics II: Electricity and Magnetism Prof. Eno Spring 2017

Syllabus - ESET 369 Embedded Systems Software, Fall 2016

CRITICAL THINKING AND WRITING: ENG 200H-D01 - Spring 2017 TR 10:45-12:15 p.m., HH 205

FRESNO COUNTY INTELLIGENT TRANSPORTATION SYSTEMS (ITS) PLAN UPDATE

Health Sciences and Human Services High School FRENCH 1,

Activities, Exercises, Assignments Copyright 2009 Cem Kaner 1

THE GEORGE WASHINGTON UNIVERSITY Department of Economics. ECON 1012: PRINCIPLES OF MACROECONOMICS Prof. Irene R. Foster

Speech Emotion Recognition Using Support Vector Machine

LIN 6520 Syntax 2 T 5-6, Th 6 CBD 234

MGMT 3280: Strategic Management

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Course Development Using OCW Resources: Applying the Inverted Classroom Model in an Electrical Engineering Course

PSYC 2700H-B: INTRODUCTION TO SOCIAL PSYCHOLOGY

Experiment Databases: Towards an Improved Experimental Methodology in Machine Learning

Lecture 1: Basic Concepts of Machine Learning

Department of Accounting ACC Fundamentals of Financial Accounting Fall, 2015 Syllabus

Time series prediction

TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE. Pierre Foy

Probability and Statistics Curriculum Pacing Guide

Business Administration

ECON 442: Economic Development Course Syllabus Second Semester 2009/2010

MAR Environmental Problems & Solutions. Stony Brook University School of Marine & Atmospheric Sciences (SoMAS)

Rule Learning With Negation: Issues Regarding Effectiveness

Switchboard Language Model Improvement with Conversational Data from Gigaword

Probabilistic Latent Semantic Analysis

Master of Philosophy (MPhil) and Doctor of Philosophy (PhD) Programs in Information Systems

Theory of Probability

COMM 210 Principals of Public Relations Loyola University Department of Communication. Course Syllabus Spring 2016

MGMT 5303 Corporate and Business Strategy Spring 2016

Australian Journal of Basic and Applied Sciences

Mathematics Program Assessment Plan

ACCOUNTING FOR MANAGERS BU-5190-OL Syllabus

Please read this entire syllabus, keep it as reference and is subject to change by the instructor.

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Course Syllabus for Math

FISK. 2016/2018 Undergraduate Bulletin

Transcription:

Data Mining for Business Analytics ISOM 3360 (L1): Spring 2017 Course Name Data Mining for Business Analytics Course Code ISOM 3360 No. of Credit 3 Credits Exclusion(s) COMP 4331 Prerequisite(s) ISOM 2010 Professor Jing Wang, ISOM Contact Office: LSK 4044 Tel: 3469-2125 Email: jwang@ust.hk Begin subject: [ISOM3360] Office Hours By appointment Course Schedule and Classroom Lecture: Tue, Thur 3:00 4:20pm (LSK 1034) Lab 1: Thur 4:30 5:20pm (LSK G005) Lab 2: Fri 3:00 3:50pm (LSK G005) Lab 3: Fri 10:30 11:20am (LSK G005) Course Webpage Accessible from Canvas Teaching Assistant Sophie Gu (LSK 6031) Tel: 2358-7645 Email: imsophie@ust.hk TA Office Hours By appointment 1. Course Overview This course will change the way you think about data and its role in business. Businesses, governments, and individuals create massive collections of data as a byproduct of their activity. Increasingly, decision-makers rely on intelligent technology to analyze data systematically to improve decision-making. In many cases, automating analytical and decision-making processes is necessary because of the volume of data and the speed with which new data are generated. In virtually every industry, data mining has been widely used across various business units such as marketing, finance and management to improve decision making. In this course, we discuss specific scenarios, including the use of data mining to support decisions in customer relationship management (CRM), market segmentation, credit risk management, e-commerce, financial trading and search engine strategies. The course will explain with real-world examples the uses and some technical details of various data mining techniques. The emphasis primarily is on understanding the business application of data mining techniques, and secondarily on the variety of techniques. We will discuss the mechanics of how the methods work only if it is necessary to understand the general concepts and business applications. You will establish analytical thinking to the problems and understand that proper application of technology is as much an art as it is a science. The course is designed for students with various backgrounds -- the class does not require any technical skills or prior knowledge. After taking this course you should:

1. Approach business problems data-analytically (intelligently). Think carefully & systematically about whether & how data can improve business performance. 2. Be able to interact competently on the topic of data mining for business intelligence. Know the basics of data mining processes, techniques, & systems well enough to interact with business analysts, marketers, and managers. Be able to envision data-mining opportunities. 3. Be able to identify the right BI tools/techniques for various business problems. Gain handson experience in using popular BI tools and get ready for the job positions that require familiarities with the BI tools. 2. Lecture Notes and Readings Lecture notes For most classes I will hand out lecture notes, which will outline the primary material for the class. Other readings are intended to supplement the material we learn in class. They give alternative perspectives and additional details about the topics we cover: Supplemental readings Supplemental readings posted to Canvas or distributed in class. Supplemental books (optional): Data Mining Techniques: For Marketing, Sales, and Customer Relationship Management, third Edition, by Michael Berry and Gordon Linoff, Wiley, 2011 ISBN: 0470650931 Data Science for Business: What you need to know about data mining and data-analytic thinking, by Foster Provost, Tom Fawcett, O'Reilly Media, 2013 ISBN: 1449361323 3. Requirements and Grading Your grades will be determined based on class and lab participation, homework assignments, the midterm exam, and the final exam. Component Percentage Class Participation 3% Lab Participation 7% Homework Assignments (3) 30% Midterm Exam 30% Final Exam 30%

4. Important Notes on the Lab Session This is primarily a lecture-based course, but lab participation is an essential part of the learning process in the form of active practice. You are NOT going to learn without practicing the data analysis yourselves. During the lab session, I will expect you to be entirely devoted to the class by following the instructions. And you should actively link the empirical results you obtained during the lab to the concepts you learned in the lectures. During the Lab session, you will gain hands-on experience with the (award-winning) toolkit Weka (http://www.cs.waikato.ac.nz/ml/weka/), and a very popular online BI service from Microsoft. 5. Homework Assignment and Exams There will be a total of 3 individual homework assignments, each comprising questions to be answered and hands-on tasks. Completed assignments must be handed in via Canvas prior to the start of the class on the due date. Assignments will be graded and returned promptly. Turn in your assignment early if there is any uncertainty about your ability to turn it in on the due date. Assignments up to 24 hours late will have their grade reduced by 25%; assignments up to one week late will have their grade reduced by 50%. After one week, late assignments will receive no credit. This course will have two closed-book exams. The midterm exam will test issues covered in the first half of the course. The final exam will cover the classes in the second half of the course. Review sessions will be scheduled to help you prepare for these examinations. The midterm exam is tentatively scheduled on March 21 st (7:00-9:00pm). Let me know as early as possible if there is any unavoidable conflict. The final exam will be held during the final examination period; the date will be announced later in the semester.

Tentative Schedule of Lectures and Labs This schedule is tentative and may be adjusted as the semester progresses. Week Date Topics Due 1 Feb 2 Overview of the Course 2 3 4 Feb 7 Feb 9 Feb 14 Feb 16 Feb 21 Feb 23 Data Mining and Relation to Other Data Analytic Techniques Data Mining Basics Decision Tree Learning Business Application: Predicting Customer Default Overfitting and Model Selection More on Evaluation: Cost-Sensitive Learning 5 Feb 28 Linear Regression Homework 1 Due Mar 2 Logistic Regression 6 Mar 7 Naïve Bayes Classifier Mar 9 Business Application: Financial News Trading 7 Mar 14 Association Rule Learning Business Application: Basket Analysis Mar 16 Midterm Review 8 Mar 21 [No Class] Midterm Exam (7:00-9:00pm) Mar 23 9 Mar 28 Clustering Methods Business Application: Customer Segmentation Mar 30 Nearest Neighbor Classification Homework 2 Due 10 April 4 Ching Ming Festival (No Class) April 6 Business Application: Recommender Systems in E-Commerce 11 April 11 Ensemble Learning April 13 Mid-Term Break (No Class) 12 April 18 Mid-Term Break (No Class) April 20 Search Engine Technology 13 April 25 Search Engine Marketing April 27 Social Network Analysis 14 May 2 TBD [for Synchronization] May 4 Neural Networks and Deep Learning Homework 3 Due 15 May 9 Final Exam Review

Lab Session Schedule Number Date Topics 1 Feb. 9&10 Data visualization (Excel) 2 Feb. 16&17 Weka introduction and Decision tree (Weka) 3 Feb. 23&24 Microsoft Azure introduction & Decision tree (Azure) 4 Mar. 2&3 Cost-sensitive learning 5 Mar. 9&10 Linear Regression and Logistic Regression (Weka & Azure) 6 Mar. 16&17 Naïve Bayes (Weka) Mar. 23&24 Cancelled for Midterm Week 7 Mar. 30&31 Text Mining (Weka) 8 Apr. 6&7 Sentiment Analysis (Azure) 9 Apr. 20&21 Association Rule (Weka) & Clustering (Weka) 10 Apr. 27&28 KNN (Weka) & Collaborative Filtering (Azure) 11 May 4&5 TBC