CS4780/ Machine Learning

Similar documents
CSL465/603 - Machine Learning

Lecture 1: Basic Concepts of Machine Learning

(Sub)Gradient Descent

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

Python Machine Learning

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Lecture 1: Machine Learning Basics

CS 100: Principles of Computing

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Probabilistic Latent Semantic Analysis

CS 3516: Computer Networks

Lahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017

Assignment 1: Predicting Amazon Review Ratings

CS Course Missive

ENME 605 Advanced Control Systems, Fall 2015 Department of Mechanical Engineering

University of Cincinnati College of Medicine. DECISION ANALYSIS AND COST-EFFECTIVENESS BE-7068C: Spring 2016

MTH 215: Introduction to Linear Algebra

Course Content Concepts

Foothill College Summer 2016

PBHL HEALTH ECONOMICS I COURSE SYLLABUS Winter Quarter Fridays, 11:00 am - 1:50 pm Pearlstein 308

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

IST 440, Section 004: Technology Integration and Problem-Solving Spring 2017 Mon, Wed, & Fri 12:20-1:10pm Room IST 202

CS Machine Learning

Spring 2015 IET4451 Systems Simulation Course Syllabus for Traditional, Hybrid, and Online Classes

Human Emotion Recognition From Speech


Accounting 312: Fundamentals of Managerial Accounting Syllabus Spring Brown

CS 101 Computer Science I Fall Instructor Muller. Syllabus

Switchboard Language Model Improvement with Conversational Data from Gigaword

EECS 700: Computer Modeling, Simulation, and Visualization Fall 2014

*In Ancient Greek: *In English: micro = small macro = large economia = management of the household or family

Navigating the PhD Options in CMS

Class Numbers: & Personal Financial Management. Sections: RVCC & RVDC. Summer 2008 FIN Fully Online

BA 130 Introduction to International Business

Spring 2016 Stony Brook University Instructor: Dr. Paul Fodor

STA2023 Introduction to Statistics (Hybrid) Spring 2013

Course Syllabus for Math

Office Hours: Mon & Fri 10:00-12:00. Course Description

GIS 5049: GIS for Non Majors Department of Environmental Science, Policy and Geography University of South Florida St. Petersburg Spring 2011

MGT/MGP/MGB 261: Investment Analysis

Probability and Game Theory Course Syllabus

Penn State University - University Park MATH 140 Instructor Syllabus, Calculus with Analytic Geometry I Fall 2010

CS177 Python Programming

CS/SE 3341 Spring 2012

SYLLABUS. EC 322 Intermediate Macroeconomics Fall 2012

Syllabus - ESET 369 Embedded Systems Software, Fall 2016

ASTR 102: Introduction to Astronomy: Stars, Galaxies, and Cosmology

Speech Emotion Recognition Using Support Vector Machine

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

ECON492 Senior Capstone Seminar: Cost-Benefit and Local Economic Policy Analysis Fall 2017 Instructor: Dr. Anita Alves Pena

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

Grading Policy/Evaluation: The grades will be counted in the following way: Quizzes 30% Tests 40% Final Exam: 30%

Data Structures and Algorithms

Mktg 315 Marketing Research Spring 2015 Sec. 003 W 6:00-8:45 p.m. MBEB 1110

Syllabus for CHEM 4660 Introduction to Computational Chemistry Spring 2010

Math Techniques of Calculus I Penn State University Summer Session 2017

Phonetic- and Speaker-Discriminant Features for Speaker Recognition. Research Project

TEACHING ASSISTANT TBD

BUAD 425 Data Analysis for Decision Making Syllabus Fall 2015

EDCI 699 Statistics: Content, Process, Application COURSE SYLLABUS: SPRING 2016

Natural Language Processing: Interpretation, Reasoning and Machine Learning

CIS Introduction to Digital Forensics 12:30pm--1:50pm, Tuesday/Thursday, SERC 206, Fall 2015

MKTG 611- Marketing Management The Wharton School, University of Pennsylvania Fall 2016

MAE Flight Simulation for Aircraft Safety

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines

Syllabus Foundations of Finance Summer 2014 FINC-UB

Discriminative Learning of Beam-Search Heuristics for Planning

ASTRONOMY 2801A: Stars, Galaxies & Cosmology : Fall term

AGN 331 Soil Science Lecture & Laboratory Face to Face Version, Spring, 2012 Syllabus

General Physics I Class Syllabus

Syllabus ENGR 190 Introductory Calculus (QR)

Course Guide and Syllabus for Zero Textbook Cost FRN 210

FINN FINANCIAL MANAGEMENT Spring 2014

Artificial Neural Networks written examination

Indian Institute of Technology, Kanpur

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Instructor Dr. Kimberly D. Schurmeier

Psychology 102- Understanding Human Behavior Fall 2011 MWF am 105 Chambliss

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

ADVANCES IN DEEP NEURAL NETWORK APPROACHES TO SPEAKER RECOGNITION

Carolina Course Evaluation Item Bank Last Revised Fall 2009

Beginning and Intermediate Algebra, by Elayn Martin-Gay, Second Custom Edition for Los Angeles Mission College. ISBN 13:

Food Products Marketing

Welcome to. ECML/PKDD 2004 Community meeting

Texas A&M University - Central Texas PSYK PRINCIPLES OF RESEARCH FOR THE BEHAVIORAL SCIENCES. Professor: Elizabeth K.

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

San José State University Department of Marketing and Decision Sciences BUS 90-06/ Business Statistics Spring 2017 January 26 to May 16, 2017

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering

Please read this entire syllabus, keep it as reference and is subject to change by the instructor.

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Reducing Features to Improve Bug Prediction

S T A T 251 C o u r s e S y l l a b u s I n t r o d u c t i o n t o p r o b a b i l i t y

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

Time series prediction

Class Meeting Time and Place: Section 3: MTWF10:00-10:50 TILT 221

SOUTHERN MAINE COMMUNITY COLLEGE South Portland, Maine 04106

Course Policies and Syllabus BUL3130 The Legal, Ethical, and Social Aspects of Business Syllabus Spring A 2017 ONLINE

Transcription:

CS4780/5780 - Machine Learning Fall 2014 Thorsten Joachims Cornell University Department of Computer Science

Outline of Today Who we are? Prof: Thorsten Joachims TAs: Daniel Sedra, Shuhan Wang, Karthik Raman, Tobias Schnabel, Jisun Jung, ++ Consultants: TBD What is learning? Why should a computer be able to learn? Examples of machine learning (ML). What drives research in and use of ML today? Syllabus Administrivia

(One) Definition of Learning Definition [Mitchell]: A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P, if its performance at tasks in T, as measured by P, improves with experience E.

Syllabus Instance-Based Learning : k-nearest neighbor, collaborative filtering Decision Trees : TDIDT, attribute selection, pruning and overfitting Linear Rules : Perceptron, logistic regression, linear regression, duality Support Vector Machines : optimal hyperplane, margin, kernels, stability Generative Models : naïve Bayes, linear discriminant analysis Hidden Markov Models : probabilistic model, estimation, Viterbi Structured Output Prediction : predicting sequences, rankings, etc. Statistical Learning Theory : PAC learning, VC dimension, error bounds Online Learning : experts, bandits, online mistake bounds Clustering : HAC Clustering, k-means, mixture of Gaussians Recommendation: similarity-based methods, matrix factorization, etc. ML Experimentation: hypothesis tests, cross validation, resampling

Textbook and Course Material Main Textbooks Tom Mitchell, "Machine Learning", McGraw Hill, 1997. CS4780 Course Pack from Campus Store Additional References (optional) Kevin Murphy, Machine Learning a Probabilistic Perspective, MIT Press, 2012. See other references on course web page Course Notes Writing on blackboard Slides available on course homepage Video of lecture available from last year

Pre-Requisites and Related Courses Pre-Requisites Programming skills (e.g. CS 2110) Basic linear algebra (e.g. MATH 2940) Basic probability theory (e.g. CS 2800) Short exam to test prereqs (via CMS) Related Courses CS4700: Foundations of Artificial Intelligence CS4758: Robot Learning CS4300: Information Retrieval CS4740: Natural Language Processing CS6780: Advanced Machine Learning CS6784: Advanced Topics in Machine Learning CS6740: Advanced Language Technologies CS6782: Probabilistic Graphical Models

Homework Assignments Assignments 5 homework assignments Some problem sets, some programming and experiments Policies Assignments are due at the beginning of class on the due date in hardcopy. Code must be submitted via CMS by the same deadline. Assignments turned in late will be charged a 1 percentage point reduction of the cumulated final homework grade for each period of 24 hours for which the assignment is late. Everybody has 5 free late days. Use them wisely. No assignments will be accepted after the solutions have been made available (typically 3-5 days after deadline). Typically collaboration of two students (see each assignment for detailed collaboration policy). We run automatic cheating detection. Must state all sources of material used in assignments or project. Please review Cornell Academic Integrity Policy!

Exams and Quizzes In-class Quizzes A few per semester No longer than 5 minutes Exams Two Prelim exams October 16 (week of fall break) November 25 (week of thanksgiving break) In class No final exam

Final Project Organization Self-defined topic related to your interests and research Groups of 3-4 students Each group has TA as advisor Deliverables Project proposal (week after fall break) Meetings with TA to discuss progress Poster presentation (last week of classes) Project report (December 10) Peer review (December 15)

Grading Deliverables 2 Prelim Exams (50% of Grade) Final Project (15% of Grade) Homeworks (~5 assignments) (25% of Grade) Quizzes (in class) (5% of Grade) PreReq Exam (2% of Grade) Participation (3% of Grade) Outlier elimination For homeworks and quizzes, the lowest grade is replaced by the second lowest grade.

How to Get in Touch Online Course Homepage (slides, video, references, policies, office hours) http://www.cs.cornell.edu/courses/cs4780/2014fa/ Piazza forum (questions and comments) CMS (homeworks and grades) Email Addresses Thorsten Joachims: tj@cs.cornell.edu Tobias Schnabel: tbs49@cornell.edu [homework and solutions] Karthik Raman: kr339@cornell.edu [projects] Daniel Sedra: dms422@cornell.edu [office hours, piazza, video] Shuhan Wang: sw788@cornell.edu [late submissions, regrades, CMS] Office Hours Thorsten Joachims: Thursdays 2:40pm 4:00pm, 418 Gates Hall Other office hours: See course homepage