CIS 519/419 Applied Machine Learning

Similar documents
(Sub)Gradient Descent

CS 446: Machine Learning

Python Machine Learning

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Lecture 1: Basic Concepts of Machine Learning

CSL465/603 - Machine Learning

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Lecture 1: Machine Learning Basics

CIS Introduction to Digital Forensics 12:30pm--1:50pm, Tuesday/Thursday, SERC 206, Fall 2015

Welcome to. ECML/PKDD 2004 Community meeting

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

EECS 700: Computer Modeling, Simulation, and Visualization Fall 2014

CS Machine Learning

Medium Term Plan English Year

Laboratorio di Intelligenza Artificiale e Robotica

CS 3516: Computer Networks

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

CS 101 Computer Science I Fall Instructor Muller. Syllabus

CS 100: Principles of Computing

Grade Band: High School Unit 1 Unit Target: Government Unit Topic: The Constitution and Me. What Is the Constitution? The United States Government

been each get other TASK #1 Fry Words TASK #2 Fry Words Write the following words in ABC order: Write the following words in ABC order:


Syllabus ENGR 190 Introductory Calculus (QR)

Data Structures and Algorithms

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

CS Course Missive

Spring 2016 Stony Brook University Instructor: Dr. Paul Fodor

Laboratorio di Intelligenza Artificiale e Robotica

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Semi-Supervised Face Detection

ASTRONOMY 2801A: Stars, Galaxies & Cosmology : Fall term

ECON 442: Economic Development Course Syllabus Second Semester 2009/2010

Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge

Artificial Neural Networks written examination

KOMAR UNIVERSITY OF SCIENCE AND TECHNOLOGY (KUST)

Australian Journal of Basic and Applied Sciences

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Syllabus Foundations of Finance Summer 2014 FINC-UB

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

SYLLABUS. EC 322 Intermediate Macroeconomics Fall 2012

Instructor. Darlene Diaz. Office SCC-SC-124. Phone (714) Course Information

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

Beginning and Intermediate Algebra, by Elayn Martin-Gay, Second Custom Edition for Los Angeles Mission College. ISBN 13:

Rule Learning With Negation: Issues Regarding Effectiveness

MATH 108 Intermediate Algebra (online) 4 Credits Fall 2008

Topic: Making A Colorado Brochure Grade : 4 to adult An integrated lesson plan covering three sessions of approximately 50 minutes each.

Computer Science 141: Computing Hardware Course Information Fall 2012

Physics 270: Experimental Physics

Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski

BIOS 104 Biology for Non-Science Majors Spring 2016 CRN Course Syllabus

ENME 605 Advanced Control Systems, Fall 2015 Department of Mechanical Engineering

The Entrepreneurial Mindset Syllabus

Axiom 2013 Team Description Paper

CS177 Python Programming

Knowledge Elicitation Tool Classification. Janet E. Burge. Artificial Intelligence Research Group. Worcester Polytechnic Institute

Human Emotion Recognition From Speech

Abstractions and the Brain

Answer the following questions in complete sentences on a separate sheet of paper:

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Computerized Adaptive Psychological Testing A Personalisation Perspective

Henry Tirri* Petri Myllymgki

ASTR 102: Introduction to Astronomy: Stars, Galaxies, and Cosmology

CIS 2 Computers and the Internet in Society -

Content-free collaborative learning modeling using data mining

MTH 215: Introduction to Linear Algebra

Knowledge-Based - Systems

INDES 350 HISTORY OF INTERIORS AND FURNITURE WINTER 2017

Instructor Experience and Qualifications Professor of Business at NDNU; Over twenty-five years of experience in teaching undergraduate students.

Interactive Whiteboard

Function Tables With The Magic Function Machine

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Undergraduate Program Guide. Bachelor of Science. Computer Science DEPARTMENT OF COMPUTER SCIENCE and ENGINEERING

A Case Study: News Classification Based on Term Frequency

Probabilistic Latent Semantic Analysis

The Heart of Philosophy, Jacob Needleman, ISBN#: LTCC Bookstore:

Firms and Markets Saturdays Summer I 2014

Instructor Dr. Kimberly D. Schurmeier

ECON492 Senior Capstone Seminar: Cost-Benefit and Local Economic Policy Analysis Fall 2017 Instructor: Dr. Anita Alves Pena

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

Lahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017

Course Syllabus for Math

BCMA Instructional Agenda January 18-22, 2016

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Proposal of Pattern Recognition as a necessary and sufficient principle to Cognitive Science

Cleveland State University Introduction to University Life Course Syllabus Fall ASC 101 Section:

Class Tuesdays & Thursdays 12:30-1:45 pm Friday 107. Office Tuesdays 9:30 am - 10:30 am, Friday 352-B (3 rd floor) or by appointment

Syllabus - ESET 369 Embedded Systems Software, Fall 2016

Unpacking a Standard: Making Dinner with Student Differences in Mind

CNS 18 21th Communications and Networking Simulation Symposium

Exposé for a Master s Thesis

Grade 6: Module 2A Unit 2: Overview

A survey of multi-view machine learning

CS224d Deep Learning for Natural Language Processing. Richard Socher, PhD

Class Meeting Time and Place: Section 3: MTWF10:00-10:50 TILT 221

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering

Transcription:

CIS 519/419 Applied Machine Learning www.seas.upenn.edu/~cis519 Dan Roth danroth@seas.upenn.edu http://www.cis.upenn.edu/~danroth/ 461C, 3401 Walnut Slides were created by Dan Roth (for CIS519/419 at Penn or CS446 at UIUC), Eric Eaton for CIS519/419 at Penn, or from other authors who have made their ML slides available. 1

CIS(4,5)19: Applied Machine Learning Tuesday, Thursday: 1:30pm-3:00pm 101 Levine Office hours: Tue/Thur 4:30-5:30 pm [my office] 9 TAs Assignments: 5 Problems set (Python Programming) Weekly (light) on-line quizzes Weekly Discussion Sessions Mid Term Exam [Project] Final No real textbook: Registration to Class Go to the web site Be on Piazza Mitchell/Flach/Other Books/ Lecture notes /Literature 2

CIS519: Today What is Learning? Who are you? What is CIS519 about? The Badges Game 3

An Owed to the Spelling Checker I have a spelling checker, it came with my PC It plane lee marks four my revue Miss steaks aye can knot sea. Eye ran this poem threw it, your sure reel glad two no. Its vary polished in it's weigh My checker tolled me sew. A checker is a bless sing, it freeze yew lodes of thyme. It helps me right awl stiles two reed And aides me when aye rime. Each frays come posed up on my screen Eye trussed to bee a joule... 4

Machine learning is everywhere 5

Applications: Spam Detection This is a binary classification task: Assign one of two labels (i.e. yes/no) to the input (here, an email message) Classification requires a model (a classifier) to determine which label to assign to items. In this class, we study algorithms and techniques to learn n Sentences Positive, Negative such models from data. Documents Labels n Documents Politics, Sports, Finance n Phrases Person, Location n Images cats,dogs, snakes n n Medical records Admit again soon/not..? 6

Comprehension (ENGLAND, June, 1989) - Christopher Robin is alive and well. He lives in England. He is the same person that you read about in the book, Winnie the Pooh. As a boy, Chris lived in a pretty home called Cotchfield Farm. When Chris was three years old, his father wrote a poem about him. The poem was printed in a magazine for others to read. Mr. Robin then wrote a book. He made up a fairy tale land where Chris lived. His friends were animals. There was a bear called Winnie the Pooh. There was also an owl and a young pig, called a piglet. All the animals were stuffed toys that Chris owned. Mr. Robin made them come to life with his words. The places in the story were all near Cotchfield Farm. Winnie the Pooh was written in 1925. Children still love to read about Christopher Robin and his animal friends. Most people don't know he is a real person who is grown now. He has written two books of his own. They tell what it is like to be famous. 1. Christopher Robin was born in England. 2. Winnie the Pooh is a title of a book. 3. Christopher Robin s dad was a magician. 4. Christopher Robin must be at least 65 now. This is an Inference Problem; where is the learning? Page 7

8

Learning Learning is at the core of Understanding High Level Cognition Performing knowledge intensive inferences Building adaptive, intelligent systems Dealing with messy, real world data Analytics Learning has multiple purposes Knowledge Acquisition Integration of various knowledge sources to ensure robust behavior Adaptation (human, systems) Decision Making (Predictions) 9

Learning = Generalization H. Simon - Learning denotes changes in the system that are adaptive in the sense that they enable the system to do the task or tasks drawn from the same population more efficiently and more effectively the next time. The ability to perform a task in a situation which has never been encountered before 10

Learning = Generalization Mail thinks this message is junk mail. The learner has to be able to classify items it has never seen before. Not junk 11

Learning = Generalization Classification Medical diagnosis; credit card applications; hand-written letters; ad selection; sentiment assignment, Planning and acting Game playing (chess, backgammon, go); driving a car Skills (A robot) balancing a pole; playing tennis Common sense reasoning Natural language interactions The ability to perform a task in a situation which has never been encountered before What does the algorithm get as input? (features) Generalization depends on the Representation as much as it depends on the Algorithm used. 12

Same Population? New Zeeland In New York State, the longest period of daylight occurs during the month of. 13

Why Study Machine Learning? A breakthrough in machine learning would be worth ten Microsofts -Bill Gates, Chairman, Microsoft Machine learning is the next Internet -Tony Tether, Former Director, DARPA Machine learning is the hot new thing -John Hennessy, President, Stanford Web rankings today are mostly a matter of machine learning -Prabhakar Raghavan, Dir. Research, Yahoo Machine learning is going to result in a real revolution -Greg Papadopoulos, CTO, Sun Machine learning is today s discontinuity -Jerry Yang, CEO, Yahoo 14

Why Study Learning? Computer systems with new capabilities. Understand human and biological learning Understanding teaching better. Time is right. Initial algorithms and theory in place. Growing amounts of on-line data Computational power available. Necessity: many things we want to do cannot be done by programming. (Think about all the examples given earlier) 15

Learning is the future Learning techniques will be a basis for every application that involves a connection to the messy real world Basic learning algorithms are ready for use in applications today Prospects for broader future applications make for exciting fundamental research and development opportunities Many unresolved issues Theory and Systems While it s hot, there are many things we don t know how to do 16

Work in Machine Learning Artificial Intelligence; Theory; Experimental CS Makes Use of: Probability and Statistics; Linear Algebra; Theory of Computation; Related to: Philosophy, Psychology (cognitive, developmental), Neurobiology, Linguistics, Vision, Robotics,. Has applications in: AI (Natural Language; Vision; Planning; Robotics; HCI) q Very active field Engineering (Agriculture; Civil; ) And: what we Computer Science (Compilers; Architecture; Systems; data bases) don t know q The fundamental real world paradigms q Some From of the Internet most companies important to algorithmic Finance, Legal, ideas Retail,. q What to teach? q Modeling 17

Course Overview Introduction: Basic problems and questions A detailed example: Linear classifiers; key algorithmic idea Two Basic Paradigms: Discriminative Learning & Generative/Probablistic Learning Learning Protocols: Supervised; Unsupervised; Semi-supervised Algorithms Gradient Descent Decision Trees Linear Representations: (Perceptron; SVMs; Kernels) Neural Networks/Deep Learning Probabilistic Representations (naïve Bayes) Unsupervised /Semi supervised: EM Clustering; Dimensionality Reduction Modeling; Evaluation; Real world challenges Ethics 18

CIS(4,5)19: Applied Machine Learning Tuesday, Thursday: 1:30pm-3:00pm 101 Levine Office hours: Tue/Thur 4:30-5:30 pm [my office] 9 TAs Assignments: 5 Problems set (Python Programming) Weekly (light) on-line quizzes Weekly Discussion Sessions Mid Term Exam [Project] Final No real textbook: Registration to Class Go to the web site Be on Piazza Mitchell/Flach/Other Books/ Lecture notes /Literature 19

CIS519: Machine Learning What do you need to know: Some exposure to: Theory of Computation Probability Theory Linear Algebra Programming (Python) Participate, Ask Questions Homework 0 20

Cheating No. We take it very seriously. Homework: CIS 519: Policies Collaboration is encouraged But, you have to write your own solution/code. Late Policy: You have a credit of 4 days (4*24hours); That s it. Grading: Possible separate for 419/519. 40% - homework; ; 20%-final; 15%-midterm; 5% Quizzes [Projects: 20%] Questions? Class Web Page Note also the Schedule Page and our Notes 21

CIS519 on the web Check our class website: Schedule, slides, videos, policies http://www.seas.upenn.edu/~cis519/spring2018/ Sign up, participate in our Piazza forum: Announcements and discussions http://piazza.com/upenn/spring2018/cis419519 Check out our team Office hours [Optional] Discussion Sessions Scribing the Class [Good writers; Latex]? 22

What is Learning The Badges Game This is an example of the key learning protocol: supervised learning First question: Are you sure you got it? Why? Issues: Prediction or Modeling? Representation Problem setting Background Knowledge When did learning take place? Algorithm 23