APA - Machine Learning

Similar documents
Python Machine Learning

Lecture 1: Machine Learning Basics

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

(Sub)Gradient Descent

SAM - Sensors, Actuators and Microcontrollers in Mobile Robots

Lecture 10: Reinforcement Learning

Artificial Neural Networks written examination

CSL465/603 - Machine Learning

Lecture 1: Basic Concepts of Machine Learning

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

CS Machine Learning

STA 225: Introductory Statistics (CT)

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Time series prediction

Probabilistic Latent Semantic Analysis

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Axiom 2013 Team Description Paper

MGT/MGP/MGB 261: Investment Analysis

Version Space. Term 2012/2013 LSI - FIB. Javier Béjar cbea (LSI - FIB) Version Space Term 2012/ / 18

Learning From the Past with Experiment Databases

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Math-U-See Correlation with the Common Core State Standards for Mathematical Content for Third Grade

Learning Methods for Fuzzy Systems

Generative models and adversarial training

PH.D. IN COMPUTER SCIENCE PROGRAM (POST M.S.)

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Welcome to. ECML/PKDD 2004 Community meeting

SSE - Supervision of Electrical Systems

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Strategy and Design of ICT Services

Setting the Scene: ECVET and ECTS the two transfer (and accumulation) systems for education and training

Discriminative Learning of Beam-Search Heuristics for Planning

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

Natural Language Processing: Interpretation, Reasoning and Machine Learning

Name of the PhD Program: Urbanism. Academic degree granted/qualification: PhD in Urbanism. Program supervisors: Joseph Salukvadze - Professor

Semi-Supervised Face Detection

Probability and Statistics Curriculum Pacing Guide

An OO Framework for building Intelligence and Learning properties in Software Agents

Reducing Features to Improve Bug Prediction

Historical maintenance relevant information roadmap for a self-learning maintenance prediction procedural approach

Assignment 1: Predicting Amazon Review Ratings

Softprop: Softmax Neural Network Backpropagation Learning

Montana Content Standards for Mathematics Grade 3. Montana Content Standards for Mathematical Practices and Mathematics Content Adopted November 2011

Lahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017

TEACHING AND EXAMINATION REGULATIONS PART B: programme-specific section MASTER S PROGRAMME IN LOGIC

Mathematics Program Assessment Plan

CS/SE 3341 Spring 2012

The Boosting Approach to Machine Learning An Overview

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Computational Data Analysis Techniques In Economics And Finance

EGRHS Course Fair. Science & Math AP & IB Courses

Multivariate k-nearest Neighbor Regression for Time Series data -

Evolutive Neural Net Fuzzy Filtering: Basic Description

Mathematics. Mathematics

The Method of Immersion the Problem of Comparing Technical Objects in an Expert Shell in the Class of Artificial Intelligence Algorithms

Extracting Opinion Expressions and Their Polarities Exploration of Pipelines and Joint Models

Extending Place Value with Whole Numbers to 1,000,000

Introduction to Financial Accounting

INTERMEDIATE ALGEBRA PRODUCT GUIDE

The Use of Statistical, Computational and Modelling Tools in Higher Learning Institutions: A Case Study of the University of Dodoma

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

OFFICE SUPPORT SPECIALIST Technical Diploma

A. What is research? B. Types of research

Measurement. When Smaller Is Better. Activity:

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

B.S/M.A in Mathematics

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

arxiv: v2 [cs.cv] 30 Mar 2017

Australian Journal of Basic and Applied Sciences

EECS 700: Computer Modeling, Simulation, and Visualization Fall 2014

Self Study Report Computer Science

FIN 448 Fundamental Financial Analysis

Foreign Languages. Foreign Languages, General

USF Course Change Proposal Global Citizens Project

A Case Study: News Classification Based on Term Frequency

Entrepreneurial Discovery and the Demmert/Klein Experiment: Additional Evidence from Germany

MTH 141 Calculus 1 Syllabus Spring 2017

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines

AC : TEACHING COLLEGE PHYSICS

A survey of multi-view machine learning

ENVR 205 Engineering Tools for Environmental Problem Solving Spring 2017

arxiv: v1 [cs.lg] 15 Jun 2015

Chapter 10 APPLYING TOPIC MODELING TO FORENSIC DATA. 1. Introduction. Alta de Waal, Jacobus Venter and Etienne Barnard

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

Human Emotion Recognition From Speech

State University of New York at Buffalo INTRODUCTION TO STATISTICS PSC 408 Fall 2015 M,W,F 1-1:50 NSC 210

VOL. 3, NO. 5, May 2012 ISSN Journal of Emerging Trends in Computing and Information Sciences CIS Journal. All rights reserved.

Speech Emotion Recognition Using Support Vector Machine

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Undergraduate Program Guide. Bachelor of Science. Computer Science DEPARTMENT OF COMPUTER SCIENCE and ENGINEERING

Guide to the Uniform mark scale (UMS) Uniform marks in A-level and GCSE exams

Statistics and Data Analytics Minor

Syllabus ENGR 190 Introductory Calculus (QR)

Probabilistic principles in unsupervised learning of visual structure: human data and a model

Truth Inference in Crowdsourcing: Is the Problem Solved?

Transcription:

Coordinating unit: 270 - FIB - Barcelona School of Informatics Teaching unit: 723 - CS - Department of Computer Science Academic year: Degree: 2018 BACHELOR'S DEGREE IN INFORMATICS ENGINEERING (Syllabus 2010). (Teaching unit Optional) ECTS credits: 6 Teaching languages: Catalan Teaching staff Coordinator: Others: - Luis Antonio Belanche Muñoz (belanche@cs.upc.edu) - Javier Béjar Alonso (bejar@cs.upc.edu) Prior skills Elementary notions of probability and statistics. Elementary linear algebra and real analysis Good programming skills in a high-level language Requirements - Prerequisite PE - Corequisite PROP Teaching methodology The topics exposed in the main classes are very well motivated and motivating (why is it important to know), and supplemented with many examples. The theory lessons introduce all the knowledge, techniques, concepts and results necessary to achieve a well-grounded body of knowledge. These concepts are put into practice in the problem sessions, and in the laboratory. Prior to each problem-solving session, the teacher proposes problems related to the current topic and the students have time to prepare them using their AA hours of personal work. In class, the students will be divided among several small groups depending on their number. The teacher provides guidance and eventually resolves questions, giving feedback and making the students progress in the resolution. The use of collaborative learning strategies is envisaged, in which one or more students take responsibility to lead the process. Students work again in these problems in their AA time and then deliver them. These deliveries are continued in time, have a uniform load and are evaluated. This strategy is also used to evaluate their skills for effective communication. The theory lessons are weekly (two hours). The two hours of laboratory classes are twice a month. The problem sessions are weekly. There is a deliverable practical work that solves a real problem (to be chosen by the student) that integrates all acquired knowledge and skills of the course. This practical work is also used to evaluate their skills for effective communication. Learning objectives of the subject 1.Formulate the problem of machine learning from data, and know the different machine learning tasks 2.Organize the flow for solving a machine learning problem, analyzing the possible options and choosing the most appropriate to the problem 3.Decide, defend and criticize a solution to a machine learning problem, arguing the strengths and weaknesses of the 1 / 10

approach 4.To compare, judge and interpret a set of results after making a hypothesis about a machine learning problem 5.Understand and know how to apply least squares techniques for solving supervised learning problems 6.Understand and know how to apply techniques for single and multilayer neural networks for solving supervised learning problems 7.Understand and know how to apply support vector machines for solving supervised learning problems 8.Understand and formulate different theoretical tools for the analysis, study and description of machine learning systems 9.Understand and know how to apply the basic techniques for solving unsupervised learning problems 10.Understand and know how to apply basic techniques for solving reinforcement learning problems 11.Understand the most important modern machine learning and computational learning techniques Study load Total learning time: 150h Theory classes: 30h 20.00% Practical classes: 15h 10.00% Laboratory classes: 15h 10.00% Guided activities: 6h 4.00% Self study: 84h 56.00% 2 / 10

Content Introduction to Machine Learning General information and basic concepts. Overview and approach to the problems tackled by machine learning techniques. Supervised (classification and regression), unsupervised (clustering) and semi-supervised (reinforcement and transductive) learning. Examples of modern applications. Unsupervised machine learning Definition and approaches for unsupervised machine learning. Clustering algorithms: EM algorithm and k-means algorithm. Supervised Machine Learning Theory The supervised Machine Learning problem setup. Classification and regression problems. Bias-variance tradeoff. Overfitting and underfitting. Generalization bounds. Vapnik-Chervonenkis dimension and complexity of a model. Model selection and feature selection. Supervised machine learning (I): classification problems The perceptron algorithm. Novikoff's theorem. Separations with maximum margin. Kernel functions. Support vector machines for classification. Error functions and classification by neural networks: multilayer perceptron and radial basis functions network. Supervised machine learning (II): regression problems Least squares: analytical and iterative methods. Error functions for regression and multilayer neural networks: multilayer perceptron and radial basis functions network. Support vector machines for regression. Supervised Machine Learning (III): ensembles of models 3 / 10

Bagging, boosting and ECOC. The bias/variance tradeoff revisited. Reinforcement learning Description of reinforcement learning. Markov processes. Bellman equations. Values and Temporal Difference methods. Q-learning and the Sarsa algorithm. Applications. Note on transductive learning. 4 / 10

Planning of activities Development of item 1 of the course Hours: 10h Theory classes: 2h Guided activities: 0h Self study: 6h The student sees an overview and basic concepts of machine learning as well as modern examples of application. 1, 11 Development of item 2 of the course Hours: 8h Theory classes: 2h Guided activities: 0h The teacher explains the theory of unsupervised machine learning, focusing on clustering algorithms. 1, 2, 9 Resolution of the problems of item 2 of the course Hours: 10h Guided activities: 2h Self study: 8h The teacher poses up to 3 problems within the current topic and the students prepare them at home. In the class, the teacher solves difficulties, gives guidance towards full resolution of the problem and eventually answers general questions about the topic. The students work again on these problems and deliver them. 1, 3, 8 Development of item 3 of the course Hours: 12h Theory classes: 6h Guided activities: 0h 5 / 10

The teacher exposes the problem of supervised Machine Learning. Explains the differences between regression and classification problems, and the notions of bias-variance tradeoff, over and underfitting as well as other theoretical tools for models selection. 1, 8 Resolution of the problems of item 3 of the course Hours: 13h Guided activities: 3h Self study: 10h The teacher poses up to 3 problems within the current topic and the students prepare them at home. In the class, the teacher solves difficulties, gives guidance towards full resolution of the problem and eventually answers general questions about the topic. The students work again on these problems and deliver them. 3, 4, 5, 6 Development of item 4 of the course Hours: 12h Theory classes: 6h Guided activities: 0h The teacher explains the basics of algorithms for separating hyperplanes: the Perceptron algorithm and maximum margin separation. The kernel functions and support vector machines for classification are introduced. Finally two neural networks for classification are covered: the multilayer perceptron and the radial basis function network. 1, 6, 7, 11 Resolution of the problems of item 4 of the course Hours: 13h Guided activities: 3h Self study: 10h The teacher poses up to 3 problems within the current topic and the students prepare them at home. In the class, the teacher solves difficulties, gives guidance towards full resolution of the problem and eventually answers general questions about the topic. The students work again on these problems and deliver them. 6 / 10

2, 3, 4, 6 Development of item 5 of the course Hours: 13h Theory classes: 6h Guided activities: 1h The teacher introduces the methods for regression problems: basically least squares (analytical and iterative methods). Error functions for the regression case are introduced. Multilayer neural networks are adapted to the regression case. The support vector machine for regression is introduced. 1, 5, 6, 7 Resolution of the problems of item 5 of the course Hours: 13h Guided activities: 3h Self study: 10h The teacher poses up to 3 problems within the current topic and the students prepare them at home. In the class, the teacher solves difficulties, gives guidance towards full resolution of the problem and eventually answers general questions about the topic. The students work again on these problems and deliver them. 3, 4, 7 Development of item 6 of the course Hours: 11h Theory classes: 4h Guided activities: 1h The teacher introduces the basic techniques for ensemble learning: bagging, boosting and ECOC, and explains them in light of the bias / variance tradeoff. 2, 8, 11 7 / 10

Resolution of the problems of item 6 of the course Hours: 10h Guided activities: 2h Self study: 8h The teacher poses up to 3 problems within the current topic and the students prepare them at home. In the class, the teacher solves difficulties, gives guidance towards full resolution of the problem and eventually answers general questions about the topic. The students work again on these problems and deliver them. 2, 3, 4, 9, 10 Development of item 7 of the subject Hours: 11h Theory classes: 4h Guided activities: 1h The teacher explains the basics of reinforcement learning and its applications and briefly introduces transductive learning. 10, 11 Resolution of the problems of item 7 of the subject Hours: 10h Guided activities: 2h Self study: 8h The teacher poses up to 3 problems within the current topic and the students prepare them at home. In the class, the teacher solves difficulties, gives guidance towards full resolution of the problem and eventually answers general questions about the topic. The students work again on these problems and deliver them. 3, 4, 11 Final exam Hours: 3h Guided activities: 3h Self study: 0h 5, 6, 7, 8, 9, 10 8 / 10

Delivery of the practical work Hours: 0h Guided activities: 0h Self study: 0h 2, 3, 4, 5, 6, 7, 8, 9, 10 Qualification system The course can be passed with continuous assessment, as follows: NProbs = Average mark of problems completed during the course NPract = Mark for the practical work NPart = Mark used to evaluate skills for effective communication NF1 = 50% NProbs + 40% NPract + 10% NPart The course can also be passed with a final exam, as follows: NExF = Mark obtained in a final exam (during the exams period) NF2 = 40% NExF + 20% NProbs + 30% NPract + 10% NPart In any case, the final mark is the highest of the two: FINAL MARK = max (NF1, NF2) 9 / 10

Bibliography Basic: Alpaydin, E. Introduction to machine learning. 3rd ed. MIT Press, 2014. ISBN 9780262028189. Bishop, C.M. Pattern recognition and machine learning. Springer, 2006. ISBN 0387310738. Cherkassky, V.; Mulier, F. Learning from data: concepts, theory, and methods [on line]. 2nd ed. John Wiley, 2007Available on: <http://site.ebrary.com/lib/upcatalunya/docdetail.action?docid=10296149&p00=learning%20data%20concepts>. ISBN 0471681822. Haykin, S.S. Neural networks and learning machines. 3rd ed. Prentice Hall, 2009. ISBN 0131471392. Murphy, K. P. Machine learning : a probabilistic perspective. MIT Press, cop. 2012. ISBN 9780262018029. Complementary: Hastie, T.; Tibshirani, R.; Friedman, J. The elements of statistical learning: data mining, inference, and prediction. 2nd ed. Springer, 2009. ISBN 0387848576. Duda, R.O.; Hart, P.E.; Stork, D.G. Pattern classification. 2nd ed. John Wiley & Sons, 2001. ISBN 0471056693. Bishop, C.M. Neural networks for pattern recognition. Clarendon Press, 1995. ISBN 0198538642. Others resources: Hyperlink http://videolectures.net/top/computer_science/machine_learning/ http://cran.r-project.org/ http://www.academicearth.org/courses/machine-learning 10 / 10