Lecture 1 - Introduction. Machine Learning and Data Mining

Similar documents
Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Seminar - Organic Computing

Laboratorio di Intelligenza Artificiale e Robotica

An Introduction to Simio for Beginners

Laboratorio di Intelligenza Artificiale e Robotica

Probabilistic Latent Semantic Analysis

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Word Segmentation of Off-line Handwritten Documents

Lecture 1: Machine Learning Basics

Lecture 1: Basic Concepts of Machine Learning

On-Line Data Analytics

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

Python Machine Learning

CSL465/603 - Machine Learning

Unit purpose and aim. Level: 3 Sub-level: Unit 315 Credit value: 6 Guided learning hours: 50

K 1 2 K 1 2. Iron Mountain Public Schools Standards (modified METS) Checklist by Grade Level Page 1 of 11

A GENERIC SPLIT PROCESS MODEL FOR ASSET MANAGEMENT DECISION-MAKING

MGT/MGP/MGB 261: Investment Analysis

Speech Recognition at ICSI: Broadcast News and beyond

Software Maintenance

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Linking Task: Identifying authors and book titles in verbose queries

MINISTRY OF EDUCATION

GACE Computer Science Assessment Test at a Glance

Reinforcement Learning by Comparing Immediate Reward

A Case Study: News Classification Based on Term Frequency

Axiom 2013 Team Description Paper

Rule discovery in Web-based educational systems using Grammar-Based Genetic Programming

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Emergency Management Games and Test Case Utility:

Circuit Simulators: A Revolutionary E-Learning Platform

MYCIN. The MYCIN Task

An OO Framework for building Intelligence and Learning properties in Software Agents

ISFA2008U_120 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM

A Reinforcement Learning Variant for Control Scheduling

Universal Design for Learning Lesson Plan

Statistics and Data Analytics Minor

Data Fusion Models in WSNs: Comparison and Analysis

Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Using dialogue context to improve parsing performance in dialogue systems

AGENDA LEARNING THEORIES LEARNING THEORIES. Advanced Learning Theories 2/22/2016

DIGITAL GAMING & INTERACTIVE MEDIA BACHELOR S DEGREE. Junior Year. Summer (Bridge Quarter) Fall Winter Spring GAME Credits.

UCEAS: User-centred Evaluations of Adaptive Systems

A Case-Based Approach To Imitation Learning in Robotic Agents

Australian Journal of Basic and Applied Sciences

Top US Tech Talent for the Top China Tech Company

BPS Information and Digital Literacy Goals

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

An investigation of imitation learning algorithms for structured prediction

AUTOMATED TROUBLESHOOTING OF MOBILE NETWORKS USING BAYESIAN NETWORKS

Georgetown University at TREC 2017 Dynamic Domain Track

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

Evolutive Neural Net Fuzzy Filtering: Basic Description

Medical Complexity: A Pragmatic Theory

Master s Programme in Computer, Communication and Information Sciences, Study guide , ELEC Majors

On Human Computer Interaction, HCI. Dr. Saif al Zahir Electrical and Computer Engineering Department UBC

Lecture 10: Reinforcement Learning

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Rule Learning With Negation: Issues Regarding Effectiveness

Machine Learning and Development Policy

Applying Fuzzy Rule-Based System on FMEA to Assess the Risks on Project-Based Software Engineering Education

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

5.7 Course Descriptions

Stages of Literacy Ros Lugg

CS Machine Learning

Taught Throughout the Year Foundational Skills Reading Writing Language RF.1.2 Demonstrate understanding of spoken words,

A study of speaker adaptation for DNN-based speech synthesis

prehending general textbooks, but are unable to compensate these problems on the micro level in comprehending mathematical texts.

While you are waiting... socrative.com, room number SIMLANG2016

Learning Methods for Fuzzy Systems

Distributed Weather Net: Wireless Sensor Network Supported Inquiry-Based Learning

AQUA: An Ontology-Driven Question Answering System

Predicting Outcomes Based on Hierarchical Regression

SOFTWARE EVALUATION TOOL

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,

Learning Methods in Multilingual Speech Recognition

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

Managing Printing Services

MASTER S COURSES FASHION START-UP

Knowledge-Based - Systems

Emergent Narrative As A Novel Framework For Massively Collaborative Authoring

Welcome to. ECML/PKDD 2004 Community meeting

DOCTOR OF PHILOSOPHY HANDBOOK

Introduction to Simulation

Comparison of network inference packages and methods for multiple networks inference

Identification of Opinion Leaders Using Text Mining Technique in Virtual Community

Utilizing FREE Internet Resources to Flip Your Classroom. Presenter: Shannon J. Holden

The Strong Minimalist Thesis and Bounded Optimality

Rendezvous with Comet Halley Next Generation of Science Standards

MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY

The taming of the data:

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

MULTIMEDIA Motion Graphics for Multimedia

Universidade do Minho Escola de Engenharia

Using computational modeling in language acquisition research

Soft Computing based Learning for Cognitive Radio

Transcription:

CPSC-340: Machine Learning and Data Mining 1. CPSC-340: Machine Learning and Data Mining 2 Lecture 1 - Introduction OBJECTIVE: Understand the several ways in which the machine learning and data mining problems arise in practice. Machine Learning and Data Mining Nando de Freitas September 3, 2005 MACHINE LEARNING AND DATA MINING Machine Learning and Data Mining are the processes of deriving abstractions of the real world from a set of observations. Data mining focuses on databases. The resulting abstractions (models) are useful for 1. Making decisions under uncertainty. 2. Predicting future events. 3. Classifying massive quantities of data quickly. 4. Finding patterns (clusters, hierarchies, abnormalities, associations) in the data. 5. Developing autonomous agents (robots, game agents and other programs).

CPSC-340: Machine Learning and Data Mining 3 MACHINE LEARNING AND OTHER FIELDS Machine learning is closely related to many disciplines of human endeavor. For example: Information Theory : Compression: Models are compressed versions of the real world. Complexity: Suppose we want to transmit a message over a communication channel CPSC-340: Machine Learning and Data Mining 4 future. That is, they generalise well. Probability Theory : Modelling noise. Dealing with uncertainty: occlusion, missing data, synonymy and polisemy, unknown inputs. Sender data Channel data Receiver To gain more efficiency, we can compress the data and send both the compressed data and the model to decompress the data. Sender data comp. data Encoder model Channel comp. data Decoder data model Receiver There is a fundamental tradeoff between the amount of compression and the cost of transmitting the model. More complex models allow for more compression, but are expensive to transmit. Learners that balance these two costs tend to perform better in the

CPSC-340: Machine Learning and Data Mining 5 Statistics : Data Analysis and Visualisation: gathering, display and summary of data. Inference: drawing statistical conclusions from specific data. Computer Science : Theory. Database technology. Software engineering. Hardware. Optimisation : Searching for optimal parameters and models in constrained and unconstrained settings is ubiquitous in machine learning. CPSC-340: Machine Learning and Data Mining 6 question of fundamental importance to human beings. At the onset of Western philosophy, Plato and Aristotle distinguished between essential and accidental properties of things. The Zen patriarch, Bodhidharma also tried to get to the essence of things by asking what is that? in a serious sense, of course. Other Branches of Science : Game theory. Econometrics. Cognitive science. Engineering. Psychology. Biology. Philosophy : The study of the nature of knowledge (epistemology) is central to machine learning. Understanding the learning process and the resulting abstractions is a

CPSC-340: Machine Learning and Data Mining 7 APPLICATION AREAS Machine learning and data mining play an important role in the following fields: CPSC-340: Machine Learning and Data Mining 8 Computer Vision : Handwritten digit recognition (Le Cun), tracking, segmentation, object recognition. Software : Teaching the computer instead of programming it. Bioinformatics : Sequence alignment, DNA micro-arrays, drug design, novelty detection. Patients Genes

CPSC-340: Machine Learning and Data Mining 9 Robotics : State estimation, control, localisation and map building. CPSC-340: Machine Learning and Data Mining 10 Computer Graphics : Automatic motion generation, realistic simulation. E.g., style machines by Brand and Hertzmann: Electronic Commerce : Data mining, collaborative filtering, recommender systems, spam.

CPSC-340: Machine Learning and Data Mining 11 Computer Games : Intelligent agents and realistic games. CPSC-340: Machine Learning and Data Mining 12 Financial Analysis : Options and derivatives, forex, portfolio allocation. Medical Sciences : Epidemiology, diagnosis, prognosis, drug design. Speech : Recognition, speaker identification. Multimedia : Sound, video, text and image databases; multimedia translation, browsing, information retrieval (search engines).

CPSC-340: Machine Learning and Data Mining 13 TYPES OF LEARNING Supervised Learning We are given input-output training data {x 1:N,y 1:N }, where x 1:N (x 1,x 2,...,x N ). That is, we have a teacher that tell us the outcome y for each input x. Learning involves adapting the model so that its predictions ŷ are close to y. To achieve this we need to introduce a lossfunction that tells us how close ŷ is to y. Where does the loss function come from? x M odel ŷ After learning the model, we can apply it to novel inputs and study its response. If the predictions are accurate we have reason to believe the model is correct. We can exploit this during training by splitting the dataset into a training set and a test set. We learn the model with the training set and validate it with the test set. This is an example of a model selection technique knowns as cross-validation. CPSC-340: Machine Learning and Data Mining 14 What are the advantages and disadvantages of this technique? In the literature, inputs are also known as predictors, explanatory variables or covariates, while outputs are often referred to as responses or variates. Unsupervised Learning Here, there is not teacher. The learner must identify structures and patterns in the data. Many times, there is no single correct answer. Examples of this include image segmentation and data clustering. Semi-supervised Learning It s a mix of supervised and unsupervised learning. Reinforcement Learning Here, the learner is given a reward for an action performed in a particular environment. Human cognitive tasks as well

CPSC-340: Machine Learning and Data Mining 15 as simple motor tasks like balancing while walking seem to make use of this learning paradigm. RL, therefore, is likely to play an important role in graphics and computer games in the future. Active Learning World data P assive Learner M odel World data query Active Learner Model Active learners query the environment. Queries include questions and requests to carry out experiments. As an analogy, I like to think of good students as active learners! But, how do we select queries optimally? That is, what questions should we ask? What is the price of asking a question? Active learning plays an important role when establishing causal relationships.