Unsupervised Learning (Examples)

Similar documents
Version Space. Term 2012/2013 LSI - FIB. Javier Béjar cbea (LSI - FIB) Version Space Term 2012/ / 18

(Sub)Gradient Descent

CS Machine Learning

Evaluating Interactive Visualization of Multidimensional Data Projection with Feature Transformation

IT Students Workshop within Strategic Partnership of Leibniz University and Peter the Great St. Petersburg Polytechnic University

Python Machine Learning

Generative models and adversarial training

Evolution of Symbolisation in Chimpanzees and Neural Nets

Learning Methods in Multilingual Speech Recognition

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Word Segmentation of Off-line Handwritten Documents

Comparison of EM and Two-Step Cluster Method for Mixed Data: An Application

Rule Learning With Negation: Issues Regarding Effectiveness

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Lecture 1: Machine Learning Basics

Rule Learning with Negation: Issues Regarding Effectiveness

SCICU Legislative Strategic Plan 2018

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Community Power Simulation

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Calibration of Confidence Measures in Speech Recognition

A Survey on Unsupervised Machine Learning Algorithms for Automation, Classification and Maintenance

Phonetic- and Speaker-Discriminant Features for Speaker Recognition. Research Project

BUILDING CONTEXT-DEPENDENT DNN ACOUSTIC MODELS USING KULLBACK-LEIBLER DIVERGENCE-BASED STATE TYING

Assessment of Student Academic Achievement

Probabilistic Latent Semantic Analysis

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Truth Inference in Crowdsourcing: Is the Problem Solved?

Australian Journal of Basic and Applied Sciences

Human Emotion Recognition From Speech

Mining Student Evolution Using Associative Classification and Clustering

Syllabus Education Department Lincoln University EDU 311 Social Studies Methods

WHEN THERE IS A mismatch between the acoustic

CSL465/603 - Machine Learning

Artificial Neural Networks written examination

Issues in the Mining of Heart Failure Datasets

STA 225: Introductory Statistics (CT)

Using Web Searches on Important Words to Create Background Sets for LSI Classification

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

The taming of the data:

DOMAIN MISMATCH COMPENSATION FOR SPEAKER RECOGNITION USING A LIBRARY OF WHITENERS. Elliot Singer and Douglas Reynolds

Corrective Feedback and Persistent Learning for Information Extraction

Robot Learning Simultaneously a Task and How to Interpret Human Instructions

arxiv: v2 [cs.cv] 30 Mar 2017

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

LA1 - High School English Language Development 1 Curriculum Essentials Document

Modeling function word errors in DNN-HMM based LVCSR systems

Disambiguation of Thai Personal Name from Online News Articles

Speech Recognition at ICSI: Broadcast News and beyond

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

supplemental materials

Probability Therefore (25) (1.33)

Multiple Measures Assessment Project - FAQs

Building a Semantic Role Labelling System for Vietnamese

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

Presentation of the English Montreal School Board To Mme Michelle Courchesne, Ministre de l Éducation, du Loisir et du Sport on

Modeling function word errors in DNN-HMM based LVCSR systems

Students will analyze governmental institutions, political behavior, civic engagement, and their political and philosophical foundations.

Active Learning. Yingyu Liang Computer Sciences 760 Fall

AP Statistics Summer Assignment 17-18

Conference Presentation

P A S A D E N A C I T Y C O L L E G E SHARED GOVERNANCE

Laboratorio di Intelligenza Artificiale e Robotica

Probability and Statistics Curriculum Pacing Guide

A Comparison of Two Text Representations for Sentiment Analysis

Lessons on American Presidents.com

K-Medoid Algorithm in Clustering Student Scholarship Applicants

Summary of Special Provisions & Money Report Conference Budget July 30, 2014 Updated July 31, 2014

A study of speaker adaptation for DNN-based speech synthesis

Short Text Understanding Through Lexical-Semantic Analysis

Cooperative evolutive concept learning: an empirical study

Left, Left, Left, Right, Left

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Chinese Language Parsing with Maximum-Entropy-Inspired Parser

Content Language Objectives (CLOs) August 2012, H. Butts & G. De Anda

CS 446: Machine Learning

Astronomy News. Activity developed at Cégep de Saint-Félicien By BRUNO MARTEL

First Grade Curriculum Highlights: In alignment with the Common Core Standards

Improving Simple Bayes. Abstract. The simple Bayesian classier (SBC), sometimes called

Professor Christina Romer. LECTURE 24 INFLATION AND THE RETURN OF OUTPUT TO POTENTIAL April 20, 2017

Generation of Attribute Value Taxonomies from Data for Data-Driven Construction of Accurate and Compact Classifiers

Self-Supervised Acquisition of Vowels in American English

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks

GDP Falls as MBA Rises?

Lesson 12. Lesson 12. Suggested Lesson Structure. Round to Different Place Values (6 minutes) Fluency Practice (12 minutes)

AQUA: An Ontology-Driven Question Answering System

Talk About It. More Ideas. Formative Assessment. Have students try the following problem.

South Carolina College- and Career-Ready Standards for Mathematics. Standards Unpacking Documents Grade 5

Mathematics subject curriculum

Learning From the Past with Experiment Databases

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

BY-LAWS THE COLLEGE OF ENGINEERING AND COMPUTER SCIENCE THE UNIVERSITY OF TENNESSEE AT CHATTANOOGA

SARDNET: A Self-Organizing Feature Map for Sequences

Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge

Chapter 2. University Committee Structure

Speech Emotion Recognition Using Support Vector Machine

Creating the Student Platform Fall 2008

Transcription:

Unsupervised Learning (Examples) Javier Béjar cbea Term 2010/2011 Javier Béjar cbea Unsupervised Learning (Examples) Term 2010/2011 1 / 25

Outline 1 Iris 2 Voting Records 3 Mushroom 4 Image Segmentation Javier Béjar cbea Unsupervised Learning (Examples) Term 2010/2011 2 / 25

Iris Iris Differentiate among three species of flowers (Iris) 4 continuous attributes Attributes: Measures of characteristics of the flowers 150 instances 3 classes 96 % accuracy for supervised learning Javier Béjar cbea Unsupervised Learning (Examples) Term 2010/2011 3 / 25

Iris Iris Javier Béjar cbea Unsupervised Learning (Examples) Term 2010/2011 4 / 25

Iris Iris - Expectation/maximization We use the EM algorithm looking for 3 clusters Clusters are relatively clear, accuracy is a little bit lower 0 1 2 <-- assigned to cluster 0 50 0 Iris-setosa 50 0 0 Iris-versicolor 14 0 36 Iris-virginica Cluster 0 <-- Iris-versicolor Cluster 1 <-- Iris-setosa Cluster 2 <-- Iris-virginica Incorrectly clustered instances : 14.0 9.3333 % Javier Béjar cbea Unsupervised Learning (Examples) Term 2010/2011 5 / 25

Iris Iris - Expectation/maximization Javier Béjar cbea Unsupervised Learning (Examples) Term 2010/2011 6 / 25

Iris Iris - K-means K-means algorithm looking of 3 clusters Clusters are relatively clear, but cluster intersection affects prediction 0 1 2 <-- assigned to cluster 0 50 0 Iris-setosa 47 0 3 Iris-versicolor 14 0 36 Iris-virginica Cluster 0 <-- Iris-versicolor Cluster 1 <-- Iris-setosa Cluster 2 <-- Iris-virginica Incorrectly clustered instances : 17.0 11.3333 % Javier Béjar cbea Unsupervised Learning (Examples) Term 2010/2011 7 / 25

Voting Records Voting Records Classify US senators by their voting 16 binary attributes Attributes: Vote of the senator to different proposals (budget, immigration, taxes, military aid,...) 435 instances 2 classes 96.3 % accuracy for supervised learning Visualization of the data set is very difficult (binary attributes!) Javier Béjar cbea Unsupervised Learning (Examples) Term 2010/2011 8 / 25

Voting Records Voting Records - PCA PCA is used to obtain a new set of attributes The data set does not holds the conditions to apply PCA (non gaussian data) The 3 first components explain the 60 % of the variance (the first one explains 45 %, All are needed to reach 95 % of variance) Javier Béjar cbea Unsupervised Learning (Examples) Term 2010/2011 9 / 25

Voting records - PCA Voting Records Javier Béjar cbea Unsupervised Learning (Examples) Term 2010/2011 10 / 25

Voting Records Voting Records - Expectation-maximization EM algorithm is applied looking for 2 clusters Clusters are not very clear, the error is large 0 1 <-- assigned to cluster 44 223 democrat 159 9 republican Cluster 0 <-- republican Cluster 1 <-- democrat Incorrectly clustered instances : 53.0 12.1839 % Javier Béjar cbea Unsupervised Learning (Examples) Term 2010/2011 11 / 25

Voting Records Voting Records - K-means K-means algorithm is applied looking for 2 clusters The error is larger because of the intersection among clusters 0 1 <-- assigned to cluster 50 217 democrat 157 11 republican Cluster 0 <-- republican Cluster 1 <-- democrat Incorrectly clustered instances : 61.0 14.023 % Javier Béjar cbea Unsupervised Learning (Examples) Term 2010/2011 12 / 25

Mushroom Mushroom Distinguish between poisonous and edible mushrooms 22 Attributes binary and nominal Attributes: Visible characteristics of the mushrooms About 8000 instances 2 classes 100 % accuracy for supervised learning Visualization using the original attributes is difficult (binary and nominal attributes!) Javier Béjar cbea Unsupervised Learning (Examples) Term 2010/2011 13 / 25

Mushroom Mushroom - PCA PCA is used to obtain a new set of attributes The data set does not holds the conditions to apply PCA (non gaussian data) The first 10 components explain only 50 % of the variance. Are necessary all to explain 95 % of the variance (PCA has 59 components). Javier Béjar cbea Unsupervised Learning (Examples) Term 2010/2011 14 / 25

Mushroom - PCA Mushroom Javier Béjar cbea Unsupervised Learning (Examples) Term 2010/2011 15 / 25

Mushroom - PCA Mushroom Javier Béjar cbea Unsupervised Learning (Examples) Term 2010/2011 16 / 25

Mushroom - PCA Mushroom Javier Béjar cbea Unsupervised Learning (Examples) Term 2010/2011 17 / 25

Mushroom - PCA Mushroom Javier Béjar cbea Unsupervised Learning (Examples) Term 2010/2011 18 / 25

Mushroom Mushroom - Expectation/maximization EM algorithm is applied looking for 2 clusters Clusters are not very clear, the error is large Probably it is more interesting to look for more clusters and analyze them (the data set has more structure than the supervised labels show) 0 1 <-- assigned to cluster 4208 0 e 836 3080 p Cluster 0 <-- e Cluster 1 <-- p Incorrectly clustered instances : 836.0 10.2905 % Javier Béjar cbea Unsupervised Learning (Examples) Term 2010/2011 19 / 25

Mushroom Mushroom - Expectation/maximization + attribute selection We are cheating :-) A wrapper using decision trees is used to find the relevant attributes (5 relevant attributes) EM algorithm is applied looking for 2 clusters 0 1 <-- assigned to cluster 4000 208 e 528 3388 p Cluster 0 <-- e Cluster 1 <-- p Incorrectly clustered instances : 736.0 9.0596 % Javier Béjar cbea Unsupervised Learning (Examples) Term 2010/2011 20 / 25

Mushroom Mushroom - K-means K-means algorithm is applied looking for 2 clusters The result is awful, intersection among classes is large, there is no good partition of the data 0 1 <-- assigned to cluster 1234 2974 e 2093 1823 p Cluster 0 <-- p Cluster 1 <-- e Incorrectly clustered instances: 3057.0 37.6292 % Javier Béjar cbea Unsupervised Learning (Examples) Term 2010/2011 21 / 25

Image Segmentation Clustering for Image Processing Javier Béjar cbea Unsupervised Learning (Examples) Term 2010/2011 22 / 25

Image Segmentation Clustering in image processing Javier Béjar cbea Unsupervised Learning (Examples) Term 2010/2011 23 / 25

Image Segmentation Clustering for Image Processing Javier Béjar cbea Unsupervised Learning (Examples) Term 2010/2011 24 / 25

Image Segmentation Clustering for Image Processing Javier Béjar cbea Unsupervised Learning (Examples) Term 2010/2011 25 / 25