Pattern Recognition Systems

Similar documents
Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Python Machine Learning

(Sub)Gradient Descent

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Lecture 1: Machine Learning Basics

Probabilistic Latent Semantic Analysis

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape

Speech Recognition at ICSI: Broadcast News and beyond

A Case Study: News Classification Based on Term Frequency

Rule Learning With Negation: Issues Regarding Effectiveness

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

CS Machine Learning

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Lecture 10: Reinforcement Learning

Word Segmentation of Off-line Handwritten Documents

Visit us at:

Improving Fairness in Memory Scheduling

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

INPE São José dos Campos

Knowledge Transfer in Deep Convolutional Neural Nets

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

Statistical Analysis of Climate Change, Renewable Energies, and Sustainability An Independent Investigation for Introduction to Statistics

Probability and Statistics Curriculum Pacing Guide

On Human Computer Interaction, HCI. Dr. Saif al Zahir Electrical and Computer Engineering Department UBC

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

Design Of An Automatic Speaker Recognition System Using MFCC, Vector Quantization And LBG Algorithm

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Edexcel GCSE. Statistics 1389 Paper 1H. June Mark Scheme. Statistics Edexcel GCSE

Australian Journal of Basic and Applied Sciences

Rule Learning with Negation: Issues Regarding Effectiveness

Interpreting ACER Test Results

Learning Methods for Fuzzy Systems

Laboratorio di Intelligenza Artificiale e Robotica

Revisiting the role of prosody in early language acquisition. Megha Sundara UCLA Phonetics Lab

Software Maintenance

Conversation Starters: Using Spatial Context to Initiate Dialogue in First Person Perspective Games

Going to School: Measuring Schooling Behaviors in GloFish

Economics 201 Principles of Microeconomics Fall 2010 MWF 10:00 10:50am 160 Bryan Building

Generative models and adversarial training

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

IT Students Workshop within Strategic Partnership of Leibniz University and Peter the Great St. Petersburg Polytechnic University

Individual Component Checklist L I S T E N I N G. for use with ONE task ENGLISH VERSION

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

1.1 Examining beliefs and assumptions Begin a conversation to clarify beliefs and assumptions about professional learning and change.

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

elearning OVERVIEW GFA Consulting Group GmbH 1

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,

Axiom 2013 Team Description Paper

STT 231 Test 1. Fill in the Letter of Your Choice to Each Question in the Scantron. Each question is worth 2 point.

A Reinforcement Learning Variant for Control Scheduling

CSL465/603 - Machine Learning

Lecture 2: Quantifiers and Approximation

Calibration of Confidence Measures in Speech Recognition

WHEN THERE IS A mismatch between the acoustic

Learning Methods in Multilingual Speech Recognition

Learning From the Past with Experiment Databases

Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge

Linking Task: Identifying authors and book titles in verbose queries

Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification

Acquiring Competence from Performance Data

A study of speaker adaptation for DNN-based speech synthesis

Lecture 1: Basic Concepts of Machine Learning

DO YOU HAVE THESE CONCERNS?

An Introduction to Simio for Beginners

Evolution of Symbolisation in Chimpanzees and Neural Nets

Assignment 1: Predicting Amazon Review Ratings

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 17, NO. 3, MARCH

Introducing the New Iowa Assessments Mathematics Levels 12 14

Teaching a Laboratory Section

Human Emotion Recognition From Speech

Historical maintenance relevant information roadmap for a self-learning maintenance prediction procedural approach

Researcher Development Assessment A: Knowledge and intellectual abilities

Artificial Neural Networks written examination

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Go fishing! Responsibility judgments when cooperation breaks down

Introduction. Educational policymakers in most schools and districts face considerable pressure to

Using computational modeling in language acquisition research

16.1 Lesson: Putting it into practice - isikhnas

Softprop: Softmax Neural Network Backpropagation Learning

Natural Language Processing. George Konidaris

Rule-based Expert Systems

Lesson Plan Title Aquatic Ecology

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Speech Emotion Recognition Using Support Vector Machine

BENCHMARK TREND COMPARISON REPORT:

Characteristics of Functions

MYCIN. The MYCIN Task

Evidence for Reliability, Validity and Learning Effectiveness

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Green Belt Curriculum (This workshop can also be conducted on-site, subject to price change and number of participants)

Longest Common Subsequence: A Method for Automatic Evaluation of Handwritten Essays

Case study Norway case 1

School of Innovative Technologies and Engineering

Shockwheat. Statistics 1, Activity 1

Transcription:

Pattern Recognition Systems Dr. Shuang LIANG School of Software Engineering TongJi University Fall, 2012

Today s Topics An example Pattern recognition systems The design cycle Introduction Pattern Recognition, Fall 2012 Dr. Shuang LIANG, SSE, TongJi

Today s Topics An example Pattern recognition systems The design cycle Introduction Pattern Recognition, Fall 2012 Dr. Shuang LIANG, SSE, TongJi

An Example Problem Sorting incoming fish on a conveyor belt according to species Assume that we have only two kinds of fish Sea bass salmon

Decision Process Q 1: What kind of information can distinguish one species from the other? Length Width Weight Number and shape of fins Tail shape

Decision Process Q 2: What can cause problems during sensing? Lighting conditions Position of fish on the conveyor belt Camera noise

Decision Process Q 3: What are the steps in the process? Which ones? Capture image Isolate fish Take measurements How? Make decision

Selecting Features Assume a fisherman told us that a sea bass is generally longer than a salmon We can use length as a feature and decide between sea bass and salmon according to a threshold on length How can we choose this threshold?

Selecting Feature (cont.) Histograms of the length feature for two types of fish in training samples. How can we choose the threshold l* to make a reliable decision?

Selecting Features (cont.) Even though sea bass is longer than salmon on the average, there are many examples of fish where this observation does not hold What should I do? Try another feature Average lightness of the fish scales

Selecting Features (cont.) Histograms of the lightness feature for two types of fish in training samples It looks easier to choose the threshold x* but we still cannot make a perfect decision

Cost of Error We should also consider costs of different errors we make in our decisions For example, If the fish packing company knows that Customers who buy salmon will object vigorously if they see sea bass in their cans Customers who buy sea bass will not be unhappy if they occasionally see some expensive salmon in their cans How does this knowledge affect our decision?

Multiple Features Assume we also observed that sea bass are typically wider than salmon We can use two features in our decision Lightness: x 1 Width: x 2 Each fish image is now represented as a point (feature vector) x1 x = x2 In a two-dimensional feature space

Multiple Features (cont.) Scatter plot of lightness and width features for training samples. We can draw a decision boundary to divide the feature space into two regions Does it look better than using only lightness?

Multiple Features (cont.) Does adding more features always improve the results? Avoid unreliable features Be careful about correlations with existing features Be careful about measure costs Be careful about noise in the measurements Is there some curse for working in very high dimensions?

Decision Boundaries Can we do better with another decision rule? Any better one?

Decision Boundaries (cont.) More complex models result in more complex boundaries

Decision Boundaries (cont.) Different criteria lead to different decision boundaries

Decision Boundaries (cont.) Two aspects of concerns We may distinguish training samples perfectly but how can we predict how well we can generalize to unknown samples How can we manage the tradeoff between complexity of decision rules and their performance to unknown samples?

More on Complexity Regression example Plot of 10 sample points for the input variable x along with the corresponding target variable t. Green curve is the true function that generated the data

More on Complexity (cont.) Polynomial curve fitting Plots of polynomials having various orders shown as red curved, fitted to the set of 10 sample points 0 th order polynomial 1 st order polynomial

More on Complexity (cont.) Polynomial curve fitting Plots of polynomials having various orders shown as red curved, fitted to the set of 10 sample points 3 rd order polynomial 9 th order polynomial

More on Complexity (cont.)

More on Complexity (cont.) Polynomial curve fitting Plots of 9 th order polynomials fitted to 15 and 100 sample points

Today s Topics An example Pattern recognition systems The design cycle Introduction Pattern Recognition, Fall 2012 Dr. Shuang LIANG, SSE, TongJi

Pattern Recognition Systems Object / process diagram of a pattern recognition system

Pattern Recognition Systems (cont.) Data acquisition and sensing Measurements of physical variables Important issues Bandwidth Resolution Sensitivity Distortion SNR Latency

Pattern Recognition Systems (cont.) Pre-processing Removal of noise in data Isolation of patterns of interest from the background Feature extraction Finding a new representation in terms of features

Pattern Recognition Systems (cont.) Modeling learning and estimation Learning a mapping between features and pattern groups and categories Classification Using features and learned models to assign a pattern to a category

Pattern Recognition Systems (cont.) Post-processing Evaluation of confidence in decisions Exploitation of content to improve performance Combination of experts

Today s Topics An example Pattern recognition systems The design cycle Introduction Pattern Recognition, Fall 2012 Dr. Shuang LIANG, SSE, TongJi

The Design Cycle Collect Data Select Features Select Models Train Classifier Evaluate Classifier

The Design Cycle Select Features Select Models Train Classifier Evaluate Classifier

The Design Cycle (cont.) Data collection Collecting training and testing data How can we know when we have adequately large and representative set of samples?

The Design Cycle Collect Data Select Models Train Classifier Evaluate Classifier

The Design Cycle (cont.) Feature selection Domain dependence and prior information Computational cost and feasibility Discriminative features Similar values for similar patterns Different values for different patterns. Invariant features with respect to translation, rotation and scale Robust features with respect to occlusion, distortion, deformation, and variations in environment.

The Design Cycle Collect Data Select Features Train Classifier Evaluate Classifier

The Design Cycle (cont.) Model selection Domain dependence and prior information Definition of design criteria Parametric vs. non-parametric models Handling of missing features Computational complexity Types of models: templates, decision-theoretic or statistical, syntactic or structural, neural, and hybrid How can we know how close we are to the true model underlying the patterns?

The Design Cycle Collect Data Select Features Select Models Evaluate Classifier

The Design Cycle (cont.) Training How can we learn the rule from data? Supervised learning: a teacher provides a category label or cost for each pattern in the training set Unsupervised learning: the system forms clusters or natural groupings of the input patterns Reinforcement learning: no desired category is given but the teacher provides feedback to the system such as the decision is right or wrong.

The Design Cycle Collect Data Select Features Select Models Train Classifier

The Design Cycle (cont.) Evaluation How can we estimate the performance with training samples? How can we predict the performance with future data? Problems of overfitting and generalization.

Summary Pattern recognition techniques find applications in many areas: machine learning, statistics, mathematics, computer science, biology, etc. There are many sub-problems in the design process Many of these problems can indeed be solved More complex learning, searching and optimization algorithms are developed with advances in computer technology There remain many fascinating unsolved problems