Pattern Recognition. CSE 802 Michigan State University Spring 2008

Similar documents
Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Python Machine Learning

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Word Segmentation of Off-line Handwritten Documents

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

Speech Recognition at ICSI: Broadcast News and beyond

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

INPE São José dos Campos

Lecture 1: Basic Concepts of Machine Learning

Time series prediction

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Learning Methods for Fuzzy Systems

Artificial Neural Networks written examination

CS Machine Learning

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

Human Emotion Recognition From Speech

A GENERIC SPLIT PROCESS MODEL FOR ASSET MANAGEMENT DECISION-MAKING

On Human Computer Interaction, HCI. Dr. Saif al Zahir Electrical and Computer Engineering Department UBC

A study of speaker adaptation for DNN-based speech synthesis

Analysis of Hybrid Soft and Hard Computing Techniques for Forex Monitoring Systems

SARDNET: A Self-Organizing Feature Map for Sequences

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Knowledge-Based - Systems

Probabilistic Latent Semantic Analysis

Test Effort Estimation Using Neural Network

Circuit Simulators: A Revolutionary E-Learning Platform

Lecture 1: Machine Learning Basics

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

An Online Handwriting Recognition System For Turkish

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Proposal of Pattern Recognition as a necessary and sufficient principle to Cognitive Science

Speech Emotion Recognition Using Support Vector Machine

AUTOMATED TROUBLESHOOTING OF MOBILE NETWORKS USING BAYESIAN NETWORKS

Evolution of Symbolisation in Chimpanzees and Neural Nets

Axiom 2013 Team Description Paper

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Modeling function word errors in DNN-HMM based LVCSR systems

Large vocabulary off-line handwriting recognition: A survey

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Probability estimates in a scenario tree

Rule Learning With Negation: Issues Regarding Effectiveness

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

Bootstrapping Personal Gesture Shortcuts with the Wisdom of the Crowd and Handwriting Recognition

Knowledge Transfer in Deep Convolutional Neural Nets

Calibration of Confidence Measures in Speech Recognition

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

A Case Study: News Classification Based on Term Frequency

Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models

Semi-Supervised Face Detection

Using Web Searches on Important Words to Create Background Sets for LSI Classification

Data Fusion Models in WSNs: Comparison and Analysis

Issues in the Mining of Heart Failure Datasets

Explorelearning Dna Fingerprint Analysis Answers

Seminar - Organic Computing

AQUA: An Ontology-Driven Question Answering System

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape

Web as Corpus. Corpus Linguistics. Web as Corpus 1 / 1. Corpus Linguistics. Web as Corpus. web.pl 3 / 1. Sketch Engine. Corpus Linguistics

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling

On the Formation of Phoneme Categories in DNN Acoustic Models

On-Line Data Analytics

Softprop: Softmax Neural Network Backpropagation Learning

Modeling function word errors in DNN-HMM based LVCSR systems

Learning Methods in Multilingual Speech Recognition

Abstractions and the Brain

CSL465/603 - Machine Learning

Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski

Using the Attribute Hierarchy Method to Make Diagnostic Inferences about Examinees Cognitive Skills in Algebra on the SAT

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Linking Task: Identifying authors and book titles in verbose queries

Arabic Orthography vs. Arabic OCR

FUZZY EXPERT. Dr. Kasim M. Al-Aubidy. Philadelphia University. Computer Eng. Dept February 2002 University of Damascus-Syria

Using computational modeling in language acquisition research

ENME 605 Advanced Control Systems, Fall 2015 Department of Mechanical Engineering

Active Learning. Yingyu Liang Computer Sciences 760 Fall

An Empirical and Computational Test of Linguistic Relativity

An Empirical Analysis of the Effects of Mexican American Studies Participation on Student Achievement within Tucson Unified School District

Semi-Supervised GMM and DNN Acoustic Model Training with Multi-system Combination and Confidence Re-calibration

Classification Using ANN: A Review

Using dialogue context to improve parsing performance in dialogue systems

Artificial Neural Networks

(Sub)Gradient Descent

Analysis of Speech Recognition Models for Real Time Captioning and Post Lecture Transcription

Course Law Enforcement II. Unit I Careers in Law Enforcement

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 17, NO. 3, MARCH

GACE Computer Science Assessment Test at a Glance

Australian Journal of Basic and Applied Sciences

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

Self-Supervised Acquisition of Vowels in American English

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

The Good Judgment Project: A large scale test of different methods of combining expert predictions

Computerized Adaptive Psychological Testing A Personalisation Perspective

Mining Association Rules in Student s Assessment Data

Build on students informal understanding of sharing and proportionality to develop initial fraction concepts.

CS 446: Machine Learning

Self-Supervised Acquisition of Vowels in American English

Evolutive Neural Net Fuzzy Filtering: Basic Description

Transcription:

Pattern Recognition CSE 802 Michigan State University Spring 2008

Pattern Recognition The real power of human thinking is based on recognizing patterns. The better computers get at pattern recognition, the more humanlike they will become. Ray Kurzweil, NY Times, Nov 24, 2003

What is a Pattern? A pattern is the opposite of a chaos; it is an entity vaguely defined, that could be given a name. (Watanabe)

Recognition Identification of a pattern as a member of a category we already know, or we are familiar with Classification (known categories) Clustering (creation of new categories) Category A Category B Classification Clustering

Pattern Recognition Given an input pattern, make a decision about the category or class of the pattern Pattern recognition is a very broad subject with many applications In this course we will study a variety of techniques to solve P.R. problems and discuss their relative strengths and weaknesses

Pattern Class A collection of similar (not necessarily identical) objects A class is defined by class samples (paradigms, exemplars, prototypes, training/learning samples) Inter-class variability Intra-class variability

Pattern Class Model Different descriptions, which are typically mathematical/statistical in form for each class/population Given a pattern, choose the best-fitting model for it and then assign it to class associated with the model

Intra-class and Inter-class Variability The letter T in different typefaces Same face under different expression, pose.

Interclass Similarity Characters that look similar Identical twins

Pattern Recognition Having been shown a few positive examples (and perhaps a few negative examples) of a pattern class, the system learns to tell whether or not a new object belongs in this class (Watanabe) Inferring a generality from a few exemplars COGNITION = Formation of new classes RECOGNITION = known classes

Pattern Recognition Applications Problem Speech recognition Non-destructive testing Detection and diagnosis of disease Natural resource identification Aerial reconnaissance Character recognition (page readers, zip code, license plate) Input Speech waveforms Ultrasound, eddy current, acoustic emission waveforms EKG, EEG waveforms Multispectral images Visual, infrared, radar images Optical scanned image Output Spoken words, speaker identity Presence/absence of flaw, type of flaw Types of cardiac conditions, classes of brain conditions Terrain forms, vegetation cover Tanks, airfields Alphanumeric characters

Pattern Recognition Applications Web search Problem Identification and counting of cells Inspection (PC boards, IC masks, textiles) Manufacturing Fingerprint identification Online handwriting retrieval Input Slides of blood samples, microsections of tissues Scanned image (visible, infrared) 3-D images (structured light, laser, stereo) Key words specified by a user Input image from fingerprint sensors Query word written by a user Type of cells Output Acceptable/unacceptable Identify objects, pose, assembly Text relevant to the user Owner of the fingerprint, fingerprint classes Occurrence of the word in the database

Pattern Recognition System Challenges Representation Matching A pattern recognition system involves Training Testing

Difficulties of Representation How should we model a face to account for the large intra-class variability? John P. Frisby, Seeing. Illusion, Brian and Mind, Oxford University Press, 1980

Difficulties of Representation How do you instruct someone (or some computer) to recognize caricatures in a magazine, let alone find a human figure in a misshapen piece of work? A program that could distinguish between male and female faces in a random snapshot would probably earn its author a Ph.D. in computer science. (Penzias 1989) A representatin could consist of a vector of real-valued numbers, ordered list of attributes, parts and their relations.

Good Representation! Should have some invariant properties (e.g., w.r.t. rotation, translation, scale ) Account for intra-class variations Ability to discriminate pattern classes of interest Robustness to noise/occlusion Lead to simple decision making (e.g., linear decision boundary) Low cost (affordable)

Pattern Recognition System Domain-specific knowledge Acquisition, representation Data acquisition camera, ultrasound, MRI,. Preprocessing Image enhancement/restoration, segmentation Representation Features: color, shape, texture Decision making Statistical/geometric pattern recognition syntactic/structural pattern recognition Artificial neural networks Post-processing/Context

Pattern Recognition System Performance Error rate (Prob. of misclassification) on independent test samples Speed Cost Robustness Reject option Return on investment

Fingerprint Classification Assign fingerprints into one of pre-specified types Plain Arch Tented Arch Right Loop Left Loop Accidental Pocket Whorl Plain Whorl Double Loop

Fingerprint Enhancement To address the problem of poor quality fingerprints Noisy image Enhanced image

Segmentation: Face Detection *Theo Pavlidis, http://home.att.net/~t.pavlidis/comphumans/comphuman.htm

Segmentation: Face Detection Games Magazine, September 2001

Fish Classification Preprocessing will involve image enhancement, separating touching/occluding fishes and finding the boundary of the fish

Length Feature Training (design or learning) Samples

Lightness Feature Overlap in the histograms is small compared to length feature

Two-dimensional Feature Space (Representation) Cost of misclassification? Two features together are better than individual features

Complex Decision Boundary Issue of generalization

Boundary With Good Generalization Simplify the decision boundary!

Feature Selection/extraction How many features and which ones to use in constructing the decision boundary? Some features may be redundant! Curse of dimensionality problems with too many features especially when we have a small number of training samples

Fruit Sorter redness Decision boundaries cherries apples lemons grapefruits Castleman, Digital Image Processing, Prentice-Hall, 1979 diameter

General Purpose P.R. System Humans have the ability to switch rapidly and seamlessly between different pattern recognition tasks It is very difficult to design a device that is capable of performing a variety of different classification tasks

Cat vs. Dog

Sheep Vs. Goat Access control for water in areas with water shortage (e.g. Australian outback); wildlife vs. livestock Install a gate that opens only when livestock enters Deny Allow Identify livestock using a PR system Rugged outdoor camera captures the image Edge detection and outline tracing Match to a library of existing shape templates Open the gate when there is a match Prototype system by Dunn et al., U. South Queensland, Australia. Claim that Sheep & goats can be separated with ~100% accuracy Vision Systems Design, November 2007 (www.vision-systems.com)

Supervised Classification Training samples are labeled

Unsupervised Classification Training samples are unlabeled

Models for Pattern Recognition Template matching Statistical (geometric) Syntactic (structural) Artificial neural networks (biologically motivated?) Hybrid approach

Template Matching Template Input scene

Deformable Template: Corpus Callosum Segmentation Shape training set Prototype and variation learning Prototype registration to the low-level segmented image Prototype warping

Statistical Pattern Recognition pattern Preprocessing Feature extraction Classification Recognition Training Patterns + Class labels Preprocessing Feature selection Learning

Representation Each pattern is represented as a point in the d- dimensional feature space Features and their desired invariance properties are domain-specific x 2 x 2 x 1 Good representation leads to small intraclass variation, large interclass separation & simple decision rule x 1

Invariant Representation Invariance to Translation Rotation Scale Skew Deformation Color

Structural Patten Recognition Decision-making when features are nonnumeric or structural Describe complicated objects in terms of simple primitives and structural relationship Scene N M L T Y X Z Object Background D E M N D E L T X Y Z

Syntactic Pattern Recognition pattern Preprocessing Primitive, relation extraction Syntax, structural analysis Recognition Training Patterns + Class labels Preprocessing Primitive selection Grammatical, structural inference

Chromosome Grammars Terminals: V T ={,,,, } Non-terminals: V N ={A,B,C,D,E,F} Pattern Classes: Median Submedian Acrocentric Telocentric

Chromosome Grammars Image of human chromosomes Hierarchical-structure description of a submedium chromosome

Artificial Neural Networks Massive parallelism is essential for complex pattern recognition tasks (e.g., speech and image recognition) Humans take only a few hundred milliseconds for most cognitive tasks; this suggests parallel computation in human brain Biological networks achieve excellent recognition performance via dense interconnection of simple computational elements (neurons) Number of neurons 10 10 10 12 Number of interconnections/neuron 10 3 10 4 Total number of interconnections 10 14

Artificial Neural Networks Nodes in neural networks are nonlinear, typically analog x 1 x 2 x d w 1 w d Y (output) where is internal threshold or offset

Multilayer Perceptron Feed-forward nets with one or more layers (hidden) between the input and output nodes A three-layer net can generate arbitrary complex decision regions.. d inputs... First hidden layer NH 1 input units These nets can be trained by backpropagation training algorithm... Second hidden layer NH 2 input units. c outputs

Utilizing Context How m ch info mation are y u mi sing Qvest

Constraining the Problem Graffiti alphabet GRAFFITI S MODIFIED alphabet is largely based on single pen strokes, starting at the dots. As soon as the pen is lifted from the screen, the letter is immediately translated into normal text. The letter X is the exception

Comparing Pattern Recognition Models Template Matching Assumes very small intra-class variability Learning is difficult for deformable templates Syntactic Primitive extraction is sensitive to noise Describing a pattern in terms of primitives is difficult Statistical Assumption of density model for each class Neural Network Parameter tuning and local minima in learning In practice, statistical and neural network approaches work well

Super Classifier Pool the evidence from component recognizers (classifier combination, mixture of experts, evidence accumulation)

Statistical Pattern Recognition Patterns represented in a feature space Statistical model for pattern generation in feature space Given training patterns from each class, goal is to partition the feature space.

Approaches to Statistical Pattern Recognition Prior Information COMPLETE INCOMPLETE Bayes Decision Theory Supervised Learning Unsupervised Learning Parametric Approach Nonparametric Approach Parametric Approach Non-parametric Approach "Optimal" Rules Plug-in Rules Density Estimation Geometric Rules (K-NN,MLP) Mixture Resolving Cluster Analysis (Hard, Fuzzy)

Summary Pattern recognition is extremely useful for Automatic decision making Assisting human decision makers Pattern recognition is a very difficult problem Successful systems have been built in wellconstrained domains No single technique/model is suited for all pattern recognition problems Use of object models, constraints, and context is necessary for identifying complex patterns Careful sensor design and feature extraction can lead to simple classifiers

Key Concepts Pattern class Representation Feature extraction Feature selection Invariance (rotation, translation, scale, deformation) Preprocessing Segmentation Training samples Test samples Error rate Reject rate Curse of dimensionality

Key Concepts Supervised classification Decision boundary unsupervised classification (clustering) Density Estimation Cost of misclassification/risk Feature space partitioning Generalization/overfitting Contextual information Multiple classifiers Prior knowledge