CSCI 5521: Pattern Recognition. Prof. Paul Schrater

Similar documents
Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

INPE São José dos Campos

Longest Common Subsequence: A Method for Automatic Evaluation of Handwritten Essays

Speech Recognition at ICSI: Broadcast News and beyond

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

(Sub)Gradient Descent

Rule Learning With Negation: Issues Regarding Effectiveness

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape

Firms and Markets Saturdays Summer I 2014

Knowledge Transfer in Deep Convolutional Neural Nets

Semi-Supervised GMM and DNN Acoustic Model Training with Multi-system Combination and Confidence Re-calibration

Rule Learning with Negation: Issues Regarding Effectiveness

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Word Segmentation of Off-line Handwritten Documents

Python Machine Learning

EECS 700: Computer Modeling, Simulation, and Visualization Fall 2014

Speech Emotion Recognition Using Support Vector Machine

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

MGT/MGP/MGB 261: Investment Analysis

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

WHEN THERE IS A mismatch between the acoustic

A Case Study: News Classification Based on Term Frequency

Individual Component Checklist L I S T E N I N G. for use with ONE task ENGLISH VERSION

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Lecture 1: Basic Concepts of Machine Learning

THE PENNSYLVANIA STATE UNIVERSITY SCHREYER HONORS COLLEGE DEPARTMENT OF MATHEMATICS ASSESSING THE EFFECTIVENESS OF MULTIPLE CHOICE MATH TESTS

Lecture 1: Machine Learning Basics

Human Emotion Recognition From Speech

Laboratorio di Intelligenza Artificiale e Robotica

ENEE 302h: Digital Electronics, Fall 2005 Prof. Bruce Jacob

On-Line Data Analytics

MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY

Generative models and adversarial training

Learning Methods for Fuzzy Systems

Pre-AP Geometry Course Syllabus Page 1

Softprop: Softmax Neural Network Backpropagation Learning

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

Australian Journal of Basic and Applied Sciences

MTH 215: Introduction to Linear Algebra

AQUA: An Ontology-Driven Question Answering System

Speaker Identification by Comparison of Smart Methods. Abstract

INTERMEDIATE ALGEBRA Course Syllabus

On Human Computer Interaction, HCI. Dr. Saif al Zahir Electrical and Computer Engineering Department UBC

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 17, NO. 3, MARCH

Update on the Next Accreditation System Drs. Culley, Ling, and Wood. Anesthesiology April 30, 2014

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

Ryerson University Sociology SOC 483: Advanced Research and Statistics

Learning Methods in Multilingual Speech Recognition

SPM 5309: SPORT MARKETING Fall 2017 (SEC. 8695; 3 credits)

Lecture 10: Reinforcement Learning

Course Content Concepts

Penn State University - University Park MATH 140 Instructor Syllabus, Calculus with Analytic Geometry I Fall 2010

Motivation to e-learn within organizational settings: What is it and how could it be measured?

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION

CS Machine Learning

CHEM 591 Seminar in Inorganic Chemistry

Syllabus ENGR 190 Introductory Calculus (QR)

GROUP COMPOSITION IN THE NAVIGATION SIMULATOR A PILOT STUDY Magnus Boström (Kalmar Maritime Academy, Sweden)

ENME 605 Advanced Control Systems, Fall 2015 Department of Mechanical Engineering

Mining Association Rules in Student s Assessment Data

Class Meeting Time and Place: Section 3: MTWF10:00-10:50 TILT 221

BUS Computer Concepts and Applications for Business Fall 2012

English Policy Statement and Syllabus Fall 2017 MW 10:00 12:00 TT 12:15 1:00 F 9:00 11:00

Applications of memory-based natural language processing

Evidence for Reliability, Validity and Learning Effectiveness

UNIVERSITY OF CALIFORNIA SANTA CRUZ TOWARDS A UNIVERSAL PARAMETRIC PLAYER MODEL

Early Warning System Implementation Guide

Evolution of Symbolisation in Chimpanzees and Neural Nets

Feature-oriented vs. Needs-oriented Product Access for Non-Expert Online Shoppers

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

Calibration of Confidence Measures in Speech Recognition

Design Of An Automatic Speaker Recognition System Using MFCC, Vector Quantization And LBG Algorithm

Conversation Starters: Using Spatial Context to Initiate Dialogue in First Person Perspective Games

How to Judge the Quality of an Objective Classroom Test

Problems of the Arabic OCR: New Attitudes

XXII BrainStorming Day

FUZZY EXPERT. Dr. Kasim M. Al-Aubidy. Philadelphia University. Computer Eng. Dept February 2002 University of Damascus-Syria

Using Web Searches on Important Words to Create Background Sets for LSI Classification

Visit us at:

COMM370, Social Media Advertising Fall 2017

Focused on Understanding and Fluency

MYCIN. The MYCIN Task

TUESDAYS/THURSDAYS, NOV. 11, 2014-FEB. 12, 2015 x COURSE NUMBER 6520 (1)

Strengthening assessment integrity of online exams through remote invigilation

CIS 2 Computers and the Internet in Society -

arxiv: v2 [cs.cv] 30 Mar 2017

Stacks Teacher notes. Activity description. Suitability. Time. AMP resources. Equipment. Key mathematical language. Key processes

Laboratorio di Intelligenza Artificiale e Robotica

Table of Contents. Introduction Choral Reading How to Use This Book...5. Cloze Activities Correlation to TESOL Standards...

Modeling function word errors in DNN-HMM based LVCSR systems

Probability and Statistics Curriculum Pacing Guide

Spring 2015 IET4451 Systems Simulation Course Syllabus for Traditional, Hybrid, and Online Classes

Strategic Management (MBA 800-AE) Fall 2010

Arabic Orthography vs. Arabic OCR

Jeff Walker Office location: Science 476C (I have a phone but is preferred) 1 Course Information. 2 Course Description

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Strategy for teaching communication skills in dentistry

Transcription:

CSCI 5521: Pattern Recognition Prof. Paul Schrater

Business Check to make sure you received the test email. If not, you are not officially registered. Course web page: http://gandalf.psych.umn.edu/~schrater/schrater_lab/courses/pattre cog03/pattrecog.html Prof. Paul Schrater Pattern Recognition CSCI 5521 2

Syllabus Reading materials: Statistical Pattern Recognition, 2nd Ed. Andrew Webb Pattern Classification, 2nd Ed. Duda, Hart, Stork ( Select Chapters will be posted for download) Neural Networks for Pattern Recognition. Bishop ( Select Chapters will be posted for download) Papers posted on the web site. Downloads will be password protected. Grading 50% on the homework assignments 20% on the midterm 30% on the final project. Prof. Paul Schrater Pattern Recognition CSCI 5521 3

Syllabus cont d Final Project 10-15 page paper involving: 1) Simulation or experiments. For example, implement a pattern recognition system for a particular application, e.g. digit classification, document clustering, etc. 2) Literature survey (with critical evaluation) on a given topic. 3) Theoretical work (detailed derivations, extensions of existing work, etc) Important dates: Sept. 23: Topic selection. One or two pages explaining the project with a list of references. Nov. 4: Partial report (3 to 5 pages). Dec. 16: Final report (10 to 15 pages). Students may work in groups of 2-3. Prof. Paul Schrater Pattern Recognition CSCI 5521 4

Policies/Procedures DO NOT CHEAT. Do NOT work in groups for homework Electronically submit homework. Homework must be submitted before class on the day it is due. Prof. Paul Schrater Pattern Recognition CSCI 5521 5

Introduction to Pattern Recognition Syllabus What are Patterns? Pattern Recognition An Example Pattern Recognition Systems The Design Cycle Learning and Adaptation Conclusion

Examples of Patterns Prof. Paul Schrater Pattern Recognition CSCI 5521 7

Examples of Patterns Natural or Not? How can we describe these patterns? Prof. Paul Schrater Pattern Recognition CSCI 5521 8

Shape Patterns D arcy Thompson s suggestion of species change through continuous deformation This figure shows the effects of Alzheimer's Disease on the ventricular expansion rate measured from serial MRI. Prof. Paul Schrater Pattern Recognition CSCI 5521 9

Explaining patterns Voice Puppetry, M. Brand; Siggraph 99 Prof. Paul Schrater Pattern Recognition CSCI 5521 10

Pattern Examples Prof. Paul Schrater Pattern Recognition CSCI 5521 11

Pattern Examples Natural Language is a pattern Prof. Paul Schrater Pattern Recognition CSCI 5521 12

What is a Pattern? A set of instances that: Share some regularities and similarities. Are Repeatable. Are Observable, sometimes partially, using sensors with noise and distortions. How do we define regularity? How do we define similarity? How do we define likelihood for the repetition of a pattern? How do we model the sensors? What is not a pattern? Prof. Paul Schrater Pattern Recognition CSCI 5521 13

Two Schools of Pattern Rec. Generative methods: Bayesian school, pattern theory. 1). Define patterns and regularities (graph spaces), 2). Specify likelihood model for how signals are generated from hidden structures 3). Learning probability models from ensemble of signals 4). Inferences. Discriminative methods: The goal is to tell apart a number of patterns, say 100 people, 10 digits, directly, without understanding or mathematical description. You should not solve a problem to an extend more than what you need. Prof. Paul Schrater Pattern Recognition CSCI 5521 14

Pattern Recognition applications Build a machine that can recognize patterns: Speech recognition Fingerprint identification OCR (Optical Character Recognition) DNA sequence identification Prof. Paul Schrater Pattern Recognition CSCI 5521 15

An Example Sorting incoming Fish on a conveyor according to species using optical sensing Species Sea bass Salmon Prof. Paul Schrater Pattern Recognition CSCI 5521 16

Problem Analysis Set up a camera and take some sample images to extract features Length Lightness Width Number and shape of fins Position of the mouth, etc This is the set of all suggested features to explore for use in our classifier! Prof. Paul Schrater Pattern Recognition CSCI 5521 17

Preprocessing Segment fish from Background Feature Extraction Image data from each fish Summarized by feature extractor whose purpose is to reduce the data by measuring certain features Classification The features are passed to a classifier, that uses the features to decide which class the instance belongs to. Prof. Paul Schrater Pattern Recognition CSCI 5521 18

Prof. Paul Schrater Pattern Recognition CSCI 5521 19

The length is a poor feature alone! Select the lightness as a possible feature. Prof. Paul Schrater Pattern Recognition CSCI 5521 20

Prof. Paul Schrater Pattern Recognition CSCI 5521 21

Threshold decision boundary and cost relationship Move our decision boundary toward smaller values of lightness in order to minimize the cost (reduce the number of sea bass that are classified salmon!) Task of decision theory Prof. Paul Schrater Pattern Recognition CSCI 5521 22

Adopt the lightness and add the width of the fish Fish x T = [x 1, x 2 ] Lightness Width Prof. Paul Schrater Pattern Recognition CSCI 5521 23

Prof. Paul Schrater Pattern Recognition CSCI 5521 24

We might add other features that are not correlated with the ones we already have. A precaution should be taken not to reduce the performance by adding such noisy features Ideally, the best decision boundary should be the one which provides an optimal performance such as in the following figure: Prof. Paul Schrater Pattern Recognition CSCI 5521 25

Prof. Paul Schrater Pattern Recognition CSCI 5521 26

However, our satisfaction is premature because the central aim of designing a classifier is to correctly classify novel input Issue of generalization! Prof. Paul Schrater Pattern Recognition CSCI 5521 27

Prof. Paul Schrater Pattern Recognition CSCI 5521 28

Pattern Recognition Systems Sensing Use of a transducer (camera or microphone) PR system depends of the bandwidth, the resolution sensitivity distortion of the transducer Segmentation and grouping Patterns should be well separated and should not overlap Prof. Paul Schrater Pattern Recognition CSCI 5521 29

Prof. Paul Schrater Pattern Recognition CSCI 5521 30

Feature extraction Discriminative features Invariant features with respect to translation, rotation and scale. Classification Use a feature vector provided by a feature extractor to assign the object to a category Post Processing Exploit context input dependent information other than from the target pattern itself to improve performance Prof. Paul Schrater Pattern Recognition CSCI 5521 31

The Design Cycle Data collection Feature Choice Model Choice Training Evaluation Computational Complexity Prof. Paul Schrater Pattern Recognition CSCI 5521 32

Prof. Paul Schrater Pattern Recognition CSCI 5521 33

Data Collection How do we know when we have collected an adequately large and representative set of examples for training and testing the system? Prof. Paul Schrater Pattern Recognition CSCI 5521 34

Feature Choice Depends on the characteristics of the problem domain. Simple to extract, invariant to irrelevant transformation insensitive to noise. Prof. Paul Schrater Pattern Recognition CSCI 5521 35

Model Choice Unsatisfied with the performance of our fish classifier and want to jump to another class of model Prof. Paul Schrater Pattern Recognition CSCI 5521 36

Training Use data to determine the classifier. Many different procedures for training classifiers and choosing models Prof. Paul Schrater Pattern Recognition CSCI 5521 37

Evaluation Measure the error rate (or performance and switch from one set of features to another one Prof. Paul Schrater Pattern Recognition CSCI 5521 38

Computational Complexity What is the trade-off between computational ease and performance? (How an algorithm scales as a function of the number of features, patterns or categories?) Prof. Paul Schrater Pattern Recognition CSCI 5521 39

Learning and Adaptation Supervised learning A teacher provides a category label or cost for each pattern in the training set Unsupervised learning The system forms clusters or natural groupings of the input patterns Prof. Paul Schrater Pattern Recognition CSCI 5521 40

Conclusion The number, complexity and magnitude of the sub-problems of Pattern Recognition are formidable. Many of these sub-problems can indeed be solved Many fascinating unsolved problems still remain Prof. Paul Schrater Pattern Recognition CSCI 5521 41