Pattern Classification and Clustering Spring 2006

Similar documents
Python Machine Learning

(Sub)Gradient Descent

Lecture 1: Machine Learning Basics

Artificial Neural Networks written examination

Learning Methods for Fuzzy Systems

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Axiom 2013 Team Description Paper

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Generative models and adversarial training

Phonetic- and Speaker-Discriminant Features for Speaker Recognition. Research Project

Data Fusion Through Statistical Matching

CSL465/603 - Machine Learning

Issues in the Mining of Heart Failure Datasets

Speech Emotion Recognition Using Support Vector Machine

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

A study of speaker adaptation for DNN-based speech synthesis

Time series prediction

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Modeling function word errors in DNN-HMM based LVCSR systems

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Modeling function word errors in DNN-HMM based LVCSR systems

Softprop: Softmax Neural Network Backpropagation Learning

Human Emotion Recognition From Speech

arxiv: v2 [cs.cv] 30 Mar 2017

Control Tutorials for MATLAB and Simulink

Design Of An Automatic Speaker Recognition System Using MFCC, Vector Quantization And LBG Algorithm

Lecture 1: Basic Concepts of Machine Learning

Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Semi-Supervised Face Detection

BA 130 Introduction to International Business

Speech Recognition at ICSI: Broadcast News and beyond

Probabilistic Latent Semantic Analysis

WHEN THERE IS A mismatch between the acoustic

Calibration of Confidence Measures in Speech Recognition

School of Innovative Technologies and Engineering

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

Australian Journal of Basic and Applied Sciences

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines

INPE São José dos Campos

INTRODUCTION TO DECISION ANALYSIS (Economics ) Prof. Klaus Nehring Spring Syllabus

Reducing Features to Improve Bug Prediction

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Word Segmentation of Off-line Handwritten Documents

Evolution of Symbolisation in Chimpanzees and Neural Nets

COURSE SYNOPSIS COURSE OBJECTIVES. UNIVERSITI SAINS MALAYSIA School of Management

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification

ECO 3101: Intermediate Microeconomics

Learning From the Past with Experiment Databases

CS177 Python Programming

Switchboard Language Model Improvement with Conversational Data from Gigaword

Lahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017

arxiv: v1 [cs.lg] 15 Jun 2015

Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models

Laboratorio di Intelligenza Artificiale e Robotica

CS Machine Learning

Likelihood-Maximizing Beamforming for Robust Hands-Free Speech Recognition

STA 225: Introductory Statistics (CT)

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION

Why Did My Detector Do That?!

Analysis of Hybrid Soft and Hard Computing Techniques for Forex Monitoring Systems

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

B. How to write a research paper

Evolutive Neural Net Fuzzy Filtering: Basic Description

Automatic Speaker Recognition: Modelling, Feature Extraction and Effects of Clinical Environment

Multi-Dimensional, Multi-Level, and Multi-Timepoint Item Response Modeling.

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Comparison of EM and Two-Step Cluster Method for Mixed Data: An Application

A survey of multi-view machine learning

Knowledge Transfer in Deep Convolutional Neural Nets

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 17, NO. 3, MARCH

SARDNET: A Self-Organizing Feature Map for Sequences

EECS 700: Computer Modeling, Simulation, and Visualization Fall 2014

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

An empirical study of learning speed in backpropagation

Speaker recognition using universal background model on YOHO database

The Method of Immersion the Problem of Comparing Technical Objects in an Expert Shell in the Class of Artificial Intelligence Algorithms

UMass at TDT Similarity functions 1. BASIC SYSTEM Detection algorithms. set globally and apply to all clusters.

Semi-Supervised GMM and DNN Acoustic Model Training with Multi-system Combination and Confidence Re-calibration

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Speaker Identification by Comparison of Smart Methods. Abstract

Sociology 521: Social Statistics and Quantitative Methods I Spring Wed. 2 5, Kap 305 Computer Lab. Course Website

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Assignment 1: Predicting Amazon Review Ratings

Rule Learning With Negation: Issues Regarding Effectiveness

Test Effort Estimation Using Neural Network

HIERARCHICAL DEEP LEARNING ARCHITECTURE FOR 10K OBJECTS CLASSIFICATION

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Deep Neural Network Language Models

Evaluating Interactive Visualization of Multidimensional Data Projection with Feature Transformation

Distributed Learning of Multilingual DNN Feature Extractors using GPUs

ATW 202. Business Research Methods

A Case Study: News Classification Based on Term Frequency

Transcription:

Pattern Classification and Clustering Time: Spring 2006 Room: Instructor: Yingen Xiong Office: 621 McBryde Office Hours: Phone: 231-4212 Email: yxiong@cs.vt.edu URL: http://www.cs.vt.edu/~yxiong/pcc/ Detailed Description: The course introduces to classical and modern computational approaches to pattern classification and clustering. Topics covered include some or all of the following: the probability and statistical basis for pattern classification and clustering, Bayesian classification decision theory, density and parameter estimation, dimensionality reduction, nonparametric estimation and classification, linear discriminant functions, feature extraction, parametric and nonparametric clustering algorithms, principal component analysis, and classification using artificial neural networks. Emphasis will be on the applications to digital video and speech analysis and classification, target tracking Course Objectives: Introduce the basic mathematical and statistical techniques commonly used in pattern classification and clustering Provide the students with a variety of pattern classification and clustering algorithms and methods which they can apply to real-world problems. Prerequisites: Basic knowledge of Linear Algebra, Probability and Statistics Some knowledge of signal/image/video/speech processing. Experience with MATLAB and C++ Programming is desirable. Textbook: R.O. Duda, P.E. Hart, and D.G. Stork, Pattern Classification, 2 nd Edition, John Wiley and Sons, New York, 2001 (ISBN 0-471- 05669-3). References: C. M. Bishop, Neural Networks for Pattern Recognition, Oxford University Press, 1995. K. Fukunaga, Introduction to Statistical Pattern Recognition, 2nd ed., Academic Pr, 1990. A.R. Webb, Statistical Pattern Recognition, 2 nd Edition, John Wiley and Sons, New York, 2002. R. J. Shalkoff, Pattern Recognition: Statistical, Structural, and Neural Approaches, John Wiley and Sons, 1992

S.M. Kay, Fundamentals of Statistical Signal Processing Estimation Theory, Prentice-Hall, Inc. Englewood Cliffs, NJ, 1993. B. Widrow, S.D. Stearns, Adaptive Signal Processing, Englewood Cliffs, N.J. Prentice- Hall, 1985. Course Outline: Introduction to Pattern Classification and Clustering Objective of Pattern Classification, Model of the pattern classification process, linear decision function, minimum-distance classification, approaches to pattern classification and clustering: statistical, neural and structural. Review of Some Basic Knowledge Probability and statistics: probability theory, conditional probability and Bayes rule, Random vectors, expectation, correlation, covariance. Linear algebra, linear transformations MATLAB Tutorial Review of some tools which need to be used to complete programming assignments. Students are highly encouraged to use MATLAB to implement their assignments and projects. Bayesian Classification Decision Theory Bayesian decision rules, Minimum error-rate classification, discriminant functions and decision boundaries, Bayes classifier for Gaussian patterns, linear and quadratic classifiers. Density and Parameter Estimation Maximum-likelihood estimation, Bayesian estimation Dimensionality Reduction The curse of dimensionality, principal component analysis, linear discriminants analysis. Nonparametric Estimation and Classification Parzen windows, K-nearest-neighbor classification, Non-parametric classification, density estimation, Parzen estimation. Linear Discriminant functions Linear discriminant, Perceptron learning, optimization by gradient descent, Support Vector Machine Clustering Algorithms Maximum-likelihood estimation and unsupervised learning, Mixture of Gaussian, K- means algorithm, hierarchical clustering, component analysis.

Introduction to Classification Using Artificial Neural Networks Single-layer networks, multilayer neural networks, feedforward operation, backpropagation algorithm, learning curves, neural networks classifiers. Grading: The course grade will be the weighted sum of four grades. Grading will be straight scale (90-100 A, 80-89 B, 70-79 C, 60-69 D, below 60 F). Homework: There will be 3-5 homework assignments and will require students to implement some of the algorithms covered during the semester and apply them. Homework assignments must be done individually. No collaboration on homework is allowed. Homework assignments will be done in MATLAB Exam: There will be a midterm exam and a final exam. All tests will be closed-books, closed-notes. The final exam may cover material from the entire course, but will emphasize material not covered on the mid-term. Project: The term project is due at the end of the semester and accounts for 40% of the course grade. Students will choose their own problem topic. Students will write a short proposal for the purpose of approval and feedback. It can be a comprehensive literature review or the implementation of the algorithms covered during the semester. Students are encouraged to propose projects related to their own research. To facilitate the completion of the project in a semester, it is advised that teams of 2-3 students work together. Students are highly encouraged to use MATLAB to implement their projects. Projects will be graded by their content (75%) and the quality of a classroom presentation (25%) at the end of the semester. Homework 30% Project 40% Midterm 10% Final Exam 20%

Course Schedule Week Date Topics Readings Assignments/activities 1 2 Introduction to Pattern Classification and Clustering: Problem, Model, Decision Function, and Approaches DHS Ch.1 Review of Statistics and Probability DHS A.4 Homework#1 assigned Review of Random Vectors, Expectation, Correlation, Covariance Review of Linear Algebra, Linear Transformations MATLAB Tutorial: Tool Box and Programming DHS A4, notes DHS A2 Notes 3 Bayesian Decision Rules, Minimun Error-rate Classification, Discriminant Functions and Decision Boundary DHS Ch. 2 4 5 6 7 8 9 10 11 12 13 14 15 Note: Bayes Classifiers for Gaussian Pattern, Linear and Quadratic Classifiers DHS Ch. 2 Homework#1 due Density and Parameter Estimation: Maximum- Likelihood Estimation DHS Ch.3 Homework#2 assigned Density and Parameter Estimation: Bayesian Estimation DHS Ch. 3 The Curse of Dimensionality, Fisher Linear Discriminant Analysis DHS Ch. 3 Principal Component Analysis DHS Ch. 3 Nonparametric Density Estimation DHS Ch.4 Parzen Window, K-nearest Neighbor Estimation DHS Ch. 4 Homework#2 due Nonparametric Classification, Parzen Estimation DHS Ch. 4 Homework#3 assigned Midterm Midterm Linear Discriminant, Percepton Learning DHS Ch. 5 Optimization by Gradient Descent, Support Vector Machine DHS Ch. 5 Mixture of Gaussian, Maximum-likelihood Estimation and Unsupervised Learning DHS Ch. 10 K-means Algorithm DHS Ch. 10 Homework#3 due Hierarchical Clustering DHS Ch. 10 Term project proposal due Componen Analysis DHS Ch. 10 Single Layer Networks DHS Ch. 6 Multilayer Neural Networks DHS Ch. 6 Neural Networks Classifiers DHS Ch. 6 Parameter Optimization Algorithm II CMB Ch. 7 Parameter Optimization Algorithm I CMB Ch. 7 Project Presentation I Project Presentation II Course Review Final Exam, 2 hours Project presentation I Project presentation II Final Exam 1. DHS--- R.O. Duda, P.E. Hart, and D.G. Stork, Pattern Classification, 2nd Edition, John Wiley and Sons, New York, 2001

2. CMB---C. M. Bishop, Neural Networks for Pattern Recognition, Oxford University Press, 1995