M3 - Machine Learning for Computer Vision

Similar documents
OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Python Machine Learning

(Sub)Gradient Descent

Human Emotion Recognition From Speech

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Rule Learning With Negation: Issues Regarding Effectiveness

The Evolution of Random Phenomena

Word Segmentation of Off-line Handwritten Documents

Phonetic- and Speaker-Discriminant Features for Speaker Recognition. Research Project

Lecture 1: Machine Learning Basics

Australian Journal of Basic and Applied Sciences

Math Grade 3 Assessment Anchors and Eligible Content

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

CS Machine Learning

Rule Learning with Negation: Issues Regarding Effectiveness

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Dublin City Schools Mathematics Graded Course of Study GRADE 4

Generative models and adversarial training

Assignment 1: Predicting Amazon Review Ratings

Probabilistic Latent Semantic Analysis

arxiv: v2 [cs.cv] 30 Mar 2017

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Hardhatting in a Geo-World

End-of-Module Assessment Task K 2

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Reducing Features to Improve Bug Prediction

TOPICS LEARNING OUTCOMES ACTIVITES ASSESSMENT Numbers and the number system

The University of Amsterdam s Concept Detection System at ImageCLEF 2011

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape

Grade 6: Correlated to AGS Basic Math Skills

Mathematics Scoring Guide for Sample Test 2005

Active Learning. Yingyu Liang Computer Sciences 760 Fall

About How Good is Estimation? Assessment Materials Page 1 of 12

First Grade Standards

Edexcel GCSE. Statistics 1389 Paper 1H. June Mark Scheme. Statistics Edexcel GCSE

Measurement. When Smaller Is Better. Activity:

The stages of event extraction

Individual Component Checklist L I S T E N I N G. for use with ONE task ENGLISH VERSION

Primary National Curriculum Alignment for Wales

Calibration of Confidence Measures in Speech Recognition

WHEN THERE IS A mismatch between the acoustic

Speech Recognition at ICSI: Broadcast News and beyond

Linking Task: Identifying authors and book titles in verbose queries

Interactive Whiteboard

Mathematics Success Level E

Broward County Public Schools G rade 6 FSA Warm-Ups

Detecting English-French Cognates Using Orthographic Edit Distance

Evaluating Interactive Visualization of Multidimensional Data Projection with Feature Transformation

Mathematics process categories

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Multi-Lingual Text Leveling

Missouri Mathematics Grade-Level Expectations

Memory-based grammatical error correction

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

WE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT

Semi-Supervised Face Detection

Alignment of Australian Curriculum Year Levels to the Scope and Sequence of Math-U-See Program

PRIMARY ASSESSMENT GRIDS FOR STAFFORDSHIRE MATHEMATICS GRIDS. Inspiring Futures

WiggleWorks Software Manual PDF0049 (PDF) Houghton Mifflin Harcourt Publishing Company

Answers: Year 4 Textbook 3 Pages 4 10

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Arizona s College and Career Ready Standards Mathematics

Montana Content Standards for Mathematics Grade 3. Montana Content Standards for Mathematical Practices and Mathematics Content Adopted November 2011

A Decision Tree Analysis of the Transfer Student Emma Gunu, MS Research Analyst Robert M Roe, PhD Executive Director of Institutional Research and

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

Large-Scale Web Page Classification. Sathi T Marath. Submitted in partial fulfilment of the requirements. for the degree of Doctor of Philosophy

Answer Key For The California Mathematics Standards Grade 1

An investigation of imitation learning algorithms for structured prediction

Similar Triangles. Developed by: M. Fahy, J. O Keeffe, J. Cooper

GCSE. Mathematics A. Mark Scheme for January General Certificate of Secondary Education Unit A503/01: Mathematics C (Foundation Tier)

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

Using focal point learning to improve human machine tacit coordination

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Automatic Pronunciation Checker

Modeling function word errors in DNN-HMM based LVCSR systems

Forget catastrophic forgetting: AI that learns after deployment

Learning Methods for Fuzzy Systems

Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models

Detecting Wikipedia Vandalism using Machine Learning Notebook for PAN at CLEF 2011

How to analyze visual narratives: A tutorial in Visual Narrative Grammar

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Finding Translations in Scanned Book Collections

This scope and sequence assumes 160 days for instruction, divided among 15 units.

Switchboard Language Model Improvement with Conversational Data from Gigaword

Large Kindergarten Centers Icons

Unit 3: Lesson 1 Decimals as Equal Divisions

Managing the Student View of the Grade Center

South Carolina College- and Career-Ready Standards for Mathematics. Standards Unpacking Documents Grade 5

GACE Computer Science Assessment Test at a Glance

Disambiguation of Thai Personal Name from Online News Articles

Software Maintenance

Using computational modeling in language acquisition research

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines

Radius STEM Readiness TM

Semantic Segmentation with Histological Image Data: Cancer Cell vs. Stroma

KS1 Transport Objectives

2 nd grade Task 5 Half and Half

On Human Computer Interaction, HCI. Dr. Saif al Zahir Electrical and Computer Engineering Department UBC

Page 1 of 11. Curriculum Map: Grade 4 Math Course: Math 4 Sub-topic: General. Grade(s): None specified

Transcription:

M3 - Machine Learning for Computer Vision Traffic Sign Detection and Recognition Adrià Ciurana Guim Perarnau Pau Riba

Index Correctly crop dataset Bootstrap Dataset generation Extract features Normalization Dimensionality reduction Data pre-processing Sliding window Detection Recognition Sign detection and recognition Get metrics (F1-Score, AUC) Visualize data Evaluation 2

Introduction 3

Motivation Module 1 project segmentation Per window results (669 images): Precision Accuracy Recall F1-Score Time / frame 47.88% 38.25% 65.55% 55.34% 0.73 s 4

Pipeline Image Initial Detector Round sign? Bootstrap Sliding Window Framework Segmentation Detection Recognition Evaluation Square sign? Triangular sign? New background dataset = False Positive 5

Pipeline Bootstrap Sliding Window Framework Segmentation Detection Recognition Evaluation 6

Pipeline Bootstrap Sliding Window Framework Segmentation Detection Recognition Evaluation 7

Pipeline Bootstrap Sliding Window Framework Segmentation Detection Recognition Evaluation 8

Pipeline Bootstrap Sliding Window Framework Segmentation Detection Recognition Evaluation? 9

Pipeline Bootstrap Sliding Window Framework Segmentation Detection Recognition Evaluation 10

Dataset 1. http://btsd.ethz.ch/shareddata/ Dataset used: reduced BelgiumTS Dataset 1 (62 classes) Problems found: - Traffic signs in (supposedly) only background images: - Traffic signs not labeled but correctly detected: Assumption: - Do Not Care Object : types of signs that we will ignore (No penalization, No gain). 11

Crop training dataset BelgiumTS Dataset already cropped images: Problem: 1. Cropped images need to have a canonical size. 2. All signs must have the same height (vertical padding).

Crop training dataset Solution: make our own 32x32 crops with 4 vertical padding pixels. Original bounding box 4 pixels Expand BB Resize 32 32 Results: Special case: sign is at image boundary add boundary padding Boundary padding 13

Bootstrap = False Positive Background Images Round sign? New background dataset Square sign? Initial Detector Hard negatives Total Initial 9863 Hard Negatives 11647 Total 21510 Train a new model adding False Positives Triangular sign? 14

Segmentation Original Image Segmentation using YCbCr color space Morphology Possible sign Advantages Speed up SW Reduces False Positives 15

Segmentation However......we miss some signs! 16

Sliding window For each level of GP: Input image Gaussian pyramid Segmentation Integral Image Possible sign region Sliding window of the image and the integral image 17

Data augmentation Idea: Generate more positive samples for each class. Flip samples: Add more positive samples: Flip not desired in some cases: Blur samples: Smooth sudden changes. Gives the shape. Original (3,3) (5,5) (7,7) (9,9) 18

Dataset division for detection First idea: Background vs Signs Problem: Very different kinds of signs. Separation is not easy. Solution: Divide signs according to its shape: Up-triangle Down-triangle Horizontal Vertical Parking Round Stop Diamond rectangle rectangle No-flip No-flip No-flip 19

Detection Window Candidate Simples binaries classifiers Customized thresholds Feature Extraction vs BKGD vs BKGD vs BKGD vs BKGD vs BKGD vs BKGD > th OR > th OR > th OR > th ロ OR > th OR > th YES It is a traffic sign? NO 20

Non maximum suppression Multiple detection: Red: Ground truth Green: Detections Combine detections: Overlap > threshold Keep the best score. - Pascal Vallotton (Pascal) - Pedro Felzenwalb (Pedro) score(a)<score(b) - Technische Universität Darmstadt (TUD) 21

Recognition Class Multiclass: Yes Detections boxes Feature Extraction 14 Classes It is a traffic sign? No Delete from detections + Background (refinement step) 22

Evaluation Train Set Train and test: cropped images Signs are centered Same scale Per Window Train Model Per Image Test: Sliding window Translation Different scale Multiple detections 23

Evaluation - Detection Per window results: FEATURE DIMENSION CLASSIFIER SOLVER DESCRIPTOR REDUCTION DATA NORMALIZATION F1-SCORE HOG (4x4 pxc) No Yes 98.63% Faster! Linear HOG (8x8 pxc) No Yes 97.95% HOG (8x8 pxc) Yes (PCA) Yes 97.30% SVM HOG+ColorHist Yes (PCA) Yes 97.26% Color is not important Slower RBF HOG (8x8 pxc) No Yes 97.66% HOG+LBP No Yes 97.31% HOG Color Multichannel No Yes 96.98% LDA SVD LBP No Yes 96.26% 24

Evaluation - Detection Per image results: Blurring the images is key CLASSIFIER SOLVER FEATURE DESCRIPTOR DIMENSION REDUCTION SEGMENTATIO N BLUR IMAGES F1-SCORE Yes (LDA) Yes Yes 55.17% SVM Linear HOG Yes (LDA) Yes No 44.89% Yes (LDA) No No 24.86% No No No 21.49% CASCADE BOOSTED CLASSIFIE RS - Haar + Adaboost No No No 27.61% LDA and segmentation improve results and speed 25

Evaluation - Recognition MODEL SOLVE R FEATURE DESCRIPTO R DIMENSIO N REDUCTIO N F1-SCORE (PER WINDOW) F1-SCORE (PER IMAGE) SVM ECOC + SVM (ONE VS REST) Linear HOG Yes (LDA) 82.50% 64.68% NEURAL NETWORK - HOG No - 75.68% NN F1-Score Precision Recall Mean 56.22% 52.52% 76.15% Weighted 75.68% 81.05% 73.02% 26

Evaluation - Whole Pipeline Detection Recognition Detection (improved) 27

Video = Ground truth = Estimated sign = Do not care object Note: this video shows the final output of the recognition given the detection, not the detection by itself. 28

Conclusions Color segmentation and parallelization saved us time. LDA improves performance (both speed and results). Tricks learned: Correctly cropping the dataset Bootstrap Data augmentation Low results. M1 M3 F1-Score 55.34% 55.17% 29

30