SOME APPROACHES TO THE RECOGNITION OF HANDWRITTEN NUMERALS

Similar documents
Word Segmentation of Off-line Handwritten Documents

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Learning Methods for Fuzzy Systems

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Python Machine Learning

INPE São José dos Campos

Speech Emotion Recognition Using Support Vector Machine

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

An Online Handwriting Recognition System For Turkish

Evolutive Neural Net Fuzzy Filtering: Basic Description

Off-line handwritten Thai name recognition for student identification in an automated assessment system

Circuit Simulators: A Revolutionary E-Learning Platform

Human Emotion Recognition From Speech

Large vocabulary off-line handwriting recognition: A survey

Classification Using ANN: A Review

Reducing Features to Improve Bug Prediction

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Accepted Manuscript. Title: Region Growing Based Segmentation Algorithm for Typewritten, Handwritten Text Recognition

Lecture 1: Machine Learning Basics

Longest Common Subsequence: A Method for Automatic Evaluation of Handwritten Essays

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Calibration of Confidence Measures in Speech Recognition

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

Artificial Neural Networks written examination

SARDNET: A Self-Organizing Feature Map for Sequences

Disambiguation of Thai Personal Name from Online News Articles

A study of speaker adaptation for DNN-based speech synthesis

Rule Learning With Negation: Issues Regarding Effectiveness

MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Test Effort Estimation Using Neural Network

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION

AUTOMATED FABRIC DEFECT INSPECTION: A SURVEY OF CLASSIFIERS

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Mining Association Rules in Student s Assessment Data

Given a directed graph G =(N A), where N is a set of m nodes and A. destination node, implying a direction for ow to follow. Arcs have limitations

Using the Attribute Hierarchy Method to Make Diagnostic Inferences about Examinees Cognitive Skills in Algebra on the SAT

A Pipelined Approach for Iterative Software Process Model

University of Groningen. Systemen, planning, netwerken Bosman, Aart

Analysis of Hybrid Soft and Hard Computing Techniques for Forex Monitoring Systems

Rule Learning with Negation: Issues Regarding Effectiveness

Knowledge Transfer in Deep Convolutional Neural Nets

Software Maintenance

On-Line Data Analytics

Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification

CSL465/603 - Machine Learning

A Reinforcement Learning Variant for Control Scheduling

Switchboard Language Model Improvement with Conversational Data from Gigaword

Designing a Rubric to Assess the Modelling Phase of Student Design Projects in Upper Year Engineering Courses

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Artificial Neural Networks

Generative models and adversarial training

Speaker Identification by Comparison of Smart Methods. Abstract

School of Innovative Technologies and Engineering

GACE Computer Science Assessment Test at a Glance

Softprop: Softmax Neural Network Backpropagation Learning

Clouds = Heavy Sidewalk = Wet. davinci V2.1 alpha3

Modeling function word errors in DNN-HMM based LVCSR systems

CS Machine Learning

Issues in the Mining of Heart Failure Datasets

Problems of the Arabic OCR: New Attitudes

Chapter 2 Rule Learning in a Nutshell

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

Modeling function word errors in DNN-HMM based LVCSR systems

Guide to the Uniform mark scale (UMS) Uniform marks in A-level and GCSE exams

Australian Journal of Basic and Applied Sciences

Knowledge-Based - Systems

(Sub)Gradient Descent

GCSE Mathematics B (Linear) Mark Scheme for November Component J567/04: Mathematics Paper 4 (Higher) General Certificate of Secondary Education

Corrective Feedback and Persistent Learning for Information Extraction

On the Combined Behavior of Autonomous Resource Management Agents

arxiv: v2 [cs.cv] 30 Mar 2017

Soft Computing based Learning for Cognitive Radio

International Journal of Computational Intelligence and Informatics, Vol. 1 : No. 4, January - March 2012

Learning From the Past with Experiment Databases

Discriminative Learning of Beam-Search Heuristics for Planning

PREDICTING SPEECH RECOGNITION CONFIDENCE USING DEEP LEARNING WITH WORD IDENTITY AND SCORE FEATURES

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

BAUM-WELCH TRAINING FOR SEGMENT-BASED SPEECH RECOGNITION. Han Shu, I. Lee Hetherington, and James Glass

Data Integration through Clustering and Finding Statistical Relations - Validation of Approach

Lecture 10: Reinforcement Learning

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 17, NO. 3, MARCH

AQUA: An Ontology-Driven Question Answering System

A GENERIC SPLIT PROCESS MODEL FOR ASSET MANAGEMENT DECISION-MAKING

Developing True/False Test Sheet Generating System with Diagnosing Basic Cognitive Ability

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

Semi-Supervised GMM and DNN Acoustic Model Training with Multi-system Combination and Confidence Re-calibration

Why Did My Detector Do That?!

Visual CP Representation of Knowledge

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

Data Fusion Models in WSNs: Comparison and Analysis

WHEN THERE IS A mismatch between the acoustic

A Case-Based Approach To Imitation Learning in Robotic Agents

The A2iA Multi-lingual Text Recognition System at the second Maurdor Evaluation

Lecture 1: Basic Concepts of Machine Learning

Math-U-See Correlation with the Common Core State Standards for Mathematical Content for Third Grade

An OO Framework for building Intelligence and Learning properties in Software Agents

Kamaldeep Kaur University School of Information Technology GGS Indraprastha University Delhi

Transcription:

SOME APPROACHES TO THE RECOGNITION OF HANDWRITTEN NUMERALS M.Hanmandlu Multimedia University, Jalan Multimedia, 63100,Cyberjaya, Selangor, Malaysia mhmandlu@hotmail.com M.Vamsi Krishna Dept. of Electronics & Communication Engineering, S.S.N.Engineering College, Ongole, A.P., India mvkrsna@hotmail.com P.Chaitanya Krishna Dept. of Electrical & Electronics Engineering, V.S.College of Engineering, Khammam, A.P., India chaitanyakrsna@rediff.com ABSTRACT The present work deals with the recognition of handwritten isolated numerals by utilizing a recent approach, which aims at tackling the variability in the writing styles. A two pronged approach involving pre-classification and the recognition has been followed in this paper. For the pre-classification of numerals, two approaches have been presented. The first is heuristic based and the second is stroke based. A recent feature extraction method, namely sector data method, which takes care of variability in the handwritten numerals, has been incorporated into the system thus the variability involved in the writing styles of different individuals is taken care of by extracting features from the sector based approach. The back propagation neural networks have been used in the recognition process using the features extracted from the sector-based approach. On the basis of recognition rates obtained with samples written by different individuals, it is concluded that the sector based approach is better suited for the recognition of numerals when pre-classification is made on the basis of strokes. 1. Introduction The recognition of handwritten numerals has many applications such as automatic Postal sorting, automatic bank cheque processing etc. The earlier systems known as Optical Character Recognition (OCR) systems that had been developed were confined to recognize only the printed or handwritten numerals of fixed size and fonts. But, the present study aims at producing a system, which could recognize numerals of any arbitrary size, shape and fonts. There are numerous approaches that address the problem and they vary in the features extracted from the graphical representation of the numerals. The system of Sridhar[8] recognizes both the hand printed and machine printed text for postal address interpretation. Fairhurst and Rahman[9] have used a new hybrid approach to recognize handwritten numerals. Ahmadi etal. [10] and Cho[11] have employed a multistage classifier for recognizing totally unconstrained handwritten numerals. Abuhaibiba and Ahmed [13] have generated a set of fuzzy constrained numerals graph

models. These can tolerate a large degree of flexibility by which the structures consisting of straight-line segments and loops (two stroke sequence) are constructed. Rocha and Pavlidis [14] have used shape analysis model for the recognition of numerals using the variation in numeral shapes from the prototypes. Classification of numeral shapes is defined as a minimization problem among the possible transformations that map an input shape to prototypical shapes. In this paper, we have experimented sector based approach [19] for the recognition of numerals. The motivation behind this work is to judge the potential of this approach for feature extraction and then use the neural networks for the recognition after they are trained with the features extracted. The organization of the paper is as follows: Section 2 briefly discusses the preprocessing steps used. The pre classification schemes are presented in section 3. Section 4 deals with the feature extraction and conclusions are in section 5. 2.Pre-processing of characters We have applied normalization and thinning in the pre-processing stage. Other steps such as slant correction, slope correction and pruning are not crucial for the present work though they can be experimented with. Binary image of a numeral Removal of redundant zeros Normaliza tion Slant Correction Thinning Fig 1: Preprocessing of numerals

2.1 Normalization and Thinning Normalization is required as the size of a numeral varies from person to person and even with the same person from time to time. Before thinning, the broken parts of numerals if any are joined. Usually the numeral 5 has broken parts. The broken part is joined to the rest of the body by extending it backwards until it meets the body. This backward direction is the direction opposite to the writing style. This is done in the following way. For this, a boundary scan of the broken part is done until we reach the point where we started. The binary file is scanned again to find any other boundary other than the marked boundary. If there exists some other boundary then it is joined with the marked boundary by finding the angle of the stroke associated with the marked boundary and extending the stroke till it meets the other boundary. Thinning is the process to extract and apply additional constraints on the pixel elements that need to be preserved such that a linear structure of the input image will be recaptured without destroying its connectivity. In the context of image processing, thinning is considered as an iterative procedure which removes points or layers of outline from a pattern until all the lines or curves are of unit width or of single pixel wide. Thus, the reduced pattern is known as the skeleton. A comprehensive survey of thinning methods is given in Lee and Suen [9]. Naccache and Singhal [5] made a study of fourteen thinning algorithms based on iterative erosion of boundary. A standard four-pass thinning procedure proposed by Rosenfield and Kak[12] has been used in this work. This algorithm is a specialized shrinking process which deletes points, whose removal does not locally disconnect their neighborhoods from the numeral matrix at each iteration and this guarantees connectivity property of the numeral. 3. Pre-classification The central issue in numeral recognition is the selection of proper features. In order to decide the identity of the numeral, features, which will exhibit the distinct characteristics of the numerals, are extracted. Ideally, the features should enable the system to discriminate correctly from one class of numerals from other classes.

3.1 Using a heuristic approach The structural features like end points, junction points and number of branches are found to vary from numeral to numeral. So, in this work, two methods are used for preclassification of numerals. In the first method, a preference level, say, zero, is set on the basis of distances extracted from the numerals. The numerals satisfying the preference levels are grouped under one category and the rest the grouped into another category. The proposed method partitions the normalized file into 16 parts with each part forming a matrix of size [5x4.]. From this matrix, for each pixel 1 encountered in the binary file, distance is calculated from the position of this pixel to the left-hand top corner of the matrix. The distances for all such pixels in the matrix are summed up. So, in all a normalized numeral yields 16 features which ma be zero and nonzero. The location of zero values in the positions designated in Table. 1 is used to categorize the numerals as per Table 1(a). For example, if position 7 and 10 contains zeros then the numeral is preclassified as 0. Then the features of numeral 0 extracted using the sector based approach are loaded into the neural network for verification. This pre-classification leads to the first category consisting of numerals 0,1,4,5 and 7. The remaining numerals belong to the second category. The scheme is also rotationally invariant. For other numerals preclassification is not possible and their features are directly used in the neural network for recognition. A few samples of handwritten numerals are shown in Fig.2. 3.2 Using stroke approach Here, numerals are grouped into four categories based on strokes. A stroke is the traversal of numeral from a junction having a odd number of branches or from an end point or from any pixel having 1 [16]. The staring point is chosen in the order of priority., i.e. first junction point, if not, the end point else any pixel containing 1. The traversal is made by finding the neighbor of each pixel and then marking the present pixel, so that, it is not traversed twice. Likewise the procedure of traversing and marking pixel is repeated.

4. The Proposed Sector based approach for Feature Extraction The approach of Hanmandlu et al.[15] for the recognition of handwritten capital letters of English text is adapted to the present numeral recognition. In this, the centroid of black pixels in the numeral array is calculated. On experimentation, the division of numeral into twelve sectors is found to be optimal. Then the total number of 1 s in the normalized numeral is calculated. In each sector, the sum of vector distances of all pixels 1 from the centroid is computed and normalized by dividing with the total number of pixels 1 s present in the numeral. Similarly, the normalized angles are computed for all the sectors. The centroid data is also normalized and is taken as additional two features. In total we have 26 features. This feature database is created for all input numeral sets. 4.1 Configuration of ANN s for different methods of preclassification Once the feature vector is chosen, the number of neurons in the input layer is fixed. The number of neurons in the output is also fixed based on the categorization of numerals. The preclassification using the preference levels suggests the number of output neurons to be employed. For example, if any one of 0,2,4,5,7 is the outcome of preclassification we need only one-output neurons. The number of numerals present in each category guides the structure of the network in the case of stroke sequence. For example, category-1 requires three output nodes, as there are seven numerals in this category. The configurations of ANN s for different preclassification methods are given in Table 3. 4.2 Results As different people write in different ways, we have used handwritten samples of numerals both for the training and testing purposes. Tables 4 & 5 give the number of samples used in the training and testing phases of recognition process and Tables 6 & 7 depict the recognition rates obtained for the two methods of preclassification respectively. The results demonstrate that the preclassification based on stroke method is better than that based on preference level method. It is found that recognition rate is drastically reduced without preclassification of numerals.

Fig. 2 : Samples of Handwritten Numerals 5. Conclusions This work presents a handwritten numeral recognition system. Two approaches are presented for the pre-classification of numerals. A recent feature extraction method, namely sector data method, which takes care of variability in the handwritten numerals, has been incorporated into the system. On the basis of recognition rates obtained with samples written by different individuals, we can conclude that the sector based approach is better suited for the recognition of numerals when preclassification is made on the basis of strokes. In this paper, nature of the stroke is not considered which may result in the improvement in the preclassification of the numerals. The two preclassification techniques can be combined resulting in better recognition rate and better utilization of memory. Then other methods proposed earlier [1,15] like Ring method and Fusion method that is a combination of both ring and sector methods can be used for feature extraction to ascertain further improvement in recognition rates. References 1. Din-Change Tsang, Hung-Pin Chiu, Jen-Cheih Cheng, Invariant Handwritten Chinese Character Recognition using fuzzy ring data, Image and Vision Computing, Vol. 14,pp. 647-657, 1996. 2. Jain, Anil K., Mao, jiang Chang, Artificial Neural Networks: a Training Methodologies: a Tutorial, computer, No 3, pp. 31-44, 1996. 3. Lam l., Lee S. W., Suen Chang Y., Thinning Methodologies: A Comprehensive Survey, IEEE Transactions on PAMI, Vol. 14, No. 9, pp. 869-885, 1992.

4. Lam L., Suen Chang Y., Structural Classification and Relaxation Matching of Totally Unconstrained Hand-written Zip Code Number, Pattern recognition, Vol. 21, No. 1,pp. 19-31, 1998. 5. Naccache N.J., Shinghal R., SPTA-A Proposed Algorithm for Thinning Binary Patterns, IEEE Trans. On Systems, Man and Cybernetics, Vol. 14, No.3, pp.409-418,1984. 6. Srihari, Sargur N., Recognition of Hand Printed and Machine Printed Text for Postal Address interpretation, Pattern Recognition Letters, Vol. 14, No.4, pp. 291-302, 1993. 7. Hwang, Young-Sup and Bang, Sung-Yang, Recognition of Unconstrained handwritten Numerals by a Radial basis function neural network classifier, Patterns Recognition Letters, Vol. 18, pp. 657-664, 1997. 8. Kimura, F. And Sridhar, M., Handwritten Numeral Recognition Based on Multiple Algorithms, Patterns Recognition, Vol. 24, No. 10, pp. 969-983, 1991. 9. Fairhurst, M.C. And Rahman, A. F. R., A New Hybrid Approach in Combining Multiple Experts to Recognize Handwritten Numerals, Pattern Recognition Letters, pp. 781-790, 1997. 10. Cao, J., Ahmadi, M., Shridhar, M., Recognition of handwritten numerals with multiple feature and multistage classifier, Patterns Recognition, 28(2), pp. 153, 1995. 11. Sung-Bae Cho, Neural Network Classifier for Recognizing totally Unconstrained handwritten Numerals, IEEE Transactions on Neural Networks, Vol. 8, No. 1, pp. 43-45, 1997. 12. Cheng, Fang-Hsuan, Wen-Hsing Hsu and Ming-Chuan Kuo, Recognition of Hand printed Chinese characters via stroke relaxation, Pattern Recognition, Vol. 26 No. 4, pp. 579-593, 1993. 13. Abuhaiba, I. S. I. And Ahmed, A Fuzzy Graph Theoretic Approach to Recognize the Totally Unconstrained Handwritten Numerals, Pattern Recognition, Vol. 26 No. 9, pp. 1335-1350, 1993. 14. Rocha, Jairo and T. Pavlidis, A Shape Analysis Model with Applications to a Character Recognition System, IEEE Transactions on Neural Networks, Vol. 16, No. 4, 1994 15. Hanmandlu, M., Murali Mohan, K.R., and Kumar, H., Neural-based Handwritten character recognition, Accepted for presentation in Fifth Int. Conf. On Document Analysis and Recognition ICDAR 99, Sept. 20-22, 1999, Bangalore, India 16. Sukhan Lee and Jack C. Pan, Offline Tracing and Representation of Signatures, IEEE Transactions on systems and Cybernetics, Vol. 22, No. 4, 1992.

APPENDIX Table 1: The position number of mark in the normalized file. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 Table 1.1(a): Preference level for specific numerals Numeral Σ dist = 0 In positions 0 7&10 2 5&6 4 4&13 5&7 9 Table 2 : Preference classification of Numerals Category Numeral Number of strokes 1 0,1,2,5,6,7,9 1 2 1,2,7,9 2 3 3,4,8 3 4 8 4 Table 3 : Configuration of ANN`s for Sector data method Preclassification No.of input No. of No. of output Technique nodes hidden notes Nodes Preference level method 26 15 1 or 4 Stroke sequence 26 15 3 for cat. 1,2 for cat.2 Method 2 for cat. 3,1 for cat.4 Table 4 : handwritten samples used in training and testing for Numerals using stroke sequence method Category training Testing Sample Samples Without preference 1240 300 Level Withpreference level `0 120 40 `2 110 35 `4 140 40 `5 & `7 220 69 Other 750 150

Table 5: Handwritten samples used in training and Testing for Numerals using stroke sequence method Category training Testing Sample Samples 1 755 200 2 460 145 3 210 80 4 100 25 Table 6 : Recognition rates for numerals with/ without Preference levels Numerals Sector data Method Without preference Level 84% With preference level `0 90% `2 88% `4 91% `5 & `7 88% Others 86% Table 7: Recognition rates for numerals using stroke Sequence method for preclassification Numerals Sector data Method 1 87% 2 90% 3 94% 4 99%