GURMUKHI CHARACTER RECOGNITION USING NEURO- FUZZY WITH EIGEN FEATURE

Similar documents
OCR for Arabic using SIFT Descriptors With Online Failure Prediction

Word Segmentation of Off-line Handwritten Documents

Guide to Teaching Computer Science

Longest Common Subsequence: A Method for Automatic Evaluation of Handwritten Essays

Human Emotion Recognition From Speech

Knowledge-Based - Systems

Arabic Orthography vs. Arabic OCR

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Python Machine Learning

For information only, correct responses are listed in the chart below. Question Number. Correct Response

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Learning Methods for Fuzzy Systems

Lecture 1: Machine Learning Basics

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

On-Line Data Analytics

Speech Emotion Recognition Using Support Vector Machine

AUTOMATED FABRIC DEFECT INSPECTION: A SURVEY OF CLASSIFIERS

Off-line handwritten Thai name recognition for student identification in an automated assessment system

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION

Modeling function word errors in DNN-HMM based LVCSR systems

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

CHALLENGES FACING DEVELOPMENT OF STRATEGIC PLANS IN PUBLIC SECONDARY SCHOOLS IN MWINGI CENTRAL DISTRICT, KENYA

Circuit Simulators: A Revolutionary E-Learning Platform

Evolutive Neural Net Fuzzy Filtering: Basic Description

Background Information. Instructions. Problem Statement. HOMEWORK INSTRUCTIONS Homework #3 Higher Education Salary Problem

McDonald's Corporation

BENG Simulation Modeling of Biological Systems. BENG 5613 Syllabus: Page 1 of 9. SPECIAL NOTE No. 1:

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape

Large vocabulary off-line handwriting recognition: A survey

Availability of Grants Largely Offset Tuition Increases for Low-Income Students, U.S. Report Says

GCSE Mathematics B (Linear) Mark Scheme for November Component J567/04: Mathematics Paper 4 (Higher) General Certificate of Secondary Education

Australian Journal of Basic and Applied Sciences

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

Rule Learning With Negation: Issues Regarding Effectiveness

Perspectives of Information Systems

Data Fusion Models in WSNs: Comparison and Analysis

WHEN THERE IS A mismatch between the acoustic

Rule Learning with Negation: Issues Regarding Effectiveness

A Case Study: News Classification Based on Term Frequency

Problems of the Arabic OCR: New Attitudes

Test Administrator User Guide

Knowledge Transfer in Deep Convolutional Neural Nets

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

Calibration of Confidence Measures in Speech Recognition

Ph.D in Advance Machine Learning (computer science) PhD submitted, degree to be awarded on convocation, sept B.Tech in Computer science and

Test Effort Estimation Using Neural Network

Training a Neural Network to Answer 8th Grade Science Questions Steven Hewitt, An Ju, Katherine Stasaski

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Kamaldeep Kaur University School of Information Technology GGS Indraprastha University Delhi

Reducing Features to Improve Bug Prediction

Linking Task: Identifying authors and book titles in verbose queries

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Modeling function word errors in DNN-HMM based LVCSR systems

International Journal of Innovative Research and Advanced Studies (IJIRAS) Volume 4 Issue 5, May 2017 ISSN:

Southern Wesleyan University 2017 Winter Graduation Exercises Information for Graduates and Guests (Updated 09/14/2017)

CLASSIFICATION OF PROGRAM Critical Elements Analysis 1. High Priority Items Phonemic Awareness Instruction

DEVM F105 Intermediate Algebra DEVM F105 UY2*2779*

Developing True/False Test Sheet Generating System with Diagnosing Basic Cognitive Ability

Accepted Manuscript. Title: Region Growing Based Segmentation Algorithm for Typewritten, Handwritten Text Recognition

Pragmatic Constraints affecting the Teacher Efficacy in Ethiopia - An Analytical Comparison with India

Lesson Plan Art: Painting Techniques

Kentucky s Standards for Teaching and Learning. Kentucky s Learning Goals and Academic Expectations

Computerized Adaptive Psychological Testing A Personalisation Perspective

Physics 270: Experimental Physics

TABLE OF CONTENTS TABLE OF CONTENTS COVER PAGE HALAMAN PENGESAHAN PERNYATAAN NASKAH SOAL TUGAS AKHIR ACKNOWLEDGEMENT FOREWORD

Abstractions and the Brain

School of Basic Biomedical Sciences College of Medicine. M.D./Ph.D PROGRAM ACADEMIC POLICIES AND PROCEDURES

Historical maintenance relevant information roadmap for a self-learning maintenance prediction procedural approach

SARDNET: A Self-Organizing Feature Map for Sequences

Soft Computing based Learning for Cognitive Radio

Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge

Lecture 1: Basic Concepts of Machine Learning

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

CS Machine Learning

Analysis of Hybrid Soft and Hard Computing Techniques for Forex Monitoring Systems

Enduring Understandings: Students will understand that

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

arxiv: v1 [cs.cl] 2 Apr 2017

Improving the Quality of MT Output using Novel Name Entity Translation Scheme

FUZZY EXPERT. Dr. Kasim M. Al-Aubidy. Philadelphia University. Computer Eng. Dept February 2002 University of Damascus-Syria

Learning Methods in Multilingual Speech Recognition

Unit 7 Data analysis and design

CREATING SHARABLE LEARNING OBJECTS FROM EXISTING DIGITAL COURSE CONTENT

INPE São José dos Campos

Large-Scale Web Page Classification. Sathi T Marath. Submitted in partial fulfilment of the requirements. for the degree of Doctor of Philosophy

Math Grade 3 Assessment Anchors and Eligible Content

Speaker Identification by Comparison of Smart Methods. Abstract

Artificial Neural Networks

Classification Using ANN: A Review

Knowledge management styles and performance: a knowledge space model from both theoretical and empirical perspectives

ADDIE MODEL THROUGH THE TASK LEARNING APPROACH IN TEXTILE KNOWLEDGE COURSE IN DRESS-MAKING EDUCATION STUDY PROGRAM OF STATE UNIVERSITY OF MEDAN

Faculty Athletics Committee Annual Report to the Faculty Council September 2014

1 st Quarter (September, October, November) August/September Strand Topic Standard Notes Reading for Literature

Unit: Human Impact Differentiated (Tiered) Task How Does Human Activity Impact Soil Erosion?

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Automatic Speaker Recognition: Modelling, Feature Extraction and Effects of Clinical Environment

Blank Table Of Contents Template Interactive Notebook

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

Transcription:

GAGANDEEP KAUR 1, MEHAK AGGARWAL 2 1 STUDENT IN LLRIET, MOGA 2 LECT. AT LLRIET, MOGA GURMUKHI CHARACTER RECOGNITION USING NEURO- FUZZY WITH EIGEN FEATURE Abstract: Optical character recognition, abbreviated as OCR, is the process of converting the images of handwritten, typewritten or printed text (usually captured by a scanner) into machine editable text or computer process able format, such as ASCII code. Many techniques has been proposed in this field the most of the work done in the field is in the transformation based feature extraction also some of the researcher use the ANN soft computing technique for the classification.in this work Offline character recognition for Gurmukhi lipi has been done using eigen feature. In order to establish correctness 3 different fonts of Gurmukhi lipi has been used. We have done training and testing over multiple samples. In proposed approach a trial has been made to give maximum accuracy in recognition. Keyword: OCR, ASCII,ANN, PCA I. INTRODUCTION recognition is a process, which associates a symbolic meaning with objects (letters, symbols and numbers) drawn on an image, i.e., character recognition techniques associate a symbolic identity with the image of a character. Mainly, character recognitionn machine takes the raw data that further implements the process of preprocessing of any recognition system. On the basis of that data acquisition process, character recognition system can Online Optical Handwritten recognition Printed Recoginitio Offline Optical Magenetic Handwritten recognition Printed be classified into following categories recognition can be categorized into following two parts: - Online Recognition Offline Recognition Off-line handwriting recognition refers to the process of recognizing words that have been scanned from a surface (such as a sheet of paper) and are stored digitally in grey scale format. After being stored, it is conventional to perform further processing to allow superior recognition. In case of DOI : 10.23883/IJRTER.2017.3440.D72RM 136

online handwritten character recognition, the handwriting is captured and stored in digital form via different means. Usually, a special pen is used in conjunction with an electronic surface. Gurumukhi script which is mainly used to write Punjabi language, and this is consist the 35 character, in 35 character 10 vowels, forming 41 consonants including 35 basic character, 10 modifiers, 6 additional modified consonants in figure2, 3 sound modifiers(semi-vowels), 3 half characters, which is lies at the feet of consonants II. LITERATURE REVIEW Madan J. et al, Development of a generic structural feature extraction method for printed Gurumukhi and similar scripts,. The problem is to develop a generalized set of features that can be easily applied on the input script like Gurumukhi or similar scripts. An approach had been devised in which the same feature set identified can be applied on various input scripts, rather than adding a new language each time. In this paper, generalized set of structural features had been developed that can be applied on Gurumukhi and similar input scripts. The results of proposed methodology gives better results on characters The experimental results demonstrate that fonts with reasonably thick stroke and which are made up of clear lines or curves. Rani S. et al, Recognition based classification of Gurmukhi manuscripts,in this paper a recognition based classifier for the Gurmukhi manuscripts into various categories has been proposed. Literature reveals that different degradation styles gives different kind of results. So it becomes difficult to identify such characters. This paper was extremely useful for researchers engaged in recognizing the manuscript documents in any script, because same kinds of problems can be found in manuscript document of any script of the world. Jindal P. et al, Line and Word Segmentation of handwritten text documents written in Gurmukhi Script using midpoint detection technique, In this paper a method for line and word segmentation has been proposed. Proposed work has been implemented on handwritten script. The main aim of this paper was Line and Word Segmentation of text document written in script using mid-point detection technique.. The proposed method was evaluated in terms of accuracy and compared with the existing methods. Two different handwritten scanned images had been taken as input images for line and word segmentation. The results of the proposed method were quite promising. @IJRTER-2017, All Rights Reserved 137

Aggarwal A. et al, Handwritten Gurmukhi character recognition,this paper deals with the offline recognition of handwritten Gurmukhi characters. In this two features of character i.e gradient and curvature of character image were computed. The extracted features were then fused together to form a composite feature vector containing both gradient and curvature information. Dimensionality of the generated composite feature vectors was set to 400. The efficiency of these feature sets was tested on a handwritten database of Gurmukhi characters containing 7000 sample character images. Kaur S. et al, Gurmukhi Printed Recognition using Hierarchical Centroid Method and SVM, proposed the system for the recognition of printed Gurmukhi character. Hierarchical centroid method is has been used for extracting the feature from images of printed characters. The main advantage of using this method was that it had provided different feature vector and could play important role for manuscript recognition. The dataset used in this study consists of 29 different font styles of the printed characters. The classification has been done by using Support Vector Machine. III. PROBLEM FORMULATION Automatic character recognition has been an active research area, which is intended with translating images of characters into a standard encoding scheme representing them. If an image could be converted into editable form, the modification and manipulation of the data for improvement in readability become easier. Many techniques has been proposed in this field the most of the work done in the field is in the transformation based feature extraction also some of the researcher use the ANN soft computing technique for the classification purpose but in the case of the transformation based recognition problem mainly come into work is that the transformed feature value are very less differentiable and cannot be recognizes easily and Also the ANN s problem is that it only classify well in case of the data already given in training module if any out of range data is there it is not efficient enough to recognize it so need to provide a system which will give better feature extraction and more refined classification. Proposed methodology works as described in the flow chart below. @IJRTER-2017, All Rights Reserved 138

IV. RESULTS AND DISCUSSIONS Result obtained after applying the process are as follows. The image given below shows a sample of inputs feed to the proposed system. Fig: Set of input s Figure shown below shows the result of selected character with eigen featuree Fig : Extraction of feature for some characters The figure shown below gives the inference system for the inputs. This is a combined inference system with 125 samples. ANFIS Structure Sum (5) anfis (sugeno) f(u) PCA Feature (5) 125 rules output (125) input3 (5) System anfis: 3 inputs, 1 outputs, 125 rules Fig: Inference system of inputs @IJRTER-2017, All Rights Reserved 139

The graph given below shows the accuracy graph of proposed system 100 90 80 70 60 A c c urac y 50 40 30 20 10 0 1 2 3 System Fig: Accuracy graph of proposed methodology V. CONCLUSION AND FUTURE SCOPE Printed Gurumukhi character recognition using neural networks is discussed here. It has been found that recognition of n Gurumukhi characters is a very difficult task. Following are main reasons for difficulty in recognition of Gurumukhi characters :- Some Gurumukhi characters are similar in shape. s can be written in different fonts. These facts are justified by the work done here. A small set of all Gurumukhi characters using back propagation neural network is trained, then testing was performed on other character set. The accuracy of network was very low. Then, some other character images in the old character set are added and trained the network using new sets. Then again testing was performed on some new image sets written by different people, and it was found that accuracy of the network increases slightly in some cases. Again some new character images into old character set are added (on which network was trained) and trained the network using this new set. The network is presented new character images and it has been seen that recognition increases, although at a slow rate. The result of the last training by 140 character set and testing with the all characters are presented. It can be concluded that as the network is trained with more number of sets, the accuracy of recognition of characters will increase definitely. Over the past three decades, many different methods have been explored by a large number of scientists to recognize characters. A variety of approaches have been proposed and tested by researchers in different parts of the world, including statistical methods, structural and syntactic methods and neural networks. No OCR in this world is 100% accurate till date. The recognition accuracy of the neural networks proposed here can be further improved. The number of character set used for training is reasonably low and the accuracy of the network can be increased by taking more training character sets. This approach of recognition is used for recognition of Gurumukhi characters only. In future work, this can be implemented for recognition of Gurumukhi words. REFERENCES Singh, G., &Sachan, M. (2015, September). Offline Gurmukhi script recognition using knowledge based approach & Multi-Layered Perceptron neural network. In Signal Processing, Computing and Control (ISPCC), 2015 International Conference on (pp. 266-271). IEEE. Grover, S., &Verma, A. (2016, August). Design for emotion detection of punjabi text using hybrid approach. In Inventive Computation Technologies (ICICT), International Conference on (Vol. 2, pp. 1-6). IEEE.. @IJRTER-2017, All Rights Reserved 140

I. Madan, J. S., Sidhu, R., & Sharma, D. V. (2016, March). Development of a generic structural feature extraction method for printed Gurumukhi and similar scripts. In Computing for Sustainable Global Development (INDIACom), 2016 3rd International Conference on (pp. 2285-2290). IEEE. II. Rani, S., &Lehal, G. S. (2016, March). Recognition based classification of Gurmukhi manuscripts. In Colossal Data Analysis and Networking (CDAN), Symposium on (pp. 1-5). IEEE. III. Jindal, P., & Jindal, B. (2015, December). Line and Word Segmentation of handwritten text documents written in Gurmukhi Script using mid point detection technique. In Recent Advances in Engineering & Computational Sciences (RAECS), 2015 2nd International Conference on (pp. 1-6). IEEE. IV. Aggarwal, A., & Singh, K. (2015, September). Handwritten Gurmukhi character recognition. In Computer, Communication and Control (IC4), 2015 International Conference on (pp. 1-5). IEEE. V. Singh, G., &Sachan, M. K. (2015, December). Data capturing process for online Gurmukhi script recognition system. In Computational Intelligence and Computing Research (ICCIC), 2015 IEEE International Conference on (pp. 1-4). IEEE. VI. Aggarwal, A., Singh, K., & Singh, K. (2015). Use of Gradient Technique for Extracting Features from Handwritten Gurmukhi s and Numerals. Procedia Computer Science, 46, 1716-1723. VII. Siddharth, K. S., Jangid, M., Dhir, R., & Rani, R. (2011). Handwritten Gurmukhi Recognition Using Statistical and Background Directional Distribution. International Journal on Computer Science and Engineering (IJCSE), 3(06), 2332-2345. VIII. Kaur, S., & Bhatia, R. (2016). Gurmukhi Printed Recognition using Hierarchical Centroid Method and SVM. International Journal of Computer Applications, 149(3). IX. Jindal, A., Dhir, R., & Rani, R. (2012). Diagonal Features and SVM Classifier for Handwritten Gurumukhi Recognition. International Journal of Advanced Research in Computer Science and Software Engineering, 2(5). X. Aggarwal, A., Singh, K., & Singh, K. (2015). Use of Gradient Technique for Extracting Features from Handwritten Gurmukhi s and Numerals. Procedia Computer Science, 46, 1716-1723. XI. Lehal, G. S., & Singh, C. (2000). A Gurmukhi script recognition system. In Pattern Recognition, 2000. Proceedings. 15th International Conference on(vol. 2, pp. 557-560). IEEE. XII. Kumar, M., Jindal, M. K., & Sharma, R. K. (2011, November). k-nearest neighbor based offline handwritten Gurmukhi character recognition. In Image Information Processing (ICIIP), 2011 International Conference on (pp. 1-4). IEEE. XIII. Mahto, M. K., Bhatia, K., & Sharma, R. K. (2015, March). Combined horizontal and vertical projection feature extraction technique for Gurmukhi handwritten character recognition. In Computer Engineering and Applications (ICACEA), 2015 International Conference on Advances in (pp. 59-65). IEEE. XIV. Singh, S., Aggarwal, A., &Dhir, R. (2012). Use of Gabor Filters for recognition of Handwritten Gurmukhi character. International Journal of Advanced Research in Computer Science and Software Engineering, 2(5). XV. Siddharth, K. S., Dhir, R., & Rani, R. (2011). Comparative Recognition of Handwritten Gurmukhi Numerals Using Different Feature Sets and Classifiers. In Proceedings of International Conference on Image Information Processing (ICIIP 2011).. XVI. Lehal, G. S., & Singh, C. (2000). A Gurmukhi script recognition system. In Pattern Recognition, 2000. Proceedings. 15th International Conference on(vol. 2, pp. 557-560). IEEE.. XVII. Davessar, N. M., Madan, S., & Singh, H. (2003, October). A hybrid approach to character segmentation of Gurmukhi script characters. In Applied Imagery Pattern Recognition Workshop, 2003. Proceedings. 32nd (pp. 169-173). IEEE. XVIII. Lehal, G. S., Singh, C., &Lehal, R. (2001). A shape based post processor for Gurmukhi OCR. In Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on (pp. 1105-1109). IEEE. XIX. Singh, P., &Budhiraja, S. (2012). Handwritten Gurmukhi Recognition Using Wavelet transforms. International Journal of Electronics, Communication & Instrumentation Engineering Research and Development (ljecierd), 2(3), 27-37. XX. Kumar, N., & Gupta, S. (2016). Offline Handwritten Gurmukhi Recognition: A Review. International Journal of Software Engineering and Its Applications, 10(5), 77-86. XXI. Kumar, M., Jindal, M. K., & Sharma, R. K. (2011, November). k-nearest neighbor based offline handwritten Gurmukhi character recognition. In Image Information Processing (ICIIP), 2011 International Conference on (pp. 1-4). IEEE. XXII. Sharma, D., &Jhajj, P. (2010). Recognition of isolated handwritten characters in Gurmukhi script. International Journal of Computer Applications, 4(8), 9-17. XXIII. Singh, S., Aggarwal, A., &Dhir, R. (2012). Use of Gabor Filters for recognition of Handwritten Gurmukhi character. International Journal of Advanced Research in Computer Science and Software Engineering, 2(5).Vol. 2, No. 5, Pp. 234-240, May 2012 @IJRTER-2017, All Rights Reserved 141