An Ocr System For Printed Nasta liq Script: A Segmentation Based Approach

Size: px
Start display at page:

Download "An Ocr System For Printed Nasta liq Script: A Segmentation Based Approach"

Transcription

1 An Ocr System For Printed Nasta liq Script: A Segmentation Based Approach Saeeda Naz, Arif Iqbal Umar, Saad Bin Ahmed,, Syed Hamad Shirazi, M. Imran Razzak,, Imran Siddiqi Department Of Information Technology, Hazara University, Mansehra, Pakistan Higher Education Department, KPK, Pakistan King Saud Bin Abdul Aziz University for Health Sciences, Riyadh, Saudi Arabia Department of Computer Science, Bahria University Islamabad, Pakistan { saeedanaz292, isaadahmed, mirpak, syedhamad, imransidiqqi}@gmail.com, arifiqbalumar@gmailcom Abstract Machine simulation of human reading has been a subject of intensive research for almost four decades. Automatic Urdu character recognition remains a challenging task due to its cursive nature despite the fact that the latest improvements in recognition methods and systems for Latin script are very promising. This work introduces a robust approach based on statistical models that provide solution for recognition of Urdu text Nasta liq style. Contrary to classical approaches which segment text into words, ligatures or characters, we intend to employ an implicit segmentation where text lines are recognized during segmentation. The developed system will be evaluated on standard Urdu text databases and compared with the state-ofthe-art recognition techniques proposed till date. I. INTRODUCTION Most people learn to read and write during their first few years of education. By the time they have grown out of childhood, they have already acquired very good reading and writing skills including the ability to read most of the texts either handwritten or printed, written in different fonts and styles. Even majority of people have no problems in reading light prints or heavy prints; upside down prints; advertisements in fancy font styles, calligraphic text; characters with flowery ornaments and missing parts. On the contrary, despite more than four decades of intensive research, the reading skill of the computer is still way behind that of human. In the recent years, there has been an unending demand for cursive/non-cursive Optical Character Recognition (OCR) systems, not only to facilitate the native speakers to readily use these OCRs for their mobile or tablet requirements, but also for the digitization of a large amount of legacy documents, such as holy books, magazines, newspapers, poetry books, and handwritten documents. Although a computationally intensive field, OCR has witnessed a significant improvement over the years. This is mainly due to the tremendous advances in the computational intelligence algorithms. The objective of character recognition is to imitate the human reading ability, with the human accuracy but with far higher speed. The target performance is at least five characters per second with a 99.9% recognition rate [1]. OCR is the most important component of various applications, such as document automation, verification of cheques, data entry applications, development of reading machines for visually handicapped and a large variety of many other banking and business applications. OCR is an active area of research and its importance is well established par rapport the disciplines of digital image processing, pattern recognition, artificial intelligence, database systems, natural language processing, human-machine interaction, and communications. These applications can perform well, if the characters from text images are classified and recognized accurately. Most of the commercial OCR applications are concerned with the machine printed Latin scripts having well-separated characters. Moreover, the OCR systems for printed Japanese and Chinese languages are also quite mature. The languages such as Arabic, Persian, Urdu, and Pashto are derived from the Arabic script and are read, written and spoken by a considerable proportion of population in the world. There are many font styles of cursive script, Nasta'liq, Kofi, Thuluth, Diwani, Riq'a, and Naskh to name a few. Among the aforementioned font styles, Naskh and Nasta'liq are the most important to mention wherein the former is preferred for Arabic, Persian, and Pashto languages and the latter is adopted for Urdu typesetting. Some commercial OCRs are available for printed Arabic characters but they have many technical problems, especially in the segmentation stage where the results are not enviable. For all practical purposes, the Urdu script is the superset of its Arabic and Persian counterparts. Recognition of printed Arabic text has received considerable research attention whereas surveys on recognition of Urdu text [2-5] reveal that very limited research efforts have been carried out towards the development of an Urdu OCR. This may be due to the complexities involved Nasta liq writing style [6]. Although Urdu and Arabic share many common attributes, the techniques developed for recognition of Arabic text cannot be directly applied to Urdu text due to complexity of writing style Nastaliq as compare to the Naskh writing style for Arabic. Challenges in recognition of Nasta liq Urdu text include diagonality, multiple baselines, high cursiveness and context sensitivity. Most of the studies on recognition of Urdu text use ligatures as the basic unit of recognition [31-32, 40-43]. The total number of unique Urdu ligatures is approximately 22,000 [39] and training classifiers to learn to discriminate such a large number of classes is a challenging task. Many studies use only a small subset of ligatures representing the frequently used ligatures. Among one of the very well-known ligature based approaches, the study presented by Javed and Hussain [10] on Offline Urdu OCR is evaluated on 1500 ligatures from a set of 5,000 frequently occurring ligatures comprising one to eight characters. HMM HTK toolkit is trained on these ligatures using DCT features ISBN: /14/$ IEEE 255

2 and a recognition rate of 93% is realized. The number of recognized ligatures (approximately 1500) is very less as compared to the total number of unique Urdu ligatures (approximately 22,000). Moreover, ligature based approaches are limited in the sense that new ligatures on which system is not trained cannot be recognized. To overcome the problem of a large number of classes in ligature based approaches, one of the solutions is to segment the ligatures in to characters and train classifiers to recognize the characters. This reduces the number of classes from total number of unique ligatures to total number of characters and their different shapes. The segmentation of ligatures into characters, however, itself is a challenging and error prone task [16]. To overcome these issues with ligature based and segmentation based recognition, a new trend is to employ implicit segmentation techniques where the text is recognized during segmentation phase itself. Moreover, the limited work on Urdu OCR reported in the literature has mostly been evaluated on non-standard datasets where the researchers would generate their own datasets for evaluation of the proposed techniques. This makes an objective comparison of different methods a challenging task. The main goal of the proposed research is to develop an implicit segmentation based Optical Character Recognition system for printed Urdu text written in Nasta liq font. The paper is organized as: section II is presented the related work in the field of OCR, its motivation and research problem. In section III, we have discussed the general steps involved in the development of an OCR along with discussion of the notable contributions to recognition of Urdu text followed by a discussion on our intended methodology. This section also present the dataset and the evaluation metric we plan to work with. Finally, we conclude the paper with some remarks and our future plan of study. II. RELATED WORK Character recognition techniques associate a Unicode with the image of a character. Based on the mode of input, OCR is classified as offline and online as illustrated in Fig. 1. [7, 8]. The offline OCR deals with the digitized images of text such as handwritten or machine printed. The digital image of text could be obtained from an optical scanner or a camera. In contrast, in the online OCR, the input text is written directly using a tablet, a PDA, or a stylus. The online character recognition is probably easier than its offline counterpart as more information is available, such as time information, stroke coordinates, and handwriting style of the user. A typical OCR system mainly comprises a combination of the following modules. Image acquisition Preprocessing Segmentation o o Feature extraction Segmentation free/holistic approach Segmentation based/ analytical approach Explicit Segmentation Implicit Segmentation o Structural Features o Statistical Features Recognition/classification Post-processing Fig. 1. Types of OCR The images of printed or handwritten documents are acquired using a scanner, camera or a digitizing tablet and are pre-processed before they could be fed to the subsequent modules. Pre-processing typically involves binarization, skew and slant detection and correction, noise removal and segmentation of text and non-text objects [9-14]. Depending upon the type of approach the segmentation step involves splitting the text into lines, words, ligatures, characters or strokes. This step is more crucial in cursive scripts like Arabic, Urdu, Persian, Pashto, Sindhi, Malay (Jawi), Uigher etc. As discussed earlier, the recognition techniques rely on one of segmentation-based or segmentation-free approaches [15-16]. In segmentation-free or holistic approaches, the system seeks to recognize the ligature or word as a whole without segmenting it further into characters or sub images. Generally, paragraphs in text are split into lines using horizontal projection or heuristics based methods. Text lines are then split into words or sub-words (ligatures) using vertical projections and connected component labeling etc. [17]. In segmentationbased or analytical approaches, ligatures are segmented into characters or strokes explicitly or implicitly. Segmentation-based approaches are further categorized into explicit and implicit segmentation. In explicit segmentation, the words or ligature are divided into characters or strokes [16]. Incorrect segmentation leads to misclassifications and results in reduced recognition rates. Correct segmentation of ligatures is in fact the major challenge in explicit segmentation based approaches [18-20]. In the implicit segmentation, words or ligatures are segmented into smaller units while being recognized without any accurate splitting path. Implicit segmentation is also termed as straight or recognition based segmentation. These methods scan the text images line by line from right to left and segments words into characters during/after recognition using 256

3 codebook entries or predefined classes or a set of features [20-24]. These approaches have been effective on highly cursive scripts and can also be employed in the development of a multilingual OCR. Segmentation is followed by the feature extraction step and a wide variety of statistical as well as structural features have been investigated in the literature. Structural features are typically computed by finding the extreme points and joining points [25] or considering the number of dots, position of the dots, presence of branches, loops or secondary strokes and the slope between the initial point and the final point [26, 32]. Statistical features, for which rich classifiers are available, are mostly preferred over structural features and a large number of techniques rely on statistical features including shape descriptors, contour based statistics, edge based features and other statistical measurements computed at word, ligature or character levels [26 30]. For recognition, a number of classifiers including hidden Markov models (HMM) [16, 24, 33, 34], artificial neural networks (ANN) [20, 25, 31, 36], support vector machine (SVM), nearest neighbor classifier (NN) or template matching [32 33] and decision tree classifier [37] have been extensively used. In some cases, the classification step is followed by postprocessing [9, 10] to improve the overall recognition accuracy of the system. After having discussed the general steps involved in an OCR system, we present the proposed solution in the next section. III. PROPOSED SOLUTION Our study is aimed at developing a robust optical character recognition system based on implicit segmentation. The main steps involved in our work are likely to include the following. Acquisition of printed Urdu text from UPTI database employed in our study. Extraction and selection of features which provide the best recognition rates for implicit segmentation based Urdu OCR for printed text. Recognition using state-of-the-art classifiers like recurrent neural network, hidden markov model, classifiers based on fuzzy logic or conditional random fields (CRF). A. Overview of Proposed System We intend to work on scanned images of text from UPTI dataset. The pre-processing in our case will comprise the traditional steps of de-noising, skew detection and correction and binarization. The text page will be segmented into lines using horizontal projection profiles complemented with some heuristics. We intend to employ implicit segmentation and use a set of statistical features. Features like projection and profiles, chain codes and zone based statistical measures etc. can be explored. Classification can be carried out using neural networks or hidden Markov models while a language model can also be integrated with the system to improve the overall recognition rates through dictionary validation. An overview of the intended methodology is presented in Fig. 2. The system is planned to be developed in MATLAB/Python on Windows platform. Our some efforts are reported in [44, 45]. Fig. 2. Proposed System B. Dataset Most of the existing Urdu OCR systems have been evaluated on custom developed databases. This makes a quantitative comparison of different methods a difficult task. The Image Understanding and Pattern Recognition Group (IUPR) at Technical University of Kaiserslautern, Germany, generated synthetic data of Urdu Nasta liq text from leading Urdu newspapers of Pakistan and termed it as UPTI dataset. We plan to evaluate our system on his standard dataset. C. Measurement Matric The developed recognition system is planned to be evaluated using graph edit distance. The character level accuracy will computed using: insertions + substituti on + deletions accuracy = totallengt hoftestset transcript ion IV. CONCLUSION This paper proposed an OCR system for printed Urdu text in Nasta liq script based on implicit segmentation. A set of statistical features will be extracted and fed to the classifier for recognition. The developed technique will also be evaluated on the standard UPTI database and will be compared with existing state-of-the-art Urdu OCRs. REFERENCES [1] V. Govindan and A. Shivaprasad, Character Recognition-A Review, Pattern Recognition, vol. 23, no. 7, pp , [2] S. Naz, K. Hayat, M.I. Razzak, M.W. Anwar, S.A. Madani and S.U. Khan, The optical character recognition of Urdu-like cursive scripts, Pattern Recognition, vol. 47, no. 3, pp ,

4 [3] S. Naz, K. Hayat, M.I. Razzak, M.W. Anwar, and H. Akbar, Arabic script based character segmentation: A review, In Computer and Information Technology (WCCIT), World Congress on, pp [4] S. Naz, K. Hayat, M.I. Razzak, M.W. Anwar, and S.Z. Khan, "Challenges in Baseline Detection of Arabic Script Based Languages." Springer International Publishing in Intelligent Systems for Science and Information, pp [5] S. Naz, K. Hayat, M.I. Razzak, M.W. Anwar, and H. Akbar, Challenges in baseline detection of cursive script languages, In Science and Information Conference (SAI), pp , [6] S. Naz, K. Hayat, M.I. Razzak, M.W. Anwar, and H. Akbar, Arabic script based language character recognition: Nasta'liq vs Naskh analysis, In Computer and Information Technology (WCCIT), World Congress on (pp. 1-7) [7] B. Al-Badr and S. A. Mahmoud, Survey and Bibliography of Arabic Optical Text Recognition, Signal Processing, vol. 41, no. 1, pp , [8] L.M. Lorigo and V. Govindaraju, Online Arabic Handwriting Recognition: A Survey, IEEE Trans. Pattern Analysis and Machine Intelligence, pp.8, no. 5, pp , [9] M. Naz Q. U. A. Akram and S. Hussain, Binarization and its Evaluation for Urdu Nastalique Document Images, Center for Language Engineering, Al-Khawarizmi Institute of Computer Science, Pakistan [10] F. Shafait, D. Keysers, and T. M. Breuel, Layout analysis of Urdu document images, In Multitopic Conference, pp , [11] R.J. Ramteke and I. K. Pathan. Noise Reduction in Urdu Document Image Spatial and Frequency Domain Approaches. In Proc. 4th International Conference on Signal and Image Processing 2012 (ICSIP'12), volume 222 of Lecture Notes in Electrical Engineering, pp Springer India, [12] D.S. Le, G. R. Thoma, and H. Wechsler. Automated Page Orientation and Skew Angle Detection for Binary Document Images. Pattern Recognition, vol. 27, no. 10: , [13] R.J. Ramteke, K. P. Imran, and S. C. Mehrotra. Skew Angle Estimation of Urdu Document Images: A Moments based Approach. International Journal of Machine Learning and Computing, vol.1, no. 1, pp. 7-12, [14] S. F. Rashid, S. S. Bukhari, F. Shafait, and T. M. Breuel. A Discriminative Learning Approach for Orientation Detection of Urdu Document Images. In Proc. 13th International Multitopic IEEE Conference (INMIC'09), pp. 1-5, [15] S.T. Javed and S. Hussain, Improving Nastalique Specific Pre- Recognition Process for Urdu OCR, In Proc. 13th International Multitopic IEEE Conference (INMIC'09), pp.1-6, [16] S. T. Javed, S. Hussain, A. Maqbool, S. Asloob, S. Jamil, and H. Moin, Segmentation Free Nastalique Urdu OCR, World Academy of Science, Engineering and Technology, vol 46, pp , [17] B. Al-Badr and S. A. Mahmoud, Survey and Bibliography of Arabic Optical Text Recognition, Signal Processing, vol. 41, no. 1, pp , [18] A.M. Zeki, The Segmentation Problem in Arabic Character Recognition The State of the Art, In Proc. 1st International Conference on Information and Communication Technologies (ICICT'05), pp , [19] Y.M. Alginahi, A survey on Arabic character segmentation, International Journal on Document Analysis and Recognition, pp. 1-22, [20] Z. Ahmad, J. K. Orakzai, and I. Shamsher, Urdu Compound Character Recognition using Feed Forward Neural Networks, In Proc. 2nd International Conference on Computer Science and Information Technology (ICCSIT'09), pp , [21] A. Ul-Hasan and S. B. Ahmed and F. Rashid and F. Shafait and T. M. Breuel, Offline Printed Urdu Nastaleeq Script Recognition with Bidirectional LSTM Networks, 12th International Conference on Document Analysis and Recognition (ICDAR'13), pp , [22] A. Graves and M. Liwicki and S. Fern and R. Bertolami and H. Bunke, and J. Schmidhuber, A Novel Connectionist System for Unconstrained Handwriting Recognition, IEEE Trans. Pattern Anal. Mach. Intell., vol. 31, [23] M.S. Khorsheed, Recognising handwritten Arabic manuscripts using a single hidden Markov model, Pattern Recognition Letter, vol 24, pp , [24] M.S. Khorsheed, Offline recognition of omnifont Arabic text using the HMM ToolKit (HTK), Pattern Recognition Letter, vol 28, pp , [25] I. Shamsher, Z. Ahmad, J. K. Orakzai, and A. Adnan, OCR for Printed Urdu Script using Feed Forward Neural Network, Proc. World Academy of Science, Engineering and Technology, vol. 23, pp , [26] R.G. Casey and G. Nagy, Recursive Segmentation and Classification of Composite Character Patterns, In Proc. 6th International Conference on Pattern Recognition, vol. 2, pp , [27] F. Hussain and J. Cowell, Extracting Features from Arabic Characters, In Proc. 2nd International Conference on Computer Graphics and Imaging (CGIM'01), pp , [28] J. Cowell and F. Hussain, A Fast Recognition System for Isolated Arabic Characters, In Proc. 6th International Conference on Information Visualisation, pp , London, UK, [29] A. Muaz, Urdu Optical Character Recognition System, Master's thesis, National University of Computer & Emerging Sciences Lahore, Pakistan, [30] S.A. Hussain, S. Zaman, and M. Ayub, A Self Organizing Map based Urdu Nasakh Character Recognition, In Proc. International Conference on Emerging Technologies (ICET'09), pp , [31] D.B. Megherbi, S. M. Lodhi, and A. J. Boulenouar, Fuzzy-Logic- Model-based Technique with Application to Urdu Character Recognition, Proc. SPIE Applications of Artificial Neural Networks in Image Processing, vol. 3962, pp , [32] Z.A. Shah, Ligature based Optical Character Recognition of Urdu- Nastaleeq Font, In Proc. 6th International Multitopic IEEE Conference (INMIC'02), [33] S.A. Husain, A Multi-Tier Holistic Approach for Urdu Nastaliq Recognition, In Proc. 6th International Multitopic IEEE Conference (INMIC'02), pp , [34] M. Decerbo, E. MacRostie, and P. Natarajan, The BBN Byblos Pashto OCR system, In Proc. 1st ACM Workshop on Hardcopy Document Processing (HDP '04), pp , [35] R. Safabakhsh and P. Adibi, Nastaaligh Handwritten Word Recognition Using a Continuous-Density Variable-Duration HMM, The Arabian Journal for Science and Engineering, vol. 30, no. 1B, , [36] S.N. Nawaz, M. Sarfraz, A. Zidouri, and W. G. Al-Khatib, An Approach to Online Arabic Character Recognition using Neural Networks, In Proc. 10th International Conference on Electronics, Circuits and Systems (ICECS'03), vol 3, pp , [37] U. Pal and A. Sarkar, Recognition of Printed Urdu Script, In Proc. Seventh International Conference on Document Analysis and Recognition (ICDAR 2003), pp , [38] D.S. Guru, S. K. Ahmed, and K. Irfan, An Attempt towards Recognition of Handwritten Urdu Characters: A Decision Tree Approach, In Proc. National Conference on Computers and Information Technology (NCCIT'01), pp , [39] A.M. Jamil, Noori Nastaliq Revolution in Urdu composing, book, ELITE PUBLISHERS LTD for NOORI NASTALIQ FOUNDATION, [40] S.A. Sattar, A Technique for the Design and Implementation of an OCR for Printed Nastalique Text, PhD thesis, NED University of Engineering & Technology, Karachi, Pakistan, [41] U. Iftikhar, Recognition of Urdu Ligatures, Master's thesis, VIBOT Consortium and German Research Center for Arti_cial Intelligence (DFKI), [42] N. Sabbour, N. and F. Shafait, A segmentation-free approach to arabic and Urdu OCR. InIS&T/SPIE Electronic Imaging, pp N-86580, International Society for Optics and Photonics,

5 [43] G.S. Lehal and A. Rana. Recognition of Nastalique Urdu Ligatures. In Proceedings of the 4th International Workshop on Multilingual OCR. ACM, 2013 [44] S.B. Ahmed, S. Naz, Salahuddin, M.I. Razzak, A.A. Khan, A.I. Umar, UCOM Offline Dataset a Urdu Handwritten Dataset Generation, accepted in IAJIT, unpublished. [45] S.B. Ahmed, S. Naz, Salahuddin, M.I. Razzak, A.I. Umar, Handwritten Urdu Character Recognition using Recurrent Neural Networks:, Accepted in Neural Computing and Application, unpublished. 259

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

OCR for Arabic using SIFT Descriptors With Online Failure Prediction OCR for Arabic using SIFT Descriptors With Online Failure Prediction Andrey Stolyarenko, Nachum Dershowitz The Blavatnik School of Computer Science Tel Aviv University Tel Aviv, Israel Email: stloyare@tau.ac.il,

More information

Word Segmentation of Off-line Handwritten Documents

Word Segmentation of Off-line Handwritten Documents Word Segmentation of Off-line Handwritten Documents Chen Huang and Sargur N. Srihari {chuang5, srihari}@cedar.buffalo.edu Center of Excellence for Document Analysis and Recognition (CEDAR), Department

More information

Arabic Orthography vs. Arabic OCR

Arabic Orthography vs. Arabic OCR Arabic Orthography vs. Arabic OCR Rich Heritage Challenging A Much Needed Technology Mohamed Attia Having consistently been spoken since more than 2000 years and on, Arabic is doubtlessly the oldest among

More information

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System QuickStroke: An Incremental On-line Chinese Handwriting Recognition System Nada P. Matić John C. Platt Λ Tony Wang y Synaptics, Inc. 2381 Bering Drive San Jose, CA 95131, USA Abstract This paper presents

More information

Problems of the Arabic OCR: New Attitudes

Problems of the Arabic OCR: New Attitudes Problems of the Arabic OCR: New Attitudes Prof. O.Redkin, Dr. O.Bernikova Department of Asian and African Studies, St. Petersburg State University, St Petersburg, Russia Abstract - This paper reviews existing

More information

Rule Learning With Negation: Issues Regarding Effectiveness

Rule Learning With Negation: Issues Regarding Effectiveness Rule Learning With Negation: Issues Regarding Effectiveness S. Chua, F. Coenen, G. Malcolm University of Liverpool Department of Computer Science, Ashton Building, Ashton Street, L69 3BX Liverpool, United

More information

Accepted Manuscript. Title: Region Growing Based Segmentation Algorithm for Typewritten, Handwritten Text Recognition

Accepted Manuscript. Title: Region Growing Based Segmentation Algorithm for Typewritten, Handwritten Text Recognition Title: Region Growing Based Segmentation Algorithm for Typewritten, Handwritten Text Recognition Authors: Khalid Saeed, Majida Albakoor PII: S1568-4946(08)00114-2 DOI: doi:10.1016/j.asoc.2008.08.006 Reference:

More information

Large vocabulary off-line handwriting recognition: A survey

Large vocabulary off-line handwriting recognition: A survey Pattern Anal Applic (2003) 6: 97 121 DOI 10.1007/s10044-002-0169-3 ORIGINAL ARTICLE A. L. Koerich, R. Sabourin, C. Y. Suen Large vocabulary off-line handwriting recognition: A survey Received: 24/09/01

More information

Speech Emotion Recognition Using Support Vector Machine

Speech Emotion Recognition Using Support Vector Machine Speech Emotion Recognition Using Support Vector Machine Yixiong Pan, Peipei Shen and Liping Shen Department of Computer Technology Shanghai JiaoTong University, Shanghai, China panyixiong@sjtu.edu.cn,

More information

An Online Handwriting Recognition System For Turkish

An Online Handwriting Recognition System For Turkish An Online Handwriting Recognition System For Turkish Esra Vural, Hakan Erdogan, Kemal Oflazer, Berrin Yanikoglu Sabanci University, Tuzla, Istanbul, Turkey 34956 ABSTRACT Despite recent developments in

More information

Modeling function word errors in DNN-HMM based LVCSR systems

Modeling function word errors in DNN-HMM based LVCSR systems Modeling function word errors in DNN-HMM based LVCSR systems Melvin Jose Johnson Premkumar, Ankur Bapna and Sree Avinash Parchuri Department of Computer Science Department of Electrical Engineering Stanford

More information

Rule Learning with Negation: Issues Regarding Effectiveness

Rule Learning with Negation: Issues Regarding Effectiveness Rule Learning with Negation: Issues Regarding Effectiveness Stephanie Chua, Frans Coenen, and Grant Malcolm University of Liverpool Department of Computer Science, Ashton Building, Ashton Street, L69 3BX

More information

Off-line handwritten Thai name recognition for student identification in an automated assessment system

Off-line handwritten Thai name recognition for student identification in an automated assessment system Griffith Research Online https://research-repository.griffith.edu.au Off-line handwritten Thai name recognition for student identification in an automated assessment system Author Suwanwiwat, Hemmaphan,

More information

Data Fusion Models in WSNs: Comparison and Analysis

Data Fusion Models in WSNs: Comparison and Analysis Proceedings of 2014 Zone 1 Conference of the American Society for Engineering Education (ASEE Zone 1) Data Fusion s in WSNs: Comparison and Analysis Marwah M Almasri, and Khaled M Elleithy, Senior Member,

More information

Longest Common Subsequence: A Method for Automatic Evaluation of Handwritten Essays

Longest Common Subsequence: A Method for Automatic Evaluation of Handwritten Essays IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 6, Ver. IV (Nov Dec. 2015), PP 01-07 www.iosrjournals.org Longest Common Subsequence: A Method for

More information

Modeling function word errors in DNN-HMM based LVCSR systems

Modeling function word errors in DNN-HMM based LVCSR systems Modeling function word errors in DNN-HMM based LVCSR systems Melvin Jose Johnson Premkumar, Ankur Bapna and Sree Avinash Parchuri Department of Computer Science Department of Electrical Engineering Stanford

More information

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur Module 12 Machine Learning 12.1 Instructional Objective The students should understand the concept of learning systems Students should learn about different aspects of a learning system Students should

More information

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION JOURNAL OF MEDICAL INFORMATICS & TECHNOLOGIES Vol. 11/2007, ISSN 1642-6037 Marek WIŚNIEWSKI *, Wiesława KUNISZYK-JÓŹKOWIAK *, Elżbieta SMOŁKA *, Waldemar SUSZYŃSKI * HMM, recognition, speech, disorders

More information

Circuit Simulators: A Revolutionary E-Learning Platform

Circuit Simulators: A Revolutionary E-Learning Platform Circuit Simulators: A Revolutionary E-Learning Platform Mahi Itagi Padre Conceicao College of Engineering, Verna, Goa, India. itagimahi@gmail.com Akhil Deshpande Gogte Institute of Technology, Udyambag,

More information

AUTOMATED FABRIC DEFECT INSPECTION: A SURVEY OF CLASSIFIERS

AUTOMATED FABRIC DEFECT INSPECTION: A SURVEY OF CLASSIFIERS AUTOMATED FABRIC DEFECT INSPECTION: A SURVEY OF CLASSIFIERS Md. Tarek Habib 1, Rahat Hossain Faisal 2, M. Rokonuzzaman 3, Farruk Ahmed 4 1 Department of Computer Science and Engineering, Prime University,

More information

The A2iA Multi-lingual Text Recognition System at the second Maurdor Evaluation

The A2iA Multi-lingual Text Recognition System at the second Maurdor Evaluation 2014 14th International Conference on Frontiers in Handwriting Recognition The A2iA Multi-lingual Text Recognition System at the second Maurdor Evaluation Bastien Moysset,Théodore Bluche, Maxime Knibbe,

More information

CS Machine Learning

CS Machine Learning CS 478 - Machine Learning Projects Data Representation Basic testing and evaluation schemes CS 478 Data and Testing 1 Programming Issues l Program in any platform you want l Realize that you will be doing

More information

Learning Methods in Multilingual Speech Recognition

Learning Methods in Multilingual Speech Recognition Learning Methods in Multilingual Speech Recognition Hui Lin Department of Electrical Engineering University of Washington Seattle, WA 98125 linhui@u.washington.edu Li Deng, Jasha Droppo, Dong Yu, and Alex

More information

A Case Study: News Classification Based on Term Frequency

A Case Study: News Classification Based on Term Frequency A Case Study: News Classification Based on Term Frequency Petr Kroha Faculty of Computer Science University of Technology 09107 Chemnitz Germany kroha@informatik.tu-chemnitz.de Ricardo Baeza-Yates Center

More information

University of Groningen. Systemen, planning, netwerken Bosman, Aart

University of Groningen. Systemen, planning, netwerken Bosman, Aart University of Groningen Systemen, planning, netwerken Bosman, Aart IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document

More information

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines

Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines Speech Segmentation Using Probabilistic Phonetic Feature Hierarchy and Support Vector Machines Amit Juneja and Carol Espy-Wilson Department of Electrical and Computer Engineering University of Maryland,

More information

Python Machine Learning

Python Machine Learning Python Machine Learning Unlock deeper insights into machine learning with this vital guide to cuttingedge predictive analytics Sebastian Raschka [ PUBLISHING 1 open source I community experience distilled

More information

Mining Association Rules in Student s Assessment Data

Mining Association Rules in Student s Assessment Data www.ijcsi.org 211 Mining Association Rules in Student s Assessment Data Dr. Varun Kumar 1, Anupama Chadha 2 1 Department of Computer Science and Engineering, MVN University Palwal, Haryana, India 2 Anupama

More information

A GENERIC SPLIT PROCESS MODEL FOR ASSET MANAGEMENT DECISION-MAKING

A GENERIC SPLIT PROCESS MODEL FOR ASSET MANAGEMENT DECISION-MAKING A GENERIC SPLIT PROCESS MODEL FOR ASSET MANAGEMENT DECISION-MAKING Yong Sun, a * Colin Fidge b and Lin Ma a a CRC for Integrated Engineering Asset Management, School of Engineering Systems, Queensland

More information

Knowledge Transfer in Deep Convolutional Neural Nets

Knowledge Transfer in Deep Convolutional Neural Nets Knowledge Transfer in Deep Convolutional Neural Nets Steven Gutstein, Olac Fuentes and Eric Freudenthal Computer Science Department University of Texas at El Paso El Paso, Texas, 79968, U.S.A. Abstract

More information

A study of speaker adaptation for DNN-based speech synthesis

A study of speaker adaptation for DNN-based speech synthesis A study of speaker adaptation for DNN-based speech synthesis Zhizheng Wu, Pawel Swietojanski, Christophe Veaux, Steve Renals, Simon King The Centre for Speech Technology Research (CSTR) University of Edinburgh,

More information

Human Emotion Recognition From Speech

Human Emotion Recognition From Speech RESEARCH ARTICLE OPEN ACCESS Human Emotion Recognition From Speech Miss. Aparna P. Wanare*, Prof. Shankar N. Dandare *(Department of Electronics & Telecommunication Engineering, Sant Gadge Baba Amravati

More information

Seminar - Organic Computing

Seminar - Organic Computing Seminar - Organic Computing Self-Organisation of OC-Systems Markus Franke 25.01.2006 Typeset by FoilTEX Timetable 1. Overview 2. Characteristics of SO-Systems 3. Concern with Nature 4. Design-Concepts

More information

Standards for Members of the American Handwriting Analysis Foundation

Standards for Members of the American Handwriting Analysis Foundation Standards for Members of the American Handwriting Analysis Foundation A. Purpose The purpose of this document is to provide a foundation for the development and evaluation of a set of standards for education,

More information

SARDNET: A Self-Organizing Feature Map for Sequences

SARDNET: A Self-Organizing Feature Map for Sequences SARDNET: A Self-Organizing Feature Map for Sequences Daniel L. James and Risto Miikkulainen Department of Computer Sciences The University of Texas at Austin Austin, TX 78712 dljames,risto~cs.utexas.edu

More information

GACE Computer Science Assessment Test at a Glance

GACE Computer Science Assessment Test at a Glance GACE Computer Science Assessment Test at a Glance Updated May 2017 See the GACE Computer Science Assessment Study Companion for practice questions and preparation resources. Assessment Name Computer Science

More information

Lecture 1: Basic Concepts of Machine Learning

Lecture 1: Basic Concepts of Machine Learning Lecture 1: Basic Concepts of Machine Learning Cognitive Systems - Machine Learning Ute Schmid (lecture) Johannes Rabold (practice) Based on slides prepared March 2005 by Maximilian Röglinger, updated 2010

More information

INPE São José dos Campos

INPE São José dos Campos INPE-5479 PRE/1778 MONLINEAR ASPECTS OF DATA INTEGRATION FOR LAND COVER CLASSIFICATION IN A NEDRAL NETWORK ENVIRONNENT Maria Suelena S. Barros Valter Rodrigues INPE São José dos Campos 1993 SECRETARIA

More information

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 98 (2016 ) 368 373 The 6th International Conference on Current and Future Trends of Information and Communication Technologies

More information

Probabilistic Latent Semantic Analysis

Probabilistic Latent Semantic Analysis Probabilistic Latent Semantic Analysis Thomas Hofmann Presentation by Ioannis Pavlopoulos & Andreas Damianou for the course of Data Mining & Exploration 1 Outline Latent Semantic Analysis o Need o Overview

More information

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH ISSN: 0976-3104 Danti and Bhushan. ARTICLE OPEN ACCESS CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH Ajit Danti 1 and SN Bharath Bhushan 2* 1 Department

More information

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation SLSP-2016 October 11-12 Natalia Tomashenko 1,2,3 natalia.tomashenko@univ-lemans.fr Yuri Khokhlov 3 khokhlov@speechpro.com Yannick

More information

Reducing Features to Improve Bug Prediction

Reducing Features to Improve Bug Prediction Reducing Features to Improve Bug Prediction Shivkumar Shivaji, E. James Whitehead, Jr., Ram Akella University of California Santa Cruz {shiv,ejw,ram}@soe.ucsc.edu Sunghun Kim Hong Kong University of Science

More information

Multimedia Application Effective Support of Education

Multimedia Application Effective Support of Education Multimedia Application Effective Support of Education Eva Milková Faculty of Science, University od Hradec Králové, Hradec Králové, Czech Republic eva.mikova@uhk.cz Abstract Multimedia applications have

More information

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words, A Language-Independent, Data-Oriented Architecture for Grapheme-to-Phoneme Conversion Walter Daelemans and Antal van den Bosch Proceedings ESCA-IEEE speech synthesis conference, New York, September 1994

More information

USER ADAPTATION IN E-LEARNING ENVIRONMENTS

USER ADAPTATION IN E-LEARNING ENVIRONMENTS USER ADAPTATION IN E-LEARNING ENVIRONMENTS Paraskevi Tzouveli Image, Video and Multimedia Systems Laboratory School of Electrical and Computer Engineering National Technical University of Athens tpar@image.

More information

Timeline. Recommendations

Timeline. Recommendations Introduction Advanced Placement Course Credit Alignment Recommendations In 2007, the State of Ohio Legislature passed legislation mandating the Board of Regents to recommend and the Chancellor to adopt

More information

International Journal of Computational Intelligence and Informatics, Vol. 1 : No. 4, January - March 2012

International Journal of Computational Intelligence and Informatics, Vol. 1 : No. 4, January - March 2012 Text-independent Mono and Cross-lingual Speaker Identification with the Constraint of Limited Data Nagaraja B G and H S Jayanna Department of Information Science and Engineering Siddaganga Institute of

More information

Operational Knowledge Management: a way to manage competence

Operational Knowledge Management: a way to manage competence Operational Knowledge Management: a way to manage competence Giulio Valente Dipartimento di Informatica Universita di Torino Torino (ITALY) e-mail: valenteg@di.unito.it Alessandro Rigallo Telecom Italia

More information

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier IOSR Journal of Electronics and Communication Engineering (IOSR-JECE) e-issn: 2278-2834,p- ISSN: 2278-8735.Volume 10, Issue 2, Ver.1 (Mar - Apr.2015), PP 55-61 www.iosrjournals.org Analysis of Emotion

More information

Curriculum Vitae FARES FRAIJ, Ph.D. Lecturer

Curriculum Vitae FARES FRAIJ, Ph.D. Lecturer Current Address Curriculum Vitae FARES FRAIJ, Ph.D. Lecturer Department of Computer Science University of Texas at Austin 2317 Speedway, Stop D9500 Austin, Texas 78712-1757 Education 2005 Doctor of Philosophy,

More information

Evolutive Neural Net Fuzzy Filtering: Basic Description

Evolutive Neural Net Fuzzy Filtering: Basic Description Journal of Intelligent Learning Systems and Applications, 2010, 2: 12-18 doi:10.4236/jilsa.2010.21002 Published Online February 2010 (http://www.scirp.org/journal/jilsa) Evolutive Neural Net Fuzzy Filtering:

More information

AQUA: An Ontology-Driven Question Answering System

AQUA: An Ontology-Driven Question Answering System AQUA: An Ontology-Driven Question Answering System Maria Vargas-Vera, Enrico Motta and John Domingue Knowledge Media Institute (KMI) The Open University, Walton Hall, Milton Keynes, MK7 6AA, United Kingdom.

More information

Automatic Pronunciation Checker

Automatic Pronunciation Checker Institut für Technische Informatik und Kommunikationsnetze Eidgenössische Technische Hochschule Zürich Swiss Federal Institute of Technology Zurich Ecole polytechnique fédérale de Zurich Politecnico federale

More information

On Human Computer Interaction, HCI. Dr. Saif al Zahir Electrical and Computer Engineering Department UBC

On Human Computer Interaction, HCI. Dr. Saif al Zahir Electrical and Computer Engineering Department UBC On Human Computer Interaction, HCI Dr. Saif al Zahir Electrical and Computer Engineering Department UBC Human Computer Interaction HCI HCI is the study of people, computer technology, and the ways these

More information

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS 1 CALIFORNIA CONTENT STANDARDS: Chapter 1 ALGEBRA AND WHOLE NUMBERS Algebra and Functions 1.4 Students use algebraic

More information

On-Line Data Analytics

On-Line Data Analytics International Journal of Computer Applications in Engineering Sciences [VOL I, ISSUE III, SEPTEMBER 2011] [ISSN: 2231-4946] On-Line Data Analytics Yugandhar Vemulapalli #, Devarapalli Raghu *, Raja Jacob

More information

Linking Task: Identifying authors and book titles in verbose queries

Linking Task: Identifying authors and book titles in verbose queries Linking Task: Identifying authors and book titles in verbose queries Anaïs Ollagnier, Sébastien Fournier, and Patrice Bellot Aix-Marseille University, CNRS, ENSAM, University of Toulon, LSIS UMR 7296,

More information

Ph.D in Advance Machine Learning (computer science) PhD submitted, degree to be awarded on convocation, sept B.Tech in Computer science and

Ph.D in Advance Machine Learning (computer science) PhD submitted, degree to be awarded on convocation, sept B.Tech in Computer science and Name Qualification Sonia Thomas Ph.D in Advance Machine Learning (computer science) PhD submitted, degree to be awarded on convocation, sept. 2016. M.Tech in Computer science and Engineering. B.Tech in

More information

Dropout improves Recurrent Neural Networks for Handwriting Recognition

Dropout improves Recurrent Neural Networks for Handwriting Recognition 2014 14th International Conference on Frontiers in Handwriting Recognition Dropout improves Recurrent Neural Networks for Handwriting Recognition Vu Pham,Théodore Bluche, Christopher Kermorvant, and Jérôme

More information

A virtual surveying fieldcourse for traversing

A virtual surveying fieldcourse for traversing Henny MILLS and David BARBER, UK Keywords: virtual, surveying, traverse, maps, observations, calculation Summary This paper presents the development of a virtual surveying fieldcourse based in the first

More information

Speech Recognition at ICSI: Broadcast News and beyond

Speech Recognition at ICSI: Broadcast News and beyond Speech Recognition at ICSI: Broadcast News and beyond Dan Ellis International Computer Science Institute, Berkeley CA Outline 1 2 3 The DARPA Broadcast News task Aspects of ICSI

More information

CLASSIFICATION OF PROGRAM Critical Elements Analysis 1. High Priority Items Phonemic Awareness Instruction

CLASSIFICATION OF PROGRAM Critical Elements Analysis 1. High Priority Items Phonemic Awareness Instruction CLASSIFICATION OF PROGRAM Critical Elements Analysis 1 Program Name: Macmillan/McGraw Hill Reading 2003 Date of Publication: 2003 Publisher: Macmillan/McGraw Hill Reviewer Code: 1. X The program meets

More information

16 WEEKS STUDY PLAN FOR BS(IT)2 nd Semester

16 WEEKS STUDY PLAN FOR BS(IT)2 nd Semester 16 WEEKS STUDY PLAN FOR BS(IT)2 nd Semester COURSE: OBJECT ORIENTED PROGRAMMING Week Ch# Chapter Names 1 1 The Big Picture 2 2 C++ Programming Basics 3 3 Loops and Decisions 4 4 Structures 5 4 Structures

More information

Learning Methods for Fuzzy Systems

Learning Methods for Fuzzy Systems Learning Methods for Fuzzy Systems Rudolf Kruse and Andreas Nürnberger Department of Computer Science, University of Magdeburg Universitätsplatz, D-396 Magdeburg, Germany Phone : +49.39.67.876, Fax : +49.39.67.8

More information

Learning From the Past with Experiment Databases

Learning From the Past with Experiment Databases Learning From the Past with Experiment Databases Joaquin Vanschoren 1, Bernhard Pfahringer 2, and Geoff Holmes 2 1 Computer Science Dept., K.U.Leuven, Leuven, Belgium 2 Computer Science Dept., University

More information

THE WEB 2.0 AS A PLATFORM FOR THE ACQUISITION OF SKILLS, IMPROVE ACADEMIC PERFORMANCE AND DESIGNER CAREER PROMOTION IN THE UNIVERSITY

THE WEB 2.0 AS A PLATFORM FOR THE ACQUISITION OF SKILLS, IMPROVE ACADEMIC PERFORMANCE AND DESIGNER CAREER PROMOTION IN THE UNIVERSITY THE WEB 2.0 AS A PLATFORM FOR THE ACQUISITION OF SKILLS, IMPROVE ACADEMIC PERFORMANCE AND DESIGNER CAREER PROMOTION IN THE UNIVERSITY F. Felip Miralles, S. Martín Martín, Mª L. García Martínez, J.L. Navarro

More information

Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models

Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models Navdeep Jaitly 1, Vincent Vanhoucke 2, Geoffrey Hinton 1,2 1 University of Toronto 2 Google Inc. ndjaitly@cs.toronto.edu,

More information

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, 2013 10.12753/2066-026X-13-154 DATA MINING SOLUTIONS FOR DETERMINING STUDENT'S PROFILE Adela BÂRA,

More information

Department of Computer Science GCU Prospectus

Department of Computer Science GCU Prospectus Department of Computer Science GCU Prospectus 2015 59 Introduction In recent years, the immense growth of numerous industries resulted in the instant need for young and vigorous IT professionals, who could

More information

LANGUAGE IN INDIA Strength for Today and Bright Hope for Tomorrow Volume 11 : 12 December 2011 ISSN

LANGUAGE IN INDIA Strength for Today and Bright Hope for Tomorrow Volume 11 : 12 December 2011 ISSN LANGUAGE IN INDIA Strength for Today and Bright Hope for Tomorrow Volume ISSN 1930-2940 Managing Editor: M. S. Thirumalai, Ph.D. Editors: B. Mallikarjun, Ph.D. Sam Mohanlal, Ph.D. B. A. Sharada, Ph.D.

More information

AGENDA LEARNING THEORIES LEARNING THEORIES. Advanced Learning Theories 2/22/2016

AGENDA LEARNING THEORIES LEARNING THEORIES. Advanced Learning Theories 2/22/2016 AGENDA Advanced Learning Theories Alejandra J. Magana, Ph.D. admagana@purdue.edu Introduction to Learning Theories Role of Learning Theories and Frameworks Learning Design Research Design Dual Coding Theory

More information

The Use of Statistical, Computational and Modelling Tools in Higher Learning Institutions: A Case Study of the University of Dodoma

The Use of Statistical, Computational and Modelling Tools in Higher Learning Institutions: A Case Study of the University of Dodoma International Journal of Computer Applications (975 8887) The Use of Statistical, Computational and Modelling Tools in Higher Learning Institutions: A Case Study of the University of Dodoma Gilbert M.

More information

DOES OUR EDUCATIONAL SYSTEM ENHANCE CREATIVITY AND INNOVATION AMONG GIFTED STUDENTS?

DOES OUR EDUCATIONAL SYSTEM ENHANCE CREATIVITY AND INNOVATION AMONG GIFTED STUDENTS? DOES OUR EDUCATIONAL SYSTEM ENHANCE CREATIVITY AND INNOVATION AMONG GIFTED STUDENTS? M. Aichouni 1*, R. Al-Hamali, A. Al-Ghamdi, A. Al-Ghonamy, E. Al-Badawi, M. Touahmia, and N. Ait-Messaoudene 1 University

More information

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments Vijayshri Ramkrishna Ingale PG Student, Department of Computer Engineering JSPM s Imperial College of Engineering &

More information

A Vector Space Approach for Aspect-Based Sentiment Analysis

A Vector Space Approach for Aspect-Based Sentiment Analysis A Vector Space Approach for Aspect-Based Sentiment Analysis by Abdulaziz Alghunaim B.S., Massachusetts Institute of Technology (2015) Submitted to the Department of Electrical Engineering and Computer

More information

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Twitter Sentiment Classification on Sanders Data using Hybrid Approach IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 4, Ver. I (July Aug. 2015), PP 118-123 www.iosrjournals.org Twitter Sentiment Classification on Sanders

More information

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining Dave Donnellan, School of Computer Applications Dublin City University Dublin 9 Ireland daviddonnellan@eircom.net Claus Pahl

More information

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining Dave Donnellan, School of Computer Applications Dublin City University Dublin 9 Ireland daviddonnellan@eircom.net Claus Pahl

More information

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks Devendra Singh Chaplot, Eunhee Rhim, and Jihie Kim Samsung Electronics Co., Ltd. Seoul, South Korea {dev.chaplot,eunhee.rhim,jihie.kim}@samsung.com

More information

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS Václav Kocian, Eva Volná, Michal Janošek, Martin Kotyrba University of Ostrava Department of Informatics and Computers Dvořákova 7,

More information

Axiom 2013 Team Description Paper

Axiom 2013 Team Description Paper Axiom 2013 Team Description Paper Mohammad Ghazanfari, S Omid Shirkhorshidi, Farbod Samsamipour, Hossein Rahmatizadeh Zagheli, Mohammad Mahdavi, Payam Mohajeri, S Abbas Alamolhoda Robotics Scientific Association

More information

Tour. English Discoveries Online

Tour. English Discoveries Online Techno-Ware Tour Of English Discoveries Online Online www.englishdiscoveries.com http://ed242us.engdis.com/technotms Guided Tour of English Discoveries Online Background: English Discoveries Online is

More information

Lecture 1: Machine Learning Basics

Lecture 1: Machine Learning Basics 1/69 Lecture 1: Machine Learning Basics Ali Harakeh University of Waterloo WAVE Lab ali.harakeh@uwaterloo.ca May 1, 2017 2/69 Overview 1 Learning Algorithms 2 Capacity, Overfitting, and Underfitting 3

More information

Using dialogue context to improve parsing performance in dialogue systems

Using dialogue context to improve parsing performance in dialogue systems Using dialogue context to improve parsing performance in dialogue systems Ivan Meza-Ruiz and Oliver Lemon School of Informatics, Edinburgh University 2 Buccleuch Place, Edinburgh I.V.Meza-Ruiz@sms.ed.ac.uk,

More information

Abdul Rahman Chik a*, Tg. Ainul Farha Tg. Abdul Rahman b

Abdul Rahman Chik a*, Tg. Ainul Farha Tg. Abdul Rahman b Available online at www.sciencedirect.com Procedia - Social and Behavioral Sciences 66 ( 2012 ) 223 231 The 8th International Language for Specific Purposes (LSP) Seminar - Aligning Theoretical Knowledge

More information

A student diagnosing and evaluation system for laboratory-based academic exercises

A student diagnosing and evaluation system for laboratory-based academic exercises A student diagnosing and evaluation system for laboratory-based academic exercises Maria Samarakou, Emmanouil Fylladitakis and Pantelis Prentakis Technological Educational Institute (T.E.I.) of Athens

More information

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE EE-589 Introduction to Neural Assistant Prof. Dr. Turgay IBRIKCI Room # 305 (322) 338 6868 / 139 Wensdays 9:00-12:00 Course Outline The course is divided in two parts: theory and practice. 1. Theory covers

More information

Physics 270: Experimental Physics

Physics 270: Experimental Physics 2017 edition Lab Manual Physics 270 3 Physics 270: Experimental Physics Lecture: Lab: Instructor: Office: Email: Tuesdays, 2 3:50 PM Thursdays, 2 4:50 PM Dr. Uttam Manna 313C Moulton Hall umanna@ilstu.edu

More information

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE Master of Science (M.S.) Major in Computer Science 1 MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE Major Program The programs in computer science are designed to prepare students for doctoral research,

More information

Bootstrapping Personal Gesture Shortcuts with the Wisdom of the Crowd and Handwriting Recognition

Bootstrapping Personal Gesture Shortcuts with the Wisdom of the Crowd and Handwriting Recognition Bootstrapping Personal Gesture Shortcuts with the Wisdom of the Crowd and Handwriting Recognition Tom Y. Ouyang * MIT CSAIL ouyang@csail.mit.edu Yang Li Google Research yangli@acm.org ABSTRACT Personal

More information

Lecture 2: Quantifiers and Approximation

Lecture 2: Quantifiers and Approximation Lecture 2: Quantifiers and Approximation Case study: Most vs More than half Jakub Szymanik Outline Number Sense Approximate Number Sense Approximating most Superlative Meaning of most What About Counting?

More information

Chamilo 2.0: A Second Generation Open Source E-learning and Collaboration Platform

Chamilo 2.0: A Second Generation Open Source E-learning and Collaboration Platform Chamilo 2.0: A Second Generation Open Source E-learning and Collaboration Platform doi:10.3991/ijac.v3i3.1364 Jean-Marie Maes University College Ghent, Ghent, Belgium Abstract Dokeos used to be one of

More information

PROCEEDINGS OF SPIE. Double degree master program: Optical Design

PROCEEDINGS OF SPIE. Double degree master program: Optical Design PROCEEDINGS OF SPIE SPIEDigitalLibrary.org/conference-proceedings-of-spie Double degree master program: Optical Design Alexey Bakholdin, Malgorzata Kujawinska, Irina Livshits, Adam Styk, Anna Voznesenskaya,

More information

Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification

Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification Tomi Kinnunen and Ismo Kärkkäinen University of Joensuu, Department of Computer Science, P.O. Box 111, 80101 JOENSUU,

More information

Busuu The Mobile App. Review by Musa Nushi & Homa Jenabzadeh, Introduction. 30 TESL Reporter 49 (2), pp

Busuu The Mobile App. Review by Musa Nushi & Homa Jenabzadeh, Introduction. 30 TESL Reporter 49 (2), pp 30 TESL Reporter 49 (2), pp. 30 38 Busuu The Mobile App Review by Musa Nushi & Homa Jenabzadeh, Shahid Beheshti University, Tehran, Iran Introduction Technological innovations are changing the second language

More information

Mandarin Lexical Tone Recognition: The Gating Paradigm

Mandarin Lexical Tone Recognition: The Gating Paradigm Kansas Working Papers in Linguistics, Vol. 0 (008), p. 8 Abstract Mandarin Lexical Tone Recognition: The Gating Paradigm Yuwen Lai and Jie Zhang University of Kansas Research on spoken word recognition

More information

Citrine Informatics. The Latest from Citrine. Citrine Informatics. The data analytics platform for the physical world

Citrine Informatics. The Latest from Citrine. Citrine Informatics. The data analytics platform for the physical world Citrine Informatics The data analytics platform for the physical world The Latest from Citrine Summit on Data and Analytics for Materials Research 31 October 2016 Our Mission is Simple Add as much value

More information

UNIDIRECTIONAL LONG SHORT-TERM MEMORY RECURRENT NEURAL NETWORK WITH RECURRENT OUTPUT LAYER FOR LOW-LATENCY SPEECH SYNTHESIS. Heiga Zen, Haşim Sak

UNIDIRECTIONAL LONG SHORT-TERM MEMORY RECURRENT NEURAL NETWORK WITH RECURRENT OUTPUT LAYER FOR LOW-LATENCY SPEECH SYNTHESIS. Heiga Zen, Haşim Sak UNIDIRECTIONAL LONG SHORT-TERM MEMORY RECURRENT NEURAL NETWORK WITH RECURRENT OUTPUT LAYER FOR LOW-LATENCY SPEECH SYNTHESIS Heiga Zen, Haşim Sak Google fheigazen,hasimg@google.com ABSTRACT Long short-term

More information

An Evaluation of E-Resources in Academic Libraries in Tamil Nadu

An Evaluation of E-Resources in Academic Libraries in Tamil Nadu An Evaluation of E-Resources in Academic Libraries in Tamil Nadu 1 S. Dhanavandan, 2 M. Tamizhchelvan 1 Assistant Librarian, 2 Deputy Librarian Gandhigram Rural Institute - Deemed University, Gandhigram-624

More information