A REVIEW ON APPLICATIONS OF DATA MINING TECHNIQUES IN HIGHER EDUCATION

Similar documents
Mining Association Rules in Student s Assessment Data

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Australian Journal of Basic and Applied Sciences

Lecture 1: Machine Learning Basics

On-Line Data Analytics

Rule Learning With Negation: Issues Regarding Effectiveness

Speech Emotion Recognition Using Support Vector Machine

Rule Learning with Negation: Issues Regarding Effectiveness

Learning Methods for Fuzzy Systems

Multisensor Data Fusion: From Algorithms And Architectural Design To Applications (Devices, Circuits, And Systems)

Reducing Features to Improve Bug Prediction

Human Emotion Recognition From Speech

Applications of data mining algorithms to analysis of medical data

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

Laboratorio di Intelligenza Artificiale e Robotica

CS Machine Learning

AUTOMATIC DETECTION OF PROLONGED FRICATIVE PHONEMES WITH THE HIDDEN MARKOV MODELS APPROACH 1. INTRODUCTION

Rule discovery in Web-based educational systems using Grammar-Based Genetic Programming

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

A Reinforcement Learning Variant for Control Scheduling

Lecture 1: Basic Concepts of Machine Learning

Education: Professional Experience: Personnel leadership and management

Generative models and adversarial training

ROLE OF TEACHERS IN CURRICULUM DEVELOPMENT FOR TEACHER EDUCATION

University of Groningen. Systemen, planning, netwerken Bosman, Aart

Laboratorio di Intelligenza Artificiale e Robotica

Guide to Teaching Computer Science

P. Belsis, C. Sgouropoulou, K. Sfikas, G. Pantziou, C. Skourlas, J. Varnas

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

Problems of the Arabic OCR: New Attitudes

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

K-Medoid Algorithm in Clustering Student Scholarship Applicants

A Case Study: News Classification Based on Term Frequency

Seminar - Organic Computing

Abstract. Janaka Jayalath Director / Information Systems, Tertiary and Vocational Education Commission, Sri Lanka.

Software Maintenance

DOES OUR EDUCATIONAL SYSTEM ENHANCE CREATIVITY AND INNOVATION AMONG GIFTED STUDENTS?

University of Toronto

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Knowledge-Based - Systems

Evolutive Neural Net Fuzzy Filtering: Basic Description

Word Segmentation of Off-line Handwritten Documents

Humboldt-Universität zu Berlin

Diploma in Library and Information Science (Part-Time) - SH220

Assignment 1: Predicting Amazon Review Ratings

Use of Online Information Resources for Knowledge Organisation in Library and Information Centres: A Case Study of CUSAT

Python Machine Learning

Evaluation of Teach For America:

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Ph.D in Advance Machine Learning (computer science) PhD submitted, degree to be awarded on convocation, sept B.Tech in Computer science and

A cognitive perspective on pair programming

The Use of Statistical, Computational and Modelling Tools in Higher Learning Institutions: A Case Study of the University of Dodoma

Mathematics subject curriculum

Learning Methods in Multilingual Speech Recognition

USER ADAPTATION IN E-LEARNING ENVIRONMENTS

Learning From the Past with Experiment Databases

Cognitive Prior-Knowledge Testing Method for Core Development of Higher Education of Computing in Academia

Multimedia Application Effective Support of Education

Analyzing the Usage of IT in SMEs

Mining Student Evolution Using Associative Classification and Clustering

A study of speaker adaptation for DNN-based speech synthesis

Automating the E-learning Personalization

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

Comparison of EM and Two-Step Cluster Method for Mixed Data: An Application

Chapter 10 APPLYING TOPIC MODELING TO FORENSIC DATA. 1. Introduction. Alta de Waal, Jacobus Venter and Etienne Barnard

Time series prediction

Fragment Analysis and Test Case Generation using F- Measure for Adaptive Random Testing and Partitioned Block based Adaptive Random Testing

An Evaluation of E-Resources in Academic Libraries in Tamil Nadu

Test Effort Estimation Using Neural Network

A Decision Tree Analysis of the Transfer Student Emma Gunu, MS Research Analyst Robert M Roe, PhD Executive Director of Institutional Research and

(Sub)Gradient Descent

FSL-BM: Fuzzy Supervised Learning with Binary Meta-Feature for Classification

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

The Comparative Study of Information & Communications Technology Strategies in education of India, Iran & Malaysia countries

Circuit Simulators: A Revolutionary E-Learning Platform

Abstractions and the Brain

INPE São José dos Campos

Entrepreneurial Discovery and the Demmert/Klein Experiment: Additional Evidence from Germany

Predicting Early Students with High Risk to Drop Out of University using a Neural Network-Based Approach

Details of educational qualifications

Axiom 2013 Team Description Paper

Computerized Adaptive Psychological Testing A Personalisation Perspective

National Survey of Student Engagement (NSSE)

Reinforcement Learning by Comparing Immediate Reward

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

CSL465/603 - Machine Learning

Robot manipulations and development of spatial imagery

ADDIE MODEL THROUGH THE TASK LEARNING APPROACH IN TEXTILE KNOWLEDGE COURSE IN DRESS-MAKING EDUCATION STUDY PROGRAM OF STATE UNIVERSITY OF MEDAN

A GENERIC SPLIT PROCESS MODEL FOR ASSET MANAGEMENT DECISION-MAKING

The Method of Immersion the Problem of Comparing Technical Objects in an Expert Shell in the Class of Artificial Intelligence Algorithms

Ensemble Technique Utilization for Indonesian Dependency Parser

UNDERSTANDING THE INITIAL CAREER DECISIONS OF HOSPITALITY MANAGEMENT GRADUATES IN SRI LANKA

VOL. 3, NO. 5, May 2012 ISSN Journal of Emerging Trends in Computing and Information Sciences CIS Journal. All rights reserved.

Activities, Exercises, Assignments Copyright 2009 Cem Kaner 1

XXII BrainStorming Day

Transcription:

e-issn 2455 1392 Volume 2 Issue 5, May 2016 pp. 102-107 Scientific Journal Impact Factor : 3.468 http://www.ijcter.com A REVIEW ON APPLICATIONS OF DATA MINING TECHNIQUES IN HIGHER EDUCATION Prof. Prashant G. Tandale Bharati Vidyapeeth Deemed University, Institute of Management, Kolhapur, Maharashtra Abstract Now a days the important challenge that higher education faces, is reaching a stage to facilitate the universities in having more efficient, effective and accurate educational processes. Educational data mining is an emerging discipline that focuses on applying data mining tools and techniques to educationally related data. The data mining technology can discover the hidden patterns, associations, and anomalies from educational data. This knowledge can improve the decision making processes in higher educational systems. Data mining is considered as the most suited technology appropriate in giving additional insight into the lecturer, student, alumni, manager, and other educational staff behaviour and acting as an active automated assistant in helping them for making better decisions on their educational activities. This paper tries to take review of work done so far in this area of educational data mining so that a model could be developed for customized pedagogical strategies for the rural development in India in case of higher education. Keywords Educational Data Mining (EDM), Classification, WEKA, Decision Tree, Prediction, I. INTRODUCTION Education is an essential element for the betterment and progress of a country. It enables the people of a country civilized and well mannered. To date, higher educational organizations are placed in a very high competitive environment and are aiming to get more competitive advantages over the other competitors. To remain competitiveness among educational field, these organizations need deep and enough knowledge for a better assessment, evaluation, planning, and decision-making. The required knowledge cannot be gained from the tailor made software used now a days. Data mining incorporates a multitude of techniques from a variety of fields including databases, statistics, data visualization, machine learning and others. Educational data mining (EDM) is an emerging discipline that focuses on applying data mining tools and techniques to educationally related data. The discipline focuses on analyzing educational data to develop models for improving learning experiences and improving institutional effectiveness [14]. The data mining technology can discover the hidden patterns, associations, and anomalies from educational data. This knowledge can improve the decision making processes in higher educational systems. Data mining is considered as the most suited technology appropriate in giving additional insight into the all stake holders of the educational institute. The data mining techniques can help the institutes in extracting patterns like students having similar characteristics, Association of students attitude with performance, what factors will attract meritorious students and so on. The past several decades have witnessed a rapid growth in the use of data and knowledge mining as a means by which academic institutions extract useful hidden information in the student result repositories in order to improve students learning processes [5]. II. REVIEW OF LITERATURE A literature review on educational data mining topics such as student retention and attrition, personal recommender systems within education, and how data mining can be used to analyze course management system data. @IJCTER-2016, All rights Reserved 102

According to Paulraj and Ponniah, the main benefits of data mining to educational institutes are It provides an integrated and total view of an institute. It makes the institute s current and historical information easily available for the decision making. It provides the facility to students to get their different subject notes from a web enabled database. It provides the information about student s attendance. Students can get theirs results easily and very quickly. It helps to provide information about faculty like how many members are there in all the different departments etc [6]. Minaei-Bidgoli & others proposed an approach to classify students in order to predict their final year grade based on the features extracted from logged data in an educational web based system was reported. Data mining classification process was used in conjunction with genetic algorithm to improve the prediction accuracy. C. Romero and S. Ventura carried a survey in the field of education. They have described the types of users, types of educational environments and the data they provide. Also they have explained in their work the common tasks in the educational environment that have been resolved through data mining techniques [9]. Hua-long Zhao has done Multidimensional cube analysis by taking use of OLAP technology and has shown that the curriculum chosen by the students can depend upon many angles like teacher, semester and student. He has used Star model of data warehouse to the analysis of curriculum which can provide certain policy making support for different levels of education policy- maker in the school. Fadzilah Siraj and Mansour Ali Abdoulha have used data mining techniques for understanding student enrolment data. They have done comparative study of three predictive data mining techniques namely Neural Network, Logistic regression and Decision tree. The results obtained can be used by the planners to formulate proper plan for the university [12]. Shaeela Ayesha & others discusses data mining technique named k-means clustering is applied to analyze student s learning behavior. Here K-means clustering method is used to discover knowledge that come from educational environment [11]. S K Altaf Hussain Basha have identified the association between different attributes of educational environment i.e. the location of the college, type of college, different social groups, different courses etc. and thereby extract strong association rules. They used data mining technique of association rule mining to extract strong rules in educational environment that identifies students success patterns in different colleges in different social groups. Further they have processed the available data to find the pattern of support for these rules from time to time [1]. Ramasubramanian P. states that Rough Set Theory (RST) has found many interesting applications in academia. Currently there is an increasing interest in data mining and educational systems, making educational data mining as a new growing and promising area for the research community. Decision making of classroom processes is a very tedious process which involves observing a student s behavior, analyzing historical data, and estimating the effectiveness of pedagogical strategies. RST has been used in estimating the teaching result analysis with the help of concept map and brainstorming which can be used as a tool for educators [7]. Robertas analyzed student academic results for informatics course improvement, rank course topics following their importance for final course marks based on the strength of the association rules and proposed which course specific course topic should be improved to achieve higher student learning effectiveness and progress [8]. @IJCTER-2016, All rights Reserved 103

W.M.R. Tissera & others presents a real-world experiment conducted in an ICT educational institute in Sri Lanka. A series of data mining tasks are applied to find relationships between subjects in the undergraduate syllabi. This knowledge provides many insights into the syllabi of different educational programmes and results in knowledge critical in decision making that directly affects he quality of the educational programmes [13]. Hongjie Sun conducts a research on student learning result based on data mining. It is aimed at putting forward a rule-discovery approach suitable for the student learning result evaluation and applying it into practice so as to improve learning evaluation skills and finally better serve learning practicing [4]. S. Anupama Kumar and Dr. Vijayalakshmi M.N applied decision tree algorithm on student s internal assessment data to predict their performance in the final exam. The outcome of the decision tree predicted the number of students who are likely to fail or pass [10]. Monika Goyal used different types of rule based systems and have been applied to predict student s performance (mark prediction) in an e learning environment (using fuzzy association rules). Several regression techniques are used to predict student s marks like linear regression for predicting student s academic performance, stepwise linear regression for predicting time to be spent on a learning page, multiple linear regression for identifying variables that could predict success in courses and for predicting exam results in distance education courses [2]. According to Jaiwei Han, data mining is an interdisciplinary field of astronomy, business, computer science, economics and others to discover new patterns from large data sets. The actual data mining task is to analyze large quantities of data in order to extract previously unknown patterns such as groups of data records (cluster analysis), unusual records( anomaly detection) and dependencies (association rule mining) [3]. III. KNOWLEDGE DISCOVERY PROCESS Simply stated, data mining refers to extracting or mining" knowledge from large amounts of data. Data mining techniques are used to operate on large volumes of data to discover hidden patterns and relationships helpful in decision making. The sequences of steps identified in extracting knowledge from data are shown in Figure 1. Fig.1 Steps of extracting Knowledge from data @IJCTER-2016, All rights Reserved 104

The various techniques used in Data Mining are: A. Association analysis Association analysis is the discovery of association rules showing attribute-value conditions that occur frequently together in a given set of data. Association analysis is widely used for market basket or transaction data analysis. B. Classification and Prediction Classification is the processing of finding a set of models (or functions) which describe and distinguish data classes or concepts, for the purposes of being able to use the model to predict the class of objects whose class label is unknown. The derived model may be represented in various forms, such as classification (IF-THEN) rules, decision trees, mathematical formulae, or neural networks. Classification can be used for predicting the class label of data objects. However, in many applications, one may like to predict some missing or unavailable data values rather than class labels. This is usually the case when the predicted values are numerical data, and is often specifically referred to as prediction. IF-THEN rules are specified as IF condition THEN conclusion e.g. IF age=youth and gender=female then buys_dress=yes C. Clustering Analysis Unlike classification and predication, which analyze class labeled data objects, clustering analyzes data objects without consulting a known class label. In general, the class labels are not present in the training data simply because they are not known to begin with. Clustering can be used to generate such labels. The objects are clustered or grouped based on the principle of maximizing the intra-class similarity and minimizing the interclass similarity. That is, clusters of objects are formed so that objects within a cluster have high similarity in comparison to one another, but are very dissimilar to objects in other clusters. Each cluster that is formed can be viewed as a class of objects, from which rules can be derived D. Outlier Analysis A database may contain data objects that do not comply with the general behavior of the data and are called outliers. The analysis of these outliers may help in fraud detection and predicting abnormal values. IV. APPLICATIONS OF DATA MINING TECHNIQUES IN HIGHER EDUCATION 4.1 Predicting The Registration of Students in an Educational Programme Now a days educational organization are getting strong competition from other Academic competitors. To have an edge over other organizations, needs deep and enough knowledge for a better assessment, evaluation, planning, and decision making. Data Mining helps organizations to identify the hidden patterns in databases; the extracted patterns are then used to build data mining models, and hence can be used to predict performance and behavior with high accuracy. As a result of this, universities are able to allocate resources more effectively One of the application of data mining can be for example, to efficiently assign resources with an accurate estimate of how many male or female will register in a particular program by using the Prediction techniques. For this purpose Admissions to first year of UG courses data are collected from the BDVU, Institute of Management, Kolhapur and it is analyzed using WEKA- Data mining Tool and weka.classifiers.rules.zeror scheme. @IJCTER-2016, All rights Reserved 105

4.2 Predictor Training Data: International Journal of Current Trends in Engineering & Research (IJCTER) Year of Admission No. of Male students Admitted 2011 72 35 2012 53 14 2013 92 21 2014 144 40 2015 220? No. of Female students Admitted Female Students= 28 V. CONCLUSION In this paper we have taken a review of the work so far done in applying data mining techniques in higher education so that it can generate strategic information which will help the higher authorities in decision making. Since the application of data mining brings a lot of advantages in higher learning institution, it is recommended to apply these techniques in the areas like optimization of resources, prediction of admissions, etc. Bring the right change in the education will definitely help the rural development, because if education is given properly, students skills will be developed and which will ultimately help in nation development. REFERENCES [1] Basha, S. et al., Analyzing Education Data Through Association Rules: A Case Study, International Journal of Data Warehousing, Issue 3 vol.2, 53-64, 2011. [2] Goyal, M. et al., Applications of Data Mining in Higher Education, International Journal of Computer Science issues, 9 Vol. 2, 2012. [3] Han, J., Data Mining concepts and Techniques, Elsevier Publication, 2012. [4] Hongjie, S., Research on Student Learning Result System based on Data Mining, International Journal of Computer Science and Network Security, Issue 10 Vol.4, 2010. [5] Kumar,V. et al., Mining Association Rules in Students Assessment Data, International Journal of Computer Science Issues. Issue 9 Vol.5, 2012. [6] Paulraj, Data Warehousing Fundamentals: A Comprehensive Guide to IT Professionals. John Wiely & Sons. 2001 [7] Ramasubramanian, P. et al., Teaching result Analysis using Rough Sets and Data Mining, Journal of computing, Issue 1 Vol.1, 168-174, 2009. [8] Robertas, D., Analysis of Academic Results for Informatics Course Improvement using Association Rule Mining, Information Systems development towards a Service Provision Society, ISBN 978-0-387-84809-9, 357-363, Published by Springer US, 2009. [9] Romero, C. et al, Educational Data Mining: A Review of the State of the Art, IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews. Issue 40 Vol.6, 2010. [10] S Anupama Kumar et al., Mining of student academic evaluation records in higher education. Recent Advances in Computing and Software Systems (RACSS), 2012 International Conference on.ieee, 67-70, 2012. [11] Shaeela, A.et al., Data Mining Model for Higher Education System, European Journal of Scientific Research,. Issue 43 Vol.1,.24-29,2010 [12] Siraj, F. et al. Uncovering hidden Information within University s Student Enrollment Data using Data Mining. Third Asia International Conference on Modeling and Simulation, 2009. [13] Tissera, W.et al., Discovery of Strongly Related Subjects in the Undergraduate Syllabi using Data Mining. IEEE International Conference on Information Acquisition, 2006 FLEXChip Signal Processor (MC68175/D), Motorola, 1996. [14] Varun Kumar et al., An Empirical Study of the Application of Data Mining Techniques in Higher Education, International Journal of Advanced Computer Science and Applications, Vol. 2, No. 3,2011. [15] www.mytechlabs.com @IJCTER-2016, All rights Reserved 106