Deptt.of Computer Science and Applications,ChaudharyRanbir Singh University, Jind (haryana)
|
|
- Madison Goodman
- 6 years ago
- Views:
Transcription
1 Predicting Students Performance: An EDM Approach 1 Sneha Kumari, 2 Dr. Anupam Bhatia 1 M.phil. Scholar, 2 Asstt. Professor Deptt.of Computer Science and Applications,ChaudharyRanbir Singh University, Jind (haryana) Abstract: Prediction about the student's performance is an integral part of an education system, as the overall growth of the education system is directly proportional to the success rate of the students in their examinations. Therefore, there are many situations where the performance of the students needs to be predicted.data mining is a powerful tool which aims at discovering of useful information from large collections of data. Different data mining techniques and models have been applied to this task.the main focus of this research work is to develop a predictive model based on student s data which can predict the performance with high accuracy rates. To evaluate theperformance of studentsclassification task is used and from many approaches that are used for data classification, the Naïve Bayes method is used in this research work. It takes students data as input and gives students upcoming performances.by using data mining classification algorithm, Naïve Bayes, we obtained a model of almost 77.02% accuracy.the result generated helps the educational institutions along with students develop a good understanding of how well or how poorly they would perform, and then develop a suitable learning strategy so that identified students can be assisted more by the teachers and their performance is improved in future, which is beneficial for their individual results and also for academic institutions profile. I. INTRODUCTION Data Mining (DM) also known as Knowledge Discovery from Databases (KDD), is the field of discovering novel and potentially useful information from the massive amount of data[1]. The objective of data mining is to design and effort efficiently with a large amount of data sets.. It has also been defined as the non trivial process of identifying genuine, unique, probably useful and understandable patterns in data [2]. Data mining applications are greatly used in the field of education to efficiently manage and extract undiscovered knowledge from educational data. Application of data mining technique in the Educational setting is called as Educational Data Mining (EDM). EDM is a nexus of Data Mining, Statistics, Machine Learning, and Psychometrics. Educational Data Mining (EDM) community website[3] defines EDM as follows: Educational Data Mining is an emerging discipline, concerned with developing methods for exploring the unique type of data that come from educational settings and using those methods to better understand students and the settings which they learn in.according to Wikipedia Educational Data Mining (EDM) refers to techniques, tools, and research designed for automatically extracting meaning from large repositories of data generated by or related to people's learning activities in educational settings [4].EDM design models, tasks, algorithms, and techniques to explore a large amount of educational data collected by the educational system from educational environments to discover new knowledge about the students and understand them, for this various data mining techniques have been used such as Decision Tree, Artificial Intelligence, Neural Network and other. The mined knowledge provides better sight, facilitate and upgrade the educational process. Predicting student academic performance has long been an key research area. Educational Institutes aims to offer quality education to students to get better their behavior and improve the quality of managerial decisions. High level of quality in education is achieved by discovering knowledge from educational information to study the main attributes that may have an effect on the students performance. The discovered knowledge help and provide recommendations to the academic planners in education institutions to improve their decision-making process, improve students academic performance and reduce failure rates to better understand students behavior, to assist instructors, to improve teaching etc.the ability to predict students performance is very important in aneducational environment. Student s academic performance is based on factors like personal, social, demographic data etc.therefore there are many situations where the performance of the students needs to be predicted[5]. The prediction of student performance with high accuracy is beneficial for identifying the students with low academic achievements. In addition, the prediction results may help students develop a good understanding of how well or how poorly they would perform, and then develop a suitable learning strategyso that identified students can be assisted more by the teachers and their performance is improved in future which is beneficial for their individual results and also for academic institutions profile. Accurate prediction of student success is one way to improve the quality of education and make available better educational services. A UGC Recommended Journal Page 1
2 II. DATA DESCRIPTION Data Preparation In this study, data set is taken from The data set is in excel sheet consists of students academic data. This data set contains 14 variables into 8 files, 2008 to 2015 applicants. There are approximately 5,00,000 records in this data set. The student dataset was a continuous data and has been converted into nominal data. Data Cleaning Integrated data was having missing values i.e. attribute value missing or noisy values. In this research missing data elements are replaced with the average. Other options include replacing missing data elements with the zero, the mean, or the mode, or to just leave it blank Data Selection and Transformation The attributes like Year, Federal race code, Gender, Disability flag, LEP flag, Disadvanced flag, Cohort count, Diploma rate and Dropout rate were considered as relevant and Division number, Division name, School number, School name were ignored as they are irrelevant to student performance analysis. The attribute Level code consists school, division, and state. For this research data related to school level code was selected. All the variables which were derived from the database are given in table 1for reference. Table 1- Selected Attributes Description Attribute name Description Possible Values 1( ), 2( ), 3( ), School Year Calendar year 4( ), 5( ), 6( ), 7( ) Federal Race Code Students category 0-unspecified, 1 -American Indian, 2-Asian, 3 - Black/African, 4 -Hispanic of any race, 5 - White, 6 -other pacific islander, 99 - two or more races Gender Students sex Male, Female Disability Flag Student with disability Yes (1), No(0) LEP Flag Limited English Proficiency Status Yes(1), No(0) Disadvantaged Flag Students with economically disadvantaged status Yes(1),No (0) Cohort Count Number of students in cohort Between 0 to 500 Diploma Rate Graduation rate of students Between 0 to 100 Dropout Rate Number of students who dropout Between 0 to 100 The domain values for some of the variables were defined for the present investigation as follows- Federal race code - The Federal Race Code identifies one of the racial categories that most clearly reflect the student's recognition of his or her community or with which th500tudent most closely identifies.the valid values are 0=unspecified (used through the school year), 1=American Indian/Alaska Native, 2=Asian, 3=Black or African/American, 4=Hispanic of any race, 5=White, 6=Native Hawaiian/Other Pacific Islander, 99=Two or more races, non-hispanic (added in ). Disability flag - A person having an intellectual disability; hearing impairment, including deafness; speech or language impairment; visual impairment, including blindness; serious emotional disturbance, other health impairment; specific learning disability; deaf-blindness; or multiple disabilities and who, by reason thereof, receive special education and related services. The valid values: Y = Yes or N = No 2
3 LEP flag LEP ( Limited English Proficiency Status) Disadvantaged flag - A flag that identifies students as economically disadvantaged if they meet any one of the following: 1) is eligible for Free/Reduced Meals, or 2) receives TANF, or 3) is eligible for Medicaid, or 4) identified as either Migrant or experiencing Homelessness. The valid values: Y = Yes or N = No Cohort count The number of students in the cohort. Virginia s graduation cohorts are defined as: group of students who enter the ninth grade for the first time together with the expectation of graduating within four years. Diploma rate - Graduation rate is the percentage of students in a cohort who earn a diploma within four years of entering the ninth grade. Dropout rate - Dropout rate reflects the number of students who dropped out and did not re-enroll. III. TOOLS AND TECHNIQUES RAPID MINER Rapid miner tool is used for exploration, statistical analysis and mining the students data. Rapid Miner ( formally YALE i.e. Yet Another Learning Environment) is an OSS (Open Source Software) under an OSI-certified open source license that may be used for Text Mining, Machine Learning, Predictive and Business Analysis as well as for Research, Education Training, Application Development and supports all steps of the Data Mining process. Rapid Miner provides a easy to use drag-drop interface without the need of any programming skills. RapidMiner Studio is a powerful visual design environment for rapidly building complete predictive analytic workflows. This all-in-one tool features hundreds of pre-defined data preparation and machine learning algorithms to support all your data science projects[12]. CLASSIFICATION - Classification is one of the most commonly applied supervised learning techniques, which employs a set of pre-classified examples to build a model that can classify the population of records at large[6]. The objective of classification is to predict the future outcome based on the existing data. Classification is the most often studied problems by Data Mining and Machine learning researchers. A classifier, or classification model, predicts categorical labels (classes)[7]. A classification model is considered by analyzing the relationship between the attributes and the class of objects in the training set. Such classification model can be used to classify future objects and develop understanding of the classes of objects in the databases. The classification process involves two steps: learning and classification. In the learning step, a model that describes a predetermined set of classes or concepts is made by examining a set of the training dataset. The models are generally in the form of classification rules. In the classification step, the model is put to test using a different data set that is used to estimate the predictive accuracy of the model. If the model accuracy is considered acceptable, the model can be applied to classify the dataset for which the class label is not known in advance[8]. Therefore, the Educational Institute s is trying to predict the future result of their registered students based on their existing previous and current student s data, that make classification one of the techniques better suited for educational analysis[9]. Basic techniques for classification are Decision Tree Induction, Bayesian Classification, and Neural Networks. Other approaches like Genetic Algorithms, Rough Sets, Fuzzy Logic, Case-Based Reasoning can also be used for classification. NAÏVE BAYES - The Naive Bayes algorithm is a statistical classifier that calculates a set of probabilities by counting the frequency and combinations of values in a given data set[10]. They can predict class membership probabilities, such as the probability that a given tuple belongs to a particular class.it represents a predictive approach to make predictions on values of data using known results found from different data. Also, the output from the prediction model using Naïve Bayes can be easily interpreted into the understandable human language. A naive Bayes (NB) classifier is a simple probabilistic classifier based on (a) Bayes theorem, (b) strong (naive) independence assumptions, and (c) independent feature models. It is also an important mining classifier for data mining and applied in many real-world classification problems because of its high classification performance. An NB classifier can easily handle missing attribute values by simply omitting the corresponding probabilities for those attributes when calculating the likelihood of membership for each class. The NB classifier also requires the class conditional independence, i.e. the effect of an attribute on a given class is independent of these of other attributes. 3
4 Naïve Bayes algorithm Pseudo Code Given a training dataset, D = {X1,X2...Xn}, each data record is represented as, Xi = {x1, x2... xn}. D contains the following attributes {A1,A2... An} and each attribute Ai contains the following attribute values{ai1, Ai2... Ain}. The attribute values can be discrete or continuous. D also contains a set of classes C = {C1, C2,Cm}. Each training instance, X D, has a particular class label Ci. For a test instance, X, the classifier will predict that X belongs to the class with the highest posterior probability, conditioned on X. That is, the NB classifier predicts that the instance X belongs to the class Ci, if and only if P(Ci X) > p(cj X) for 1 j m,j 6= i. Thus we find that the class Ci for which P(Ci X) is maximized is called the Maximum Posteriori Hypothesis. By Bayes theoremp(ci X) =P(X Ci)P(Ci) /P(X) In this theorem, as P(X) is constant for all classes, only P(X Ci )P(Ci) needs to be maximized. If the class prior probabilities are not known, then it is commonly assumed that the classes are equally likely, that is P(C1) = P(C2) = = P(Cm), and we would therefore maximize P(Ci). Otherwise, maximize P(X Ci)P(Ci). The class prior probabilities are calculated by P(Ci) = Ci,D / D, where Ci,D is the number of training instances belonging to the class Ci in D. To compute P(X Ci) in a dataset with many attributes is extremely computationally expensive. Thus, the naive assumption of class conditional independence is made in order to reduce computation in evaluating P(X Ci). The attributes are conditionally independent of another, given the class label of the instance. Thus eq 1 and eq 2 are used to produce P(X Ci)[11]. P(X Ci) = n k=1p(xk Ci)...(1) P(X Ci) = P(x1 Ci) P(x2 Ci) P(xn Ci)..(2) EVALUATION AND ANALYSIS The analysis is performed using Rapid Miner studio. In this paper, Naïve Bayes in Rapid Miner is utilized to construct the prediction model. The data is imported from Excel File to Rapid Miner using the Read CSV operator. Data was retrieved using the retrieve operator and data was passed to the operator named crossvalidation. The set role and discretize operator is used are preprocessing step. Cross-validation is applied to evaluate and find the accuracy of the model. Cross-validation operator is a nested operator; it has two subprocesses testing and training. Testing & Training- These are the sub-processes of validation operator (figure 2). The training subprocess is used for training a model. The trained model is then applied in the testing sub-process. During the testing phase performance of the model is also measured. During the training phase of cross-validation, naïve Bayes operator is used to training the model and in testing phase Apply Model operator is used to test the model. Performance operator is used for performance evaluation. Figure 1 Representing the Model for Data Analysis 4
5 Figure 2 - Testing and Training (Cross-Validation) of Model I. RESULTS Rapid Miner Performance operator provides several options to check the model validity: accuracy, precision, recall, and AUC charts(auc (optimistic), AUC and AUC (pessimistic)) and, themodel accuracy is 77.04% (figure- 3). Figure 3- Shows the Accuracy of the Model Figure 4 Precision of the Model 5
6 Figure 5 Recall Table View Figure 6 AUC (Neutral) Figure 7 - Description of Performance Vector 6
7 Evaluation of Performance The table 2 shows the performance of naïve bayes in predicting the results based on training set that contains students data. Table 2 Performance Table of the Model Accuracy: 77.04% +/-1.09% (mikro:77.04%) True Range 1 True Range 2 Class Precision Pred. Pass % Pred. Fail % Class Recall 83.21% 70.78% In this report, the experiment is trying to determine how many students will fail in exam in order to focus on these students to improve their academic performance. The value range1 is positive class and range 2 is negative class. The data set consists of 10,000 records of students, which are used by naïve bayes algorithm in classifying the results. In the first row, in the true range 1 (pass) column of the confusion matrix, 4189 students are classified as positive (predicted to pass) and are true pass (actually passed). However, there are 1451 students classified incorrectly, where they were actual fail but were predicted as passes that are false negatives (FN). In the second row, in true range 1 (pass), there are 845 instances predicted as fail but are actually pass. Secondly, 3515 instances are predicted fail and are actually fail. As seen in the table 4.2, TP = 4189, FP = 1451, FN = 845, TN = 3515 Sensitivity = 100 = 100 = 83.21% Specificity = 100 = 100 = 70.78% Accuracy = Sensitivity + Specificity = 100 = 77.04% The model shows an accuracy of 77.04% with a margin of error (+/- 1.09%). Precision - The percentage of cases which rapid miner classified correctly (pass) is 74.27% and correctly (fail) is 80.62% respectively. Recall - The percentage of cases in which Rapid Miner predicted pass is (83.31%) and predicted fail is 70.78%. On basis of the table 2, there were actually 4966 students out of 10,000 who were likely to fail and the model predicted 3515 out of 4966 correctly, which help in improving students performance. AUC - Accuracy is measured by the area under the ROC curve. An area of 1 represents a perfect test; an area of 0.5 represents a worthless test. As in figure 6 the AUC is 0.86 i.e it is considered to be good. 7
8 The performance of the model in terms of accuracy, precision, recall.we have concluded that the naïve Bayes produces the accuracy The model also produced precision 80.66% and recall 83.21% that shows it is possible to obtain a good prediction model. IV. CONCLUSION AND FUTURE WORK In this study, a model was developed based on some selected input variables. Out of all input variables, some of most influencing factors were identified and taken to predict the student s academic performance. Data mining classification algorithm naïve Bayes was applied to predict the performance of students on the basis of previous year student database. The tool Rapid miner is used for exploration, statistical analysis, and mining of student data. Cross-Validation operator is used to performing a cross-validation process. From the above analysis, we have concluded that the naïve Bayes produces the accuracy 77.04%, precision 80.66% and recall 83.21% that shows it is possible to obtain a good prediction model.the proposed methodology can be adopted to predict the performance of students and help the teachers as well as the students to enhance the quality of learning and student s performance by taking significance decision at right time. In future work, the study can be enhanced by including the data with more information about the students and of higher quality which might help to improve the current model performance and also to obtain more accurate student performance and to determine student behavior. Also, the work could be carried out with other modern techniques to acquire a wider approach and more reliable outputs. REFERENCES 1. S. G. Kulkarni, G. C. Rampure, and B. Yadav, Understanding Educational Data Mining ( EDM ), pp , M. Computing, Predictive Data Mining : A Generalized Approach, vol. 3, no. 1, pp , Home International Educational Data Mining Society. [Online]. Available: [Accessed: 22-May-2017]. 4. EducationalDataMining.org S. Taruna and M. Pandey, An Empirical Analysis of Classification Techniques for Predicting Academic Performance, pp , B. Ramageri, Data mining techniques and applications, Indian J. Comput. Sci., vol. 1, no. 4, pp , A. D. Kumar and V. Radhika, A Survey on Predicting Student Performance, vol. 5, no. 5, pp , J. Ruby and K. David, Analysis of Influencing Factors in Predicting Students Performance Using MLP -A Comparative Study, Int. J. Innov. Res. Comput. Commun. Eng. (An ISO Certif. Organ., vol. 3297, no. 2, pp , M. A. Al-Barrak and M. Al-Razgan, Predicting Students Final GPA Using Decision Trees: A Case Study, Int. J. Inf. Educ. Technol., vol. 6, no. 7, pp , S. R. Dash and S. Dehuri, Comparative Study of Different Classification Techniques for Post Operative Patient Dataset, Int. J. Innov. Res. Comput. Communucation Eng., vol. 1, no. 5, pp , P. Sharma, D. Singh, and A. Singh, Classification Algorithms on a Large Continuous Random Dataset Using Rapid, Ieee Spons. 2Nd Int. Conf. Electron. Commun. Syst. 2015), no. Icecs, pp , RapidMiner Studi o Manual. 8
Rule Learning With Negation: Issues Regarding Effectiveness
Rule Learning With Negation: Issues Regarding Effectiveness S. Chua, F. Coenen, G. Malcolm University of Liverpool Department of Computer Science, Ashton Building, Ashton Street, L69 3BX Liverpool, United
More informationA Case Study: News Classification Based on Term Frequency
A Case Study: News Classification Based on Term Frequency Petr Kroha Faculty of Computer Science University of Technology 09107 Chemnitz Germany kroha@informatik.tu-chemnitz.de Ricardo Baeza-Yates Center
More informationThe 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X
The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, 2013 10.12753/2066-026X-13-154 DATA MINING SOLUTIONS FOR DETERMINING STUDENT'S PROFILE Adela BÂRA,
More informationRule Learning with Negation: Issues Regarding Effectiveness
Rule Learning with Negation: Issues Regarding Effectiveness Stephanie Chua, Frans Coenen, and Grant Malcolm University of Liverpool Department of Computer Science, Ashton Building, Ashton Street, L69 3BX
More informationOPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS
OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS Václav Kocian, Eva Volná, Michal Janošek, Martin Kotyrba University of Ostrava Department of Informatics and Computers Dvořákova 7,
More informationTwitter Sentiment Classification on Sanders Data using Hybrid Approach
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 4, Ver. I (July Aug. 2015), PP 118-123 www.iosrjournals.org Twitter Sentiment Classification on Sanders
More informationIowa School District Profiles. Le Mars
Iowa School District Profiles Overview This profile describes enrollment trends, student performance, income levels, population, and other characteristics of the public school district. The report utilizes
More informationLecture 1: Machine Learning Basics
1/69 Lecture 1: Machine Learning Basics Ali Harakeh University of Waterloo WAVE Lab ali.harakeh@uwaterloo.ca May 1, 2017 2/69 Overview 1 Learning Algorithms 2 Capacity, Overfitting, and Underfitting 3
More informationPython Machine Learning
Python Machine Learning Unlock deeper insights into machine learning with this vital guide to cuttingedge predictive analytics Sebastian Raschka [ PUBLISHING 1 open source I community experience distilled
More informationAustralian Journal of Basic and Applied Sciences
AENSI Journals Australian Journal of Basic and Applied Sciences ISSN:1991-8178 Journal home page: www.ajbasweb.com Feature Selection Technique Using Principal Component Analysis For Improving Fuzzy C-Mean
More informationMachine Learning from Garden Path Sentences: The Application of Computational Linguistics
Machine Learning from Garden Path Sentences: The Application of Computational Linguistics http://dx.doi.org/10.3991/ijet.v9i6.4109 J.L. Du 1, P.F. Yu 1 and M.L. Li 2 1 Guangdong University of Foreign Studies,
More informationWord Segmentation of Off-line Handwritten Documents
Word Segmentation of Off-line Handwritten Documents Chen Huang and Sargur N. Srihari {chuang5, srihari}@cedar.buffalo.edu Center of Excellence for Document Analysis and Recognition (CEDAR), Department
More informationData Diskette & CD ROM
Data File Format Data Diskette & CD ROM Texas Assessment of Academic Skills Fall 2002 through Summer 2003 Exit Level Test Administrations Attention Macintosh Users To accommodate Macintosh systems a delimiter
More informationRaw Data Files Instructions
Raw Data Files Instructions Colleges will report the above information for students in the Main Cohort for each of the reporting timeframes and the system will calculate the sub cohorts and metrics based
More informationModule 12. Machine Learning. Version 2 CSE IIT, Kharagpur
Module 12 Machine Learning 12.1 Instructional Objective The students should understand the concept of learning systems Students should learn about different aspects of a learning system Students should
More informationDisambiguation of Thai Personal Name from Online News Articles
Disambiguation of Thai Personal Name from Online News Articles Phaisarn Sutheebanjard Graduate School of Information Technology Siam University Bangkok, Thailand mr.phaisarn@gmail.com Abstract Since online
More informationReducing Features to Improve Bug Prediction
Reducing Features to Improve Bug Prediction Shivkumar Shivaji, E. James Whitehead, Jr., Ram Akella University of California Santa Cruz {shiv,ejw,ram}@soe.ucsc.edu Sunghun Kim Hong Kong University of Science
More informationA Guide to Adequate Yearly Progress Analyses in Nevada 2007 Nevada Department of Education
A Guide to Adequate Yearly Progress Analyses in Nevada 2007 Nevada Department of Education Note: Additional information regarding AYP Results from 2003 through 2007 including a listing of each individual
More informationPort Graham El/High. Report Card for
School: District: Kenai Peninsula Grades: K - 12 School Enrollment: 20 Title I School? No Title 1 Program: Accreditation: Report Card for 2008-2009 A Title 1 school receives federal money in support low-achieving
More informationShelters Elementary School
Shelters Elementary School August 2, 24 Dear Parents and Community Members: We are pleased to present you with the (AER) which provides key information on the 23-24 educational progress for the Shelters
More informationMining Association Rules in Student s Assessment Data
www.ijcsi.org 211 Mining Association Rules in Student s Assessment Data Dr. Varun Kumar 1, Anupama Chadha 2 1 Department of Computer Science and Engineering, MVN University Palwal, Haryana, India 2 Anupama
More informationPredicting Students Performance with SimStudent: Learning Cognitive Skills from Observation
School of Computer Science Human-Computer Interaction Institute Carnegie Mellon University Year 2007 Predicting Students Performance with SimStudent: Learning Cognitive Skills from Observation Noboru Matsuda
More informationKansas Adequate Yearly Progress (AYP) Revised Guidance
Kansas State Department of Education Kansas Adequate Yearly Progress (AYP) Revised Guidance Based on Elementary & Secondary Education Act, No Child Left Behind (P.L. 107-110) Revised May 2010 Revised May
More informationLecture 1: Basic Concepts of Machine Learning
Lecture 1: Basic Concepts of Machine Learning Cognitive Systems - Machine Learning Ute Schmid (lecture) Johannes Rabold (practice) Based on slides prepared March 2005 by Maximilian Röglinger, updated 2010
More informationSINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)
SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF) Hans Christian 1 ; Mikhael Pramodana Agus 2 ; Derwin Suhartono 3 1,2,3 Computer Science Department,
More informationApplications of data mining algorithms to analysis of medical data
Master Thesis Software Engineering Thesis no: MSE-2007:20 August 2007 Applications of data mining algorithms to analysis of medical data Dariusz Matyja School of Engineering Blekinge Institute of Technology
More informationMiami-Dade County Public Schools
ENGLISH LANGUAGE LEARNERS AND THEIR ACADEMIC PROGRESS: 2010-2011 Author: Aleksandr Shneyderman, Ed.D. January 2012 Research Services Office of Assessment, Research, and Data Analysis 1450 NE Second Avenue,
More informationSemi-Supervised Face Detection
Semi-Supervised Face Detection Nicu Sebe, Ira Cohen 2, Thomas S. Huang 3, Theo Gevers Faculty of Science, University of Amsterdam, The Netherlands 2 HP Research Labs, USA 3 Beckman Institute, University
More informationProbabilistic Latent Semantic Analysis
Probabilistic Latent Semantic Analysis Thomas Hofmann Presentation by Ioannis Pavlopoulos & Andreas Damianou for the course of Data Mining & Exploration 1 Outline Latent Semantic Analysis o Need o Overview
More informationLearning From the Past with Experiment Databases
Learning From the Past with Experiment Databases Joaquin Vanschoren 1, Bernhard Pfahringer 2, and Geoff Holmes 2 1 Computer Science Dept., K.U.Leuven, Leuven, Belgium 2 Computer Science Dept., University
More informationCS Machine Learning
CS 478 - Machine Learning Projects Data Representation Basic testing and evaluation schemes CS 478 Data and Testing 1 Programming Issues l Program in any platform you want l Realize that you will be doing
More informationAn Empirical Analysis of the Effects of Mexican American Studies Participation on Student Achievement within Tucson Unified School District
An Empirical Analysis of the Effects of Mexican American Studies Participation on Student Achievement within Tucson Unified School District Report Submitted June 20, 2012, to Willis D. Hawley, Ph.D., Special
More informationIterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages
Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages Nuanwan Soonthornphisaj 1 and Boonserm Kijsirikul 2 Machine Intelligence and Knowledge Discovery Laboratory Department of Computer
More informationKnowledge-Based - Systems
Knowledge-Based - Systems ; Rajendra Arvind Akerkar Chairman, Technomathematics Research Foundation and Senior Researcher, Western Norway Research institute Priti Srinivas Sajja Sardar Patel University
More informationBest Colleges Main Survey
Best Colleges Main Survey Date submitted 5/12/216 18::56 Introduction page 1 / 146 BEST COLLEGES Data Collection U.S. News has begun collecting data for the 217 edition of Best Colleges. The U.S. News
More informationStudent Mobility Rates in Massachusetts Public Schools
Student Mobility Rates in Massachusetts Public Schools Introduction The Massachusetts Department of Elementary and Secondary Education (ESE) calculates and reports mobility rates as part of its overall
More informationILLINOIS DISTRICT REPORT CARD
-6-525-2- Hazel Crest SD 52-5 Hazel Crest SD 52-5 Hazel Crest, ILLINOIS 2 8 ILLINOIS DISTRICT REPORT CARD and federal laws require public school districts to release report cards to the public each year.
More informationILLINOIS DISTRICT REPORT CARD
-6-525-2- HAZEL CREST SD 52-5 HAZEL CREST SD 52-5 HAZEL CREST, ILLINOIS and federal laws require public school districts to release report cards to the public each year. 2 7 ILLINOIS DISTRICT REPORT CARD
More informationUsing dialogue context to improve parsing performance in dialogue systems
Using dialogue context to improve parsing performance in dialogue systems Ivan Meza-Ruiz and Oliver Lemon School of Informatics, Edinburgh University 2 Buccleuch Place, Edinburgh I.V.Meza-Ruiz@sms.ed.ac.uk,
More informationCooper Upper Elementary School
LIVONIA PUBLIC SCHOOLS www.livoniapublicschools.org/cooper 213-214 BOARD OF EDUCATION 213-14 Mark Johnson, President Colleen Burton, Vice President Dianne Laura, Secretary Tammy Bonifield, Trustee Dan
More informationSchool Year 2017/18. DDS MySped Application SPECIAL EDUCATION. Training Guide
SPECIAL EDUCATION School Year 2017/18 DDS MySped Application SPECIAL EDUCATION Training Guide Revision: July, 2017 Table of Contents DDS Student Application Key Concepts and Understanding... 3 Access to
More informationIndian Institute of Technology, Kanpur
Indian Institute of Technology, Kanpur Course Project - CS671A POS Tagging of Code Mixed Text Ayushman Sisodiya (12188) {ayushmn@iitk.ac.in} Donthu Vamsi Krishna (15111016) {vamsi@iitk.ac.in} Sandeep Kumar
More informationDLM NYSED Enrollment File Layout for NYSAA
Enrollment Field Definitions AYP_School_ Identifier Alphanumeric; 30 No The BEDSCODE of the DISTRICT that has Committee on Special Education (CSE) responsibility for the student. Must include any leading
More informationJohn F. Kennedy Middle School
John F. Kennedy Middle School CUPERTINO UNION SCHOOL DISTRICT Steven Hamm, Principal hamm_steven@cusdk8.org School Address: 821 Bubb Rd. Cupertino, CA 95014-4938 (408) 253-1525 CDS Code: 43-69419-6046890
More informationOCR for Arabic using SIFT Descriptors With Online Failure Prediction
OCR for Arabic using SIFT Descriptors With Online Failure Prediction Andrey Stolyarenko, Nachum Dershowitz The Blavatnik School of Computer Science Tel Aviv University Tel Aviv, Israel Email: stloyare@tau.ac.il,
More informationAutomating the E-learning Personalization
Automating the E-learning Personalization Fathi Essalmi 1, Leila Jemni Ben Ayed 1, Mohamed Jemni 1, Kinshuk 2, and Sabine Graf 2 1 The Research Laboratory of Technologies of Information and Communication
More informationNCEO Technical Report 27
Home About Publications Special Topics Presentations State Policies Accommodations Bibliography Teleconferences Tools Related Sites Interpreting Trends in the Performance of Special Education Students
More informationHuman Emotion Recognition From Speech
RESEARCH ARTICLE OPEN ACCESS Human Emotion Recognition From Speech Miss. Aparna P. Wanare*, Prof. Shankar N. Dandare *(Department of Electronics & Telecommunication Engineering, Sant Gadge Baba Amravati
More informationApps4VA at JMU. Student Projects Featuring VLDS Data. Dr. Chris Mayfield. Department of Computer Science James Madison University
Apps4VA at JMU Student Projects Featuring VLDS Data Dr. Chris Mayfield Department of Computer Science James Madison University VLDS Insights June 30, 2015 One minute version 250 students from JMU Computer
More informationComing in. Coming in. Coming in
212-213 Report Card for Glenville High School SCHOOL DISTRICT District results under review by the Ohio Department of Education based upon 211 findings by the Auditor of State. Achievement This grade combines
More informationRule discovery in Web-based educational systems using Grammar-Based Genetic Programming
Data Mining VI 205 Rule discovery in Web-based educational systems using Grammar-Based Genetic Programming C. Romero, S. Ventura, C. Hervás & P. González Universidad de Córdoba, Campus Universitario de
More informationUW-Waukesha Pre-College Program. College Bound Take Charge of Your Future!
UW-Waukesha Pre-College Program College Bound 2017 Take Charge of Your Future! This is a great program to increase your knowledge on various subjects. Students will be engaged in workshops and hands-on
More informationCSL465/603 - Machine Learning
CSL465/603 - Machine Learning Fall 2016 Narayanan C Krishnan ckn@iitrpr.ac.in Introduction CSL465/603 - Machine Learning 1 Administrative Trivia Course Structure 3-0-2 Lecture Timings Monday 9.55-10.45am
More informationDetecting Wikipedia Vandalism using Machine Learning Notebook for PAN at CLEF 2011
Detecting Wikipedia Vandalism using Machine Learning Notebook for PAN at CLEF 2011 Cristian-Alexandru Drăgușanu, Marina Cufliuc, Adrian Iftene UAIC: Faculty of Computer Science, Alexandru Ioan Cuza University,
More informationEvaluating and Comparing Classifiers: Review, Some Recommendations and Limitations
Evaluating and Comparing Classifiers: Review, Some Recommendations and Limitations Katarzyna Stapor (B) Institute of Computer Science, Silesian Technical University, Gliwice, Poland katarzyna.stapor@polsl.pl
More informationLinking Task: Identifying authors and book titles in verbose queries
Linking Task: Identifying authors and book titles in verbose queries Anaïs Ollagnier, Sébastien Fournier, and Patrice Bellot Aix-Marseille University, CNRS, ENSAM, University of Toulon, LSIS UMR 7296,
More informationWE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT
WE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT PRACTICAL APPLICATIONS OF RANDOM SAMPLING IN ediscovery By Matthew Verga, J.D. INTRODUCTION Anyone who spends ample time working
More informationScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques
Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 98 (2016 ) 368 373 The 6th International Conference on Current and Future Trends of Information and Communication Technologies
More informationMyths, Legends, Fairytales and Novels (Writing a Letter)
Assessment Focus This task focuses on Communication through the mode of Writing at Levels 3, 4 and 5. Two linked tasks (Hot Seating and Character Study) that use the same context are available to assess
More informationBasic Skills Initiative Project Proposal Date Submitted: March 14, Budget Control Number: (if project is continuing)
Basic Skills Initiative Project Proposal 2016-2017 Date Submitted: March 14, 2016 Check One: New Proposal: Continuing Project: X Budget Control Number: (if project is continuing) Control # 87-413 - EOPS
More informationWisconsin 4 th Grade Reading Results on the 2015 National Assessment of Educational Progress (NAEP)
Wisconsin 4 th Grade Reading Results on the 2015 National Assessment of Educational Progress (NAEP) Main takeaways from the 2015 NAEP 4 th grade reading exam: Wisconsin scores have been statistically flat
More informationComputerized Adaptive Psychological Testing A Personalisation Perspective
Psychology and the internet: An European Perspective Computerized Adaptive Psychological Testing A Personalisation Perspective Mykola Pechenizkiy mpechen@cc.jyu.fi Introduction Mixed Model of IRT and ES
More information12-month Enrollment
12-month Enrollment 2016-17 Institution: Potomac State College of West Virginia University (237701) Overview 12-month Enrollment Overview The 12-Month Enrollment component collects unduplicated student
More informationApplication for Postgraduate Studies (Research)
Application for Postgraduate Studies (Research) Please complete clearly. This form will be photocopied. Applicant Number (for office use only). For office use only: Admissions Office Admissions Tutor Interview
More informationApplication for Admission to Postgraduate Studies
Ref A Application for Admission to Postgraduate Studies Please read the attached notes before completing the application form Section A Personal Details (Please see notes) Surname / Family name Email Mr
More informationSwitchboard Language Model Improvement with Conversational Data from Gigaword
Katholieke Universiteit Leuven Faculty of Engineering Master in Artificial Intelligence (MAI) Speech and Language Technology (SLT) Switchboard Language Model Improvement with Conversational Data from Gigaword
More informationPlease complete these two forms, sign them, and return them to us in the enclosed pre paid envelope.
Anatomical Donation Program Jack and Pearl Resnick Campus 1300 Morris Park Avenue, Rm F627N Bronx, NY 10461 Phone: 718.430.3142 Fax: 718.430.8997 anatomical.gifts@einstein.yu.edu We sincerely thank you
More informationFrank Phillips College. Accountability Report
Frank Phillips College Accountability Report January 2016 Accountability System, January 2016 1 of 22 Participation - Key Measures Enrollment 1. Fall Headcount (Unduplicated) Fall 2000 Fall 2014 Fall 2015
More informationREADY OR NOT? CALIFORNIA'S EARLY ASSESSMENT PROGRAM AND THE TRANSITION TO COLLEGE
READY OR NOT? CALIFORNIA'S EARLY ASSESSMENT PROGRAM AND THE TRANSITION TO COLLEGE Michal Kurlaender University of California, Davis Policy Analysis for California Education March 16, 2012 This research
More informationFOR TEACHERS ONLY. The University of the State of New York REGENTS HIGH SCHOOL EXAMINATION PHYSICAL SETTING/PHYSICS
PS P FOR TEACHERS ONLY The University of the State of New York REGENTS HIGH SCHOOL EXAMINATION PHYSICAL SETTING/PHYSICS Thursday, June 21, 2007 9:15 a.m. to 12:15 p.m., only SCORING KEY AND RATING GUIDE
More informationHow do adults reason about their opponent? Typologies of players in a turn-taking game
How do adults reason about their opponent? Typologies of players in a turn-taking game Tamoghna Halder (thaldera@gmail.com) Indian Statistical Institute, Kolkata, India Khyati Sharma (khyati.sharma27@gmail.com)
More informationExperiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling
Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling Notebook for PAN at CLEF 2013 Andrés Alfonso Caurcel Díaz 1 and José María Gómez Hidalgo 2 1 Universidad
More informationPEIMS Submission 3 list
Campus PEIMS Preparation SPRING 2014-2015 D E P A R T M E N T O F T E C H N O L O G Y ( D O T ) - P E I M S D I V I S I O N PEIMS Submission 3 list The information on this page provides instructions for
More informationImproving Simple Bayes. Abstract. The simple Bayesian classier (SBC), sometimes called
Improving Simple Bayes Ron Kohavi Barry Becker Dan Sommereld Data Mining and Visualization Group Silicon Graphics, Inc. 2011 N. Shoreline Blvd. Mountain View, CA 94043 fbecker,ronnyk,sommdag@engr.sgi.com
More informationAssessing and Providing Evidence of Generic Skills 4 May 2016
Assessing and Providing Evidence of Generic Skills 4 May 2016 Dr. Cecilia Ka Yuk Chan Head of Professional Development/ Associate Professor Centre for the Enhancement of Teaching and Learning (CETL) Tell
More informationIntroduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition
Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition Todd Holloway Two Lecture Series for B551 November 20 & 27, 2007 Indiana University Outline Introduction Bias and
More informationIssues in the Mining of Heart Failure Datasets
International Journal of Automation and Computing 11(2), April 2014, 162-179 DOI: 10.1007/s11633-014-0778-5 Issues in the Mining of Heart Failure Datasets Nongnuch Poolsawad 1 Lisa Moore 1 Chandrasekhar
More informationSystem Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks
System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks 1 Tzu-Hsuan Yang, 2 Tzu-Hsuan Tseng, and 3 Chia-Ping Chen Department of Computer Science and Engineering
More informationCS 446: Machine Learning
CS 446: Machine Learning Introduction to LBJava: a Learning Based Programming Language Writing classifiers Christos Christodoulopoulos Parisa Kordjamshidi Motivation 2 Motivation You still have not learnt
More informationEducational Attainment
A Demographic and Socio-Economic Profile of Allen County, Indiana based on the 2010 Census and the American Community Survey Educational Attainment A Review of Census Data Related to the Educational Attainment
More informationA GENERIC SPLIT PROCESS MODEL FOR ASSET MANAGEMENT DECISION-MAKING
A GENERIC SPLIT PROCESS MODEL FOR ASSET MANAGEMENT DECISION-MAKING Yong Sun, a * Colin Fidge b and Lin Ma a a CRC for Integrated Engineering Asset Management, School of Engineering Systems, Queensland
More informationTRANSFER APPLICATION: Sophomore Junior Senior
: Sophomore Junior Senior 2714 W Augusta Phone: 773.534.9718 Fax: 773.534.4022 Email: admissions@chiarts.org Web: www.chiarts.org CPS Mail Run: G.S.R. #35 FRESHMAN APPLICATION STEPS Thank you for your
More informationSpeech Emotion Recognition Using Support Vector Machine
Speech Emotion Recognition Using Support Vector Machine Yixiong Pan, Peipei Shen and Liping Shen Department of Computer Technology Shanghai JiaoTong University, Shanghai, China panyixiong@sjtu.edu.cn,
More informationPredicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks
Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks Devendra Singh Chaplot, Eunhee Rhim, and Jihie Kim Samsung Electronics Co., Ltd. Seoul, South Korea {dev.chaplot,eunhee.rhim,jihie.kim}@samsung.com
More informationIS FINANCIAL LITERACY IMPROVED BY PARTICIPATING IN A STOCK MARKET GAME?
21 JOURNAL FOR ECONOMIC EDUCATORS, 10(1), SUMMER 2010 IS FINANCIAL LITERACY IMPROVED BY PARTICIPATING IN A STOCK MARKET GAME? Cynthia Harter and John F.R. Harter 1 Abstract This study investigates the
More informationEvaluation of Teach For America:
EA15-536-2 Evaluation of Teach For America: 2014-2015 Department of Evaluation and Assessment Mike Miles Superintendent of Schools This page is intentionally left blank. ii Evaluation of Teach For America:
More informationUniversity of Utah. 1. Graduation-Rates Data a. All Students. b. Student-Athletes
University of Utah FRESHMAN-COHORT GRADUATION RATES All Students Student-Athletes # 2009-10 Graduation Rate 64% 64% Four-Class Average 61% 64% Student-Athlete Graduation Success Rate 87% 1. Graduation-Rates
More informationExperiment Databases: Towards an Improved Experimental Methodology in Machine Learning
Experiment Databases: Towards an Improved Experimental Methodology in Machine Learning Hendrik Blockeel and Joaquin Vanschoren Computer Science Dept., K.U.Leuven, Celestijnenlaan 200A, 3001 Leuven, Belgium
More informationUniversity of Arizona
Annual Report Submission View Questionnaire (Edit) University of Arizona Annual Report Submission for the year 2009. Report has been submitted 1 times. Report was last submitted on 11/30/2009 7:12:09 PM.
More informationAPPLICANT INFORMATION. Area Code: Phone: Area Code: Phone:
MARQUETTE UNIVERSITY HEALTH CAREERS OPPORTUNITY PROGRAM College Science Enrichment Program (CSEP) & Pre-Enrollment Support Program (PESP) Website: http://www.mu.edu/hcop INSTRUCTIONS: Please type or print
More informationCLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH
ISSN: 0976-3104 Danti and Bhushan. ARTICLE OPEN ACCESS CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH Ajit Danti 1 and SN Bharath Bhushan 2* 1 Department
More informationPsychometric Research Brief Office of Shared Accountability
August 2012 Psychometric Research Brief Office of Shared Accountability Linking Measures of Academic Progress in Mathematics and Maryland School Assessment in Mathematics Huafang Zhao, Ph.D. This brief
More informationClassification Using ANN: A Review
International Journal of Computational Intelligence Research ISSN 0973-1873 Volume 13, Number 7 (2017), pp. 1811-1820 Research India Publications http://www.ripublication.com Classification Using ANN:
More informationThe Method of Immersion the Problem of Comparing Technical Objects in an Expert Shell in the Class of Artificial Intelligence Algorithms
IOP Conference Series: Materials Science and Engineering PAPER OPEN ACCESS The Method of Immersion the Problem of Comparing Technical Objects in an Expert Shell in the Class of Artificial Intelligence
More informationPUBLIC INFORMATION POLICY
CALIFORNIA STATE POLYTECHNIC UNIVERSITY, POMONA Landscape Architecture College of Environmental Design PUBLIC INFORMATION POLICY Landscape Architecture Accreditation Board (LAAB) accredited programs are
More informationLaboratorio di Intelligenza Artificiale e Robotica
Laboratorio di Intelligenza Artificiale e Robotica A.A. 2008-2009 Outline 2 Machine Learning Unsupervised Learning Supervised Learning Reinforcement Learning Genetic Algorithms Genetics-Based Machine Learning
More informationAustralia s tertiary education sector
Australia s tertiary education sector TOM KARMEL NHI NGUYEN NATIONAL CENTRE FOR VOCATIONAL EDUCATION RESEARCH Paper presented to the Centre for the Economics of Education and Training 7 th National Conference
More informationSchool of Innovative Technologies and Engineering
School of Innovative Technologies and Engineering Department of Applied Mathematical Sciences Proficiency Course in MATLAB COURSE DOCUMENT VERSION 1.0 PCMv1.0 July 2012 University of Technology, Mauritius
More informationWelcome to. ECML/PKDD 2004 Community meeting
Welcome to ECML/PKDD 2004 Community meeting A brief report from the program chairs Jean-Francois Boulicaut, INSA-Lyon, France Floriana Esposito, University of Bari, Italy Fosca Giannotti, ISTI-CNR, Pisa,
More informationOrganizational Knowledge Distribution: An Experimental Evaluation
Association for Information Systems AIS Electronic Library (AISeL) AMCIS 24 Proceedings Americas Conference on Information Systems (AMCIS) 12-31-24 : An Experimental Evaluation Surendra Sarnikar University
More information