DATA MINING MODEL FOR DOMAIN

Similar documents
The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

CS Machine Learning

A Case Study: News Classification Based on Term Frequency

Content-free collaborative learning modeling using data mining

Rule Learning With Negation: Issues Regarding Effectiveness

Guide to Teaching Computer Science

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

NATIONAL CENTER FOR EDUCATION STATISTICS RESPONSE TO RECOMMENDATIONS OF THE NATIONAL ASSESSMENT GOVERNING BOARD AD HOC COMMITTEE ON.

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Word Segmentation of Off-line Handwritten Documents

AQUA: An Ontology-Driven Question Answering System

COMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS

A student diagnosing and evaluation system for laboratory-based academic exercises

La Grange Park Public Library District Strategic Plan of Service FY 2014/ /16. Our Vision: Enriching Lives

Python Machine Learning

Rule Learning with Negation: Issues Regarding Effectiveness

Stacks Teacher notes. Activity description. Suitability. Time. AMP resources. Equipment. Key mathematical language. Key processes

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

NCAA Eligibility Center High School Portal Instructions. Course Module

MYCIN. The MYCIN Task

Foothill College Summer 2016

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Automating the E-learning Personalization

RETURNING TEACHER REQUIRED TRAINING MODULE YE TRANSCRIPT

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

Unit 3. Design Activity. Overview. Purpose. Profile

Millersville University Degree Works Training User Guide

The Revised Math TEKS (Grades 9-12) with Supporting Documents

P. Belsis, C. Sgouropoulou, K. Sfikas, G. Pantziou, C. Skourlas, J. Varnas

Mining Association Rules in Student s Assessment Data

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Parsing of part-of-speech tagged Assamese Texts

Linking Task: Identifying authors and book titles in verbose queries

Statistics and Data Analytics Minor

CS 101 Computer Science I Fall Instructor Muller. Syllabus

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

No Parent Left Behind

Developing True/False Test Sheet Generating System with Diagnosing Basic Cognitive Ability

Emporia State University Degree Works Training User Guide Advisor

The Political Engagement Activity Student Guide

STUDENT MOODLE ORIENTATION

On-Line Data Analytics

University of Groningen. Systemen, planning, netwerken Bosman, Aart

Software Maintenance

EDIT 576 DL1 (2 credits) Mobile Learning and Applications Fall Semester 2014 August 25 October 12, 2014 Fully Online Course

Implementing a tool to Support KAOS-Beta Process Model Using EPF

Assessment and Evaluation

School of Innovative Technologies and Engineering

Switchboard Language Model Improvement with Conversational Data from Gigaword

(Sub)Gradient Descent

The Heart of Philosophy, Jacob Needleman, ISBN#: LTCC Bookstore:

Australian Journal of Basic and Applied Sciences

success. It will place emphasis on:

Chamilo 2.0: A Second Generation Open Source E-learning and Collaboration Platform

Using SAM Central With iread

Ministry of Education, Republic of Palau Executive Summary

PROVIDENCE UNIVERSITY COLLEGE

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Indiana Collaborative for Project Based Learning. PBL Certification Process

Degree Audit Self-Service For Students 1

University of Waterloo School of Accountancy. AFM 102: Introductory Management Accounting. Fall Term 2004: Section 4

Lisa Forster Student Functional Group - ITS. SI-net: Student Placements

Anglia Ruskin University Assessment Offences

HISTORY COURSE WORK GUIDE 1. LECTURES, TUTORIALS AND ASSESSMENT 2. GRADES/MARKS SCHEDULE

Table of Contents. Internship Requirements 3 4. Internship Checklist 5. Description of Proposed Internship Request Form 6. Student Agreement Form 7

Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge

Test Effort Estimation Using Neural Network

USER ADAPTATION IN E-LEARNING ENVIRONMENTS

Navigating the PhD Options in CMS

OCR for Arabic using SIFT Descriptors With Online Failure Prediction

EDIT 576 (2 credits) Mobile Learning and Applications Fall Semester 2015 August 31 October 18, 2015 Fully Online Course

Business Computer Applications CGS 1100 Course Syllabus. Course Title: Course / Prefix Number CGS Business Computer Applications

Mathematics Program Assessment Plan

ScienceDirect. Malayalam question answering system

TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE. Pierre Foy

Probabilistic Latent Semantic Analysis

Academic Advising Manual

THE WEB 2.0 AS A PLATFORM FOR THE ACQUISITION OF SKILLS, IMPROVE ACADEMIC PERFORMANCE AND DESIGNER CAREER PROMOTION IN THE UNIVERSITY

IAT 888: Metacreation Machines endowed with creative behavior. Philippe Pasquier Office 565 (floor 14)

The Enterprise Knowledge Portal: The Concept

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

The Future of Consortia among Indian Libraries - FORSA Consortium as Forerunner?

CWIS 23,3. Nikolaos Avouris Human Computer Interaction Group, University of Patras, Patras, Greece

Reducing Features to Improve Bug Prediction

Situational Virtual Reference: Get Help When You Need It

Multiple Measures Assessment Project - FAQs

Request for Proposal UNDERGRADUATE ARABIC FLAGSHIP PROGRAM

Mining Student Evolution Using Associative Classification and Clustering

HIGHLAND HIGH SCHOOL CREDIT FLEXIBILITY PLAN

GACE Computer Science Assessment Test at a Glance

Fieldwork Practice Manual- AHSC 435

Learning Methods in Multilingual Speech Recognition

Learning Microsoft Publisher , (Weixel et al)

Automating Outcome Based Assessment

Course Development Using OCW Resources: Applying the Inverted Classroom Model in an Electrical Engineering Course

National Survey of Student Engagement (NSSE) Temple University 2016 Results

8. Prerequisites, corequisites (If applicable) Prerequisites: ACCTG 1 (Financial Accounting) ACCTG 168 (Tax Accounting)

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

Introduction to the Revised Mathematics TEKS (2012) Module 1

Transcription:

DATA MINING MODEL FOR DOMAIN Miss. Amruta R. Jadhav1, Miss. Rohini V. Pillai 2 Student, Computer Engineering, NMIET, Pune, India ABSTRACT: Now a days in engineering colleges, domain selection process for project is not been focused seriously the manual procedure of selecting domain consumes unnecessarily too much time. (We can say around 2-3 months). Students in final year need to find the domain details on their own or another option is to ask the queries to respective teachers. It is really irritating as well as time consuming to tell every individual group to explain same domain. Here our system starts working. In our system there will be one module called student in that the sub module called domain will help students to know the details about the types of domain and the description of each and every domain in depth from the list. Moreover we are going to provide an aptitude test on domains, so that student s interest in particular domain can be calculated. In this module students need to add there percentage marks (transcripts). This data will be displayed to teachers in there login panel. They will be having the result of domain aptitude of every individual student as well as the list of students having domains in common. All these results will help teachers to do the grouping of students in particular domain. The teachers even can combine two domains which can co relate to each other so in result the students as well (e.g. Data mining, database, networking security). Coming back to current system TPO s in the colleges work is to get the campus details and they forward the same mail to all the students even though they are not eligible for a particular campus drive. Along with this TPO s are having hard copies of students data individually. It is time consuming to sort out the eligible and non-eligible personalities. Our system will work smartly in this area as well. We will be providing dynamic categorization in which TPO s will be having the details of Students like academic marks and extra curriculum (technical, on-technical).tpo will work as a strong bridge between industries and students. As the data is available on web portal, TPO will forward the campus drive information only to the eligible students. Another module is there in which students can share study related material (PPT s, PDF s, audios, video s) with other students as well as with teachers. KEYWORDS: DataMining, Co-operativLearning, DomainSelection, C4.5 1. INTRODUCTION Consistently, numerous building researchers need to arrange themselves for their major/final year project. The final year project plays an important role in demonstrating the effectiveness of studying results of modules that the students have taken through their studies. We are going to implement our own system which will help in domain selection process of student s final year project. The aptitude on the basis of domains will be conducted in student s module. The result of domain selection will be displayed to teacher s module so that they can understand and divide the groups according to resulting domain. The another module called TPO will work as a bridge between students and companies. It will help in Finding the eligible students for particular campus drive in less time. The students which are eligible for particular campus drive will only know about the drive via mail. Based on the results, several judgments are offered to upgrade student fulfilment. System is feasible in the point of accurately process the TPO Centres and for finding project topic in colleges. Methods we present a class of linear multi-regression models that are developed to create models that are demonstrate to each student and also take into explanation a considerable number of appearance that relate to a student past achievement, course components, and students commitment and achievement. These models appraisal a small number of relapse models that are mutual across the different students along with student- specific linear sequence functions to expedite collection. Our preliminary assessment on a large set of students, courses, and activity shows that these models are capable of better the achievement. In current oral system of project selection it is really very time consuming around months for both students and teachers to select the accurate academic project. It is still confusion for the students for the first time what the domain actually means. Due to lack of knowledge about the domain, most students choose the domains randomly and fail to understand and work in the corresponding domain. Proper domain understanding and selection for academic BE projects. Solution will be the resulting department according to test results for individual as well as group of students having common department after the test. The marks entered by the Students will be visible to Copyright to IJIRCCE DOI: 10.15680/IJIRCCE.2016. 0411001 1111

the TPO who will in return can send the data as well as mail to the students in the eligibility standards. The list and description about particular domain gives clear idea to the students for selection process. Aptitude on domain selection will really heal the confusion and time wastage. Aptitude selection will lets students understand their real interests and will provide there suiting domain in which a particular student can at least do research by his/her point of interest. We also added one more module in which students can share their knowledge, or information related to education with their Classmates as well as teachers TPO always need to send campus information to all students. It happens many times that some students are not eligible for specific criteria, still they ask about the drive to the Respecting TPO. So it is little bit difficult to handle no. of students for giving same Information all the time. 2. RELATED WORK Describes the use of decision tree and rule inauguration in data mining applications of methods for allocation and relapse that have been developed in the range of pattern recognition, demography, and machine learning,these are of individual activity for data mining since they appropriate symbolic and intelligible representations. This facet are useful in the business and profitable applications [1]. This paper describes the use of decision tree and rule extracting in data mining applications of methods for classification and regression that have been developed in the fields of pattern recognition, statistics, and machine learn, these are of particular interest for data mining since they utilize symbolic and interpretable representations. This aspect are useful in the industrial and commercial applications [2].The Bayesian network classifiers is related for anticipate the student s academic achievement and to achieve a model. This model helps to classify the drop outs and students who need special attention earlier and allow the teacher to supply correct advocate. In Addition, authentic indicator is useful in many various contexts. For example, identifying extraordinary students for scholarships and weak students who are likely to fails critical for apportion limited tutoring resources [3]. This paper specifies about what is data mining, Frequent Pattern Mining, Clustering, Classification, Probabilistic classification, Decision tree Classifier with example which is useful for understanding the knowledge of data mining[4]. Co-operative learning is used in teaching different issues on various educational levels-from fundamental to overhead. Objective of this paper is as follows; establish a separate learning alternative of Japanese college students. Study a variety of strategies that combine co-operative individual learning.[5]. Suggesting one way of inducing learner s willingness to work with others, thus making co-operative learning successful, shifting many students towards their favoring cooperation the disadvantages in this paper is, only based on co-operative learning. It doesn t focuses on student s capabilities and interests according to subjects in a particular domain area [7]. Based on a collaborative learn experience with hundreds of students over three consecutive years, that an approach using domain independent learning that is transferable to current e-learning plat- forms helps both students and teachers to manage student collaboration better. The approach draws on a domain-independent modeling method of collaborative learning based on data mining that helps clarify which user model issues are to be considered.[8] Regression analysis is used to examine the significance of four independent variables: cumulative grade point average prior to enrolling in intermediate accounting, grade in the introductory financial accounting class, grade in the introductory managerial accounting class, and score on a diagnostic assessment used to measure general financial accounting knowledge. Based on the results, several recommendations are offered to improve student performance. [10] 3. SYSTEM ARCHITECTURE Fig. 1 Proposed System Copyright to IJIRCCE DOI: 10.15680/IJIRCCE.2016. 0411001 1112

As shown in above architecture our system is capable for calculating the student results according to their point of interest. Those results will be shared with the staff and also stored into the database for further usage. Also every student user of this system is capable of sharing their knowledge by using share knowledge facility in our project. The very next module is the staffs who is able share knowledge as well as get test results of students also track the details of every student shared knowledge. Every staff can track which domain for project is suggested for which student for project making. In another module TPO can get help from system for managing campus into the college. With the help of the system TPO will be able to watch the details of every student regarding student s academic criteria, certifications, curriculum activities, according this TPO will get classified student lists. So it is very helpful to manage campus in and out of colleges. For all this our system is using some technical algorithm like c4.5 for data mining and also some searching and sorting algorithm as mentioned below. 3.1 C4.5: C4.5 is an algorithm used to generate a decision tree developed by Ross Quinlan. This algorithm has a few base cases. The decision trees generated by C4.5 can be used for classification, and for this reason, C4.5 is often referred to as a statistical classifier. All the samples in the list belong to the same class. 1. When this happens, it simply makes a leaf node for the decision tree saying to choose that class. 2. None of the features provide any information gain. In this case, C4.5 generates a decision node higher up the tree using the expected value of the class. 3. Instance of previously-unseen class encountered. Again, C4.5 makes a decision node higher up the tree using the expected value C4.5 is implemented recursively with this following sequence 1. Check if algorithm satisfies termination criteria 2. Computer information-theoretic criteria for all attributes 3. Choose best attribute according to the information- theoretic criteria 4. Create a decision node based on the best attribute in step 3 5. Induce (i.e. split) the dataset based on newly created decision node in step 4 6. For all sub-dataset in step 5, call C4.5 algorithm to get a sub-tree (recursive call) 7. Attach the tree obtained in step 6 to the decision node in step 4 8. Return tree 3.2 For Searching: Binary Search: At the point when the qualities are in sorted request, a superior approach than the one given above is to utilize double hunt. The calculation for twofold inquiry begins by taking a gander at the centre thing x. In the event that x is equivalent to v, it stops and returns genuine. Else, it utilizes the relative requesting of x and v to take out portion of the exhibit (if v is not as much as x, then it can't be put away to one side of x in the exhibit; correspondingly, in the event that it is more noteworthy than x, it can't be put away to one side of x). When half of the exhibit has been killed, the calculation begins again by taking a gander at the centre thing in the staying half. It stops when it finds v or when the whole cluster has been dispensed with. 3.3 For Sorting: Quicksort: Despite the fact that the shell sort calculation is altogether superior to anything addition sort, there is still opportunity to get better. A standout amongst the most mainstream sorting calculations is quicksort. Quicksort executes in O (n log n) by and large, and O (n2) in the most pessimistic scenario. Notwithstanding, with legitimate safeguards, most pessimistic scenario conduct is impossible. Quicksort is a non-stable sort. It is not a set up sort as stack space is Copyright to IJIRCCE DOI: 10.15680/IJIRCCE.2016. 0411001 1113

required. For further perusing, counsel Carmen. The quicksort calculation works by dividing the cluster to be sorted, then recursively sorting every parcel. In Partition one of the cluster components is chosen as a rotate esteem. Values littler than the rotate esteem are set to one side of the turn, while bigger qualities are set to one side. 4. CONCLUSION Proper domain understanding and selection for academic BE projects. The list and description about particular domain gives clear idea to the students for selection process. Aptitude selection will lets students understand their real interests and will provide the resulting domain in which a particular student can at least do research by his/her point of interest. We also added one more module in which students can share their knowledge, or information related to education with their Classmates as well as teachers. This system can be used for college level programs. The system deals with time and cost effectiveness of the calculated risks. System works in complex at its back but proves more simple and useful to the students, teachers as well as TPO s (Training and Placement Officers). The resulting solution will be the resulting domain according to test results for individual as well as group of students having common domains after the test. The marks entered by the Students will be visible to the TPO who will in return can send the information as well as mail to the students in the eligibility standards. 5. FUTURE SCOPE This type of system can be created for all the 10 th or 12 th standard students for selecting the future field of education by taking aptitude tests for calculating their knowledge according to particular field. This system will be helpful to know field of interest. 6. ACKNOWLEDGEMENT I would like to extend my sincere & heartfelt obligation towards all personages who have helped me in this endeavour for their active guidance, help, cooperation & encouragement. I am extremely thankful and pay my gratitude to my project guide Prof. Deepali Patil madam for her valuable guidance and support on completion of this project work and I also thankful to my project coordinator Prof. Ashwini Jadhav madam. We would also like to thank our HOD Prof. Shyamsunder Ingle who always has enough time to solve everyone s problems. I also acknowledge with deep sense of reverence, my gratitude towards my parents and member of my family, who has always supported me morally as well as economically. 7. REFERENCES [1] P Amornsinlaphachai, Efficiency of data mining models to predict academic performance and a cooperative learning model.ieee2016. [2] Data mining with decision trees and decision rules, ACM [3] PVP Sundar, A comparative study for predicting students academic performance using Bayesian Network Classifiers. IOSRJEN 2278-8719:3; 2013 [4] Data mining and analysis, Library of Congress Cataloging in Publication Data, Zaki, Mohammed J.2014 [5] Combining co-operative learning and individual approach in Japanese college, Japanese College EFL Course2014. [6] Designing a learning model using the STAD technique with a suggestion system to decrease learner s weakness, Elsevier Ltd.2013 [7] web-based learning environment, Elsevier Ltd.2014 8] Content-free collaborative learning modelling using data mining, Artificial Intelligence Department, E.T.S.I.I., UNED, Ciudad Universitaria, 2013 [9] Personalized Multi-Regression Models for Predicting Students Performance in Course Activities, University of Minnesota, 2014 [10] Determinants of Students Performance in Intermediate Accounting, Journal of College Teaching and Learning, 2015 Copyright to IJIRCCE DOI: 10.15680/IJIRCCE.2016. 0411001 1114

Copyright to IJIRCCE DOI: 10.15680/IJIRCCE.2016. 0411001 1115