DATA ANALYSIS IN ROAD ACCIDENTS USING ANN AND DECISION TREE
|
|
- Caroline Blankenship
- 5 years ago
- Views:
Transcription
1 International Journal of Civil Engineering and Technology (IJCIET) Volume 9, Issue 4, April 2018, pp , Article ID: IJCIET_09_04_023 Available online at ISSN Print: and ISSN Online: IAEME Publication Scopus Indexed DATA ANALYSIS IN ROAD ACCIDENTS USING ANN AND DECISION TREE Roop Kumar R PG Scholar, Department of Computer Science, CHRIST (Deemed to be University), Bengaluru, India Ramamurthy B Associate Professors, Department of Computer Science, CHRIST (Deemed to be University), Bengaluru, India. ABSTRACT Road accidents have become some of the main causes for fatal death globally. A report tells that road accident is the major cause for high death rate other than wars and diseases. A study by World Health Organization (WHO), Global status report on road safety 2015 says over 1.24 million people die every year due to road accidents worldwide and it even predicts by 2020 this number can even increase by 20-50%. This can affect the GDP of the Country, for developing countries this can affect adversely. This paper shows the use of data analytics techniques to build a prediction model for road accidents, so that these models can be used in real time scenario to make some policies and avoid accidents. This paper has identified the attributes which has high impact on accident severity class label. Keywords: Road accident, Data analysis, ANN, Decision tree, Machine learning, Prediction/Classification, Back Propagation. Cite this Article: Roop Kumar R and Ramamurthy B, Data Analysis in Road Accidents Using ANN and Decision Tree, International Journal of Civil Engineering and Technology, 9(4), 2018, pp INTRODUCTION Over 3,400 people die every day and 10,00,000 people are injured or disabled every year in road accident worldwide. These numbers are very large compared to the average death rate of the world. According to WHO, road accident is ranked as the 10 th leading cause for death in the year 2015 with 18 deaths per 1,00,000 [1]. India is Ranked as first in road death in the world [2]. Global status report on road safety 2013 say more than points are estimated number of death every year in India alone due to road accidents, these numbers are higher than the number of people killed through the wars in our country. There are18.9 people per 1,00,000 killed in accidents [3]. As a developing country, these road accidents can affect the country s economy editor@iaeme.com
2 Roop Kumar R and Ramamurthy B The proposed paper analyses the road accident data collected. Using various data analytics methodologies. Identifying the factors impacting the accident severity of the given data by using information gain measure and chi square measure. These road accidents can be prevented and their overall impact on the country and its development can reduce drastically if proper models are applied and policies made. Before deciding upon the policies and models we first have to analyse the causes and impact factors. This can be done by using Data Analysis techniques. The data then extracted should be converted into a more useful model or policy making algorithm, this part is where machine learning comes into picture. Data analysis is a technique which helps to discover different knowledge available in the data. It gives insight views of the data, so that the data becomes more informative and useful form for business or domain perspective. There different types of learning technique available like Supervised learning and Unsupervised learning. These techniques can be used in road accident data to find different views of the data. Machine learning is a branch of AI which helps to build intelligent system by making it learn from the existing cases, and experience for better system. Most of the techniques are nature inspired. These concepts are eventually available in Data analysis as different classifier and clustering algorithms; which can be used for building prediction model. These algorithms are self-learner where the system learns itself without any definitive program. This paper uses machine learning concepts for data analysis in road accidents to build an intelligent system that can predict future base on the available data used for training the system. Road accident have become some of the major health concerns due to increase in the fatal deaths and injuries. Road accident data is collected and analysis is done to find the pattern available in the data. Before starting the analysis, the data is first pre-processed according to the requirement of the user for future purpose. Identifying the intensity of the attribute in the accident from the available data set. The key objective of data analysis in road accident is to identify the key factor for the accident and form some policies that would reduce the accident level. There are four major factors which leads to road traffic and accidents such as [4]: Driver factors Vehicle factors Road factors Environment factors 2. LITERATURE SURVEY There are different data mining techniques available for analysis of road accidents. This paper compares the accuracy of different technique and models for road accident, proposed by different researchers. Some of the techniques are: Random forest, Rough set Decision tree Artificial neural network Naïve Bayes, etc. Of which Naïve Bayes, Decision tree gives better accuracy for the respective data [5] editor@iaeme.com
3 Data Analysis in Road Accidents Using ANN and Decision Tree The author in this paper identifying the key factors for accident by a proposed framework, the data is being pre-processed, clustered using clustering algorithms like Euclidean distance, dynamic time wrapping, triangle distance metric and hierarchical cluster analysis for a time series and identify the trend of accident using trend analysis for 2 different district and identify the similarities between them and come to conclusion. It uses trend analysis for identifying the trend in the pre-processed data set. It is difficult and time consuming to analyse every time series of every cluster. [6] This paper has an in-depth study focusing on the application of event analysis through investigating the accident details and reconstructing the scenario. The goal of this research study are as follow: 1. To identify the factors contributing based on the findings obtained from crash investigation and reconstruction by using a case study; 2. To apply an event analysis in establishing the links between the events to describe the crash scenario based on the available information. The proposed model makes a conceptual view of the accidents and then analyses the data for the accuracy. It makes in depth study about the accident. It identifies the key factory accurately because of the reconstruction of the accident from the report given from the people present in that place at that time. [7] This paper deals with prediction of traffic incident duration such that the driver gets prior information. Neural Network method of prediction is used to build the model. This paper uses the real traffic incident data for building the prediction model, 660 Records where used for training the model and 170 Records where used for testing the built model. The result generated had 85.35% of accuracy with the actual result. [8] This paper uses Rough set theory which is a kind of uncertainty analysis method. Initially the information decision table is created using the available data set, then simplified algorithm of rough set model is used to calculate the degree of the attributes, importance of different factors to their corresponding accident morphologies. [9] The researcher has used discovery algorithm to identify useful insight from road accident dataset having multiple attributes. Unlike classification learning, subgroup discovery pursues rules of not the accuracy but the generality and unusualness. Depending on the factors of the data we are focusing our attention to, we may combine multiple relevant features of interest to make a synthetic target feature, and give it to the subgroup discovery algorithm. After a set of rules is derived, some post processing steps are taken to make the ruleset more compact and easier to understand. [10] 3. METHODS The proposed paper uses two prediction models namely artificial neural network and decision tree for predicting the outcome of the accident data Artificial Neural Network (ANN) is supervised learning used for classification of the result based on the model built using training data. It is nature inspired concept imitating the brain cell, where every neuron individually processes data and provide input for next level of neuron. The common goal is to predict the values based on trained data. ANN is one of the important concepts of deep learning where large sized data set is used for building better classification model. Compared with all other prediction model ANN provides the best model in most of the cases. In the proposed paper neuralnet package in R is used [11], it uses back propagation method where the actual and predicted results are compared and based on the editor@iaeme.com
4 Roop Kumar R and Ramamurthy B incorrectness, the weight of the nodes are adjusted. This is also known as feedback method. [12] Decision Tree is one of the simple and commonly used prediction model. The tree consists multiple if else rule for classifying the output for a given input. Decision tree provides simple rules which is easy to understand. The model provides good results when the size of the data is limited. Decision tree resemble human reasoning and mapping of data [13]. The package used is C50 in R which is also rule based model, this model is an extended version of C4.5 package in R which uses Shannon entropy for better information gain node. the C50 package can handle both numeric and other non-numeric data types also. [14] The dataset used in this study is obtained from the UK data service. This data is the record of accidents that occurred in the year 2013 throughout UK. The dataset consists of records and 30 attributes. The class label attributes used in this data set for research purpose is Accident Severity. Figure 1 shows the procedure followed for building the model. Figure 1: Flow diagram of the model building process Data Collection: In this phase road accident related data for the year 2013 in United Kingdom is collected from UKDA, UK data service website, which stores data of the major surveys taken under UK government, provided for researchers and business analysts for research purpose. [15] Data Pre-processing: Pre-processing helps in transforming the data into required form. To reduce the complexity of the data, remove the unwanted attributes, that improves the accuracy of the result, Data sampling is performed to train and test the model for available data set. In this process 2 major methods are followed: Data field selection: Based on the Domain knowledge the fields are selected for further process in the analysis. Data Cleaning: The missing values and outliers in the data is removed so that the quality of the model is maintained. Data transformation is the end result of the Data pre-processing phase where the actual data is converted into the required form for analysis purpose editor@iaeme.com
5 Data Analysis in Road Accidents Using ANN and Decision Tree Table 1 Shows the list of attributes available in the data set before pre-processing Table 1 Attributes of the data set before pre-processing Accident Index Number Police Force Code Accident Severity(class label) Number of vehicles Number of Casualties Date, Month, Year Time(hour, Minute of the accident) Local Authority Location(OSGR)Northing,Easting 1 st Road details(road class, Road Speed limit Junction details number, Road type) Junction Control 2 nd Road details(road class, Road number) Pedestrian Crossing Details(human control, physical facilities) Light condition Weather condition Road surface condition Special condition at site Carriage way hazards Did police attend the Scene To reduce the complexity of the model built, the dimensionality is reduced in data based on the domain knowledge and other dimensionality reduction techniques. Table 2 shows the list of attributes used for analysis. The attributes where modified for model building. Table 2 Pre-processed attributes details used for analysis Attribute Name Possible values Number of Vehicles 1:one vehicle; 2:two vehicles;3:three vehicles; 4: four or more vehicles(vehicles involved in accident) Quartile 1:january-march; 2:april-june; 3:july-september; 4:october-december Time period 0:day ;2:night State 1: England;2: Wale;3: Scotland 1 st Road class 1-6: road classes available in UK Junction details 0-9: details of the nearby junction Junction control 0-4: junction control details 2 nd Road class 1-6: road classes available in UK Pedestrian Crossing-Physical Facilities 0:false; 1: true Light conditions 1: day light; 4: darkness-lights lit; 5: darkness-lights unlit; 6: darkness-no lighting; 7: darkness-lighting unknown Weather condition 1-6: weather condition in the accident area Road surface condition 0-5:condition of the road Special conditions at site 0:false;1:true Accident severity(class label) 1:Fatal;2:Serious;3:Slight Model building: Using Artificial Neural Network and Decision tree algorithms models are built. There are two level process that occurs while building a model where data is split into two proportion. Training level: Prediction model is built based on the previous data available for the specific domain. Testing level: The accuracy of the model built in training level is measured by comparing the actual classification value and predicted value. Model extraction: The built model is extracted and used for future prediction of the given scenario. Identifying the key factors / attributes that affect the class label, accident severity. 4. EXPERIMENT RESULTS The experiment conducted used 4841 records and 14 attribute including the class label. Since the data is not evenly distributed, sampling was performed based on the different criteria correlating with the class label. The result achieved using the C50 method in R tool using all attributes was 79.8% which is better when compared with the paper proposed by Dipo T editor@iaeme.com
6 Roop Kumar R and Ramamurthy B Akomolafe et al [16] which gave 77.7% using ID3 method in decision tree. C50 has more additional feature than ID3 method. The number of rules generated is 292.The following table shows the attribute usage of in the decision tree. Table 3 provides the list of attributes and their usage percentage in building the model. Table 3 Decision tree attribute usage Attribute name Percentage Light Conditions 100% 1 st Road Class 92.80% Quartile 90.71% 2 nd Road Class 90.82% Number of Vehicles 86.93% State 74.11% Pedestrian Crossing-Physical Facilities 56.16% Time period 53.02% Junction Details 45.62% Junction control 37.69% Road Surface Conditions 27.69% Weather Conditions 26.16% Neuralnet method was implemented for the same data set for classifying the accident severity, the error reached in steps with 4 nodes in the hidden layer. The accuracy of this model is 79% the data even include the time attribute to find the trend [6]. Liping Guan et al proposed work where various details about accident was collected and NN model was built to predict accident duration [8]. But the proposed model looks the data in different dimension. To predict the accident severity which is more important, based on these pattern preventive measures can be taken. Figure2 shows the model generated using the backpropagation algorithm in ANN. Figure 2 Neural network model built for road accident editor@iaeme.com
7 Data Analysis in Road Accidents Using ANN and Decision Tree 5. CONCLUSIONS This paper shows the use of Machine learning concepts for building a prediction model in road accident data. Two basic models are used, artificial neural network and decision tree. Both models emulate human thought process. Since the data used was real time records building a generic model with higher accuracy is more difficult. So data sampling is performed on the dataset in pre-processing phase based on the class label, State, date. Based on the decision tree generated, rules are extracted. The experiment shows ANN gives better performance compared with the decision tree for the same sample. As the countries are moving towards digital era, when accident data is recorded in a tabular format with appropriate attributes, these data can be used for data analysis for a particular region, so appropriate policies and preventive measure can be taken. REFERENCES [1] World Health Organization, Global Health Observatory (GHO) data, 19 July [2] Auto economic times, India ranks first in road deaths in the world. 19 July 2017, [3] Global status report on road safety2013: supporting a decade of action, World Health Organization,2013 [4] Rui Tian and Zhaosheng Yang, Method of road traffic accidents causes analysis based on Data Mining, IEEE, conference paper, 2010 [5] Maninder Singh and Amrit Kaur, A Review on road accident in traffic system using data mining techniques, International Journal of Science and research, 2016, 5(1), pp [6] Sachin Kumar and Durga Toshniwal, A novel framework to analyze road accident time series data, SpringerOpen, 3(8), 2016 [7] Mouyid Bin Islam and Kunnawee Kanitpong, Identification of Factors in Road Accidents Through In-Depth Accident Analysis, IATSS Research, 2008, 32(2), pp [8] Liping Guan et al, Traffic Incident Duration Prediction Based on Artificial Neural Network, International Conference on Intelligent Computation Technology and Automation, 2010 [9] Tao Gang et al, Cause Analysis of Traffic Accidents Based on Degrees of Attribute Importance of Rough Set, 2015, pp [10] Jeongmin Kim et al, Mining Traffic accident data by subgroup discovery using combinatorial targets, IEEE, 2015 [11] Cran.r, neuralnet package,7 January [12] KurtHornik et al, Multilayer feedforward networks are universal approximators, Elsevier,1989, 2(5), pp editor@iaeme.com
8 Roop Kumar R and Ramamurthy B [13] S. B. Kotsiantis, Decision trees: a recent overview, Springer, 2011, 39(4), pp [14] Cran.r, C50 package,7 January [15] Department for Transport. Road Accident Statistics Branch, Road Accident Data, 2013 [computer file]. Colchester, Essex: UK Data Archive [distributor], September SN: 7550, [16] Dipo T. Akomolafe, Akinbola Olutayo, Using Data Mining Technique to Predict Cause of Accident and Accident Prone Locations on Highways, American Journal of Database Theory and Application, 2012, 1(1), pp [17] Jasvinder Singh, Mahipal Singh, Anil Baliram Ghubade and Manjinder Singh Analytical Hierarchy Process for Ro ad Accident of Motorcycle in India: A Case Study. International Journal of Mechanical Engineering and Technology, 8(7), 2017, pp [18] Aishwarya Patil and Deepthi Das, Comparative Analysis and Suggestion of Architectures for Reduction of Road Accidents, International Journal of Civil Engineering and Technology, 9(3), 2018, pp editor@iaeme.com
Python Machine Learning
Python Machine Learning Unlock deeper insights into machine learning with this vital guide to cuttingedge predictive analytics Sebastian Raschka [ PUBLISHING 1 open source I community experience distilled
More informationModule 12. Machine Learning. Version 2 CSE IIT, Kharagpur
Module 12 Machine Learning 12.1 Instructional Objective The students should understand the concept of learning systems Students should learn about different aspects of a learning system Students should
More informationRule Learning With Negation: Issues Regarding Effectiveness
Rule Learning With Negation: Issues Regarding Effectiveness S. Chua, F. Coenen, G. Malcolm University of Liverpool Department of Computer Science, Ashton Building, Ashton Street, L69 3BX Liverpool, United
More informationRule Learning with Negation: Issues Regarding Effectiveness
Rule Learning with Negation: Issues Regarding Effectiveness Stephanie Chua, Frans Coenen, and Grant Malcolm University of Liverpool Department of Computer Science, Ashton Building, Ashton Street, L69 3BX
More informationThe 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X
The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, 2013 10.12753/2066-026X-13-154 DATA MINING SOLUTIONS FOR DETERMINING STUDENT'S PROFILE Adela BÂRA,
More informationAustralian Journal of Basic and Applied Sciences
AENSI Journals Australian Journal of Basic and Applied Sciences ISSN:1991-8178 Journal home page: www.ajbasweb.com Feature Selection Technique Using Principal Component Analysis For Improving Fuzzy C-Mean
More informationModeling user preferences and norms in context-aware systems
Modeling user preferences and norms in context-aware systems Jonas Nilsson, Cecilia Lindmark Jonas Nilsson, Cecilia Lindmark VT 2016 Bachelor's thesis for Computer Science, 15 hp Supervisor: Juan Carlos
More informationOn-Line Data Analytics
International Journal of Computer Applications in Engineering Sciences [VOL I, ISSUE III, SEPTEMBER 2011] [ISSN: 2231-4946] On-Line Data Analytics Yugandhar Vemulapalli #, Devarapalli Raghu *, Raja Jacob
More informationPredicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks
Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks Devendra Singh Chaplot, Eunhee Rhim, and Jihie Kim Samsung Electronics Co., Ltd. Seoul, South Korea {dev.chaplot,eunhee.rhim,jihie.kim}@samsung.com
More informationLecture 1: Machine Learning Basics
1/69 Lecture 1: Machine Learning Basics Ali Harakeh University of Waterloo WAVE Lab ali.harakeh@uwaterloo.ca May 1, 2017 2/69 Overview 1 Learning Algorithms 2 Capacity, Overfitting, and Underfitting 3
More informationHuman Emotion Recognition From Speech
RESEARCH ARTICLE OPEN ACCESS Human Emotion Recognition From Speech Miss. Aparna P. Wanare*, Prof. Shankar N. Dandare *(Department of Electronics & Telecommunication Engineering, Sant Gadge Baba Amravati
More informationA Neural Network GUI Tested on Text-To-Phoneme Mapping
A Neural Network GUI Tested on Text-To-Phoneme Mapping MAARTEN TROMPPER Universiteit Utrecht m.f.a.trompper@students.uu.nl Abstract Text-to-phoneme (T2P) mapping is a necessary step in any speech synthesis
More informationQuickStroke: An Incremental On-line Chinese Handwriting Recognition System
QuickStroke: An Incremental On-line Chinese Handwriting Recognition System Nada P. Matić John C. Platt Λ Tony Wang y Synaptics, Inc. 2381 Bering Drive San Jose, CA 95131, USA Abstract This paper presents
More informationMining Association Rules in Student s Assessment Data
www.ijcsi.org 211 Mining Association Rules in Student s Assessment Data Dr. Varun Kumar 1, Anupama Chadha 2 1 Department of Computer Science and Engineering, MVN University Palwal, Haryana, India 2 Anupama
More informationCS Machine Learning
CS 478 - Machine Learning Projects Data Representation Basic testing and evaluation schemes CS 478 Data and Testing 1 Programming Issues l Program in any platform you want l Realize that you will be doing
More informationLearning Methods for Fuzzy Systems
Learning Methods for Fuzzy Systems Rudolf Kruse and Andreas Nürnberger Department of Computer Science, University of Magdeburg Universitätsplatz, D-396 Magdeburg, Germany Phone : +49.39.67.876, Fax : +49.39.67.8
More informationSystem Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks
System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks 1 Tzu-Hsuan Yang, 2 Tzu-Hsuan Tseng, and 3 Chia-Ping Chen Department of Computer Science and Engineering
More informationTwitter Sentiment Classification on Sanders Data using Hybrid Approach
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 4, Ver. I (July Aug. 2015), PP 118-123 www.iosrjournals.org Twitter Sentiment Classification on Sanders
More informationIssues in the Mining of Heart Failure Datasets
International Journal of Automation and Computing 11(2), April 2014, 162-179 DOI: 10.1007/s11633-014-0778-5 Issues in the Mining of Heart Failure Datasets Nongnuch Poolsawad 1 Lisa Moore 1 Chandrasekhar
More informationTest Effort Estimation Using Neural Network
J. Software Engineering & Applications, 2010, 3: 331-340 doi:10.4236/jsea.2010.34038 Published Online April 2010 (http://www.scirp.org/journal/jsea) 331 Chintala Abhishek*, Veginati Pavan Kumar, Harish
More informationArtificial Neural Networks written examination
1 (8) Institutionen för informationsteknologi Olle Gällmo Universitetsadjunkt Adress: Lägerhyddsvägen 2 Box 337 751 05 Uppsala Artificial Neural Networks written examination Monday, May 15, 2006 9 00-14
More informationCLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH
ISSN: 0976-3104 Danti and Bhushan. ARTICLE OPEN ACCESS CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH Ajit Danti 1 and SN Bharath Bhushan 2* 1 Department
More informationAxiom 2013 Team Description Paper
Axiom 2013 Team Description Paper Mohammad Ghazanfari, S Omid Shirkhorshidi, Farbod Samsamipour, Hossein Rahmatizadeh Zagheli, Mohammad Mahdavi, Payam Mohajeri, S Abbas Alamolhoda Robotics Scientific Association
More informationAutomating the E-learning Personalization
Automating the E-learning Personalization Fathi Essalmi 1, Leila Jemni Ben Ayed 1, Mohamed Jemni 1, Kinshuk 2, and Sabine Graf 2 1 The Research Laboratory of Technologies of Information and Communication
More informationEvolution of Symbolisation in Chimpanzees and Neural Nets
Evolution of Symbolisation in Chimpanzees and Neural Nets Angelo Cangelosi Centre for Neural and Adaptive Systems University of Plymouth (UK) a.cangelosi@plymouth.ac.uk Introduction Animal communication
More informationReducing Features to Improve Bug Prediction
Reducing Features to Improve Bug Prediction Shivkumar Shivaji, E. James Whitehead, Jr., Ram Akella University of California Santa Cruz {shiv,ejw,ram}@soe.ucsc.edu Sunghun Kim Hong Kong University of Science
More informationKnowledge Elicitation Tool Classification. Janet E. Burge. Artificial Intelligence Research Group. Worcester Polytechnic Institute
Page 1 of 28 Knowledge Elicitation Tool Classification Janet E. Burge Artificial Intelligence Research Group Worcester Polytechnic Institute Knowledge Elicitation Methods * KE Methods by Interaction Type
More informationHIERARCHICAL DEEP LEARNING ARCHITECTURE FOR 10K OBJECTS CLASSIFICATION
HIERARCHICAL DEEP LEARNING ARCHITECTURE FOR 10K OBJECTS CLASSIFICATION Atul Laxman Katole 1, Krishna Prasad Yellapragada 1, Amish Kumar Bedi 1, Sehaj Singh Kalra 1 and Mynepalli Siva Chaitanya 1 1 Samsung
More informationAGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS
AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS 1 CALIFORNIA CONTENT STANDARDS: Chapter 1 ALGEBRA AND WHOLE NUMBERS Algebra and Functions 1.4 Students use algebraic
More informationEvolutive Neural Net Fuzzy Filtering: Basic Description
Journal of Intelligent Learning Systems and Applications, 2010, 2: 12-18 doi:10.4236/jilsa.2010.21002 Published Online February 2010 (http://www.scirp.org/journal/jilsa) Evolutive Neural Net Fuzzy Filtering:
More informationPh.D in Advance Machine Learning (computer science) PhD submitted, degree to be awarded on convocation, sept B.Tech in Computer science and
Name Qualification Sonia Thomas Ph.D in Advance Machine Learning (computer science) PhD submitted, degree to be awarded on convocation, sept. 2016. M.Tech in Computer science and Engineering. B.Tech in
More informationScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques
Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 98 (2016 ) 368 373 The 6th International Conference on Current and Future Trends of Information and Communication Technologies
More informationAssignment 1: Predicting Amazon Review Ratings
Assignment 1: Predicting Amazon Review Ratings 1 Dataset Analysis Richard Park r2park@acsmail.ucsd.edu February 23, 2015 The dataset selected for this assignment comes from the set of Amazon reviews for
More informationCSL465/603 - Machine Learning
CSL465/603 - Machine Learning Fall 2016 Narayanan C Krishnan ckn@iitrpr.ac.in Introduction CSL465/603 - Machine Learning 1 Administrative Trivia Course Structure 3-0-2 Lecture Timings Monday 9.55-10.45am
More informationA Case Study: News Classification Based on Term Frequency
A Case Study: News Classification Based on Term Frequency Petr Kroha Faculty of Computer Science University of Technology 09107 Chemnitz Germany kroha@informatik.tu-chemnitz.de Ricardo Baeza-Yates Center
More informationCourse Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE
EE-589 Introduction to Neural Assistant Prof. Dr. Turgay IBRIKCI Room # 305 (322) 338 6868 / 139 Wensdays 9:00-12:00 Course Outline The course is divided in two parts: theory and practice. 1. Theory covers
More informationDeep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach
#BaselOne7 Deep search Enhancing a search bar using machine learning Ilgün Ilgün & Cedric Reichenbach We are not researchers Outline I. Periscope: A search tool II. Goals III. Deep learning IV. Applying
More informationLaboratorio di Intelligenza Artificiale e Robotica
Laboratorio di Intelligenza Artificiale e Robotica A.A. 2008-2009 Outline 2 Machine Learning Unsupervised Learning Supervised Learning Reinforcement Learning Genetic Algorithms Genetics-Based Machine Learning
More informationWelcome to. ECML/PKDD 2004 Community meeting
Welcome to ECML/PKDD 2004 Community meeting A brief report from the program chairs Jean-Francois Boulicaut, INSA-Lyon, France Floriana Esposito, University of Bari, Italy Fosca Giannotti, ISTI-CNR, Pisa,
More informationWord Segmentation of Off-line Handwritten Documents
Word Segmentation of Off-line Handwritten Documents Chen Huang and Sargur N. Srihari {chuang5, srihari}@cedar.buffalo.edu Center of Excellence for Document Analysis and Recognition (CEDAR), Department
More informationDeveloping True/False Test Sheet Generating System with Diagnosing Basic Cognitive Ability
Developing True/False Test Sheet Generating System with Diagnosing Basic Cognitive Ability Shih-Bin Chen Dept. of Information and Computer Engineering, Chung-Yuan Christian University Chung-Li, Taiwan
More informationApplications of data mining algorithms to analysis of medical data
Master Thesis Software Engineering Thesis no: MSE-2007:20 August 2007 Applications of data mining algorithms to analysis of medical data Dariusz Matyja School of Engineering Blekinge Institute of Technology
More informationADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF
Read Online and Download Ebook ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF Click link bellow and free register to download
More informationIntroduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition
Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition Todd Holloway Two Lecture Series for B551 November 20 & 27, 2007 Indiana University Outline Introduction Bias and
More informationCS 446: Machine Learning
CS 446: Machine Learning Introduction to LBJava: a Learning Based Programming Language Writing classifiers Christos Christodoulopoulos Parisa Kordjamshidi Motivation 2 Motivation You still have not learnt
More informationINPE São José dos Campos
INPE-5479 PRE/1778 MONLINEAR ASPECTS OF DATA INTEGRATION FOR LAND COVER CLASSIFICATION IN A NEDRAL NETWORK ENVIRONNENT Maria Suelena S. Barros Valter Rodrigues INPE São José dos Campos 1993 SECRETARIA
More informationGenerative models and adversarial training
Day 4 Lecture 1 Generative models and adversarial training Kevin McGuinness kevin.mcguinness@dcu.ie Research Fellow Insight Centre for Data Analytics Dublin City University What is a generative model?
More informationWE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT
WE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT PRACTICAL APPLICATIONS OF RANDOM SAMPLING IN ediscovery By Matthew Verga, J.D. INTRODUCTION Anyone who spends ample time working
More informationImpact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees
Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees Mariusz Łapczy ski 1 and Bartłomiej Jefma ski 2 1 The Chair of Market Analysis and Marketing Research,
More informationClassification Using ANN: A Review
International Journal of Computational Intelligence Research ISSN 0973-1873 Volume 13, Number 7 (2017), pp. 1811-1820 Research India Publications http://www.ripublication.com Classification Using ANN:
More informationUnit 7 Data analysis and design
2016 Suite Cambridge TECHNICALS LEVEL 3 IT Unit 7 Data analysis and design A/507/5007 Guided learning hours: 60 Version 2 - revised May 2016 *changes indicated by black vertical line ocr.org.uk/it LEVEL
More informationAnalysis of Hybrid Soft and Hard Computing Techniques for Forex Monitoring Systems
Analysis of Hybrid Soft and Hard Computing Techniques for Forex Monitoring Systems Ajith Abraham School of Business Systems, Monash University, Clayton, Victoria 3800, Australia. Email: ajith.abraham@ieee.org
More informationSeminar - Organic Computing
Seminar - Organic Computing Self-Organisation of OC-Systems Markus Franke 25.01.2006 Typeset by FoilTEX Timetable 1. Overview 2. Characteristics of SO-Systems 3. Concern with Nature 4. Design-Concepts
More informationMaximizing Learning Through Course Alignment and Experience with Different Types of Knowledge
Innov High Educ (2009) 34:93 103 DOI 10.1007/s10755-009-9095-2 Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge Phyllis Blumberg Published online: 3 February
More informationProduct Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments
Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments Vijayshri Ramkrishna Ingale PG Student, Department of Computer Engineering JSPM s Imperial College of Engineering &
More informationEdexcel GCSE. Statistics 1389 Paper 1H. June Mark Scheme. Statistics Edexcel GCSE
Edexcel GCSE Statistics 1389 Paper 1H June 2007 Mark Scheme Edexcel GCSE Statistics 1389 NOTES ON MARKING PRINCIPLES 1 Types of mark M marks: method marks A marks: accuracy marks B marks: unconditional
More informationLecture 1: Basic Concepts of Machine Learning
Lecture 1: Basic Concepts of Machine Learning Cognitive Systems - Machine Learning Ute Schmid (lecture) Johannes Rabold (practice) Based on slides prepared March 2005 by Maximilian Röglinger, updated 2010
More informationSINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)
SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF) Hans Christian 1 ; Mikhael Pramodana Agus 2 ; Derwin Suhartono 3 1,2,3 Computer Science Department,
More informationBusiness Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence
Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence COURSE DESCRIPTION This course presents computing tools and concepts for all stages
More informationOCR for Arabic using SIFT Descriptors With Online Failure Prediction
OCR for Arabic using SIFT Descriptors With Online Failure Prediction Andrey Stolyarenko, Nachum Dershowitz The Blavatnik School of Computer Science Tel Aviv University Tel Aviv, Israel Email: stloyare@tau.ac.il,
More informationMYCIN. The MYCIN Task
MYCIN Developed at Stanford University in 1972 Regarded as the first true expert system Assists physicians in the treatment of blood infections Many revisions and extensions over the years The MYCIN Task
More informationSARDNET: A Self-Organizing Feature Map for Sequences
SARDNET: A Self-Organizing Feature Map for Sequences Daniel L. James and Risto Miikkulainen Department of Computer Sciences The University of Texas at Austin Austin, TX 78712 dljames,risto~cs.utexas.edu
More information*** * * * COUNCIL * * CONSEIL OFEUROPE * * * DE L'EUROPE. Proceedings of the 9th Symposium on Legal Data Processing in Europe
*** * * * COUNCIL * * CONSEIL OFEUROPE * * * DE L'EUROPE Proceedings of the 9th Symposium on Legal Data Processing in Europe Bonn, 10-12 October 1989 Systems based on artificial intelligence in the legal
More informationLaboratorio di Intelligenza Artificiale e Robotica
Laboratorio di Intelligenza Artificiale e Robotica A.A. 2008-2009 Outline 2 Machine Learning Unsupervised Learning Supervised Learning Reinforcement Learning Genetic Algorithms Genetics-Based Machine Learning
More informationOPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS
OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS Václav Kocian, Eva Volná, Michal Janošek, Martin Kotyrba University of Ostrava Department of Informatics and Computers Dvořákova 7,
More informationCircuit Simulators: A Revolutionary E-Learning Platform
Circuit Simulators: A Revolutionary E-Learning Platform Mahi Itagi Padre Conceicao College of Engineering, Verna, Goa, India. itagimahi@gmail.com Akhil Deshpande Gogte Institute of Technology, Udyambag,
More informationKamaldeep Kaur University School of Information Technology GGS Indraprastha University Delhi
Soft Computing Approaches for Prediction of Software Maintenance Effort Dr. Arvinder Kaur University School of Information Technology GGS Indraprastha University Delhi Kamaldeep Kaur University School
More informationMultimedia Courseware of Road Safety Education for Secondary School Students
Multimedia Courseware of Road Safety Education for Secondary School Students Hanis Salwani, O 1 and Sobihatun ur, A.S 2 1 Universiti Utara Malaysia, Malaysia, hanisalwani89@hotmail.com 2 Universiti Utara
More informationReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology
ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology Tiancheng Zhao CMU-LTI-16-006 Language Technologies Institute School of Computer Science Carnegie Mellon
More informationProbabilistic Latent Semantic Analysis
Probabilistic Latent Semantic Analysis Thomas Hofmann Presentation by Ioannis Pavlopoulos & Andreas Damianou for the course of Data Mining & Exploration 1 Outline Latent Semantic Analysis o Need o Overview
More informationAn OO Framework for building Intelligence and Learning properties in Software Agents
An OO Framework for building Intelligence and Learning properties in Software Agents José A. R. P. Sardinha, Ruy L. Milidiú, Carlos J. P. Lucena, Patrick Paranhos Abstract Software agents are defined as
More informationLearning From the Past with Experiment Databases
Learning From the Past with Experiment Databases Joaquin Vanschoren 1, Bernhard Pfahringer 2, and Geoff Holmes 2 1 Computer Science Dept., K.U.Leuven, Leuven, Belgium 2 Computer Science Dept., University
More informationArtificial Neural Networks
Artificial Neural Networks Andres Chavez Math 382/L T/Th 2:00-3:40 April 13, 2010 Chavez2 Abstract The main interest of this paper is Artificial Neural Networks (ANNs). A brief history of the development
More informationData Fusion Through Statistical Matching
A research and education initiative at the MIT Sloan School of Management Data Fusion Through Statistical Matching Paper 185 Peter Van Der Puttan Joost N. Kok Amar Gupta January 2002 For more information,
More informationSwitchboard Language Model Improvement with Conversational Data from Gigaword
Katholieke Universiteit Leuven Faculty of Engineering Master in Artificial Intelligence (MAI) Speech and Language Technology (SLT) Switchboard Language Model Improvement with Conversational Data from Gigaword
More informationSoftprop: Softmax Neural Network Backpropagation Learning
Softprop: Softmax Neural Networ Bacpropagation Learning Michael Rimer Computer Science Department Brigham Young University Provo, UT 84602, USA E-mail: mrimer@axon.cs.byu.edu Tony Martinez Computer Science
More informationA cognitive perspective on pair programming
Association for Information Systems AIS Electronic Library (AISeL) AMCIS 2006 Proceedings Americas Conference on Information Systems (AMCIS) December 2006 A cognitive perspective on pair programming Radhika
More informationRule discovery in Web-based educational systems using Grammar-Based Genetic Programming
Data Mining VI 205 Rule discovery in Web-based educational systems using Grammar-Based Genetic Programming C. Romero, S. Ventura, C. Hervás & P. González Universidad de Córdoba, Campus Universitario de
More informationLinking Task: Identifying authors and book titles in verbose queries
Linking Task: Identifying authors and book titles in verbose queries Anaïs Ollagnier, Sébastien Fournier, and Patrice Bellot Aix-Marseille University, CNRS, ENSAM, University of Toulon, LSIS UMR 7296,
More informationCalibration of Confidence Measures in Speech Recognition
Submitted to IEEE Trans on Audio, Speech, and Language, July 2010 1 Calibration of Confidence Measures in Speech Recognition Dong Yu, Senior Member, IEEE, Jinyu Li, Member, IEEE, Li Deng, Fellow, IEEE
More informationLongest Common Subsequence: A Method for Automatic Evaluation of Handwritten Essays
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 6, Ver. IV (Nov Dec. 2015), PP 01-07 www.iosrjournals.org Longest Common Subsequence: A Method for
More informationFragment Analysis and Test Case Generation using F- Measure for Adaptive Random Testing and Partitioned Block based Adaptive Random Testing
Fragment Analysis and Test Case Generation using F- Measure for Adaptive Random Testing and Partitioned Block based Adaptive Random Testing D. Indhumathi Research Scholar Department of Information Technology
More informationAUTOMATED FABRIC DEFECT INSPECTION: A SURVEY OF CLASSIFIERS
AUTOMATED FABRIC DEFECT INSPECTION: A SURVEY OF CLASSIFIERS Md. Tarek Habib 1, Rahat Hossain Faisal 2, M. Rokonuzzaman 3, Farruk Ahmed 4 1 Department of Computer Science and Engineering, Prime University,
More information(Sub)Gradient Descent
(Sub)Gradient Descent CMSC 422 MARINE CARPUAT marine@cs.umd.edu Figures credit: Piyush Rai Logistics Midterm is on Thursday 3/24 during class time closed book/internet/etc, one page of notes. will include
More informationAUTOMATED TROUBLESHOOTING OF MOBILE NETWORKS USING BAYESIAN NETWORKS
AUTOMATED TROUBLESHOOTING OF MOBILE NETWORKS USING BAYESIAN NETWORKS R.Barco 1, R.Guerrero 2, G.Hylander 2, L.Nielsen 3, M.Partanen 2, S.Patel 4 1 Dpt. Ingeniería de Comunicaciones. Universidad de Málaga.
More informationOntologies vs. classification systems
Ontologies vs. classification systems Bodil Nistrup Madsen Copenhagen Business School Copenhagen, Denmark bnm.isv@cbs.dk Hanne Erdman Thomsen Copenhagen Business School Copenhagen, Denmark het.isv@cbs.dk
More informationGrade 6: Correlated to AGS Basic Math Skills
Grade 6: Correlated to AGS Basic Math Skills Grade 6: Standard 1 Number Sense Students compare and order positive and negative integers, decimals, fractions, and mixed numbers. They find multiples and
More informationUsing the Attribute Hierarchy Method to Make Diagnostic Inferences about Examinees Cognitive Skills in Algebra on the SAT
The Journal of Technology, Learning, and Assessment Volume 6, Number 6 February 2008 Using the Attribute Hierarchy Method to Make Diagnostic Inferences about Examinees Cognitive Skills in Algebra on the
More informationCitrine Informatics. The Latest from Citrine. Citrine Informatics. The data analytics platform for the physical world
Citrine Informatics The data analytics platform for the physical world The Latest from Citrine Summit on Data and Analytics for Materials Research 31 October 2016 Our Mission is Simple Add as much value
More informationCREATING SHARABLE LEARNING OBJECTS FROM EXISTING DIGITAL COURSE CONTENT
CREATING SHARABLE LEARNING OBJECTS FROM EXISTING DIGITAL COURSE CONTENT Rajendra G. Singh Margaret Bernard Ross Gardler rajsingh@tstt.net.tt mbernard@fsa.uwi.tt rgardler@saafe.org Department of Mathematics
More informationMedical Complexity: A Pragmatic Theory
http://eoimages.gsfc.nasa.gov/images/imagerecords/57000/57747/cloud_combined_2048.jpg Medical Complexity: A Pragmatic Theory Chris Feudtner, MD PhD MPH The Children s Hospital of Philadelphia Main Thesis
More informationOn Human Computer Interaction, HCI. Dr. Saif al Zahir Electrical and Computer Engineering Department UBC
On Human Computer Interaction, HCI Dr. Saif al Zahir Electrical and Computer Engineering Department UBC Human Computer Interaction HCI HCI is the study of people, computer technology, and the ways these
More informationMachine Learning from Garden Path Sentences: The Application of Computational Linguistics
Machine Learning from Garden Path Sentences: The Application of Computational Linguistics http://dx.doi.org/10.3991/ijet.v9i6.4109 J.L. Du 1, P.F. Yu 1 and M.L. Li 2 1 Guangdong University of Foreign Studies,
More informationMathematics process categories
Mathematics process categories All of the UK curricula define multiple categories of mathematical proficiency that require students to be able to use and apply mathematics, beyond simple recall of facts
More informationIndian Institute of Technology, Kanpur
Indian Institute of Technology, Kanpur Course Project - CS671A POS Tagging of Code Mixed Text Ayushman Sisodiya (12188) {ayushmn@iitk.ac.in} Donthu Vamsi Krishna (15111016) {vamsi@iitk.ac.in} Sandeep Kumar
More informationUniversidade do Minho Escola de Engenharia
Universidade do Minho Escola de Engenharia Universidade do Minho Escola de Engenharia Dissertação de Mestrado Knowledge Discovery is the nontrivial extraction of implicit, previously unknown, and potentially
More informationThe feasibility, delivery and cost effectiveness of drink driving interventions: A qualitative analysis of professional stakeholders
Abstract The feasibility, delivery and cost effectiveness of drink driving interventions: A qualitative analysis of Miss Hollie Wilson, Dr Gavan Palk, Centre for Accident Research & Road Safety Queensland
More informationCustomized Question Handling in Data Removal Using CPHC
International Journal of Research Studies in Computer Science and Engineering (IJRSCSE) Volume 1, Issue 8, December 2014, PP 29-34 ISSN 2349-4840 (Print) & ISSN 2349-4859 (Online) www.arcjournals.org Customized
More informationRadius STEM Readiness TM
Curriculum Guide Radius STEM Readiness TM While today s teens are surrounded by technology, we face a stark and imminent shortage of graduates pursuing careers in Science, Technology, Engineering, and
More informationUnsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model
Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model Xinying Song, Xiaodong He, Jianfeng Gao, Li Deng Microsoft Research, One Microsoft Way, Redmond, WA 98052, U.S.A.
More information