SCHEME OF COURSE WORK

Similar documents
Mining Association Rules in Student s Assessment Data

Python Machine Learning

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Reducing Features to Improve Bug Prediction

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Mining Student Evolution Using Associative Classification and Clustering

Learning From the Past with Experiment Databases

A Case Study: News Classification Based on Term Frequency

Australian Journal of Basic and Applied Sciences

Probability and Statistics Curriculum Pacing Guide

Rule Learning With Negation: Issues Regarding Effectiveness

Analysis: Evaluation: Knowledge: Comprehension: Synthesis: Application:

*Net Perceptions, Inc West 78th Street Suite 300 Minneapolis, MN

STA 225: Introductory Statistics (CT)

Rule discovery in Web-based educational systems using Grammar-Based Genetic Programming

Word Segmentation of Off-line Handwritten Documents

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

CS Machine Learning

Rule Learning with Negation: Issues Regarding Effectiveness

Linking Task: Identifying authors and book titles in verbose queries

Lecture 1: Basic Concepts of Machine Learning

Edexcel GCSE. Statistics 1389 Paper 1H. June Mark Scheme. Statistics Edexcel GCSE

Statewide Framework Document for:

Large-Scale Web Page Classification. Sathi T Marath. Submitted in partial fulfilment of the requirements. for the degree of Doctor of Philosophy

Applications of data mining algorithms to analysis of medical data

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Netpix: A Method of Feature Selection Leading. to Accurate Sentiment-Based Classification Models

AQUA: An Ontology-Driven Question Answering System

Beyond the Pipeline: Discrete Optimization in NLP

Grade 6: Correlated to AGS Basic Math Skills

Unit 7 Data analysis and design

EGRHS Course Fair. Science & Math AP & IB Courses

CSL465/603 - Machine Learning

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

Lecture 10: Reinforcement Learning

Issues in the Mining of Heart Failure Datasets

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

(Sub)Gradient Descent

On-Line Data Analytics

K-Medoid Algorithm in Clustering Student Scholarship Applicants

Longest Common Subsequence: A Method for Automatic Evaluation of Handwritten Essays

Generative models and adversarial training

Learning Methods for Fuzzy Systems

ScienceDirect. A Framework for Clustering Cardiac Patient s Records Using Unsupervised Learning Techniques

Knowledge Elicitation Tool Classification. Janet E. Burge. Artificial Intelligence Research Group. Worcester Polytechnic Institute

AP Statistics Summer Assignment 17-18

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

Undergraduate Program Guide. Bachelor of Science. Computer Science DEPARTMENT OF COMPUTER SCIENCE and ENGINEERING

GACE Computer Science Assessment Test at a Glance

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

Assignment 1: Predicting Amazon Review Ratings

Extending Place Value with Whole Numbers to 1,000,000

Dublin City Schools Mathematics Graded Course of Study GRADE 4

Master s Programme in Computer, Communication and Information Sciences, Study guide , ELEC Majors

Predicting Outcomes Based on Hierarchical Regression

Universidade do Minho Escola de Engenharia

Multi-Lingual Text Leveling

Theory of Probability

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

Human Emotion Recognition From Speech

Feature Selection based on Sampling and C4.5 Algorithm to Improve the Quality of Text Classification using Naïve Bayes

A Topic Maps-based ontology IR system versus Clustering-based IR System: A Comparative Study in Security Domain

P. Belsis, C. Sgouropoulou, K. Sfikas, G. Pantziou, C. Skourlas, J. Varnas

The Method of Immersion the Problem of Comparing Technical Objects in an Expert Shell in the Class of Artificial Intelligence Algorithms

Data Integration through Clustering and Finding Statistical Relations - Validation of Approach

Julia Smith. Effective Classroom Approaches to.

OFFICE SUPPORT SPECIALIST Technical Diploma

Comparison of EM and Two-Step Cluster Method for Mixed Data: An Application

University of Groningen. Systemen, planning, netwerken Bosman, Aart

Cross-lingual Short-Text Document Classification for Facebook Comments

Probabilistic Latent Semantic Analysis

Kenya: Age distribution and school attendance of girls aged 9-13 years. UNESCO Institute for Statistics. 20 December 2012

Mathematics subject curriculum

AUTOMATED FABRIC DEFECT INSPECTION: A SURVEY OF CLASSIFIERS

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

Axiom 2013 Team Description Paper

(I couldn t find a Smartie Book) NEW Grade 5/6 Mathematics: (Number, Statistics and Probability) Title Smartie Mathematics

CS 446: Machine Learning

Conference Presentation

Knowledge-Based - Systems

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

A Comparison of Two Text Representations for Sentiment Analysis

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

MASTER OF SCIENCE (M.S.) MAJOR IN COMPUTER SCIENCE

Statistics and Data Analytics Minor

Ensemble Technique Utilization for Indonesian Dependency Parser

Word learning as Bayesian inference

Lecture 1: Machine Learning Basics

Degree Qualification Profiles Intellectual Skills

PELLISSIPPI STATE TECHNICAL COMMUNITY COLLEGE MASTER SYLLABUS APPLIED MECHANICS MET 2025

Data Modeling and Databases II Entity-Relationship (ER) Model. Gustavo Alonso, Ce Zhang Systems Group Department of Computer Science ETH Zürich

Chinese Language Parsing with Maximum-Entropy-Inspired Parser

Customized Question Handling in Data Removal Using CPHC

Exposé for a Master s Thesis

Indian Institute of Technology, Kanpur

COMPARISON OF TWO SEGMENTATION METHODS FOR LIBRARY RECOMMENDER SYSTEMS. by Wing-Kee Ho

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Transcription:

SCHEME OF COURSE WORK Department of CSE Course Title : Data Warehousing and Data mining Course Outcomes (COs): Program Outcomes (POs): Course Code : 13IT2114 L P C 4 0 3 Programme: : M.Tech. Specialization: : CSE Semester :Ist Semester Prerequisites : DBMS Courses to which it is a prerequisite : Text Mining 1 Use the appropriate abstract data type for formulating solutions for the given problem. 2 Describe priority queues using heaps and compare the complexities of various sorting algorithms 3 Examine the solution for dynamic equivalence problem using find and smart union algorithms and discover solutions for various graph problems. 4 Extrapolate various algorithm design techniques with examples and compute amortized analysis for skew heaps, binomial queues, splay trees. 5 Apply various advanced data structures like red-black trees, heap, AA trees, k-d trees etc in relevant application needed. A graduate of M.Tech CSE Specialization will be able to 1 Graduates will demonstrate knowledge in core subjects of Computer Science and Engineering and the ability to learn independently. 2 Graduates will demonstrate the ability to design a software application or process that meets desired Specifications within the constraints. 3 Graduates will demonstrate the ability to solve problems relevant to industries and research organizations. 4 Graduates will develop innovative thinking capabilities to promote research in core and trans-disciplinaryareas. 5 Graduates will be familiar with modern engineering software tools and equipment to analyze computer science and engineering problems. 6 Graduates will demonstrate the ability to collaborate with engineers of other disciplines and work on projects requiring multidisciplinary skills. 7 Graduates will acquire project management and finance control abilities. 8 Graduates will be able to communicate effectively in both verbal and written forms. 9 Graduates will engage themselves in lifelong learning in the context of rapid technological changes in computer science and engineering

10 Graduates will demonstrate an appreciation of ethical and social responsibilities in professional and societal context. 11 Graduates will demonstrate the ability in carrying out tasks independently and by reflective learning. Course Outcome versus Program Outcomes: COs PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 CO1 M M S S S M S CO2 S S S S M M S CO3 S S S S M M S CO4 S S S S M S CO5 S S S S M S Assessment Methods: Assignment / Quiz / Seminar / Case Study / Mid-Test / End Exam

Teaching-Learning and Evaluation Week TOPIC / CONTENTS Course Sample questions TEACHING- Assessment Outcomes LEARNING Method & STRATEGY Schedule 1 Introduction:Data mining-on what CO-1 1.Define Data mining. Lecture / Discussion Assignment kinds of Data, Data Mining Functionalities, Classification of Data Mining systems, Data Mining Task Primitives 2.Compare and contrast Data, information and knowledge. 2 Integration of a Data Mining CO-1. Lecture / Discussion Assignment System with a Database or Data 1. What is the difference Warehouse System, Major issues in between prediction and Data Mining. classification 3 Data Preprocessing: Descriptive CO-1 1.What is the need of Lecture / Discussion data summarization, Data Cleaning, Data Integration and Transformation, Preprocessing. Problem solving 4 Data Reduction, Discretization and CO-1 1. What are the phases in Lecture / Discussion Assignment Concept Hierarchy Generation. preprocessing. Problem solving 5 Data Warehouse and OLAP CO-2 1.What is the model used Lecture Technology: Multidimensional for construction of a Data Model, Data warehouse. WarehouseArchitecture 2. Whar are the ways in which the warehouse may be coupled with the data mining system 3. Expand OLAP system 4. Give the difference between OLAP and OLTP. Problem solving

6 Data Warehouse Implementation, CO-2 1.What are the application Lecture / Discussion Assignment From Data Warehousing to Data areas of OLAP system. Mining Problem solving 7 Data Cube Computation and Data CO-2 1 Explain BUC algorithm. Lecture / Discussion Generalization: Efficient methods for Data Cube Computation, Further Development of Data Cube and OLAP Technology, Attribute- Oriented Induction. Problem solving Assignment 8 Mining Frequent Patterns, CO-3 1.Differnce between Lecture / Discussion Assignment Problem solving Association and Correlations: Basic association and correlation. concepts, Efficient and Scalable Frequent Itemset Mining Methods 2. When is an item said to be frequent. 3. Define support and confidence. 9 CO-3 10 Mining Various kinds of CO-3 1. What is the purpose of Lecture / Discussion Assignment Association Rules mining frequent item sets. 2.What re the drawbacks of apriori algorithm. 11 From Association Mining to CO-3 1 What are constraints imposed Lecture / Discussion Assignment Correlation Analysis, Constraint Based Association over assoc rules. 12 Classification and Prediction-1: CO-4 1.Give the formula for Lecture / Discussion Assignment Issues Regarding Classification and gainratio. Prediction, Classification by Decision Tree Induction 2. What is bayes rule. 3. Give the formula for error in back propagation classification. 13 Bayesian Classification,Rule- CO-4 1. What is the basic ides in Lecture / Discussion Assignment Based Classification, Classification ID3 algorithm. by

Backpropagation. 2.What is training set. 14 Classification and Prediction-2: CO-4 1 What are the different types of Lecture / Discussion Assignment Support Vector Machines, Association Classification, Other Classification Methods SVM s. 15 Prediction, Accuracy and Error CO-4 1 Give the formula for accuracy. Lecture / Discussion Assignment Measures, Evaluating the Accuracy of a Classifier or Predictor. 16 Cluster Analysis Introduction CO-5 1.Define cluster. Lecture / Discussion Assignment :Types of Data in Cluster Analysis, A Categorization of Major 2. Give the formula for Clustering Methods, Partitioning precision and recall. Methods, Hierarchical Methods, 3. What is clusterability. 17 Density-Based Methods, Grid- CO-5 1. Mention different types Lecture / Discussion Assignment Based Methods, Model-Based of clustering techniques Clustering Methods, Outlier Analysis 1.Give example for Partional clustering. 18 19/20 END EXAM