Weka: Naïve Bayes Classifier(s)

Size: px
Start display at page:

Download "Weka: Naïve Bayes Classifier(s)"

Transcription

1 Lecture 06: LAB Assignment Weka: Naïve Bayes Classifier(s) ACKNOWLEDGEMENTS: Our lab assignment today has been inspired by the following lab projects: past tense dataset + decision trees: < > and spam dataset + naïve bayes < >. INFO Required reading for Lecture 6 and matching Lab Assignment: - - Daume (2015): Ch 7 from page 103 to page 110. Requirement: be open to discuss the main topics and/or the main problems of the lab assignment with one or more randomly- chosen classmate(s). If you have a problem or deep insight, do not keep it to yourself: share it! You might solve the problem or you might get an even deeper insight J Your original way of thinking will be enriched by discussions with peers. Execution time: approx. 2-3 hours. ATT: datasets can be downloaded from here: < Learning objectives In this lab assignment you are going to: explore the behavior of Naïve Bayes classifier(s) (as implemented in weka) on linguistic data: o spam dataset, past tense dataset o Naïve Bayes o (NaiveBayesSimple) Pondering about our previous experiences In our previous weka lab assignments, we used two different datasets, namely the iris dataset and the past tense dataset. These two datasets represent: n iris flowers: small (150 instances), balanced distributions of instances across three classes (50 instances per class), numerical attributes/features (measurements in cm), categorical class labels (the names of the iris species) n past tense inflections: largish (4330 instances), many classes (42 class labels), nominal attributes (phonemes), highly unbalanced. You explored the past tense dataset and you correctly figured out the following facts: n It is list of: o verbs (ex: <

2 o phonemes (ex:< o classes (< o For your convenience, you can now browse the list here (< tense.dat>) and see if it really matches your intuitions about the dataset. n In summary we can say that the past tense dataset contains verb lemmas, where the class to be predicted is characterized by past tense formation rules. n This is a first example of how machine learning can help out in solving linguistic issues, although J48 does not seem the ideal classifier for this dataset. J48 implements a decision tree model following the C4.5 algorithm. C4.5 is an algorithm used to generate a decision tree, which was developed by Ross Quinlan. C4.5 is an extension of Quinlan's earlier ID3 algorithm. The ID3 algorithm uses "Information Gain" measure. The C4.5 uses "Gain Ratio" measure. (optional reading: ( linux1.temple.edu/~giorgio/cis587/readings/id3- c45.html) If you look at the picture below, you can see that weka cites the reference of the implemented classifier. J48 makes use of entropy, which gives us the information about "degree of doubt". J48 selects the attribute for classifying by comparing the information gains. The following is the quick summarize of the J48 algorithm: 1. For each attribute, compute its entropy with respect to the class attribute. 2. Compute and select the attribute (say A) with highest gain ratio. 3. Divide the data into separate sets according to the values of A. 4. Build a tree with each branch represents an attribute (A) value. 5. For each subtree, repeat this process from step At each iteration, one attribute gets removed from consideration. 7. The process stops when there are no attributes left to consider, or when all the data being considered in a subtree have the same value for the class attribute. 8. Inductive bias: Shorter trees are preferred over larger trees. Trees that place high gain ratio attributes close to the root are preferred over those that do not (cf Lecture 1: What is machine learning? Slide 31: J48/C4.5 and its antecedent ID3 have the same inductive bias). This means that the default parameters of J48 prune the trees drastically (read again the definition of inductive bias Lecture 1: What is machine learning? Slide 30. You have now experienced in practice what inductive bias is). It is possible to change these parameters and get an unpruned tree. We are not going into

3 this exploration right now but if you read carefully all the parameters you will see a confidencefactor (ie the C) and an unpruded parameters. (see picture below). J48 can deal with both nominal (past tense dataset) and numeric (iris dataset) attributes. However, please remember that this is not always the case, since some classifiers do not have this flexibility, for example linear classifiers. In today s lab assignment we will explore the behaviour of weka's Naive Bayes implementations. NB is neither a linear classifier, nor a divide and conquer classifier, it is a probabilistic classifier. How does NB behave with linguistic datasets? Let's carry out this exploration today... Preliminaries (Repetition) Preprocess Tab In the Preprocess tab, you can review the data you are working with. In the left section, it outlines the information and all attributes of the dataset. By selecting each attribute, the right section of the Explorer window will also give you the information about the data in that attribute. There is also a visual way of examining the data which you can see by clicking the Visualize All button. Sometimes, the visualization is a very powerful tool for reviewing the dataset. Classify Tab In the Classify tab, you can create a model by using Choose to select a model. If you hover on the name of a classifier when you press Choose, you will see a tooltip

4 containing the main information about the classifier you are hovering on (see picture below). After the desired model has been chosen, we have to tell WEKA how to evaluate the model that has been built. In the Test options frame, the option Use training set means that we want to use the data supplied in the ARFF file loaded. The other three choices are Supplied test set, where you can supply a different set of data to build the model; Cross- validation, which lets WEKA build a model based on subsets of the supplied data and then average them out to create a final model; and Percentage split, where WEKA takes a percentile subset of the supplied data to build a final model. Tasks G tasks: pls provide comprehensive answers to all the questions below: (1) Start Weka, launch the Explorer window and select the "Preprocess" tab. Open the past tense dataset. Select the Classify tab to get into the Classification tab of Weka. Click on Choose and hover on to NaiveBayes. Read the tooltip and then select the classifier. In Test options, select 10- fold cross validation and hit Start. Evaluate the performance (ie. read the output, ie the evaluation measures that we have studied so far) of the NB classifier on the past tense dataset. What are your conclusions? (2) Go back to your previous lab (when we used the decision tree classifier J48 on the past tense dataset). Compare the performance of J48 and NaiveBayes. What are your conclusions? If you should recommend one of the two classifiers based on weka output, which one would you recommend?

5 (3) Open the spambase dataset in the Preprocessing panel. How many classes, instances and attributes can be found in the spambase.arff dataset? What type of attributes and what kind of classes? (4) Run both J48 and NaïveBayes on the spambase dataset. Compare and discuss their behavior on the two different datasets. (5) Theoretical question: Why is naïve Bayes classification called naïve? Briefly outline the major ideas of naïve Bayes classification. VG tasks: pls provide comprehensive answers to all the questions below: (6) Run NaiveBayesSimple on the past tense dataset. You will get an error. Can you make a try and interpret what causes this error? (tips: google the error and compare the descriptions of the NaïveBayseSimple classifier against the description of the NaiveBayes classifier). What are your conclusions? (7) Run NaiveBayesSimple on the spambase dataset. You will get an error. Can you make a try and interpret what causes this error? (tips: google the error and compare the descriptions of the NaïveBayseSimple classifier against the description of the NaiveBayes classifier). What are your conclusions? To be submitted A written report (at least 1 page) containing the reasoned answers to the tasks and questions above and a short section where you summarize your reflections and experience. Submit the report in PDF format to santinim@stp.lingfil.uu.se no later than Fri 27 Nov 2015, 1pm (13:00). Naming conventions Please, name your pdf report in this way (it will be easier for me to organize and archive them): surname_name_lecturenumberlab_report.pdf (ex: santini_marina_lecture06lab_report.pdf).

CS Machine Learning

CS Machine Learning CS 478 - Machine Learning Projects Data Representation Basic testing and evaluation schemes CS 478 Data and Testing 1 Programming Issues l Program in any platform you want l Realize that you will be doing

More information

Houghton Mifflin Online Assessment System Walkthrough Guide

Houghton Mifflin Online Assessment System Walkthrough Guide Houghton Mifflin Online Assessment System Walkthrough Guide Page 1 Copyright 2007 by Houghton Mifflin Company. All Rights Reserved. No part of this document may be reproduced or transmitted in any form

More information

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Twitter Sentiment Classification on Sanders Data using Hybrid Approach IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 4, Ver. I (July Aug. 2015), PP 118-123 www.iosrjournals.org Twitter Sentiment Classification on Sanders

More information

Learning From the Past with Experiment Databases

Learning From the Past with Experiment Databases Learning From the Past with Experiment Databases Joaquin Vanschoren 1, Bernhard Pfahringer 2, and Geoff Holmes 2 1 Computer Science Dept., K.U.Leuven, Leuven, Belgium 2 Computer Science Dept., University

More information

Rule Learning with Negation: Issues Regarding Effectiveness

Rule Learning with Negation: Issues Regarding Effectiveness Rule Learning with Negation: Issues Regarding Effectiveness Stephanie Chua, Frans Coenen, and Grant Malcolm University of Liverpool Department of Computer Science, Ashton Building, Ashton Street, L69 3BX

More information

TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE. Pierre Foy

TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE. Pierre Foy TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE Pierre Foy TIMSS Advanced 2015 orks User Guide for the International Database Pierre Foy Contributors: Victoria A.S. Centurino, Kerry E. Cotter,

More information

Rule Learning With Negation: Issues Regarding Effectiveness

Rule Learning With Negation: Issues Regarding Effectiveness Rule Learning With Negation: Issues Regarding Effectiveness S. Chua, F. Coenen, G. Malcolm University of Liverpool Department of Computer Science, Ashton Building, Ashton Street, L69 3BX Liverpool, United

More information

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition Todd Holloway Two Lecture Series for B551 November 20 & 27, 2007 Indiana University Outline Introduction Bias and

More information

Lecture 1: Machine Learning Basics

Lecture 1: Machine Learning Basics 1/69 Lecture 1: Machine Learning Basics Ali Harakeh University of Waterloo WAVE Lab ali.harakeh@uwaterloo.ca May 1, 2017 2/69 Overview 1 Learning Algorithms 2 Capacity, Overfitting, and Underfitting 3

More information

An Introductory Blackboard (elearn) Guide For Parents

An Introductory Blackboard (elearn) Guide For Parents An Introductory Blackboard (elearn) Guide For Parents Prepared: July 2010 Revised: Jan 2013 By M. A. Avila Introduction: Blackboard is a course management system widely used in educational settings. At

More information

Python Machine Learning

Python Machine Learning Python Machine Learning Unlock deeper insights into machine learning with this vital guide to cuttingedge predictive analytics Sebastian Raschka [ PUBLISHING 1 open source I community experience distilled

More information

i>clicker Setup Training Documentation This document explains the process of integrating your i>clicker software with your Moodle course.

i>clicker Setup Training Documentation This document explains the process of integrating your i>clicker software with your Moodle course. This document explains the process of integrating your i>clicker software with your Moodle course. Center for Effective Teaching and Learning CETL Fine Arts 138 mymoodle@calstatela.edu Cal State L.A. (323)

More information

Intel-powered Classmate PC. SMART Response* Training Foils. Version 2.0

Intel-powered Classmate PC. SMART Response* Training Foils. Version 2.0 Intel-powered Classmate PC Training Foils Version 2.0 1 Legal Information INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE,

More information

Test How To. Creating a New Test

Test How To. Creating a New Test Test How To Creating a New Test From the Control Panel of your course, select the Test Manager link from the Assessments box. The Test Manager page lists any tests you have already created. From this screen

More information

CS 446: Machine Learning

CS 446: Machine Learning CS 446: Machine Learning Introduction to LBJava: a Learning Based Programming Language Writing classifiers Christos Christodoulopoulos Parisa Kordjamshidi Motivation 2 Motivation You still have not learnt

More information

Outreach Connect User Manual

Outreach Connect User Manual Outreach Connect A Product of CAA Software, Inc. Outreach Connect User Manual Church Growth Strategies Through Sunday School, Care Groups, & Outreach Involving Members, Guests, & Prospects PREPARED FOR:

More information

Using Blackboard.com Software to Reach Beyond the Classroom: Intermediate

Using Blackboard.com Software to Reach Beyond the Classroom: Intermediate Using Blackboard.com Software to Reach Beyond the Classroom: Intermediate NESA Conference 2007 Presenter: Barbara Dent Educational Technology Training Specialist Thomas Jefferson High School for Science

More information

Netpix: A Method of Feature Selection Leading. to Accurate Sentiment-Based Classification Models

Netpix: A Method of Feature Selection Leading. to Accurate Sentiment-Based Classification Models Netpix: A Method of Feature Selection Leading to Accurate Sentiment-Based Classification Models 1 Netpix: A Method of Feature Selection Leading to Accurate Sentiment-Based Classification Models James B.

More information

Creating a Test in Eduphoria! Aware

Creating a Test in Eduphoria! Aware in Eduphoria! Aware Login to Eduphoria using CHROME!!! 1. LCS Intranet > Portals > Eduphoria From home: LakeCounty.SchoolObjects.com 2. Login with your full email address. First time login password default

More information

(Sub)Gradient Descent

(Sub)Gradient Descent (Sub)Gradient Descent CMSC 422 MARINE CARPUAT marine@cs.umd.edu Figures credit: Piyush Rai Logistics Midterm is on Thursday 3/24 during class time closed book/internet/etc, one page of notes. will include

More information

Welcome to California Colleges, Platform Exploration (6.1) Goal: Students will familiarize themselves with the CaliforniaColleges.edu platform.

Welcome to California Colleges, Platform Exploration (6.1) Goal: Students will familiarize themselves with the CaliforniaColleges.edu platform. Welcome to California Colleges, Platform Exploration (6.1) Goal: Students will familiarize themselves with the CaliforniaColleges.edu platform. Lesson Time Options This lesson requires one 45-60 minute

More information

ACCESSING STUDENT ACCESS CENTER

ACCESSING STUDENT ACCESS CENTER ACCESSING STUDENT ACCESS CENTER Student Access Center is the Fulton County system to allow students to view their student information. All students are assigned a username and password. 1. Accessing the

More information

NCAA Eligibility Center High School Portal Instructions. Course Module

NCAA Eligibility Center High School Portal Instructions. Course Module NCAA Eligibility Center High School Portal Instructions Course Module www.eligibilitycenter.org Click here to enter the High School Portal Before logging in, you can peruse the resource page or look at

More information

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur Module 12 Machine Learning 12.1 Instructional Objective The students should understand the concept of learning systems Students should learn about different aspects of a learning system Students should

More information

Parent s Guide to the Student/Parent Portal

Parent s Guide to the Student/Parent Portal Nova Scotia Public Education System Parent s Guide to the Student/Parent Portal Revision Date: The Student/Parent Portal is your gateway into the classroom of the children associated to your account. The

More information

Assignment 1: Predicting Amazon Review Ratings

Assignment 1: Predicting Amazon Review Ratings Assignment 1: Predicting Amazon Review Ratings 1 Dataset Analysis Richard Park r2park@acsmail.ucsd.edu February 23, 2015 The dataset selected for this assignment comes from the set of Amazon reviews for

More information

EdX Learner s Guide. Release

EdX Learner s Guide. Release EdX Learner s Guide Release Nov 18, 2017 Contents 1 Welcome! 1 1.1 Learning in a MOOC........................................... 1 1.2 If You Have Questions As You Take a Course..............................

More information

16.1 Lesson: Putting it into practice - isikhnas

16.1 Lesson: Putting it into practice - isikhnas BAB 16 Module: Using QGIS in animal health The purpose of this module is to show how QGIS can be used to assist in animal health scenarios. In order to do this, you will have needed to study, and be familiar

More information

Emporia State University Degree Works Training User Guide Advisor

Emporia State University Degree Works Training User Guide Advisor Emporia State University Degree Works Training User Guide Advisor For use beginning with Catalog Year 2014. Not applicable for students with a Catalog Year prior. Table of Contents Table of Contents Introduction...

More information

The Revised Math TEKS (Grades 9-12) with Supporting Documents

The Revised Math TEKS (Grades 9-12) with Supporting Documents The Revised Math TEKS (Grades 9-12) with Supporting Documents This is the first of four modules to introduce the revised TEKS for high school mathematics. The goals for participation are to become familiar

More information

Urban Analysis Exercise: GIS, Residential Development and Service Availability in Hillsborough County, Florida

Urban Analysis Exercise: GIS, Residential Development and Service Availability in Hillsborough County, Florida UNIVERSITY OF NORTH TEXAS Department of Geography GEOG 3100: US and Canada Cities, Economies, and Sustainability Urban Analysis Exercise: GIS, Residential Development and Service Availability in Hillsborough

More information

The following information has been adapted from A guide to using AntConc.

The following information has been adapted from A guide to using AntConc. 1 7. Practical application of genre analysis in the classroom In this part of the workshop, we are going to analyse some of the texts from the discipline that you teach. Before we begin, we need to get

More information

Student Handbook. This handbook was written for the students and participants of the MPI Training Site.

Student Handbook. This handbook was written for the students and participants of the MPI Training Site. Student Handbook This handbook was written for the students and participants of the MPI Training Site. Purpose To enable the active participants of this website easier operation and a thorough understanding

More information

Moodle MyFeedback update April 2017

Moodle MyFeedback update April 2017 Moodle MyFeedback update April 2017 Jessica Gramp j.gramp@ucl.ac.uk Moodle My Feedback Report Allows students and staff to easily view grades & feedback across Moodle courses. It is available from Moodle.org

More information

MOODLE 2.0 GLOSSARY TUTORIALS

MOODLE 2.0 GLOSSARY TUTORIALS BEGINNING TUTORIALS SECTION 1 TUTORIAL OVERVIEW MOODLE 2.0 GLOSSARY TUTORIALS The glossary activity module enables participants to create and maintain a list of definitions, like a dictionary, or to collect

More information

Schoology Getting Started Guide for Teachers

Schoology Getting Started Guide for Teachers Schoology Getting Started Guide for Teachers (Latest Revision: December 2014) Before you start, please go over the Beginner s Guide to Using Schoology. The guide will show you in detail how to accomplish

More information

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks 1 Tzu-Hsuan Yang, 2 Tzu-Hsuan Tseng, and 3 Chia-Ping Chen Department of Computer Science and Engineering

More information

Quick Reference for itslearning

Quick Reference for itslearning Quick Reference for itslearning Frequently Asked Questions... 2 How do I access itslearning?... 2 Who can I contact if I get a problem?... 2 Where can I get help?... 2 Can I get itslearning in my language?...

More information

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages Nuanwan Soonthornphisaj 1 and Boonserm Kijsirikul 2 Machine Intelligence and Knowledge Discovery Laboratory Department of Computer

More information

Experiment Databases: Towards an Improved Experimental Methodology in Machine Learning

Experiment Databases: Towards an Improved Experimental Methodology in Machine Learning Experiment Databases: Towards an Improved Experimental Methodology in Machine Learning Hendrik Blockeel and Joaquin Vanschoren Computer Science Dept., K.U.Leuven, Celestijnenlaan 200A, 3001 Leuven, Belgium

More information

IBCP Language Portfolio Core Requirement for the International Baccalaureate Career-Related Programme

IBCP Language Portfolio Core Requirement for the International Baccalaureate Career-Related Programme IBCP Language Portfolio Core Requirement for the International Baccalaureate Career-Related Programme Name Student ID Year of Graduation Start Date Completion Due Date May 1, 20 (or before) Target Language

More information

U of S Course Tools. Open CourseWare (OCW)

U of S Course Tools. Open CourseWare (OCW) Open CourseWare (OCW) January 2014 Overview: Open CourseWare works by using the Public Access settings in your or Blackboard course. This document explains how to configure these basic settings for your

More information

CSL465/603 - Machine Learning

CSL465/603 - Machine Learning CSL465/603 - Machine Learning Fall 2016 Narayanan C Krishnan ckn@iitrpr.ac.in Introduction CSL465/603 - Machine Learning 1 Administrative Trivia Course Structure 3-0-2 Lecture Timings Monday 9.55-10.45am

More information

PowerTeacher Gradebook User Guide PowerSchool Student Information System

PowerTeacher Gradebook User Guide PowerSchool Student Information System PowerSchool Student Information System Document Properties Copyright Owner Copyright 2007 Pearson Education, Inc. or its affiliates. All rights reserved. This document is the property of Pearson Education,

More information

Online ICT Training Courseware

Online ICT Training Courseware Computing Guide THE LIBRARY www.salford.ac.uk/library Online ICT Training Courseware What materials are covered? Office 2003 to 2007 Quick Conversion Course Microsoft 2010, 2007 and 2003 for Word, PowerPoint,

More information

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, 2013 10.12753/2066-026X-13-154 DATA MINING SOLUTIONS FOR DETERMINING STUDENT'S PROFILE Adela BÂRA,

More information

Chapter 2 Rule Learning in a Nutshell

Chapter 2 Rule Learning in a Nutshell Chapter 2 Rule Learning in a Nutshell This chapter gives a brief overview of inductive rule learning and may therefore serve as a guide through the rest of the book. Later chapters will expand upon the

More information

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling Notebook for PAN at CLEF 2013 Andrés Alfonso Caurcel Díaz 1 and José María Gómez Hidalgo 2 1 Universidad

More information

Automating Outcome Based Assessment

Automating Outcome Based Assessment Automating Outcome Based Assessment Suseel K Pallapu Graduate Student Department of Computing Studies Arizona State University Polytechnic (East) 01 480 449 3861 harryk@asu.edu ABSTRACT In the last decade,

More information

IT Students Workshop within Strategic Partnership of Leibniz University and Peter the Great St. Petersburg Polytechnic University

IT Students Workshop within Strategic Partnership of Leibniz University and Peter the Great St. Petersburg Polytechnic University IT Students Workshop within Strategic Partnership of Leibniz University and Peter the Great St. Petersburg Polytechnic University 06.11.16 13.11.16 Hannover Our group from Peter the Great St. Petersburg

More information

Course Groups and Coordinator Courses MyLab and Mastering for Blackboard Learn

Course Groups and Coordinator Courses MyLab and Mastering for Blackboard Learn Course Groups and Coordinator Courses MyLab and Mastering for Blackboard Learn MyAnthroLab MyArtsLab MyDevelopmentLab MyHistoryLab MyMusicLab MyPoliSciLab MyPsychLab MyReligionLab MySociologyLab MyThinkingLab

More information

Preferences...3 Basic Calculator...5 Math/Graphing Tools...5 Help...6 Run System Check...6 Sign Out...8

Preferences...3 Basic Calculator...5 Math/Graphing Tools...5 Help...6 Run System Check...6 Sign Out...8 CONTENTS GETTING STARTED.................................... 1 SYSTEM SETUP FOR CENGAGENOW....................... 2 USING THE HEADER LINKS.............................. 2 Preferences....................................................3

More information

Filing RTI Application by your own

Filing RTI Application by your own We at filertinow.com file RTIs anywhere in India. Filing RTI through us is an easy 3 minutes process. Our experts have information about RTI filing for thousands of government offices across the country

More information

Odyssey Writer Online Writing Tool for Students

Odyssey Writer Online Writing Tool for Students Odyssey Writer Online Writing Tool for Students Ways to Access Odyssey Writer: 1. Odyssey Writer Icon on Student Launch Pad Stand alone icon on student launch pad for free-form writing. This is the drafting

More information

Beyond the Pipeline: Discrete Optimization in NLP

Beyond the Pipeline: Discrete Optimization in NLP Beyond the Pipeline: Discrete Optimization in NLP Tomasz Marciniak and Michael Strube EML Research ggmbh Schloss-Wolfsbrunnenweg 33 69118 Heidelberg, Germany http://www.eml-research.de/nlp Abstract We

More information

DegreeWorks Advisor Reference Guide

DegreeWorks Advisor Reference Guide DegreeWorks Advisor Reference Guide Table of Contents 1. DegreeWorks Basics... 2 Overview... 2 Application Features... 3 Getting Started... 4 DegreeWorks Basics FAQs... 10 2. What-If Audits... 12 Overview...

More information

Memory-based grammatical error correction

Memory-based grammatical error correction Memory-based grammatical error correction Antal van den Bosch Peter Berck Radboud University Nijmegen Tilburg University P.O. Box 9103 P.O. Box 90153 NL-6500 HD Nijmegen, The Netherlands NL-5000 LE Tilburg,

More information

THE ROLE OF DECISION TREES IN NATURAL LANGUAGE PROCESSING

THE ROLE OF DECISION TREES IN NATURAL LANGUAGE PROCESSING SISOM & ACOUSTICS 2015, Bucharest 21-22 May THE ROLE OF DECISION TREES IN NATURAL LANGUAGE PROCESSING MarilenaăLAZ R 1, Diana MILITARU 2 1 Military Equipment and Technologies Research Agency, Bucharest,

More information

CHANCERY SMS 5.0 STUDENT SCHEDULING

CHANCERY SMS 5.0 STUDENT SCHEDULING CHANCERY SMS 5.0 STUDENT SCHEDULING PARTICIPANT WORKBOOK VERSION: 06/04 CSL - 12148 Student Scheduling Chancery SMS 5.0 : Student Scheduling... 1 Course Objectives... 1 Course Agenda... 1 Topic 1: Overview

More information

TIPS PORTAL TRAINING DOCUMENTATION

TIPS PORTAL TRAINING DOCUMENTATION TIPS PORTAL TRAINING DOCUMENTATION 1 TABLE OF CONTENTS General Overview of TIPS. 3, 4 TIPS, Where is it? How do I access it?... 5, 6 Grade Reports.. 7 Grade Reports Demo and Exercise 8 12 Withdrawal Reports.

More information

Reducing Features to Improve Bug Prediction

Reducing Features to Improve Bug Prediction Reducing Features to Improve Bug Prediction Shivkumar Shivaji, E. James Whitehead, Jr., Ram Akella University of California Santa Cruz {shiv,ejw,ram}@soe.ucsc.edu Sunghun Kim Hong Kong University of Science

More information

Getting Started Guide

Getting Started Guide Getting Started Guide Getting Started with Voki Classroom Oddcast, Inc. Published: July 2011 Contents: I. Registering for Voki Classroom II. Upgrading to Voki Classroom III. Getting Started with Voki Classroom

More information

INTERMEDIATE ALGEBRA Course Syllabus

INTERMEDIATE ALGEBRA Course Syllabus INTERMEDIATE ALGEBRA Course Syllabus This syllabus gives a detailed explanation of the course procedures and policies. You are responsible for this information - ask your instructor if anything is unclear.

More information

Office of Planning and Budgets. Provost Market for Fiscal Year Resource Guide

Office of Planning and Budgets. Provost Market for Fiscal Year Resource Guide Office of Planning and Budgets Provost Market for Fiscal Year 2017-18 Resource Guide This resource guide will show users how to operate the Cognos Planning application used to collect Provost Market raise

More information

Using SAM Central With iread

Using SAM Central With iread Using SAM Central With iread January 1, 2016 For use with iread version 1.2 or later, SAM Central, and Student Achievement Manager version 2.4 or later PDF0868 (PDF) Houghton Mifflin Harcourt Publishing

More information

Chinese Language Parsing with Maximum-Entropy-Inspired Parser

Chinese Language Parsing with Maximum-Entropy-Inspired Parser Chinese Language Parsing with Maximum-Entropy-Inspired Parser Heng Lian Brown University Abstract The Chinese language has many special characteristics that make parsing difficult. The performance of state-of-the-art

More information

GACE Computer Science Assessment Test at a Glance

GACE Computer Science Assessment Test at a Glance GACE Computer Science Assessment Test at a Glance Updated May 2017 See the GACE Computer Science Assessment Study Companion for practice questions and preparation resources. Assessment Name Computer Science

More information

CS177 Python Programming

CS177 Python Programming CS177 Python Programming Recitation 1 Introduction Adapted from John Zelle s Book Slides 1 Course Instructors Dr. Elisha Sacks E-mail: eps@purdue.edu Ruby Tahboub (Course Coordinator) E-mail: rtahboub@purdue.edu

More information

How to analyze visual narratives: A tutorial in Visual Narrative Grammar

How to analyze visual narratives: A tutorial in Visual Narrative Grammar How to analyze visual narratives: A tutorial in Visual Narrative Grammar Neil Cohn 2015 neilcohn@visuallanguagelab.com www.visuallanguagelab.com Abstract Recent work has argued that narrative sequential

More information

Driving Author Engagement through IEEE Collabratec

Driving Author Engagement through IEEE Collabratec Driving Author Engagement through IEEE Collabratec Gianluca Setti 2013-2014 IEEE Vice President for Publication Services and Products Professor of Engineering, University of Ferrara gianluca.setti@unife.it

More information

Introduction to the Revised Mathematics TEKS (2012) Module 1

Introduction to the Revised Mathematics TEKS (2012) Module 1 Introduction to the Revised Mathematics TEKS (2012) Module 1 This is the first of four modules to introduce the Revised TEKS for grades K 8. The goals for participation are to become familiar with the

More information

THESIS GUIDE FORMAL INSTRUCTION GUIDE FOR MASTER S THESIS WRITING SCHOOL OF BUSINESS

THESIS GUIDE FORMAL INSTRUCTION GUIDE FOR MASTER S THESIS WRITING SCHOOL OF BUSINESS THESIS GUIDE FORMAL INSTRUCTION GUIDE FOR MASTER S THESIS WRITING SCHOOL OF BUSINESS 1. Introduction VERSION: DECEMBER 2015 A master s thesis is more than just a requirement towards your Master of Science

More information

Best Colleges Main Survey

Best Colleges Main Survey Best Colleges Main Survey Date submitted 5/12/216 18::56 Introduction page 1 / 146 BEST COLLEGES Data Collection U.S. News has begun collecting data for the 217 edition of Best Colleges. The U.S. News

More information

For Jury Evaluation. The Road to Enlightenment: Generating Insight and Predicting Consumer Actions in Digital Markets

For Jury Evaluation. The Road to Enlightenment: Generating Insight and Predicting Consumer Actions in Digital Markets FACULDADE DE ENGENHARIA DA UNIVERSIDADE DO PORTO The Road to Enlightenment: Generating Insight and Predicting Consumer Actions in Digital Markets Jorge Moreira da Silva For Jury Evaluation Mestrado Integrado

More information

Model Ensemble for Click Prediction in Bing Search Ads

Model Ensemble for Click Prediction in Bing Search Ads Model Ensemble for Click Prediction in Bing Search Ads Xiaoliang Ling Microsoft Bing xiaoling@microsoft.com Hucheng Zhou Microsoft Research huzho@microsoft.com Weiwei Deng Microsoft Bing dedeng@microsoft.com

More information

Multisensor Data Fusion: From Algorithms And Architectural Design To Applications (Devices, Circuits, And Systems)

Multisensor Data Fusion: From Algorithms And Architectural Design To Applications (Devices, Circuits, And Systems) Multisensor Data Fusion: From Algorithms And Architectural Design To Applications (Devices, Circuits, And Systems) If searching for the ebook Multisensor Data Fusion: From Algorithms and Architectural

More information

Storytelling Made Simple

Storytelling Made Simple Storytelling Made Simple Storybird is a Web tool that allows adults and children to create stories online (independently or collaboratively) then share them with the world or select individuals. Teacher

More information

SECTION 12 E-Learning (CBT) Delivery Module

SECTION 12 E-Learning (CBT) Delivery Module SECTION 12 E-Learning (CBT) Delivery Module Linking a CBT package (file or URL) to an item of Set Training 2 Linking an active Redkite Question Master assessment 2 to the end of a CBT package Removing

More information

Field Experience Management 2011 Training Guides

Field Experience Management 2011 Training Guides Field Experience Management 2011 Training Guides Page 1 of 40 Contents Introduction... 3 Helpful Resources Available on the LiveText Conference Visitors Pass... 3 Overview... 5 Development Model for FEM...

More information

Indian Institute of Technology, Kanpur

Indian Institute of Technology, Kanpur Indian Institute of Technology, Kanpur Course Project - CS671A POS Tagging of Code Mixed Text Ayushman Sisodiya (12188) {ayushmn@iitk.ac.in} Donthu Vamsi Krishna (15111016) {vamsi@iitk.ac.in} Sandeep Kumar

More information

Switchboard Language Model Improvement with Conversational Data from Gigaword

Switchboard Language Model Improvement with Conversational Data from Gigaword Katholieke Universiteit Leuven Faculty of Engineering Master in Artificial Intelligence (MAI) Speech and Language Technology (SLT) Switchboard Language Model Improvement with Conversational Data from Gigaword

More information

Connect Microbiology. Training Guide

Connect Microbiology. Training Guide 1 Training Checklist Section 1: Getting Started 3 Section 2: Course and Section Creation 4 Creating a New Course with Sections... 4 Editing Course Details... 9 Editing Section Details... 9 Copying a Section

More information

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS Václav Kocian, Eva Volná, Michal Janošek, Martin Kotyrba University of Ostrava Department of Informatics and Computers Dvořákova 7,

More information

Word Segmentation of Off-line Handwritten Documents

Word Segmentation of Off-line Handwritten Documents Word Segmentation of Off-line Handwritten Documents Chen Huang and Sargur N. Srihari {chuang5, srihari}@cedar.buffalo.edu Center of Excellence for Document Analysis and Recognition (CEDAR), Department

More information

Poster Presentation Best Practices. Kuba Glazek, Ph.D. Methodology Expert National Center for Academic and Dissertation Excellence Los Angeles

Poster Presentation Best Practices. Kuba Glazek, Ph.D. Methodology Expert National Center for Academic and Dissertation Excellence Los Angeles Poster Presentation Best Practices Kuba Glazek, Ph.D. Methodology Expert National Center for Academic and Dissertation Excellence Los Angeles Outline Background Scholarship and career goals Overview of

More information

Preparing a Research Proposal

Preparing a Research Proposal Preparing a Research Proposal T. S. Jayne Guest Seminar, Department of Agricultural Economics and Extension, University of Pretoria March 24, 2014 What is a Proposal? A formal request for support of sponsored

More information

LMS - LEARNING MANAGEMENT SYSTEM END USER GUIDE

LMS - LEARNING MANAGEMENT SYSTEM END USER GUIDE LMS - LEARNING MANAGEMENT SYSTEM (ADP TALENT MANAGEMENT) END USER GUIDE August 2012 Login Log onto the Learning Management System (LMS) by clicking on the desktop icon or using the following URL: https://lakehealth.csod.com

More information

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining Dave Donnellan, School of Computer Applications Dublin City University Dublin 9 Ireland daviddonnellan@eircom.net Claus Pahl

More information

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining Dave Donnellan, School of Computer Applications Dublin City University Dublin 9 Ireland daviddonnellan@eircom.net Claus Pahl

More information

MyUni - Turnitin Assignments

MyUni - Turnitin Assignments - Turnitin Assignments Originality, Grading & Rubrics Turnitin Assignments... 2 Create Turnitin assignment... 2 View Originality Report and grade a Turnitin Assignment... 4 Originality Report... 6 GradeMark...

More information

Truth Inference in Crowdsourcing: Is the Problem Solved?

Truth Inference in Crowdsourcing: Is the Problem Solved? Truth Inference in Crowdsourcing: Is the Problem Solved? Yudian Zheng, Guoliang Li #, Yuanbing Li #, Caihua Shan, Reynold Cheng # Department of Computer Science, Tsinghua University Department of Computer

More information

Create Quiz Questions

Create Quiz Questions You can create quiz questions within Moodle. Questions are created from the Question bank screen. You will also be able to categorize questions and add them to the quiz body. You can crate multiple-choice,

More information

Android App Development for Beginners

Android App Development for Beginners Description Android App Development for Beginners DEVELOP ANDROID APPLICATIONS Learning basics skills and all you need to know to make successful Android Apps. This course is designed for students who

More information

ecampus Basics Overview

ecampus Basics Overview ecampus Basics Overview 2016/2017 Table of Contents Managing DCCCD Accounts.... 2 DCCCD Resources... 2 econnect and ecampus... 2 Registration through econnect... 3 Fill out the form (3 steps)... 4 ecampus

More information

Genre classification on German novels

Genre classification on German novels Genre classification on German novels Lena Hettinger, Martin Becker, Isabella Reger, Fotis Jannidis and Andreas Hotho Data Mining and Information Retrieval Group, University of Würzburg Email: {hettinger,

More information

Applications of data mining algorithms to analysis of medical data

Applications of data mining algorithms to analysis of medical data Master Thesis Software Engineering Thesis no: MSE-2007:20 August 2007 Applications of data mining algorithms to analysis of medical data Dariusz Matyja School of Engineering Blekinge Institute of Technology

More information

Online Testing - Quick Troubleshooting Tips

Online Testing - Quick Troubleshooting Tips Online Testing - Quick Troubleshooting Tips This document outlines quick troubleshooting tips for some common issues related to online testing that may impact the Test Coordinators/ Administrators or the

More information

Interactive Whiteboard

Interactive Whiteboard 50 Graphic Organizers for the Interactive Whiteboard Whiteboard-ready graphic organizers for reading, writing, math, and more to make learning engaging and interactive by Jennifer Jacobson & Dottie Raymer

More information

A Case Study: News Classification Based on Term Frequency

A Case Study: News Classification Based on Term Frequency A Case Study: News Classification Based on Term Frequency Petr Kroha Faculty of Computer Science University of Technology 09107 Chemnitz Germany kroha@informatik.tu-chemnitz.de Ricardo Baeza-Yates Center

More information

Dyslexia and Dyscalculia Screeners Digital. Guidance and Information for Teachers

Dyslexia and Dyscalculia Screeners Digital. Guidance and Information for Teachers Dyslexia and Dyscalculia Screeners Digital Guidance and Information for Teachers Digital Tests from GL Assessment For fully comprehensive information about using digital tests from GL Assessment, please

More information