Search Engines. Gertjan van Noord. August 31, 2017

Size: px
Start display at page:

Download "Search Engines. Gertjan van Noord. August 31, 2017"

Transcription

1 Search Engines Gertjan van Noord August 31, 2017

2 About the course Information about the course is available from:

3 Search Engines / Information Retrieval Individuals, administrations, organizations have lots of digital information how to organize and store it? how to retrieve documents? how to retrieve info inside them? An IR system is a tool to facilitate retrieval of such information

4 Search Engines / Information Retrieval Here is what the book says: Information retrieval is finding material (usually documents) of an unstructured nature (usually text) that satisfies an information need from within large collections (usually stored on computers).

5 Search Engines / Information Retrieval Important aspects of this definition: finding material large collections unstructured, usually text information need

6 Finding material documents part of documents tweets facts (e.g., what is the Dutch name for peregrine falcon )...

7 Large collections Large world-wide-web or parts of it Twitter specific collections (legal, medical,... ) your own computer information within a company (manuals Boeing)

8 Unstructured arbitray text versus databases even text is always structured somewhat beyond text: image, sound, video (not in this course)

9 Database search versus Information Retrieval structured versus unstructured search of fields versus full text search exact search versus inexact order of results alfanumerical versus order of results by goodness (... )

10 Information Need Is not the same as the query you type in Google In web search: find specific web-site find information about some topic find concrete answer to a question...

11 Information Need and Evaluation Search engines produce often a lot of results When are you satisfied with the results? How can we evaluate a system?

12 Evaluation: Precision and recall Key statistics for evaluation with a test set (fixed questions, set of documents, evaluations of documents for the queries available). Precision: what fraction of the system results are relevant to the information need? Recall: what fraction of the relevant documents in the collection were returned by the system?

13 Information Need and Evaluation Precision and Recall are widely used evaluation metrics. It depends on your information need if these metrics are really appropriate.

14 Course objectives IR terminology IR models and IR processes Understanding of indexing methods, querying, retrieving, ranking, evaluation Practical experience by implementing basic techniques in Python Learn about program efficiency: what happens in case you apply particular algorithms for very large datasets

15 Chapter 1: Boolean retrieval The first IR systems were Boolean systems. Queries are formulated with the Boolean operators AND, OR and NOT. Examples: Brutus AND Caesar (Brutus OR Caesar) AND NOT Cleopatra Brutus OR (Caesar AND NOT Cleopatra) NOT Brutus

16 Information from documents Each document in the collection needs a unique identifier Each document is tokenized. Tokenization is the process of splitting a text into separate tokens (terms, words). This is harder than you might think. For a boolean system: we need to know which tokens (terms, words) are present in which document

17 Term document incidence matrix For a boolean system: we need to know which tokens (terms, words) are present in which document. Doc1 Doc2 Doc3 Doc4 Antony Brutus Caesar Cleopatra What to do for a query such as: Antony AND Brutus AND NOT Cleopatra

18 Term document incidence matrix For a huge collection, this matrix becomes very big (how big?). What would the row for Cleopatra look like in a real system? More compact representation: only represent the 1 s.

19 How big? Example: Dutch Wikipedia. 256 Million words, 16 Million sentences, almost 5 Million articles. Number of different words. Types. Vocabulary. 3,5 Million (!). The term document incidence matrix would contain 5 Million 3,5 Million bits, i.e., more than 2 terrabytes.

20 Alternative representation: Inverted File For each term (type, word), have an ordered list of document identifiers The ordered list of document identifiers is called postings list Antony Brutus Caesar Cleopatra [Doc1,Doc2] [Doc1,Doc2,Doc3] [Doc1,Doc2,Doc4] [Doc1] What to do for a query such as: Antony AND Brutus AND NOT Cleopatra

21 How big? Dump of the Dutch Wikipedia. 256 Million words, 16 Million sentences, almost 5 Million articles. Number of different words. Types. Vocabulary. 3,5 Million (!). The term document incidence matrix would contain 5 Million 3,5 Million bits, is more than 2 terrabytes. The length of all posting lists together: 256 Million. 3 bytes for each document id. 256 Million * 3 amounts to something like 800 megabytes.

22 Efficient merging algorithm if posting lists are sorted In order to find the common elements in two ordered lists (intersection): l1 points at the next element of list1 l2 points at the next element of list2 if list1[l1] = list2[l2]: add the common element to the result, and move l1 and l2 one step forward otherwise move either l1 or l2 one step forward (whichever is the smallest). stop if either list is exhausted

23 Efficient merging algorithm if posting lists are sorted. Example. Example: list1: [1,6,12,14,20,21]; list2: [2,10,14,16,21]. Initially, l1 points to first element of list1; l2 points to first element of list2. The result is intially empty, of course. l1 list1 list2 l2 result ==> 1 2 <==

24 Efficient merging algorithm if posting lists are sorted. Example (1) l1 list1 list2 l2 result ==> 1 2 <==

25 Efficient merging algorithm if posting lists are sorted. Example (2) l1 list1 list2 l2 result 1 2 <== ==>

26 Efficient merging algorithm if posting lists are sorted. Example (3) l1 list1 list2 l2 result 1 2 <== 6 10 ==>

27 Efficient merging algorithm if posting lists are sorted. Example (4) l1 list1 list2 l2 result <== ==>

28 Efficient merging algorithm if posting lists are sorted. Example (5) l1 list1 list2 l2 result ==> <==

29 Efficient merging algorithm if posting lists are sorted. Example (6) l1 list1 list2 l2 result <== ==>

30 Efficient merging algorithm if posting lists are sorted. Example (7) l1 list1 list2 l2 result <== ==>

31 Efficient merging algorithm if posting lists are sorted. Example (8) l1 list1 list2 l2 result ==> <== 21

32 Efficient merging algorithm if posting lists are sorted. Example (9) l1 list1 list2 l2 result <== ==> 21

33 Efficient merging algorithm if posting lists are sorted To find identical elements in two ordered lists, the number of steps depends on the size of both lists. If the length of the two lists are x and y, then the number of steps roughly is x + y. The number of steps is linear in the length of the input: if the input is ten times as long, then computation takes about ten times as long. If the lists are unordered, the number of steps roughly is x y. The number of steps is quadratic in the length of the input. If the input is ten times as long, computation takes about hundred times as long. This is an important difference in case x and y become large. x y x + y x y

34 Intersection of two ordered lists in Python (1): the naive way def intersect (l1,l2 ): return [ obj for obj in l1 if obj in l2 ] or, equivalently, without the use of list-comprehension: def intersect (l1,l2 ): sol = [] for obj in l1: if obj in l2: sol. append ( obj ) return sol You have to compare each element in l1 with each element in l2 If those lists 11 and l2 have n and m elements respectively, then you have to perform n m comparisons You do not use the information that the lists are ordered

35 Can we do better? Yes of course.

36 Intersection of two ordered lists in Python (2): merge def intersect (l1,l2 ): it1 = iter (l1) it2 = iter (l2) sol = [] n1 = next ( it1 ) n2 = next ( it2 ) while True : try : if n1 == n2: sol. append (n1) n1 = next ( it1 ) n2 = next ( it2 ) else : if n1 < n2: n1 = next ( it1 ) else : n2 = next ( it2 ) except StopIteration : return sol

37 Intersection of two ordered lists in Python (2): merge Go through both lists, from top to bottom, in parallel Number of steps: n+m

38 Intersection of two ordered lists in Python (3): sets Use Python s set datastructure Intersection of two sets is built-in (and very fast) This is efficient as long as you can work with sets throughout; it is not recommended to convert lists to sets for the sole purpose of computing the intersection aset = set (l1) bset = set (l2) intersect_set = aset & bset

39 Experiment de timeit module in Python library l1 and l2 both have N elements, intersection is empty timings in second for N=10000, N=20000, N= lists (naive) lists (merge) lists converted to sets sets

Web as Corpus. Corpus Linguistics. Web as Corpus 1 / 1. Corpus Linguistics. Web as Corpus. web.pl 3 / 1. Sketch Engine. Corpus Linguistics

Web as Corpus. Corpus Linguistics. Web as Corpus 1 / 1. Corpus Linguistics. Web as Corpus. web.pl 3 / 1. Sketch Engine. Corpus Linguistics (L615) Markus Dickinson Department of Linguistics, Indiana University Spring 2013 The web provides new opportunities for gathering data Viable source of disposable corpora, built ad hoc for specific purposes

More information

CROSS-LANGUAGE INFORMATION RETRIEVAL USING PARAFAC2

CROSS-LANGUAGE INFORMATION RETRIEVAL USING PARAFAC2 1 CROSS-LANGUAGE INFORMATION RETRIEVAL USING PARAFAC2 Peter A. Chew, Brett W. Bader, Ahmed Abdelali Proceedings of the 13 th SIGKDD, 2007 Tiago Luís Outline 2 Cross-Language IR (CLIR) Latent Semantic Analysis

More information

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks

System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks 1 Tzu-Hsuan Yang, 2 Tzu-Hsuan Tseng, and 3 Chia-Ping Chen Department of Computer Science and Engineering

More information

Pre-Algebra A. Syllabus. Course Overview. Course Goals. General Skills. Credit Value

Pre-Algebra A. Syllabus. Course Overview. Course Goals. General Skills. Credit Value Syllabus Pre-Algebra A Course Overview Pre-Algebra is a course designed to prepare you for future work in algebra. In Pre-Algebra, you will strengthen your knowledge of numbers as you look to transition

More information

The Internet as a Normative Corpus: Grammar Checking with a Search Engine

The Internet as a Normative Corpus: Grammar Checking with a Search Engine The Internet as a Normative Corpus: Grammar Checking with a Search Engine Jonas Sjöbergh KTH Nada SE-100 44 Stockholm, Sweden jsh@nada.kth.se Abstract In this paper some methods using the Internet as a

More information

10.2. Behavior models

10.2. Behavior models User behavior research 10.2. Behavior models Overview Why do users seek information? How do they seek information? How do they search for information? How do they use libraries? These questions are addressed

More information

Linking Task: Identifying authors and book titles in verbose queries

Linking Task: Identifying authors and book titles in verbose queries Linking Task: Identifying authors and book titles in verbose queries Anaïs Ollagnier, Sébastien Fournier, and Patrice Bellot Aix-Marseille University, CNRS, ENSAM, University of Toulon, LSIS UMR 7296,

More information

How to organise Quality Events

How to organise Quality Events How to organise Quality Events Dear locals, The Quality Assurance Committee s role is to help and support the locals in organising quality events. Quality is the term frequently used in education but often

More information

Python Machine Learning

Python Machine Learning Python Machine Learning Unlock deeper insights into machine learning with this vital guide to cuttingedge predictive analytics Sebastian Raschka [ PUBLISHING 1 open source I community experience distilled

More information

Detecting English-French Cognates Using Orthographic Edit Distance

Detecting English-French Cognates Using Orthographic Edit Distance Detecting English-French Cognates Using Orthographic Edit Distance Qiongkai Xu 1,2, Albert Chen 1, Chang i 1 1 The Australian National University, College of Engineering and Computer Science 2 National

More information

New Features & Functionality in Q Release Version 3.1 January 2016

New Features & Functionality in Q Release Version 3.1 January 2016 in Q Release Version 3.1 January 2016 Contents Release Highlights 2 New Features & Functionality 3 Multiple Applications 3 Analysis 3 Student Pulse 3 Attendance 4 Class Attendance 4 Student Attendance

More information

Analyzing sentiments in tweets for Tesla Model 3 using SAS Enterprise Miner and SAS Sentiment Analysis Studio

Analyzing sentiments in tweets for Tesla Model 3 using SAS Enterprise Miner and SAS Sentiment Analysis Studio SCSUG Student Symposium 2016 Analyzing sentiments in tweets for Tesla Model 3 using SAS Enterprise Miner and SAS Sentiment Analysis Studio Praneth Guggilla, Tejaswi Jha, Goutam Chakraborty, Oklahoma State

More information

Information Retrieval

Information Retrieval Information Retrieval Suan Lee - Information Retrieval - 02 The Term Vocabulary & Postings Lists 1 02 The Term Vocabulary & Postings Lists - Information Retrieval - 02 The Term Vocabulary & Postings Lists

More information

Switchboard Language Model Improvement with Conversational Data from Gigaword

Switchboard Language Model Improvement with Conversational Data from Gigaword Katholieke Universiteit Leuven Faculty of Engineering Master in Artificial Intelligence (MAI) Speech and Language Technology (SLT) Switchboard Language Model Improvement with Conversational Data from Gigaword

More information

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE Mingon Kang, PhD Computer Science, Kennesaw State University Self Introduction Mingon Kang, PhD Homepage: http://ksuweb.kennesaw.edu/~mkang9

More information

Probabilistic Latent Semantic Analysis

Probabilistic Latent Semantic Analysis Probabilistic Latent Semantic Analysis Thomas Hofmann Presentation by Ioannis Pavlopoulos & Andreas Damianou for the course of Data Mining & Exploration 1 Outline Latent Semantic Analysis o Need o Overview

More information

Manipulative Mathematics Using Manipulatives to Promote Understanding of Math Concepts

Manipulative Mathematics Using Manipulatives to Promote Understanding of Math Concepts Using Manipulatives to Promote Understanding of Math Concepts Multiples and Primes Multiples Prime Numbers Manipulatives used: Hundreds Charts Manipulative Mathematics 1 www.foundationsofalgebra.com Multiples

More information

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Twitter Sentiment Classification on Sanders Data using Hybrid Approach IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 4, Ver. I (July Aug. 2015), PP 118-123 www.iosrjournals.org Twitter Sentiment Classification on Sanders

More information

Postprint.

Postprint. http://www.diva-portal.org Postprint This is the accepted version of a paper presented at CLEF 2013 Conference and Labs of the Evaluation Forum Information Access Evaluation meets Multilinguality, Multimodality,

More information

Rule Learning With Negation: Issues Regarding Effectiveness

Rule Learning With Negation: Issues Regarding Effectiveness Rule Learning With Negation: Issues Regarding Effectiveness S. Chua, F. Coenen, G. Malcolm University of Liverpool Department of Computer Science, Ashton Building, Ashton Street, L69 3BX Liverpool, United

More information

CS Machine Learning

CS Machine Learning CS 478 - Machine Learning Projects Data Representation Basic testing and evaluation schemes CS 478 Data and Testing 1 Programming Issues l Program in any platform you want l Realize that you will be doing

More information

GACE Computer Science Assessment Test at a Glance

GACE Computer Science Assessment Test at a Glance GACE Computer Science Assessment Test at a Glance Updated May 2017 See the GACE Computer Science Assessment Study Companion for practice questions and preparation resources. Assessment Name Computer Science

More information

A Case Study: News Classification Based on Term Frequency

A Case Study: News Classification Based on Term Frequency A Case Study: News Classification Based on Term Frequency Petr Kroha Faculty of Computer Science University of Technology 09107 Chemnitz Germany kroha@informatik.tu-chemnitz.de Ricardo Baeza-Yates Center

More information

Finding Translations in Scanned Book Collections

Finding Translations in Scanned Book Collections Finding Translations in Scanned Book Collections Ismet Zeki Yalniz Dept. of Computer Science University of Massachusetts Amherst, MA, 01003 zeki@cs.umass.edu R. Manmatha Dept. of Computer Science University

More information

Mathematics process categories

Mathematics process categories Mathematics process categories All of the UK curricula define multiple categories of mathematical proficiency that require students to be able to use and apply mathematics, beyond simple recall of facts

More information

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH ISSN: 0976-3104 Danti and Bhushan. ARTICLE OPEN ACCESS CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH Ajit Danti 1 and SN Bharath Bhushan 2* 1 Department

More information

CS 100: Principles of Computing

CS 100: Principles of Computing CS 100: Principles of Computing Kevin Molloy August 29, 2017 1 Basic Course Information 1.1 Prerequisites: None 1.2 General Education Fulfills Mason Core requirement in Information Technology (ALL). 1.3

More information

Cross Language Information Retrieval

Cross Language Information Retrieval Cross Language Information Retrieval RAFFAELLA BERNARDI UNIVERSITÀ DEGLI STUDI DI TRENTO P.ZZA VENEZIA, ROOM: 2.05, E-MAIL: BERNARDI@DISI.UNITN.IT Contents 1 Acknowledgment.............................................

More information

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur Module 12 Machine Learning 12.1 Instructional Objective The students should understand the concept of learning systems Students should learn about different aspects of a learning system Students should

More information

Field Experience Management 2011 Training Guides

Field Experience Management 2011 Training Guides Field Experience Management 2011 Training Guides Page 1 of 40 Contents Introduction... 3 Helpful Resources Available on the LiveText Conference Visitors Pass... 3 Overview... 5 Development Model for FEM...

More information

Controlled vocabulary

Controlled vocabulary Indexing languages 6.2.2. Controlled vocabulary Overview Anyone who has struggled to find the exact search term to retrieve information about a certain subject can benefit from controlled vocabulary. Controlled

More information

CS 101 Computer Science I Fall Instructor Muller. Syllabus

CS 101 Computer Science I Fall Instructor Muller. Syllabus CS 101 Computer Science I Fall 2013 Instructor Muller Syllabus Welcome to CS101. This course is an introduction to the art and science of computer programming and to some of the fundamental concepts of

More information

Cal s Dinner Card Deals

Cal s Dinner Card Deals Cal s Dinner Card Deals Overview: In this lesson students compare three linear functions in the context of Dinner Card Deals. Students are required to interpret a graph for each Dinner Card Deal to help

More information

IMGD Technical Game Development I: Iterative Development Techniques. by Robert W. Lindeman

IMGD Technical Game Development I: Iterative Development Techniques. by Robert W. Lindeman IMGD 3000 - Technical Game Development I: Iterative Development Techniques by Robert W. Lindeman gogo@wpi.edu Motivation The last thing you want to do is write critical code near the end of a project Induces

More information

Math 96: Intermediate Algebra in Context

Math 96: Intermediate Algebra in Context : Intermediate Algebra in Context Syllabus Spring Quarter 2016 Daily, 9:20 10:30am Instructor: Lauri Lindberg Office Hours@ tutoring: Tutoring Center (CAS-504) 8 9am & 1 2pm daily STEM (Math) Center (RAI-338)

More information

Speak Up 2012 Grades 9 12

Speak Up 2012 Grades 9 12 2012 Speak Up Survey District: WAYLAND PUBLIC SCHOOLS Speak Up 2012 Grades 9 12 Results based on 130 survey(s). Note: Survey responses are based upon the number of individuals that responded to the specific

More information

1 Instructional Design Website: Making instruction easy for HCPS Teachers Henrico County, Virginia

1 Instructional Design Website: Making instruction easy for HCPS Teachers Henrico County, Virginia 1 Instructional Design Website: Making instruction easy for HCPS Teachers Short Overview The teachers of Henrico County Public Schools had many resources available to them but the resources were scattered

More information

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model Xinying Song, Xiaodong He, Jianfeng Gao, Li Deng Microsoft Research, One Microsoft Way, Redmond, WA 98052, U.S.A.

More information

A DISTRIBUTIONAL STRUCTURED SEMANTIC SPACE FOR QUERYING RDF GRAPH DATA

A DISTRIBUTIONAL STRUCTURED SEMANTIC SPACE FOR QUERYING RDF GRAPH DATA International Journal of Semantic Computing Vol. 5, No. 4 (2011) 433 462 c World Scientific Publishing Company DOI: 10.1142/S1793351X1100133X A DISTRIBUTIONAL STRUCTURED SEMANTIC SPACE FOR QUERYING RDF

More information

Outreach Connect User Manual

Outreach Connect User Manual Outreach Connect A Product of CAA Software, Inc. Outreach Connect User Manual Church Growth Strategies Through Sunday School, Care Groups, & Outreach Involving Members, Guests, & Prospects PREPARED FOR:

More information

READY TO WORK PROGRAM INSTRUCTOR GUIDE PART I. LESSON TITLE: Precision Measurement Guided Discussion

READY TO WORK PROGRAM INSTRUCTOR GUIDE PART I. LESSON TITLE: Precision Measurement Guided Discussion READY TO WORK PROGRAM INSTRUCTOR GUIDE PART I LESSON TITLE: Precision Measurement METHOD: Lecture, Guided Discussion EDUCATIONAL OBJECTIVE: Given different precision measurement tool names and drawings,

More information

Outline. Web as Corpus. Using Web Data for Linguistic Purposes. Ines Rehbein. NCLT, Dublin City University. nclt

Outline. Web as Corpus. Using Web Data for Linguistic Purposes. Ines Rehbein. NCLT, Dublin City University. nclt Outline Using Web Data for Linguistic Purposes NCLT, Dublin City University Outline Outline 1 Corpora as linguistic tools 2 Limitations of web data Strategies to enhance web data 3 Corpora as linguistic

More information

Getting Started with Deliberate Practice

Getting Started with Deliberate Practice Getting Started with Deliberate Practice Most of the implementation guides so far in Learning on Steroids have focused on conceptual skills. Things like being able to form mental images, remembering facts

More information

Computer Science 1015F ~ 2016 ~ Notes to Students

Computer Science 1015F ~ 2016 ~ Notes to Students Computer Science 1015F ~ 2016 ~ Notes to Students Course Description Computer Science 1015F and 1016S together constitute a complete Computer Science curriculum for first year students, offering an introduction

More information

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus

CS 1103 Computer Science I Honors. Fall Instructor Muller. Syllabus CS 1103 Computer Science I Honors Fall 2016 Instructor Muller Syllabus Welcome to CS1103. This course is an introduction to the art and science of computer programming and to some of the fundamental concepts

More information

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words, A Language-Independent, Data-Oriented Architecture for Grapheme-to-Phoneme Conversion Walter Daelemans and Antal van den Bosch Proceedings ESCA-IEEE speech synthesis conference, New York, September 1994

More information

Library Reference Services textbook Chapter 7

Library Reference Services textbook Chapter 7 Library Reference Services textbook Chapter 7 Goals of Reference Services Directly aid individual customers (library patrons) in their quest for information, to resolve their research needs and/or assist

More information

Creating an Online Test. **This document was revised for the use of Plano ISD teachers and staff.

Creating an Online Test. **This document was revised for the use of Plano ISD teachers and staff. Creating an Online Test **This document was revised for the use of Plano ISD teachers and staff. OVERVIEW Step 1: Step 2: Step 3: Use ExamView Test Manager to set up a class Create class Add students to

More information

Introduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions.

Introduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions. to as a linguistic theory to to a member of the family of linguistic frameworks that are called generative grammars a grammar which is formalized to a high degree and thus makes exact predictions about

More information

SCT Banner Student Fee Assessment Training Workbook October 2005 Release 7.2

SCT Banner Student Fee Assessment Training Workbook October 2005 Release 7.2 SCT HIGHER EDUCATION SCT Banner Student Fee Assessment Training Workbook October 2005 Release 7.2 Confidential Business Information --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

More information

Virginia Commonwealth University Retrospective Concussion Diagnostic Interview - Blast. (dd mmm yyyy)

Virginia Commonwealth University Retrospective Concussion Diagnostic Interview - Blast. (dd mmm yyyy) VCUrCDI -Blast Virginia Commonwealth University Retrospective Concussion Diagnostic Interview - Blast Interviewer: Potential Concussive Event (PCE) Label 1. PCE setting 2. Date of PCE Civilian Sector Military;

More information

ScienceDirect. Malayalam question answering system

ScienceDirect. Malayalam question answering system Available online at www.sciencedirect.com ScienceDirect Procedia Technology 24 (2016 ) 1388 1392 International Conference on Emerging Trends in Engineering, Science and Technology (ICETEST - 2015) Malayalam

More information

Preferences...3 Basic Calculator...5 Math/Graphing Tools...5 Help...6 Run System Check...6 Sign Out...8

Preferences...3 Basic Calculator...5 Math/Graphing Tools...5 Help...6 Run System Check...6 Sign Out...8 CONTENTS GETTING STARTED.................................... 1 SYSTEM SETUP FOR CENGAGENOW....................... 2 USING THE HEADER LINKS.............................. 2 Preferences....................................................3

More information

The Role of String Similarity Metrics in Ontology Alignment

The Role of String Similarity Metrics in Ontology Alignment The Role of String Similarity Metrics in Ontology Alignment Michelle Cheatham and Pascal Hitzler August 9, 2013 1 Introduction Tim Berners-Lee originally envisioned a much different world wide web than

More information

PowerTeacher Gradebook User Guide PowerSchool Student Information System

PowerTeacher Gradebook User Guide PowerSchool Student Information System PowerSchool Student Information System Document Properties Copyright Owner Copyright 2007 Pearson Education, Inc. or its affiliates. All rights reserved. This document is the property of Pearson Education,

More information

Mathematics subject curriculum

Mathematics subject curriculum Mathematics subject curriculum Dette er ei omsetjing av den fastsette læreplanteksten. Læreplanen er fastsett på Nynorsk Established as a Regulation by the Ministry of Education and Research on 24 June

More information

Software Maintenance

Software Maintenance 1 What is Software Maintenance? Software Maintenance is a very broad activity that includes error corrections, enhancements of capabilities, deletion of obsolete capabilities, and optimization. 2 Categories

More information

The Flaws, Fallacies and Foolishness of Benchmark Testing

The Flaws, Fallacies and Foolishness of Benchmark Testing Benchmarking is a great tool for improving an organization's performance...when used or identifying, then tracking (by measuring) specific variables that are proven to be "S.M.A.R.T." That is: Specific

More information

When Student Confidence Clicks

When Student Confidence Clicks When Student Confidence Clicks Academic Self-Efficacy and Learning in HE Fabio R. Aricò 1 ACKNOWLEDGEMENTS UEA-HEFCE Widening Participation Teaching Fellowship HEA Teaching Development Grant Scheme 2 ETHICAL

More information

LEGO MINDSTORMS Education EV3 Coding Activities

LEGO MINDSTORMS Education EV3 Coding Activities LEGO MINDSTORMS Education EV3 Coding Activities s t e e h s k r o W t n e d Stu LEGOeducation.com/MINDSTORMS Contents ACTIVITY 1 Performing a Three Point Turn 3-6 ACTIVITY 2 Written Instructions for a

More information

Performance Analysis of Optimized Content Extraction for Cyrillic Mongolian Learning Text Materials in the Database

Performance Analysis of Optimized Content Extraction for Cyrillic Mongolian Learning Text Materials in the Database Journal of Computer and Communications, 2016, 4, 79-89 Published Online August 2016 in SciRes. http://www.scirp.org/journal/jcc http://dx.doi.org/10.4236/jcc.2016.410009 Performance Analysis of Optimized

More information

Moodle 2 Assignments. LATTC Faculty Technology Training Tutorial

Moodle 2 Assignments. LATTC Faculty Technology Training Tutorial LATTC Faculty Technology Training Tutorial Moodle 2 Assignments This tutorial begins with the instructor already logged into Moodle 2. http://moodle.lattc.edu/ Faculty login id is same as email login id.

More information

Are You Ready? Simplify Fractions

Are You Ready? Simplify Fractions SKILL 10 Simplify Fractions Teaching Skill 10 Objective Write a fraction in simplest form. Review the definition of simplest form with students. Ask: Is 3 written in simplest form? Why 7 or why not? (Yes,

More information

Use of Online Information Resources for Knowledge Organisation in Library and Information Centres: A Case Study of CUSAT

Use of Online Information Resources for Knowledge Organisation in Library and Information Centres: A Case Study of CUSAT DESIDOC Journal of Library & Information Technology, Vol. 31, No. 1, January 2011, pp. 19-24 2011, DESIDOC Use of Online Information Resources for Knowledge Organisation in Library and Information Centres:

More information

CS 446: Machine Learning

CS 446: Machine Learning CS 446: Machine Learning Introduction to LBJava: a Learning Based Programming Language Writing classifiers Christos Christodoulopoulos Parisa Kordjamshidi Motivation 2 Motivation You still have not learnt

More information

A Pumpkin Grows. Written by Linda D. Bullock and illustrated by Debby Fisher

A Pumpkin Grows. Written by Linda D. Bullock and illustrated by Debby Fisher GUIDED READING REPORT A Pumpkin Grows Written by Linda D. Bullock and illustrated by Debby Fisher KEY IDEA This nonfiction text traces the stages a pumpkin goes through as it grows from a seed to become

More information

TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE. Pierre Foy

TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE. Pierre Foy TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE Pierre Foy TIMSS Advanced 2015 orks User Guide for the International Database Pierre Foy Contributors: Victoria A.S. Centurino, Kerry E. Cotter,

More information

Radius STEM Readiness TM

Radius STEM Readiness TM Curriculum Guide Radius STEM Readiness TM While today s teens are surrounded by technology, we face a stark and imminent shortage of graduates pursuing careers in Science, Technology, Engineering, and

More information

Fragment Analysis and Test Case Generation using F- Measure for Adaptive Random Testing and Partitioned Block based Adaptive Random Testing

Fragment Analysis and Test Case Generation using F- Measure for Adaptive Random Testing and Partitioned Block based Adaptive Random Testing Fragment Analysis and Test Case Generation using F- Measure for Adaptive Random Testing and Partitioned Block based Adaptive Random Testing D. Indhumathi Research Scholar Department of Information Technology

More information

The Writing Process. The Academic Support Centre // September 2015

The Writing Process. The Academic Support Centre // September 2015 The Writing Process The Academic Support Centre // September 2015 + so that someone else can understand it! Why write? Why do academics (scientists) write? The Academic Writing Process Describe your writing

More information

Once your credentials are accepted, you should get a pop-window (make sure that your browser is set to allow popups) that looks like this:

Once your credentials are accepted, you should get a pop-window (make sure that your browser is set to allow popups) that looks like this: SCAIT IN ARIES GUIDE Accessing SCAIT The link to SCAIT is found on the Administrative Applications and Resources page, which you can find via the CSU homepage under Resources or click here: https://aar.is.colostate.edu/

More information

Learning From the Past with Experiment Databases

Learning From the Past with Experiment Databases Learning From the Past with Experiment Databases Joaquin Vanschoren 1, Bernhard Pfahringer 2, and Geoff Holmes 2 1 Computer Science Dept., K.U.Leuven, Leuven, Belgium 2 Computer Science Dept., University

More information

Chapter 4 - Fractions

Chapter 4 - Fractions . Fractions Chapter - Fractions 0 Michelle Manes, University of Hawaii Department of Mathematics These materials are intended for use with the University of Hawaii Department of Mathematics Math course

More information

The Indices Investigations Teacher s Notes

The Indices Investigations Teacher s Notes The Indices Investigations Teacher s Notes These activities are for students to use independently of the teacher to practise and develop number and algebra properties.. Number Framework domain and stage:

More information

School of Innovative Technologies and Engineering

School of Innovative Technologies and Engineering School of Innovative Technologies and Engineering Department of Applied Mathematical Sciences Proficiency Course in MATLAB COURSE DOCUMENT VERSION 1.0 PCMv1.0 July 2012 University of Technology, Mauritius

More information

WE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT

WE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT WE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT PRACTICAL APPLICATIONS OF RANDOM SAMPLING IN ediscovery By Matthew Verga, J.D. INTRODUCTION Anyone who spends ample time working

More information

University of Groningen. Systemen, planning, netwerken Bosman, Aart

University of Groningen. Systemen, planning, netwerken Bosman, Aart University of Groningen Systemen, planning, netwerken Bosman, Aart IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document

More information

Notetaking Directions

Notetaking Directions Porter Notetaking Directions 1 Notetaking Directions Simplified Cornell-Bullet System Research indicates that hand writing notes is more beneficial to students learning than typing notes, unless there

More information

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling

Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling Experiments with SMS Translation and Stochastic Gradient Descent in Spanish Text Author Profiling Notebook for PAN at CLEF 2013 Andrés Alfonso Caurcel Díaz 1 and José María Gómez Hidalgo 2 1 Universidad

More information

AQUA: An Ontology-Driven Question Answering System

AQUA: An Ontology-Driven Question Answering System AQUA: An Ontology-Driven Question Answering System Maria Vargas-Vera, Enrico Motta and John Domingue Knowledge Media Institute (KMI) The Open University, Walton Hall, Milton Keynes, MK7 6AA, United Kingdom.

More information

Julia Smith. Effective Classroom Approaches to.

Julia Smith. Effective Classroom Approaches to. Julia Smith @tessmaths Effective Classroom Approaches to GCSE Maths resits julia.smith@writtle.ac.uk Agenda The context of GCSE resit in a post-16 setting An overview of the new GCSE Key features of a

More information

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1 Notes on The Sciences of the Artificial Adapted from a shorter document written for course 17-652 (Deciding What to Design) 1 Ali Almossawi December 29, 2005 1 Introduction The Sciences of the Artificial

More information

Word Segmentation of Off-line Handwritten Documents

Word Segmentation of Off-line Handwritten Documents Word Segmentation of Off-line Handwritten Documents Chen Huang and Sargur N. Srihari {chuang5, srihari}@cedar.buffalo.edu Center of Excellence for Document Analysis and Recognition (CEDAR), Department

More information

Dublin City Schools Mathematics Graded Course of Study GRADE 4

Dublin City Schools Mathematics Graded Course of Study GRADE 4 I. Content Standard: Number, Number Sense and Operations Standard Students demonstrate number sense, including an understanding of number systems and reasonable estimates using paper and pencil, technology-supported

More information

How to set up gradebook categories in Moodle 2.

How to set up gradebook categories in Moodle 2. How to set up gradebook categories in Moodle 2. It is possible to set up the gradebook to show divisions in time such as semesters and quarters by using categories. For example, Semester 1 = main category

More information

Social Media Journalism J336F Unique Spring 2016

Social Media Journalism J336F Unique Spring 2016 Social Media Journalism J336F Unique 07865 Spring 2016 Class: Online Professor: Robert Quigley Office hours: T-TH 10:30 to noon and by appointment Email: robert.quigley@austin.utexas.edu Personal social

More information

The Strong Minimalist Thesis and Bounded Optimality

The Strong Minimalist Thesis and Bounded Optimality The Strong Minimalist Thesis and Bounded Optimality DRAFT-IN-PROGRESS; SEND COMMENTS TO RICKL@UMICH.EDU Richard L. Lewis Department of Psychology University of Michigan 27 March 2010 1 Purpose of this

More information

Digital Fabrication and Aunt Sarah: Enabling Quadratic Explorations via Technology. Michael L. Connell University of Houston - Downtown

Digital Fabrication and Aunt Sarah: Enabling Quadratic Explorations via Technology. Michael L. Connell University of Houston - Downtown Digital Fabrication and Aunt Sarah: Enabling Quadratic Explorations via Technology Michael L. Connell University of Houston - Downtown Sergei Abramovich State University of New York at Potsdam Introduction

More information

South Carolina English Language Arts

South Carolina English Language Arts South Carolina English Language Arts A S O F J U N E 2 0, 2 0 1 0, T H I S S TAT E H A D A D O P T E D T H E CO M M O N CO R E S TAT E S TA N DA R D S. DOCUMENTS REVIEWED South Carolina Academic Content

More information

Generating Test Cases From Use Cases

Generating Test Cases From Use Cases 1 of 13 1/10/2007 10:41 AM Generating Test Cases From Use Cases by Jim Heumann Requirements Management Evangelist Rational Software pdf (155 K) In many organizations, software testing accounts for 30 to

More information

Introducing the New Iowa Assessments Mathematics Levels 12 14

Introducing the New Iowa Assessments Mathematics Levels 12 14 Introducing the New Iowa Assessments Mathematics Levels 12 14 ITP Assessment Tools Math Interim Assessments: Grades 3 8 Administered online Constructed Response Supplements Reading, Language Arts, Mathematics

More information

Text Compression for Dynamic Document Databases

Text Compression for Dynamic Document Databases Text Compression for Dynamic Document Databases Alistair Moffat Justin Zobel Neil Sharman March 1994 Abstract For compression of text databases, semi-static word-based methods provide good performance

More information

Home Access Center. Connecting Parents to Fulton County Schools

Home Access Center. Connecting Parents to Fulton County Schools Home Access Center Connecting Parents to Fulton County Schools What is Home Access Center? Website available to parents (and at site discretion, students) that is a real-time look at student data The data

More information

LEARNING AGREEMENT FOR STUDIES

LEARNING AGREEMENT FOR STUDIES LEARNING AGREEMENT FOR STUDIES The Student Last name (s) First name (s) Date of birth Nationality 1 Sex [M/F] Academic year 20../20.. Study cycle 2 Phone Subject area, Code 3 E-mail The Sending Institution

More information

Detecting Wikipedia Vandalism using Machine Learning Notebook for PAN at CLEF 2011

Detecting Wikipedia Vandalism using Machine Learning Notebook for PAN at CLEF 2011 Detecting Wikipedia Vandalism using Machine Learning Notebook for PAN at CLEF 2011 Cristian-Alexandru Drăgușanu, Marina Cufliuc, Adrian Iftene UAIC: Faculty of Computer Science, Alexandru Ioan Cuza University,

More information

Foothill College Fall 2014 Math My Way Math 230/235 MTWThF 10:00-11:50 (click on Math My Way tab) Math My Way Instructors:

Foothill College Fall 2014 Math My Way Math 230/235 MTWThF 10:00-11:50  (click on Math My Way tab) Math My Way Instructors: This is a team taught directed study course. Foothill College Fall 2014 Math My Way Math 230/235 MTWThF 10:00-11:50 www.psme.foothill.edu (click on Math My Way tab) Math My Way Instructors: Instructor:

More information

BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS

BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS Daffodil International University Institutional Repository DIU Journal of Science and Technology Volume 8, Issue 1, January 2013 2013-01 BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS Uddin, Sk.

More information

Learning Methods in Multilingual Speech Recognition

Learning Methods in Multilingual Speech Recognition Learning Methods in Multilingual Speech Recognition Hui Lin Department of Electrical Engineering University of Washington Seattle, WA 98125 linhui@u.washington.edu Li Deng, Jasha Droppo, Dong Yu, and Alex

More information

Treebank mining with GrETEL. Liesbeth Augustinus Frank Van Eynde

Treebank mining with GrETEL. Liesbeth Augustinus Frank Van Eynde Treebank mining with GrETEL Liesbeth Augustinus Frank Van Eynde GrETEL tutorial - 27 March, 2015 GrETEL Greedy Extraction of Trees for Empirical Linguistics Search engine for treebanks GrETEL Greedy Extraction

More information

Objective: Add decimals using place value strategies, and relate those strategies to a written method.

Objective: Add decimals using place value strategies, and relate those strategies to a written method. NYS COMMON CORE MATHEMATICS CURRICULUM Lesson 9 5 1 Lesson 9 Objective: Add decimals using place value strategies, and relate those strategies to a written method. Suggested Lesson Structure Fluency Practice

More information