Bootstrapping: described and illustrated Comparing the standard mean, 10 and 20% trimmed means, & median

Size: px
Start display at page:

Download "Bootstrapping: described and illustrated Comparing the standard mean, 10 and 20% trimmed means, & median"

Transcription

1 Bootstrapping: described and illustrated Comparing the standard mean, 10 and 20% trimmed means, & median Before discussing the main topic, let us quickly review sampling distributions so that everyone is clear on the major theoretical background. Because the concept of a sampling distribution of a statistic (especially a mean) is so fundamental to bootstrapping what it s about, why it works as it does I want to review the following: The sampling distribution of the mean has three principal characteristics you should remember: (1) For any sample size n, the mean of all (!) sample means [drawn from a finite population] is necessarily equal to the population mean (such a statistic is said to be unbiased); (2) The variance of the distribution of all means (always) equals the population variance divided by n; and (perhaps surprisingly), (3) As sample size, n, grows larger, the shape of the sampling distribution of the mean tends toward that of a normal curve, regardless of the shape or form of the parent population. Thus, it is properly said that the distribution of the sample mean, necessarily has mean µ, and variance σ 2 /n, where these Greek letters stand for the population mean and variance respectively. Moreover, such a sampling distribution approaches normal form as the sample size n grows larger for [virtually] every population! It is the generality of the last point that is so distinctive. The square root of the variance σ 2 /n (written as σ/ n, so we may write σ mean = σ/ n) is called the standard error of the mean (i.e., the standard deviation of the sampling distribution of the mean, σ mean ); a term that is frequently encountered in statistical practice. What has just been stated are the principal results of the Central limit theorem. (Stop to note that the sampling distribution [of any statistic] is itself a population; but such a distribution is wholly distinct from the distribution of the parent population, or from the distribution of any particular sample. Because of its status as a population, the characteristics of a sampling distribution are generally denoted by Greek letters [consider that all possible samples of a given size were the sources of the statistics]. But don t confuse the sampling distribution (of any statistic and there is generally a different one for different statistics) with the parent population from which samples were drawn. The preceding points are fundamental to the bootstrap method (see below). But note that when we speak about a bootstrap distribution of a statistic we are talking about an approximate sampling distribution of a particular statistic (not just the mean!), based on a large number of bootstrap samples; and for each sample, the sampling is done with replacement from a particular sample-as-population. And each sample is of the same size as the original, i.e., n. Still that large number [1000 below] of statistics is far smaller than the total of all possible samples, which is generally n to the power n (n n ) in a bootstrap context, or N to the power n (N n ), for a finite population of size N.) Bootstrapping entails sampling with replacement from a vector (or rows of matrix or data frame; see next page, bottom for how to do in R), so that each bootstrap sample is always the same size as the original. But don t confine your thinking to just the mean as we begin to consider bootstrapping; in general, bootstrap distributions can be created for any statistic that can be computed and each statistic is based on a set of resampled data points. The following illustration begins from a vector y that contains n = 100 values, originally generated as a random sample from the t 3 distribution, i.e. t w/ 3 degrees of freedom, and then scaled to have a mean of 20 and a standard deviation of about 3. This accounts for the relatively long tails of y, compared with a Gaussian (normal) distribution that you see below. See plot of y

2 (which is both a sample and a population, depending on your point of view both ideas are relevant); its summary statistics (parameters?) are given below. Here is the R function I used to obtain four central tendency estimates for each of 1000 bootstrap samples: (Copy and paste means4 into your R session) means4 <- function(x,tr1=.1,tr2=.2) { # function that, given vector x, computes FOUR statistics to assess central tendency xm1 <- mean(x) xmt.1 <- mean(x, trim =tr1) # 10% trimmed mean xmt.2 <- mean(x,trim=tr2) # 20% trimmed mean xm.5 <- median(x) # 50% trimmed mean = median xms <- c(xm1, xmt.1,xmt.2, xm.5) xms=round(xms,2) list(xms=xms) } #the four means above are given as mean 1...mean4 below. Now, we use the bootstrap function from the library bootstrap: mns4.y <- bootstrap(y, nboots=1000,means4) command used for main bootstrap run (1000 replicates)[ nboot >= 1000 for good C.I s] I generated 1000 bootstrap replications of the four statistics [for library: bootstrap in R] All R commands given on page 4 below. Numerical summary of bootstrap results: >cbind(my.summary(y=x),my.summary(mns4.500)) pop. mean1 mean2 mean3 mean4 #mean1 is just conventional mean. means departures from 20 indicate bias s.d.s #first value is popul s.d./rest (bold vals=s.e. s) skewns of s.d.s are bootstrap s.e. s

3 will discuss! See plot next p. #key results in bold italics above AND BELOW. A second run, again w/ 1000 bootstrap replications, gave results: means # I ignore the pop. values here s.d.s as they did not change. skewns For practical purposes, identical. Following are the four bootstrap distributions for the first set: NB: The initial population was a long-tailed sample. It s use affords an opportunity to study the conventional sample mean as an estimate of the center of a distribution, when normality does not hold. We in fact see below that the conventional mean is the worst of the four estimators of the center of the distribution of the parent population, based on 1000 bootstrap samples. Remember initial sample as population had n=100 scores, so that n = 100 for each sample. Thus, the first s.e., for mean1, can be calculated by theory; that theory says divide the population s.d. by sqrt(n); here 3.03/sqrt(100)=.303. We are most informed by the computed standard error estimates; these quantify how well the different estimators of the center of this distribution work in relation to one another. To repeat, each bootstrap sample entails sampling WITH replacement from the elements of the initial data vector y. Each of the B = 1000 bootstrap samples contains n = 100 resampled scores, and all four statistics ( means ; three trimmed ) were computed for each bootstrap sample. The summary results, and especially standard error estimates, based on the bootstrap replicates are the principal results on which one will usually focus in studies like this. See the documentation for bootstrap for more information as to what this function does, or can do.

4 The first major book on the bootstrap was written by Bradley Efron, inventor of the bootstrap, and Tibshirani: An introduction to the bootstrap, There are now at least a dozen books, many of them technical, about bootstrapping. (The May 2003 issue of Statistical Science is devoted exclusively to articles on bootstrapping, for its 25 th anniversary.) Some things you may find useful about bootstrapping within the world of R: 1. A vector such as y, regarded as y[1:n], where one controls contents, e.g. y[c(1,3)] = 1 st and 3 rd elements of y; or y[n:1] presents y values in reverse order; or y[sample(1:n,n,repl=t)] yields a bootstrap sample of y, of size n; and the latter, repeated (using sampling WITH replacement), becomes a basis for bootstrap analysis. 2. A matrix such as yy, regarded as yy[1:n,1:p] (of order n x p) can be examined in parts using bracket notation; e.g. yy[1:3, ] displays the first 3 rows of yy; also, to sample the rows of yy, use yy[sample(1:n,n,repl=t), ], where comma in [, ] separates row and column designations. R commands used to get the preceding numerical results: > bt.mn4=bootstrap(x,nboot=1000,theta=means4) #x is called xt3 in Fig1 > bt.mns4=(as.data.frame(bt.mn4$theta)) #the output thetastar is of class list ; needs to be data.frame or a matrix for what follows. > bt.mns4=t(bt.mns4) #transpose of matrix (d.frame) bt.mns4 is taken for convenience (below) > gpairs(bt.mns4) #gpairs function is from package YaleToolkit > my.summary(bt.mns4) #I wrote my.summary. Copy it into your R session, or just use summary from R. my.summary <- function (xxx, dig = 3) { #xxx is taken to be input data.frame or matrix. xxx <- as.matrix(xxx) xm <- apply(xxx, 2, mean) s.d <- sqrt(apply(xxx, 2, var)) xs <- scale(xxx) sk <- apply(xs^3, 2, mean) kr <- apply(xs^4, 2, mean) - 3 rg <- apply(xxx, 2, range) sumry <- round(rbind(xm, s.d, sk, kr, rg), 3) dimnames(sumry)[1] <- list(c("means", "s.d.s", "skewns", "krtsis", "low", "high")) sumry <- round(sumry, dig) sumry } Bootstrapping sources in R R functions for bootstrapping can be found in the bootstrap and the boot library, so you should examine the help files for several of the functions in these libraries to see how to proceed. Note that bootstrap is a much smaller library than boot, and generally easier to use effectively. I recommend that you begin w/ the function bootstrap in the library of the same name, but the greater generality of boot will be most helpful in some more advanced situations. The help.search function will yield more packages that reference bootstrapping, so give that a try too. Naturally, many introductions and discussions can be found on the web; let s see what you like post a URL or two.

5 These pages summarize key points, and offer more general principles. Of particular interest is the concept of confidence intervals, and ways to use bootstrap methods to generate limits for CIs. Note the (deliberate) redundancy in what follows and what you have read above. The essence of bootstrapping: We begin by assuming a quantitative variable, for which we want to characterize or describe using numerous different statistics (means, medians, trimmed means, variances, sds, skewness, etc.). Our goal is to make inferences about parent population parameters using confidence intervals that have in turn been constructed using the information from a reasonably large number of computer generated bootstrap samples. (Take note: we will NOT introduce any mathematical theory here; all that follows involves computer intensive computation, but no theory as such.) 1. Begin from an initial sample, not too small (say, 30 or 40 cases at a minimum); this should be a random sample, or a sample for which we can reasonably argue that it reasonably re- presents some larger universe of scores that we shall think of as our parent population. Let y represent the sample data. 2. Decide what feature(s) of the parent population we would like to make inferences about ( center, spread, skewness, etc.); then, given one or two choices, say center and spread, decide on what statistics we want to use for inferences. We might have two, three or more alternative measures of each feature (e.g., four means for center; s.d.s and IQRs to assess spread, etc), a total of S statistics, say. One goal here is to compare various estimators with one another with respect to their purposes in helping to make inferences. We might also have begun with difference scores in our initial vector. 3. Compute and save each of these statistics for our initial sample of y values; we shall call them by a special name: bootstrap parameters (which are also statistics, see below). Reflect on this point since it is easy to be confused here. 3. Choose or write functions that we will be able to apply to each bootstrap sample, where each bootstrap sample is simply a sample drawn w/out replacement from the initial sample. The initial sample will now be regarded for the purposes of bootstrapping as our bootstrap population. (Note carefully that we must take care in what follows to distinguish the parent population from the bootstrap population. The latter population can be said to have bootstrap parameters that are also properly labeled as conventional sample statistics.) 4. Generate bootstrap samples a substantial number of times (say B = 500 to 1000 of these), where we save these bootstrap replicates (those that measure center, spread, skewness) for each of the bootstrap samples. Best to generate an array (matrix, of order B x S) that contains all of these; they shall be called replicate values for the respective statistics, and they will be the basis for the ultimate inferences. 5. Summarize the preceding table by columns, one for each statistic that relates to a particular feature of the initial bootstrap population (recalling that our bootstrap population began as our initial sample). Both numeric and graphical methods should usually be employed. 6. Compute and compare the (conventional) means of the replicate statistics (columns) with the bootstrap population parameters; the differences may positive or negative, and these differences measure bias. Ideally, we might seek zero bias, but small amounts of bias are usually tolerated, particularly if the biased statistics have compensating virtues, especially relatively small variation across the set of bootstrap samples. 7. Then compute and compare the s.d.s of the respective statistics; often the main goal of the entire bootstrapping study is to find which statistics have the smallest s.d.s (which is to say bootstrap standard errors) since these are the statistics that will have the narrowest confidence

6 intervals. If a statistic is found to be notably biased, we may want to adjust the statistics (nominally used as centers of our ultimate confidence intervals). 8. Generate the density distributions (histograms ok) and, more importantly, selected quantiles, of any bootstrap statistics (the bootstrap replicates) we generated. For, example if we aim to get a 95% interval for a trimmed mean, we find the 2.5% and the 97.5% quantile points of the distribution of that trimmed mean, and (supposing it has minimal bias) these become our confidence limits for a 95% interval. We will surely want to compare these limits with those for the conventional mean. Statistics with the narrowest CIs can usually be said to be best, particularly if they were found to be minimally biased. Similar methods are used for 99% CIs, etc. Graphics can be useful in this context, but be sure to note that all the information is based on a (rather arbitrary) initial sample, so care has to be taken not to misinterpret, or over-interpret results. 9. Summarize by comprehensively describing the main results, also noting that this methodology has bypassed normal theory methods that strictly speaking, apply only when normality assumptions can be invoked; moreover, we have made no assumptions about shapes or other features of the (putative [look it up!]) parent population. In particular, we have not assumed normality at any point, and we have gone (well) beyond the mean to consider virtually any statistic of interest (review these; add others). 10. Finally, recognize that the interpretation of any such bootstrap CI is essentially the same as that for a conventional CI gotten by normal theory methods. These ideas readily generalize to statistics that are vectors, such as vectors of regression coefficients. This means we are all free to invent and use bootstrap methods to study the comparative merits and demerits of a wide variety of statistics, without regard for whether they are supported by normal theory. We need not invoke normality assumptions, nor make any other so-called parametric assumptions in the process. The main thing to note is that any bootstrap sample drawn from a vector or matrix, is just a sample drawn with replacement from the ROWS of an initial sample data matrix; and (vector) bootstrap statistics are computed for each bootstrap matrix, analogous to what has been described above. The computation may be intense but with modern computers such operations are readily carried out for rather large matrices (thousands of rows, hundreds of columns) if efficient computational routines are used. Conventional statistics can be notably inferior to certain new counterparts, a point that needs often to be considered seriously.

Probability and Statistics Curriculum Pacing Guide

Probability and Statistics Curriculum Pacing Guide Unit 1 Terms PS.SPMJ.3 PS.SPMJ.5 Plan and conduct a survey to answer a statistical question. Recognize how the plan addresses sampling technique, randomization, measurement of experimental error and methods

More information

STA 225: Introductory Statistics (CT)

STA 225: Introductory Statistics (CT) Marshall University College of Science Mathematics Department STA 225: Introductory Statistics (CT) Course catalog description A critical thinking course in applied statistical reasoning covering basic

More information

Analysis of Enzyme Kinetic Data

Analysis of Enzyme Kinetic Data Analysis of Enzyme Kinetic Data To Marilú Analysis of Enzyme Kinetic Data ATHEL CORNISH-BOWDEN Directeur de Recherche Émérite, Centre National de la Recherche Scientifique, Marseilles OXFORD UNIVERSITY

More information

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS Václav Kocian, Eva Volná, Michal Janošek, Martin Kotyrba University of Ostrava Department of Informatics and Computers Dvořákova 7,

More information

Lecture 1: Machine Learning Basics

Lecture 1: Machine Learning Basics 1/69 Lecture 1: Machine Learning Basics Ali Harakeh University of Waterloo WAVE Lab ali.harakeh@uwaterloo.ca May 1, 2017 2/69 Overview 1 Learning Algorithms 2 Capacity, Overfitting, and Underfitting 3

More information

Statewide Framework Document for:

Statewide Framework Document for: Statewide Framework Document for: 270301 Standards may be added to this document prior to submission, but may not be removed from the framework to meet state credit equivalency requirements. Performance

More information

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview Algebra 1, Quarter 3, Unit 3.1 Line of Best Fit Overview Number of instructional days 6 (1 day assessment) (1 day = 45 minutes) Content to be learned Analyze scatter plots and construct the line of best

More information

Python Machine Learning

Python Machine Learning Python Machine Learning Unlock deeper insights into machine learning with this vital guide to cuttingedge predictive analytics Sebastian Raschka [ PUBLISHING 1 open source I community experience distilled

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Ch 2 Test Remediation Work Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 1) High temperatures in a certain

More information

MINUTE TO WIN IT: NAMING THE PRESIDENTS OF THE UNITED STATES

MINUTE TO WIN IT: NAMING THE PRESIDENTS OF THE UNITED STATES MINUTE TO WIN IT: NAMING THE PRESIDENTS OF THE UNITED STATES THE PRESIDENTS OF THE UNITED STATES Project: Focus on the Presidents of the United States Objective: See how many Presidents of the United States

More information

The Good Judgment Project: A large scale test of different methods of combining expert predictions

The Good Judgment Project: A large scale test of different methods of combining expert predictions The Good Judgment Project: A large scale test of different methods of combining expert predictions Lyle Ungar, Barb Mellors, Jon Baron, Phil Tetlock, Jaime Ramos, Sam Swift The University of Pennsylvania

More information

The lab is designed to remind you how to work with scientific data (including dealing with uncertainty) and to review experimental design.

The lab is designed to remind you how to work with scientific data (including dealing with uncertainty) and to review experimental design. Name: Partner(s): Lab #1 The Scientific Method Due 6/25 Objective The lab is designed to remind you how to work with scientific data (including dealing with uncertainty) and to review experimental design.

More information

Introduction to Simulation

Introduction to Simulation Introduction to Simulation Spring 2010 Dr. Louis Luangkesorn University of Pittsburgh January 19, 2010 Dr. Louis Luangkesorn ( University of Pittsburgh ) Introduction to Simulation January 19, 2010 1 /

More information

Instructor: Mario D. Garrett, Ph.D. Phone: Office: Hepner Hall (HH) 100

Instructor: Mario D. Garrett, Ph.D.   Phone: Office: Hepner Hall (HH) 100 San Diego State University School of Social Work 610 COMPUTER APPLICATIONS FOR SOCIAL WORK PRACTICE Statistical Package for the Social Sciences Office: Hepner Hall (HH) 100 Instructor: Mario D. Garrett,

More information

A Comparison of Charter Schools and Traditional Public Schools in Idaho

A Comparison of Charter Schools and Traditional Public Schools in Idaho A Comparison of Charter Schools and Traditional Public Schools in Idaho Dale Ballou Bettie Teasley Tim Zeidner Vanderbilt University August, 2006 Abstract We investigate the effectiveness of Idaho charter

More information

Essentials of Ability Testing. Joni Lakin Assistant Professor Educational Foundations, Leadership, and Technology

Essentials of Ability Testing. Joni Lakin Assistant Professor Educational Foundations, Leadership, and Technology Essentials of Ability Testing Joni Lakin Assistant Professor Educational Foundations, Leadership, and Technology Basic Topics Why do we administer ability tests? What do ability tests measure? How are

More information

On-the-Fly Customization of Automated Essay Scoring

On-the-Fly Customization of Automated Essay Scoring Research Report On-the-Fly Customization of Automated Essay Scoring Yigal Attali Research & Development December 2007 RR-07-42 On-the-Fly Customization of Automated Essay Scoring Yigal Attali ETS, Princeton,

More information

AP Statistics Summer Assignment 17-18

AP Statistics Summer Assignment 17-18 AP Statistics Summer Assignment 17-18 Welcome to AP Statistics. This course will be unlike any other math class you have ever taken before! Before taking this course you will need to be competent in basic

More information

Edexcel GCSE. Statistics 1389 Paper 1H. June Mark Scheme. Statistics Edexcel GCSE

Edexcel GCSE. Statistics 1389 Paper 1H. June Mark Scheme. Statistics Edexcel GCSE Edexcel GCSE Statistics 1389 Paper 1H June 2007 Mark Scheme Edexcel GCSE Statistics 1389 NOTES ON MARKING PRINCIPLES 1 Types of mark M marks: method marks A marks: accuracy marks B marks: unconditional

More information

Truth Inference in Crowdsourcing: Is the Problem Solved?

Truth Inference in Crowdsourcing: Is the Problem Solved? Truth Inference in Crowdsourcing: Is the Problem Solved? Yudian Zheng, Guoliang Li #, Yuanbing Li #, Caihua Shan, Reynold Cheng # Department of Computer Science, Tsinghua University Department of Computer

More information

Why Did My Detector Do That?!

Why Did My Detector Do That?! Why Did My Detector Do That?! Predicting Keystroke-Dynamics Error Rates Kevin Killourhy and Roy Maxion Dependable Systems Laboratory Computer Science Department Carnegie Mellon University 5000 Forbes Ave,

More information

Hierarchical Linear Modeling with Maximum Likelihood, Restricted Maximum Likelihood, and Fully Bayesian Estimation

Hierarchical Linear Modeling with Maximum Likelihood, Restricted Maximum Likelihood, and Fully Bayesian Estimation A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to Practical Assessment, Research & Evaluation. Permission is granted to distribute

More information

Introduction to Causal Inference. Problem Set 1. Required Problems

Introduction to Causal Inference. Problem Set 1. Required Problems Introduction to Causal Inference Problem Set 1 Professor: Teppei Yamamoto Due Friday, July 15 (at beginning of class) Only the required problems are due on the above date. The optional problems will not

More information

Certified Six Sigma Professionals International Certification Courses in Six Sigma Green Belt

Certified Six Sigma Professionals International Certification Courses in Six Sigma Green Belt Certification Singapore Institute Certified Six Sigma Professionals Certification Courses in Six Sigma Green Belt ly Licensed Course for Process Improvement/ Assurance Managers and Engineers Leading the

More information

NIH Public Access Author Manuscript J Prim Prev. Author manuscript; available in PMC 2009 December 14.

NIH Public Access Author Manuscript J Prim Prev. Author manuscript; available in PMC 2009 December 14. NIH Public Access Author Manuscript Published in final edited form as: J Prim Prev. 2009 September ; 30(5): 497 512. doi:10.1007/s10935-009-0191-y. Using a Nonparametric Bootstrap to Obtain a Confidence

More information

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur Module 12 Machine Learning 12.1 Instructional Objective The students should understand the concept of learning systems Students should learn about different aspects of a learning system Students should

More information

Extending Place Value with Whole Numbers to 1,000,000

Extending Place Value with Whole Numbers to 1,000,000 Grade 4 Mathematics, Quarter 1, Unit 1.1 Extending Place Value with Whole Numbers to 1,000,000 Overview Number of Instructional Days: 10 (1 day = 45 minutes) Content to Be Learned Recognize that a digit

More information

Shockwheat. Statistics 1, Activity 1

Shockwheat. Statistics 1, Activity 1 Statistics 1, Activity 1 Shockwheat Students require real experiences with situations involving data and with situations involving chance. They will best learn about these concepts on an intuitive or informal

More information

School Size and the Quality of Teaching and Learning

School Size and the Quality of Teaching and Learning School Size and the Quality of Teaching and Learning An Analysis of Relationships between School Size and Assessments of Factors Related to the Quality of Teaching and Learning in Primary Schools Undertaken

More information

Entrepreneurial Discovery and the Demmert/Klein Experiment: Additional Evidence from Germany

Entrepreneurial Discovery and the Demmert/Klein Experiment: Additional Evidence from Germany Entrepreneurial Discovery and the Demmert/Klein Experiment: Additional Evidence from Germany Jana Kitzmann and Dirk Schiereck, Endowed Chair for Banking and Finance, EUROPEAN BUSINESS SCHOOL, International

More information

Proof Theory for Syntacticians

Proof Theory for Syntacticians Department of Linguistics Ohio State University Syntax 2 (Linguistics 602.02) January 5, 2012 Logics for Linguistics Many different kinds of logic are directly applicable to formalizing theories in syntax

More information

CONSTRUCTION OF AN ACHIEVEMENT TEST Introduction One of the important duties of a teacher is to observe the student in the classroom, laboratory and

CONSTRUCTION OF AN ACHIEVEMENT TEST Introduction One of the important duties of a teacher is to observe the student in the classroom, laboratory and CONSTRUCTION OF AN ACHIEVEMENT TEST Introduction One of the important duties of a teacher is to observe the student in the classroom, laboratory and in other settings. He may also make use of tests in

More information

arxiv: v1 [math.at] 10 Jan 2016

arxiv: v1 [math.at] 10 Jan 2016 THE ALGEBRAIC ATIYAH-HIRZEBRUCH SPECTRAL SEQUENCE OF REAL PROJECTIVE SPECTRA arxiv:1601.02185v1 [math.at] 10 Jan 2016 GUOZHEN WANG AND ZHOULI XU Abstract. In this note, we use Curtis s algorithm and the

More information

Software Maintenance

Software Maintenance 1 What is Software Maintenance? Software Maintenance is a very broad activity that includes error corrections, enhancements of capabilities, deletion of obsolete capabilities, and optimization. 2 Categories

More information

GCSE Mathematics B (Linear) Mark Scheme for November Component J567/04: Mathematics Paper 4 (Higher) General Certificate of Secondary Education

GCSE Mathematics B (Linear) Mark Scheme for November Component J567/04: Mathematics Paper 4 (Higher) General Certificate of Secondary Education GCSE Mathematics B (Linear) Component J567/04: Mathematics Paper 4 (Higher) General Certificate of Secondary Education Mark Scheme for November 2014 Oxford Cambridge and RSA Examinations OCR (Oxford Cambridge

More information

How to Judge the Quality of an Objective Classroom Test

How to Judge the Quality of an Objective Classroom Test How to Judge the Quality of an Objective Classroom Test Technical Bulletin #6 Evaluation and Examination Service The University of Iowa (319) 335-0356 HOW TO JUDGE THE QUALITY OF AN OBJECTIVE CLASSROOM

More information

Lesson M4. page 1 of 2

Lesson M4. page 1 of 2 Lesson M4 page 1 of 2 Miniature Gulf Coast Project Math TEKS Objectives 111.22 6b.1 (A) apply mathematics to problems arising in everyday life, society, and the workplace; 6b.1 (C) select tools, including

More information

Chapters 1-5 Cumulative Assessment AP Statistics November 2008 Gillespie, Block 4

Chapters 1-5 Cumulative Assessment AP Statistics November 2008 Gillespie, Block 4 Chapters 1-5 Cumulative Assessment AP Statistics Name: November 2008 Gillespie, Block 4 Part I: Multiple Choice This portion of the test will determine 60% of your overall test grade. Each question is

More information

Physics 270: Experimental Physics

Physics 270: Experimental Physics 2017 edition Lab Manual Physics 270 3 Physics 270: Experimental Physics Lecture: Lab: Instructor: Office: Email: Tuesdays, 2 3:50 PM Thursdays, 2 4:50 PM Dr. Uttam Manna 313C Moulton Hall umanna@ilstu.edu

More information

MGT/MGP/MGB 261: Investment Analysis

MGT/MGP/MGB 261: Investment Analysis UNIVERSITY OF CALIFORNIA, DAVIS GRADUATE SCHOOL OF MANAGEMENT SYLLABUS for Fall 2014 MGT/MGP/MGB 261: Investment Analysis Daytime MBA: Tu 12:00p.m. - 3:00 p.m. Location: 1302 Gallagher (CRN: 51489) Sacramento

More information

Probability estimates in a scenario tree

Probability estimates in a scenario tree 101 Chapter 11 Probability estimates in a scenario tree An expert is a person who has made all the mistakes that can be made in a very narrow field. Niels Bohr (1885 1962) Scenario trees require many numbers.

More information

Research Design & Analysis Made Easy! Brainstorming Worksheet

Research Design & Analysis Made Easy! Brainstorming Worksheet Brainstorming Worksheet 1) Choose a Topic a) What are you passionate about? b) What are your library s strengths? c) What are your library s weaknesses? d) What is a hot topic in the field right now that

More information

Enhancing Students Understanding Statistics with TinkerPlots: Problem-Based Learning Approach

Enhancing Students Understanding Statistics with TinkerPlots: Problem-Based Learning Approach Enhancing Students Understanding Statistics with TinkerPlots: Problem-Based Learning Approach Krongthong Khairiree drkrongthong@gmail.com International College, Suan Sunandha Rajabhat University, Bangkok,

More information

Role Models, the Formation of Beliefs, and Girls Math. Ability: Evidence from Random Assignment of Students. in Chinese Middle Schools

Role Models, the Formation of Beliefs, and Girls Math. Ability: Evidence from Random Assignment of Students. in Chinese Middle Schools Role Models, the Formation of Beliefs, and Girls Math Ability: Evidence from Random Assignment of Students in Chinese Middle Schools Alex Eble and Feng Hu February 2017 Abstract This paper studies the

More information

Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge

Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge Innov High Educ (2009) 34:93 103 DOI 10.1007/s10755-009-9095-2 Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge Phyllis Blumberg Published online: 3 February

More information

Houghton Mifflin Online Assessment System Walkthrough Guide

Houghton Mifflin Online Assessment System Walkthrough Guide Houghton Mifflin Online Assessment System Walkthrough Guide Page 1 Copyright 2007 by Houghton Mifflin Company. All Rights Reserved. No part of this document may be reproduced or transmitted in any form

More information

Assignment 1: Predicting Amazon Review Ratings

Assignment 1: Predicting Amazon Review Ratings Assignment 1: Predicting Amazon Review Ratings 1 Dataset Analysis Richard Park r2park@acsmail.ucsd.edu February 23, 2015 The dataset selected for this assignment comes from the set of Amazon reviews for

More information

Ohio s Learning Standards-Clear Learning Targets

Ohio s Learning Standards-Clear Learning Targets Ohio s Learning Standards-Clear Learning Targets Math Grade 1 Use addition and subtraction within 20 to solve word problems involving situations of 1.OA.1 adding to, taking from, putting together, taking

More information

Sector Differences in Student Learning: Differences in Achievement Gains Across School Years and During the Summer

Sector Differences in Student Learning: Differences in Achievement Gains Across School Years and During the Summer Catholic Education: A Journal of Inquiry and Practice Volume 7 Issue 2 Article 6 July 213 Sector Differences in Student Learning: Differences in Achievement Gains Across School Years and During the Summer

More information

Measures of the Location of the Data

Measures of the Location of the Data OpenStax-CNX module m46930 1 Measures of the Location of the Data OpenStax College This work is produced by OpenStax-CNX and licensed under the Creative Commons Attribution License 3.0 The common measures

More information

Universityy. The content of

Universityy. The content of WORKING PAPER #31 An Evaluation of Empirical Bayes Estimation of Value Added Teacher Performance Measuress Cassandra M. Guarino, Indianaa Universityy Michelle Maxfield, Michigan State Universityy Mark

More information

Visit us at:

Visit us at: White Paper Integrating Six Sigma and Software Testing Process for Removal of Wastage & Optimizing Resource Utilization 24 October 2013 With resources working for extended hours and in a pressurized environment,

More information

Interpreting ACER Test Results

Interpreting ACER Test Results Interpreting ACER Test Results This document briefly explains the different reports provided by the online ACER Progressive Achievement Tests (PAT). More detailed information can be found in the relevant

More information

On the Distribution of Worker Productivity: The Case of Teacher Effectiveness and Student Achievement. Dan Goldhaber Richard Startz * August 2016

On the Distribution of Worker Productivity: The Case of Teacher Effectiveness and Student Achievement. Dan Goldhaber Richard Startz * August 2016 On the Distribution of Worker Productivity: The Case of Teacher Effectiveness and Student Achievement Dan Goldhaber Richard Startz * August 2016 Abstract It is common to assume that worker productivity

More information

Introduction to the Practice of Statistics

Introduction to the Practice of Statistics Chapter 1: Looking at Data Distributions Introduction to the Practice of Statistics Sixth Edition David S. Moore George P. McCabe Bruce A. Craig Statistics is the science of collecting, organizing and

More information

NCEO Technical Report 27

NCEO Technical Report 27 Home About Publications Special Topics Presentations State Policies Accommodations Bibliography Teleconferences Tools Related Sites Interpreting Trends in the Performance of Special Education Students

More information

Probability Therefore (25) (1.33)

Probability Therefore (25) (1.33) Probability We have intentionally included more material than can be covered in most Student Study Sessions to account for groups that are able to answer the questions at a faster rate. Use your own judgment,

More information

B. How to write a research paper

B. How to write a research paper From: Nikolaus Correll. "Introduction to Autonomous Robots", ISBN 1493773070, CC-ND 3.0 B. How to write a research paper The final deliverable of a robotics class often is a write-up on a research project,

More information

Understanding and Interpreting the NRC s Data-Based Assessment of Research-Doctorate Programs in the United States (2010)

Understanding and Interpreting the NRC s Data-Based Assessment of Research-Doctorate Programs in the United States (2010) Understanding and Interpreting the NRC s Data-Based Assessment of Research-Doctorate Programs in the United States (2010) Jaxk Reeves, SCC Director Kim Love-Myers, SCC Associate Director Presented at UGA

More information

What s in a Step? Toward General, Abstract Representations of Tutoring System Log Data

What s in a Step? Toward General, Abstract Representations of Tutoring System Log Data What s in a Step? Toward General, Abstract Representations of Tutoring System Log Data Kurt VanLehn 1, Kenneth R. Koedinger 2, Alida Skogsholm 2, Adaeze Nwaigwe 2, Robert G.M. Hausmann 1, Anders Weinstein

More information

University of Groningen. Systemen, planning, netwerken Bosman, Aart

University of Groningen. Systemen, planning, netwerken Bosman, Aart University of Groningen Systemen, planning, netwerken Bosman, Aart IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document

More information

Learning From the Past with Experiment Databases

Learning From the Past with Experiment Databases Learning From the Past with Experiment Databases Joaquin Vanschoren 1, Bernhard Pfahringer 2, and Geoff Holmes 2 1 Computer Science Dept., K.U.Leuven, Leuven, Belgium 2 Computer Science Dept., University

More information

Working Paper: Do First Impressions Matter? Improvement in Early Career Teacher Effectiveness Allison Atteberry 1, Susanna Loeb 2, James Wyckoff 1

Working Paper: Do First Impressions Matter? Improvement in Early Career Teacher Effectiveness Allison Atteberry 1, Susanna Loeb 2, James Wyckoff 1 Center on Education Policy and Workforce Competitiveness Working Paper: Do First Impressions Matter? Improvement in Early Career Teacher Effectiveness Allison Atteberry 1, Susanna Loeb 2, James Wyckoff

More information

Linking Task: Identifying authors and book titles in verbose queries

Linking Task: Identifying authors and book titles in verbose queries Linking Task: Identifying authors and book titles in verbose queries Anaïs Ollagnier, Sébastien Fournier, and Patrice Bellot Aix-Marseille University, CNRS, ENSAM, University of Toulon, LSIS UMR 7296,

More information

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering Time and Place: MW 3:00-4:20pm, A126 Wells Hall Instructor: Dr. Marianne Huebner Office: A-432 Wells Hall

More information

Malicious User Suppression for Cooperative Spectrum Sensing in Cognitive Radio Networks using Dixon s Outlier Detection Method

Malicious User Suppression for Cooperative Spectrum Sensing in Cognitive Radio Networks using Dixon s Outlier Detection Method Malicious User Suppression for Cooperative Spectrum Sensing in Cognitive Radio Networks using Dixon s Outlier Detection Method Sanket S. Kalamkar and Adrish Banerjee Department of Electrical Engineering

More information

Norms How were TerraNova 3 norms derived? Does the norm sample reflect my diverse school population?

Norms How were TerraNova 3 norms derived? Does the norm sample reflect my diverse school population? Frequently Asked Questions Today s education environment demands proven tools that promote quality decision making and boost your ability to positively impact student achievement. TerraNova, Third Edition

More information

STAT 220 Midterm Exam, Friday, Feb. 24

STAT 220 Midterm Exam, Friday, Feb. 24 STAT 220 Midterm Exam, Friday, Feb. 24 Name Please show all of your work on the exam itself. If you need more space, use the back of the page. Remember that partial credit will be awarded when appropriate.

More information

CS Machine Learning

CS Machine Learning CS 478 - Machine Learning Projects Data Representation Basic testing and evaluation schemes CS 478 Data and Testing 1 Programming Issues l Program in any platform you want l Realize that you will be doing

More information

Mathematics subject curriculum

Mathematics subject curriculum Mathematics subject curriculum Dette er ei omsetjing av den fastsette læreplanteksten. Læreplanen er fastsett på Nynorsk Established as a Regulation by the Ministry of Education and Research on 24 June

More information

An Introduction to Simio for Beginners

An Introduction to Simio for Beginners An Introduction to Simio for Beginners C. Dennis Pegden, Ph.D. This white paper is intended to introduce Simio to a user new to simulation. It is intended for the manufacturing engineer, hospital quality

More information

16.1 Lesson: Putting it into practice - isikhnas

16.1 Lesson: Putting it into practice - isikhnas BAB 16 Module: Using QGIS in animal health The purpose of this module is to show how QGIS can be used to assist in animal health scenarios. In order to do this, you will have needed to study, and be familiar

More information

Graduate Division Annual Report Key Findings

Graduate Division Annual Report Key Findings Graduate Division 2010 2011 Annual Report Key Findings Trends in Admissions and Enrollment 1 Size, selectivity, yield UCLA s graduate programs are increasingly attractive and selective. Between Fall 2001

More information

ABILITY SORTING AND THE IMPORTANCE OF COLLEGE QUALITY TO STUDENT ACHIEVEMENT: EVIDENCE FROM COMMUNITY COLLEGES

ABILITY SORTING AND THE IMPORTANCE OF COLLEGE QUALITY TO STUDENT ACHIEVEMENT: EVIDENCE FROM COMMUNITY COLLEGES ABILITY SORTING AND THE IMPORTANCE OF COLLEGE QUALITY TO STUDENT ACHIEVEMENT: EVIDENCE FROM COMMUNITY COLLEGES Kevin Stange Ford School of Public Policy University of Michigan Ann Arbor, MI 48109-3091

More information

Mathacle PSet Stats, Concepts in Statistics and Probability Level Number Name: Date:

Mathacle PSet Stats, Concepts in Statistics and Probability Level Number Name: Date: 1 st Quarterly Exam ~ Sampling, Designs, Exploring Data and Regression Part 1 Review I. SAMPLING MC I-1.) [APSTATSMC2014-6M] Approximately 52 percent of all recent births were boys. In a simple random

More information

Exploring Derivative Functions using HP Prime

Exploring Derivative Functions using HP Prime Exploring Derivative Functions using HP Prime Betty Voon Wan Niu betty@uniten.edu.my College of Engineering Universiti Tenaga Nasional Malaysia Wong Ling Shing Faculty of Health and Life Sciences, INTI

More information

Quantitative analysis with statistics (and ponies) (Some slides, pony-based examples from Blase Ur)

Quantitative analysis with statistics (and ponies) (Some slides, pony-based examples from Blase Ur) Quantitative analysis with statistics (and ponies) (Some slides, pony-based examples from Blase Ur) 1 Interviews, diary studies Start stats Thursday: Ethics/IRB Tuesday: More stats New homework is available

More information

12- A whirlwind tour of statistics

12- A whirlwind tour of statistics CyLab HT 05-436 / 05-836 / 08-534 / 08-734 / 19-534 / 19-734 Usable Privacy and Security TP :// C DU February 22, 2016 y & Secu rivac rity P le ratory bo La Lujo Bauer, Nicolas Christin, and Abby Marsh

More information

An Empirical Analysis of the Effects of Mexican American Studies Participation on Student Achievement within Tucson Unified School District

An Empirical Analysis of the Effects of Mexican American Studies Participation on Student Achievement within Tucson Unified School District An Empirical Analysis of the Effects of Mexican American Studies Participation on Student Achievement within Tucson Unified School District Report Submitted June 20, 2012, to Willis D. Hawley, Ph.D., Special

More information

GCE. Mathematics (MEI) Mark Scheme for June Advanced Subsidiary GCE Unit 4766: Statistics 1. Oxford Cambridge and RSA Examinations

GCE. Mathematics (MEI) Mark Scheme for June Advanced Subsidiary GCE Unit 4766: Statistics 1. Oxford Cambridge and RSA Examinations GCE Mathematics (MEI) Advanced Subsidiary GCE Unit 4766: Statistics 1 Mark Scheme for June 2013 Oxford Cambridge and RSA Examinations OCR (Oxford Cambridge and RSA) is a leading UK awarding body, providing

More information

Generating Test Cases From Use Cases

Generating Test Cases From Use Cases 1 of 13 1/10/2007 10:41 AM Generating Test Cases From Use Cases by Jim Heumann Requirements Management Evangelist Rational Software pdf (155 K) In many organizations, software testing accounts for 30 to

More information

Ricopili: Postimputation Module. WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015

Ricopili: Postimputation Module. WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015 Ricopili: Postimputation Module WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015 Ricopili Overview Ricopili Overview postimputation, 12 steps 1) Association analysis 2) Meta analysis

More information

CHAPTER 4: REIMBURSEMENT STRATEGIES 24

CHAPTER 4: REIMBURSEMENT STRATEGIES 24 CHAPTER 4: REIMBURSEMENT STRATEGIES 24 INTRODUCTION Once state level policymakers have decided to implement and pay for CSR, one issue they face is simply how to calculate the reimbursements to districts

More information

success. It will place emphasis on:

success. It will place emphasis on: 1 First administered in 1926, the SAT was created to democratize access to higher education for all students. Today the SAT serves as both a measure of students college readiness and as a valid and reliable

More information

Mathematics (JUN14MS0401) General Certificate of Education Advanced Level Examination June Unit Statistics TOTAL.

Mathematics (JUN14MS0401) General Certificate of Education Advanced Level Examination June Unit Statistics TOTAL. Centre Number Candidate Number For Examiner s Use Surname Other Names Candidate Signature Examiner s Initials Mathematics Unit Statistics 4 Tuesday 24 June 2014 General Certificate of Education Advanced

More information

CAAP. Content Analysis Report. Sample College. Institution Code: 9011 Institution Type: 4-Year Subgroup: none Test Date: Spring 2011

CAAP. Content Analysis Report. Sample College. Institution Code: 9011 Institution Type: 4-Year Subgroup: none Test Date: Spring 2011 CAAP Content Analysis Report Institution Code: 911 Institution Type: 4-Year Normative Group: 4-year Colleges Introduction This report provides information intended to help postsecondary institutions better

More information

Professor Christina Romer. LECTURE 24 INFLATION AND THE RETURN OF OUTPUT TO POTENTIAL April 20, 2017

Professor Christina Romer. LECTURE 24 INFLATION AND THE RETURN OF OUTPUT TO POTENTIAL April 20, 2017 Economics 2 Spring 2017 Professor Christina Romer Professor David Romer LECTURE 24 INFLATION AND THE RETURN OF OUTPUT TO POTENTIAL April 20, 2017 I. OVERVIEW II. HOW OUTPUT RETURNS TO POTENTIAL A. Moving

More information

I N T E R P R E T H O G A N D E V E L O P HOGAN BUSINESS REASONING INVENTORY. Report for: Martina Mustermann ID: HC Date: May 02, 2017

I N T E R P R E T H O G A N D E V E L O P HOGAN BUSINESS REASONING INVENTORY. Report for: Martina Mustermann ID: HC Date: May 02, 2017 S E L E C T D E V E L O P L E A D H O G A N D E V E L O P I N T E R P R E T HOGAN BUSINESS REASONING INVENTORY Report for: Martina Mustermann ID: HC906276 Date: May 02, 2017 2 0 0 9 H O G A N A S S E S

More information

PAGE(S) WHERE TAUGHT If sub mission ins not a book, cite appropriate location(s))

PAGE(S) WHERE TAUGHT If sub mission ins not a book, cite appropriate location(s)) Ohio Academic Content Standards Grade Level Indicators (Grade 11) A. ACQUISITION OF VOCABULARY Students acquire vocabulary through exposure to language-rich situations, such as reading books and other

More information

School of Innovative Technologies and Engineering

School of Innovative Technologies and Engineering School of Innovative Technologies and Engineering Department of Applied Mathematical Sciences Proficiency Course in MATLAB COURSE DOCUMENT VERSION 1.0 PCMv1.0 July 2012 University of Technology, Mauritius

More information

Individual Differences & Item Effects: How to test them, & how to test them well

Individual Differences & Item Effects: How to test them, & how to test them well Individual Differences & Item Effects: How to test them, & how to test them well Individual Differences & Item Effects Properties of subjects Cognitive abilities (WM task scores, inhibition) Gender Age

More information

School Competition and Efficiency with Publicly Funded Catholic Schools David Card, Martin D. Dooley, and A. Abigail Payne

School Competition and Efficiency with Publicly Funded Catholic Schools David Card, Martin D. Dooley, and A. Abigail Payne School Competition and Efficiency with Publicly Funded Catholic Schools David Card, Martin D. Dooley, and A. Abigail Payne Web Appendix See paper for references to Appendix Appendix 1: Multiple Schools

More information

Jason A. Grissom Susanna Loeb. Forthcoming, American Educational Research Journal

Jason A. Grissom Susanna Loeb. Forthcoming, American Educational Research Journal Triangulating Principal Effectiveness: How Perspectives of Parents, Teachers, and Assistant Principals Identify the Central Importance of Managerial Skills Jason A. Grissom Susanna Loeb Forthcoming, American

More information

On-Line Data Analytics

On-Line Data Analytics International Journal of Computer Applications in Engineering Sciences [VOL I, ISSUE III, SEPTEMBER 2011] [ISSN: 2231-4946] On-Line Data Analytics Yugandhar Vemulapalli #, Devarapalli Raghu *, Raja Jacob

More information

Focus of the Unit: Much of this unit focuses on extending previous skills of multiplication and division to multi-digit whole numbers.

Focus of the Unit: Much of this unit focuses on extending previous skills of multiplication and division to multi-digit whole numbers. Approximate Time Frame: 3-4 weeks Connections to Previous Learning: In fourth grade, students fluently multiply (4-digit by 1-digit, 2-digit by 2-digit) and divide (4-digit by 1-digit) using strategies

More information

Corpus Linguistics (L615)

Corpus Linguistics (L615) (L615) Basics of Markus Dickinson Department of, Indiana University Spring 2013 1 / 23 : the extent to which a sample includes the full range of variability in a population distinguishes corpora from archives

More information

American Journal of Business Education October 2009 Volume 2, Number 7

American Journal of Business Education October 2009 Volume 2, Number 7 Factors Affecting Students Grades In Principles Of Economics Orhan Kara, West Chester University, USA Fathollah Bagheri, University of North Dakota, USA Thomas Tolin, West Chester University, USA ABSTRACT

More information

JONATHAN H. WRIGHT Department of Economics, Johns Hopkins University, 3400 N. Charles St., Baltimore MD (410)

JONATHAN H. WRIGHT Department of Economics, Johns Hopkins University, 3400 N. Charles St., Baltimore MD (410) JONATHAN H. WRIGHT Department of Economics, Johns Hopkins University, 3400 N. Charles St., Baltimore MD 21218. (410) 516 5728 wrightj@jhu.edu EDUCATION Harvard University 1993-1997. Ph.D., Economics (1997).

More information

WORK OF LEADERS GROUP REPORT

WORK OF LEADERS GROUP REPORT WORK OF LEADERS GROUP REPORT ASSESSMENT TO ACTION. Sample Report (9 People) Thursday, February 0, 016 This report is provided by: Your Company 13 Main Street Smithtown, MN 531 www.yourcompany.com INTRODUCTION

More information