ECO220 Mid-Term Test (June 29, 2005) Page 1 of 5 SOLUTIONS. T (1) Adding up the heights of the bars in a frequency histogram yields the sample size.

Similar documents
Probability and Statistics Curriculum Pacing Guide

STA 225: Introductory Statistics (CT)

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Edexcel GCSE. Statistics 1389 Paper 1H. June Mark Scheme. Statistics Edexcel GCSE

Measures of the Location of the Data

Chapters 1-5 Cumulative Assessment AP Statistics November 2008 Gillespie, Block 4

AP Statistics Summer Assignment 17-18

STAT 220 Midterm Exam, Friday, Feb. 24

STT 231 Test 1. Fill in the Letter of Your Choice to Each Question in the Scantron. Each question is worth 2 point.

Lesson M4. page 1 of 2

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

Shockwheat. Statistics 1, Activity 1

Broward County Public Schools G rade 6 FSA Warm-Ups

Paper Reference. Edexcel GCSE Mathematics (Linear) 1380 Paper 1 (Non-Calculator) Foundation Tier. Monday 6 June 2011 Afternoon Time: 1 hour 30 minutes

4-3 Basic Skills and Concepts

May To print or download your own copies of this document visit Name Date Eurovision Numeracy Assignment

Statewide Framework Document for:

Mathematics subject curriculum

Algebra 2- Semester 2 Review

Theory of Probability

The lab is designed to remind you how to work with scientific data (including dealing with uncertainty) and to review experimental design.

Green Belt Curriculum (This workshop can also be conducted on-site, subject to price change and number of participants)

CONSTRUCTION OF AN ACHIEVEMENT TEST Introduction One of the important duties of a teacher is to observe the student in the classroom, laboratory and

S T A T 251 C o u r s e S y l l a b u s I n t r o d u c t i o n t o p r o b a b i l i t y

Focus of the Unit: Much of this unit focuses on extending previous skills of multiplication and division to multi-digit whole numbers.

Level 1 Mathematics and Statistics, 2015

Visit us at:

Using Proportions to Solve Percentage Problems I

Mathematics (JUN14MS0401) General Certificate of Education Advanced Level Examination June Unit Statistics TOTAL.

Numeracy Medium term plan: Summer Term Level 2C/2B Year 2 Level 2A/3C

Financing Education In Minnesota

Unit 3: Lesson 1 Decimals as Equal Divisions

Grade 6: Correlated to AGS Basic Math Skills

University of Waterloo School of Accountancy. AFM 102: Introductory Management Accounting. Fall Term 2004: Section 4

Statistical Studies: Analyzing Data III.B Student Activity Sheet 7: Using Technology

Introduction to Causal Inference. Problem Set 1. Required Problems

Paper 2. Mathematics test. Calculator allowed. First name. Last name. School KEY STAGE TIER

Effectiveness of McGraw-Hill s Treasures Reading Program in Grades 3 5. October 21, Research Conducted by Empirical Education Inc.

Office Hours: Mon & Fri 10:00-12:00. Course Description

Assignment 1: Predicting Amazon Review Ratings

CHAPTER 4: REIMBURSEMENT STRATEGIES 24

The Good Judgment Project: A large scale test of different methods of combining expert predictions

Options for Updating Wyoming s Regional Cost Adjustment

GCE. Mathematics (MEI) Mark Scheme for June Advanced Subsidiary GCE Unit 4766: Statistics 1. Oxford Cambridge and RSA Examinations

Probability Therefore (25) (1.33)

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Cal s Dinner Card Deals

Individual Differences & Item Effects: How to test them, & how to test them well

Math 121 Fundamentals of Mathematics I

Student s Edition. Grade 6 Unit 6. Statistics. Eureka Math. Eureka Math

An Introduction to Simio for Beginners

Functional Skills Mathematics Level 2 assessment

Multiplication of 2 and 3 digit numbers Multiply and SHOW WORK. EXAMPLE. Now try these on your own! Remember to show all work neatly!

Julia Smith. Effective Classroom Approaches to.

GCSE Mathematics B (Linear) Mark Scheme for November Component J567/04: Mathematics Paper 4 (Higher) General Certificate of Secondary Education

Fourth Grade. Reporting Student Progress. Libertyville School District 70. Fourth Grade

Why Did My Detector Do That?!

Diagnostic Test. Middle School Mathematics

A Program Evaluation of Connecticut Project Learning Tree Educator Workshops

Informal Comparative Inference: What is it? Hand Dominance and Throwing Accuracy

The Effect of Written Corrective Feedback on the Accuracy of English Article Usage in L2 Writing

MGF 1106 Final Exam Review / (sections )

Technical Manual Supplement

Understanding and Interpreting the NRC s Data-Based Assessment of Research-Doctorate Programs in the United States (2010)

State University of New York at Buffalo INTRODUCTION TO STATISTICS PSC 408 Fall 2015 M,W,F 1-1:50 NSC 210

Trends in College Pricing

Lecture 1: Machine Learning Basics

Extending Place Value with Whole Numbers to 1,000,000

Lesson 12. Lesson 12. Suggested Lesson Structure. Round to Different Place Values (6 minutes) Fluency Practice (12 minutes)

Introduction to Simulation

Mathacle PSet Stats, Concepts in Statistics and Probability Level Number Name: Date:

Certified Six Sigma Professionals International Certification Courses in Six Sigma Green Belt

Mathematics process categories

ABILITY SORTING AND THE IMPORTANCE OF COLLEGE QUALITY TO STUDENT ACHIEVEMENT: EVIDENCE FROM COMMUNITY COLLEGES

How do adults reason about their opponent? Typologies of players in a turn-taking game

On-the-Fly Customization of Automated Essay Scoring

Statistics and Probability Standards in the CCSS- M Grades 6- HS

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Managerial Decision Making

2013 TRIAL URBAN DISTRICT ASSESSMENT (TUDA) RESULTS

Hardhatting in a Geo-World

Alignment of Australian Curriculum Year Levels to the Scope and Sequence of Math-U-See Program

The Editor s Corner. The. Articles. Workshops. Editor. Associate Editors. Also In This Issue

Math Grade 3 Assessment Anchors and Eligible Content

(I couldn t find a Smartie Book) NEW Grade 5/6 Mathematics: (Number, Statistics and Probability) Title Smartie Mathematics

UNIT ONE Tools of Algebra

MINUTE TO WIN IT: NAMING THE PRESIDENTS OF THE UNITED STATES

BENCHMARK TREND COMPARISON REPORT:

Redirected Inbound Call Sampling An Example of Fit for Purpose Non-probability Sample Design

Amiens Whistler Principal s News

Lahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017

Functional Maths Skills Check E3/L x

JOB OUTLOOK 2018 NOVEMBER 2017 FREE TO NACE MEMBERS $52.00 NONMEMBER PRICE NATIONAL ASSOCIATION OF COLLEGES AND EMPLOYERS

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

Go fishing! Responsibility judgments when cooperation breaks down

Math-U-See Correlation with the Common Core State Standards for Mathematical Content for Third Grade

Montana Content Standards for Mathematics Grade 3. Montana Content Standards for Mathematical Practices and Mathematics Content Adopted November 2011

Executive Guide to Simulation for Health

Mathematics Scoring Guide for Sample Test 2005

Transcription:

ECO220 Mid-Term Test (June 29, 2005) Page of 5 SOLUTIONS T () Adding up the heights of the bars in a frequency histogram yields the sample size. F (2) In general, as the sample size increases you should reduce the number of bins in a density histogram. F (3) The standard error of the population mean is given by: T (4) The interquartile range is robust to outliers.. N F (5) A 90% confidence interval estimator is more likely to exclude the true population parameter with a sample size of 0 compared to a sample size of 00. F (6) Inferences should not be based on very small samples because estimators from such samples are biased. T (7) If is a normal random variable then variable. Y 2 * 0 e is also a normal random T (8) If, 2, 3, 4, 5 are independent random variables with identical E is equivalent to ] probability distributions, then [ 5] E [ 5 i. i F (9) If, 2, 3, 4, 5 are independent random variables with identical probability distributions, then V[ 5] is equivalent to V [ 5 i ]. i F (0) If a purely random sample is selected and the sample does not suffer from nonresponse or selection bias, then you can conclude that you have experimental data and not observational data. b () In which case is it reasonable to expect that the sample median will be greater than the sample mean? b (2) The sample standard deviation and the coefficient of variation of the data shown in this histogram is closest to which of the following? e (3) How many extra units should the firm expect to sell if it runs 0 additional promotions? a (4) If the baggage of all passenger tiers is handled in the same way, what is the probability (to the nearest hundredth) that of the 0 people who had their bags lost by the airline, none of them are super elite?

ECO220 Mid-Term Test (June 29, 2005) Page 2 of 5 c (5) Consider a population with a variance of 00. To obtain a 95% interval estimator of the population mean such that the difference between the upper and lower confidence limits is no more than 2 units would require a sample size of at least: c (6) Consider the data summarized in the tabulation below. Which of the following is a reasonable inference? d (7) Which of the following statements is most plausible? e (8) Based on these survey results what is the point estimate of the total dollar amount (includes bonuses and salary increases) that the company should set aside if it wants to meet its employees expectations? b (9) Which of the following scatter diagrams is plausible when and Y are measured in 000 s of dollars: d (0) What is the s.d. of the amount of additional compensation (includes bonus and salary increase) expected by employees? b () If on average Q is million units, then what is the expected value of total costs (TC)? b (2) For which of these populations would a sample size of 0,000 yield the narrowest (most precise) 95% confidence interval estimator of the population mean? d (3) Consider the sample of data shown in this graph. The 95% confidence interval estimator (rounded to nearest tenth) of μ is: e (4) Suppose and Y are independent. ~ U[0, 0] and Y ~ U[-0, 0], where U[a, b] denotes the uniform distribution and its parameters. If a sample of 00 observations of each is taken, how is the sum of the sample means distributed? a (5) Suppose that on an average Saturday night during peak season, 24 of a hotel s 304 rooms request room service. What is the probability that on a given Saturday night during peak season 35 or more rooms request room service? () Call Diane s sales and Dan s sales Y. Must find V[ + Y]. V[ + Y] V[] + V[Y] + 2COV[,Y] E[] 0.5* + 0.5*2.5 E[Y] 0.6* + 0.4*2.4 V[] 0.5(.5) 2 + 0.5(2.5) 2 0.25 V[Y] 0.6(.4) 2 + 0.4(2.4) 2 0.24 COV[,Y] 0.4(.5)(.4) + 0.(.5)(2.4) + 0.2(2.5)(.4) + 0.3(2.5)(2.4) 0. V[ + Y] 0.25 + 0.24 + 2*0. 0.69 0.69 0.5 0.83; The s.d. of total sales per month is 0.83 sales.

ECO220 Mid-Term Test (June 29, 2005) Page 3 of 5 (2) (a) The population contains an observation for each high school student, where the actual number of caffeinated soft drinks that each purchases in a day from any vending machine on school property is recorded. This population would have a discrete distribution because the number of caffeinated soft drinks that a high school student could potentially purchase in a day is: 0,, 2, 3, up to some maximum that is not very large (can certainly be no more than the capacity of the vending machines). This population would have the property that there would be a large number of 0 s (students that don t purchase) and lots of students buying or maybe 2 drinks. Given that the number of successes (purchasing a drink) is likely to be small one may be tempted to say that it will be Poisson distributed. Across students the important assumption is that all students are independence draws from the SAME Poisson experiment (has the same lambda parameter). If difference students have different tastes (different lambda's), which would be likely as not all students have the same tastes for caffeinated soft drinks, then the data will have overdispersion (variance > mean) and hence will not be Poisson distributed, which we talked about in Lecture 4. This is another reason to say that it is not likely to be Poisson. However, purchases are not likely to be independent (if you already bought one recently then you are less likely to buy another one in the subsequent time period). The Binomial distribution is also inappropriate because of the violation of independence and because it is not possible to identify a fixed number of trials. (Even if you wanted to call the breaks in between classes trials this would not work because there is the possibility that during a single trial a student could buy more than one can.) Hence, other than to note that the population will be discrete, and note its properties (lots of zeros and small integers) we cannot say it will follow a named distribution that we have studied. (Note: the normal distribution would be completely inappropriate. The population is very discrete: most students would buy 0, or 2 drinks. Also, it is not possible for a student to buy a negative number of drinks which is likely what the normal approximation would predict.) (b) The population parameter of interest is the population mean (μ). A plausible estimator of this parameter is the sample mean (-bar). (c) () Selection Bias #: All students in the relevant population do NOT have an equal chance of being included in the sample. Those students that do not purchase caffeinated soft drinks have 0 probability of being included in the sample. This is because even among students that visit the vending machine, this researcher is ONLY selecting those that actually purchase a CAFFEINATED beverage. In addition, students that do not make ANY purchases from the vending machine also have 0 probability of being included in the sample. This introduces a selection bias. In this case NO ONE in the collected sample will have purchased 0 caffeinated soft drinks. This is a big problem given that lots of students in the target population will purchase 0. (2) Selection Bias #2: Even among students that purchase a caffeinated beverage, the probability of inclusion in the sample is NOT equal. Those students that purchase more caffeinated soft drinks would have a higher probability of being included in the sample: they would be visiting the vending machine more frequently and hence have multiple chances to

ECO220 Mid-Term Test (June 29, 2005) Page 4 of 5 be included in the random sample. You might remember from class that this bias has a name: an avidity bias (more avid people have a higher probability of inclusion in the sample). (3) Selection Bias #3: The researcher is standing next to a particular vending machine in the high school. It is likely that different types of students visit different vending machines, which are located in different areas of the school. (For example if the machine the researcher is standing next to is near the gymnasium then the sample would include a disproportionate number of athletic students.) Again, not all students would have an equal probability of inclusion: the type of students that use that particular vending machine would be disproportionately represented. (4) Target Population not equal to Sampled Population: The researcher is only sampling students from a particular high school. However, the assignment was to study a typical high school student. To the extent that the school that the researcher selected is different or special in any way (which it likely is given that there is considerable variance across school districts) than this sample will reflect that difference. Students at other high schools have no chance of being included in this sample. (5) Researcher is asking wrong question. (i) Researcher is asking how many the student already drank. However, those students that are surveyed earlier in the day would give lower numbers than those surveyed later in the day. The assignment asked how many caffeinated soft drinks would be purchased in an entire day by a typical high school student. If you ask a student how many they had so far and the day is not over yet then you have not collected the correct piece of information. (ii) Researcher is asking about soft drinks rather than caffeinated soft drinks. (iii) Researcher is asking how many the student already drank rather than how many the student purchased. (d) The sample mean would be a biased estimator of the population mean. It is likely that the sample mean would be biased upward: the reason is that we are excluding all students that drink zero from our sample and disproportionately sampling students that drink large numbers of soft drinks. The sample mean would also not be a consistent estimator of the population mean. The reason is that there are many systematic biases present in the sampling plan and survey instrument (discussed in (c)). These will not go away by simply collecting a larger sample size. For example, collecting data on 450 randomly selected students instead of 50 will do NOTHING to address the biases in the data collection process. Hence in this case the estimator will be biased and inconsistent. 0 7 P ( < 0) P Z < P Z < 27 (3) (a) (.35) 0. 0890

ECO220 Mid-Term Test (June 29, 2005) Page 5 of 5 (b) μ P( μ n 7 27 30 < 0) P Z < 3 3 0 3 0 7 27 30 3 0 0.9487 P( Z < 7.30) 0 (c) ( Y < 0) 2* 0. 6 ( 2) P Note: The probability is found using the formula for the area of a rectangle: A b*h. (d) μ P( Y (6 + ( 2)) μ 7 2 2 (6 ( 2)) 2 n 30 < 0) P Z < 8 2 30 0 7 P( Z 27 30 8 2 3 0 3 3 0 < 7.30) 0 0.9487 (e) In (a) and (c) we obtained different answer for the probability of obtaining a single observation less than zero. The reason is that in (a) we used the normal distribution and in (c) we used the uniform distribution. Even though the population distributions in both cases have the same mean and variance, the probabilities were different because of the different shapes of these two distributions. In (b) and (d) we obtained the same answer for the probability of obtaining a sample mean with n 30 that is less than zero. The reason is the sampling distribution of the sample mean is the same in both cases: normal. It is normal in (b) because the population was normal. It is normal in (d) because of the Central Limit theorem. Hence, because the shape of the distributions is the same and the parameters are the same, the probabilities must be the same.