Normal Distributions Section 15.4

Similar documents
Introduction to the Practice of Statistics

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Measures of the Location of the Data

Probability and Statistics Curriculum Pacing Guide

Student s Edition. Grade 6 Unit 6. Statistics. Eureka Math. Eureka Math

Statistical Studies: Analyzing Data III.B Student Activity Sheet 7: Using Technology

STA 225: Introductory Statistics (CT)

Chapters 1-5 Cumulative Assessment AP Statistics November 2008 Gillespie, Block 4

A Guide to Adequate Yearly Progress Analyses in Nevada 2007 Nevada Department of Education

The lab is designed to remind you how to work with scientific data (including dealing with uncertainty) and to review experimental design.

Measurement. Time. Teaching for mastery in primary maths

On-the-Fly Customization of Automated Essay Scoring

Linking the Ohio State Assessments to NWEA MAP Growth Tests *

Interpreting ACER Test Results

If a measurement is given, can we convert that measurement to different units to meet our needs?

Sample Problems for MATH 5001, University of Georgia

Grade Dropping, Strategic Behavior, and Student Satisficing

Visit us at:

Probability Therefore (25) (1.33)

Grade 6: Correlated to AGS Basic Math Skills

GCSE Mathematics B (Linear) Mark Scheme for November Component J567/04: Mathematics Paper 4 (Higher) General Certificate of Secondary Education

Numeracy Medium term plan: Summer Term Level 2C/2B Year 2 Level 2A/3C

Broward County Public Schools G rade 6 FSA Warm-Ups

Shockwheat. Statistics 1, Activity 1

CONSTRUCTION OF AN ACHIEVEMENT TEST Introduction One of the important duties of a teacher is to observe the student in the classroom, laboratory and

Why Did My Detector Do That?!

AP Statistics Summer Assignment 17-18

MGF 1106 Final Exam Review / (sections )

Statewide Framework Document for:

Lesson M4. page 1 of 2

Diagnostic Test. Middle School Mathematics

Name: Class: Date: ID: A

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering

Association Between Categorical Variables

STAT 220 Midterm Exam, Friday, Feb. 24

College Pricing and Income Inequality

Malicious User Suppression for Cooperative Spectrum Sensing in Cognitive Radio Networks using Dixon s Outlier Detection Method

The Editor s Corner. The. Articles. Workshops. Editor. Associate Editors. Also In This Issue

Financing Education In Minnesota

Edexcel GCSE. Statistics 1389 Paper 1H. June Mark Scheme. Statistics Edexcel GCSE

CLASSIFICATION OF PROGRAM Critical Elements Analysis 1. High Priority Items Phonemic Awareness Instruction

Simple Random Sample (SRS) & Voluntary Response Sample: Examples: A Voluntary Response Sample: Examples: Systematic Sample Best Used When

Julia Smith. Effective Classroom Approaches to.

Centre for Evaluation & Monitoring SOSCA. Feedback Information

ASSESSMENT REPORT FOR GENERAL EDUCATION CATEGORY 1C: WRITING INTENSIVE

4 th Grade Number and Operations in Base Ten. Set 3. Daily Practice Items And Answer Keys

Discriminative Learning of Beam-Search Heuristics for Planning

School Size and the Quality of Teaching and Learning

Mathematics (JUN14MS0401) General Certificate of Education Advanced Level Examination June Unit Statistics TOTAL.

STT 231 Test 1. Fill in the Letter of Your Choice to Each Question in the Scantron. Each question is worth 2 point.

Probability estimates in a scenario tree

Educational Attainment

KeyTrain Level 7. For. Level 7. Published by SAI Interactive, Inc., 340 Frazier Avenue, Chattanooga, TN

Physics 270: Experimental Physics

Learning Disability Functional Capacity Evaluation. Dear Doctor,

Opinion on Private Garbage Collection in Scarborough Mixed

The Evolution of Random Phenomena

arxiv: v1 [cs.lg] 3 May 2013

Algebra 2- Semester 2 Review

Math 96: Intermediate Algebra in Context

learning collegiate assessment]

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape

OFFICE SUPPORT SPECIALIST Technical Diploma

16.1 Lesson: Putting it into practice - isikhnas

Preliminary Chapter survey experiment an observational study that is not a survey

NEW NCAA Division I Initial-Eligibility Academic Requirements

Research Design & Analysis Made Easy! Brainstorming Worksheet

Fourth Grade. Reporting Student Progress. Libertyville School District 70. Fourth Grade

On the Distribution of Worker Productivity: The Case of Teacher Effectiveness and Student Achievement. Dan Goldhaber Richard Startz * August 2016

OVERVIEW OF CURRICULUM-BASED MEASUREMENT AS A GENERAL OUTCOME MEASURE

Independent Assurance, Accreditation, & Proficiency Sample Programs Jason Davis, PE

Mathematics Assessment Plan

The Federal Reserve Bank of New York

Lecture 1: Machine Learning Basics

Evaluation of a College Freshman Diversity Research Program

Foothill College Summer 2016

Number of students enrolled in the program in Fall, 2011: 20. Faculty member completing template: Molly Dugan (Date: 1/26/2012)

CHAPTER III RESEARCH METHOD

NCEO Technical Report 27

MINUTE TO WIN IT: NAMING THE PRESIDENTS OF THE UNITED STATES

College Pricing and Income Inequality

Going to School: Measuring Schooling Behaviors in GloFish

Progress Monitoring for Behavior: Data Collection Methods & Procedures

Managerial Decision Making

Mathacle PSet Stats, Concepts in Statistics and Probability Level Number Name: Date:

Arizona s English Language Arts Standards th Grade ARIZONA DEPARTMENT OF EDUCATION HIGH ACADEMIC STANDARDS FOR STUDENTS

Multi-Lingual Text Leveling

Linking the Common European Framework of Reference and the Michigan English Language Assessment Battery Technical Report

Name Class Date. Graphing Proportional Relationships

EDEXCEL FUNCTIONAL SKILLS PILOT. Maths Level 2. Chapter 7. Working with probability

The Effect of Written Corrective Feedback on the Accuracy of English Article Usage in L2 Writing

Non intrusive multi-biometrics on a mobile device: a comparison of fusion techniques

School of Innovative Technologies and Engineering

Language properties and Grammar of Parallel and Series Parallel Languages

TRAVEL TIME REPORT. Casualty Actuarial Society Education Policy Committee October 2001

Functional Maths Skills Check E3/L x

Using Proportions to Solve Percentage Problems I

Dana Chisnell, UsabilityWorks Ethan Newby, Newby Research (consultant on statistics) Sharon Laskowski, NIST Svetlana Lowry, NIST

Learning Lesson Study Course

Transcription:

A B I L E N E C H R I S T I A N U N I V E R S I T Y Department of Mathematics Normal Distributions Section 15.4 Dr. John Ehrke Department of Mathematics Fall 2012

Measures of Relative Standing Sometimes you need to know the position of one data point relative to all other data points within a given population. For example, if you took a standardized test in high school, like the SAT or the ACT, you, but especially the college you applied to for entrance, will want to know how your score of 1500 on the SAT compares to someone else s score of 1450. Is 1500 really that much higher relatively speaking? What if the school wanted to compare your score versus another student s ACT score? They are different exams, so is there a way to compare them? If the school required you to score in the top 30% of all those who took the exam, does your score make the cut? We will answer these and other questions like them in this section. Slide 2/23 Dr. John Ehrke Lecture 8 Fall 2012

Probability Distribution Suppose you have a set of measurements on a continuous random variable, and you create a relative frequency histogram to describe the data set. For a small number of measurements, only a few classes is needed, but as more and more measurements are collected, the need for more classes (and therefore shorter class widths) becomes necessary. As the number of measurements becomes very large, and the class widths become very narrow, the relative frequency histogram appears more and more like a smooth curve. This smooth curve describes the probability distribution of the continuous random variable. Slide 3/23 Dr. John Ehrke Lecture 8 Fall 2012

Shape of a Normal Distribution The shape of the normal distribution depends on the mean and standard deviation. These three graphs have the same mean, but different standard deviations. As the standard deviation increases, the distribution becomes more spread out. Slide 4/23 Dr. John Ehrke Lecture 8 Fall 2012

Sample Z-Score The mean and standard deviation of a sample can be used to calculate a z-score, which measures the relative standing of a measurement in a data set. Definition The sample z-score is a measure of relative standing defined by z-score = x x s x. If the population mean, µ, and the population standard deviation σ are known, we use these in place of x and s x, respectively. What does a z-score tell you? Slide 5/23 Dr. John Ehrke Lecture 8 Fall 2012

College-Bound Student Data for SAT 2012 Data in this report are for high school graduates in the year 2012. Information is summarized for seniors who took the SAT at any time during their high school years through June 2012. If a student took the test more than once, the most recent score is used. The overall mean scores for the major areas are listed below. Example Draw distributions for each of the three main areas: Critical Reading, Mathematics, and Writing and determine the z-score associated with a reading score of 720, math score of 600, and writing score of 400. Slide 6/23 Dr. John Ehrke Lecture 8 Fall 2012

College-Bound Student Data for SAT 2012 Slide 7/23 Dr. John Ehrke Lecture 8 Fall 2012

Polling Question #19 Suppose you are given a set of data that is normally distributed with mean, x = 18 and standard deviation, s x = 5. Which of the following is true? (a) The z-score corresponding to the data item x = 7 is 2.2. (b) The data item x = 25 has a smaller z-score than x = 7. (c) The data item x = 25 is 1.4 standard deviations above the mean. (d) The data item x = 7 is 2.2 standard deviations above the mean. Slide 8/23 Dr. John Ehrke Lecture 8 Fall 2012

Polling Question #20 A student scores 60 on a vocabulary test and 80 on a grammar test. The data items for both tests are normally distributed. The vocabulary test has a mean of 50 and a standard deviation of 5. The grammar test has a mean of 72 and standard deviation of 6. Which of the following best describes the student s relative performance on the exams? (a) The student s relative performance was better on the vocabulary test. (b) The student s relative performance was better on the grammar test. (c) The student did equally well on both exams. (d) The student s relative performance cannot be determined. Slide 9/23 Dr. John Ehrke Lecture 8 Fall 2012

Empirical Rule Theorem (Empirical Rule) Given a distribution of measurements that is approximately mound-shaped: The interval (µ ± σ) contains approximately 68% of the data. The interval (µ ± 2σ) contains approximately 95% of the data. The interval (µ ± 3σ) contains approximately 99.7% of the data. Slide 10/23 Dr. John Ehrke Lecture 8 Fall 2012

Modified Empirical Rule Slide 11/23 Dr. John Ehrke Lecture 8 Fall 2012

Applying the Empirical Rule Example Load the bodytemp.txt file on your calculator. In this example, we will use the empirical rule to determine if the population is normally distributed. 1 Calculate the sample mean, x and the sample standard deviation, s x. 2 Use the Empirical rule to describe the distribution of data points between x ± s x, x ± 2s x, and x ± 3s x. 3 How does this estimation compare to the actual data? 4 Based on your observation, do you think the data is normally distributed? There are a wide variety of normality tests used by statisticians to answer the same kind of question we are attempting to answer in this question. While such tests are beyond the level of this course all require a large sample size, i.e. n > 20 and more realistically n > 50 to be of any great value. Slide 12/23 Dr. John Ehrke Lecture 8 Fall 2012

Polling Question #21 The scores on a test are normally distributed with a mean of 70 and standard deviation of 7. What percentage of students scored between a 63 and 84? (a) 13.5% (b) 84% (c) 47.5% (d) 81.5% Slide 13/23 Dr. John Ehrke Lecture 8 Fall 2012

The Normal Cumulative Distribution Function (CDF) There are three basic types of questions we are interested in concerning areas underneath the probability density function. P(x b) P(x a) P(a x b) Slide 14/23 Dr. John Ehrke Lecture 8 Fall 2012

The Basic Idea Example Consider a normally distributed set of data with mean, x = 10 and standard deviation, s x = 2. (a) Find the probability that x lies between 11 and 13.6. (b) Find the probability that x is greater than 12.2. (c) Find the probability that x is less than 8.7. Slide 15/23 Dr. John Ehrke Lecture 8 Fall 2012

Normal CDF Question Example Studies show that the gasoline usage of compact cars sold in the United States is normally distributed with mean 25.5 miles per gallon (mpg) and a standard deviation of 4.5 mpg. (a) What percentage of compacts get 20 mpg or more? (b) Which is more likely? Finding a compact car that gets more than 50 mpg, or finding a compact car that gets less than 10 mpg? (c) What percentage of compacts get between 16.5 and 34.5 mpg? Slide 16/23 Dr. John Ehrke Lecture 8 Fall 2012

Calculating Percentiles Suppose we changed the previous question up slightly and rather than ask what percentage of compact cars fell in certain intervals we asked you to find the miles per gallon a compact car must have to be considered in the top 5% among all other compacts produced during that year. In other words, find x 0 such that P(x x 0 ) = 0.95. This is the 95th percentile of the distribution, and can be found as follows: Solution: We begin by trying to calculate the z-score for such a point. z 0 = x 0 25.5 4.5 Since the value of z 0 corresponds to x 0, it must also have area.95 to its left. In the z-score table, we find that the z-score which corresponds to this point is z 0 = 1.64. Thus we have z 0 = x 0 25.5 4.5 = 1.64. Solving for x 0 we obtain x 0 = 32.88. So a car must get at least approximately 33 mpg to be in the top 5% among all compact cars. Slide 17/23 Dr. John Ehrke Lecture 8 Fall 2012

Inverse Normal Command (a) What z-score is the 80th percentile? (b) What z-score is the 30th percentile? (c) Between what two z-scores is the middle 40% of the data? (d) If a distribution has a mean of 40 and a standard deviation of 4, what is the 75th percentile? (e) If you know that the mean salary for your profession is $53,000 with a standard deviation of $2500, to what percentile does your salary $57,000 correspond? Slide 18/23 Dr. John Ehrke Lecture 8 Fall 2012

Grading on the Curve Example Sometimes grading on a curve isn t always a good thing. Suppose an instructor grades on a curve by assuming the test scores are normally distributed. If the average grade is 70 and the standard deviation is 8, answer the questions below if the instructor wishes to assign grades as follows: 10% A s, 20% B s, 40% C s, 20% D s, and 10% F s. (a) If you made a 75 on the exam, and the exam was graded on the curve described above, what grade would you receive? (b) What is the lowest grade you could make and still receive an A on the exam? (c) What is the cutoff for passing with a C? (d) To be considered in the top quarter of the class, what grade would you have to make? Slide 19/23 Dr. John Ehrke Lecture 8 Fall 2012

Grading on the Curve Slide 20/23 Dr. John Ehrke Lecture 8 Fall 2012

Polling Question #22 An honors program requires a student score in the top 2.5% on a particular exam to be considered for entry into the program. From past experience, out of all those who have taken the exam, the average test score is 100 and the standard deviation is 5. What is the minimum score a student must make to be considered by the honors program? Answers are rounded to the nearest whole number. (a) 110 (b) 108 (c) 90 (d) 100 Slide 21/23 Dr. John Ehrke Lecture 8 Fall 2012

Margin of Error for the Sample Mean The margin of error formula to calculate the margin of error of a sample mean, provided that we have a sample from a population that is normally distributed and know the population (or sample) standard deviation is given by E = z α/2 σ n and the associated confidence interval is CI = x ± z α/2 E α = level of confidence, usually α = 0.05 represents 1-0.05 =.95 = 95% confidence level z α/2 = this is the point on the standard normal for which α/2 area lies above this point σ = population standard deviation n = sample size Slide 22/23 Dr. John Ehrke Lecture 8 Fall 2012

Margin of Error and Confidence Intervals Example Use the margin of error formula above to calculate the margin of error in the following presidential poll where 700 likely voters (LV) were sampled: Obama Romney 48 % 52% Slide 23/23 Dr. John Ehrke Lecture 8 Fall 2012