Probability An Introduction with Applications

Similar documents
Probability and Statistics Curriculum Pacing Guide

STA 225: Introductory Statistics (CT)

S T A T 251 C o u r s e S y l l a b u s I n t r o d u c t i o n t o p r o b a b i l i t y

Theory of Probability

Introduction to Simulation

Shockwheat. Statistics 1, Activity 1

CS/SE 3341 Spring 2012

Mathematics subject curriculum

UNIT ONE Tools of Algebra

Probability and Game Theory Course Syllabus

Math-U-See Correlation with the Common Core State Standards for Mathematical Content for Third Grade

Self Study Report Computer Science

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

Digital Fabrication and Aunt Sarah: Enabling Quadratic Explorations via Technology. Michael L. Connell University of Houston - Downtown

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

Montana Content Standards for Mathematics Grade 3. Montana Content Standards for Mathematical Practices and Mathematics Content Adopted November 2011

Julia Smith. Effective Classroom Approaches to.

An Introduction to Simio for Beginners

Physics 270: Experimental Physics

Mathematics (JUN14MS0401) General Certificate of Education Advanced Level Examination June Unit Statistics TOTAL.

Syllabus ENGR 190 Introductory Calculus (QR)

AGENDA LEARNING THEORIES LEARNING THEORIES. Advanced Learning Theories 2/22/2016

Mathematics process categories

Major Milestones, Team Activities, and Individual Deliverables

Learning Disability Functional Capacity Evaluation. Dear Doctor,

AP Statistics Summer Assignment 17-18

The Indices Investigations Teacher s Notes

EGRHS Course Fair. Science & Math AP & IB Courses

University of Groningen. Systemen, planning, netwerken Bosman, Aart

Enhancing Learning with a Poster Session in Engineering Economy

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

The lab is designed to remind you how to work with scientific data (including dealing with uncertainty) and to review experimental design.

Lecture 1: Machine Learning Basics

Firms and Markets Saturdays Summer I 2014

Lecturing Module

Characteristics of Functions

Foothill College Summer 2016

Mathematics Success Level E

Cal s Dinner Card Deals

Designing a Computer to Play Nim: A Mini-Capstone Project in Digital Design I

Technical Manual Supplement

4.0 CAPACITY AND UTILIZATION

Science Fair Project Handbook

Mathematics Success Grade 7

Page 1 of 11. Curriculum Map: Grade 4 Math Course: Math 4 Sub-topic: General. Grade(s): None specified

Dublin City Schools Mathematics Graded Course of Study GRADE 4

learning collegiate assessment]

Lecture 10: Reinforcement Learning

Radius STEM Readiness TM

Numeracy Medium term plan: Summer Term Level 2C/2B Year 2 Level 2A/3C

University of Cincinnati College of Medicine. DECISION ANALYSIS AND COST-EFFECTIVENESS BE-7068C: Spring 2016

Spring 2015 IET4451 Systems Simulation Course Syllabus for Traditional, Hybrid, and Online Classes

Green Belt Curriculum (This workshop can also be conducted on-site, subject to price change and number of participants)

Grade 6: Correlated to AGS Basic Math Skills

Grade 5 + DIGITAL. EL Strategies. DOK 1-4 RTI Tiers 1-3. Flexible Supplemental K-8 ELA & Math Online & Print

Mathematics Program Assessment Plan

Math 098 Intermediate Algebra Spring 2018

BADM 641 (sec. 7D1) (on-line) Decision Analysis August 16 October 6, 2017 CRN: 83777

Ascension Health LMS. SumTotal 8.2 SP3. SumTotal 8.2 Changes Guide. Ascension

Fourth Grade. Reporting Student Progress. Libertyville School District 70. Fourth Grade

Statewide Framework Document for:

Innovative Methods for Teaching Engineering Courses

The Good Judgment Project: A large scale test of different methods of combining expert predictions

GCSE Mathematics B (Linear) Mark Scheme for November Component J567/04: Mathematics Paper 4 (Higher) General Certificate of Secondary Education

State University of New York at Buffalo INTRODUCTION TO STATISTICS PSC 408 Fall 2015 M,W,F 1-1:50 NSC 210

Instructor: Mario D. Garrett, Ph.D. Phone: Office: Hepner Hall (HH) 100

THEORETICAL CONSIDERATIONS

Proof Theory for Syntacticians

Improving Conceptual Understanding of Physics with Technology

FIGURE IT OUT! MIDDLE SCHOOL TASKS. Texas Performance Standards Project

Learning Microsoft Office Excel

INFORMS Transactions on Education. Blitzograms Interactive Histograms

Python Machine Learning

Extending Place Value with Whole Numbers to 1,000,000

MGT/MGP/MGB 261: Investment Analysis

Build on students informal understanding of sharing and proportionality to develop initial fraction concepts.

Math Techniques of Calculus I Penn State University Summer Session 2017

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

Visit us at:

How do adults reason about their opponent? Typologies of players in a turn-taking game

Highlighting and Annotation Tips Foundation Lesson

Rendezvous with Comet Halley Next Generation of Science Standards

Office Hours: Mon & Fri 10:00-12:00. Course Description

TEKS Resource System. Effective Planning from the IFD & Assessment. Presented by: Kristin Arterbury, ESC Region 12

MKTG 611- Marketing Management The Wharton School, University of Pennsylvania Fall 2016

D Road Maps 6. A Guide to Learning System Dynamics. System Dynamics in Education Project

WORK OF LEADERS GROUP REPORT

Grade 2: Using a Number Line to Order and Compare Numbers Place Value Horizontal Content Strand

TABE 9&10. Revised 8/2013- with reference to College and Career Readiness Standards

This scope and sequence assumes 160 days for instruction, divided among 15 units.

THEORY OF PLANNED BEHAVIOR MODEL IN ELECTRONIC LEARNING: A PILOT STUDY

White Paper. The Art of Learning

Classroom Connections Examining the Intersection of the Standards for Mathematical Content and the Standards for Mathematical Practice

Executive Guide to Simulation for Health

Stacks Teacher notes. Activity description. Suitability. Time. AMP resources. Equipment. Key mathematical language. Key processes

TEACHER'S TRAINING IN A STATISTICS TEACHING EXPERIMENT 1

Visual CP Representation of Knowledge

THE PENNSYLVANIA STATE UNIVERSITY SCHREYER HONORS COLLEGE DEPARTMENT OF MATHEMATICS ASSESSING THE EFFECTIVENESS OF MULTIPLE CHOICE MATH TESTS

Curriculum Design Project with Virtual Manipulatives. Gwenanne Salkind. George Mason University EDCI 856. Dr. Patricia Moyer-Packenham

MTH 141 Calculus 1 Syllabus Spring 2017

Transcription:

Probability An Introduction with Applications 0.5 0.2 0 0 2 0 0 5 0.05 0.1 0 5 10 15 0 40 60 80 Gordon B. Hazen

Preface to the instructor This text is meant as an introduction to calculus-based probability, and can be used as a preparation for further material in statistics, stochastic processes or decision analysis. It has been used in this fashion in the industrial engineering department at Northwestern University for the past fifteen years. The text has several important features which distinguish it from typical probability texts: Linguistic presentation of random variables and events In this text, I bypass the traditional sample-space/ set-theoretic introduction to probability in favor of an intuitive linguistic presentation involving random variables and events. The set-theoretic foundation for probability is in my view a mathematical nicety which is never practically used in applications and need not appear at the introductory level (although it may be important for more advanced levels of instruction). An intuitive notion of random variable is taken as the primitive, and events are introduced as statements about random variables using the linguistic operators and, or, not rather than the set-theoretic operators of union, intersection, and complement. In my view, this is how experienced users of probability think. (For example, when considering whether two events A and B are disjoint, an experienced probabilist does not think What is the sample space? What subset of represents A? What subset represents B? Finally, is A B =? Rather he/she simply asks Can A and B occur simultaneously?, which is the way disjointness is presented in this text.) The goal is to move students more quickly towards examples of how probability is really used. Students are presented early in the course with examples and problems having multiple random variables. This early introduction is easy because, as just stated, random variables are taken as the primitives. Real probability models are filled with random variables, both independent and dependent, but typical probability textbooks do not introduce joint distributions until the middle or end of the course. Again, the goal is to move more quickly toward examples of real use of probability. Combinatorial probability, that is, the use of counting methods to calculate probabilities, n is almost completely bypassed, with the exception of the combination operator for k the binomial model. Combinatorial counting problems are peripheral to most important areas of probability and its applications, and the essential features of probability modeling can be presented without including this topic. The usual combinatorial approaches are replaced in this text with approaches based on multiplying conditional probabilities. Integration of Excel-based Monte Carlo simulation Spreadsheet-based Monte Carlo simulation is integrated throughout the text, both as a problem solving tool and as an instructional device. 2

Early in the text, I introduce a self-contained method for conducting Monte Carlo simultion using spreadsheet software such as Microsoft Excel. No other special software is required, and any reader with spreadsheet software should easily be able to perform a Monte Carlo simulation. I use Monte Carlo simulation examples and exercises to help build the reader s intuition about key probability concepts such as the convergence of relative frequencies and longterm averages to probabilities and expected values (the strong law of large numbers). Later on I use Monte Carlo simulation to reinforce intuition about probability density functions for continuous random variables, and to concretely illustrate the implications and meaning of the central limit theorem. I emphasize the usefulness of Monte Carlo simulation as a problem-solving tool, especially when algebraic methods are unwieldy. I employ examples and exercises in such applications as electrical power system reliability, hazardous waste transport, inventory modeling, and facility location. I discuss the application of elementary techniques of statistical inference to estimation problems arising in Monte Carlo simulation. I discuss in depth two important applications in which Monte Carlo simulation is important: activity networks, and probabilistic sensitivity analysis. Initial focus on discrete random variables I present all major concepts first using discrete random variables. Later, I present the basics of continuous random variables, noting that all previously introduced properties go through for continuous random variables as long as one replaces pmf by pdf and summation by integral. Again, the goal is to get to applications as quickly as possible without spending time on derivations which are not really used when applying probability tools. I present a table of all major properties in both discrete and continuous forms to assist the student in seeing the parallels. Emphasis on thinking conditionally One of the most important skills a student can acquire in preparation for advanced and applied uses of probability is the ability to think conditionally. Examples of this manner of thinking include the following. Independence is defined in the conditional sense P(A B) = P(A) rather than the multiplicative sense P(A B) = P(A)P(B). The intuitive view of independence that finding out whether B occurred does not influence the probability of A is emphasized. Too often I have encountered students in advanced courses who know only that independence means you can multiply probabilities. Understanding the conditional definition is crucial to being able to know when to invoke independence assumptions when constructing a probability model. Example problems which in many texts are solved by counting (such as calculating the probability of 4-of-a-kind in poker) are solved instead by using sequential multiplication rules for conditional probability. 3

A advanced chapter of the text is devoted to conditioning, including the total probability and total expectation rules, conditional independence, conditioning using the expectation operator, the conditional variance formula, and conditional extensions of probability rules. Emphasis on examples The text includes many different examples. My method for choosing examples focuses first on finding useful or interesting real situations in which a probability model might be helpful, and only secondly on devising an example which fits the concept currently under discussion. Because real problems are hard, this can often result in examples which are challenging to the novice. My approach is to present solutions in simple concrete (rather than abstract general) ways, and in exercises to have students mimic or incrementally revise solution approaches to examples which would otherwise be too difficult. Examples and exercises which I present include: Birthday coincidences, airline overbooking, the Windows game Minesweeper, attacking and defending in the board game Risk, poker, landslide risk analysis, free-throw shooting, majority voting, baseball batting performance, source-sink network reliability, testing for AIDS, the number of victories by series or round-robin winners, the Illinois lottery, examinations with repeats, evacuating a city, arrivals at an automated entrance gate, single-period inventory models, facility location, electrical power system reliability, and hazardous material transport. I also include a section devoted entirely to activity networks, and another to probabilistic sensitivity analysis. Use of graphical tools Throughout the text, I consistently use the event tree to give the student visual insight into concepts such as independence, the total probability rule, Bayes rule, and the binomial and geometric random variables. I also present the material on Poisson splitting graphically using a hybrid transition diagram/ event tree format. I use an intuitive presentation of the influence diagram to schematically portray Bayes rule using arrow reversals and to give intuitive meaning to the notion of conditional independence. Overall, I try to be as graphically helpful as possible, presenting bar charts of probability mass functions, plots of cdf s and pdf s, and relative frequency graphs for Monte Carlo simulation whenever possible. Tools and examples from decision analysis Event trees and influence diagrams are prominent tools from the field of decision analysis. Other decision-analytic tools are also discussed in this text, including sensitivity analysis, tornado diagrams, probabilistic sensitivity analysis, decision trees, expected utility, and of course, Bayes rule. This text is not intended to be an introduction to decision analysis, but a good preparation for further study can be found here. Building a student s intuition and highlighting common misconceptions Throughout the text, but especially in the beginning sections, I make a special effort to build intuition about fundamental notions and warn of common misconceptions. Here are some instances of this practice: 4

Defining and using random variables improperly defining random variables confusion of algebraic variables and random variables the distribution of a random variable Conditional probability Independence an intuitive view of conditional probability conditional probability and temporal order conditional probability in a Venn diagram misconceptions concerning conditional probability the intuitive view of independence the distinction between disjoint events and independent events pairwise versus collective independence independence is a relationship between random variables substituting a conditioning value dependent random variables which are uncorrelated intuition on independent Bernoulli trials Expectation and probability operators doing algebra inside the probability operator and expectation operators expectation of a function is not the function of the expectation mean of a product is not necessarily the product of the means the mean of a quotient is not the quotient of the means the variance of a sum is not always the sum of the variances Continuous random variables density functions as probability per unit length impossible events versus events having probability zero relationship between the pdf and the cdf lack of memory property for the exponential Recurring summaries of the role of important topics in probability modeling The text begins with a preview and overview in the form of a simple flow diagram describing how probability models are used to predict behavior of real-world systems subject to uncertainty. As a review or preview of each basic topic, the probability modeling flow diagram is revisited, 5

and the place of that topic within the diagram is highlighted and described. The purpose is to allow students, after concentrating on the details, to re-focus on the big picture of probability modeling. p. 3, in the overview and preview p.263, after the student has been introduced to independence, conditional probabililty, probability distributions, and expectation Real-world system whose behavior is uncertain Assumptions Data Probability model of the system Logical deduction Properties of the model Calculation Estimation How likely is a particular important event? What will be the average behavior of the system? Applications to stochastic processes, statistics and simulation A few words about what this text is not. In the concluding chapters, the reader will find short introductions to further topics in which probability modeling plays an important role: Poisson processes, statistical inference, and Monte Carlo simulation. I do not view this text as adequate by itself for courses in these topics. The most important features of Poisson processes are treated, but no further topics in stochastic processes are approached. Only large sample confidence intervals and hypothesis testing are covered, to illustrate the use of the normal distribution and the central limit theorem in statistics. A chapter on Bayesian versus classical statistics is included to give the beginning reader an entry point into this important and timely subject. And only spreadsheet-based Monte Carlo simulation and analysis is treated. For individual courses on any of these topics, there are a variety of appropriate textbooks. Using this text in a course I have used this text for a one-quarter (10-week) introductory course in probability in the engeering school at Northwestern University. There is more material here than will fit into a single quarter or semester, and chapters need not be covered in strictly sequential order. Figure A summarizes the precedence relationships between the chapters, and highlights material that is typically included in a one-quarter introduction. The text contains 360 exercises. Exercise solutions are available on request from the author. 6

Random Variables, Events and Probabilities 1 Basic Concepts 2 Conditional Probability and Independence 3 The Mean 4 More on Conditional Probability* Discrete Random Variables Appx: Mathematical Requirements 5 Probability Mass Functions 6 Repeated Independent Trials 7 The Expectation Operator 9 More on Conditioning* 8 Variance and Covariance Continuous Random Variables 10 Basic Properties of Continuous Random Variables 11 Further Properties of Continuous Random Variables 12 Important Continuous Random Variables Discrete and Continuous Random Variables 15 Overview of Important Discrete and Continuous Random Variables 14 The Poisson Process* 13 Sums of Random Variables Further Topics 17 Applications to Monte Carlo Simulation* 16 Applications in Statistical Inference* 18 Classical Versus Bayesian Inference* Figure A: Precedence relationships between chapters in this textbook. Arrows indicate what material from prior chapters is used in a given chapter. Dotted arrows indicate that prior material is not heavily used. Chapters outlined in bold form typical topics in a one-quarter or one-semester course. Starred chapters denote optional or advanced topics. 7

The electronic version of this text This textbook has been used for fifteen years in the required undergraduate probability course in the Department of Industrial Engineering and Management Sciences at Northwestern University. The electronic version has been available as an option for spring and fall 2013. It is not a supplement, summary, or set of lecture notes it is the complete version of a 591-page text. It is viewable on your computer or electronic device, and can be accessed at a reasonable fee through the website scribd.com via the link Probability an Introduction With Applications. The text is hyperlinked both to and from its table of contents, and also contains crosslinks within its body. The author Gordon Hazen is a professor in the Department of Industrial Engineering and Management Sciences at Northwestern University. He has taught courses in probability, statistics, stochastic processes, and decision analysis for three decades. His research focuses on stochastic and decision analytic applications in healthcare, and he has has extensive editorial service in decision analysis, medical decision making, and operations research. Contents of the text Preface to the instructor i Random Variables, Events, and Probabilities 1 1 Basic Concepts 2 2 Conditional Probability and Independence 53 3 The Mean 78 4 More on Conditional Probability* 97 Discrete Random Variables 162 5 Probability Mass Functions 163 6 Repeated Independent Trials 194 7 The Expectation Operator 232 8 Variance and Covariance 264 9 More on Conditioning* 301 Continuous Random Variables 342 10 Basic Properties of Continuous Random Variables 343 11 Further Properties of Continuous Random Variables 370 12 Important Continuous Random Variables 423 Discrete and Continuous Random Variables 451 13 Sums of Random Variables 452 14 The Poisson Process* 487 15 Overview of Important Discrete and Continuous Random Variables 501 8

Further Topics 527 16 Applications in Statistical Inference* 528 17 Applications of Monte Carlo Simulation* 539 18 Classical Versus Bayesian Inference* 561 Appendix: Discrete Mathematics Requirements 576 Bibliography 585 Index 586 9