ACE Experimental Design: An Introduction

Similar documents
STA 225: Introductory Statistics (CT)

Probability and Statistics Curriculum Pacing Guide

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

Certified Six Sigma Professionals International Certification Courses in Six Sigma Green Belt

Lecture 15: Test Procedure in Engineering Design

Analysis of Enzyme Kinetic Data

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

Universityy. The content of

Statistical Analysis of Climate Change, Renewable Energies, and Sustainability An Independent Investigation for Introduction to Statistics

Lecture 1: Machine Learning Basics

Lecture 2: Quantifiers and Approximation

Instructor: Mario D. Garrett, Ph.D. Phone: Office: Hepner Hall (HH) 100

On-the-Fly Customization of Automated Essay Scoring

Research Design & Analysis Made Easy! Brainstorming Worksheet

Algebra 2- Semester 2 Review

Visit us at:

The Good Judgment Project: A large scale test of different methods of combining expert predictions

Probabilistic Latent Semantic Analysis

Mathematics Success Grade 7

Statewide Framework Document for:

Detailed course syllabus

Python Machine Learning

An Introduction to Simio for Beginners

Math 96: Intermediate Algebra in Context

Introduction to Simulation

Peer Influence on Academic Achievement: Mean, Variance, and Network Effects under School Choice

School of Innovative Technologies and Engineering

Chapters 1-5 Cumulative Assessment AP Statistics November 2008 Gillespie, Block 4

MGT/MGP/MGB 261: Investment Analysis

Characterizing Mathematical Digital Literacy: A Preliminary Investigation. Todd Abel Appalachian State University

The lab is designed to remind you how to work with scientific data (including dealing with uncertainty) and to review experimental design.

Assignment 1: Predicting Amazon Review Ratings

Introduction to the Practice of Statistics

Practical Research. Planning and Design. Paul D. Leedy. Jeanne Ellis Ormrod. Upper Saddle River, New Jersey Columbus, Ohio

elearning OVERVIEW GFA Consulting Group GmbH 1

Malicious User Suppression for Cooperative Spectrum Sensing in Cognitive Radio Networks using Dixon s Outlier Detection Method

An Introduction to Simulation Optimization

Ricopili: Postimputation Module. WCPG Education Day Stephan Ripke / Raymond Walters Toronto, October 2015

An Empirical Analysis of the Effects of Mexican American Studies Participation on Student Achievement within Tucson Unified School District

Enhancing Students Understanding Statistics with TinkerPlots: Problem-Based Learning Approach

Why Did My Detector Do That?!

Stopping rules for sequential trials in high-dimensional data

Understanding and Interpreting the NRC s Data-Based Assessment of Research-Doctorate Programs in the United States (2010)

CSC200: Lecture 4. Allan Borodin

Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge

Mathematics subject curriculum

Scientific Method Investigation of Plant Seed Germination

Radius STEM Readiness TM

Designing a Rubric to Assess the Modelling Phase of Student Design Projects in Upper Year Engineering Courses

Grade 6: Correlated to AGS Basic Math Skills

PROFESSIONAL TREATMENT OF TEACHERS AND STUDENT ACADEMIC ACHIEVEMENT. James B. Chapman. Dissertation submitted to the Faculty of the Virginia

Individual Differences & Item Effects: How to test them, & how to test them well

Mandarin Lexical Tone Recognition: The Gating Paradigm

CS Machine Learning

CONSTRUCTION OF AN ACHIEVEMENT TEST Introduction One of the important duties of a teacher is to observe the student in the classroom, laboratory and

Generating Test Cases From Use Cases

Lahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017

TABLE OF CONTENTS TABLE OF CONTENTS COVER PAGE HALAMAN PENGESAHAN PERNYATAAN NASKAH SOAL TUGAS AKHIR ACKNOWLEDGEMENT FOREWORD

On the Combined Behavior of Autonomous Resource Management Agents

Using focal point learning to improve human machine tacit coordination

Learning By Asking: How Children Ask Questions To Achieve Efficient Search

ENME 605 Advanced Control Systems, Fall 2015 Department of Mechanical Engineering

STT 231 Test 1. Fill in the Letter of Your Choice to Each Question in the Scantron. Each question is worth 2 point.

Let s think about how to multiply and divide fractions by fractions!

Science Fair Project Handbook

ABILITY SORTING AND THE IMPORTANCE OF COLLEGE QUALITY TO STUDENT ACHIEVEMENT: EVIDENCE FROM COMMUNITY COLLEGES

A GENERIC SPLIT PROCESS MODEL FOR ASSET MANAGEMENT DECISION-MAKING

A Model to Predict 24-Hour Urinary Creatinine Level Using Repeated Measurements

Truth Inference in Crowdsourcing: Is the Problem Solved?

University of Groningen. Systemen, planning, netwerken Bosman, Aart

Application of Multimedia Technology in Vocabulary Learning for Engineering Students

Office Hours: Mon & Fri 10:00-12:00. Course Description

Reduce the Failure Rate of the Screwing Process with Six Sigma Approach

STAT 220 Midterm Exam, Friday, Feb. 24

While you are waiting... socrative.com, room number SIMLANG2016

How and Why Has Teacher Quality Changed in Australia?

Discriminative Learning of Beam-Search Heuristics for Planning

Science Olympiad Competition Model This! Event Guidelines

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

Green Belt Curriculum (This workshop can also be conducted on-site, subject to price change and number of participants)

WHEN THERE IS A mismatch between the acoustic

Procedia - Social and Behavioral Sciences 237 ( 2017 )

How do adults reason about their opponent? Typologies of players in a turn-taking game

Transfer Learning Action Models by Measuring the Similarity of Different Domains

Teachable Robots: Understanding Human Teaching Behavior to Build More Effective Robot Learners

Evolutive Neural Net Fuzzy Filtering: Basic Description

State University of New York at Buffalo INTRODUCTION TO STATISTICS PSC 408 Fall 2015 M,W,F 1-1:50 NSC 210

Focus of the Unit: Much of this unit focuses on extending previous skills of multiplication and division to multi-digit whole numbers.

12- A whirlwind tour of statistics

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

A Program Evaluation of Connecticut Project Learning Tree Educator Workshops

DIGITAL GAMING & INTERACTIVE MEDIA BACHELOR S DEGREE. Junior Year. Summer (Bridge Quarter) Fall Winter Spring GAME Credits.

Thesis-Proposal Outline/Template

Hierarchical Linear Modeling with Maximum Likelihood, Restricted Maximum Likelihood, and Fully Bayesian Estimation

Tun your everyday simulation activity into research

PROMOTING QUALITY AND EQUITY IN EDUCATION: THE IMPACT OF SCHOOL LEARNING ENVIRONMENT

Mathematics (JUN14MS0401) General Certificate of Education Advanced Level Examination June Unit Statistics TOTAL.

A Comparison of Charter Schools and Traditional Public Schools in Idaho

STABILISATION AND PROCESS IMPROVEMENT IN NAB

Transcription:

ACE Experimental Design: An Introduction Alan G. Isaac American University 2015-07-03 Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 1 / 42

Simulations as Experiments A simulation study is an experiment that needs to be designed. [kelton-1999-dse] Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 2 / 42

Purposes of Simulations Find a useful simple model of a complex process. understanding: why results emerge; which factors are important; how specific factors influence the results; hypothesis testing prediction: anticipate real-world responses to controllable factors control/optimization: search for optimal configuration of controllable factors search for robust configurations of controllable factors In this course, we emphasize the role of simulation modeling in producing new understanding. Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 3 / 42

Experimental Design in CSS Need for planning: avoid time-consuming inefficiencies by adequate design stopping criteria (terminal or steady state) choice of metrics (moments, quantiles,...) role of randomness (seeds for PRNGs; use of common random numbers (CRNs)) Design of Experiments (DoE): rules for the effective collection of responses from experimental designs originates from real world experimentation; we apply DoE techniques to experiments with artificial computer worlds Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 4 / 42

Experimental Design in CSS Design of Experiments (DoE) is not used as widely or effectively in the practice of simulation as it should be. Possible explanations: ignorance: DoE literature is unfamiliar to CSS researchers (different specialization). myopia: CSS researchers focus on building not analyzing their models. empahsis: DoE literature primarily addresses a different audience (real-world experimentation) and not the needs of simulation research. Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 5 / 42

Terminology factor an input or parameter (quantitative or qualitative) that can take on more than one value decision factors (controllable) noise factors (exogenous) factor levels values of factors (usually numerically coded) scenario a parameter configuration: specifies a factor level for each factor replicate a scenario + a random number sequence simulation maps a replicate into outputs (e.g., produces an observation or an observation time series) meta-model simplified representation of the scenariooutput relationship (e.g., a regression model) Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 6 / 42

Experimental Design in CSS responses experimental design focal outcome variations in response to treatment variations 1 an experimental framework 2 predictions of responses based on one or more hypotheses (what are the questions?) 3 a detailed description of how the hypotheses will be experimentally tested. the treatments the data collection the approach to data analysis Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 7 / 42

Experimental Design in CSS (cont.) experimental framework a simulation model that permits systematic variation in one or more treatment factors. hypothesis a testable statement about the relationship between treatment factors and responses (simulation results) treatment a configuration of treatment factors treatment factor a model parameter whose values can be manipulated by the experimenter. experiment multiple simulation runs with specified variation in treatment factors Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 8 / 42

Hypothesis Testing 1 predict the effects of changes in the treatment factors 2 run the simulation to confirm or disconfirm those predictions. requires a specification of the: treatment factor variation in the experiment, simulation results that will be used to test the hypothesis, criteria determining whether or not these results reject the hypothesis. Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 9 / 42

Basic Components of Experimental Design decide what responses to examine (i.e., what will count as results) choose one or more treatment factors (i.e., model parameters to vary systematically) specify what values of the treatment factor you will consider specify the fixed values of all other model parameters. specify your hypotheses about the effect of your treatment factor(s) on your responses specify the number of replicates for each scenario specify the random number generator and seed(s) for each run specify the stopping criterion for the simulation (e.g., the number of iterations for each replicate) Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 10 / 42

Basic Components of Results Reporting For each value of the treatment parameter, make a table showing the following: the treatment value, the random seed, and the final values of your focus variables at iteration (i.e., the values when the simulation stops) For each value of the treatment parameter, report descriptive statistics for your focus variables. (At the very least, report the mean, standard deviation, and histogram.) Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 11 / 42

Design Matrix design matrix: specifies all considered combinations of factor levels. design point: a row of the design matrix; a scenario E.g., if we have just two factors (A and B) with levels A1, A2, B1, B2, B3 we end up with six scenarios. A A1 A1 A1 A2 A2 A2 B B1 B2 B3 B1 B2 B3 Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 12 / 42

Problem of Dimensionality Rule: if k is a big number, then 2 k is a very big number. If we have k factors each with just 2 possible levels, that gives us 2 k possible combinations. full-factorial experiment considers all 2 k scenarios fractional-factorial experiment: restrict the scenarios considered by imposing constraints (e.g., C=A*B) http://www.itl.nist.gov/ div898/handbook/pri/section3/pri334.htm Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 13 / 42

Response Surface images/kelton-response01.png Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 14 / 42

Response Surface: Contours images/kelton-response02.png Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 15 / 42

Estimated Response Surface images/kelton-response03.png Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 16 / 42

Estimated Response Surface: Contours images/kelton-response04.png Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 17 / 42

Estimated Response Surface images/kelton-response05.png Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 18 / 42

Estimated Response Surface: Contours images/kelton-response06.png Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 19 / 42

Response of Response How sensitive is the response to the factors? Direct approach (finite differences): Indirect approach: Single-run methods: make a run with the factor of interest set to a value perturb the factor value and make another run compute slope (difference quotient) fit a meta-model consider the partial derivatives perturbation analysis move the factor during the run, track new trajectory as well as trajectory if the perturbation had not been made Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 20 / 42

Meta-Model response = f(factors) E.g., summarize with a regression. Comment: we may have many different responses in a simulation, with a different meta-model for each. Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 21 / 42

Optimum Response Indirect: Staged Black box fit a meta-model optimize it using calculus fit a meta-model of low dimensionality optimize it using calculus optimum determines a search region for further simulations simulation is a black box function optimization run by any external tool (e.g., NetLogo s Mathematica link) Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 22 / 42

Optimization using External Services images/kelton-optimum.png Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 23 / 42

Example: Schelling Segregation Model The simulation model is specified by our code but should be given a supporting description. Model parameters include: the size of the world (default: 8 by 8), the number of agent types (default: 2) the number of agents of each type (or, the total number of agents and their ratio) the happiness cutoff parameter the relative size of the different agent classes; the initial location of the agents. Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 24 / 42

Example: Schelling Segregation Model We can pick any parameter as a treatment factor. Illustrative hypothesis: increases in the happiness cutoff parameter increase the likelihood of a segregated final outcome. Turning the hypothesis into a prediction: given 25 agents each of two agent types and an initial random distribution of agents in the world, a rise in the happiness cutoff increases the final (T=1000) segregation measure. Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 25 / 42

Seeding the PRNG Note that this experimental design is probabilistic. Therefore, for each variation in the model s treatment factors, you must run multiple simulations. Typically, each run will be based on a different programmer-specified setting of a random seed for the pseudo-random number generator (PRNG) used in the simulation. Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 26 / 42

Seeding the PRNG Specification of the PRNG and seed renders the results deterministic (i.e., capable of exact replication). Therefore, you must record as data the random seed used for each run (along with the values of your treatment-factors) and all user-specified or default settings for maintained parameter values and simulation control options. Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 27 / 42

Recording the Data For each run, record the degree of segregation displayed by the resulting agent location pattern. Report descriptive statistics that summarize these experimental segregation findings. Based on these descriptive statistics, draw conclusions regarding whether or not your hypothesis appears to be supported by your observations. These descriptive statistics would typically include (at a minimum) the sample mean value, the sample standard deviation, and possibly also a histogram, for the degree of segregation observed across runs. Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 28 / 42

Descriptive Statistics (Very Basic Review) Suppose you have conducted N runs of an experiment using N different seed values for your pseudo-random number generator. Suppose you have recorded N observations X1, X2,..., XN regarding some quantifiable experimental outcome of interest X (e.g., the degree of segregation). The sample mean is the arithmetic average value of your N observations. X = 1 N N X i i=1 Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 29 / 42

Descriptive Statistics (basic review continued) Variance (σ 2 ): the average squared deviation from the mean. standard deviation: the square root of the variance. often good measures of how dispersed the data are: a low variance means that the data are more tightly clustered around the mean. we usually use the standard deviation, because it has the same units as the mean, which makes it easier to interpret. Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 30 / 42

Descriptive Statistics (review continued) For example, given observations X1,...,XN: σ 2 X = 1 N = 1 N = 1 N N i=1 N i=1 N i=1 (X i X) 2 (X 2 i 2X i X + X 2 ) σ X = X 2 i X 2 σ 2 X Note: there are two common measures of variance (and thus of standard deviation). A common alternative to the measure we adopt divides by (N-1) instead of N. See http://en.wikipedia.org/wiki/standard_deviation for details. Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 31 / 42

Graphics histogram a standard graphic for simulation outcomes. puts observations into bins based on their values plots the number (or relative frequency) of values in each bin. Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 32 / 42

Histogram You might construct a frequency histogram for N observations as follows. 1 Bin your observations into bins B1... BN 2 For each possible bin, Bi, let #Bi denote the number of observations falling in that bin. 3 Produce the bin relative frequencies bi = #Bi/N 4 plot the points (i, bi) This is your relative frequency histogram. Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 33 / 42

Histogram Example Suppose N = 1000 and you have two bins, for outcomes above (bin 1) or below (bin 2) some segregation measure cutoff. Suppose 300 observation fall into bin 1. Then your frequency histogram plots two points: (1,.30) and (2,.70). Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 34 / 42

Histogram A histogram can indicate a failure of the data to have a central tendency. (Pareto Distribution) This is important for our interpretation of our descriptive statistics. E.g., sample mean and sample standard deviation are most useful observations cluster around the mean value. Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 35 / 42

Schelling Model: Response Surface images/schelling-response01.png Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 36 / 42

Schelling Model: Response Estimate After looking at the response surface, seek a meta-model: y = beta 0 + beta 1 x 1 + β 2 x 2 + beta 3 x 2 1 + β 4 x 2 2 y pct_same_type x1 population_density x2 pct_same_type_desired Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 37 / 42

Schelling Model: Responses (blue) v. Fitted Values (yellow) images/schelling-response02.png Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 38 / 42

Schelling Model: Response Estimate import scikits.statsmodels as sm y = pct_same_type x1 = population_density x2 = pct_same_type_desired dummy = x2 >= 80 X = np.column_stack((x1, x2, x1**2, x2**2, dummy)) X = sm.add_constant(x) results = sm.ols(pct_same_type, X).fit() print results.summary() yf = results.fittedvalues Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 39 / 42

Schelling Model: Response Estimate ========================================== coefficient prob. ------------------------------------------ x1-73.2431 2.138E-08 x2 1.1802 1.080E-13 x3 40.3979 0.001051 x4-0.0074 3.685E-05 x5-28.4147 1.077E-12 const 73.8049 4.204E-45 ========================================== Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 40 / 42

Schelling Model: Responses (blue) v. Fitted Values (yellow) images/schelling-response03.png Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 41 / 42

References [kelton-1999-dse] Kelton, W David. (1999) "Designing simulation experiments". In (Eds.) Proceedings of the 31st conference on Winter simulation: Simulation---a bridge to the future - Volume 1, Phoenix, Arizona, United States: ACM. [kleijnen-2005-ijoc] Kleijnen, Jack P C, Susan M Sanchez, and Thomas W Lucas and Thomas M Cioppa. 2005. A User s Guide to the Brave New World of Designing Simulation Experiments. INFORMS Journal on Computing 17, 263--289. Alan G. Isaac (American University) ACE Experimental Design: An Introduction 2015-07-03 42 / 42