February Statistics: Multiple Regression in R

Size: px
Start display at page:

Download "February Statistics: Multiple Regression in R"

Transcription

1 February 2016 Statistics: Multiple Regression in R

2 February 2016 How to Use This Course Book This course book accompanies the face-to-face session taught at IT Services. It contains a copy of the slideshow and the worksheets. Software Used We might use Excel to capture your data, but no other software is required. Since this is a Concepts course, we will concentrate on exploring ideas and underlying concepts that researchers will find helpful in undertaking data collection and interpretation. Revision Information Version Date Author Changes made 1.0 February 2016 John Fresen Course book version 1

3 February 2016 Copyright The copyright of this document lies with Oxford University IT Services.

4 February 2016 Contents 1 Introduction What You Should Already Know What You Will Learn Your Resources for These Exercises Help and Support Resources What Next? Statistics Courses IT Services Help Centre... 3

5 Statistics: Concepts TRMSZ 1 Introduction Welcome to the course Multiple in R. This course introduces the concept of regression using Sir Francis Galton s Parent- Child height data and then extends the concept to multiple regression using real examples. The course has an applied focus and makes minimum used of mathematics. No derivations of formulae are presented What You Should Already Know We assume that you are familiar with entering and editing text, rearranging and formatting text - drag and drop, copy and paste, printing and previewing, and managing files and folders. The computer network in IT Services may differ slightly from that which you are used to in your College or Department; if you are confused by the differences, ask for help from the teacher What You Will Learn In this course we will cover the following topics: What is regression Simple linear regression Influential observations Multiple regression Model selection Post selection inference Cross valadation Where to get help. From problem to data to conclusions Topics covered in related Statistics courses, should you be interested, are given in Section IT Services

6 Statistics: Concepts TRMSZ 2 Your Resources for These Exercises The exercises in this handbook will introduce you to some of the tasks you will need to carry out when working with WebLearn. Some sample files and documents are provided for you; if you are on a course held at IT Services, they will be on your network drive H:\ (Find it under My Computer). During a taught course at IT Services, there may not be time to complete all the exercises. You will need to be selective, and choose your own priorities among the variety of activities offered here. However, those exercises marked with a star * should not be skipped. Please complete the remaining exercises later in your own time, or book for a Computer8 session at IT Services for classroom assistance (See section 8.2) Help and Support Resources You can find support information for the exercises on this course and your future use of WebLearn, as follows: WebLearn Guidance (This should be your first port of call) If at any time you are not clear about any aspect of this course, please make sure you ask John for help. If you are away from the class, you can get help and advice by ing the central address weblearn@it.ox.ac.uk. The website for this course including reading material and other material can be found at You are welcome to contact John about statistical issues and questions at john.fresen@gmail.com 2 IT Services

7 Statistics: Concepts TRMSZ 3 What Next? 3.1. Statistics Courses Now that you have a grasp of some basic concepts in Statistics, you may want to develop your skills further. IT Services offers further Statistics courses and details are available at In particular, you might like to attend the course Statistics: Introduction: this is a four-session module which covers the basics of statistics and aims to provide a platform for learning more advanced tools and techniques. Courses on particular discipline areas or data analysis packages include: Statistics: Designing clinical research and biostatistics SPSS: An introduction SPSS: An introduction to using syntax STATA: An introduction to d ata access and management STATA: Data manipulation and analysis STATA: Statistical, survey and graphical analyses 3.2. IT Services Help Centre The IT Services Help Centre at 13 Banbury Road is open by appointment during working hours, and on a drop-in basis from 6:00 pm to 8:30 pm, Monday to Friday. The Help Centre is also a good place to get advice about any aspect of using computer software or hardware. You can contact the Help Centre on (2)73200 or by on help@it.ox.ac.uk 3 IT Services

8

9 Your safety is important Where is the fire exit? Beware of hazards: Tripping over bags and coats Please report any equipment faults to us Let us know if you have any other concerns 2

10 Your comfort is important The toilets are along the corridor outside the lecture rooms The rest area is where you registered; it has vending machines and a water cooler The seats at the computers are adjustable You can adjust the monitors for height, tilt and brightness Session 1 The concept of regression from Galton Thanks to: Dave Baker, IT Services Jill Fresen, IT Services Jim Hanley, McGill University Ian Sinclair, REES Group Oxford 4

11 Sir Francis Galton (16 February January 1911) Sir Francis Galton was an incredible polymath Cousin of Charles Darwin. General: Genetics What do we inherit form our ancestors? Particular: Do tall parents have tall children and short parents, short children? i.e. Does the height of children depend on the height of parents? Data: Famous 1885 study: 205 sets of parents 928 offspring mph = average height of parents; ch = child height Galton Peas Experiment: Selected 700 pea pods of selected sizes average diam of parent peas ; average diam of child peas

12 Francis Galton: Do tall parents have tall children, short parents short children? Does height of child depend on height of parents? Frequency scatterplot of Galton Data Midparent

13 Galton data: boxplots of conditional distributions of child-ht conditional on parent-ht histograms of the marginal distributions of child-ht and parent-ht Plot of child-parent data sunflower plot of data parent-ht parent-ht Plot of data child-ht jittered Plot of distributions child-ht given parent-ht parent-ht parent-ht 72.5

14 Regression is a plot/trace of the means of the conditional distributions trace of actual means regression of Child on Midparent trace of linear regression means assumes means lie on a straight line Midparent superimposing actual and linear regressions Midparent Midparent The trace of actual means has no assumptions in it but end distributions have a lot of sampling variation because of the small number of observations in those distributions Linear regression stabilises that Linear regression model Linear regression model fitted to data Midparent Midparent Linear regression model assumes: 1. Conditional distributions are normal 2. Conditional means lie on a straight line 3. Conditional distributions all have same spread In words: the distribution of child height, conditional on a given midparent height, is normal, with means lying on the straight line, and constant spread In mathematics: This model can be extended in many ways 72 74

15 The Linear Model : ways. Here are three - there are more: can be extended in many 1. Model the mean by a more general function such as a polynomial or trigonometric function or Fourier series or radial basis functions or some nonparametric function 2. Model the variance as a function : 3. Generalize from the Normal distribution to the Exponential Family that includes: normal, exponential, gamma, chi-squared, beta, Dirichlet, Bernoulli, categorical, Poisson, Wishart, Inverse Wishart and many others. But in all cases we are modelling the mean and other parameters of conditional distributions. These are called Generalized Linear Models. In R : fits a linear model fits a generalized linear model fits a generalized additive model Does the average diam of child peas depend on average diam of parent peas? What are the sketches telling us? Would a linear regression model be suitable? An important point about this example is that in regression, it is the slope of the regression line that is important, not the intercept.

16 Go to Lecture 2 A: Example 1 UWC Analysis Detecting Influential Observations using R Influential observations may suggest your model is incorrect data point has been miss-recorded

17 Detecting Influential Observations in R Case1: Outlying in y-space, not x-space. Use studentized residuals to detect these. Case 2: Outlying in x-space, not y-space. High leverage point. Use hatvalues to detect this. Case 3: Outlying in both x-space and y-space. High leverage point. Studentized residuals Hatvalues DFFITS 17 DFBETAS Outliers in Y- space: To detect outliers in the Y-space we compute the studentized residuals, defined as: This is just the standardizing transformation on the residuals. Thus, we expect 99% of studentized residuals to lie between -3 and 3. Values outside of this range suggest possible outliers. In R these are plotted on a graph using the statement: 18

18 Outliers in X-space: To detect outliers in the X-space we compute the hatvalues, defined as: hatvalues = diagonal elements of the hat matrix diag( X( XT X) 1 XT ) The i-th hatvalue measures the distance of case i x-values from the centroid of the x-values. In R these are plotted on a graph using the statement: Some of these will be small, some intermediate, and some large. These are assessed in a relative sense there is, in - 19 We want to avoid the situation in which one observation dominates the others. DFFITS (means change (DF) in FITted values) DFFITS measures the effect or influence of removing the i th observation on the predicted value of the i-th observation, divided by a standardizing quantity. DFFITS i Yi Yi (i ) constant This is a local measure of influence it is only concerned with what is happens at case I if case I is removed from the data. In R these are plotted on a graph using the statement: Some of these will be small, some intermediate, and some large. These are assessed in a relative sense there is, in point. We want to avoid the situation in which one observation dominates the others. 20

19 Cooks Distance measures the effect or influence of removing the i th observation on all predicted, divided by a standardizing quantity to normalize it. Di (Y j Y j (i ) ) 2 constant This is a global measure of influence it considers the effect of removing case I on all predicted values. In R these are plotted on a graph using the statement: Some of these will be small, some intermediate, and some large. These are assessed in a relative sense there is, in - 21 DFBETAS (means change (DF) in the BETAS) DFBETAS measures the effect or influence of removing the i th observation on the estimated regression coefficients, divided by a standardizing quantity. DFBk (i ) k k (i ) constant In R these are plotted on a graph using the statement: is a pxn matrix. The entry in row I and column k represents the effect of removing the i-th observation on the k-th regression coefficient. Some of these will be small, some intermediate, and some large. These are assessed in a relative sense there is, in -

20 Go to Lecture 2 B UWC Analysis outliers and influential observations Model Selection The Executive Salary Data has the variables: lsalary exper educat bonus numemp assets board age profits internat sales Question: How do we select which variables to include in a model for estimating the mean log(salary)? still a vibrant research topic in statistics with many contentious issues controversy: Among competing hypotheses, the one with the fewest assumptions should be selected. Choose the simplest model that gives and adequate description of the data

21 include Akaike information criterion (AIC) Bayesian information criterion (BIC) AIC = -2*log(likelihood(model)) + 2*no predictors BIC = -2*log(likelihood(model)) + 2*no predictors*log(no obs) The first term -2*log(likelihood(model)) gets smaller with more predictors but gets penalized by the increased no of predictors and the increased sample size (no obs). we choose model with the smallest AIC (BIC) Recent excellent articles are: Statistical model choice by Gerda Claeskens (2016) Valid post selection inference 2013 Berk Brown Buja Zhang and Zhao Annals (2015) Problem: If we have k predictors, we have 2^k possible models (without considering interactions and transformations such as log, etc) For the Exec Salary Data we have 10 predictors so there are 2^10 = 1024 models to consider without interactions or transformations. We consider the Stepwise Selection: Forward selection, which involves starting with no variables in the model, testing the addition of each variable using a chosen model comparison criterion, adding the variable (if any) that improves the model the most, and repeating this process until none improves the model. Backward elimination, which involves starting with all candidate variables, testing the deletion of each variable using a chosen model comparison criterion, deleting the variable (if any) that improves the model the most by being deleted, and repeating this process until no further improvement is possible. Bidirectional elimination, a combination of the above, testing at each step for variables to be included or excluded.

22 Inference after Model Selection The following articles provide a great discussion post-selection inference and make suggestions for how to proceed: Ernst Wit, Edwin van den Heuvel and Jan-Willem Romeijn Statistica Neerlandica, doi: /j x Richard Berk, Lawrence Brown, Andreas Buja, Kai Zhang, and Linda Zhao (2013) Valid post-selection inference. Ann. Statist. Volume 41, Number 2, Two major problems that arise in the process of model selection are: First, the distributions of the estimated regression parameters are no longer valid. (This means that the tests and confidence intervals normally calculated are no longer valid.) Second, we should see how well the model works on unseen data. This might conceptually be achieved by splitting the data into two subsets, and a This is called cross-validation. Cross-validation is important in guarding against testing hypotheses suggested by the data (called "Type III errors")

23 Leave-p-out cross-validation Leave-p-out cross-validation (LpO CV) involves using p observations as the validation set and the remaining observations as the training set. This is repeated on all ways to cut the original sample on a validation set of p observations and a training set. Very expensive computing wise Leave-one-out cross-validation Leave-one -out cross-validation (LOOCV) is a particular case of leave-pout cross-validation with p = 1. LOOCV doesn't have the computational problem of general LpO crossvalidation. k-fold cross-validation In k-fold cross-validation, the original sample is randomly partitioned into k equal sized subsamples. Of the k subsamples, a single subsample is retained as the validation data for testing The cross-validation process is then repeated k times (the folds), with each of the k subsamples used exactly once as the validation data. The k results from the folds can then be averaged (or otherwise combined) to produce a single estimation 2-fold cross-validation = special case

24 Go to lecture 5 Example 4 Executive salary data

25 Lecture 3: Example 2: Multiple Regression Executive Salary Data Four variables Problem and Data This data set was collected by a large Human Resources Placement company specializing in the placement of Chief Executive Officers (CEO's). The objective of the research is to develop a model that will predict the average salary of CEO's based on a number of factors or variables associated with the executive and the company. Because of its skewed nature salary was converted to log-salary and recorded as lsalary to bring it closer to normality. The purpose of the model is to help guide executives as to the distribution of salaries (mean and standard deviation) earned by executives with similar qualifications and backgrounds. The data is given in execsaldata.xlxs, execsaldata.txt, execsaldata.csv. For this exercise we will only consider the following predictors: experience (exper); number of employees (numemp) ; number of years of formal education (educat) ; company assets in millions of dollars (assets) Model: lsalary = β 0 + β 1 exper + β 2 educat + β 3 numemp + β 4 assets + noise where the noise follows a normal distribution with a mean of zero and constant variance The data were obtained by taking a random sample of 100 companies listed on the New York Stock Exchange during 2008 and is given in a tab delimited file execsaldata.csv. But an excel version is also given. Assignment: Perform an analysis that will lead to a regression model for predicting the average salary of executives based on their experience and background in terms of the factors considered above. You might consider, inter-alia, the following points in your analysis but do not be restricted to this list. 1. Read the data into R and attach it. 2. Summarize the univariate marginal distributions of experience, number of employees, number of years of formal education and the company assets. 3. Provide the matrix of these variables, using the pairs statement, compute the correlation coefficients and comment on the plots and correlation matrix. 4. Fit the regression model to the data. Construct and interpret the SUMMARY and ANOVA tables. 5. How accurate is the prediction equation? Assess graphically the accuracy of the model by plotting the observed salary against the fitted values, say salhat. 6. Perform a graphical analysis of the residuals to assess if the assumptions on the error terms are reasonable. 7. Assess the possibility of outliers and influential observations. 8. Can you make recommendations about estimating the average salary of executives from the predictor variables considered here? 9. Can you criticize the data or the model?

26 Recommended Steps You may copy and paste into R lsalary exper educat bonus numemp assets board age profits internat sales Step 1 Step 2 Step 3 Read and attach data. My data is stored in the directory Data of my memory stick. One can specify any directory. esd = read.csv("e:/data/execsaldata.csv",header=t,sep=",") attach(esd) head(esd) names(esd) Plot marginal distributions of the response variable and the four predictor variables on a 2X3 scatterplot matrix par(mfrow=c(2,3)) hist(lsalary,prob=t,col="gray");lines(density(lsalary));rug(lsalary) hist(exper,prob=t,col="gray");lines(density(exper));rug(exper) hist(educat,prob=t,col="gray");lines(density(educat));rug(educat) hist(numemp,prob=t,col="gray");lines(density(numemp));rug(numemp) hist(assets,prob=t,col="gray");lines(density(assets));rug(assets) # notice that these are on vastly different scales Pairs plot (matrix of scatterplots) newdata <- cbind(lsalary,exper,educat,numemp,assets) pairs(newdata) Step 4 Can combine steps 2 and 3 (See help for pairs) panel.hist <- function(x,...) { usr <- par("usr"); on.exit(par(usr)) par(usr = c(usr[1:2], 0, 1.5) ) h <- hist(x, plot = FALSE) breaks <- h$breaks; nb <- length(breaks) y <- h$counts; y <- y/max(y) rect(breaks[-nb], 0, breaks[-1], y, col = "cyan",...) } pairs(newdata, panel = panel.smooth, diag.panel = panel.hist, cex.labels = 1.0, font.labels = 2) Step 5 Compute the correlation coefficients: cor(newdata) # or, to have fewer decimal places round(cor(newdata),3) Step 6 Fit the linear model to the data and get summary and anova: fit1 = lm(lsalary~exper+educat+numemp+assets) summary(fit1) anova(fit1) Step 7 Compute predicted values and residuals: salhat = fitted.values(fit1) res = residuals(fit1)

27 Step 8 # check the fit and assumptions on noise terms i.e. # are they normal and independent of predictor and model? par(mfrow=c(2,4)) plot(salhat,lsalary,xlim=c(10.6,12.1),ylim=c(10.6,12.1), main="observed vs fitted values") abline(0,1) #superimposes an ideal straight line plot(res) # sequential plot of residuals qqnorm(res);qqline(res) #normal probability plot of residuals plot(exper,res) #residuals vs exper plot(educat,res) #residuals vs eduac plot(numemp,res) #residuals vs numemp plot(assets,res) #residuals vs assets plot(salhat,res) #residuals vs fitted values Step 9 # Detecting possible outlying and influential observations # Are there outliers in the y-space or x-space? # Are there influential observations as measured by # DFFITS,Cook s Distance or DFBETAS? par(mfrow=c(3,3)) plot(rstudent(fit1),type="h") plot(hatvalues(fit1),type="h") plot(dffits(fit1),type="h") plot(cooks.distance(fit1),type="h") plot(dfbetas(fit1)[,1],type="h") plot(dfbetas(fit1)[,2],type="h") plot(dfbetas(fit1)[,3],type="h") plot(dfbetas(fit1)[,4],type="h") plot(dfbetas(fit1)[,5],type="h") # We might wish to plot hatvalues against cooks.distance and even # consider other combinations of plots. # # An interesting 2X2 matrix of plots is provided by plot(fit1) par(mfrow=c(2,2)) plot(fit1) Step 10 What are our conclusions? First interpret the diagnostic plots for assumptions on residuals and then consider the possibility of influential observations and outliers. Do the assumptions on the noise terms seem reasonable? If the diagnostics are satisfactory we then look at the summary and ANOVA to assess and interpret the model: What is the mean function and what is the spread about the mean function? Is the model a reasonable approximation to the data? Can we criticize that data or the model and make suggestions for further analysis or research?

28

29 Example 1: UWC Analysis Mathematical model: result = β 0 + β 1 rating + noise assume noise follows a normal distribution with a mean of zero and variance σ 2 We could write this as result rating~normal(β 0 + β 1 rating, σ 2 ) The conditional distribution of result given rating is normal with a mean of β 0 + β 1 rating and variance of σ 2 Suggested steps: Step 1 # Read data into R, attach data, print first 6 lines uwc = read.table("e:/data/uwcdata.csv",header = T,sep=",") attach(uwc) head(uwc) Step 2 # Plot marginal distributions par(mfrow=c(1,2)) #1 row by 2 cols graphics window hist(rating,prob=t,col="gray");density(rating);rug(rating) hist(result,prob=t,col="gray");density(result);rug(result) Step 3 # fit the linear model (linear regression model) fit1 <- lm(result ~ rating) # fit1 is an object generated by the routine lm containing a lot # of information about the fitted model. # The rest of the steps are simply accessing information in fit1 Step 4 # obtain summary and anova of fit # compute fitted values and residuals summary(fit1) anova(fit1) yhat <- fitted.values(fit1) # fitted values res <- residuals(fit1) # residuals Step 5 # various common plots put into a 2X3 matrix of scatter-plots to # check the fit and assumptions on noise terms i.e. # are they normal and independent of predictor and model? # # plot 1: scatterplot of data and superimposed fitted model - # only do this plot when there is only a single predictor # plot 2: scatterplot of observed values vs fitted values to see # how close the fitted values are to the observed data # do this plot no matter how many predictors # plot 3: plot of residuals (random or pattern?) # plot 4: Q-Q plot of residuals (are they approx normal?) # plot 5: residuals vs predictor (random or pattern?) # if there are many predictors we plot residuals against # each predictor in turn # plot 6: residuals vs fitted (random or pattern?) 1

30 par(mfrow=c(2,3)) plot(rating,result,ylim=c(0,100), pch=19,cex=1.5, main="result vs Rating UWC data/n showing pass mark and fitted model") abline(fit1) #superimposes fitted straight line abline(h=48,lty=2) # superimposes dashed horizontal line at 48 plot(yhat,result,xlim=c(0,100),ylim=c(0,100), main="observed vs fitted values") abline(0,1) #superimposes an ideal straight line plot(res) qqnorm(res);qqline(res) #normal probability plot of residuals plot(rating,res) #residuals vs predictor plot(yhat,res) #residuals vs fitted values Step 6 # Detecting possible outlying and influential observations # Are there outliers in the y-space or x-space? # Are there influential observations as measured by # DFFITS,Cook s Distance or DFBETAS? par(mfrow=c(2,3)) plot(rstudent(fit1),type="h") plot(hatvalues(fit1),type="h") plot(dffits(fit1),type="h") plot(cooks.distance(fit1),type="h") plot(dfbetas(fit1)[,1],type="h") plot(dfbetas(fit1)[,2],type="h") Step 7 What are our conclusions? First interpret the diagnostic plots for assumptions on residuals and then consider the possibility of influential observations and outliers. If the diagnostics are satisfactory we then look at the summary and ANOVA to assess and interpret the model: What is the mean function and what is the spread about the mean function? Can we criticize that data or the model and make suggestions for further analysis or research? 2

31 Lecture 3: Example 2: Multiple Regression Executive Salary Data Four variables Problem and Data This data set was collected by a large Human Resources Placement company specializing in the placement of Chief Executive Officers (CEO's). The objective of the research is to develop a model that will predict the average salary of CEO's based on a number of factors or variables associated with the executive and the company. Because of its skewed nature salary was converted to log-salary and recorded as lsalary to bring it closer to normality. The purpose of the model is to help guide executives as to the distribution of salaries (mean and standard deviation) earned by executives with similar qualifications and backgrounds. The data is given in execsaldata.xlxs, execsaldata.txt, execsaldata.csv. For this exercise we will only consider the following predictors: experience (exper); number of employees (numemp) ; number of years of formal education (educat) ; company assets in millions of dollars (assets) Model: lsalary = β 0 + β 1 exper + β 2 educat + β 3 numemp + β 4 assets + noise where the noise follows a normal distribution with a mean of zero and constant variance The data were obtained by taking a random sample of 100 companies listed on the New York Stock Exchange during 2008 and is given in a tab delimited file execsaldata.csv. But an excel version is also given. Assignment: Perform an analysis that will lead to a regression model for predicting the average salary of executives based on their experience and background in terms of the factors considered above. You might consider, inter-alia, the following points in your analysis but do not be restricted to this list. 1. Read the data into R and attach it. 2. Summarize the univariate marginal distributions of experience, number of employees, number of years of formal education and the company assets. 3. Provide the matrix of these variables, using the pairs statement, compute the correlation coefficients and comment on the plots and correlation matrix. 4. Fit the regression model to the data. Construct and interpret the SUMMARY and ANOVA tables. 5. How accurate is the prediction equation? Assess graphically the accuracy of the model by plotting the observed salary against the fitted values, say salhat. 6. Perform a graphical analysis of the residuals to assess if the assumptions on the error terms are reasonable. 7. Assess the possibility of outliers and influential observations. 8. Can you make recommendations about estimating the average salary of executives from the predictor variables considered here? 9. Can you criticize the data or the model?

32 Recommended Steps You may copy and paste into R lsalary exper educat bonus numemp assets board age profits internat sales Step 1 Step 2 Step 3 Read and attach data. My data is stored in the directory Data of my memory stick. One can specify any directory. esd = read.csv("e:/data/execsaldata.csv",header=t,sep=",") attach(esd) head(esd) names(esd) Plot marginal distributions of the response variable and the four predictor variables on a 2X3 scatterplot matrix par(mfrow=c(2,3)) hist(lsalary,prob=t,col="gray");lines(density(lsalary));rug(lsalary) hist(exper,prob=t,col="gray");lines(density(exper));rug(exper) hist(educat,prob=t,col="gray");lines(density(educat));rug(educat) hist(numemp,prob=t,col="gray");lines(density(numemp));rug(numemp) hist(assets,prob=t,col="gray");lines(density(assets));rug(assets) # notice that these are on vastly different scales Pairs plot (matrix of scatterplots) newdata <- cbind(lsalary,exper,educat,numemp,assets) pairs(newdata) Step 4 Can combine steps 2 and 3 (See help for pairs) panel.hist <- function(x,...) { usr <- par("usr"); on.exit(par(usr)) par(usr = c(usr[1:2], 0, 1.5) ) h <- hist(x, plot = FALSE) breaks <- h$breaks; nb <- length(breaks) y <- h$counts; y <- y/max(y) rect(breaks[-nb], 0, breaks[-1], y, col = "cyan",...) } pairs(newdata, panel = panel.smooth, diag.panel = panel.hist, cex.labels = 1.0, font.labels = 2) Step 5 Compute the correlation coefficients: cor(newdata) # or, to have fewer decimal places round(cor(newdata),3) Step 6 Fit the linear model to the data and get summary and anova: fit1 = lm(lsalary~exper+educat+numemp+assets) summary(fit1) anova(fit1) Step 7 Compute predicted values and residuals: salhat = fitted.values(fit1) res = residuals(fit1)

33 Step 8 # check the fit and assumptions on noise terms i.e. # are they normal and independent of predictor and model? par(mfrow=c(2,4)) plot(salhat,lsalary,xlim=c(10.6,12.1),ylim=c(10.6,12.1), main="observed vs fitted values") abline(0,1) #superimposes an ideal straight line plot(res) # sequential plot of residuals qqnorm(res);qqline(res) #normal probability plot of residuals plot(exper,res) #residuals vs exper plot(educat,res) #residuals vs eduac plot(numemp,res) #residuals vs numemp plot(assets,res) #residuals vs assets plot(salhat,res) #residuals vs fitted values Step 9 # Detecting possible outlying and influential observations # Are there outliers in the y-space or x-space? # Are there influential observations as measured by # DFFITS,Cook s Distance or DFBETAS? par(mfrow=c(3,3)) plot(rstudent(fit1),type="h") plot(hatvalues(fit1),type="h") plot(dffits(fit1),type="h") plot(cooks.distance(fit1),type="h") plot(dfbetas(fit1)[,1],type="h") plot(dfbetas(fit1)[,2],type="h") plot(dfbetas(fit1)[,3],type="h") plot(dfbetas(fit1)[,4],type="h") plot(dfbetas(fit1)[,5],type="h") # We might wish to plot hatvalues against cooks.distance and even # consider other combinations of plots. # # An interesting 2X2 matrix of plots is provided by plot(fit1) par(mfrow=c(2,2)) plot(fit1) Step 10 What are our conclusions? First interpret the diagnostic plots for assumptions on residuals and then consider the possibility of influential observations and outliers. Do the assumptions on the noise terms seem reasonable? If the diagnostics are satisfactory we then look at the summary and ANOVA to assess and interpret the model: What is the mean function and what is the spread about the mean function? Is the model a reasonable approximation to the data? Can we criticize that data or the model and make suggestions for further analysis or research?

34

35 Lecture 4: Example 5: Model Selection Executive Salary Data All variables Stepwise Regression in R Recommended Steps You may copy and paste into R lsalary exper educat bonus numemp assets board age profits internat sales Step 1 Read and attach data. My data is stored in the directory Data of my memory stick. One can specify any directory. esd = read.csv("e:/data/execsaldata.csv",header=t,sep=",") attach(esd) head(esd) names(esd) Step 2 Step 3 Step 4 # Invoke the MASS library that contains the stepaic function library(mass) Pairs plot (matrix of scatterplots) newdata <- c(exper,educat,numemp,assets,age,profits,sales) Can combine steps 2 and 3 (See help for pairs) panel.hist <- function(x,...) { usr <- par("usr"); on.exit(par(usr)) par(usr = c(usr[1:2], 0, 1.5) ) h <- hist(x, plot = FALSE) breaks <- h$breaks; nb <- length(breaks) y <- h$counts; y <- y/max(y) rect(breaks[-nb], 0, breaks[-1], y, col = "cyan",...) } pairs(newdata, panel = panel.smooth, diag.panel = panel.hist, cex.labels = 1.0, font.labels = 2) Step 5 Compute the correlation coefficients: round(cor(newdata),3) Step 6 Perform stepwise regression fit1 <- lm(low ~.,data=esd) esd.step <- stepaic(fit1, direction = "backward" ) Step 6 Fit the linear model to the data and get summary and anova: fit2 <- lm(lsalary ~ exper+educat+bonus+numemp+assets+board+age+ profits+internat+sales) summary(fit2) anova(fit2)

36 Step 7 Step 7 Step 8 Fit reduced model take out non-significant terms fit3 <- lm(lsalary ~ exper+educat+bonus+numemp+assets) summary(fit3) anova(fit3) Compute predicted values and residuals: salhat = fitted.values(fit3) res = residuals(fit3) # check the fit and assumptions on noise terms i.e. # are they normal and independent of predictor and model? par(mfrow=c(3,3)) plot(salhat,lsalary,xlim=c(10.6,12.1),ylim=c(10.6,12.1), main="observed vs fitted values") abline(0,1) #superimposes an ideal straight line plot(res) # sequential plot of residuals qqnorm(res);qqline(res) #normal probability plot of residuals plot(exper,res) #residuals vs exper plot(educat,res) #residuals vs educat plot(bonus,res) #residuals vs educat plot(numemp,res) #residuals vs numemp plot(assets,res) #residuals vs assets plot(salhat,res) #residuals vs fitted values Step 9 # Detecting possible outlying and influential observations # Are there outliers in the y-space or x-space? # Are there influential observations as measured by # DFFITS,Cook s Distance or DFBETAS? par(mfrow=c(3,3)) plot(rstudent(fit3),type="h") plot(hatvalues(fit3),type="h") plot(dffits(fit3),type="h") plot(cooks.distance(fit3),type="h") plot(dfbetas(fit3)[,1],type="h") plot(dfbetas(fit3)[,2],type="h") plot(dfbetas(fit3)[,3],type="h") plot(dfbetas(fit3)[,4],type="h") plot(dfbetas(fit3)[,5],type="h") plot(dfbetas(fit3)[,6],type="h") Step 10 What are our conclusions? First interpret the diagnostic plots for assumptions on residuals and then consider the possibility of influential observations and outliers. Do the assumptions on the noise terms seem reasonable? If the diagnostics are satisfactory we then look at the summary and ANOVA to assess and interpret the model: What is the mean function and what is the spread about the mean function? Is the model a reasonable approximation to the data? Can we criticize that data or the model and make suggestions for further analysis or research?

37 Lecture 5: Inference after model selection The following articles provide a great discussion post-selection inference and make suggestions for how to proceed: Ernst Wit, Edwin van den Heuvel and Jan-Willem Romeijn (2012) All models are wrong... : an introduction to model uncertainty. Statistica Neerlandica, doi: /j x Richard Berk, Lawrence Brown, Andreas Buja, Kai Zhang, and Linda Zhao (2013) Valid postselection inference. Ann. Statist. Volume 41, Number 2, Two major problems that arise in the process of model selection are: First, the distributions of the estimated regression parameters are no longer valid. (This means that the tests and confidence intervals normally calculated are no longer valid.) Second, we should see how well the model works on unseen data. This might conceptually be achieved by splitting the data into two subsets, a training set a validation set This is called cross-validation. Cross-validation is important in guarding against testing hypotheses suggested by the data (called "Type III errors") Exhaustive cross-validation Exhaustive cross-validation methods are cross-validation methods which learn and test on all possible ways to divide the original sample into a training and a validation set. Leave-p-out cross-validation Leave-p-out cross-validation (LpO CV) involves using p observations as the validation set and the remaining observations as the training set. This is repeated on all ways to cut the original sample on a validation set of p observations and a training set. LpO cross-validation requires to learn and validate C_p^n times (where n is the number of observations in the original sample). So as soon as n is quite big it becomes impossible to calculate. Leave-one-out cross-validation Leave-one-out cross-validation (LOOCV) is a particular case of leave-p-out cross-validation with p = 1. LOOCV doesn't have the calculation problem of general LpO cross-validation because C_1^n=n. k-fold cross-validation In k-fold cross-validation, the original sample is randomly partitioned into k equal sized subsamples. Of the k subsamples, a single subsample is retained as the validation data for testing the model, and the remaining k 1 subsamples are used as training data. The cross-validation process is then repeated k times (the folds), with each of the k subsamples used exactly once as the validation data. The k results from the folds can then be averaged (or otherwise combined) to produce a single estimation

38 2-fold cross-validation This is the simplest variation of k-fold cross-validation. Also called holdout method.[8] For each fold, we randomly assign data points to two sets d0 and d1, so that both sets are equal size (this is usually implemented by shuffling the data array and then splitting it in two). We then train on d0 and test on d1, followed by training on d1 and testing on d0. Cross-validation only yields meaningful results if the validation set and training set are drawn from the same population and only if selection biases are controlled. Cross validation in R Example 5: Executive salary data esd = read.csv("e:/data/execsaldata.csv",header=t,sep=",") attach(esd) head(esd) fit3 <- lm(lsalary ~ exper+educat+bonus+numemp+assets) xmatrix <- as.matrix(cbind(exper,educat,bonus,numemp,assets)) library(cvtools) cvfit(fit3, lsalary ~ exper+educat+bonus+numemp+assets, data=esd, y=lsalary,x=xmatrix, cost = rmspe, K = 5, R=1,foldType = "consecutive")

39 Practical 1: Ht Wt UCLA Data The dataset UCLA ht wt sample.csv contains 250 records of human heights and weights. These were obtained by taking a random sample of 250 from the original sample of children in 1993 by a Growth Survey of children from birth to 18 years of age recruited from Maternal and Child Health Centres (MCHC) and schools and were used to develop Hong Kong's current growth charts for weight, height, weight-for-age, weight-for-height and body mass index (BMI). To reduce the size of the data for this exercise a random sample of 250 rows were generated and stored in UCLA ht wt sample.csv. Large data sets require a different treatment that we won t cover here. The columns contain the new index (new.index), the original index (index), the ht and wt. Use the UWC analysis to guide your R code. Step 1 # Read data into R, attach data, print first 6 lines d <- read.csv("e:/data/ucla ht wt sample.csv",header=t) Step 2 # Plot marginal distributions of height and weight # Provide the boxplot of the conditional distributions of weight # given height boxplot(wt~ht,range=0,varwidth=t,col= "gray", main= "Boxplots of the distributions\n of weight given height", xlab="height (in)", ylab="weight (lb)") Step 3 # fit the linear model of wt on ht Step 4 # obtain summary and anova of fit # compute fitted values and residuals Step 5 # various common plots put into a 2X3 matrix of scatter-plots to # check the fit and assumptions on noise terms i.e. # are they normal and independent of predictor and model? # # plot 1: scatterplot of data and superimposed fitted model - # only do this plot when there is only a single predictor # plot 2: scatterplot of observed values vs fitted values to see # how close the fitted values are to the observed data # do this plot no matter how many predictors # plot 3: plot of residuals (random or pattern?) # plot 4: Q-Q plot of residuals (are they approx normal?) # plot 5: residuals vs predictor (random or pattern?) # if there are many predictors we plot residuals against # each predictor in turn # plot 6: residuals vs fitted (random or pattern?) Step 6 # Detecting possible outlying and influential observations # Are there outliers in the y-space or x-space? # Are there influential observations as measured by # DFFITS,Cook s Distance or DFBETAS?

40 Step 7 What are our conclusions? First interpret the diagnostic plots for assumptions on residuals and then consider the possibility of influential observations and outliers. If the diagnostics are satisfactory we then look at the summary and ANOVA to assess and interpret the model: What is the mean function and what is the spread about the mean function? Can we criticize that data or the model and make suggestions for further analysis or research?

41 Practical 2: Salamander Problem and Data The data set for this assignment was obtained from Bill Peterman during the fall of 2005, then a postgraduate student in Ecology and Conservation at University of Missouri. See his webpage at The data given in the appendix were collected on 45 salamanders to ascertain the time to anesthetization (seconds) when submerged in different concentrations of Tricaine Methanesulfonate, or MS-222 for short. It is a fine white powder that easily dissolves in water. The salamanders were placed in a container with the solution and were completely submerged. The temperature of the water-anesthetic solution (MS- 222 was the anesthetic) was measured in degrees Celsius. The covariates considered were snout vent length (sl) measured in millimeters, total length (tl) measured in millimeters, mass measured in grams, ph of the solution, the temperature. The study was motivated because Bill needed to insert electronic tracking devices into the salamanders so that they could be easily tracked. However, he could find no guidelines about the concentration required for anesthetization for salamanders. The objective was to develop a model to predict the time required for anesthetization in terms of the concentration, the size of the salamander as measured by the mass. We will ignore the ph and temperature. Model Building Considerations It seems appropriate to exclude temperature and ph from the analysis because these were strongly correlated with the concentration. Further it seemed sensible to use mass rather than either snout length (sl) or total length (tl) in the analysis. (Because of their high correlation only one of these measurements would be included and mass is by far the more reliable and intuitively appealing measurement.) After considering the scatterplots of time to anesthetization (anes [measured in minutes]) against concentration (conc [mg/l]), it seemed that the analysis should be based on log transformations of both anesthetization time, concentration and mass. Thus, the complete model to be contemplated is Analysis: ln( anes ) 0 1 ln( conc) 2ln( mass) Perform an analysis that will lead to a regression model for predicting the average time to anesthetization in terms of concentration, mass and ph. Suggestions for the analysis step of the Salamander Data (approximate Steps): 1 Read data into R. 2 Analyse the marginal distributions of original/untransformed data and comment on these. (e.g. stem-and-leaf, summary, Q-Qplots, histograms, etc.) 3 Obtain scatterplot and correlation matrices of original data and comment on these. 4 Transform data to logs: log(anes), log(conc) and log(mass). 5 Check that the transformed data are approximately normal. (At this stage we are not so interested in the means and sd s but in the shape of their distributions are they approximately normal?.) 6 Repeat step 3 for the transformed data as a precursor to the model fitting. 7 Fit and assess the contemplated models: Model 1 ln( anes ) 0 1 ln( conc) 2ln( mass) 8 provide the ANOVA and summary table, perform the checks that the assumptions on the error terms are reasonable for that model and perform the usual diagnostic plots looking for outliers in the Y-space, the X-space, the DFFITS, Cooks Distance, and DFBETAS. 9 Interpretations 10 Criticisms and recommendations of experiment, data and the model 1

42 Picture of salamander species used in the anesthetization study: Pictures by Bill Peterman: 2

43 Practical 4: Birthweight data The data JRHbirthwt.csv for this exercise comes from a recent research project at the John Radcliff Hospital. The 17 variables are: Age PAPPA hcg NT trisomy Parity (categorical) BMI Smoking (categorical) Ethnicity (categorical) Conception Gestation in weeks (won t use this) Gestation in days Delivery (categorical) Centile Birthwt PET2 (categorical) G3M (categorical) The objective is to develop a model to predict birthweight from the other variables. I plotted all the marginal distributions and re-coded to eliminate sparse categories, and then converted the categorical variables to factors: Ethnicity.new <- 1*(Ethnicity==1)+2*(Ethnicity==2)+1*(Ethnicity>2) Ethnicity.new <- as.factor( Ethnicity.new) PET.new <-1*(PET2==1)+2*(PET2>1) PET.new <- as.factor(pet.new) G3M.new <- 1*(G3M==1)+2*(G3M>1) G3M.new <- as.factor(g3m.new) Smoking <- as.factor(smoking) Parity.new <- 0*(Parity==0)+1*(Parity==1)+1*(Parity==2) Parity.new <- as.factor(parity.new) Conception.new <- 1*(Conception==1)+2*(Conception>1) Conception.new <- as.factor(conception.new) Perform a stepwise regression on these variables. Fit the best resulting model Obtain the summary and anova Select your model Assess assumptions and influential observations What are your conclusions?

Probability and Statistics Curriculum Pacing Guide

Probability and Statistics Curriculum Pacing Guide Unit 1 Terms PS.SPMJ.3 PS.SPMJ.5 Plan and conduct a survey to answer a statistical question. Recognize how the plan addresses sampling technique, randomization, measurement of experimental error and methods

More information

STT 231 Test 1. Fill in the Letter of Your Choice to Each Question in the Scantron. Each question is worth 2 point.

STT 231 Test 1. Fill in the Letter of Your Choice to Each Question in the Scantron. Each question is worth 2 point. STT 231 Test 1 Fill in the Letter of Your Choice to Each Question in the Scantron. Each question is worth 2 point. 1. A professor has kept records on grades that students have earned in his class. If he

More information

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview Algebra 1, Quarter 3, Unit 3.1 Line of Best Fit Overview Number of instructional days 6 (1 day assessment) (1 day = 45 minutes) Content to be learned Analyze scatter plots and construct the line of best

More information

Statewide Framework Document for:

Statewide Framework Document for: Statewide Framework Document for: 270301 Standards may be added to this document prior to submission, but may not be removed from the framework to meet state credit equivalency requirements. Performance

More information

STA 225: Introductory Statistics (CT)

STA 225: Introductory Statistics (CT) Marshall University College of Science Mathematics Department STA 225: Introductory Statistics (CT) Course catalog description A critical thinking course in applied statistical reasoning covering basic

More information

A Model to Predict 24-Hour Urinary Creatinine Level Using Repeated Measurements

A Model to Predict 24-Hour Urinary Creatinine Level Using Repeated Measurements Virginia Commonwealth University VCU Scholars Compass Theses and Dissertations Graduate School 2006 A Model to Predict 24-Hour Urinary Creatinine Level Using Repeated Measurements Donna S. Kroos Virginia

More information

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS 1 CALIFORNIA CONTENT STANDARDS: Chapter 1 ALGEBRA AND WHOLE NUMBERS Algebra and Functions 1.4 Students use algebraic

More information

CS Machine Learning

CS Machine Learning CS 478 - Machine Learning Projects Data Representation Basic testing and evaluation schemes CS 478 Data and Testing 1 Programming Issues l Program in any platform you want l Realize that you will be doing

More information

TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE. Pierre Foy

TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE. Pierre Foy TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE Pierre Foy TIMSS Advanced 2015 orks User Guide for the International Database Pierre Foy Contributors: Victoria A.S. Centurino, Kerry E. Cotter,

More information

Lecture 1: Machine Learning Basics

Lecture 1: Machine Learning Basics 1/69 Lecture 1: Machine Learning Basics Ali Harakeh University of Waterloo WAVE Lab ali.harakeh@uwaterloo.ca May 1, 2017 2/69 Overview 1 Learning Algorithms 2 Capacity, Overfitting, and Underfitting 3

More information

Chapters 1-5 Cumulative Assessment AP Statistics November 2008 Gillespie, Block 4

Chapters 1-5 Cumulative Assessment AP Statistics November 2008 Gillespie, Block 4 Chapters 1-5 Cumulative Assessment AP Statistics Name: November 2008 Gillespie, Block 4 Part I: Multiple Choice This portion of the test will determine 60% of your overall test grade. Each question is

More information

Ryerson University Sociology SOC 483: Advanced Research and Statistics

Ryerson University Sociology SOC 483: Advanced Research and Statistics Ryerson University Sociology SOC 483: Advanced Research and Statistics Prerequisites: SOC 481 Instructor: Paul S. Moore E-mail: psmoore@ryerson.ca Office: Sociology Department Jorgenson JOR 306 Phone:

More information

School Size and the Quality of Teaching and Learning

School Size and the Quality of Teaching and Learning School Size and the Quality of Teaching and Learning An Analysis of Relationships between School Size and Assessments of Factors Related to the Quality of Teaching and Learning in Primary Schools Undertaken

More information

Python Machine Learning

Python Machine Learning Python Machine Learning Unlock deeper insights into machine learning with this vital guide to cuttingedge predictive analytics Sebastian Raschka [ PUBLISHING 1 open source I community experience distilled

More information

Instructor: Mario D. Garrett, Ph.D. Phone: Office: Hepner Hall (HH) 100

Instructor: Mario D. Garrett, Ph.D.   Phone: Office: Hepner Hall (HH) 100 San Diego State University School of Social Work 610 COMPUTER APPLICATIONS FOR SOCIAL WORK PRACTICE Statistical Package for the Social Sciences Office: Hepner Hall (HH) 100 Instructor: Mario D. Garrett,

More information

The Good Judgment Project: A large scale test of different methods of combining expert predictions

The Good Judgment Project: A large scale test of different methods of combining expert predictions The Good Judgment Project: A large scale test of different methods of combining expert predictions Lyle Ungar, Barb Mellors, Jon Baron, Phil Tetlock, Jaime Ramos, Sam Swift The University of Pennsylvania

More information

Statistical Analysis of Climate Change, Renewable Energies, and Sustainability An Independent Investigation for Introduction to Statistics

Statistical Analysis of Climate Change, Renewable Energies, and Sustainability An Independent Investigation for Introduction to Statistics 5/22/2012 Statistical Analysis of Climate Change, Renewable Energies, and Sustainability An Independent Investigation for Introduction to Statistics College of Menominee Nation & University of Wisconsin

More information

AP Statistics Summer Assignment 17-18

AP Statistics Summer Assignment 17-18 AP Statistics Summer Assignment 17-18 Welcome to AP Statistics. This course will be unlike any other math class you have ever taken before! Before taking this course you will need to be competent in basic

More information

The lab is designed to remind you how to work with scientific data (including dealing with uncertainty) and to review experimental design.

The lab is designed to remind you how to work with scientific data (including dealing with uncertainty) and to review experimental design. Name: Partner(s): Lab #1 The Scientific Method Due 6/25 Objective The lab is designed to remind you how to work with scientific data (including dealing with uncertainty) and to review experimental design.

More information

On-the-Fly Customization of Automated Essay Scoring

On-the-Fly Customization of Automated Essay Scoring Research Report On-the-Fly Customization of Automated Essay Scoring Yigal Attali Research & Development December 2007 RR-07-42 On-the-Fly Customization of Automated Essay Scoring Yigal Attali ETS, Princeton,

More information

Research Design & Analysis Made Easy! Brainstorming Worksheet

Research Design & Analysis Made Easy! Brainstorming Worksheet Brainstorming Worksheet 1) Choose a Topic a) What are you passionate about? b) What are your library s strengths? c) What are your library s weaknesses? d) What is a hot topic in the field right now that

More information

VOL. 3, NO. 5, May 2012 ISSN Journal of Emerging Trends in Computing and Information Sciences CIS Journal. All rights reserved.

VOL. 3, NO. 5, May 2012 ISSN Journal of Emerging Trends in Computing and Information Sciences CIS Journal. All rights reserved. Exploratory Study on Factors that Impact / Influence Success and failure of Students in the Foundation Computer Studies Course at the National University of Samoa 1 2 Elisapeta Mauai, Edna Temese 1 Computing

More information

EDCI 699 Statistics: Content, Process, Application COURSE SYLLABUS: SPRING 2016

EDCI 699 Statistics: Content, Process, Application COURSE SYLLABUS: SPRING 2016 EDCI 699 Statistics: Content, Process, Application COURSE SYLLABUS: SPRING 2016 Instructor: Dr. Katy Denson, Ph.D. Office Hours: Because I live in Albuquerque, New Mexico, I won t have office hours. But

More information

Grade 6: Correlated to AGS Basic Math Skills

Grade 6: Correlated to AGS Basic Math Skills Grade 6: Correlated to AGS Basic Math Skills Grade 6: Standard 1 Number Sense Students compare and order positive and negative integers, decimals, fractions, and mixed numbers. They find multiples and

More information

Assignment 1: Predicting Amazon Review Ratings

Assignment 1: Predicting Amazon Review Ratings Assignment 1: Predicting Amazon Review Ratings 1 Dataset Analysis Richard Park r2park@acsmail.ucsd.edu February 23, 2015 The dataset selected for this assignment comes from the set of Amazon reviews for

More information

Analysis of Enzyme Kinetic Data

Analysis of Enzyme Kinetic Data Analysis of Enzyme Kinetic Data To Marilú Analysis of Enzyme Kinetic Data ATHEL CORNISH-BOWDEN Directeur de Recherche Émérite, Centre National de la Recherche Scientifique, Marseilles OXFORD UNIVERSITY

More information

Functional Skills Mathematics Level 2 assessment

Functional Skills Mathematics Level 2 assessment Functional Skills Mathematics Level 2 assessment www.cityandguilds.com September 2015 Version 1.0 Marking scheme ONLINE V2 Level 2 Sample Paper 4 Mark Represent Analyse Interpret Open Fixed S1Q1 3 3 0

More information

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Ch 2 Test Remediation Work Name MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 1) High temperatures in a certain

More information

Edexcel GCSE. Statistics 1389 Paper 1H. June Mark Scheme. Statistics Edexcel GCSE

Edexcel GCSE. Statistics 1389 Paper 1H. June Mark Scheme. Statistics Edexcel GCSE Edexcel GCSE Statistics 1389 Paper 1H June 2007 Mark Scheme Edexcel GCSE Statistics 1389 NOTES ON MARKING PRINCIPLES 1 Types of mark M marks: method marks A marks: accuracy marks B marks: unconditional

More information

Hierarchical Linear Modeling with Maximum Likelihood, Restricted Maximum Likelihood, and Fully Bayesian Estimation

Hierarchical Linear Modeling with Maximum Likelihood, Restricted Maximum Likelihood, and Fully Bayesian Estimation A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to Practical Assessment, Research & Evaluation. Permission is granted to distribute

More information

Visit us at:

Visit us at: White Paper Integrating Six Sigma and Software Testing Process for Removal of Wastage & Optimizing Resource Utilization 24 October 2013 With resources working for extended hours and in a pressurized environment,

More information

Math-U-See Correlation with the Common Core State Standards for Mathematical Content for Third Grade

Math-U-See Correlation with the Common Core State Standards for Mathematical Content for Third Grade Math-U-See Correlation with the Common Core State Standards for Mathematical Content for Third Grade The third grade standards primarily address multiplication and division, which are covered in Math-U-See

More information

OFFICE SUPPORT SPECIALIST Technical Diploma

OFFICE SUPPORT SPECIALIST Technical Diploma OFFICE SUPPORT SPECIALIST Technical Diploma Program Code: 31-106-8 our graduates INDEMAND 2017/2018 mstc.edu administrative professional career pathway OFFICE SUPPORT SPECIALIST CUSTOMER RELATIONSHIP PROFESSIONAL

More information

Why Did My Detector Do That?!

Why Did My Detector Do That?! Why Did My Detector Do That?! Predicting Keystroke-Dynamics Error Rates Kevin Killourhy and Roy Maxion Dependable Systems Laboratory Computer Science Department Carnegie Mellon University 5000 Forbes Ave,

More information

Math 96: Intermediate Algebra in Context

Math 96: Intermediate Algebra in Context : Intermediate Algebra in Context Syllabus Spring Quarter 2016 Daily, 9:20 10:30am Instructor: Lauri Lindberg Office Hours@ tutoring: Tutoring Center (CAS-504) 8 9am & 1 2pm daily STEM (Math) Center (RAI-338)

More information

12- A whirlwind tour of statistics

12- A whirlwind tour of statistics CyLab HT 05-436 / 05-836 / 08-534 / 08-734 / 19-534 / 19-734 Usable Privacy and Security TP :// C DU February 22, 2016 y & Secu rivac rity P le ratory bo La Lujo Bauer, Nicolas Christin, and Abby Marsh

More information

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS Václav Kocian, Eva Volná, Michal Janošek, Martin Kotyrba University of Ostrava Department of Informatics and Computers Dvořákova 7,

More information

Sociology 521: Social Statistics and Quantitative Methods I Spring 2013 Mondays 2 5pm Kap 305 Computer Lab. Course Website

Sociology 521: Social Statistics and Quantitative Methods I Spring 2013 Mondays 2 5pm Kap 305 Computer Lab. Course Website Sociology 521: Social Statistics and Quantitative Methods I Spring 2013 Mondays 2 5pm Kap 305 Computer Lab Instructor: Tim Biblarz Office: Hazel Stanley Hall (HSH) Room 210 Office hours: Mon, 5 6pm, F,

More information

PROFESSIONAL TREATMENT OF TEACHERS AND STUDENT ACADEMIC ACHIEVEMENT. James B. Chapman. Dissertation submitted to the Faculty of the Virginia

PROFESSIONAL TREATMENT OF TEACHERS AND STUDENT ACADEMIC ACHIEVEMENT. James B. Chapman. Dissertation submitted to the Faculty of the Virginia PROFESSIONAL TREATMENT OF TEACHERS AND STUDENT ACADEMIC ACHIEVEMENT by James B. Chapman Dissertation submitted to the Faculty of the Virginia Polytechnic Institute and State University in partial fulfillment

More information

Diagnostic Test. Middle School Mathematics

Diagnostic Test. Middle School Mathematics Diagnostic Test Middle School Mathematics Copyright 2010 XAMonline, Inc. All rights reserved. No part of the material protected by this copyright notice may be reproduced or utilized in any form or by

More information

Science Fair Project Handbook

Science Fair Project Handbook Science Fair Project Handbook IDENTIFY THE TESTABLE QUESTION OR PROBLEM: a) Begin by observing your surroundings, making inferences and asking testable questions. b) Look for problems in your life or surroundings

More information

Individual Differences & Item Effects: How to test them, & how to test them well

Individual Differences & Item Effects: How to test them, & how to test them well Individual Differences & Item Effects: How to test them, & how to test them well Individual Differences & Item Effects Properties of subjects Cognitive abilities (WM task scores, inhibition) Gender Age

More information

learning collegiate assessment]

learning collegiate assessment] [ collegiate learning assessment] INSTITUTIONAL REPORT 2005 2006 Kalamazoo College council for aid to education 215 lexington avenue floor 21 new york new york 10016-6023 p 212.217.0700 f 212.661.9766

More information

Measurement. When Smaller Is Better. Activity:

Measurement. When Smaller Is Better. Activity: Measurement Activity: TEKS: When Smaller Is Better (6.8) Measurement. The student solves application problems involving estimation and measurement of length, area, time, temperature, volume, weight, and

More information

Understanding and Interpreting the NRC s Data-Based Assessment of Research-Doctorate Programs in the United States (2010)

Understanding and Interpreting the NRC s Data-Based Assessment of Research-Doctorate Programs in the United States (2010) Understanding and Interpreting the NRC s Data-Based Assessment of Research-Doctorate Programs in the United States (2010) Jaxk Reeves, SCC Director Kim Love-Myers, SCC Associate Director Presented at UGA

More information

Comparison of network inference packages and methods for multiple networks inference

Comparison of network inference packages and methods for multiple networks inference Comparison of network inference packages and methods for multiple networks inference Nathalie Villa-Vialaneix http://www.nathalievilla.org nathalie.villa@univ-paris1.fr 1ères Rencontres R - BoRdeaux, 3

More information

Sociology 521: Social Statistics and Quantitative Methods I Spring Wed. 2 5, Kap 305 Computer Lab. Course Website

Sociology 521: Social Statistics and Quantitative Methods I Spring Wed. 2 5, Kap 305 Computer Lab. Course Website Sociology 521: Social Statistics and Quantitative Methods I Spring 2012 Wed. 2 5, Kap 305 Computer Lab Instructor: Tim Biblarz Office hours (Kap 352): W, 5 6pm, F, 10 11, and by appointment (213) 740 3547;

More information

JONATHAN H. WRIGHT Department of Economics, Johns Hopkins University, 3400 N. Charles St., Baltimore MD (410)

JONATHAN H. WRIGHT Department of Economics, Johns Hopkins University, 3400 N. Charles St., Baltimore MD (410) JONATHAN H. WRIGHT Department of Economics, Johns Hopkins University, 3400 N. Charles St., Baltimore MD 21218. (410) 516 5728 wrightj@jhu.edu EDUCATION Harvard University 1993-1997. Ph.D., Economics (1997).

More information

Probabilistic Latent Semantic Analysis

Probabilistic Latent Semantic Analysis Probabilistic Latent Semantic Analysis Thomas Hofmann Presentation by Ioannis Pavlopoulos & Andreas Damianou for the course of Data Mining & Exploration 1 Outline Latent Semantic Analysis o Need o Overview

More information

School Competition and Efficiency with Publicly Funded Catholic Schools David Card, Martin D. Dooley, and A. Abigail Payne

School Competition and Efficiency with Publicly Funded Catholic Schools David Card, Martin D. Dooley, and A. Abigail Payne School Competition and Efficiency with Publicly Funded Catholic Schools David Card, Martin D. Dooley, and A. Abigail Payne Web Appendix See paper for references to Appendix Appendix 1: Multiple Schools

More information

EDEXCEL FUNCTIONAL SKILLS PILOT TEACHER S NOTES. Maths Level 2. Chapter 4. Working with measures

EDEXCEL FUNCTIONAL SKILLS PILOT TEACHER S NOTES. Maths Level 2. Chapter 4. Working with measures EDEXCEL FUNCTIONAL SKILLS PILOT TEACHER S NOTES Maths Level 2 Chapter 4 Working with measures SECTION G 1 Time 2 Temperature 3 Length 4 Weight 5 Capacity 6 Conversion between metric units 7 Conversion

More information

WE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT

WE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT WE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT PRACTICAL APPLICATIONS OF RANDOM SAMPLING IN ediscovery By Matthew Verga, J.D. INTRODUCTION Anyone who spends ample time working

More information

American Journal of Business Education October 2009 Volume 2, Number 7

American Journal of Business Education October 2009 Volume 2, Number 7 Factors Affecting Students Grades In Principles Of Economics Orhan Kara, West Chester University, USA Fathollah Bagheri, University of North Dakota, USA Thomas Tolin, West Chester University, USA ABSTRACT

More information

Paper 2. Mathematics test. Calculator allowed. First name. Last name. School KEY STAGE TIER

Paper 2. Mathematics test. Calculator allowed. First name. Last name. School KEY STAGE TIER 259574_P2 5-7_KS3_Ma.qxd 1/4/04 4:14 PM Page 1 Ma KEY STAGE 3 TIER 5 7 2004 Mathematics test Paper 2 Calculator allowed Please read this page, but do not open your booklet until your teacher tells you

More information

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering Time and Place: MW 3:00-4:20pm, A126 Wells Hall Instructor: Dr. Marianne Huebner Office: A-432 Wells Hall

More information

Grade 2: Using a Number Line to Order and Compare Numbers Place Value Horizontal Content Strand

Grade 2: Using a Number Line to Order and Compare Numbers Place Value Horizontal Content Strand Grade 2: Using a Number Line to Order and Compare Numbers Place Value Horizontal Content Strand Texas Essential Knowledge and Skills (TEKS): (2.1) Number, operation, and quantitative reasoning. The student

More information

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition Todd Holloway Two Lecture Series for B551 November 20 & 27, 2007 Indiana University Outline Introduction Bias and

More information

Mathematics process categories

Mathematics process categories Mathematics process categories All of the UK curricula define multiple categories of mathematical proficiency that require students to be able to use and apply mathematics, beyond simple recall of facts

More information

Radius STEM Readiness TM

Radius STEM Readiness TM Curriculum Guide Radius STEM Readiness TM While today s teens are surrounded by technology, we face a stark and imminent shortage of graduates pursuing careers in Science, Technology, Engineering, and

More information

State University of New York at Buffalo INTRODUCTION TO STATISTICS PSC 408 Fall 2015 M,W,F 1-1:50 NSC 210

State University of New York at Buffalo INTRODUCTION TO STATISTICS PSC 408 Fall 2015 M,W,F 1-1:50 NSC 210 1 State University of New York at Buffalo INTRODUCTION TO STATISTICS PSC 408 Fall 2015 M,W,F 1-1:50 NSC 210 Dr. Michelle Benson mbenson2@buffalo.edu Office: 513 Park Hall Office Hours: Mon & Fri 10:30-12:30

More information

Algebra 2- Semester 2 Review

Algebra 2- Semester 2 Review Name Block Date Algebra 2- Semester 2 Review Non-Calculator 5.4 1. Consider the function f x 1 x 2. a) Describe the transformation of the graph of y 1 x. b) Identify the asymptotes. c) What is the domain

More information

Paper Reference. Edexcel GCSE Mathematics (Linear) 1380 Paper 1 (Non-Calculator) Foundation Tier. Monday 6 June 2011 Afternoon Time: 1 hour 30 minutes

Paper Reference. Edexcel GCSE Mathematics (Linear) 1380 Paper 1 (Non-Calculator) Foundation Tier. Monday 6 June 2011 Afternoon Time: 1 hour 30 minutes Centre No. Candidate No. Paper Reference 1 3 8 0 1 F Paper Reference(s) 1380/1F Edexcel GCSE Mathematics (Linear) 1380 Paper 1 (Non-Calculator) Foundation Tier Monday 6 June 2011 Afternoon Time: 1 hour

More information

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014 UNSW Australia Business School School of Risk and Actuarial Studies ACTL5103 Stochastic Modelling For Actuaries Course Outline Semester 2, 2014 Part A: Course-Specific Information Please consult Part B

More information

Software Maintenance

Software Maintenance 1 What is Software Maintenance? Software Maintenance is a very broad activity that includes error corrections, enhancements of capabilities, deletion of obsolete capabilities, and optimization. 2 Categories

More information

Certified Six Sigma Professionals International Certification Courses in Six Sigma Green Belt

Certified Six Sigma Professionals International Certification Courses in Six Sigma Green Belt Certification Singapore Institute Certified Six Sigma Professionals Certification Courses in Six Sigma Green Belt ly Licensed Course for Process Improvement/ Assurance Managers and Engineers Leading the

More information

Characteristics of Functions

Characteristics of Functions Characteristics of Functions Unit: 01 Lesson: 01 Suggested Duration: 10 days Lesson Synopsis Students will collect and organize data using various representations. They will identify the characteristics

More information

Physics 270: Experimental Physics

Physics 270: Experimental Physics 2017 edition Lab Manual Physics 270 3 Physics 270: Experimental Physics Lecture: Lab: Instructor: Office: Email: Tuesdays, 2 3:50 PM Thursdays, 2 4:50 PM Dr. Uttam Manna 313C Moulton Hall umanna@ilstu.edu

More information

Discovering Statistics

Discovering Statistics School of Psychology Module Handbook 2015/2016 Discovering Statistics Module Convenor: Professor Andy Field NOTE: Most of the questions you need answers to about this module are in this document. Please

More information

Ohio s Learning Standards-Clear Learning Targets

Ohio s Learning Standards-Clear Learning Targets Ohio s Learning Standards-Clear Learning Targets Math Grade 1 Use addition and subtraction within 20 to solve word problems involving situations of 1.OA.1 adding to, taking from, putting together, taking

More information

An Empirical Analysis of the Effects of Mexican American Studies Participation on Student Achievement within Tucson Unified School District

An Empirical Analysis of the Effects of Mexican American Studies Participation on Student Achievement within Tucson Unified School District An Empirical Analysis of the Effects of Mexican American Studies Participation on Student Achievement within Tucson Unified School District Report Submitted June 20, 2012, to Willis D. Hawley, Ph.D., Special

More information

Minitab Tutorial (Version 17+)

Minitab Tutorial (Version 17+) Minitab Tutorial (Version 17+) Basic Commands and Data Entry Graphical Tools Descriptive Statistics Outline Minitab Basics Basic Commands, Data Entry, and Organization Minitab Project Files (*.MPJ) vs.

More information

S T A T 251 C o u r s e S y l l a b u s I n t r o d u c t i o n t o p r o b a b i l i t y

S T A T 251 C o u r s e S y l l a b u s I n t r o d u c t i o n t o p r o b a b i l i t y Department of Mathematics, Statistics and Science College of Arts and Sciences Qatar University S T A T 251 C o u r s e S y l l a b u s I n t r o d u c t i o n t o p r o b a b i l i t y A m e e n A l a

More information

Informal Comparative Inference: What is it? Hand Dominance and Throwing Accuracy

Informal Comparative Inference: What is it? Hand Dominance and Throwing Accuracy Informal Comparative Inference: What is it? Hand Dominance and Throwing Accuracy Logistics: This activity addresses mathematics content standards for seventh-grade, but can be adapted for use in sixth-grade

More information

Honors Mathematics. Introduction and Definition of Honors Mathematics

Honors Mathematics. Introduction and Definition of Honors Mathematics Honors Mathematics Introduction and Definition of Honors Mathematics Honors Mathematics courses are intended to be more challenging than standard courses and provide multiple opportunities for students

More information

Using Blackboard.com Software to Reach Beyond the Classroom: Intermediate

Using Blackboard.com Software to Reach Beyond the Classroom: Intermediate Using Blackboard.com Software to Reach Beyond the Classroom: Intermediate NESA Conference 2007 Presenter: Barbara Dent Educational Technology Training Specialist Thomas Jefferson High School for Science

More information

InCAS. Interactive Computerised Assessment. System

InCAS. Interactive Computerised Assessment. System Interactive Computerised Assessment Administered by: System 015 Carefully follow the instructions in this manual to make sure your assessment process runs smoothly! InCAS Page 1 2015 InCAS Manual If there

More information

CONSTRUCTION OF AN ACHIEVEMENT TEST Introduction One of the important duties of a teacher is to observe the student in the classroom, laboratory and

CONSTRUCTION OF AN ACHIEVEMENT TEST Introduction One of the important duties of a teacher is to observe the student in the classroom, laboratory and CONSTRUCTION OF AN ACHIEVEMENT TEST Introduction One of the important duties of a teacher is to observe the student in the classroom, laboratory and in other settings. He may also make use of tests in

More information

Focus of the Unit: Much of this unit focuses on extending previous skills of multiplication and division to multi-digit whole numbers.

Focus of the Unit: Much of this unit focuses on extending previous skills of multiplication and division to multi-digit whole numbers. Approximate Time Frame: 3-4 weeks Connections to Previous Learning: In fourth grade, students fluently multiply (4-digit by 1-digit, 2-digit by 2-digit) and divide (4-digit by 1-digit) using strategies

More information

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics 2017-2018 GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics Entrance requirements, program descriptions, degree requirements and other program policies for Biostatistics Master s Programs

More information

Math 098 Intermediate Algebra Spring 2018

Math 098 Intermediate Algebra Spring 2018 Math 098 Intermediate Algebra Spring 2018 Dept. of Mathematics Instructor's Name: Office Location: Office Hours: Office Phone: E-mail: MyMathLab Course ID: Course Description This course expands on the

More information

Dublin City Schools Mathematics Graded Course of Study GRADE 4

Dublin City Schools Mathematics Graded Course of Study GRADE 4 I. Content Standard: Number, Number Sense and Operations Standard Students demonstrate number sense, including an understanding of number systems and reasonable estimates using paper and pencil, technology-supported

More information

Foothill College Summer 2016

Foothill College Summer 2016 Foothill College Summer 2016 Intermediate Algebra Math 105.04W CRN# 10135 5.0 units Instructor: Yvette Butterworth Text: None; Beoga.net material used Hours: Online Except Final Thurs, 8/4 3:30pm Phone:

More information

Simple Random Sample (SRS) & Voluntary Response Sample: Examples: A Voluntary Response Sample: Examples: Systematic Sample Best Used When

Simple Random Sample (SRS) & Voluntary Response Sample: Examples: A Voluntary Response Sample: Examples: Systematic Sample Best Used When Simple Random Sample (SRS) & Voluntary Response Sample: In statistics, a simple random sample is a group of people who have been chosen at random from the general population. A simple random sample is

More information

CENTRAL MAINE COMMUNITY COLLEGE Introduction to Computer Applications BCA ; FALL 2011

CENTRAL MAINE COMMUNITY COLLEGE Introduction to Computer Applications BCA ; FALL 2011 CENTRAL MAINE COMMUNITY COLLEGE Introduction to Computer Applications BCA 120-03; FALL 2011 Instructor: Mrs. Linda Cameron Cell Phone: 207-446-5232 E-Mail: LCAMERON@CMCC.EDU Course Description This is

More information

Houghton Mifflin Online Assessment System Walkthrough Guide

Houghton Mifflin Online Assessment System Walkthrough Guide Houghton Mifflin Online Assessment System Walkthrough Guide Page 1 Copyright 2007 by Houghton Mifflin Company. All Rights Reserved. No part of this document may be reproduced or transmitted in any form

More information

Teaching a Laboratory Section

Teaching a Laboratory Section Chapter 3 Teaching a Laboratory Section Page I. Cooperative Problem Solving Labs in Operation 57 II. Grading the Labs 75 III. Overview of Teaching a Lab Session 79 IV. Outline for Teaching a Lab Session

More information

AP Calculus AB. Nevada Academic Standards that are assessable at the local level only.

AP Calculus AB. Nevada Academic Standards that are assessable at the local level only. Calculus AB Priority Keys Aligned with Nevada Standards MA I MI L S MA represents a Major content area. Any concept labeled MA is something of central importance to the entire class/curriculum; it is a

More information

Answers To Hawkes Learning Systems Intermediate Algebra

Answers To Hawkes Learning Systems Intermediate Algebra Answers To Hawkes Learning Free PDF ebook Download: Answers To Download or Read Online ebook answers to hawkes learning systems intermediate algebra in PDF Format From The Best User Guide Database Double

More information

Montana's Distance Learning Policy for Adult Basic and Literacy Education

Montana's Distance Learning Policy for Adult Basic and Literacy Education Montana's Distance Learning Policy for Adult Basic and Literacy Education 2013-2014 1 Table of Contents I. Introduction Page 3 A. The Need B. Going to Scale II. Definitions and Requirements... Page 4-5

More information

A Decision Tree Analysis of the Transfer Student Emma Gunu, MS Research Analyst Robert M Roe, PhD Executive Director of Institutional Research and

A Decision Tree Analysis of the Transfer Student Emma Gunu, MS Research Analyst Robert M Roe, PhD Executive Director of Institutional Research and A Decision Tree Analysis of the Transfer Student Emma Gunu, MS Research Analyst Robert M Roe, PhD Executive Director of Institutional Research and Planning Overview Motivation for Analyses Analyses and

More information

CHAPTER 4: REIMBURSEMENT STRATEGIES 24

CHAPTER 4: REIMBURSEMENT STRATEGIES 24 CHAPTER 4: REIMBURSEMENT STRATEGIES 24 INTRODUCTION Once state level policymakers have decided to implement and pay for CSR, one issue they face is simply how to calculate the reimbursements to districts

More information

Student s Edition. Grade 6 Unit 6. Statistics. Eureka Math. Eureka Math

Student s Edition. Grade 6 Unit 6. Statistics. Eureka Math. Eureka Math Student s Edition Grade 6 Unit 6 Statistics Eureka Math Eureka Math Lesson 1 Lesson 1: Posing Statistical Questions Statistics is about using data to answer questions. In this module, the following four

More information

A Simple VQA Model with a Few Tricks and Image Features from Bottom-up Attention

A Simple VQA Model with a Few Tricks and Image Features from Bottom-up Attention A Simple VQA Model with a Few Tricks and Image Features from Bottom-up Attention Damien Teney 1, Peter Anderson 2*, David Golub 4*, Po-Sen Huang 3, Lei Zhang 3, Xiaodong He 3, Anton van den Hengel 1 1

More information

Foothill College Fall 2014 Math My Way Math 230/235 MTWThF 10:00-11:50 (click on Math My Way tab) Math My Way Instructors:

Foothill College Fall 2014 Math My Way Math 230/235 MTWThF 10:00-11:50  (click on Math My Way tab) Math My Way Instructors: This is a team taught directed study course. Foothill College Fall 2014 Math My Way Math 230/235 MTWThF 10:00-11:50 www.psme.foothill.edu (click on Math My Way tab) Math My Way Instructors: Instructor:

More information

UNIT ONE Tools of Algebra

UNIT ONE Tools of Algebra UNIT ONE Tools of Algebra Subject: Algebra 1 Grade: 9 th 10 th Standards and Benchmarks: 1 a, b,e; 3 a, b; 4 a, b; Overview My Lessons are following the first unit from Prentice Hall Algebra 1 1. Students

More information

Exploring Derivative Functions using HP Prime

Exploring Derivative Functions using HP Prime Exploring Derivative Functions using HP Prime Betty Voon Wan Niu betty@uniten.edu.my College of Engineering Universiti Tenaga Nasional Malaysia Wong Ling Shing Faculty of Health and Life Sciences, INTI

More information

Introducing the New Iowa Assessments Mathematics Levels 12 14

Introducing the New Iowa Assessments Mathematics Levels 12 14 Introducing the New Iowa Assessments Mathematics Levels 12 14 ITP Assessment Tools Math Interim Assessments: Grades 3 8 Administered online Constructed Response Supplements Reading, Language Arts, Mathematics

More information

School of Innovative Technologies and Engineering

School of Innovative Technologies and Engineering School of Innovative Technologies and Engineering Department of Applied Mathematical Sciences Proficiency Course in MATLAB COURSE DOCUMENT VERSION 1.0 PCMv1.0 July 2012 University of Technology, Mauritius

More information

Theory of Probability

Theory of Probability Theory of Probability Class code MATH-UA 9233-001 Instructor Details Prof. David Larman Room 806,25 Gordon Street (UCL Mathematics Department). Class Details Fall 2013 Thursdays 1:30-4-30 Location to be

More information