Standard Setting in a Small Scale OSCE: A Comparison of the Modified Borderline-Group Method and the Borderline Regression Method
|
|
- Horatio Davis
- 6 years ago
- Views:
Transcription
1 Advances in Health Sciences Education (2006) 11: Ó Springer 2006 DOI /s Standard Setting in a Small Scale OSCE: A Comparison of the Modified Borderline-Group Method and the Borderline Regression Method TIMOTHY J. WOOD 1, *, SUSAN M. HUMPHREY-MURTO 2 and GEOFFREY R. NORMAN 3 1 Medical Council of Canada and Faculty of Medicine, University of Ottawa K1G-3H7 ON, Ottawa, Canada; 2 Faculty of Medicine, University of Ottawa ; 3 Faculty of Medicine McMaster University (*author for correspondence, Phone: (613) ; Fax: (613) ; twood@mcc.ca) (Received 14 May 2004; accepted 24 May 2005) Abstract. When setting standards, administrators of small-scale OSCEs often face several challenges, including a lack of resources, a lack of available expertise in statistics, and difficulty in recruiting judges. The Modified Borderline-Group Method is a standard setting procedure that compensates for these challenges by using physician examiners and is easy to use making it a good choice for small scale OSCEs. Unfortunately, the use of this approach may introduce a new challenge. Because a small scale OSCE has a small number of examinees, there may be few examinees in the borderline range, which could introduce an unintentional bias. A standard setting method called The Borderline Regression Method will be described. This standard setting method is similar to the Modified Borderline-Group Method but incorporates a linear regression approach allowing the cut score to be set using the scores from all examinees and not from a subset. The current study uses confidence intervals to analyze the precision of cut scores derived from both approaches when applied to a small scale OSCE. Key words: standard setting, OSCE Although a large number of methods for setting standards on performance examinations exist (Cusimano, 1996), there is no gold standard. Several studies confirm that various methods produce different cut scores and examination administrators must chose a defensible yet feasible method for their examination. Ideally, the method chosen would produce the most accurate result, but in small-scale university-based OSCEs, there are several additional constraints Administrators may only have limited access to experts in psychometrics or statistics or have few resources for data entry and analysis making complex statistical analyses required by some standard
2 116 TIMOTHY J. WOOD ET AL. setting methods difficult to perform. In addition, finding clinicians who are able to devote time to the extensive standard setting procedures required by some standard setting methods is increasingly difficult. For the last six years, the University of Ottawa Medical School has used the Modified Borderline-Group Method to set the cut score in a 2nd year student OSCE, a clerkship OSCE, and an Internal Medicine Resident OSCE (Humphrey-Murto and MacFadyen, 2002; MacFadyen, 1996). This standard setting method has also been used by the University of Otago, New Zealand (Wilkinson et al., 2001) and by the Medical Council of Canada (MCC) for the MCC Qualifying Examination Part II (Dauphinee et al., 1997; Smee and Blackmore, 2001). With the Modified Borderline-Group Method, a physician examiner evaluates a examinee s performance at a station by completing a stationspecific checklist and then a rating on a global rating scale. The number of points on the scale can vary as long as there is a cohort of examinees labeled as borderline. The MCC and the University of Ottawa use six point scales with adjective descriptors corresponding to inferior, poor, borderline unsatisfactory, borderline satisfactory, good, and excellent. To determine a cut score for a station, the mean checklist score for the cohort of examinees rated as borderline is calculated and then applied to all examinees. By averaging the checklist scores of the Borderline Satisfactory and Borderline Unsatisfactory groups, it is assumed that this corresponds to a examinee exactly at the pass/fail cut point between the two categories. The sum of the station cut scores becomes the cut score for the overall exam. For the small-scale OSCE administrator, there are several benefits to using the Modified Borderline-Group Method as a standard setting method, it does not require any complex statistical procedures and the cut point is easy to calculate. It is also based on actual examinee performance rather than a review of checklist items, so it appears to have higher face validity than other methods. Finally, it is an efficient use of the clinicians time since the evaluation occurs at the time of the exam. Although the Modified Borderline-Group Method meets the needs of small-scale OSCE administrators, there may be some potential problems. For example, because the Modified Borderline-Group Method only uses the checklist scores from those examinees rated as borderline, it does not use all of the data This may not be a problem for a large scale OSCE like the MCCQE Part II, but for a small scale OSCE, there is the risk that a cut score could be based on a relatively small number of examinees, which would result in an increased amount of statistical error associated with the cut score. Another potential problem is related to the calculation of the mean of the checklist scores. For any given station, the cut score will always be toward the extreme of the distribution of checklist scores no
3 STANDARD SETTING IN A SMALL SCALE OSCE 117 matter how the borderline group is chosen (i.e., as an average over two categories as in the present example or as a single category such as borderline). The reason is that typically there will be more candidates with scores at the high end of the category than at the low end (except of course for statistical fluctuations). Consequently, any average of scores within the group will always result in a computed mean checklist score corresponding to an individual who is higher than the middle of the category. So the mean score computing by averaging over the borderline category will result in a consistent bias upwards introduced into the computed cut point score. Recent research has described a standard setting procedure called a Borderline Regression Method that uses a linear regression approach to set cut scores (Kramer et al., 2003; see also MacFadyen, 1996 and Woehr et al., 1991) This method is very similar to the Modified Borderline-Group Method but rather than selecting out a cohort of borderline examinees and calculating their mean checklist score, this method regresses all of the examinees checklist scores onto their global ratings to produce a linear equation. By inserting the midpoint of the global rating scale corresponding to the borderline group(s) (e.g., 3.5 on the current six-point scale) into the equation, a corresponding predicted checklist score can be determined. This predicted score becomes the cut score for the station. The advantage of the Borderline Regression Method is that it uses all of the examinee data for setting the pass mark and not just the scores from examinees rated as borderline. In addition, this approach will be less susceptible to variation due to unequal weighting of examinees in the borderline groups. The borderline regression approach has also been compared to a standard set using two different Angoff procedures and was found to have less variance associated with it than either of two Angoff procedures (Kramer et al., 2003). Because this regression approach to standard setting is similar to the Modified Borderline-Group Method it also has all of the same advantages including using actual examinee performances, having better face validity, and being an efficient use of the clinicians time. The Borderline Regression Method is a bit more complicated in that an OSCE administrator needs to be able to run a linear regression but this can be done easily in common statistical and spreadsheet programs. The question remains therefore as to whether the use of a regression approach actually leads to a more accurate decision compared to the Modified Borderline-Group Method. The purpose of this study is to compare the accuracy of the two standard setting methods. A 95% confidence interval around the cut scores for each station will be used to determine the accuracy of the pass/fail decision. This study will add to the existing literature because these methods of standard setting have not been compared for accuracy and feasibility with the small scale OSCE administrator in mind.
4 118 TIMOTHY J. WOOD ET AL. Methods Design A 10 station OSCE using physician examiners and standardized patients was administered to 59 clinical clerks at the University of Ottawa in Eight stations involved patient encounters and two stations used written questions. Only the patient encounter stations were used for this analysis. Analysis Descriptive statistics including the number of examinees, station cut scores, and pass rates for each standard setting method were calculated. To determine the accuracy of the station cut scores, 95% confidence intervals were calculated. For the Modified Borderline-Group Method, the confidence interval was calculated using formulas from any introductory statistics book (196*standard error). For cut scores derived from the Borderline Regression Method, a linear regression in which the checklist scores were regressed onto the ratings was first conducted for each station. The confidence interval for the resulting regression lines was calculated as follows (Kleinbaum et al., 1988): sffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi 1 S ¼ S YjX Y^ XO n þ ðx O þ XÞ 2 ðn 1ÞS 2 X where S is the standard error of the regression line, S YjX is the standard Y^ XO error of the estimate, n is the number of examinees, X O is the cut score, X is the mean of the examinees scores, and S 2 X is the variance associated with the examinees scores. After generating the standard error of the regression line, the confidence intervals are calculated by multiplying the standard error of the regression line by the t value at p = 0.05 and d.f. = n)2. Because the Borderline Regression approach is a relatively novel method of setting a standard, it was decided that some diagnostic tests should be conducted to determine if a linear regression was justified. First, residuals for each station were also analyzed to ensure that there were no outliers. Second, a lack of fit test was conducted for each station to determine if a linear regression was appropriate (Dixon and Massey, 1969) This latter analysis tests whether the means for the groups (as defined by the 6 points on the rating scale) are located in a straight line. For each station, the Sum of Squares(residual) term, available from the regression analysis, is determined. A second Sum of Squares(pure error) term was computed from a one way ANOVA using the values on the rating scale as a grouping factor. This latter term estimates the random error within each rating category. A Sum of Squares(lack of fit) term was then computed by subtracting the Sum of Squares(residual) term from the Sum of
5 STANDARD SETTING IN A SMALL SCALE OSCE 119 Squares(pure error) term. A Mean Square(lack of fit) term was then created by dividing the Sum of Squares(lack of fit) by k)2 (where k is the number of points on rating scale). The F ratio (F(k)2, n)k)) for the lack of fit test is calculated by dividing the Mean Square(lack of fit) by the Mean Square error term available from the one way ANOVA. Results An analysis of the residuals for each station indicated that scores from three examinees were outliers and therefore these scores were removed from three stations. As shown in Table I, the lack of fit test for each station revealed non-significant results. This indicates that a linear relationship existed between the checklist scores and the global ratings for each station and therefore the linear analyses were justified. Table II displays the number of examinees, mean score, pass rate, and 95% confidence interval for each station as a function of standard setting method. Comparing across standard setting methods, the cut score derived using the Borderline Regression Method was, on average, 0.14 points lower (M = 5.14 vs. M = 5.28, respectively), and was lower than the cut score derived using the Modified Borderline-Group Method on six of the eight stations. The pass rate for the regression approach was, on average, 4% higher (M = 71% vs. M = 67%, Table I. Values used to calculate a lack of fit test for each station Station N SSres SS(err) SS(lof) d.f.(res) d.f.(err) d.f.(lof) MS(lof) MS(err) F Sig n = number of examinees. SSres = SS residual from the regression analysis. SS(err) = SS residual from an ANOVA with scale as grouping factor. SS(lof) = SS(res) ) SS(err). d.f.(res) = n)2. d.f.(error) = n ) k (k = number of items on scale). d.f.(lof) = d.f.(res) ) d.f.(err) = k)2. MS(lof) = SS(lof)/d.f.(lof). MS(err) = error term from an ANOVA with scale as grouping factor. F = MS(lof)/MS(err).
6 120 TIMOTHY J. WOOD ET AL. Table II. Number of examinees, cut score, pass rate and 95% confidence interval for each standard setting method Station Modified Borderline-group method Regression method N Cut score Pass rate (%) Confidence interval N Cut score Pass rate (%) Confidence interval ± ± ± ± ± ± ± ± ± ± ± ± ± ± ± ±0.29 overall ± ±0.39 Checklist scores range from 0 to 10. The number of examinees for the Modified Borderline-Group Method correspond to those examinees rated as borderline whereas the number of examinees for the Regression Method correspond to all examinees. respectively), and the two approaches differed on five of the eight stations. More importantly, the 95% confidence intervals were smaller for the cut scores derived using the regression approach than the Modified Borderline- Group Method, by an average of 0.09 (M = 0.39 vs. M = 0.48, respectively), t = 2.93, p < Interestingly, the differences between the two approaches were quite large for Station 9. Using the Modified Borderline-Group Method, the cut score was 5.49 with 69% of the examinees passing. With the Borderline Regression approach the cut score was 4.79 and 92% of the examinees passed. A subsequent analysis of this station revealed that of the 12 examinees rated as borderline using the Modified Borderline-Group Method, all 12 had been rated as borderline satisfactory and therefore the cut score the station was set relatively high. This station demonstrates one of the potential problems of using the Modified Borderline-Group Method when there are few examinees. Discussion In this study, we describe two standard setting methods appropriate for use with a small scale OSCE, discuss the strengths and potential weakness of both methods and demonstrate that a cut score derived from a Borderline Regression method was more accurate than one derived using the Modified Borderline-Group Method.
7 STANDARD SETTING IN A SMALL SCALE OSCE 121 There were relatively small differences in the cut scores and pass rates derived from both methods with cut scores determined using a regression approach being slightly lower for most stations. This is anticipated based on the earlier discussion, which suggests that simple averaging of borderline groups will lead to generally biased results. In addition, there was decreased statistical error using the regression estimates. There are other advantages to using a regression approach. First, as demonstrated by Station 9 on the OSCE, a cut score determined using the Modified Borderline-Group Method is more susceptible to variations in the distribution of scores in the borderline groups than is the regression approach. Second, because the borderline group(s) are in the lower tail of the overall distribution, the actual distribution of scores within the groups(s) will usually be skewed to the left so that the average will be biased on the high side. Linear regression uses values across a continuous dimension and therefore avoids this computational bias. The principal disadvantage to using the Borderline Regression approach is related to the statistical complexity. We performed a number of exploratory analyses of the regression approach including a residual analysis to look for outliers, a lack of fit test, and the calculation of the 95% confidence interval associated with the regression line. These analyses were done primarily because this was a new method of standard setting and we wanted to ensure that its use met the assumptions associated with linear regression and to determine the accuracy of the decision compared to the Modified Borderline- Group Method. These analyses are beyond what the typical university OSCE administrator would need to do. Simple linear regression analysis is quite easy to perform and can be conducted in most popular statistics packages available to users. The improved accuracy of a cut score calculated using a regression-based approach, compares favorably to a study reported by Kramer et al. (2003). Using senior post-graduate trainees and general practioners as examinees on an OSCE, they calculated the root mean squared error (RMSE) term derived from a generalizability analysis to determine the amount of error associated with cut scores calculated using a regression approach and two types of Angoff procedures. The RMSE associated with the regression approach was less than the RMSE associated with the Angoff procedures. In addition, the overall pass rates of the examinees were more credible than those associated with the Angoff procedures. In summary, small-scale OSCE administrators have various constraints that dictate which standard setting method is used to determine a cut score. The Modified Borderline-Group Method has many advantages in that it is easy to use, doesn t require a great deal of statistical support, and is an efficient and defensible method of standard setting. Despite these advantages,
8 122 TIMOTHY J. WOOD ET AL. potential problems can occur when it is applied to a small scale OSCEs. A linear regression approach, similar in nature to the Modified Borderline- Group Method, demonstrated all of the benefits of the latter standard setting method and also proved to have less statistical error associated with it. Before implementing the Borderline Regression Method at the University of Ottawa, future studies will track, across time and examinations, the pass/fail marks derived from both methods. The possibility of extending this approach to a high stakes clinical examination like that used by the Medical Council of Canada will also be investigated. References Cusimano, M. (1996). Standard setting in medical education. Academic Medicine 71: s112 s120. Dauphinee, W.D., Blackmore, D.E., Smee, S.M., Rothman, A.I. & Reznick, R.K. (1997). Using the judgments of physician examiners in setting standards for a national multi-centre high stakes OSCE. Advances in Health Science Education: Theory and Practice 2: Dixon, W.F & Massey, F.J. (1969). Introduction to Statistical Analysis. New York: McGraw Hill. Humphrey-Murto, S. & MacFadyen, J.C. (2002). Standard Setting: A comparison of case-author and modified borderline-group methods in a small-scale OSCE. Academic Medicine 77: Kleinbaum, D.G., Kupper, L.L. & Muller, K.E. (1988). Applied Regression Analysis and Other Multivariable Methods. Belmont, CA: Duxbury Press. Kramer, A., Muitjens, A., Jansen, K., Dusman, H., Tan, L. & van der Vleuten, C. (2003). Comparison of a rational and an empirical standard setting procedure for an OSCE. Medical Education 37: MacFadyen, I.J.C. (1996). A Modified Borderline Groups Method to Establish Case-Based Pass/Fail Decisions for an Undergraduate Objective Structured Clinical Exam: Exploring Issues of Validity. Masters dissertation, University of Illinois at Chicago, Chicago. Smee, S.M. & Blackmore, D.E. (2001). Setting standards for an objective structured clinical examination: the borderline group method gains ground on Angoff. Medical Education 35: Wilkinson, T.J., Newble, D.I. & Frampton, C.M. (2001). Standard setting in an objective structured clinical examination: use of global ratings of borderline performance to determine the passing score. Medical Education 35: Woehr, D.J., Arthur, W. & Fehrmann, M.L (1991). An empirical comparison of cutoff score methods for content-related and criterion-related validity settings. Educational and Psychological Measurement 51:
RESEARCH ARTICLES Objective Structured Clinical Examinations in Doctor of Pharmacy Programs in the United States
RESEARCH ARTICLES Objective Structured Clinical Examinations in Doctor of Pharmacy Programs in the United States Deborah A. Sturpe, PharmD American Journal of Pharmaceutical Education 2010; 74 (8) Article
More informationAlgebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview
Algebra 1, Quarter 3, Unit 3.1 Line of Best Fit Overview Number of instructional days 6 (1 day assessment) (1 day = 45 minutes) Content to be learned Analyze scatter plots and construct the line of best
More informationAGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS
AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS 1 CALIFORNIA CONTENT STANDARDS: Chapter 1 ALGEBRA AND WHOLE NUMBERS Algebra and Functions 1.4 Students use algebraic
More informationVOL. 3, NO. 5, May 2012 ISSN Journal of Emerging Trends in Computing and Information Sciences CIS Journal. All rights reserved.
Exploratory Study on Factors that Impact / Influence Success and failure of Students in the Foundation Computer Studies Course at the National University of Samoa 1 2 Elisapeta Mauai, Edna Temese 1 Computing
More informationThe Objective Structured Clinical Examination (OSCE): AMEE Guide No. 81. Part II: Organisation & Administration
Medical Teacher ISSN: 0142-159X (Print) 1466-187X (Online) Journal homepage: http://www.tandfonline.com/loi/imte20 The Objective Structured Clinical Examination (OSCE): AMEE Guide No. 81. Part II: Organisation
More informationProbability and Statistics Curriculum Pacing Guide
Unit 1 Terms PS.SPMJ.3 PS.SPMJ.5 Plan and conduct a survey to answer a statistical question. Recognize how the plan addresses sampling technique, randomization, measurement of experimental error and methods
More information4.0 CAPACITY AND UTILIZATION
4.0 CAPACITY AND UTILIZATION The capacity of a school building is driven by four main factors: (1) the physical size of the instructional spaces, (2) the class size limits, (3) the schedule of uses, and
More informationLecture 1: Machine Learning Basics
1/69 Lecture 1: Machine Learning Basics Ali Harakeh University of Waterloo WAVE Lab ali.harakeh@uwaterloo.ca May 1, 2017 2/69 Overview 1 Learning Algorithms 2 Capacity, Overfitting, and Underfitting 3
More informationEducational Leadership and Policy Studies Doctoral Programs (Ed.D. and Ph.D.)
Contact: Susan Korach susan.korach@du.edu Morgridge Office of Admissions mce@du.edu http://morgridge.du.edu/ Educational Leadership and Policy Studies Doctoral Programs (Ed.D. and Ph.D.) Doctoral (Ed.D.
More informationSchool Size and the Quality of Teaching and Learning
School Size and the Quality of Teaching and Learning An Analysis of Relationships between School Size and Assessments of Factors Related to the Quality of Teaching and Learning in Primary Schools Undertaken
More informationThe lab is designed to remind you how to work with scientific data (including dealing with uncertainty) and to review experimental design.
Name: Partner(s): Lab #1 The Scientific Method Due 6/25 Objective The lab is designed to remind you how to work with scientific data (including dealing with uncertainty) and to review experimental design.
More informationLinking the Common European Framework of Reference and the Michigan English Language Assessment Battery Technical Report
Linking the Common European Framework of Reference and the Michigan English Language Assessment Battery Technical Report Contact Information All correspondence and mailings should be addressed to: CaMLA
More informationStatewide Framework Document for:
Statewide Framework Document for: 270301 Standards may be added to this document prior to submission, but may not be removed from the framework to meet state credit equivalency requirements. Performance
More informationOn-the-Fly Customization of Automated Essay Scoring
Research Report On-the-Fly Customization of Automated Essay Scoring Yigal Attali Research & Development December 2007 RR-07-42 On-the-Fly Customization of Automated Essay Scoring Yigal Attali ETS, Princeton,
More informationInstructor: Mario D. Garrett, Ph.D. Phone: Office: Hepner Hall (HH) 100
San Diego State University School of Social Work 610 COMPUTER APPLICATIONS FOR SOCIAL WORK PRACTICE Statistical Package for the Social Sciences Office: Hepner Hall (HH) 100 Instructor: Mario D. Garrett,
More informationMeasurement. When Smaller Is Better. Activity:
Measurement Activity: TEKS: When Smaller Is Better (6.8) Measurement. The student solves application problems involving estimation and measurement of length, area, time, temperature, volume, weight, and
More informationSETTING STANDARDS FOR CRITERION- REFERENCED MEASUREMENT
SETTING STANDARDS FOR CRITERION- REFERENCED MEASUREMENT By: Dr. MAHMOUD M. GHANDOUR QATAR UNIVERSITY Improving human resources is the responsibility of the educational system in many societies. The outputs
More informationTHE PENNSYLVANIA STATE UNIVERSITY SCHREYER HONORS COLLEGE DEPARTMENT OF MATHEMATICS ASSESSING THE EFFECTIVENESS OF MULTIPLE CHOICE MATH TESTS
THE PENNSYLVANIA STATE UNIVERSITY SCHREYER HONORS COLLEGE DEPARTMENT OF MATHEMATICS ASSESSING THE EFFECTIVENESS OF MULTIPLE CHOICE MATH TESTS ELIZABETH ANNE SOMERS Spring 2011 A thesis submitted in partial
More informationExtending Place Value with Whole Numbers to 1,000,000
Grade 4 Mathematics, Quarter 1, Unit 1.1 Extending Place Value with Whole Numbers to 1,000,000 Overview Number of Instructional Days: 10 (1 day = 45 minutes) Content to Be Learned Recognize that a digit
More informationTechnical Manual Supplement
VERSION 1.0 Technical Manual Supplement The ACT Contents Preface....................................................................... iii Introduction....................................................................
More informationUnderstanding and Interpreting the NRC s Data-Based Assessment of Research-Doctorate Programs in the United States (2010)
Understanding and Interpreting the NRC s Data-Based Assessment of Research-Doctorate Programs in the United States (2010) Jaxk Reeves, SCC Director Kim Love-Myers, SCC Associate Director Presented at UGA
More informationBENCHMARK TREND COMPARISON REPORT:
National Survey of Student Engagement (NSSE) BENCHMARK TREND COMPARISON REPORT: CARNEGIE PEER INSTITUTIONS, 2003-2011 PREPARED BY: ANGEL A. SANCHEZ, DIRECTOR KELLI PAYNE, ADMINISTRATIVE ANALYST/ SPECIALIST
More informationCS Machine Learning
CS 478 - Machine Learning Projects Data Representation Basic testing and evaluation schemes CS 478 Data and Testing 1 Programming Issues l Program in any platform you want l Realize that you will be doing
More informationIntroduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition
Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition Todd Holloway Two Lecture Series for B551 November 20 & 27, 2007 Indiana University Outline Introduction Bias and
More informationHow to Judge the Quality of an Objective Classroom Test
How to Judge the Quality of an Objective Classroom Test Technical Bulletin #6 Evaluation and Examination Service The University of Iowa (319) 335-0356 HOW TO JUDGE THE QUALITY OF AN OBJECTIVE CLASSROOM
More informationUse of the Kalamazoo Essential Elements Communication Checklist (Adapted) in an Institutional Interpersonal and Communication Skills Curriculum
Use of the Kalamazoo Essential Elements Communication Checklist (Adapted) in an Institutional Interpersonal and Communication Skills Curriculum Barbara L. Joyce, PhD Timothy Steenbergh, PhD Eric Scher,
More informationThe My Class Activities Instrument as Used in Saturday Enrichment Program Evaluation
Running Head: MY CLASS ACTIVITIES My Class Activities 1 The My Class Activities Instrument as Used in Saturday Enrichment Program Evaluation Nielsen Pereira Purdue University Scott J. Peters University
More informationNCEO Technical Report 27
Home About Publications Special Topics Presentations State Policies Accommodations Bibliography Teleconferences Tools Related Sites Interpreting Trends in the Performance of Special Education Students
More informationMulti-Lingual Text Leveling
Multi-Lingual Text Leveling Salim Roukos, Jerome Quin, and Todd Ward IBM T. J. Watson Research Center, Yorktown Heights, NY 10598 {roukos,jlquinn,tward}@us.ibm.com Abstract. Determining the language proficiency
More informationEvidence for Reliability, Validity and Learning Effectiveness
PEARSON EDUCATION Evidence for Reliability, Validity and Learning Effectiveness Introduction Pearson Knowledge Technologies has conducted a large number and wide variety of reliability and validity studies
More informationMultiple regression as a practical tool for teacher preparation program evaluation
Multiple regression as a practical tool for teacher preparation program evaluation ABSTRACT Cynthia Williams Texas Christian University In response to No Child Left Behind mandates, budget cuts and various
More informationOVERVIEW OF CURRICULUM-BASED MEASUREMENT AS A GENERAL OUTCOME MEASURE
OVERVIEW OF CURRICULUM-BASED MEASUREMENT AS A GENERAL OUTCOME MEASURE Mark R. Shinn, Ph.D. Michelle M. Shinn, Ph.D. Formative Evaluation to Inform Teaching Summative Assessment: Culmination measure. Mastery
More informationSection 3.4 Assessing barriers and facilitators to knowledge use
Section 3.4 Assessing barriers and facilitators to knowledge use France Légaré, MD, PhD Canada Research Chair in Implementation of Shared Decision Making in Primary Care Centre de recherche, Hôpital St-François
More informationGrade 6: Correlated to AGS Basic Math Skills
Grade 6: Correlated to AGS Basic Math Skills Grade 6: Standard 1 Number Sense Students compare and order positive and negative integers, decimals, fractions, and mixed numbers. They find multiples and
More informationAnalysis of Enzyme Kinetic Data
Analysis of Enzyme Kinetic Data To Marilú Analysis of Enzyme Kinetic Data ATHEL CORNISH-BOWDEN Directeur de Recherche Émérite, Centre National de la Recherche Scientifique, Marseilles OXFORD UNIVERSITY
More informationLahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017
Instructor Syed Zahid Ali Room No. 247 Economics Wing First Floor Office Hours Email szahid@lums.edu.pk Telephone Ext. 8074 Secretary/TA TA Office Hours Course URL (if any) Suraj.lums.edu.pk FINN 321 Econometrics
More informationHierarchical Linear Modeling with Maximum Likelihood, Restricted Maximum Likelihood, and Fully Bayesian Estimation
A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to Practical Assessment, Research & Evaluation. Permission is granted to distribute
More informationHow do we balance statistical evidence with expert judgement when aligning tests to the CEFR?
How do we balance statistical evidence with expert judgement when aligning tests to the CEFR? Professor Anthony Green CRELLA University of Bedfordshire Colin Finnerty Senior Assessment Manager Oxford University
More informationMontana Content Standards for Mathematics Grade 3. Montana Content Standards for Mathematical Practices and Mathematics Content Adopted November 2011
Montana Content Standards for Mathematics Grade 3 Montana Content Standards for Mathematical Practices and Mathematics Content Adopted November 2011 Contents Standards for Mathematical Practice: Grade
More informationINTERNAL MEDICINE IN-TRAINING EXAMINATION (IM-ITE SM )
INTERNAL MEDICINE IN-TRAINING EXAMINATION (IM-ITE SM ) GENERAL INFORMATION The Internal Medicine In-Training Examination, produced by the American College of Physicians and co-sponsored by the Alliance
More informationMaximizing Learning Through Course Alignment and Experience with Different Types of Knowledge
Innov High Educ (2009) 34:93 103 DOI 10.1007/s10755-009-9095-2 Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge Phyllis Blumberg Published online: 3 February
More informationMath 098 Intermediate Algebra Spring 2018
Math 098 Intermediate Algebra Spring 2018 Dept. of Mathematics Instructor's Name: Office Location: Office Hours: Office Phone: E-mail: MyMathLab Course ID: Course Description This course expands on the
More informationCHAPTER 4: REIMBURSEMENT STRATEGIES 24
CHAPTER 4: REIMBURSEMENT STRATEGIES 24 INTRODUCTION Once state level policymakers have decided to implement and pay for CSR, one issue they face is simply how to calculate the reimbursements to districts
More informationTun your everyday simulation activity into research
Tun your everyday simulation activity into research Chaoyan Dong, PhD, Sengkang Health, SingHealth Md Khairulamin Sungkai, UBD Pre-conference workshop presented at the inaugual conference Pan Asia Simulation
More informationEvaluation of a College Freshman Diversity Research Program
Evaluation of a College Freshman Diversity Research Program Sarah Garner University of Washington, Seattle, Washington 98195 Michael J. Tremmel University of Washington, Seattle, Washington 98195 Sarah
More informationLearning Disability Functional Capacity Evaluation. Dear Doctor,
Dear Doctor, I have been asked to formulate a vocational opinion regarding NAME s employability in light of his/her learning disability. To assist me with this evaluation I would appreciate if you can
More informationMGT/MGP/MGB 261: Investment Analysis
UNIVERSITY OF CALIFORNIA, DAVIS GRADUATE SCHOOL OF MANAGEMENT SYLLABUS for Fall 2014 MGT/MGP/MGB 261: Investment Analysis Daytime MBA: Tu 12:00p.m. - 3:00 p.m. Location: 1302 Gallagher (CRN: 51489) Sacramento
More informationInterdisciplinary Journal of Problem-Based Learning
Interdisciplinary Journal of Problem-Based Learning Volume 6 Issue 1 Article 9 Published online: 3-27-2012 Relationships between Language Background, Secondary School Scores, Tutorial Group Processes,
More informationTeachers development in educational systems
Available online at www.sciencedirect.com Procedia - Social and Behavioral Sciences 47 ( 2012 ) 250 255 CY-ICER 2012 Teachers development in educational systems Sooan Laei* Kermanshah Branch, Islamic Azad
More informationStrategy for teaching communication skills in dentistry
Strategy for teaching communication in dentistry SADJ July 2010, Vol 65 No 6 p260 - p265 Prof. JG White: Head: Department of Dental Management Sciences, School of Dentistry, University of Pretoria, E-mail:
More informationMathematics process categories
Mathematics process categories All of the UK curricula define multiple categories of mathematical proficiency that require students to be able to use and apply mathematics, beyond simple recall of facts
More informationFINAL EXAMINATION OBG4000 AUDIT June 2011 SESSION WRITTEN COMPONENT & LOGBOOK ASSESSMENT
L-UNIVERSITÀ TA MALTA Msida Malta SKOLA MEDIKA Sptar Mater Dei Prof. Charles Savona-Ventura MD, DScMed, FRCOG, AccrCOG, MRCPI Head Department of Obstetrics & Gynaecology UNIVERSITY OF MALTA Msida Malta
More informationTHEORY OF PLANNED BEHAVIOR MODEL IN ELECTRONIC LEARNING: A PILOT STUDY
THEORY OF PLANNED BEHAVIOR MODEL IN ELECTRONIC LEARNING: A PILOT STUDY William Barnett, University of Louisiana Monroe, barnett@ulm.edu Adrien Presley, Truman State University, apresley@truman.edu ABSTRACT
More informationRyerson University Sociology SOC 483: Advanced Research and Statistics
Ryerson University Sociology SOC 483: Advanced Research and Statistics Prerequisites: SOC 481 Instructor: Paul S. Moore E-mail: psmoore@ryerson.ca Office: Sociology Department Jorgenson JOR 306 Phone:
More informationFurther, Robert W. Lissitz, University of Maryland Huynh Huynh, University of South Carolina ADEQUATE YEARLY PROGRESS
A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to Practical Assessment, Research & Evaluation. Permission is granted to distribute
More informationEDUCATIONAL ATTAINMENT
EDUCATIONAL ATTAINMENT By 2030, at least 60 percent of Texans ages 25 to 34 will have a postsecondary credential or degree. Target: Increase the percent of Texans ages 25 to 34 with a postsecondary credential.
More informationIntroduction to Questionnaire Design
Introduction to Questionnaire Design Why this seminar is necessary! Bad questions are everywhere! Don t let them happen to you! Fall 2012 Seminar Series University of Illinois www.srl.uic.edu The first
More informationThe Good Judgment Project: A large scale test of different methods of combining expert predictions
The Good Judgment Project: A large scale test of different methods of combining expert predictions Lyle Ungar, Barb Mellors, Jon Baron, Phil Tetlock, Jaime Ramos, Sam Swift The University of Pennsylvania
More informationPsychometric Research Brief Office of Shared Accountability
August 2012 Psychometric Research Brief Office of Shared Accountability Linking Measures of Academic Progress in Mathematics and Maryland School Assessment in Mathematics Huafang Zhao, Ph.D. This brief
More informationAn Empirical Analysis of the Effects of Mexican American Studies Participation on Student Achievement within Tucson Unified School District
An Empirical Analysis of the Effects of Mexican American Studies Participation on Student Achievement within Tucson Unified School District Report Submitted June 20, 2012, to Willis D. Hawley, Ph.D., Special
More informationInstructions and Guidelines for Promotion and Tenure Review of IUB Librarians
Instructions and Guidelines for Promotion and Tenure Review of IUB Librarians Approved by the IUB Library Faculty June 2012. Future amendment by vote of Bloomington Library Faculty Council. Amended August
More informationRendezvous with Comet Halley Next Generation of Science Standards
Next Generation of Science Standards 5th Grade 6 th Grade 7 th Grade 8 th Grade 5-PS1-3 Make observations and measurements to identify materials based on their properties. MS-PS1-4 Develop a model that
More informationKarla Brooks Baehr, Ed.D. Senior Advisor and Consultant The District Management Council
Karla Brooks Baehr, Ed.D. Senior Advisor and Consultant The District Management Council This paper aims to inform the debate about how best to incorporate student learning into teacher evaluation systems
More informationAssignment 1: Predicting Amazon Review Ratings
Assignment 1: Predicting Amazon Review Ratings 1 Dataset Analysis Richard Park r2park@acsmail.ucsd.edu February 23, 2015 The dataset selected for this assignment comes from the set of Amazon reviews for
More informationUsing the Attribute Hierarchy Method to Make Diagnostic Inferences about Examinees Cognitive Skills in Algebra on the SAT
The Journal of Technology, Learning, and Assessment Volume 6, Number 6 February 2008 Using the Attribute Hierarchy Method to Make Diagnostic Inferences about Examinees Cognitive Skills in Algebra on the
More informationThe patient-centered medical
Primary Care Residents Want to Learn About the Patient- Centered Medical Home Gerardo Moreno, MD, MSHS; Julia Gold, MD; Maureen Mavrinac, MD BACKGROUND AND OBJECTIVES: The patient-centered medical home
More informationNumeracy Medium term plan: Summer Term Level 2C/2B Year 2 Level 2A/3C
Numeracy Medium term plan: Summer Term Level 2C/2B Year 2 Level 2A/3C Using and applying mathematics objectives (Problem solving, Communicating and Reasoning) Select the maths to use in some classroom
More informationPractical Research. Planning and Design. Paul D. Leedy. Jeanne Ellis Ormrod. Upper Saddle River, New Jersey Columbus, Ohio
SUB Gfittingen 213 789 981 2001 B 865 Practical Research Planning and Design Paul D. Leedy The American University, Emeritus Jeanne Ellis Ormrod University of New Hampshire Upper Saddle River, New Jersey
More informationApplications from foundation doctors to specialty training. Reporting tool user guide. Contents. last updated July 2016
Applications from foundation doctors to specialty training Reporting tool user guide last updated July 2016 Contents Overview... 2 Purpose of the reports... 2 The reports can be found on the GMC website:...
More informationMathematics subject curriculum
Mathematics subject curriculum Dette er ei omsetjing av den fastsette læreplanteksten. Læreplanen er fastsett på Nynorsk Established as a Regulation by the Ministry of Education and Research on 24 June
More informationvalue equivalent 6. Attendance Full-time Part-time Distance learning Mode of attendance 5 days pw n/a n/a
PROGRAMME APPROVAL FORM SECTION 1 THE PROGRAMME SPECIFICATION 1. Programme title and designation Orthodontics 2. Final award Award Title Credit ECTS Any special criteria value equivalent MSc Orthodontics
More informationFunctional Maths Skills Check E3/L x
Functional Maths Skills Check E3/L1 Name: Date started: The Four Rules of Number + - x May 2017. Kindly contributed by Nicola Smith, Gloucestershire College. Search for Nicola on skillsworkshop.org Page
More informationOFFICE OF ENROLLMENT MANAGEMENT. Annual Report
2014-2015 OFFICE OF ENROLLMENT MANAGEMENT Annual Report Table of Contents 2014 2015 MESSAGE FROM THE VICE PROVOST A YEAR OF RECORDS 3 Undergraduate Enrollment 6 First-Year Students MOVING FORWARD THROUGH
More informationPaper 2. Mathematics test. Calculator allowed. First name. Last name. School KEY STAGE TIER
259574_P2 5-7_KS3_Ma.qxd 1/4/04 4:14 PM Page 1 Ma KEY STAGE 3 TIER 5 7 2004 Mathematics test Paper 2 Calculator allowed Please read this page, but do not open your booklet until your teacher tells you
More informationDetailed course syllabus
Detailed course syllabus 1. Linear regression model. Ordinary least squares method. This introductory class covers basic definitions of econometrics, econometric model, and economic data. Classification
More informationJulia Smith. Effective Classroom Approaches to.
Julia Smith @tessmaths Effective Classroom Approaches to GCSE Maths resits julia.smith@writtle.ac.uk Agenda The context of GCSE resit in a post-16 setting An overview of the new GCSE Key features of a
More informationA Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many
Schmidt 1 Eric Schmidt Prof. Suzanne Flynn Linguistic Study of Bilingualism December 13, 2013 A Minimalist Approach to Code-Switching In the field of linguistics, the topic of bilingualism is a broad one.
More informationIf we want to measure the amount of cereal inside the box, what tool would we use: string, square tiles, or cubes?
String, Tiles and Cubes: A Hands-On Approach to Understanding Perimeter, Area, and Volume Teaching Notes Teacher-led discussion: 1. Pre-Assessment: Show students the equipment that you have to measure
More informationMYCIN. The MYCIN Task
MYCIN Developed at Stanford University in 1972 Regarded as the first true expert system Assists physicians in the treatment of blood infections Many revisions and extensions over the years The MYCIN Task
More informationFunctional Skills Mathematics Level 2 assessment
Functional Skills Mathematics Level 2 assessment www.cityandguilds.com September 2015 Version 1.0 Marking scheme ONLINE V2 Level 2 Sample Paper 4 Mark Represent Analyse Interpret Open Fixed S1Q1 3 3 0
More informationProcess Evaluations for a Multisite Nutrition Education Program
Process Evaluations for a Multisite Nutrition Education Program Paul Branscum 1 and Gail Kaye 2 1 The University of Oklahoma 2 The Ohio State University Abstract Process evaluations are an often-overlooked
More informationHow People Learn Physics
How People Learn Physics Edward F. (Joe) Redish Dept. Of Physics University Of Maryland AAPM, Houston TX, Work supported in part by NSF grants DUE #04-4-0113 and #05-2-4987 Teaching complex subjects 2
More informationLecture 2: Quantifiers and Approximation
Lecture 2: Quantifiers and Approximation Case study: Most vs More than half Jakub Szymanik Outline Number Sense Approximate Number Sense Approximating most Superlative Meaning of most What About Counting?
More informationMachine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler
Machine Learning and Data Mining Ensembles of Learners Prof. Alexander Ihler Ensemble methods Why learn one classifier when you can learn many? Ensemble: combine many predictors (Weighted) combina
More informationPHD COURSE INTERMEDIATE STATISTICS USING SPSS, 2018
1 PHD COURSE INTERMEDIATE STATISTICS USING SPSS, 2018 Department Of Psychology and Behavioural Sciences AARHUS UNIVERSITY Course coordinator: Anne Scharling Rasmussen Lectures: Ali Amidi (AA), Kaare Bro
More informationGuidelines for Writing an Internship Report
Guidelines for Writing an Internship Report Master of Commerce (MCOM) Program Bahauddin Zakariya University, Multan Table of Contents Table of Contents... 2 1. Introduction.... 3 2. The Required Components
More informationImproving Conceptual Understanding of Physics with Technology
INTRODUCTION Improving Conceptual Understanding of Physics with Technology Heidi Jackman Research Experience for Undergraduates, 1999 Michigan State University Advisors: Edwin Kashy and Michael Thoennessen
More informationSociology 521: Social Statistics and Quantitative Methods I Spring Wed. 2 5, Kap 305 Computer Lab. Course Website
Sociology 521: Social Statistics and Quantitative Methods I Spring 2012 Wed. 2 5, Kap 305 Computer Lab Instructor: Tim Biblarz Office hours (Kap 352): W, 5 6pm, F, 10 11, and by appointment (213) 740 3547;
More informationSURVIVING ON MARS WITH GEOGEBRA
SURVIVING ON MARS WITH GEOGEBRA Lindsey States and Jenna Odom Miami University, OH Abstract: In this paper, the authors describe an interdisciplinary lesson focused on determining how long an astronaut
More informationGRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics
2017-2018 GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics Entrance requirements, program descriptions, degree requirements and other program policies for Biostatistics Master s Programs
More informationSector Differences in Student Learning: Differences in Achievement Gains Across School Years and During the Summer
Catholic Education: A Journal of Inquiry and Practice Volume 7 Issue 2 Article 6 July 213 Sector Differences in Student Learning: Differences in Achievement Gains Across School Years and During the Summer
More informationThe Effect of Written Corrective Feedback on the Accuracy of English Article Usage in L2 Writing
Journal of Applied Linguistics and Language Research Volume 3, Issue 1, 2016, pp. 110-120 Available online at www.jallr.com ISSN: 2376-760X The Effect of Written Corrective Feedback on the Accuracy of
More informationReference to Tenure track faculty in this document includes tenured faculty, unless otherwise noted.
PHILOSOPHY DEPARTMENT FACULTY DEVELOPMENT and EVALUATION MANUAL Approved by Philosophy Department April 14, 2011 Approved by the Office of the Provost June 30, 2011 The Department of Philosophy Faculty
More informationMay To print or download your own copies of this document visit Name Date Eurovision Numeracy Assignment
1. An estimated one hundred and twenty five million people across the world watch the Eurovision Song Contest every year. Write this number in figures. 2. Complete the table below. 2004 2005 2006 2007
More informationWE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT
WE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT PRACTICAL APPLICATIONS OF RANDOM SAMPLING IN ediscovery By Matthew Verga, J.D. INTRODUCTION Anyone who spends ample time working
More informationGCSE English Language 2012 An investigation into the outcomes for candidates in Wales
GCSE English Language 2012 An investigation into the outcomes for candidates in Wales Qualifications and Learning Division 10 September 2012 GCSE English Language 2012 An investigation into the outcomes
More informationExecutive Guide to Simulation for Health
Executive Guide to Simulation for Health Simulation is used by Healthcare and Human Service organizations across the World to improve their systems of care and reduce costs. Simulation offers evidence
More informationThe Impact of Postgraduate Health Technology Innovation Training: Outcomes of the Stanford Biodesign Fellowship
Annals of Biomedical Engineering, Vol. 45, No. 5, May 2017 (Ó 2016) pp. 1163 1171 DOI: 10.1007/s10439-016-1777-1 The Impact of Postgraduate Health Technology Innovation Training: Outcomes of the Stanford
More informationKansas Adequate Yearly Progress (AYP) Revised Guidance
Kansas State Department of Education Kansas Adequate Yearly Progress (AYP) Revised Guidance Based on Elementary & Secondary Education Act, No Child Left Behind (P.L. 107-110) Revised May 2010 Revised May
More information