Student Achievement and Attitudes, describes a quasi-experimental study investigating the

Similar documents
Enhancing Van Hiele s level of geometric understanding using Geometer s Sketchpad Introduction Research purpose Significance of study

Evidence-based Practice: A Workshop for Training Adult Basic Education, TANF and One Stop Practitioners and Program Administrators

CHAPTER III RESEARCH METHOD

Student Mobility and Stability in CT

Note: Principal version Modification Amendment Modification Amendment Modification Complete version from 1 October 2014

A Note on Structuring Employability Skills for Accounting Students

Appendix. Journal Title Times Peer Review Qualitative Referenced Authority* Quantitative Studies

VIEW: An Assessment of Problem Solving Style

Research Design & Analysis Made Easy! Brainstorming Worksheet

Professional Development Connected to Student Achievement in STEM Education

Learning Disabilities and Educational Research 1

ACADEMIC AFFAIRS GUIDELINES

Rubric for Scoring English 1 Unit 1, Rhetorical Analysis

EFFECTS OF MATHEMATICS ACCELERATION ON ACHIEVEMENT, PERCEPTION, AND BEHAVIOR IN LOW- PERFORMING SECONDARY STUDENTS

Introduction to Causal Inference. Problem Set 1. Required Problems

On-the-Fly Customization of Automated Essay Scoring

George Mason University Graduate School of Education Program: Special Education

Shockwheat. Statistics 1, Activity 1

Developing Students Research Proposal Design through Group Investigation Method

Enhancing Students Understanding Statistics with TinkerPlots: Problem-Based Learning Approach

Improved Effects of Word-Retrieval Treatments Subsequent to Addition of the Orthographic Form

Running head: METACOGNITIVE STRATEGIES FOR ACADEMIC LISTENING 1. The Relationship between Metacognitive Strategies Awareness

Urban Legends Three Week Unit 9th/10th Speech

Tun your everyday simulation activity into research

DOES RETELLING TECHNIQUE IMPROVE SPEAKING FLUENCY?

ESTABLISHING A TRAINING ACADEMY. Betsy Redfern MWH Americas, Inc. 380 Interlocken Crescent, Suite 200 Broomfield, CO

Chapters 1-5 Cumulative Assessment AP Statistics November 2008 Gillespie, Block 4

MASTER S THESIS GUIDE MASTER S PROGRAMME IN COMMUNICATION SCIENCE

Statistical Analysis of Climate Change, Renewable Energies, and Sustainability An Independent Investigation for Introduction to Statistics

CREATING SAFE AND INCLUSIVE SCHOOLS: A FRAMEWORK FOR SELF-ASSESSMENT. Created by: Great Lakes Equity Center

Dyslexia and Dyscalculia Screeners Digital. Guidance and Information for Teachers

and secondary sources, attending to such features as the date and origin of the information.

Tutor s Guide TARGET AUDIENCES. "Qualitative survey methods applied to natural resource management"

Guru: A Computer Tutor that Models Expert Human Tutors

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

PIRLS. International Achievement in the Processes of Reading Comprehension Results from PIRLS 2001 in 35 Countries

Match or Mismatch Between Learning Styles of Prep-Class EFL Students and EFL Teachers

Senior Project Information

Practical Research. Planning and Design. Paul D. Leedy. Jeanne Ellis Ormrod. Upper Saddle River, New Jersey Columbus, Ohio

Developing a Language for Assessing Creativity: a taxonomy to support student learning and assessment

NCEO Technical Report 27

An Empirical and Computational Test of Linguistic Relativity

Learning Lesson Study Course

Session 2B From understanding perspectives to informing public policy the potential and challenges for Q findings to inform survey design

Jay P. Greene and Marcus A. Winters. Manhattan Institute. Sean P. Corcoran and Lawrence Mishel.

Rote rehearsal and spacing effects in the free recall of pure and mixed lists. By: Peter P.J.L. Verkoeijen and Peter F. Delaney

Systematic reviews in theory and practice for library and information studies

BSc (Hons) in International Business

Degree Qualification Profiles Intellectual Skills

TEXT FAMILIARITY, READING TASKS, AND ESP TEST PERFORMANCE: A STUDY ON IRANIAN LEP AND NON-LEP UNIVERSITY STUDENTS

NORTH CAROLINA VIRTUAL PUBLIC SCHOOL IN WCPSS UPDATE FOR FALL 2007, SPRING 2008, AND SUMMER 2008

Orleans Central Supervisory Union

Qualification handbook

PROMOTING QUALITY AND EQUITY IN EDUCATION: THE IMPACT OF SCHOOL LEARNING ENVIRONMENT

Curriculum Assessment Employing the Continuous Quality Improvement Model in Post-Certification Graduate Athletic Training Education Programs

The Open University s repository of research publications and other research outputs. Moving forward with TESSA: what is the potential for MOOCs?

STA 225: Introductory Statistics (CT)

Evidence for Reliability, Validity and Learning Effectiveness

International Conference on Current Trends in ELT

Interpreting ACER Test Results

PSYC 2700H-B: INTRODUCTION TO SOCIAL PSYCHOLOGY

Concept mapping instrumental support for problem solving

Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge

It s News to Me! Teaching with Colorado s Historic Newspaper Collection Model Lesson Format

Scientific Method Investigation of Plant Seed Germination

Process Evaluations for a Multisite Nutrition Education Program

What is Thinking (Cognition)?

Number of students enrolled in the program in Fall, 2011: 20. Faculty member completing template: Molly Dugan (Date: 1/26/2012)

Analysis of Enzyme Kinetic Data

Build on students informal understanding of sharing and proportionality to develop initial fraction concepts.

Communication Studies 151 & LAB Class # & Fall 2014 Thursdays 4:00-6:45

Effectiveness of McGraw-Hill s Treasures Reading Program in Grades 3 5. October 21, Research Conducted by Empirical Education Inc.

Common Performance Task Data

Ministry of Education General Administration for Private Education ELT Supervision

Informal Comparative Inference: What is it? Hand Dominance and Throwing Accuracy

The Condition of College & Career Readiness 2016

Developing creativity in a company whose business is creativity By Andy Wilkins

A Guide to Adequate Yearly Progress Analyses in Nevada 2007 Nevada Department of Education

A Study of Metacognitive Awareness of Non-English Majors in L2 Listening

How to read a Paper ISMLL. Dr. Josif Grabocka, Carlotta Schatten

DO YOU HAVE THESE CONCERNS?

Historical Overview of Georgia s Standards. Dr. John Barge, State School Superintendent

LANGUAGE IN INDIA Strength for Today and Bright Hope for Tomorrow Volume 11 : 12 December 2011 ISSN

NATIONAL CENTER FOR EDUCATION STATISTICS RESPONSE TO RECOMMENDATIONS OF THE NATIONAL ASSESSMENT GOVERNING BOARD AD HOC COMMITTEE ON.

Evaluation of Respondus LockDown Browser Online Training Program. Angela Wilson EDTECH August 4 th, 2013

Teacher intelligence: What is it and why do we care?

Simple Random Sample (SRS) & Voluntary Response Sample: Examples: A Voluntary Response Sample: Examples: Systematic Sample Best Used When

learning collegiate assessment]

CAAP. Content Analysis Report. Sample College. Institution Code: 9011 Institution Type: 4-Year Subgroup: none Test Date: Spring 2011

National Survey of Student Engagement (NSSE) Temple University 2016 Results

EQuIP Review Feedback

GCSE English Language 2012 An investigation into the outcomes for candidates in Wales

Scoring Notes for Secondary Social Studies CBAs (Grades 6 12)

Lecture 1: Machine Learning Basics

Technical Manual Supplement

Data Integration through Clustering and Finding Statistical Relations - Validation of Approach

QUESTIONS ABOUT ACCESSING THE HANDOUTS AND THE POWERPOINT

To link to this article: PLEASE SCROLL DOWN FOR ARTICLE

THE PENNSYLVANIA STATE UNIVERSITY SCHREYER HONORS COLLEGE DEPARTMENT OF MATHEMATICS ASSESSING THE EFFECTIVENESS OF MULTIPLE CHOICE MATH TESTS

Transcription:

Quantitative Research Critique The article by Zane Olina and Howard J. Sullivan, Effects of Classroom Evaluation Strategies on Student Achievement and Attitudes, describes a quasi-experimental study investigating the effect of different evaluation strategies on student performance and attitude. During a 12- lesson instructional program called Learning Explorations, student classes were subject to one of three evaluation strategies: 1) no evaluation; 2) formative, teacher evaluations; or 3) formative, teacher plus formative, self-evaluations. The authors rated the final reports produced by the students and collected survey data to determine if these various assessment strategies resulted in differences in student performance and attitude. Review of the literature The literature review cites four studies dealing with the positive impact of formative evaluation on student performance. Two other studies are cited that show no effect of teacher evaluation on student performance. A list of effective evaluation characteristics are honed from five works and used by the authors in the development of their own teacher-evaluation instruments (Olina and Sullivan, 2002, page 63). Nine studies showing positive results from use of student selfevaluation methods were included in the review. One article comparing teacher evaluation to student self-evaluation and peer-evaluation methods found no performance differences among the three assessment models, but significant differences in motivational levels. (Olina and Sullivan, page 62) For this level of research the literature review was adequate. The authors also referenced previous literature to defend their design of treatment instruments (Olina and Sullivan, page 63), and again in their choice to use the quality of the students final reports as criteria measure (Olina and Sullivan, page 63)

Research hypotheses The environment of the Latvian school system is described to provide a general context for the research. It speaks to need and audience; little formative evaluation is practiced or encouraged in the Latvian schools. The authors provide a broad statement of purpose from which they develop three research questions to investigate. In the introduction to the article they identify the criterion measures that will be used to investigate the first two research questions dealing with student performance, but they fail to mention the post-survey that will be used to collect data on student attitudes. In the Criterion Measures section of the article two additional measures, student attitude surveys and teacher attitude surveys, are identified. The teacher attitude surveys do not seem to apply directly to any of the research statements. Broad Concepts Statement Specific Research Statements Criterion Measures The present study investigated the effects of teacher evaluation and student self-evaluation on student posttest scores, the quality of student research reports, and student attitudes. Does teacher evaluation have a positive effect on student performance? Does the combination of teacher evaluation and student self-evaluation have a different effect on student perfornance than teacher evaluation alone? Scores on post-test Quality of student research reports Scores on post-test Quality of student research reports Does the combination of teacher evaluation and student self-evaluation have a different effect on student attitudes than teacher evaluation alone? Student Attitude Surveys Teacher Attitude Surveys

Participants The description of the sampling method would have been much clearer if the authors had identified that the student classes used in the study were convenience samples intact classes taught by the six teachers involved. That would also have required that the authors identify the limitations of such groups: As it does not represent any group apart for itself, it does not seek to generalize about the wider population; for a convenience sample that is a irrelevance. The researcher, of course, must take pains to report this point that the parameters of generalizability in this type of sample are negligible. (Cohen, 2000, page 103) The authors state that the twelve classes selected in the study were representative of both rural and urban areas and varied socio-economic backgrounds. The classes were from five schools in different regions of Latvia. A significant problem that derives from this is that since the classes are treated as six subject groups with one teacher and two classes being assigned a particular treatment, before the study begins the 186 students are already divided into subject groups that are dissimilar. Since assignment of treatment is per teacher, two classes of rural students may be prescribed self-assessment as a treatment, whereas two classes of urban students may be prescribed no treatment. Differences in post-tests and post-surveys between these groups could stem from the treatments or from their respective urban/rural cultures. Procedure The application of treatments as described by the authors is unintelligible: In order to assign teachers to treatments, the researcher ranked all pairs of classes for each teacher from the highest achieving to the lowest achieving, based on the student 9th Grade Graduation Exam scores in mathematics and the Latvian language. The pairs of classes for each teacher were divided into high-achieving and low-achieving classes using a median split. Teachers with classes from each group were then randomly assigned to one of the three treatments. (Olina and Sullivan, page 66)

It is difficult to determine if the application of the treatments was indeed random. Ranking all pairs of classes for each teacher involves ranking two classes for each teacher. Next, the pairs of classes for each teacher were divided into high-achieving and low-achieving classes using a median split might mean that the students were actually reassigned within the classes but could also mean that two classes were labeled using a median score. Finally teachers with classes from each group were randomly assigned to one of three treatments. The wording here suggests that after dividing a pair of classes on a median split, there were teachers who somehow managed to not have a class from each group. At this point, anyone critiquing this study is forced to make his or her own assumptions and move on. This lack of clarity would certainly make it very difficult for someone to replicate the study. Before the experiment begins another extraneous variable is introduced that makes the three treatment groups unequal. All teachers received the same version of the instructional program. Teachers in the no-evaluation group received no additional instructions for use of the program. Teachers in the remaining two treatments received additional instructions describing the evaluation procedures that they were expected to complete for their evaluation condition. (Olina and Sullivan, page 66) Instructors, and possibly students, who received additional training on evaluation procedures, would have a better understanding of the objectives of the program and this could impact their performance and the results of the study. Instrumentation The four criterion measures used in this experiment are: scores on post-test, quality of student research reports, student attitude surveys, and teacher attitude surveys. The authors describe each of these in detail and refer to alignment with widely excepted standards such as the interrater reliability scores for rating of students projects (Olina and Sullivan, page 65) and

Cronbach s scores for internal reliability of the post-test (Page 65, para 2). That the teacher attitude survey does not address any of the specific research questions posed is the only significant weakness in this section of the survey. Results Tables showing statistical results for mean project report and posttest scores by treatments group are provided. Mean ratings for eight distinct statements on the student attitude survey are also provided and listed by treatment group. A narrative of responses from the teacher attitude surveys are provided although these responses don t address any of the specific research questions posed. Also, classroom observations are provided. Classroom observations were not put forth as a criteria measurements and the fact that visits took place brings up questions of how, if at all, they may have impacted the progress and results of the experiment. Discussion and Conclusions The discussion and conclusions provide recommendations for classroom application and future studies which are derived from the results. However, the limitations and lack of external validity posed by the choice of the convenience sampling and the confusing procedures listed for treatment application mean that the study may not be generalized or easily reproduced. Some limitations of the study, such as the lack of a self-assessment-only treatment, and the unfamiliarity of the teaching and assessment methods to the Latvian teachers are pointed out. However, the disparate instruction given to the treatment groups is not pointed out as a confounding variable. Overall, the study suffers from a lack of consistency among the convenience samples and a lack of control of independent variables.

References Cohen, Lousie; Lawrence, Manion, and Morrison, Keith. (2000) Research Methods in Education (5 th Ed). Routledge Farmer (London, England) Olina, Zane; Sullivan, Howard J. (2002) Effects of Classroom Evaluation Strategies on Student Achievement and Attitudes. Educational Technology Research and Development 50, no.3 pages 61-75 Perry, Lenora; Crocker, Robert. (2006). Module 2: Introduction to Quantitative Research. Education 6100: Research Designs and Methods in Education. Memorial University of Newfoundland. Retrieved Feb 14, 2007 from course lecture notes Perry, Lenora; Crocker, Robert. (2006). Module 3: Experimental and Quasi-experimental Research. Education 6100: Research Designs and Methods in Education. Memorial University of Newfoundland. Retrieved Feb 14, 2007 from course lecture notes