Goals of Evaluation. Evaluates level of system functionality. Evaluates effect of interface on user. Identifies specific problems

Similar documents
SOFTWARE EVALUATION TOOL

On Human Computer Interaction, HCI. Dr. Saif al Zahir Electrical and Computer Engineering Department UBC

English Language Arts Summative Assessment

Behavior List. Ref. No. Behavior. Grade. Std. Domain/Category. Social/ Emotional will notify the teacher when angry (words, signal)

Software Maintenance

M55205-Mastering Microsoft Project 2016

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

Seminar - Organic Computing

Instructor: Mario D. Garrett, Ph.D. Phone: Office: Hepner Hall (HH) 100

AGENDA LEARNING THEORIES LEARNING THEORIES. Advanced Learning Theories 2/22/2016

Multimedia Courseware of Road Safety Education for Secondary School Students

Star Math Pretest Instructions

Different Requirements Gathering Techniques and Issues. Javaria Mushtaq

White Paper. The Art of Learning

Learning Lesson Study Course

University of Groningen. Systemen, planning, netwerken Bosman, Aart

Aviation English Solutions

CAFE ESSENTIAL ELEMENTS O S E P P C E A. 1 Framework 2 CAFE Menu. 3 Classroom Design 4 Materials 5 Record Keeping

Practical Research. Planning and Design. Paul D. Leedy. Jeanne Ellis Ormrod. Upper Saddle River, New Jersey Columbus, Ohio

Does the Difficulty of an Interruption Affect our Ability to Resume?

Thesis-Proposal Outline/Template

Knowledge based expert systems D H A N A N J A Y K A L B A N D E

MASTER OF ARTS IN APPLIED SOCIOLOGY. Thesis Option

TA Script of Student Test Directions

K 1 2 K 1 2. Iron Mountain Public Schools Standards (modified METS) Checklist by Grade Level Page 1 of 11

PETER BLATCHFORD, PAUL BASSETT, HARVEY GOLDSTEIN & CLARE MARTIN,

Document number: 2013/ Programs Committee 6/2014 (July) Agenda Item 42.0 Bachelor of Engineering with Honours in Software Engineering

Mastering Team Skills and Interpersonal Communication. Copyright 2012 Pearson Education, Inc. publishing as Prentice Hall.

TA Certification Course Additional Information Sheet

Running head: DELAY AND PROSPECTIVE MEMORY 1

Finding, Hiring, and Directing e-learning Voices Harlan Hogan, E-learningvoices.com

MENTORING. Tips, Techniques, and Best Practices

Renaissance Learning P.O. Box 8036 Wisconsin Rapids, WI (800)

Smarter Balanced Assessment System

Rover Races Grades: 3-5 Prep Time: ~45 Minutes Lesson Time: ~105 minutes

Exercise Format Benefits Drawbacks Desk check, audit or update

Running head: THE INTERACTIVITY EFFECT IN MULTIMEDIA LEARNING 1

DSTO WTOIBUT10N STATEMENT A

Instructional Supports for Common Core and Beyond: FORMATIVE ASSESMENT

PLCs - From Understanding to Action Handouts

OFFICE OF DISABILITY SERVICES FACULTY FREQUENTLY ASKED QUESTIONS

STEPS TO EFFECTIVE ADVOCACY

HARPER ADAMS UNIVERSITY Programme Specification

Simulation in Maritime Education and Training

OVERVIEW OF CURRICULUM-BASED MEASUREMENT AS A GENERAL OUTCOME MEASURE

AUTHORITATIVE SOURCES ADULT AND COMMUNITY LEARNING LEARNING PROGRAMMES

Circuit Simulators: A Revolutionary E-Learning Platform

Planning a research project

What is beautiful is useful visual appeal and expected information quality

1 Use complex features of a word processing application to a given brief. 2 Create a complex document. 3 Collaborate on a complex document.

Introduction to Mobile Learning Systems and Usability Factors

Introduction to Questionnaire Design

Unit purpose and aim. Level: 3 Sub-level: Unit 315 Credit value: 6 Guided learning hours: 50

Team Love <3. Because it s all about heart.

Non-Secure Information Only

What to Do When Conflict Happens

Web-based Learning Systems From HTML To MOODLE A Case Study

An Introduction to Simio for Beginners

Course Law Enforcement II. Unit I Careers in Law Enforcement

Planning a Dissertation/ Project

NATIONAL CENTER FOR EDUCATION STATISTICS RESPONSE TO RECOMMENDATIONS OF THE NATIONAL ASSESSMENT GOVERNING BOARD AD HOC COMMITTEE ON.

Modeling user preferences and norms in context-aware systems

MASTER S THESIS GUIDE MASTER S PROGRAMME IN COMMUNICATION SCIENCE

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Appendix L: Online Testing Highlights and Script

EMPOWER Self-Service Portal Student User Manual

10 Tips For Using Your Ipad as An AAC Device. A practical guide for parents and professionals

Delaware Performance Appraisal System Building greater skills and knowledge for educators

Case study Norway case 1

A GENERIC SPLIT PROCESS MODEL FOR ASSET MANAGEMENT DECISION-MAKING

Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge

Newlands Girls School

Ph.D. in Behavior Analysis Ph.d. i atferdsanalyse

Renaissance Learning 32 Harbour Exchange Square London, E14 9GE +44 (0)

Extending Learning Across Time & Space: The Power of Generalization

Examity - Adding Examity to your Moodle Course

POL EVALUATION PLAN. Created for Lucy Learned, Training Specialist Jet Blue Airways

Medical Complexity: A Pragmatic Theory

Tutor Coaching Study Research Team

Application of Virtual Instruments (VIs) for an enhanced learning environment

Scenario Design for Training Systems in Crisis Management: Training Resilience Capabilities

Just Because You Can t Count It Doesn t Mean It Doesn t Count: Doing Good Research with Qualitative Data

Cognitive Thinking Style Sample Report

Monitoring Metacognitive abilities in children: A comparison of children between the ages of 5 to 7 years and 8 to 11 years

1. Professional learning communities Prelude. 4.2 Introduction

Human Factors Engineering Design and Evaluation Checklist

PAGE(S) WHERE TAUGHT If sub mission ins not a book, cite appropriate location(s))

Age Effects on Syntactic Control in. Second Language Learning

ACCT 3400, BUSN 3400-H01, ECON 3400, FINN COURSE SYLLABUS Internship for Academic Credit Fall 2017

HEROIC IMAGINATION PROJECT. A new way of looking at heroism

ACCOMMODATIONS FOR STUDENTS WITH DISABILITIES

How to read a Paper ISMLL. Dr. Josif Grabocka, Carlotta Schatten

Smarter ELA/Literacy and Mathematics Interim Comprehensive Assessment (ICA) and Interim Assessment Blocks (IABs) Test Administration Manual (TAM)

Individual Component Checklist L I S T E N I N G. for use with ONE task ENGLISH VERSION

Connect Microbiology. Training Guide

Critical Thinking in the Workplace. for City of Tallahassee Gabrielle K. Gabrielli, Ph.D.

Ministry of Education, Republic of Palau Executive Summary

elearning OVERVIEW GFA Consulting Group GmbH 1

MYCIN. The embodiment of all the clichés of what expert systems are. (Newell)

Transcription:

Evaluation 0 Tests usability and functionality of system 0 Occurs in laboratory, field and/or in collaboration with users 0 Evaluates both design and implementation 0 Should be considered at all stages in the design life cycle

Goals of Evaluation Evaluates level of system functionality Evaluates effect of interface on user Identifies specific problems

Evaluating Designs

Cognitive Walkthrough Proposed by Polson et al. evaluates design on how well it supports user in learning task usually performed by expert in cognitive psychology expert walks though design to identify possible problems using psychological principles forms used to guide analysis

Cognitive Walkthrough (ctd) 0 For each task walkthrough considers what impact will interaction have on user? what cognitive processes are required? what learning problems may occur? 0 Analysis focuses on goals and knowledge: does the design lead the user to make the correct goals?

Heuristic Evaluation 0 Proposed by Nielsen and Molich. usability criteria (heuristics) are identified design examined by experts to see if these are violated 0 Examples of heuristics o system behaviour is predictable o system behaviour is consistent o feedback is provided 0 Heuristic evaluation `debugs' design. Debugging means identifying errors and fix them

Review-based evaluation 0 Results from the written review used to support or refute (disapprove) parts of design. 0 Care needed to ensure results are transferable to new design. 0 Model-based evaluation

Evaluating through user Participation

Laboratory studies 0 Advantages: specialist equipment available uninterrupted environment 0 Disadvantages: lack of context(environment) difficult to observe several users cooperating

Field Studies 0 Advantages: natural environment context retained (though observation may alter it) 0 Disadvantages: distractions noise

Evaluating Implementations

Experimental evaluation controlled evaluation of interactive behaviour evaluator chooses hypothesis(theory) to be tested a number of experimental conditions are considered which differ only in the value of some controlled variable. changes in behavioural measure are attributed to different conditions

Experimental factors Subjects o who representative, sufficient sample Variables o things to modify and measure Hypothesis o what you d like to show Experimental design o how you are going to do it

Variables 1. independent variable (IV) o characteristic changed to produce different conditions o e.g. interface style, number of menu items 2. dependent variable (DV) characteristics measured in the experiment e.g. time taken, number of errors.

Hypothesis (thesis or theory) 0 prediction of outcome 0 framed in terms of IV and DV e.g. error rate will increase as font size decreases 0 null hypothesis: 0 states no difference between conditions e.g. null hyp. = no change with font size

Experimental design 1. Within(inside) groups design o each subject performs experiment under each condition. o less costly and less likely to suffer from user variation. 2. Between groups design o o o each subject performs under only one condition more users required variation can bias results.

Analysis of data 0 Before you start to do any statistics: o look at data o save original data 0 Choice of statistical technique depends on o type of data o information required 0 Type of data o discrete - finite (fixed) number of values o continuous - any value

Analysis - types of test 0 Parametric assume normal distribution powerful 0 Non-parametric do not assume normal distribution less powerful more reliable 0 Likelihood table classify data by discrete attributes count number of data items in each group

Analysis of data (cont.) 0 What information is required? o is there a difference? o how big is the difference? o how accurate is the estimate?

Experimental studies on groups More difficult than single-user experiments Problems with: o subject groups o choice of task o data gathering o analysis

Subject groups larger number of subjects more expensive longer time to `settle down even more variation! difficult to timetable so often only three or four groups

The task 1.Difficult task 2.Medium task 3.Easy task

Data gathering several video cameras + direct logging of application Experimental Laboratory

Field studies Experiments ruled by group formation Field studies more realistic: work studied in context real action is situated action physical and social environment both crucial

Observational Methods

Think Aloud 0 user observed performing task 0 user asked to describe what s/he is doing and why, what s/he thinks is happening etc. 0 Advantages 0 simplicity - requires little expertise 0 can provide useful insight 0 can show how system is actually used 0 Disadvantages 0 Selective (careful) 0 act of describing may alter task performance

Cooperative evaluation 0 variation on think aloud 0 user collaborates in evaluation 0 both user and evaluator can ask each other questions throughout 0 Additional advantages less constrained and easier to use user is encouraged to criticize system explanation possible

Protocol analysis paper and pencil cheap, limited to writing speed audio good for think aloud, difficult to match with other protocols video accurate and realistic, needs special equipment computer logging automatic, large amounts of data difficult to analyze user notebooks coarse and subjective, useful insights, good for longitudinal studies 0 Mixed use in practice. 0 audio/video transcription difficult and requires skill.

automated analysis EVA 0 Workplace project 0 Post task walkthrough 0 user reacts on action after the event 0 Advantages o analyst has time to focus on relevant incidents o avoid unnecessary interruption of task 0 Disadvantages o lack of newness o may be post-hoc interpretation of events

post-task walkthroughs 0 transcript played back to participant for comment immediately fresh in mind delayed evaluator has time to identify questions 0 useful to identify reasons for actions and alternatives considered 0 necessary in cases where think aloud is not possible

Query Techniques

Interviews 0 analyst questions user on one-to -one basis usually based on prepared questions 0 informal, subjective and relatively cheap 0 Advantages o can be varied to suit context o issues can be explored more fully o can elicit user views and identify unanticipated problems 0 Disadvantages o very subjective o time consuming

Questionnaires 0 Set of fixed questions given to users 0 Advantages o quick and reaches large user group o can be analyzed more rigorously 0 Disadvantages o less flexible o less searching

Questionnaires (ctd) 0 Need careful design o what information is required? o how are answers to be analyzed?

Questionnaire 0 Styles of question 1. General - establish background of user 2. Open-ended 0 Can you suggest improvements to interface? 3. Scalar 0 It is easy to recover from mistakes. Disagree 1 2 3 4 5 Agree 4. Multi-choice 0 How do you most often get help with the system? Choose one. online manual contextual help command prompt ask a colleague 5. Ranked place a list of items in order

Physiological methods

eye tracking 0 head or desk mounted equipment tracks the position of the eye 0 eye movement reflects the amount of cognitive processing a display requires 0 measurements include 1. fixations: eye maintains stable position. Number and duration indicate level of difficulty with display 2. saccades: rapid eye movement from one point of interest to another 3. scan paths: moving straight to a target with a short fixation at the target is optimal

physiological measurements 0 emotional response linked to physical changes 0 these may help determine a user s reaction to an interface 0 measurements include: 0 heart activity, including blood pressure, volume and pulse. 0 activity of sweat glands 0 electrical activity in muscle 0 electrical activity in brain 0 some difficulty in interpreting these physiological responses - more research needed

Choosing an Evaluation Method when in process: style of evaluation: how objective: type of measures: level of information: level of interference: resources available: design vs. implementation laboratory vs. field subjective vs. objective qualitative vs. quantitative high level vs. low level obtrusive vs. unobtrusive time, subjects, equipment, expertise