A Model to Predict 24-Hour Urinary Creatinine Level Using Repeated Measurements

Similar documents
PROFESSIONAL TREATMENT OF TEACHERS AND STUDENT ACADEMIC ACHIEVEMENT. James B. Chapman. Dissertation submitted to the Faculty of the Virginia

Hierarchical Linear Modeling with Maximum Likelihood, Restricted Maximum Likelihood, and Fully Bayesian Estimation

CHALLENGES FACING DEVELOPMENT OF STRATEGIC PLANS IN PUBLIC SECONDARY SCHOOLS IN MWINGI CENTRAL DISTRICT, KENYA

Lecture 1: Machine Learning Basics

Chapters 1-5 Cumulative Assessment AP Statistics November 2008 Gillespie, Block 4

Detailed course syllabus

Knowledge management styles and performance: a knowledge space model from both theoretical and empirical perspectives

STT 231 Test 1. Fill in the Letter of Your Choice to Each Question in the Scantron. Each question is worth 2 point.

Universityy. The content of

Analysis of Enzyme Kinetic Data

VOL. 3, NO. 5, May 2012 ISSN Journal of Emerging Trends in Computing and Information Sciences CIS Journal. All rights reserved.

Digital Andrews University. Andrews University. Janine Monica Lim Andrews University

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

Probability and Statistics Curriculum Pacing Guide

Predicting the Performance and Success of Construction Management Graduate Students using GRE Scores

APPENDIX A: Process Sigma Table (I)

Lahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017

STA 225: Introductory Statistics (CT)

Investment in e- journals, use and research outcomes

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

Python Machine Learning

Multiple regression as a practical tool for teacher preparation program evaluation

Comparison of EM and Two-Step Cluster Method for Mixed Data: An Application

Edexcel GCSE. Statistics 1389 Paper 1H. June Mark Scheme. Statistics Edexcel GCSE

Empowering Students Learning Achievement Through Project-Based Learning As Perceived By Electrical Instructors And Students

Doctor of Public Health (DrPH) Degree Program Curriculum for the 60 Hour DrPH Behavioral Science and Health Education

Certified Six Sigma Professionals International Certification Courses in Six Sigma Green Belt

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

Understanding and Interpreting the NRC s Data-Based Assessment of Research-Doctorate Programs in the United States (2010)

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

ABILITY SORTING AND THE IMPORTANCE OF COLLEGE QUALITY TO STUDENT ACHIEVEMENT: EVIDENCE FROM COMMUNITY COLLEGES

MASTER OF ARTS IN APPLIED SOCIOLOGY. Thesis Option

A THESIS. By: IRENE BRAINNITA OKTARIN S

Multi-Dimensional, Multi-Level, and Multi-Timepoint Item Response Modeling.

B.S/M.A in Mathematics

School of Innovative Technologies and Engineering

Linking the Ohio State Assessments to NWEA MAP Growth Tests *

Capturing and Organizing Prior Student Learning with the OCW Backpack

School Size and the Quality of Teaching and Learning

Why Did My Detector Do That?!

Cross-Year Stability in Measures of Teachers and Teaching. Heather C. Hill Mark Chin Harvard Graduate School of Education

THE PENNSYLVANIA STATE UNIVERSITY SCHREYER HONORS COLLEGE DEPARTMENT OF MATHEMATICS ASSESSING THE EFFECTIVENESS OF MULTIPLE CHOICE MATH TESTS

Assignment 1: Predicting Amazon Review Ratings

Math Placement at Paci c Lutheran University

February Statistics: Multiple Regression in R

GDP Falls as MBA Rises?

Assessing Stages of Team Development in a Summer Enrichment Program

learning collegiate assessment]

TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE. Pierre Foy

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

Instrumentation, Control & Automation Staffing. Maintenance Benchmarking Study

A Program Evaluation of Connecticut Project Learning Tree Educator Workshops

An Empirical Analysis of the Effects of Mexican American Studies Participation on Student Achievement within Tucson Unified School District

Working Paper: Do First Impressions Matter? Improvement in Early Career Teacher Effectiveness Allison Atteberry 1, Susanna Loeb 2, James Wyckoff 1

Discovering Statistics

Psychometric Research Brief Office of Shared Accountability

IS FINANCIAL LITERACY IMPROVED BY PARTICIPATING IN A STOCK MARKET GAME?

Evaluation of a College Freshman Diversity Research Program

American Journal of Business Education October 2009 Volume 2, Number 7

A Comparison of Charter Schools and Traditional Public Schools in Idaho

On-the-Fly Customization of Automated Essay Scoring

Wenguang Sun CAREER Award. National Science Foundation

Evaluation of Teach For America:

CONSTRUCTION OF AN ACHIEVEMENT TEST Introduction One of the important duties of a teacher is to observe the student in the classroom, laboratory and

THE INFLUENCE OF COOPERATIVE WRITING TECHNIQUE TO TEACH WRITING SKILL VIEWED FROM STUDENTS CREATIVITY

Participation Rates: Fall 2012

Race, Class, and the Selective College Experience

Comparison of network inference packages and methods for multiple networks inference

Algebra 2- Semester 2 Review

Confirmatory Factor Structure of the Kaufman Assessment Battery for Children Second Edition: Consistency With Cattell-Horn-Carroll Theory

S T A T 251 C o u r s e S y l l a b u s I n t r o d u c t i o n t o p r o b a b i l i t y

PEER EFFECTS IN THE CLASSROOM: LEARNING FROM GENDER AND RACE VARIATION *

Statewide Framework Document for:

University of Groningen. Systemen, planning, netwerken Bosman, Aart

School Competition and Efficiency with Publicly Funded Catholic Schools David Card, Martin D. Dooley, and A. Abigail Payne

CHMB16H3 TECHNIQUES IN ANALYTICAL CHEMISTRY

Grade 6: Correlated to AGS Basic Math Skills

THE IMPACT OF STATE-WIDE NUMERACY TESTING ON THE TEACHING OF MATHEMATICS IN PRIMARY SCHOOLS

Physics 270: Experimental Physics

Access Center Assessment Report

Meet the Preceptors. Jeremy Rose, Pharm.D., BCPS Preceptor for Infectious Diseases, Antimicrobial Stewardship

Does the Difficulty of an Interruption Affect our Ability to Resume?

Reinforcement Learning by Comparing Immediate Reward

Ryerson University Sociology SOC 483: Advanced Research and Statistics

Spanish IV Textbook Correlation Matrices Level IV Standards of Learning Publisher: Pearson Prentice Hall

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

Do First Impressions Matter? Predicting Early Career Teacher Effectiveness

Sociology 521: Social Statistics and Quantitative Methods I Spring 2013 Mondays 2 5pm Kap 305 Computer Lab. Course Website

MASTER OF PHILOSOPHY IN STATISTICS

SAMPLE AFFILIATION AGREEMENT

Reduce the Failure Rate of the Screwing Process with Six Sigma Approach

COMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS

THEORY OF PLANNED BEHAVIOR MODEL IN ELECTRONIC LEARNING: A PILOT STUDY

Aalya School. Parent Survey Results

Effective Pre-school and Primary Education 3-11 Project (EPPE 3-11)

Abu Dhabi Indian. Parent Survey Results

Abu Dhabi Grammar School - Canada

Research Design & Analysis Made Easy! Brainstorming Worksheet

The Relationship of Grade Span in 9 th Grade to Math Achievement in High School

w o r k i n g p a p e r s

Transcription:

Virginia Commonwealth University VCU Scholars Compass Theses and Dissertations Graduate School 2006 A Model to Predict 24-Hour Urinary Creatinine Level Using Repeated Measurements Donna S. Kroos Virginia Commonwealth University Follow this and additional works at: http://scholarscompass.vcu.edu/etd Part of the Physical Sciences and Mathematics Commons The Author Downloaded from http://scholarscompass.vcu.edu/etd/1172 This Thesis is brought to you for free and open access by the Graduate School at VCU Scholars Compass. It has been accepted for inclusion in Theses and Dissertations by an authorized administrator of VCU Scholars Compass. For more information, please contact libcompass@vcu.edu.

A Model to Predict 24-Hour Urinary Creatinine Level Using Repeated Measurements A thesis submitted in partial fulfillment of the requirements for the degree of Master of Science in Mathematical Sciences (Statistics) at Virginia Commonwealth University. Donna S. Kroos BS (Mathematics), Pennsylvania State University, 1980 MBA (Quantitative Decision Making), Rochester Institute of Technology, 1996 Director: Dr. James E. Mays, Associate Professor, Department of Statistical Sciences & Operations Research Virginia Commonwealth University Richmond, Virginia December, 2006

Acknowledgement There are many individuals who have been instrumental in this thesis project and hence deserving of recognition. First, I would like to thank each of my thesis committee members-dr. James E. Mays, Dr. Shelley A. Harris, and Dr. Edward L. Boone. Dr. Mays' statistical knowledge, insight and guidance throughout my research were extremely valuable. I am honored to have him both as a mentor and as a friend. I wish to thank Dr. Harris for granting me the opportunity to analyze the creatinine data from the "Pesticide Dose Monitoring in Turf Applicators" study. Her patience and candor in answering my many questions are greatly appreciated. Dr. Boone's knowledge of mixed model methodologies provided much guidance to my usage of this modeling technique. My appreciation also is extended to the National Institute of Occupational Safety and Health, Centers for Disease Control and Prevention, for the research grant that funded the "Pesticide Dose Monitoring in Turf Applicators" study. I would like to thank Ms. Kristen Wells and Ms. Diane Bishop, Department of Epidemiology and Community Health, Virginia Commonwealth University. Their assistance, in extracting the creatinine and subject data from the pesticide dose study data base, was instrumental to my thesis efforts. Lastly, I want to thank my husband Jim Kroos. I am, and always will be, grateful for his unwavering love and support throughout my academic pursuits and our life together.

Table of Contents List of Tables... vi.. List of Figures... vll... List of Abbreviations... VIII Abstract... x Introduction... 1... Objectives 4 Other Models to Predict 24-Hour Urinary Creatinine... 6 Turner and Cohn... 6 Moriyama et a1... 7 Kawasaki et a1... 9 Jones, Newstead, and Will... 9 Harris et a1... 10 Tanaka et a1... 11 Kamata and Tochikubo... 13 Penie, Porben, and Silverio... 14 Summary... 15... Study Data 17 Data Collection... 17

Data Calculations... 20 i v Participant Enrollments and Descriptive Statistics... 21 Model... 24 Methodology... 24 Covariance Structure Selection... 28 Determination of Model Predictor Factors... 33 Residual Analysis and Influence Diagnostics... 37 Model Validation... 41 Model Comparisons... 45 Conclusions... 1 References...-55 Appendices... 59 Appendix 1 : Imputing of 12-hour Creatinine Values for Follow-on Phase Data...... 59 Appendix 2: Covariance Matrix Structures' Descriptions... 61 Appendix 3: SAS Code for Covariance Matrix Structure Tests... 65 Appendix 4: Covariance Estimates by Type of Covariance Matrix Structure... 68 Appendix 5: Output from Covariance Analysis... 70 Appendix 6: AIC, AICC, BIC Calculations... 80 Appendix 7: Estimation of Fixed and Random Effects in the Mixed Model when using REML... 81

Appendix 8: Profile Plots. Box Plots and Multiple Comparisons by v Location... 82 Appendix 9: Initial Model Output... 86 Appendix 10: Final Model Output... 90 Appendix 11 : SAS Code for Final Model... 94 Appendix 12: Residual Plots for Final Model... 96 Appendix 13: Outliers and High Leverage Observations... 97 Appendix 14: Influence Diagnostics by Observation... 99 Appendix 15: SAS Code for Calculating O.ther Models' Predicted Values... 103 Appendix 16: Comparisons of Other Studies' Participants... 105 Vita... 107

List of Tables Table Page 1. Summary of Other Models used to Predict 24-hour Urinary Creatinine Level... 16 2. Study Enrollments by Location and Phase... 22... 3. Descriptive Statistics for Model Building Data 23... 4. Likelihood Ratio Tests for Covariance Matrix Structures 31 5. Information Criteria Results for Covariance Matrix Structures... 32 6. Descriptive Statistics for Model Validation Data... 42 7. MSPR Results for Models... 49 8. Correlation Coefficients (r,,.,,r,,,,l;,,,d) for Predicted and Actual Creatinine Values by Model... 50 9. Imputing Equations... 60

vii List of Figures Figure Page 1. Lag Plot from Unstructured Covariance Matrix... 29 2. Lag Plot Comparison of Covariance Matrix Structure Approaches... 33 3. Days of Pilot Phase used in Validation Data Set... 42 Predicted versus Actual 24-hour Creatinine Levels... 44 Plot of Creatinine versus Day of Collection for Location 1... 82 Plot of Creatinine versus Day of Collection for Location 2... 83 Plot of Creatinine versus Day of Collection for Location 3... 83 Plot of Creatinine versus Day of Collection for Location 4... 84 Plot of Creatinine versus Day of Collection for Location 5... 84 Box Plot of Creatinine by Location... 85 Residual Plots for Final Model... 96 Plot of Overall Influence by Subject and Day of Collection... 99 Plot of DFFITS by Subject and Day of Collection... 100 Plot of Cook's Distance by Subject and Day of Collection... 101 Plot of COVRATIO by Subject and Day of Collection... 102 Comparison of Studies' Subject Heights... 105 Comparison of Studies' Subject Weights... 106

List of Abbreviations AIC... A k a Information Criterion ANTE....Ante-dependence AR(1)... First Order Autoregressive.. BIC... Schwarz's Bayesian Information Criterion BMI... Body Mass Index BSA... Body Surface Area CAPD... Continuous Ambulatory Peritoneal Dialysis CDC... Centers for Disease Control and Prevention CS... Compound Symmetric EBLUE... Estimated Best Linear Unbiased Estimator EBLUP... Estimated Best Linear Unbiased Predictor GEE... Generalized Estimating Equations LBM... Lean Body Mass ML... Maximum Likelihood MSE... Mean Square Error MSPR... Mean Squared Prediction Error NIOSH... National Institute of Occupational Safety and Health

REML... Restricted Maximum Likelihood RLD... Restricted Likelihood Distance SD... Standard Deviation STL... Scientific Testing Labs TOEP....Toepli tz UN... Unstructured VCU... Virginia Commonwealth University

Abstract A MODEL TO PREDICT 24-HOUR URINARY CREATININE LEVEL USING REPEATED MEASUREMENTS By Donna S. Kroos A thesis submitted in partial fulfillment of the requirements for the degree of Master of Science at Virginia Commonwealth University. Virginia Commonwealth University, 2006 Thesis Director: Dr. James E. Mays, Associate Professor, Department of Statistical Sciences & Operations Research Creatinine is a metabolic waste product, removed from the blood by the kidneys, and excreted in the urine. The measurement of creatinine is used in the assessment and monitoring of many medical conditions as well as in the determination or adjustment of absorbed dosage of pesticides. Earlier models to predict 24-hour urinary creatinine used ordinary least squares regression and assumed that the subjects' observations were uncorrelated. However, many of these studies had repeated creatinine measurements for each of their subjects. Repeated measures on the same subject frequently are correlated. Using data from the NIOSH-CDC "Pesticide Dose Monitoring in Turf Applicators" study, this thesis project built a model to predict 24-hour urinary creatinine using the Mixed Model methodology. A covariance structure, that permitted multiple observations for any one individual to be correlated, was identified and utilized. The predictive capabilities of this model were then compared to the earlier models investigated. X