SPPH 501 Analysis of Longitudinal & Correlated Data September, 2016

Similar documents
ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

Hierarchical Linear Modeling with Maximum Likelihood, Restricted Maximum Likelihood, and Fully Bayesian Estimation

Lahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017

MGT/MGP/MGB 261: Investment Analysis

MTH 215: Introduction to Linear Algebra

GEOG 473/573: Intermediate Geographic Information Systems Department of Geography Minnesota State University, Mankato

Multi-Dimensional, Multi-Level, and Multi-Timepoint Item Response Modeling.

Strategic Management (MBA 800-AE) Fall 2010

STA 225: Introductory Statistics (CT)

San José State University Department of Marketing and Decision Sciences BUS 90-06/ Business Statistics Spring 2017 January 26 to May 16, 2017

Introduction to Simulation

Design and Creation of Games GAME

COURSE SYNOPSIS COURSE OBJECTIVES. UNIVERSITI SAINS MALAYSIA School of Management

CS/SE 3341 Spring 2012

ECON 6901 Research Methods for Economists I Spring 2017

Penn State University - University Park MATH 140 Instructor Syllabus, Calculus with Analytic Geometry I Fall 2010

Ryerson University Sociology SOC 483: Advanced Research and Statistics

Math 181, Calculus I

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

BUS Computer Concepts and Applications for Business Fall 2012

CIS Introduction to Digital Forensics 12:30pm--1:50pm, Tuesday/Thursday, SERC 206, Fall 2015

TU-E2090 Research Assignment in Operations Management and Services

A Model to Predict 24-Hour Urinary Creatinine Level Using Repeated Measurements

CS Machine Learning

ENEE 302h: Digital Electronics, Fall 2005 Prof. Bruce Jacob

SPM 5309: SPORT MARKETING Fall 2017 (SEC. 8695; 3 credits)

ATW 202. Business Research Methods


Room: Office Hours: T 9:00-12:00. Seminar: Comparative Qualitative and Mixed Methods

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Sociology 521: Social Statistics and Quantitative Methods I Spring 2013 Mondays 2 5pm Kap 305 Computer Lab. Course Website

Theory of Probability

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering

Office Hours: Mon & Fri 10:00-12:00. Course Description

Python Machine Learning

Office Hours: Day Time Location TR 12:00pm - 2:00pm Main Campus Carl DeSantis Building 5136

Sociology 521: Social Statistics and Quantitative Methods I Spring Wed. 2 5, Kap 305 Computer Lab. Course Website

CTE Teacher Preparation Class Schedule Career and Technical Education Business and Industry Route Teacher Preparation Program

Course Prerequisite: CE 2407 Adobe Illustrator or equivalent experience

EECS 700: Computer Modeling, Simulation, and Visualization Fall 2014

Northeastern University Online Course Syllabus

State University of New York at Buffalo INTRODUCTION TO STATISTICS PSC 408 Fall 2015 M,W,F 1-1:50 NSC 210

ACC 362 Course Syllabus

CHEMISTRY 104 FALL Lecture 1: TR 9:30-10:45 a.m. in Chem 1351 Lecture 2: TR 1:00-2:15 p.m. in Chem 1361

PHD COURSE INTERMEDIATE STATISTICS USING SPSS, 2018

University of Waterloo School of Accountancy. AFM 102: Introductory Management Accounting. Fall Term 2004: Section 4

CS4491/CS 7265 BIG DATA ANALYTICS INTRODUCTION TO THE COURSE. Mingon Kang, PhD Computer Science, Kennesaw State University

Graduate Program in Education

Probability and Statistics Curriculum Pacing Guide

Gridlocked: The impact of adapting survey grids for smartphones. Ashley Richards 1, Rebecca Powell 1, Joe Murphy 1, Shengchao Yu 2, Mai Nguyen 1

THE UNIVERSITY OF SYDNEY Semester 2, Information Sheet for MATH2068/2988 Number Theory and Cryptography

BAYESIAN ANALYSIS OF INTERLEAVED LEARNING AND RESPONSE BIAS IN BEHAVIORAL EXPERIMENTS

Mastering Biology Test Answers

Lecture 1: Machine Learning Basics

Syllabus - ESET 369 Embedded Systems Software, Fall 2016

ACC 380K.4 Course Syllabus

Chemistry Senior Seminar - Spring 2016

EGRHS Course Fair. Science & Math AP & IB Courses

EDCI 699 Statistics: Content, Process, Application COURSE SYLLABUS: SPRING 2016

Hierarchical Linear Models I: Introduction ICPSR 2015

Detailed course syllabus

ECE (Fall 2009) Computer Networking Laboratory

CEE 2050: Introduction to Green Engineering

LEAD 612 Advanced Qualitative Research Fall 2015 Dr. Lea Hubbard Camino Hall 101A

Summarize The Main Ideas In Nonfiction Text

Jeffrey Church and Roger Ware, Industrial Organization: A Strategic Approach, edition 1. It is available for free in PDF format.

MGMT3274 INTERNATONAL BUSINESS PROCESSES AND PROBLEMS

Spring 2015 IET4451 Systems Simulation Course Syllabus for Traditional, Hybrid, and Online Classes

University of Cincinnati College of Medicine. DECISION ANALYSIS AND COST-EFFECTIVENESS BE-7068C: Spring 2016

Answer Key Applied Calculus 4

The Use of Statistical, Computational and Modelling Tools in Higher Learning Institutions: A Case Study of the University of Dodoma

Class Numbers: & Personal Financial Management. Sections: RVCC & RVDC. Summer 2008 FIN Fully Online

PH.D. IN COMPUTER SCIENCE PROGRAM (POST M.S.)

Evaluation of ecodriving performances and teaching method: comparing training and simple advice

Mktg 315 Marketing Research Spring 2015 Sec. 003 W 6:00-8:45 p.m. MBEB 1110

Required Materials: The Elements of Design, Third Edition; Poppy Evans & Mark A. Thomas; ISBN GB+ flash/jump drive

MATH 1A: Calculus I Sec 01 Winter 2017 Room E31 MTWThF 8:30-9:20AM

George Mason University Graduate School of Education Education Leadership Program. Course Syllabus Spring 2006

Chapter 10 APPLYING TOPIC MODELING TO FORENSIC DATA. 1. Introduction. Alta de Waal, Jacobus Venter and Etienne Barnard

BUSI 2504 Business Finance I Spring 2014, Section A

Accounting 312: Fundamentals of Managerial Accounting Syllabus Spring Brown

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

Redirected Inbound Call Sampling An Example of Fit for Purpose Non-probability Sample Design

Department of Anthropology ANTH 1027A/001: Introduction to Linguistics Dr. Olga Kharytonava Course Outline Fall 2017

Firms and Markets Saturdays Summer I 2014

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

Lecture 1: Basic Concepts of Machine Learning

White Paper. The Art of Learning

MANA 7A97 - STRESS AND WORK. Fall 2016: 6:00-9:00pm Th. 113 Melcher Hall

BUSINESS FINANCE 4265 Financial Institutions

CRIME PREVENTION (CRIM 4040) Fall 2016

Business 712 Managerial Negotiations Fall 2011 Course Outline. Human Resources and Management Area DeGroote School of Business McMaster University

Clerical Skills Level I

Class Tuesdays & Thursdays 12:30-1:45 pm Friday 107. Office Tuesdays 9:30 am - 10:30 am, Friday 352-B (3 rd floor) or by appointment

RETURNING TEACHER REQUIRED TRAINING MODULE YE TRANSCRIPT

Strategy and Design of ICT Services

4. Long title: Emerging Technologies for Gaming, Animation, and Simulation

Please read this entire syllabus, keep it as reference and is subject to change by the instructor.

Transcription:

SPPH 501 Analysis of Longitudinal & Correlated Data September, 2016 TIME & PLACE: Term 1, Tuesday, 1:30-4:30 P.M. LOCATION: SPPH, Room 143 INSTRUCTOR: OFFICE: Dr. Ying C MacNab SPPH, Room 134b TELEPHONE: (604) 822-5593 EMAIL: ying.macnab@ubc.ca OFFICE HOURS: By appointment TEACHING ASSISTANT: TBD COMPUTER LABS: Students in 501 receive priority to use the School computing lab on Tuesdays, Noon. - 1:30 P.M. Students are welcome to use their own rather than the lab computers. COURSE PHILOSOPHY AND OBJECTIVES: This course will introduce students to concepts and methods in the analysis of correlated data, with special emphasis on longitudinal and hierarchical data. By the end of the course students are expected to: 1. Recognize the types of study/sampling designs that give rise to correlated data. 2. Translate conceptual models relating health outcomes and their determinants into statistical models. 3. Have basic knowledge of generalized linear mixed models, Bayesian hierarchical models, and related methods of inference

4. Identify different analytic approaches to longitudinal/correlated data analysis and explain the advantages and disadvantages of each approach. PREREQUISITES: SPPH 400 and 500, 5002 or their equivalents. COURSE REFERENCES: The primary reference used in this course will be 1. Gelman A, and Hill J. Data Analysis Using Regression and Multilevel/Hierarchical Models. 2007. Cambridge. though some material will also be taken from: 1. Fitzmaurice GM, Laird NM, and Ware JH. 2011. Applied Longitudinal Analysis (2nd Edition). Wiley. 2. Diggle PJ, Heagerty P, Liang KY, and Zeger SL. 2002. Analysis of Longitudinal Data (2nd Edition). Oxford University Press. Computing: 1. The Gelman text provides some examples in R. Additional materials will be distributed during the course. 2. WinBUGs manual (http://www.mrc-bsu.cam.ac.uk/bugs) Additional: 1. Goldstein H. 2003. Multilevel Statistical Models (3rd Edition). Arnold. (A pdf of an older version, which suffices for this course, is available for download from http://www.ats.ucla.edu/stat/examples/msm_goldstein/default.htm.). 2. Pinheiro JC, and Bates DM. 2000. Mixed Effects Models in S and S-PLUS. Springer. (As indicated by the title, this text uses S-PLUS as the computing environment. While many of the basic functions and commands in R and S-PLUS are identical, this is not the case for mixed effects models so you should not purchase this text for learning the computing. The book is listed here because although it is a text on computing, the examples used also illustrate many of the important concepts and theory important underlying mixed effects models.) COURSE NOTES: Course slides will be available on selected topics. Note that these are summary slides, not detailed content. Students are expected to have read the slides and related material prior to the class. In-class time will usually alternate between short presentations of basic ideas by the instructor and class discussion of these ideas.

STATISTICAL COMPUTING: Statistical analysis in this course will be illustrated using R and WinBUGs. R is a free software environment for statistical computing and can be downloaded through the R Project homepage: www.r-project.org. WinBUGs is a free statistical software for Bayesian analysis using Markov chain Monte Carlo (simulation) methods and can be downloaded at http://www.mrc-bsu.cam.ac.uk/bugs. Students are welcome to use other software packages (SAS, Stata, etc) instead of R to complete the assignments (limited support from the instructor and TA). COURSE EVALUATION: The course is graded on a Pass/Fail basis (68% required for a Pass). The components that go into the final grade are: Assignments: 30% Final Project: 50% (30% report, 20% class presentation) Class Participation: 20% ASSIGNMENTS: Assignments will be distributed periodically. The main aim of the assignments is to help students become comfortable with translating conceptual models into mathematical models. Late assignments are not accepted. Assignments should be typed or neatly written. Mathematical notation and language should be used very carefully. Marks will be deducted for improper use of notation/jargon, particularly in cases where it renders the answer vague. Students working with the software are encouraged to consult with each other about how to get things done but all written work must be completed individually. Present only those parts of the computer output that are relevant to your answer and highlight or underline the specific items of interest. Alternatively, transcribe those items to another page if you prefer. FINAL PROJECT: The due date for the write-up of the final project is December 6. There is considerable flexibility in the scope of the final project and students are

encouraged to propose and develop a project in consultation with the instructor. Examples of projects include: 1. Critiquing of longitudinal analyses used in published studies: Identify the type of design and the models used. How appropriate were the models to the questions of interest? What were the limitations of the design and analysis? What might have been done or could be done in the future to better address the questions? 2. Design of a longitudinal study: Identify a question which requires longitudinal data to answer. Discuss possible ways (and the advantages/disadvantages of each) of collecting data to answer the question. Propose specific models for analyzing the data with discussion of how each model is related to and accounts for various features of the data. 3. Analysis of a dataset: Starting with a given dataset and various questions of interest, perform the necessary analyses (graphical, modeling, diagnostics, interpretation) to address those questions. 4. Special topics: Develop a "mini-lecture" that enhances material on a topic (e.g. missing data, transition models, causal models, etc) that is not discussed in-depth in class. 5. Other software packages: Develop a "tutorial" to show how the analyses illustrated in class using R would be done using another software package. Compare and contrast the capabilities/limitations of the two packages. EXAMS: There are no exams for this course. COURSE OUTLINE: The following schedule is tentative. Changes may be made to accommodate the needs and interests of the students. Class #1 (Tuesday, Sep 6): Introduction and preliminaries. Online search for (preferably most recent) public health research papers that involve analysis of correlated data (longitudinal, multilevel, spatial, etc). Class #2 (Tuesday, Sep 13): Review of distributions, probability, conditional probability, expected value, correlation. Review of sampling designs and data structures. Sources of correlated data (hierarchical, longitudinal, spatial). Implications for statistical inference of various types (association, causality, prediction). Principles of regression modeling. Examples. Readings: Gelman, Ch 1 4, 9 Class #3 (Tuesday, Sep 20): Grouped data. Stratification and clustering. Levels of analysis. Fixed and random effects. Analysis of studies with observations at two time-points. Exploring and summarizing

grouped data. Readings: Gelman, Ch 11. Fitzmaurice, Ch 1 & 2. Diggle, Ch 1. Class #4 (Tuesday, Sep 27): Assignment #1 due Matrices. Correlation structures. Design and sample size considerations. Analytic approaches (marginal models, mixed effects models, Bayesian hierarchical models. Readings: Gelman, Ch 20. Fitzmaurice, Sec 7.1-7.7, Ch 3. Diggle, Ch 2 & 3. Class #5 (Tuesday, Oct 4): Linear mixed effects models. Modeling of mean and variance structures. ML and REML estimation. Matrix representation. Readings: Gelman, Ch 12 & 13. Fitzmaurice, Ch 8. Diggle, Ch 4 & 5. Class #6 (Tuesday, Oct 11): Outline of proposed term project due. Multilevel (linear) models: mixed effects model expression, Readings: Gelman, Ch 12 & 13. Fitzmaurice, Ch 8. Diggle, Ch 4 & 5. Class #7 (Tuesday, Oct 18): Review of generalized linear models (GLMs). Generalized linear mixed effects models (GLMMs), MLE-EM, PQL and MCMC estimation. Readings: Gelman, Ch 5, 6, 14 & 15. Fitzmaurice, Ch 10 & 12. Diggle, Ch 9. WinBUGs Examples Vol I Class #8 (Tuesday, Oct 24): Assignment #2 due. Midterm tutorial/q&a led by the T.A. (The instructor will be absent.) Class #9 (Tuesday, Nov 01): GLMM and Bayesian GLMM for multilevel data Readings: WinBUGs Examples Vol I, reading materials to be distributed Class #10 (Tuesday, Nov 08): Draft of projects due for distribution to class for background/review send directly to class email list. Bayesian disease mapping Readings: GeoBUGs manual, reading materials to be distributed Class #11 (Tuesday, Nov 15): Additional topics (continued) as time allows. Student presentations.

Class #12 (Tuesday, Nov 22): Student presentations. Class #13 (Tuesday, Dec 29): Student presentations.