FIE - Foundations of Statistical Inference

Similar documents
STA 225: Introductory Statistics (CT)

Probability and Statistics Curriculum Pacing Guide

Lecture 1: Machine Learning Basics

S T A T 251 C o u r s e S y l l a b u s I n t r o d u c t i o n t o p r o b a b i l i t y

12- A whirlwind tour of statistics

Theory of Probability

Lahore University of Management Sciences. FINN 321 Econometrics Fall Semester 2017

VOL. 3, NO. 5, May 2012 ISSN Journal of Emerging Trends in Computing and Information Sciences CIS Journal. All rights reserved.

CS/SE 3341 Spring 2012

Research Design & Analysis Made Easy! Brainstorming Worksheet

School Size and the Quality of Teaching and Learning

EDCI 699 Statistics: Content, Process, Application COURSE SYLLABUS: SPRING 2016

San José State University Department of Marketing and Decision Sciences BUS 90-06/ Business Statistics Spring 2017 January 26 to May 16, 2017

Algebra 1, Quarter 3, Unit 3.1. Line of Best Fit. Overview

A. What is research? B. Types of research

MASTER OF PHILOSOPHY IN STATISTICS

MGT/MGP/MGB 261: Investment Analysis

Mathematics subject curriculum

EGRHS Course Fair. Science & Math AP & IB Courses

STA2023 Introduction to Statistics (Hybrid) Spring 2013

The Effect of Written Corrective Feedback on the Accuracy of English Article Usage in L2 Writing

Hierarchical Linear Modeling with Maximum Likelihood, Restricted Maximum Likelihood, and Fully Bayesian Estimation

Introduction to the Practice of Statistics

PHD COURSE INTERMEDIATE STATISTICS USING SPSS, 2018

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

Instructor: Mario D. Garrett, Ph.D. Phone: Office: Hepner Hall (HH) 100

Technical Manual Supplement

Sociology 521: Social Statistics and Quantitative Methods I Spring Wed. 2 5, Kap 305 Computer Lab. Course Website

Certified Six Sigma Professionals International Certification Courses in Six Sigma Green Belt

Management of time resources for learning through individual study in higher education

Office Hours: Mon & Fri 10:00-12:00. Course Description

Mathematics (JUN14MS0401) General Certificate of Education Advanced Level Examination June Unit Statistics TOTAL.

PREDISPOSING FACTORS TOWARDS EXAMINATION MALPRACTICE AMONG STUDENTS IN LAGOS UNIVERSITIES: IMPLICATIONS FOR COUNSELLING

Quantitative analysis with statistics (and ponies) (Some slides, pony-based examples from Blase Ur)

Ryerson University Sociology SOC 483: Advanced Research and Statistics

The Good Judgment Project: A large scale test of different methods of combining expert predictions

Individual Differences & Item Effects: How to test them, & how to test them well

Analysis of Enzyme Kinetic Data

Evaluation of Teach For America:

Ph.D. in Behavior Analysis Ph.d. i atferdsanalyse

NIH Public Access Author Manuscript J Prim Prev. Author manuscript; available in PMC 2009 December 14.

An overview of risk-adjusted charts

An Empirical Analysis of the Effects of Mexican American Studies Participation on Student Achievement within Tucson Unified School District

Peer Influence on Academic Achievement: Mean, Variance, and Network Effects under School Choice

State University of New York at Buffalo INTRODUCTION TO STATISTICS PSC 408 Fall 2015 M,W,F 1-1:50 NSC 210

JONATHAN H. WRIGHT Department of Economics, Johns Hopkins University, 3400 N. Charles St., Baltimore MD (410)

A Stochastic Model for the Vocabulary Explosion

Mandarin Lexical Tone Recognition: The Gating Paradigm

School of Innovative Technologies and Engineering

On-the-Fly Customization of Automated Essay Scoring

CHAPTER 4: REIMBURSEMENT STRATEGIES 24

Detailed course syllabus

Lecture 15: Test Procedure in Engineering Design

The influence of parental background on students academic performance in physics in WASSCE

The Impact of Mobile Telecommunication Services on Students Lives: Findings from a Comparative Study in South Africa and Nigeria

Reflective Teaching KATE WRIGHT ASSOCIATE PROFESSOR, SCHOOL OF LIFE SCIENCES, COLLEGE OF SCIENCE

Instructor: Matthew Wickes Kilgore Office: ES 310

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

A Program Evaluation of Connecticut Project Learning Tree Educator Workshops

Statewide Framework Document for:

SAM - Sensors, Actuators and Microcontrollers in Mobile Robots

Self Study Report Computer Science

The Impact of Formative Assessment and Remedial Teaching on EFL Learners Listening Comprehension N A H I D Z A R E I N A S TA R A N YA S A M I

Stopping rules for sequential trials in high-dimensional data

STUDENT SATISFACTION IN PROFESSIONAL EDUCATION IN GWALIOR

Evidence for Reliability, Validity and Learning Effectiveness

THE INFLUENCE OF COOPERATIVE WRITING TECHNIQUE TO TEACH WRITING SKILL VIEWED FROM STUDENTS CREATIVITY

Grade 6: Correlated to AGS Basic Math Skills

AP Calculus AB. Nevada Academic Standards that are assessable at the local level only.

On the Distribution of Worker Productivity: The Case of Teacher Effectiveness and Student Achievement. Dan Goldhaber Richard Startz * August 2016

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Development of Multistage Tests based on Teacher Ratings

Introduction to Simulation

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering

GDP Falls as MBA Rises?

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

Learning From the Past with Experiment Databases

MTH 141 Calculus 1 Syllabus Spring 2017

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Level 6. Higher Education Funding Council for England (HEFCE) Fee for 2017/18 is 9,250*

Abstractions and the Brain

Classifying combinations: Do students distinguish between different types of combination problems?

AUTOMATED TROUBLESHOOTING OF MOBILE NETWORKS USING BAYESIAN NETWORKS

Introduction to Causal Inference. Problem Set 1. Required Problems

Multi-Dimensional, Multi-Level, and Multi-Timepoint Item Response Modeling.

Sociology 521: Social Statistics and Quantitative Methods I Spring 2013 Mondays 2 5pm Kap 305 Computer Lab. Course Website

An Introduction to Simio for Beginners

Systematic reviews in theory and practice for library and information studies

Learning Disability Functional Capacity Evaluation. Dear Doctor,

Knowledge management styles and performance: a knowledge space model from both theoretical and empirical perspectives

Answer Key Applied Calculus 4

Tun your everyday simulation activity into research

Python Machine Learning

DOCTORAL SCHOOL TRAINING AND DEVELOPMENT PROGRAMME

The Search for Strategies to Prevent Persistent Misconceptions

SAT MATH PREP:

ENME 605 Advanced Control Systems, Fall 2015 Department of Mechanical Engineering

Conceptual and Procedural Knowledge of a Mathematics Problem: Their Measurement and Their Causal Interrelations

STAT 220 Midterm Exam, Friday, Feb. 24

Transcription:

Coordinating unit: Teaching unit: Academic year: Degree: ECTS credits: 2017 200 - FME - School of Mathematics and Statistics 715 - EIO - Department of Statistics and Operations Research 1004 - UB - (ENG)Universitat de Barcelona MASTER'S DEGREE IN STATISTICS AND OPERATIONS RESEARCH (Syllabus 2013). (Teaching unit Optional) 5 Teaching languages: Spanish Teaching staff Coordinator: Others: ANTONIO MIÑARRO ALONSO Primer quadrimestre: ANTONIO MIÑARRO ALONSO - A LOURDES RODERO DE LAMO - A Prior skills The MESIO UPC-UB includes two compulsory subjects: Advanced Statistical Inference and Foundations of Statistical Inference. Advanced Statistical Inference is mandatory for all graduate students in statistics or mathematics (path 1) and Foundations of Statistical Inference is compulsory for all students from other degrees (path 2). Students from path 2 can choose Advanced Statistical Inference as optional. Students from path 1 can not choose Foundations of Statistical Inference. The course assumes a basic knowledge of the concepts of probability theory. The student should know and work with major discrete and continuous probability models: Poisson, Binomial, Exponential, Uniform, Normal. In particular the student should be able to use the cumulative distribution functions and density functions or probability mass, for calculating probabilities and population parameters of the main distributions. It is also assumed the skill to work with the expectation and variance of random variables. Finally, it is important to know and understand the implications of the central limit theorem. You can consult the following material: Statmedia free version: http://www.ub.edu/stat/grupsinnovacio/statmedia/demo/statmedia.htm Probabilidad y estadística de Evans, Michael J. (2005) Michael J. Evans (Autor) y Jeffrey Rosenthal Edit. Reverter http://www.reverte.com/motor?id_pagina=catalogo/ficha&idcategoria=6&idsubcategoria=47&idlibro=664 Morris H. DeGroot and Mark J. Schervish Probability and Statistics (4th Edition) Addison-Wesley (2010) ISBN 0-321-50046-6 http://www.pearsonhighered.com/pearsonhigheredus/educator/product/products_detail.page?isbn=0201524880 Degree competences to which the subject contributes Specific: 3. CE-4. Ability to use different inference procedures to answer questions, identifying the properties of different 1 / 8

estimation methods and their advantages and disadvantages, tailored to a specific situation and a specific context. 4. CE-6. Ability to use appropriate software to perform the necessary calculations in solving a problem. Transversal: 1. TEAMWORK: Being able to work in an interdisciplinary team, whether as a member or as a leader, with the aim of contributing to projects pragmatically and responsibly and making commitments in view of the resources that are available. 2. FOREIGN LANGUAGE: Achieving a level of spoken and written proficiency in a foreign language, preferably English, that meets the needs of the profession and the labour market. Teaching methodology Theory sessions The teacher explains the contents of the course with the help of computer presentations. Student participation will be encouraged through some questions and examples. Problem sessions By the end of each issue a session specially devoted to problems will take place. The list of problems will be available in advance on the intranet. Students should come to class with doubts related to the proposed problems in order to be solved by the teacher. Statistical laboratory Several statistical analyses will be carry out with the help of some scripts of R. Students will be proposed to solve several more extensive exercises with the help of the software. Learning objectives of the subject Students should achieve a good knowledge of the common language of statistical inference with both a theoretical and a practical basis. Students not only should to be able to use most of the statistical techniques but also they have to be able to learn new methodologies. Students should be able to use software R as a tool for the inferential process. As specific goals we have the following: Students should know the main sample techniques and the main sample distributions based on normal law and its use in statistical inference. Students should be able to apply some of the usual methods of estimation. Students should know the desirable properties of an estimator and verify if they are achieved by a given statistic. Students should understand the concept of confidence of an interval. They have to be able to construct the most usual intervals and compute the necessary sample size to achieve a given confidence and precision. Students should understand the methodology underlying the testing of hypotheses including the types of errors and the importance of sample size to make decisions with a good statistical basis. Students should be able to obtain estimates from a linear regression model and verify the validity of the assumptions of the model in order to discuss the results of a regression study. Students should understand the linear model of analysis of variance together with the sum of squares variance decomposition and solve the one-way model and the two-way model both with fix and random factors. 2 / 8

Study load Total learning time: 125h Hours large group: 30h 24.00% Hours medium group: 0h 0.00% Hours small group: 15h 12.00% Guided activities: 0h 0.00% Self study: 80h 64.00% 3 / 8

Content 1. Introduction to inference Learning time: 0h 30m Theory classes: 0h 30m Basic ideas of Statistical Inference. Theory sessions. Basic introduction to the main concepts of statistical inference and review of the necessary ideas of the Theory of Probability 2. Sampling Learning time: 2h 30m Theory classes: 2h 30m 2.1. Definition 2.2. Sampling methods 2.3. Random sampling 2.4. Sampling distributions 2.4.1. Exact and asymptotic sampling distributions 2.4.2. The distribution in sampling from a Normal Population 2.4.3. Distributions arising from Normal sampling 2.5. Simulating random samples Theory sessions. Problem sessions. Students should know the main sample techniques and the main sample distributions based on normal law and its use in statistical inference. 4 / 8

3. Parameter estimation Learning time: 6h Theory classes: 6h 3.1. Introduction, concept of estimator, point and confidence estimation. 3.2. Properties of point estimates: consistency, bias, efficiency, minimal variance, sufficiency, mean square error. 3.3. Methods to obtain estimates: moments, maximum likelihood, least squares, Bayes 3.4. Resampling methods: Bootstrap, Jacknife Theory sessions. Problem sessions Students should be able to apply some of the usual methods of estimation. Students should know the desirable properties of an estimator and verify if they are achieved by a given statistic. 4. Confidence Intervals Learning time: 4h 30m Theory classes: 4h 30m 4.1. Definition 4.2. Construction of intervals 4.3. Confidence level and sample size 4.4. Some confidence intervals 4.5. Asymptotic confidence intervals Theory sessions. Problem sessions. Statistical laboratory. Students should understand the concept of confidence of an interval. They have to be able to construct the most usual intervals and compute the necessary sample size to achieve a given confidence and precision. 5 / 8

5. Hypotheses testing Learning time: 12h Theory classes: 12h 5.1. Fundamental notions of hypotheses testing 5.1.1. From language to parametrical hypotheses 5.1.2. Null and alternative hypotheses 5.1.3. Decision rule: Critical region 5.2. Errors in hypotheses testing 5.2.1. Type I error: level of significance 5.2.2. Type II error: power of the test 5.2.3. Sample size 5.3. P-values 5.4. Some hypotheses tests 5.4.1. Likelihood ratio tests 5.4.2. Tests for normal populations 5.4.3. Tests on proportions 5.4.4. Chi-squared tests 5.4.5. Robust tests: tests based on ranks and permutation tests 5.5. Relation between confidence estimation and hypotheses testing 5.6. Multiple testing 5.7. Combining results from different tests 5.8. Bayesian hypothesis testing Theory sessions. Problem sessions. Statistical laboratory. Students should understand the methodology underlying the testing of hypotheses including the types of errors and the importance of sample size to make decisions with a good statistical basis. 6 / 8

6. The general linear model Learning time: 9h Theory classes: 9h 6.1. Introduction 6.2. Parameter estimation and hypotheses testing 6.3. Simple linear regression 6.3.1. Parameter estimation 6.3.2. Regression diagnostic 6.3.3. Hypotheses in regression 6.3.4. Model comparisons 6.3.5. Relationship between regression and correlation 6.3.6. Smoothing 6.4. Multiple regression 6.4.1. Parameter estimation 6.4.2. Regression diagnostic 6.4.3. Inference in multiple regression 6.4.4. Collinearity Theory sessions. Problem sessions. Students should be able to obtain estimates from a linear regression model and verify the validity of the assumptions of the model in order to discuss the results of a regression study. 7 / 8

7. ANOVA models Learning time: 10h 30m Theory classes: 10h 30m 7.1. One-way ANOVA 7.1.1. Linear model for one-way ANOVA 7.1.2. Null hypotheses 7.1.3. Factor effects 7.1.4. ANOVA diagnostics 7.1.5. Multiple comparison of means 7.2. Two-way ANOVA 7.2.1. Randomized blocks design 7.2.2. Two fixed factors ANOVA 7.2.3. Interpreting interactions 7.2.4. Two random factors ANOVA 7.2.5. Mixed effects model Theory sessions. Problem sessions. Statistical laboratory. Students should understand the linear model of analysis of variance together with the sum of squares variance decomposition and solve the one-way model and the two-way model both with fix and random factors. Qualification system Throughout the course students will be proposed to solve 3 small quizzes (CUEST). They will also be proposed to solve take-home exercises and deliver it within a specified period as discussed in the section on practical laboratory in teaching methodology (EJER). A final exam (EF) will take place on the date specified by the master direction. The grade of the course will be obtained as N =0.2 *CUEST+0.20*EJER+ 0.6*EF. Bibliography Basic: Casella, G.; Berger, Roger L. Statistical inference. 2nd ed. Duxbury: Pacific Grove, 2002. Rohatgi, Vijay K. Statistical Inference. New York: John Wiley & Sons, 1984. Sánchez, P., Baraza, X., Reverter, F. y Vegas, E. Métodos Estadísticos Aplicados. Texto docente 311. Barcelona: UB, 2006. Peña, Daniel. Estadística. Modelos y Métodos. 2 vols. 2ª ed. rev. Madrid: Alianza Universidad Textos, 1986-1991. DeGroot, Morris; Schervish, Mark. Probability and statistics. 4th ed. Pearson, 2012. ISBN 0321500466. Evans, Michael; Rosenthal, Jeffrey S. Probability and statistics : the science of uncertainty. 2nd ed. New York: W.H. Freeman and Company, cop. 2010. ISBN 1-4292-2462-2. De Groot, Morris H; Schervish, Mark J. Probability and statistics. 3rd. ed. Boston [etc.]: Addison-Wesley, cop. 2002. ISBN 0201524880. 8 / 8