Data Intensive Analysis

Similar documents
Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

Python Machine Learning

Navigating the PhD Options in CMS

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

1. Programme title and designation International Management N/A

CS Machine Learning

Programme Specification. MSc in International Real Estate

Programme Specification

PROGRAMME SPECIFICATION: MSc International Management (12 month)

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

SAT & ACT PREP. Evening classes at GBS - open to all Juniors!

STUDENT INFORMATION GUIDE MASTER'S DEGREE PROGRAMME ENERGY AND ENVIRONMENTAL SCIENCES (EES) 2016/2017. Faculty of Mathematics and Natural Sciences

S T A T 251 C o u r s e S y l l a b u s I n t r o d u c t i o n t o p r o b a b i l i t y

MSc Education and Training for Development

Theory of Probability

Programme Specification

value equivalent 6. Attendance Full-time Part-time Distance learning Mode of attendance 5 days pw n/a n/a

Introduction to Simulation

Abstractions and the Brain

THE UNIVERSITY OF SYDNEY Semester 2, Information Sheet for MATH2068/2988 Number Theory and Cryptography

Biomedical Sciences (BC98)

Evolution of Symbolisation in Chimpanzees and Neural Nets

Doctor in Engineering (EngD) Additional Regulations

Foothill College Summer 2016

Computational Data Analysis Techniques In Economics And Finance

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Level 6. Higher Education Funding Council for England (HEFCE) Fee for 2017/18 is 9,250*

VOL. 3, NO. 5, May 2012 ISSN Journal of Emerging Trends in Computing and Information Sciences CIS Journal. All rights reserved.

PH.D. IN COMPUTER SCIENCE PROGRAM (POST M.S.)

Henley Business School at Univ of Reading

Identifying Novice Difficulties in Object Oriented Design

BSc Food Marketing and Business Economics with Industrial Training For students entering Part 1 in 2015/6

Ryerson University Sociology SOC 483: Advanced Research and Statistics

DOCTORAL SCHOOL TRAINING AND DEVELOPMENT PROGRAMME

A Case Study: News Classification Based on Term Frequency

Bachelor Programme Structure Max Weber Institute for Sociology, University of Heidelberg

PROGRAMME SPECIFICATION

Sociology 521: Social Statistics and Quantitative Methods I Spring 2013 Mondays 2 5pm Kap 305 Computer Lab. Course Website

2013 District STAR Coordinator Workshop

FINANCE 3320 Financial Management Syllabus May-Term 2016 *

Programme Specification. MSc in Palliative Care: Global Perspectives (Distance Learning) Valid from: September 2012 Faculty of Health & Life Sciences

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

ENEE 302h: Digital Electronics, Fall 2005 Prof. Bruce Jacob

March. July. July. September

Ericsson Wallet Platform (EWP) 3.0 Training Programs. Catalog of Course Descriptions

NATIONAL UNIVERSITIES COMMISSION SOCIAL SCIENCES

Note: Principal version Modification Amendment Modification Amendment Modification Complete version from 1 October 2014

PGCE Trainees' Handbook (With Post-16 Enhancement)

Followed by a 30 minute session for those interested in school social work placements and specialization

Spring 2014 SYLLABUS Michigan State University STT 430: Probability and Statistics for Engineering

Casual and Temporary Teacher Programs

Self Study Report Computer Science

Master's degree students

Educational Leadership and Policy Studies Doctoral Programs (Ed.D. and Ph.D.)

UNIVERSITY OF DAR-ES-SALAAM OFFICE OF VICE CHANCELLOR-ACADEMIC DIRECTORATE OF POSTGRADUATE STUDIUES

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Information System Design and Development (Advanced Higher) Unit. level 7 (12 SCQF credit points)

JANE ADDAMS COLLEGE REGISTRATION PACKET: SUMMER/FALL 2017

M.SC. BIOSTATISTICS PROGRAMME ( ) The Maharaja Sayajirao University of Baroda

Sociology 521: Social Statistics and Quantitative Methods I Spring Wed. 2 5, Kap 305 Computer Lab. Course Website

University of Groningen. Systemen, planning, netwerken Bosman, Aart


M.S. in Environmental Science Graduate Program Handbook. Department of Biology, Geology, and Environmental Science

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Programme Specification

The Program. Hands-on Workshop in Computational Biophysics. Prof. Klaus Schulten. Prof. Emad Tajkhorshid

PROGRAMME SPECIFICATION

The Keele University Skills Portfolio Personal Tutor Guide

Major Milestones, Team Activities, and Individual Deliverables

Programme Specification (Postgraduate) Date amended: 25 Feb 2016

TREATMENT OF SMC COURSEWORK FOR STUDENTS WITHOUT AN ASSOCIATE OF ARTS

Is there a limit to how often I can attempt the Irish Language Requirement by this method?

University of Cambridge: Programme Specifications POSTGRADUATE ADVANCED CERTIFICATE IN EDUCATIONAL STUDIES. June 2012

Detailed course syllabus

ME 4495 Computational Heat Transfer and Fluid Flow M,W 4:00 5:15 (Eng 177)

DIGITAL GAMING & INTERACTIVE MEDIA BACHELOR S DEGREE. Junior Year. Summer (Bridge Quarter) Fall Winter Spring GAME Credits.

Exploring the Development of Students Generic Skills Development in Higher Education Using A Web-based Learning Environment

Submission of a Doctoral Thesis as a Series of Publications

Time series prediction

BSc (Hons) Property Development

COMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS

Diploma in Library and Information Science (Part-Time) - SH220

Programme Specification. BSc (Hons) RURAL LAND MANAGEMENT

PROGRAM AND EXAMINATION REGULATIONS FOR THE MASTER S PROGRAM IN INDUSTRIAL AND APPLIED MATHEMATICS

Faculty of Civil and Industrial Engineering ACADEMIC YEAR 2017/2018

A student diagnosing and evaluation system for laboratory-based academic exercises

Research computing Results

MASTERS IN GEOGRAPHY

TIMSS ADVANCED 2015 USER GUIDE FOR THE INTERNATIONAL DATABASE. Pierre Foy

Studies Arts, Humanities and Social Science Faculty

Computerized Adaptive Psychological Testing A Personalisation Perspective

Lecture 1: Machine Learning Basics

Graduate Program in Education

Graduate Handbook Linguistics Program For Students Admitted Prior to Academic Year Academic year Last Revised March 16, 2015

MGT/MGP/MGB 261: Investment Analysis

Integrating simulation into the engineering curriculum: a case study

Developing Students Research Proposal Design through Group Investigation Method

Programme Specification

Examination Timetables Series to Series

Transcription:

Data Intensive Analysis Programme Requirements: Mathematics & Statistics - Data Intensive Analysis - 2018/9 - August 2018 Data-Intensive Analysis - MSc MT4113 (15 credits) and MT5761 (15 credits) and MT5762 (15 credits) and MT5763 (15 credits) and MT5764 (15 credits) and ID5059 (15 credits) and Between 0 and 30 credits from Module List: CS5001 - CS5003, CS5044, CS5052 and (CS5099 (60 credits) or MT5099 (60 credits)) Compulsory modules: MT4113 Computing in Statistics SCOTCAT Credits: 15 SCQF Level 10 Semester 1 12.00 noon Mon (odd weeks) and Wed, 12.00 noon - 2.00 pm Fri The aim of this module is to teach computer programming skills, including principles of good programming practice, with an emphasis on statistical computing. Practical work focusses on the widely-used statistical language and environment R. Practical skills are developed through a series of computing exercises that include (1) modular programming; (2) manipulating data; (3) simulating data with specific statistical properties, (4) investigating behaviour of statistical procedures under failure of statistical assumptions. Before taking this module you must pass MT2508 Weekly contact: 1.5-hour lectures (x 10 weeks), 2-hour practical classes (x 10 weeks) 2-hour Written Examination = 40%, Coursework = 60% Re-assessment pattern: Module teaching staff: 1-hour 40 minute Written Examination = 40%, Coursework (4 new programming assignments) = 60% Prof L J Thomas Prof L Thomas Page 20.2.1

MT5761 Statistical Modelling SCOTCAT Credits: 15 SCQF Level 11 Semester 1 Mon, Tues, Thur, Fri 3:00-4:00 (lectures), Tues, Thur 4:00-5:00 (practicals) This applied statistics module covers the main aspects of linear models (LMs) and generalized linear models (GLMs). In each case the course describes model specification, various options for model selection, model assessment and tools for diagnosing model faults. Common modelling issues such as collinearity and residual correlation are also addressed, and as a consequence of the latter the Generalized Least squares (GLS) method is described. The GLM component has emphasis on models for count data and presence/absence data while GLMs for multinomial (sometimes called choice-based models) are also covered for nominal and ordinal response outcomes. The largest part of the course material is taught inside an environmental impact assessment case study with reality-based research objectives. Political and medical examples are used to illustrate the multinomial models. Undergraduates must have passed at least one of MT4113, MT4527, MT4528, MT4530, MT4531, MT4537, MT4539, MT4606, MT4608, MT4609, MT4614. You cannot take this module if you take MT4607 or take MT5753 Weekly contact: 4 lectures (x 5 weeks), 2 practicals (x 5 weeks) Scheduled learning: 30 hours 2-hour Written Examination = 50%, Coursework = 50% Re-assessment pattern: 2-hour Written Examination = 100% Module teaching staff: TBC Guided independent study: 117 hours MT5762 Introductory Data Analysis SCOTCAT Credits: 15 SCQF Level 11 Semester 1 Availability restrictions: Not available to Undergraduate students. Mon, Tue, Fri 2:00-3:30, Thur 3:30-5:00 This module provides coverage of essential statistical concepts and analysis methods relevant to commercial analysis. Specifically: the different types of data and their numerical/graphical treatment; basic probability theory and concepts of inference; fundamental statistical concepts with particular emphasis on sampling issues; basic statistical models and tests; linear models; introductory computer-intensive inference. This module is a short intensive course and is a core, preliminary, requirement for the MSc in Applied Statistics and Datamining. It covers material essential for study of the more advanced statistical methods encountered in subsequent modules. Coursework = 100% You cannot take this module if you take MT5756 Weekly contact: Four 1.5-hour lectures (x 5 weeks) Dr C R Donovan Module teaching staff: Dr D Donovan, Dr L Scott-Hayward Page 20.2.2

MT5763 Software for Data Analysis Mathematics & Statistics - Data Intensive Analysis - 2018/9 - August 2018 SCOTCAT Credits: 15 SCQF Level 11 Semester 1 Availability restrictions: Not available to Undergraduate students Mon, Tues, Fri 3:30-4:30 (lectures). Mon, Tues, Fri 4:30-5:30 (Practicals) This module covers the practical computing aspects of statistical data analysis, focussing on packages most widely used in the commercial sector (R, SAS, SPSS & Excel). We cover the accessing, manipulation, checking and presentation of data (visual and numerical). We fit various statistical models to data, with subsequent assessment, interpretation and presentation. Good practice and 'reproducible research' is covered, as is computer intensive inference and big data considerations. This module is a short intensive course and is a core, preliminary, requirement for the MSc in Applied Statistics and Datamining and the MSc in Data Intensive Analysis. It covers material essential for study of the more advanced statistical methods encountered in subsequent modules. Pass in MT1007 or MT3507 or MT3508 or be taking MT5762 You cannot take this module if you take MT5756 Weekly contact: Three 2-hour lecture/practical classes (x 5 weeks) Scheduled learning: 30 hours Coursework = 100% Re-assessment pattern: Coursework = 100% Module teaching staff: TBC Guided independent study: 120 hours MT5764 Advanced Data Analysis SCOTCAT Credits: 15 SCQF Level 11 Semester 2 Mon 12:00-1:00 Weeks 2, 4, 5, 8, 10 Tues, Thur 12:00-2:00, Weeks 1-10 (lectures) Tues 2:00-3:00 Weeks 2-9 (practicals) This module covers modern modelling methods for situations where the data fails to meet the assumptions of common statistical models and simple remedies do not suffice. This represents a lot of real world data. Methods covered include: nonlinear models; basic splines and Generalised Additive Models; LASSO and the Elastic Net; models for non-independent errors and random effects. Pragmatic data imputation is covered with associated issues. Computer intensive inference is considered throughout. Practical applications build sought-after skills in R and the commercial packages SAS. Undergraduates must pass MT4607 or MT5753 or MT5761 You cannot take this module if you take MT5757 Weekly contact: 2.5 hours of lectures lectures (Weeks 1-10) and 8 practicals over the semester. 2-hour Written Examination = 60%, Coursework = 40% Re-assessment pattern: 2-hour Written Examination = 100% Module teaching staff: TBC Page 20.2.3

ID5059 Knowledge Discovery and Datamining SCOTCAT Credits: 15 SCQF Level 11 Semester 2 11.00 am Mon (odd weeks), Wed and Fri Contemporary data collection can be automated and on a massive scale e.g. credit card transaction databases. Large databases potentially carry a wealth of important information that could inform business strategy, identify criminal activities, characterise network faults etc. These large scale problems may preclude the standard carefully constructed statistical models, necessitating highly automated approaches. This module covers many of the methods found under the banner of Datamining, building from a theoretical perspective but ultimately teaching practical application. Topics covered include: historical/philosophical perspectives, model selection algorithms and optimality measures, tree methods, bagging and boosting, neural nets, and classification in general. Practical applications build sought-after skills in programming (typically R, SAS or python). Weekly contact: Lectures, seminars, tutorials and practical classes. 2-hour Written Examination = 60%, Coursework = 40% Re-assessment pattern: 2-hour Written Examination = 60%, Existing Coursework = 40% Module teaching staff: Dr T W Kelsey Dr T Kelsey, Dr R Hoffmann One of: CS5099 Dissertation in Computer Science SCOTCAT Credits: 60 SCQF Level 11 Semester Full Year To be arranged. This module is an individually supervised MSc project on a topic in Computer Science. It results in a dissertation of no more than 15,000 words. Typically the dissertation comprises a review of related work, the extension of old or development of new ideas, software implementation and testing, analyses and evaluation. Students are required to give a presentation of their work. Module teaching staff: Requires admission to dissertation phase of msc and permission of the head of school You cannot take this module if you take CS5098 Weekly contact: Meeting with supervisor. Scheduled learning: 0 hours Coursework = 100% Guided independent study: 0 hours TBC Module coordinator(s): Director of Postgraduate Teaching - Computer Science (dopgt-cs@st-andrews.ac.uk) Page 20.2.4

Or: MT5099 Dissertation for MSc Programme/s SCOTCAT Credits: 60 SCQF Level 11 Semester Full Year At times to be arranged with the supervisor. Student dissertations will be supervised by members of the teaching staff who will advise on the choice of subject and provide guidance throughout the progress of the dissertation. The completed dissertation must be no more than 15,000 words. Weekly contact: Individual supervision Scheduled learning: 0 hours Dissertation = 100% Re-assessment pattern: No Re-Assessment Available Dr J D Mitchell Guided independent study: 0 hours Optional modules are available - see the pdf online called Computer Science optional modules 2018-2019 Page 20.2.5

Page 20.2.6