Computational Biology

Similar documents
Biology 1 General Biology, Lecture Sections: 47231, and Fall 2017

General Microbiology (BIOL ) Course Syllabus

GRADUATE STUDENT HANDBOOK Master of Science Programs in Biostatistics

CS 100: Principles of Computing

Office Hours: Mon & Fri 10:00-12:00. Course Description

University of Massachusetts Lowell Graduate School of Education Program Evaluation Spring Online

MATH 205: Mathematics for K 8 Teachers: Number and Operations Western Kentucky University Spring 2017

Class Meeting Time and Place: Section 3: MTWF10:00-10:50 TILT 221

Course Policies and Syllabus BUL3130 The Legal, Ethical, and Social Aspects of Business Syllabus Spring A 2017 ONLINE

GAT General (Analytical Reasoning Section) NOTE: This is GAT-C where: English-40%, Analytical Reasoning-30%, Quantitative-30% GAT

Full text of O L O W Science As Inquiry conference. Science as Inquiry

Chromatography Syllabus and Course Information 2 Credits Fall 2016

HCI 440: Introduction to User-Centered Design Winter Instructor Ugochi Acholonu, Ph.D. College of Computing & Digital Media, DePaul University

SYLLABUS. EC 322 Intermediate Macroeconomics Fall 2012

Social Media Marketing BUS COURSE OUTLINE

MTH 141 Calculus 1 Syllabus Spring 2017

SAMPLE SYLLABUS. Master of Health Care Administration Academic Center 3rd Floor Des Moines, Iowa 50312

MAT 122 Intermediate Algebra Syllabus Summer 2016

Course Content Concepts

Astronomy/Physics 1404 Introductory Astronomy II Course Syllabus

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

Human Development: Life Span Spring 2017 Syllabus Psych 220 (Section 002) M/W 4:00-6:30PM, 120 MARB

State University of New York at Buffalo INTRODUCTION TO STATISTICS PSC 408 Fall 2015 M,W,F 1-1:50 NSC 210

Course Syllabus Solid Waste Management and Environmental Health ENVH 445 Fall Quarter 2016 (3 Credits)

Financial Accounting Concepts and Research

EECS 700: Computer Modeling, Simulation, and Visualization Fall 2014

Business Finance 3400 Introduction to Real Estate Autumn Semester, 2017

Outreach Connect User Manual

Computer Architecture CSC

Integral Teaching Fellowship Application Packet Spring 2018

We are strong in research and particularly noted in software engineering, information security and privacy, and humane gaming.

JN2000: Introduction to Journalism Syllabus Fall 2016 Tuesdays and Thursdays 12:30 1:45 p.m., Arrupe Hall 222

English 2319 British Literature Heroes, Villains, and Monsters in British Literature

This course has been proposed to fulfill the Individuals, Institutions, and Cultures Level 1 pillar.

Scottsdale Community College Spring 2016 CIS190 Intro to LANs CIS105 or permission of Instructor

Instructor: Khaled Kassem (Mr. K) Classroom: C Use the message tool within UNM LEARN, or

Psychology 101(3cr): Introduction to Psychology (Summer 2016) Monday - Thursday 4:00-5:50pm - Gruening 413

Syllabus ENGR 190 Introductory Calculus (QR)

BIOL Nutrition and Diet Therapy Blinn College-Bryan Campus Course Syllabus Spring 2011

Social Media Journalism J336F Unique Spring 2016

Office Hours: Day Time Location TR 12:00pm - 2:00pm Main Campus Carl DeSantis Building 5136

Penn State University - University Park MATH 140 Instructor Syllabus, Calculus with Analytic Geometry I Fall 2010

MAR Environmental Problems & Solutions. Stony Brook University School of Marine & Atmospheric Sciences (SoMAS)

An Introductory Blackboard (elearn) Guide For Parents

ECON492 Senior Capstone Seminar: Cost-Benefit and Local Economic Policy Analysis Fall 2017 Instructor: Dr. Anita Alves Pena

PSY 1012 General Psychology. Course Policies and Syllabus

CS Course Missive

Strategic Management (MBA 800-AE) Fall 2010

MKT ADVERTISING. Fall 2016

MBA 5652, Research Methods Course Syllabus. Course Description. Course Material(s) Course Learning Outcomes. Credits.

POFI 1349 Spreadsheets ONLINE COURSE SYLLABUS

ACTL5103 Stochastic Modelling For Actuaries. Course Outline Semester 2, 2014

Prerequisite: General Biology 107 (UE) and 107L (UE) with a grade of C- or better. Chemistry 118 (UE) and 118L (UE) or permission of instructor.

12-WEEK GRE STUDY PLAN

Language Acquisition Chart

INTRODUCTION TO CULTURAL ANTHROPOLOGY ANT 2410 FALL 2015

George Mason University Graduate School of Education Education Leadership Program. Course Syllabus Spring 2006

Class Tuesdays & Thursdays 12:30-1:45 pm Friday 107. Office Tuesdays 9:30 am - 10:30 am, Friday 352-B (3 rd floor) or by appointment

Foothill College Summer 2016

PSYCHOLOGY 353: SOCIAL AND PERSONALITY DEVELOPMENT IN CHILDREN SPRING 2006

Teaching a Discussion Section

Computational Data Analysis Techniques In Economics And Finance

Citrine Informatics. The Latest from Citrine. Citrine Informatics. The data analytics platform for the physical world

Syllabus: INF382D Introduction to Information Resources & Services Spring 2013

Evaluation of a College Freshman Diversity Research Program

San José State University

BIOS 104 Biology for Non-Science Majors Spring 2016 CRN Course Syllabus

National Survey of Student Engagement

Math 96: Intermediate Algebra in Context

M.S. in Environmental Science Graduate Program Handbook. Department of Biology, Geology, and Environmental Science

Physics XL 6B Reg# # Units: 5. Office Hour: Tuesday 5 pm to 7:30 pm; Wednesday 5 pm to 6:15 pm

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

GLBL 210: Global Issues

Spring 2015 IET4451 Systems Simulation Course Syllabus for Traditional, Hybrid, and Online Classes

FINN FINANCIAL MANAGEMENT Spring 2014

CS 3516: Computer Networks

Course Syllabus It is the responsibility of each student to carefully review the course syllabus. The content is subject to revision with notice.

University of Waterloo School of Accountancy. AFM 102: Introductory Management Accounting. Fall Term 2004: Section 4

AGED 4103 METHODS AND SKILLS OF TEACHING AND MANAGEMENT IN AGRICULTURAL EDUCATION

LEAD 612 Advanced Qualitative Research Fall 2015 Dr. Lea Hubbard Camino Hall 101A

GUIDELINES FOR HUMAN GENETICS

Marketing Management MBA 706 Mondays 2:00-4:50

COURSE DESCRIPTION PREREQUISITE COURSE PURPOSE

Syllabus - ESET 369 Embedded Systems Software, Fall 2016

DOCTOR OF PHILOSOPHY HANDBOOK

Poster Presentation Best Practices. Kuba Glazek, Ph.D. Methodology Expert National Center for Academic and Dissertation Excellence Los Angeles

TCH_LRN 531 Frameworks for Research in Mathematics and Science Education (3 Credits)

BIODIVERSITY: CAUSES, CONSEQUENCES, AND CONSERVATION

Syllabus Foundations of Finance Summer 2014 FINC-UB

Beginning and Intermediate Algebra, by Elayn Martin-Gay, Second Custom Edition for Los Angeles Mission College. ISBN 13:

The Heart of Philosophy, Jacob Needleman, ISBN#: LTCC Bookstore:

Office: CLSB 5S 066 (via South Tower elevators)


Accounting 312: Fundamentals of Managerial Accounting Syllabus Spring Brown

Social Media Journalism J336F Unique ID CMA Fall 2012

PSCH 312: Social Psychology

Assessment System for M.S. in Health Professions Education (rev. 4/2011)

UNA PROFESSIONAL ACCOUNTING PREP PROGRAM

Class Numbers: & Personal Financial Management. Sections: RVCC & RVDC. Summer 2008 FIN Fully Online

TEACHING ASSISTANT TBD

Transcription:

Computational Biology Spring 2017 Class meetings: Enzi STEM Building 155 3:10 5:00 p.m., Tuesday & Thursday Websites: www.uwyo.edu/buerkle/compbio uwyo.instructure.com Course numbers: Botany 4550, 5550 and Comp. Sci. 5010 Professor: Alex Buerkle T.A.: Mallory Lai Email: buerkle@uwyo.edu mstrong3@uwyo.edu Office: Aven Nelson 202 Aven Nelson 138 Office Hours: Wednesday and Friday 1 2 p.m. by appointment (and by appointment) Overview Most subdisciplines in modern biology involve the analysis of large amounts of data, inference based on probabilistic models, or both. This course exists to help students gain skills in data analysis and combines elements of applied computational science (data wrangling, computer systems, etc.) with probability and statistics. These are challenging aspects of modern research and can take years to master, which might be one reason why some people neglect to develop their skills in this area. However, you will be able to learn and do new useful things very quickly. Computational biology can open a world of possibilities for research and employment. The computational and analysis skills we will practice are increasingly imperative for modern biologists (e.g., see the mention of computation, statistics and quantitative analysis in many job advertisements). Without them the scope and ambition for biological research are unnecessarily constrained. In addition, the problems and computational approaches we will study are good application domains for students from outside biology who are interested in pursuing computational science. This course will be motivated by practical applications of probability, simple mathematics and computational tools to biology. In each section of the course we will begin with biological questions and then investigate computational methods for graphical and statistical analysis of real data sets. I have a few main goals in this class: 1) to discover the importance of probability, mathematics and computational methods in biology, 2) to understand philosophies and conceptual frameworks underlying commonly-used statistics and analytical methods, and 3) to become proficient with analytical and computational tools that can be applied to biological problems. I hope that by the end of the course you will have an appreciation of the diversity of applications of these analytical tools and the role they play in modern biology, and have begun to develop proficiency in these areas. 1

Course Materials 1. O. Jones, R. Maillardet, and A. Robinson. 2014. Introduction to Scientific Programming and Simulation Using R. 2 nd edition. CRC Press. Please acquire a paper copy or gain access to the free e-book copy through the UW Library. 2. Additional materials will be distributed in class or linked from the course website. 3. Optional supplemental and potentially useful materials: a. G. Grolemund and H. Wickham. 2016. R for Data Science. (see course webpage for URL) b. R. Peng. 2015. R Programming for Data Science. Leanpub. (see course webpage for URL) c. B. M. Bolker. 2008. Ecological Models and Data in R. Princeton University Press. (we will read and use several chapters from this book; I will provide pdf copies of chapters). d. S. H. D. Haddock and C. W. Dunn. 2011. Practical Computing for Biologists. Sinauer Associates. Assessment and grading Assessmentofyourworkinthiscoursewillbedoneregularlybyyou. Youwilldoindependent work (e.g., reading, exercises, problem solving) and determine through interactions with the class what you did correctly, where you need further study or have questions, and so forth. Several small assignments will be graded for completion (not accuracy) to offer an incentive to keep up with classwork. Given that this is an upper-level undergraduate or graduate course, it will be your responsibility to gauge your competency and understanding of these exercises. I will help motivate you by offering challenging and engaging course material, but your success will depend heavily on your personal motivation. Your performance will be evaluated formally based on three write-ups of lab projects, two exams, and several small assignments (graded for completion). In addition, graduate students will write and present short papers on specific topics in computational biology. Grading will be on a standard scale (i.e., 90 100=A, 80 89=B, etc.). Undergraduate Graduate Project papers 3 60 = 180 180 Exams 2 30 = 60 60 Small assignments 10 4 = 40 40 Graduate student project 30 280 310 2

Project papers Three of the four sections of the course will culminate in a project paper. The task is to report findings of a computational analysis and to interpret them in the context of the biological questions that motivated the study. The format and length of these will vary slightly from topic to topic. Project reports will be written individually, but they will be based on collaboration among students and the instructor. Due dates will be announced in class and posted on the course s WyoCourse website. Papers will be submitted electronically, via the WyoCourse website. Short exams Two short exams (scheduled for 28 March and 4 May) will be used to assess your understanding of major concepts. In practice people do statistics, mathematics and computation with the assistance of books, computers and colleagues. Thus, for the exams, I will ask you to address concepts and big-picture items, things that you would expect to be able to do without books, etc. Questions on the exam will call for short answers of a few sentences or short essays. We will discuss sample questions before the first exam to illustrate the types of questions I will ask. Small graded assignments Exercises and problem sets will be associated with several subjects and will be assigned regularly. Ten assignments will be graded for completion (will be listed on the course s website) and will be submitted electronically through the website. You will have opportunities to ask questions about assignments during class, but you will require time outside of class to complete the exercises. A subset of these graded assignments will be done as a group and we will compare these critically as one would for a bake- or cook-off at a fair. Note-taking and questions You will want to take notes on reading assignments, make outlines of important concepts, and make notes on computer code that is of use to you. In class meetings I will present lectures, answer questions, offer further explanations of material and facilitate discussions. The course s website is an additional resource for asking questions and discussing topics outside of class, and we will use it in various ways to interact outside of class. Graduate students Beyond the above requirements, each graduate student will write a short paper that introduces a specific topic in computational biology (the equivalent of two substantial 8 1 2 11 pages, plus any images, references and URLs to key resources, on an individual web page 3

associated with the course). Each student will write on a different topic. The purpose of this assignment is for the individual student to dig deeper into a particular subject, and to share the findings with the class and enrich our literacy in computational science. In addition to the short paper, graduate students will give short presentations on their findings to the class. The presentation and paper will account for 30 points and the course grades of graduate students will be calculated out of 310 possible points. The due date for the written assignment will be 6 April and students will choose a time for their presentations in the last 3 weeks of classes. Below is a list of suggested topics. Students are welcome to suggest and discuss additional topics with me: 1. Data archiving at Dryad and other public repositories 2. The National Science Foundation s requirement for data management plans 3. Network file systems for scientific computing 4. Types of parallel computing for science 5. High performance computing in a cluster environment (e.g., Mt. Moran computer at UW) 6. Tools and methods for reproducible research 7. Version control systems for documents, software development, and other collaborative work (cvs, svn, git, Google docs, etc.) 8. Hardware and software tools for secure data sharing and storage 9. Modern database technologies 10. Programming libraries for scientific computing (HDF5, GSL, etc.) 11. What, if anything, is a supercomputer? 12. What is Cloud computing and what is its relevance for academic computing? 13. The Cyverse (formerly, the iplant Collaborative) 14. A comparison of machine learning methods to parametric modeling Computer use and etiquette We are using a room full of computers for this class. Occasionally, a student has difficulty focusing on the class material and not being distracted by the many distractions that a networked computer offers. These distractions are typically detrimental to the attention of the individual user, but also to the students who sit nearby. Therefore I ask that you find a solution to manage and minimize these distractions for yourself. For example, I ask that you do not consult the web while I or anyone else is presenting material to the class, because it will interfere with your ability to follow the presenter. Likewise, there is no reason to use personal email or other messaging software during class. Of course, legitimate uses of these resources during class exist, particularly during those times when I ask you to use the computers. These include transferring a copy of a file to yourself or another student via email, consulting the web for computational science resources related to class work, taking notes, and trying out code as I present it on screen. Please use your best judgment and minimize distractions to yourself and others. 4

Additional Items The schedule of topics, assignments, and all other details in this syllabus are subject to change with fair warning, including announcements in class or via university email addresses and the course website. Any student who has a disability and is in need of classroom accommodations please contact the instructor and the University Disability Support Services. Students whose religious activities conflict with the class schedule should contact the instructor at the beginning of the semester to make alternative arrangements. Cheating and other forms of academic dishonesty are listed in University Regulation 802, Revision 2. If you are found to be engaged in academic misconduct, at a minimum you will receive no credit for that exam or assignment. Repeat or serious offenders can expect more serious consequences. Overview of Topics 1. Project I: Analysis of body weight and lifespan of laboratory mice Class meetings 2 8, 3.5 weeks Introduction to computational science Exploratory data analysis and simple tests of hypotheses Probability Theory I Introduction to R 2. Project II: Poisson processes and the distribution of palindromes in cytomegalovirus DNA Class meetings 9 14, 3 weeks Probability Theory II Poisson processes and associated distributions Statistical analysis of categorical data Paper on Project II, due 9 March 3. Project III: Improbable Wyoming: parameter estimation using likelihood and Bayesian methods Class meetings 15 20, 3 weeks Likelihood and Bayesian parameter estimation Numerical analysis Paper on Project III, due 11 April 4. Project IV: Discovery and quantification of features of a genome Class meetings 21 28, 4 weeks Data wrangling and extraction with large data sets Hierarchical Bayesian modeling Paper on Project IV, due by 9 May (Tuesday of final exam week) 5