Genetics 245 / Statistics 345

Computational Algorithms for Statistical Genetics

Spring 2009   ●   TuTh 11:00AM - 12:15PM      Alway Building Room M104

Professors Hua Tang and Nancy Zhang


Announcements

 

05/30/09: Nick will hold extra office hours on Monday, June 1, 2:30-3:30 pm.

05/28/09: Hua will move her office hours next week to Wed, 12-1 pm instead of Thurs 3:30-4:30.

05/05/09: You need to submit a half page proposal of your final project on May 14, along with your homework 2.

05/01/09: Homework 2 is posted [here]

04/16/09: Homework 1 is posted [here]

04/13/09: For this week only, Nick will hold office hour on Friday instead of Wednesday.

The first class meeting, 3/31, will be held in Alway M114.

Contacts

 

 

Office / Office hours

Instructors

Hua Tang (huatang)

Alway M335, 3:30-4:30 PM Thursdays

 

Nancy Zhang (nzhang)

Sequoia 141, 9:30-10:30 PM Tuesdays

TA

Nick Johnson (nickaj)

Alway M327, 2:30-3:30 PM Tuesdays

Course email

gen245@gmail.com

Please use this address for course-related emails 

Tentative Syallabus

Slides for the lectures will be available here, or at the mirroring site [here].

Week

 

Date

Topic

Lecturer

Reading

Slides / code

1

 

 

 

 

 

 

 

1

31-Mar

Introduction, course logistics

HT

Stat & Prob primer [by Woolfe et al.]

  Slides

 

2

2-Apr

Likelihood & Bayesian inference

NZ

R primer [by Emmanuel Paradis]

  Slides

 

 

 

 

 

 

 

 

3

7-Apr

EM

HT

Dempster et al. JRSSB, 1977.

  Slides | R code

 

4

9-Apr

HMM

HT

HMM tutorial by Rabiner

  Slides

3

 

 

 

 

 

 

 

5

14-Apr

HMM example I: fastPHASE

HT

Scheet & Stephens, AJHG, 2006.

  Slides

 

6

16-Apr

Monte Carlo integration, rejection method

HT

Notes by Stefan Weinzierl 

  Slides | R code

4

 

 

 

 

 

 

 

7

21-Apr

HMM example II: conserved element

NZ

Siepel et al. Genome Research, 2005.      Siepel et al. supplementary materials.

 Slides, notes

 

8

23-Apr

Rejection method example: coalescent

HT

Tavare et al. Genetics, 1997.

 Slides

5

 

 

 

 

 

 

 

9

28-Apr

Metropolis-Hastings algorithm

HT

Chib & Greenberg, American Statistician, 1995.

 Slides

 

10

30-Apr

Gibbs sampling

NZ

Casella & George, American Statistician, 1992.

 Slides

6

 

 

 

 

 

 

 

11

5-May

Gibbs sampling example

NZ

Liu et al., JASA, 1995.

 Slides, notes

 

12

7-May

Metropolis-Hastings example

HT

Pritchard et al., Genetics, 2000.

 Slides, movie

7

 

 

 

 

 

 

 

13

12-May

Bootstrap

NZ

Efron & Tibshirani. Stat. Sci, 1986.

 Slides, R code, mammal.txt

 

14

14-May

Bootstrap example: phylogenetic analysis

HT

Felsenstein, Evolution, 1985.

 Slides

8

 

 

 

 

 

 

 

15

19-May

Bootstrapping correlated data.  Example: genomic profiling 

NZ

Buhlmann et al. Statistical Sciences 2002

 Slides, Animation

 

16

21-May

Multiple hypothesis testing

NZ

Shaffer J, Annual Review of Psychology, 1995

Benjamini & Hochberg, JRSSB, 1995

 Slides, R code

9

 

 

 

 

 

 

 

17

26-May

Scan statistics & change point models

NZ

James et al. Biometrika, 1987

 Slides

 

18

28-May

Scan statistics example

NZ

Karlin & Brendel, Science, 1992

Feingold et al. AJHG, 1993.

 Slides

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Course requirements

There will be three assignments, and one final projects. All will require some programming.

Homework 1. (Due April 30)    Homework 1 Solutions

Homework 2. (Due May 14)     Homework 2 Solutions

Homework 3. (Due June 2)  For problem 3: X.txt,  XY.txt

Final Project  Guidelines (Due June 4)

 

Grading

Homeworks:

60%

Final Project:

40%

 

Pre-requisite

Basic probability at the level of Stats 116.

Textbook

There are no required texts for this class. Reading materials to complement the lectures will be posted here or distributed in class. Below is a partial list of books that covers some of the topics at a more advanced level.