Statistics 203: Introduction to Regression Models and ANOVA
Winter 2009 ● Tu Th 9:30-10:45 PM ● Sequoia Hall 200
Nancy R. Zhang ● nzhang atstanford ● Office Hours: Th 2:15-4:15 PM Sequoia Hall 141
A N N O U N C E M E N T S
The final exam can now be downloaded here. For data sets, see here.
Here are the solutions to the practice midterm.
Practice Midterm: This is a modification of a midterm from previous years. Your midterm will be similar in format, but with less emphasis on random effects and variable transformations, more emphasis on nested model testing. The midterm will be open book, open notes.
The R introductory session will be Friday, 1/16 4:00-5:15 PM in the Sequoia Hall Computer Lab (Room 211).
R introductory session slides by Yueh Wen Liao.
C O U R S E D E S C R I P T I O N
This course introduces statistical regression models and ANOVA. The content is very similar to STAT 191, but with more matrix theory.
We will cover the basic concepts behind these models, and apply them to the analysis of data sets. Please see the syllabus for more information.
P R E R E Q U I S I T E S
Basic probability and statistics at the level of Stat 200 and Stat 116. Basic linear algebra.
T A
YuehWen Liao (yuehwen@stanford) Office Hours: Mon, Wed 1-2 pm Sequoia Room 229T E X T B O O K S
Neter et al., Applied Linear Statistical Models, 5th Edition (Required)
Weisberg, Applied Linear Regression, 2nd Edition (Reference)
D A T A
Data sets used in this class are here.
L E C T U R E S (Materials will be posted here after every lecture.)
| Date | Materials |
| Tu 1/6 | Review. Slides, R examples. |
| Th 1/8 | Simple linear regression. Slides. |
| Tu 1/13 | Inference, diagnostics for linear regression. |
| Th 1/15 | Diagnostics for linear regression. Slides, R examples. |
| Tu 1/20 | Multiple diagnostics, ANOVA. Slides, R examples. |
| Th 1/22 | Multiple diagnostics, ANOVA Slides, R examples. |
| Tu 1/27 | Fixed and Random effects Slides, R examples |
| Th 1/29 | Fixed and Random effects Slides, R examples |
| Tu 2/3 | Variable transformations, Midterm Review Slides, R examples |
| Th 2/5 | Midterm Midterm solutions |
| Tu 2/10 | Weighted least squares, PCA. Slides, R examples |
| Th 2/12 | PCA, Model selection Slides, R examples |
| Tu 2/17 | Model selection. Slides, R examples |
| Th 2/19 | LARS, Logistic regression. Slides, R examples |
| Tu 2/24 | Logistic regression. Slides, R examples |
| Th 2/26 | Contingency tables. Slides, R examples |
| Tu 3/3 | Contingency tables. Slides, R examples |
| Th 3/5 | Regression with correlated errors (time series). Slides, R examples |
| Tu 3/10 | Take home final out (due Monday, 3/16). Download final exam. |
A S S I G N M E N T S
Assignments need to be handed in at the beginning of lecture on the due date. Solutions are posted the following day. You can be late by at most one day on at most 1 problem set, with deduction of 10% on grade.
| Due date | File | Solutions |
| Tues, 1/27 | Problem Set 1 | ps1sol.pdf |
| Thurs, 2/5 | Problem Set 2 | ps2sol.pdf |
| Tues, 2/24 | Problem Set 3 (data file: PatientSatisfaction.txt) | ps3sol.pdf |
| Thurs, 3/5 | Problem Set 4 | ps4sol.pdf |
R
We will be using R for most of the data analysis in this class. R can be freely downloaded here. If you are new to R, here is a brief introduction to the language. The Stats 141 page also has more extensive tutorials. Elizabeth Purdom's website also has some good resources.
Here is the R tutorial given by Jun Li from last year.
G R A D I N G
| Homeworks (4-5) | 40% |
| Midterm (in class) | 20% |
| Final (take home) | 40% |