| Instructors | Brian Junker 132E Baker Hall 268-2718 brian@stat.cmu.edu Office hours: T,Th 10-11am | Kim Sellers 228C Baker Hall 268-3967 ksellers@stat.cmu.edu Office Hours: M 2-3pm | 
|---|---|---|
| Class Schedule | Tuesday & Thursday, 3-4:20 p.m. 237B Baker Hall | |
| Syllabus & Policies | postscript form | |
| Teaching Assistant | Marnie Bertolet 132B Baker Hall 268-8591 mhb2@stat.cmu.edu office hours: Wed 3:30-4:30, Wean 5202 | 
 To
401 Home Page
To
401 Home Page
| Date | Subject Material | Reading | Homework | 
|---|---|---|---|
| w1 Tu 8/28 | Course introduction, Examples | ||
| w1 Th 8/30 | Introduction to Splus (Meet in Wean 5202) | MASS pp. 1-68: skim headings | HW1 -
REVISED (please read) Part A Solutions Part B Solutions | 
| w2 Tu 9/4 | Exploratory data analysis | RwG pp. 1-23 (Chapter 1) | |
| w2 Th 9/6 | Finishing examples of EDA with Splus | ||
| w3 THU 9/11 | CLASS CANCELLED | ||
| w3 Tu 9/13 | Simple linear regression | RwG pp. 29-59 (Chapter 2) | REVISED Reading (due 9/18) and Writing (due 9/20) Example of a reasonable answer | 
| w3 Th 9/13 | HW02 due Thurs Sept 20 (NO REVISION) SOLUTIONS | ||
| w4 Tu 9/18 | More simple regression, maybe some Mult. Regr. | RWG Ch 2, and pp 66-69 (start of Ch 3) | |
| w4 Th 9/20 | Matrix algebra and Multiple Regression | RWG, pp 65-77 RWG pp333-342 | HW03 due
Thurs Sept 27 Solutions (Due date changed to Tues Oct 2) | 
| w5 Tu 9/25 | Example of multiple regression: the wage data | ||
| w5 Th 9/27 | Example of mulitple regression: the fuel consumption data | ||
| w6 Tu 10/2 | Variable Selection I | RWG pp 77-84 | HW04 due Tue Oct 9 Solutions | 
| w6 Th 10/4 | Variable Selection II | Reading/writing
discussion exercise, due Thu Oct 11. Here is a partial outline of Alley Ch2 (optional) | |
| w7 Tu10/9 | Interactions and Dummy Variables (tentative) | RWG pp 84-101 | (MIDTERM EXAM MOVED TO TUE OCT 16) | 
| w7 Th 10/11 | Review for midterm exam | ||
| w8 Tu 10/16 | MIDTERM EXAM | ||
| w8 Th 10/18 | Regression Criticism I: Assumptions | RWG Ch 4 | HW05 (Due Oct 25) Solutions | 
| w9 Tu 10/23 | Regression Criticism II: Outliers and Influence | RWG Ch 4 | |
| w9 Th 10/25 | 401/402 Project Descriptions | Principal Investigator Visits | See description sheets. | 
| w10 Tu 10/30 | 401/402 Project Descriptions | Principal Investigator Visits | See project selection sheets. | 
| w10 Th 11/1 | Nonlinear regression I | RWG Ch 5 | |
| w11 Tu 11/6 | Nonlinear regression II | RWG Ch 5 | |
| w11 Th 11/8 | Robust Regression I | RWG Ch 6 | HW06
(Due Thu Nov 15) solutions | 
| w12 Tu 11/13 | Robust Regression II | RWG Ch 6 | |
| w12 Th 11/15 | Logistic Regression I | RWG Ch 7 | HW07 (Due Thu Nov 27) solutions | 
| w13 Tu 11/20 | Logistic Regression II (?) | RWG Ch 7 | |
| w13 Th 11/22 | THANKSGIVING | ||
| w14(*) 11/27 | Principal Components I | RWG Ch 8 | HW08 (Due Thu Dec 6) solutions | 
| w14 (*) 11/29 | Principal Components II | RWG Ch 8 | |
| w15(*) 12/4 | Factor Analysis (from Principal Components II) | RWG Ch 8 | |
| w15(*) 12/6 | Work on Projects in Wean computer cluster | ||
| w16 12/11 | Project papers due Fri Dec 14 | ||
| Finals Week | FINAL EXAM THU DEC 13, 830--1130 | 
 to
Top of Page   
To 401 Home Page
to
Top of Page   
To 401 Home Page
| Link | Description | More info | 
|---|---|---|
| concord1.dat | Concord household water use | concord1.txt | 
| widgets.dat | Widget data for HW 1, problem 2 | |
| seapart.dat | Seabird data (partial); for HW 1, problem 3 | See RwG p. 61 | 
| cane.dat | Sugar cane data for HW1, problem | cane.txt | 
| Prestige.dat | Job Prestige data from Canadian Census | Prestige.cbk (more info, and variable names) | 
| cyclist.html | Cylist data | |
| homedat.html | Home resale data | |
| fuel-cr.dat | Consumer reports data | Columns: Weight Disp Mileage Fuel Type | 
| fuel-alr.dat | Fuel consumption data for the 50 states | |
| grass.dat | Chemical factors in soil affecting spartina graass growth | grass.txt | 
| lead.dat | Lead toxicity data | |
| televisions.dat | Life expectancy data | televisions.txt | 
| wages.dat | CPS wage data | wages.txt | 
| cheese.dat | Cheese data | cheese.txt | 
| ozone.dat | Ozone data | ozone.txt | 
| salamander.dat | Salamander data | See RwG p. 104 | 
| cassava.dat | Cassava data | See RwG p. 137 | 
| deforest.dat | Deforestation data | See RwG p. 175 | 
| solrad.dat | Solar radiation data | See RwG p. 177 | 
| eggs.dat | Egg data | See RwG p. 180 | 
| kob.dat | Kob data | See RwG p. 213 | 
| titanic.dat | Titanic data | Columns: Class, Adult, Male, Survive | 
| streams.dat | streams data for hw 08 | |
| shetland.dat | shetland data for hw 08 | 
 to
Top of Page   
To 401 Home Page
to
Top of Page   
To 401 Home Page
| Link | Description | More info | 
|---|---|---|
| Avoiding statistical pitfalls | 8/28 Handout to be discussed Sep 6 in class. | Postscript file | 
| Splus on campus | 8/28 Where to run Splus on campus | Postscript file | 
| Depression and Sleep | 8/28 Handout(s) for breakout exercise | Postscript file | 
| Unix.ps | 8/30Unix Summary | |
| Sintro.ps | 8/30 Notes: Introduction to S-plus | |
| RWG Ch 1 handouts | 9/4 Handouts to accompany in-class S demo | |
| 9/11 CLASS CANCELLED | ||
| announce-week03.ps | 9/11 Announcement with Reading and Writing Assignments | |
| handouts | 9/13 lecture notes and handouts | |
| Simple regression | 9/18 lecture notes and breakout exercise | Postscript files | 
| Simple
regression in matrix form (Handed out in class 9/20; discussed 9/27) | 9/20 Handouts to accompany lecture | Postscript file. Please read the handout for next Tuesday's (9/27) lecture | 
| Multiple regression example: Fuel consumption data | 9/29 Handout to accompany lecture | Postscript file. | 
| Variable Selection I | 10/2 Handout to accompany lecture | Postscript file. | 
| Variable Selection II | 10/4 Handout to accompany lecture | Postsctipt file. | 
| Interactions and Dummy Variables | 10/9 | Postscript file. | 
| Case study: Personal Ozone Levels in Children | 10/11 | Postscrtipt file. | 
| 10/16 MIDTERM EXAM | ||
| Model Cricicism I: Assumptions | 10/18 Lecture notes and exercise | Postscript file. | 
| Model Criticism II: Influence Analysis | 10/23 Lecture notes and exercise | postscript file. | 
| 401/402 PROJECT DESCRIPTIONS | 10/25 Project presentations by investigators | postscript file | 
| 401/402 PROJECT SELECTION SHEETS | 10/30 Project presentations by investigators | postscript file | 
| Nonlinear Regression I (Ch 5 RWG) | 11/1 Lecture notes | postscript file | 
| Nonlinear Regression II (Ch 5 RWG) | 11/6 lecture notes and exercise | postscript file | 
| Robust Regression I (Ch 6 RWG) | 11/8 lecture notes and exercise | postscript file; see also breakout session. | 
| Robust Regresssion II (Ch 6 RWG) | 11/13 lecture notes and exercise | |
| Logistic Regression I (Ch 7 RWG) | 11/15 lecture notes and exercise | |
| Logistic Regression II (Ch 7 RWG) (?) | 11/20 lecture notes and exercise | |
| Principal components I (Ch 8, RWG) | 11/27 lecture notes | |
| Principal components II (Ch 8, RWG) | 11/29 and 12/4 lecture notes; includes Factor Analysis | corrected pp 7,8,9 for lecture notes | 
| Latex.ps | Intro to using Latex for reports | template.tex | 
 to
Top of Page   
To 401 Home Page
to
Top of Page   
To 401 Home Page
 
| Link | Description | More info | 
|---|---|---|
| S-Tutorial | Written by a CMU Student | |
| cheatsheet | Splus usage summary | |
| contents.html | Splus on-line tutorial | |
| SplusTips.html | Splus Tips and Links | |
| Splus in MSWord | How to convert a ps figure from Splus into tif format, for inclusion in MS Word docs | |
| symplot.q | function: symmetry plot | |
| ps.q | function: copy a plot to a .ps file | and optionally print it | 
| Sintro.q | commands: Sintro.q | Solutions to Splus Intro exercises | 
| ndhist.q | function: histogram plus normal density | |
| rwgbox.q | function: boxplots in RwG style | |
| qplot.q | function: quantile plot | |
| qqn.q | function: quantile-normal plot | |
| qqenv.q | function: new quantile-normal plot that includes 95% conf envelope | |
| scatbox.q | function: scattergram + marginal boxplots | |
| stdres.q | function: calculate standardized residuals (better check of normality of residuals) | |
| collin.q | function: check collinearity for a set of X's | R^2 for predicted each on all others | 
| sum.step.q | function: exhaustive model selection by AIC, BIC or adj. R^2 | |
| mypairs.q | function: pairs plot with boxplots for few categories | |
| DW.q | function: Durbin-Watson test | |
| spruce.q | sample code: for hw6 (dummy/interaction plotting) | |
| influence.q | function: influence measures | |
| partreg.q | function: partial regression influence plots | |
| CookPlot.q | function: residual vs. fit plot with Cook Distance | |
| medplot.q | function: Exploratory band regression | |
| logitinf.q | function: influence measures and residuals for logistic regression |