CHPR290/Stat266/HRP292-- Course Files, Readings, Examples


Week 1--Course Introduction; Matching Methods part 1 (intro and theory)
In the news
Going to concerts helps you live longer (UK)       Going to Concerts Can Help You Live Longer, (Newsweek)      Going to a concert is better for you than yoga (NYPost)   Science says gig-going can help you live longer and increases wellbeing (o2 UK)     Going to Concerts Is Good for Your Health (Variety)

Lecture Topics             Lecture 1 slide deck
1. Course outline and logistics
2. A matched observational study (DOS, Chap 7)
3. Study design versus inference
4. Basic tools of multivariate matching (DOS, Secs 8.1-8.4)

Text Readings
Rosenbaum DOS: Chapters 7 and 8 (8.1-8.4)
Additional Resources
Observational Studies according to Donald B. Rubin
   For objective causal inference, design trumps analysis Annals of Applied Statistics, Volume 2, Number 3 (2008), 808-840.    Rubin talk .   Another Rubin overview of matching: Matching Methods for Causal Inference Stuart, E.A. and Rubin, D.B. (2007). Best Practices in Quasi-Experimental Designs: Matching methods for causal inference. Chapter 11 (pp. 155-176) in Best Practices in Quantitative Social Science. J. Osborne (Ed.). Thousand Oaks, CA: Sage Publications.

Computing Corner: Extended Data Analysis Examples
Lalonde NSW data (DOS sec 2.1). Subclassification/Stratification and Full matching.
   Week 1 handout       Rogosa R-session (using R 3.3.3)        4/1/18 redo in R 3.4.4 (sparse)      pdf slides shown in class
     MatchIt vignette

Week 1 Review Questions
From Computing Corner
1.  In Week 1 Computing Corner with the Lalonde data (effect of job training on earnings), we started out (see R-session) by showing the ubiquitous [epidemiology to economics] analysis for observational data of an analysis of covariance, aka tossing the treatment variable and all the confounders into a regression equation predicting outcome and hoping for the best (c.f 2016 Week 1 in the news analyses: mom fish consumption on child cognition). The statement made in class (technical details week 1 stat209) is that regression does not "control" for confounders; instead the coefficient of treament (putative causal effect) is obtained from a straight-line regression of outcome on the residuals from a prediction of treatment by all the other predictors in the regression. Demonstrate that equivalence using the ancova in CC1.       
 Solution for Review Question 1
2. RQ1 uses the Week 1 Computing Corner Lalonde data (effect of job training on earnings) analysis of covariance: tossing the treatment variable and all the confounders into a regression equation predicting outcome and hoping for the best. Compare that ancova with an ancova the uses just the significant predictors of re78. Also compare with an ancova which uses the single available covariate/confounder having the highest correlation with outcome. Are these analyses consistent?       
 Solution for Review Question 2
From Lecture
3. We will be working a lot with matching based techniques. One of the best thinkers/writers on the topic of matching is Elizabeth Stuart from Johns Hopkins. For this problem, take a look at her paper: "Matching Methods for Causal Inference: A Review and a Look Forward." In lecture 01 you were introduced to "balance tables" (a.k.a. "Table 1") which summarizes the covariate distribution of the observations. A handful of questions: (a) as concisely as possible, state why we focus on balance assessments as part of our argumentation when attempting to perform causal inference, (b) in addition to a balance table, name other tools used to report balance, (c) why do we use standardized mean differences instead of p-values to assess balance when assessing the quality of a match design?, and (d) why is it kinda weird to use a p-value of the covariates in a randomized trial to assess balance?       
 Solution for Review Question 3
4. In lecture 1 we quickly outlined some of the big challenges to causal inference when using observational data (see slide 41, "There should be strong effort to show the two groups are similar..."). These challenges include: inclusion/exclusion of observations, observational units that may be completely missing (censored, survival bias), missing data, imbalances in observed data, and imbalances in unobserved data. We'll address each of these at different points in the course. But let's focus on the decision to include/exclude observations. What we're doing when matching -- i.e., removing observations that do not have adequate counterparts in the contrast group -- may seem a bit subversive. The intuition is: why "throw away" data? I think there are two reasons people worry about "throwing away data." First, it seems like limiting the kinds of observations in our study we may be losing the ability to generalize our conclusions to a wider swath of the population. The counter to that is: yes, we are trading off the ability to generalize (i.e., external validity) for the ability to make stronger claims about a candidate causal effect (i.e., internal validity). The second concern is that it seems like more data is better. Formulate a response to this concern. (Note: OMG, this question seems so nebulous. Yup. That's how this works; you're playing Big Kid academics now. We made sure to mention this argument during lecture 01, so you know it. It's a common statistical argument nowadays. If you want to read your way out of this one... here's a good paper.)       
 Solution for Review Question 4
From Computing Corner
5. Exercise in pair matching. In DOS Sec 2.1, Rosenbaum works with the randomized experiment data from NSW. In Week 1,2 Computing Corner we used the constructed observational study version of these data. Use the observational study data to do a version of the 1:1 matching in DOS section 2.1. Compare the balance improvement achieved from nearest neighbor matching with the full matching results in Computing Corner Week 1,2.       
 Solution for Review Question 5
6. For the fullmatch analysis done in the Lalonde class presentation weeks 1 and 2, the outcome comparison was carried out using lmer to average the treatment effects over the 104 subclasses. A hand-wavy analogy to the paired t-test here would be to use the mean difference within each subclass. Show that (because some of the subclasses are large) this simplified analysis doesn't well replicate the lmer results.       
 Solution for Review Question 6


Week 2-- Matching Methods Part 2 (implementation); Potential Outcomes and Study Design
In the news
Perennial item: cell phones and (brain) cancer. The latest flurry from National Toxicology Program (NTP).
Can Cellphones Cause Cancer? Experts Surprised By Latest Tests     Scientists see strong link between waves from cellphones, cancer in rats     Researchers find the cellphone-cancer risk is higher than originally thought
Scientific report (week5 Stat209)   Cell phones and cancer once again.    Cellphone Radiation Linked To Tumors In Male Rats, Government Study Says       NIH Tech Reports

Lecture Topics             Lecture 2 slide deck
1. Basic tools of multivariate matching (DOS, Secs 8.1-8.4)
2. Potential outcomes framework (DOS 2.2) 
3. Fisher's sharp null; permutation test (DOS 2.3) 
4. Various practical issues in matching (DOS, Chap 9)
Text Readings
Rosenbaum DOS: Chapter 2 (plus week1 items)
Additional Resources
From Donald B. Rubin
   First section of Basic Concepts of Statistical Inference for Causal Effects in Experiments and Observational Studies    Similar material Chaps 1 and 2 Causal Inference in Statistics, Social and Biomedical Sciences: An Introduction, Guido Imbens and Don Rubin linked on main page.

Computing Corner: Extended Data Analysis Examples
1.     Finish Week 1 material, Lalonde data, links in week 1
2.     Lindner data, Percutaneous Coronary Intervention with 'evidence based medicine'.
Percutaneous coronary intervention (PCI), commonly known as coronary angioplasty or simply angioplasty, is a non-surgical procedure used to treat the stenotic (narrowed) coronary arteries of the heart found in coronary heart disease.
Lindner data in package PSAgraphics
Use of Lindner data in Vignette JSS   PSAgraphics: An R Package to Support Propensity Score Analysis  Journal of Statistical Software February 2009, Volume 29, Issue 6. http://www.jstatsoft.org/
               Week 2 handout       Rogosa R-session        cc2 pdf slides Lindner example

Week 2 Review Questions
From Computing Corner
1. The JSS vignette for PSAgraphics (linked week 2 Computing Corner) does subclassification matching for Lindner data. Repeat their subclassification analyses and try out their balance displays and tests. They have some specialized functions. Compare with our basic approach.       
Lindner data  package PSAgraphics Vignette JSS           outcome analysis, Rogosa session
2. The Week 2 presentation showed an alternative propensity score analysis -- analysis of covariance with propensity score as covariate. A rough analogy is to ancova vs blocking (where blocking is our subclassification, say quintiles). Try out the basic (here logistic regression) ancova approach for the lifepres dichotomous outcome       
 Solution for Review Question 2

From Lecture
3. Modify Fisher's Sharp Null to reflect the null hypothesis that the treatment adds five units to the outcome under control. Build a small simulation (e.g., 10 observations) and construct a table that summarizes the potential outcomes. Randomize using a fair coin flip to assign treatment or control for each observational unit. Use the permutation test to assess your data set using (i) Fisher's Sharp Null and (ii) the null hypothesis that the treatment adds five units to the outcome under control.       
 Solution for Review Question 3

4. Building off of RQ#3 above, sort your observations so they are in ascending order based on the outcome under control. Randomize two at a time: one fair coin flip now assigns either the first or second observation to treatment (and the other to control). A second fair coin flip assigns either the third or the fourth observation to treatment (and the other to control). This continues so on and so forth. Use the appropriate permutation test to assess your data set using (i) Fisher's Sharp Null and (ii) the null hypothesis that the treatment adds five units to the outcome under control. Contrast the results here with the results from RQ#3.       
 Solution for Review Question 4




Week 3-- Planning Analysis

In the news
One extra glass of wine 'will shorten your life by 30 minutes', Drinking is as harmful as smoking and consuming more than five drinks a week lowers life expectancy, say researchers
Publication: Risk thresholds for alcohol consumption: combined analysis of individual-participant data for 599?912 current drinkers in 83 prospective studies The Lancet, Volume 391, No. 10129, p1513-1523, 14 April 2018 .
Caution, consider the source, from the fine folks that brought you Wakefield (vaccines and autism). Remember "In 1998, Andrew Wakefield and 12 of his colleagues published a case series in the Lancet, which suggested that the measles, mumps, and rubella (MMR) vaccine may predispose to behavioral regression and pervasive developmental disorder in children. Despite the small sample size (n=12), the uncontrolled design, see https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3136032/

Lecture Topics             Lecture 3 slide deck
                                      optmatch example
1. Finish up: Basic tools of multivariate matching (DOS, Secs 8.1-8.4)
2. Various practical issues in matching (DOS, Chap 9)
Text Readings
Rosenbaum DOS: Chapters 8 and 9
Additional Resources
Use of optmatch and RItools: Matching in R using the optmatch and RItools packages
Smoking study (Prochaska et al 2016)
Dealing with limited overlap in estimation of average treatment effects (Crump et al 2009)  (or see http://public.econ.duke.edu/~vjh3/working_papers/overlap.pdf )
Defining the Study Population for an Observational Study to Ensure Sufficient Overlap: A Tree Approach (Traskin & Small 2011)
CONSORT Statement   (randomized trials)
STROBE Statement   (observational studies)

Week 3 Review Questions:
From Lecture
1. In this class we've shown you a couple of tools to assess the adequacy of a matched set - for example: Love plots, balance tables, standardized mean differences, and histogram plots of fitted propensity scores (or covariates). Why haven't we shown you a statistical test? That's weird, right? A ton of researchers fall for this, failing to see why assessing balance using a hypothesis test in an observational study is problematic. There are a couple of valid critiques; try articulating at least one such critique. (Hint: think about how we calculate the SMD vs a standard error.) Once you've given it a go, check out Section 6.6 of this paper (great paper!) for a couple of solutions to this question.

2. In section 6.7 of that same paper, the authors say their preferred tool for assessing balance is an empirical QQ plot. What's a QQ plot? Compare and contrast the use of QQ plots and a balance table. Neither of these tools in dominate, so what are the benefits and drawbacks to each?       
 Solution for Review Question 2




Week 4-- Models for Observational Studies and Inverse Probability Weighting

In the news
From Stanford Reports:  Religion and grades, A Graduate School of Education study finds that adolescents who practice religion on a regular basis do better in school than those who don't.   Preprint: The Abider-Avoider Achievement Gap: The Association Between GPA and Religiosity in Public Schools


Lecture Topics             Lecture 4 slide deck
1. First model for observational studies (DOS, Sections 3.1-3.3)
2. Inverse probability weighting  (link)

Computing Corner: Extended Data Analysis Examples
Alternative propensity score analyses. Propensity score weighting: Inverse Probability of Treatment Weighting (IPTW). Treatment effect estimation without matching.
A thorough R exposition using the Lalonde data   A Practical Guide for Using Propensity Score Weighting in R Practical Assessment, Research & Evaluation, v20 n13 Jun 2015.
    Also    Cox Regression, comparison with full matching (Elizabeth Stuart)

      Rogosa R-session        pdf slides shown in class


Week 4 Review Questions
From Computing Corner
1. Try out the ATE IPTW analysis (done in week4 computing corner) for the dichotomous outcome lifepres in the Lindner data. Compare with full matching results shown in class.       
 Solution for Review Question 1

2. Try an ATT IPTW analysis for log(cardbill) outcome in the Lindner data.       
 Solution for Review Question 2

From Lecture
3. The Wilcoxon signed rank test takes as its input a fixed number, designate this number I, of matched pairs. The Wilcoxon signed rank test is a permutation test with a specific test statistic. Let's explore the behavior of its statistic compared to the behavior of the average of the within-pair differences. You can use the sample code provided to simulate (i.e., simulation 1 here). Consider playing around with the sd in the data generating functions to see the impact in the histograms.
Question: what happens when we introduce one really 'weird' data point in our matched sets? Compare what happens to the distributions for mean(y_t - y_c |matched pairs) vs the Wilcoxon rank sign test. The solution is in the comments in simulation 3 in the link above.



Week 5-- Randomized Experiments and Design Sensitivity

In the news
Hormones and money.    Alpha males hurt alpha: Hedge fund managers with high testosterone underperform   (CNBC) ; The cost of being an alpha male: Study reveals testosterone-fueled hedge fund managers like those in 'Wolf of Wall Street' are more likely to LOSE money, commit fraud, and get fired (Daily Mail) ;   Hedge Fund 'Beta Males' Produce More Alpha, New Report


Lecture Topics             Lecture 5 slide deck
1. Sensitivity analysis (DOS Sections 3.4-3.7 and 3.9)
2. Gamma sensitivity analysis (DOS 3.4-3.8)

Computing Corner: Extended Data Analysis Examples
Alternative computation of propensity scores (trees, boosting). Teamed with IPTW in twang package.
           Toolkit for Weighting and Analysis of Nonequivalent Groups: A tutorial for the twang package   Lalonde data, yet again.
                Rogosa twang and ATT session with Lalonde data         Week 4,5 slides
Additional Resources:
Package rpart .     An Introduction to Recursive Partitioning Using the RPART Routines
Package gbm .     Generalized Boosted Models: A guide to the gbm package
To come, sensitivity analysis computations: package rbounds, Rosenbaum packages sensivitymv and sensitivitymw   vignette:   Two R Packages for Sensitivity Analysis in Observational Studies


Week 5 Review Questions
From Computing Corner
1. Try out, using the Lalonde data (Week 1), the boosted regression approach to computing propensity scores using Ridgeway's (via Friedman) gbm package. Are the balance and overlap results improved compared to the logistic regression estimation shown in Week 1?       
 Solution for Review Question 1

2. Try out using the Lindner data shown in the PSAgraphics vignette (JSS linked week 2), the regression tree classification (use rpart) approach for propensity score estimation. Examine resulting propensity scores, balance for matching in six suclassifications, and outcome analysis for cardbill measure.       
 Solution for Review Question 2




Week 6-- Design Sensitivity and Planning Analysis

In the news
Blue light like that from smartphones linked to some cancers     Publication: Evaluating the Association between Artificial Light-at-Night Exposure and Breast and Prostate Cancer Risk in Spain   Environ Health Perspect; April 2018 DOI:10.1289/EHP1837


Lecture Topics             Lecture 6 slide deck
1. Using multiple outcomes - coherence and known null effects (DoS 5.2.3 and 5.2.4)
2. Using a second control group - mitigating bias (DoS 5.2.2)
3. Thick description 
Reading:   The causal impact of bail on case outcomes for indigent defendants in New York City


Computing Corner: Extended Data Analysis Examples
Sensitivity analysis computations:
package rbounds,
Rosenbaum pacakges sensivitymv and sensitivitymw   vignette:   Two R Packages for Sensitivity Analysis (examples from sections 2 and 3)in Observational Studies
                Rogosa sensitivity session                   CC_6 slides

Week 6 Review Questions
From Week 6 Computing Corner
1. Mercury example (2 controls) from section 3 and 6 of Rosenbaum vignette (linked in CC_6)
Fish often contains mercury. Does eating large quantities of fish increase levels of mercury in the blood? Data set mercury in the sensitivitymw package is from the 2009-2010 National Health and Nutrition Examination Survey (NHANES) and is the example in Rosenbaum (2014). There are 397 rows or matched triples and three columns, one treated with two controls. The values are methylmercury levels in blood. Column 1, Treated, describes an individual who had at least 15 servings of fish or shellfish in the previous month. Column 2, Zero, describes an individual who had 0 servings of fish or shellfish in the previous month. Column 3, One, describes an individual who had 1 serving of fish or shellfish in the previous month. In the comparison here, Zero and One are not distinguished; both are controls. Sets were matched for gender, age, education, household income, black race, Hispanic, and cigarette consumption.
a. describe the apparent effect of fish consumption and try out sensitivity analyses (for both tests and CI) for the apparent effect of fish. c.f Rosenbaum vignette sec 3.2
b. look at the effects of weighting (method w in the sensitivitymw manual) as theory and simulations suggest that a sensitivity analysis will be more powerful if matched sets with little variability are given little weight. c.f Rosenbaum vignette sec 6.3.       
 Solution for Review Question 1

2. Demonstration--see solution. Mechanics of setting up a matched data set for the sensitivity functions. Easiest to create the data set for the most common 1:1 matching situation (merge works without needing thought); steps for 1:1 matching setting below       
 Review Question 2 with 1:1 matching




Week 7-- Discontinuity Designs and Case-Noncase Studies

In the news
1.   How Chocolate May Actually Help Combat Diabetes .   Publication: Monomeric cocoa catechins enhance beta-cell function by increasing mitochondrial respiration  The Journal of Nutritional Biochemistry Volume 49, November 2017, Pages 30-41.
2. Climate skeptics more eco-friendly than global-warming alarmists   Publication: Believing in climate change, but not behaving sustainably: Evidence from a one-year longitudinal study. Journal of Environmental Psychology Volume 56, April 2018, Pages 55-62.


Lecture Topics             Lecture 7 slide deck
Regression discontinuity - Lee and Lemieux 2011
Case-noncase study - Breslow 1998


Computing Corner:                       Dose response functions (and multiple groups): Beyond Binary Treatments
package causaldrf     vignette:   Estimating Average Dose Response Functions Using the R Package causaldrf      Rnw file for vignette      dot-R file for vignette
                                          Rogosa session, causaldrf examples
also covariate balancing propensity score, package CBPS
Background publications:
The Propensity Score with Continuous Treatments
Causal Inference With General Treatment Regimes: Generalizing the Propensity Score, Journal of the American Statistical Association, Vol. 99, No. 467 (September), pp. 854-866.       
 Slides for Dose-Response, CC_7           also,   summary ADRF slide

    
Week 7 Review Questions
From Week 7 Computing Corner
1. Dose-Response functions. IPW (aka importance sampling) can't hit the curve? Can't hit anything??
In week 7 Computing Corner we showed results for ADRF (average dose-response function) estimates using Imbens very clever artificial data example from the linked causaldrf vignette (see also CC_7 slides).
IPW results (see Weeks 3 and 4 Computing Corner for examples for binary treatements) were notable in apparant bad bad performance (all other estimates did pretty well). Keep in mind this artificial data test is not even a "phase 2" hurdle, as we are given the selection variables (X_1, X_2) that are responsible for individuals selecting dose (here denoted by T) other than randomness.
As IPW is dominant in applications like long-term occupation exposures (to bad stuff), the dose-reponse setting is quite relevant. The artificial data ADRF has an important feature of a non-monotonic dip, reminiscent of alcohol or even salt (a bit above 0 is better than zero) for health outcomes. So for another look at IPW, I tried to make a much easier example, with basically a straight-line ADRF (just with a little wiggle) by limiting dose (T) to > .5.
So try out the comparison of the hi_estimate (shown in class) and the iptw_estimate both from the causaldrf package with the true ADRF from the artificial data construction using values T > .5 (about half the data).
Are we any happier with the value of IPW (importance sampling)? Solution indicates to me: "no", YMMV.       
 Solution for Review Question 1
Diffs-in-Diffs from lecture
2.  RCT   Time 1 Time 2 (pre-post) data, Differences in Differences analysis
Comparing groups on time-1, time-2 measurements: repeated measures anova vs lmer OR the t-test on improvement
Comparative Analyses of Pretest-Posttest Research Designs, Donna R. Brogan; Michael H. Kutner, The American Statistician, Vol. 34, No. 4. (Nov., 1980), pp. 229-232.   JSTOR link
Class Example Stat222 (week4), Stat209 (week9)     BK overview
     urea synthesis, BK data       data, long-form         BK plots (by group)

     2017 Analysis handout     Extended BK lmer analysis (ascii)
3.    Time 1 Time 2 observational data, Differences in Differences analysis.
We reuse some time-1, time-2 observational data generated to illustrate Lord's paradox (week 9, Stat209, lecture item 4) -- gender differences in weight gain. (The 'paradox' is solved by Holland, Wainer, Rubin using potential outcomes.) The set up for these artificial data is females gain, males no change
  corr .7 within gender, equal vars time1 time 2 within gender
means
                M               F
X (t1)         170            120
Y (t2)         170            130 
comparison of "gains" 170 - 170 - (130 - 120) = -10    negative effect males (females gain more).
ancova: 170 - 130 - .7*(170 - 120) = 5 positive male effect
So: does being male cause a student to gain weight or lose weight?   The solution illustrates forms of diffs-in-diffs analyses.
wide form for these data      long form for these data       
 Solution for Review Question 3




Week 8-- Instrumental Variable Methods for Randomized Controlled Trials

In the news
Better Diet Tied to Bigger Brains  Dutch study shows association between food and brain structure .
   Publication: Croll, Pauline H. et al Better diet quality relates to larger brain tissue volumes. Neurology (2018): Web. 20 May. 2018.


Lecture Topics             Lecture 8 slide deck
Encouragement design (Holland 1988 )
Instrumental variable methods for causal inference ( Baiocchi, Cheng and Small 2004)
Lecture 7 addendum: Case-control studies
Case-control overview (shown in class) from Encyclopedia of Public Health
Breslow NE. Statistics in epidemiology: the case-control study.J Am Stat Assoc. 1996 Mar;91(433):14-28
Carbonated Soft Drink Consumption and Risk of Esophageal Adenocarcinoma JNCI: Journal of the National Cancer Institute, Volume 98, Issue 1, 4 January 2006, Pages 72-75,
Smoking and Lung Cancer in Chap 18 of HSAUR3 (Handbook of Statistical Analysis Using R). Also driving and backpain data in Chap 7 HSAUR2
Some R-packages and resources: SensitivityCaseControl: Sensitivity Analysis for Case-Control Studies; multipleNCC: Inverse Probability Weighting of Nested Case-Control Data;    Two-phase designs in epidemiology   (Thomas Lumley) ;   Exact McNemar's Test and Matching Confidence Intervals


Computing Corner:                   Regression Discontinuity Designs     
    Example from rdd manual (Stat209 handout)     ascii version
Angrist-Lavy Maimondes (class size) data     sections 1.3, 3.2, 5.2.3, 5.3 DOS text
              read data ang = read.dta("http://stats.idre.ucla.edu/stat/stata/examples/methods_matter/chapter9/angrist.dta")
R-package--rdd;   Regression Discontinuity Estimation Author Drew Dimmery
Also Package rdrobust Title Robust data-driven statistical inference in Regression-Discontinuity designs       
 Slides for Regression Discontinuity CC
Regression Discontinuity Resources
       Stat209, Regression Discontinuity handout
William Trochim's Knowledge Base
Trochim W.M. & Cappelleri J.C. (1992). "Cutoff assignment strategies for enhancing randomized clinical trials." Controlled Clinical Trials, 13, 190-212.  pubmed link
Journal of Econometrics (special issue) Volume 142, Issue 2, February 2008, The regression discontinuity design: Theory and applications      Regression discontinuity designs: A guide to practice, Guido W. Imbens, Thomas Lemieux
    Also from Journal of Econometrics (special issue) Volume 142, Issue 2, February 2008, Waiting for Life to Arrive: A history of the regression-discontinuity design in Psychology, Statistics and Economics, Thomas D Cook
the original paper: Thistlewaite, D., and D. Campbell (1960): "Regression-Discontinuity Analysis: An Alternative to the Ex Post Facto Experiment," Journal of Educational Psychology, 51, 309-317.
Capitalizing on Nonrandom Assignment to Treatments: A Regression-Discontinuity Evaluation of a Crime-Control Program Richard A. Berk; David Rauma Journal of the American Statistical Association, Vol. 78, No. 381. (Mar., 1983), pp. 21-27. Jstor
Berk, R.A. & de Leeuw, J. (1999). "An evaluation of California's inmate classification system using a generalized regression discontinuity design." Journal of the American Statistical Association, 94(448), 1045-1052.  Jstor

To come: Instrumental Variable Methods: packages   AER(ivreg), ivpack, ivmodel

Week 8 Review Questions
Computing Exercises
1. Regression Discontinuity, classic "Sharp" design. Replicate the package rdd toy example: cutpoint = 0, sharp design, with treatment effect of 3 units (instead of 10). Try out the analysis of covariance (Rubin 1977) estimate and compare with rdd output and plot. Pick off the observations used in the Half-BW estimate and verify using t-test or wilcoxon.
Extra: try out also the rdrobust package for this sharp design.       
 Solution for Review Question 1

2. Systematic Assignment, "fuzzy design". Probabilistic assignment on the basis of the covariate.
i. Create artificial data with the following specification. 10,000 observations; premeasure (Y_uc in my session) gaussian mean 10 variance 1. Effect of intervention (rho) if in the treatment group is 2 (or close to 2) and uncorrelated with Y_uc. Probability of being in the treatment group depends on Y_uc but is not a deterministic step-function ("sharp design"): Pr(treatment|Y_uc) = pnorm(Y_uc, 10,1) . Plot that function.
ii. Try out analysis of covariance with Y_uc as covariate. Obtain a confidence interval for the effect of the treatment.
iii. Try out the fancy econometric estimators (using finite support) as in the rdd package. See if you find that they work poorly in this very basic fuzzy design example.
Extra: try out also the rdrobust package for this fuzzy design.       
 Solution for Review Question 2

3. Case-Control Study. Matched case-control study.
From epiDisplay v3.5.0.1 by Virasakdi Chongsuvivatwong Datasets on a matched case-control study of esophageal cancer
See also matched case-control study in the epiDisplay package manual. data(VC1to1)
Data from a matched case-control study testing whether smoking, drinking alcohol and working in the rubber industry (all dichotomous) are risk factors for oesophageal cancer. Each case was matched with his/her neighbours of the same sex and age group. The matching ratio in VC1to1 is 1:1 pair matching. The data are in long form (as you would get from MatchIt) with variable matset indicating subclass and case indicating cancer or not. Discussion and analysis is found in Chap 16 of the Epicalc Book
Publication: Chongsuvivatwong, V. 1990 A case-control study of esophageal cancer in Southern Thailand. J Gastro Hep 5:391-394.       
 Rogosa session for Review Question 3



Week 9 - Instrumental Variable Methods for Observational Studies

In the news
1.   Glasses wearers really are more intelligent - it's in the genes       Wear glasses? Study suggests you might be smarter than those who don't
Publication: Nature Communications. Study of 300,486 individuals identifies 148 independent genetic loci influencing general cognitive function
2.   Egg a day tied to lower risk of heart disease   Publication: Associations of egg consumption with cardiovascular disease in a cohort study of 0.5 million Chinese adults,  Heart  Published Online First: 21 May 2018.
3.   Workplace anxiety isn't always a bad thing -- in some cases it can help boost employee performance        A new study says your anxiety can make you better at your job   Publication: Journal of Applied Psychology


Lecture Topics             Lecture 9 slide deck
Instrumental variable methods for causal inference ( Baiocchi, Cheng and Small 2004)
Computing Corner:                  Instrumental Variables: IV basics and implementation.            Music: Wishin' and hopin'
                          IV handout           CC_9 slides         Rogosa IV sessions, examples
Additional resources:
AER package
ivmodel package    vignette

Week 9 Review Questions
Computing Exercises
1. Broken RCT: Compliance, measured or binary Compliance as a measured variable. In Stat209 week 7 we also examine compliance adjustments; both those based on a dichotomous compliance variable and the much much more common measured compliance (often unwisely dichotomized to match Rubin formulation). The Efron-Feldman study ( handout description) used a continuous compliance measure. An artificial data set a data frame containing Compliance, Group, and Outcome for Stat209 HW7 is constructed so that ITT for cholesterol reduction is about 20 (compliance .6) and effect of cholestyramine for perfect compliance is about 35. Try out some IV estimators for CACE. Obtain ITT estimate of group (treatment) effect with a confidence interval. Try using G as an instrument for the Y ~ comp regression. What does that produce? Alternatively use the Rubin formulation with a dichotomous compliance indicator defined as TRUE for compliance > .8 in these data. What is your CACE estimate. What assumptions did you make? Compare with ITT estimate. In this problem the ivreg function from AER package is used for IV estimation.       
 Solution for Review Question 1
More problem 1   1. Compliance data, IV analysis, imitating Efron-Feldman cholestyramine trial. Solution showed you the widely used ivreg function from package AER package. Redo the ivreg analyses using functions from the ivmodel package.       
 Solution for more Review Question 1

2. Observational study: Use the Card data, described in the ivmodel vignette, to carry out some basic IV analyses. Compare ivreg with some analyses using the ivmodel package.       
 Solution for Review Question 2
3. RCT, Encouragement Design. Sesamee Street. The CC example and Rogosa session, used postlet (letter recognition) as the outcome. The data also contains pretest measures (before encouragement), so an alternative outcome is improvement, post - pre, for letter recognition. Repeat the IV encouragement design analysis for the outcome improvement in letter recognition. Time1-Time2 data week4 of Stat222 (longitudinal research), and week9 Stat209.       
 Solution for Review Question 3