Abstract
Stratifying and matching by the propensity score are increasingly popular approaches to deal with confounding in medical studies investigating effects of a treatment or exposure. A more traditional alternative technique is the direct adjustment for confounding in regression models. This paper discusses fundamental differences between the two approaches, with a focus on linear regression and propensity score stratification, and identifies points to be considered for an adequate comparison. The treatment estimators are examined for unbiasedness and efficiency. This is illustrated in an application to real data and supplemented by an investigation on properties of the estimators for a range of underlying linear models. We demonstrate that in specific circumstances the propensity score estimator is identical to the effect estimated from a full linear model, even if it is built on coarser covariate strata than the linear model. As a consequence the coarsening property of the propensity score - adjustment for a one-dimensional confounder instead of a high-dimensional covariate - may be viewed as a way to implement a pre-specified, richly parametrized linear model. We conclude that the propensity score estimator inherits the potential for overfitting and that care should be taken to restrict covariates to those relevant for outcome.
Original language | English |
---|---|
Pages (from-to) | 5529-5544 |
Number of pages | 16 |
Journal | Statistics in Medicine |
Volume | 26 |
Issue number | 30 |
DOIs | |
Publication status | Published - 30 Dec 2007 |
Externally published | Yes |
Keywords
- Conditional independence
- Confounding
- Gauss-Markov theorem
- Linear models
- Propensity score