Comparison of propensity score (PS) weighting and trimming strategies to reduce variance and bias of treatment effect estimates: A simulation study

Til Stürmer; KJ Rothman; Alan R. Ellis; Richard Wyss; Mitchell Conover; Mark Lunt; Robert J Glynn

Comparison of propensity score (PS) weighting and trimming strategies to reduce variance and bias of treatment effect estimates

A simulation study

Stürmer, T., Rothman, KJ., Ellis, A. R., Wyss, R., Conover, M., Lunt, M., & Glynn, R. J. (2018). Comparison of propensity score (PS) weighting and trimming strategies to reduce variance and bias of treatment effect estimates: A simulation study. Pharmacoepidemiology and Drug Safety, 27(S2), Article 110. https://doi.org/10.1002/pds.4629

Copy citation

Abstract

Background: To reduce the variance of PS‐based treatment effect estimates, biostatisticians have proposed various covariate‐balancing weights and trimming the tails of the PS distribution. In parallel, pharmacoepidemiologists have proposed PS trimming to reduce confounding in the tails of the PS distribution where unmeasured factors can cause patients to be treated contrary to prediction.

Objectives: To compare the performance of PS weighting and trimming methods with respect to both variance and bias.

Methods: We simulated cohort studies with a binary intended treatment T as a function of 4 measured covariates (confounders and instruments, dichotomous and continuous). We mimicked treatment withheld and last‐resort treatment by adding two “unmeasured” dichotomous factors that caused treatment assignment to change for some patients in both tails of the PS distribution. The number of outcomes Y was simulated as a Poisson function of T, all confounders (including unmeasured), and 2 risk factors. We estimated the PS based on measured covariates using logistic regression and trimmed the tails of the PS distribution using 3 strategies (Crump et al., Biometrika 2009; Stürmer et al., AJE 2010; and Walker et al, Comp Eff Res 2013). After trimming, we re‐estimated the PS and then implemented various PS weights to estimate the treatment effect (rate ratio) in the population (ATE), the treated (ATT), the untreated (ATU), the overlap (ATO), and the matched (ATM) (Li et al., JASA 2017).

Results: With no unmeasured confounding and 20% treatment prevalence, relative efficiency (RE) versus the untrimmed ATE ranged from 61% (ATU, Walker) to 123% (ATO, untrimmed). Crump trimming improved efficiency for ATE (RE = 112%), but not for ATO (RE = 120% vs 123%). With unmeasured confounding leading to treatment withheld, ATO and ATM were more biased than ATE, and only Stürmer and Walker trimming reduced bias for all estimates. Mean squared error (MSE) was lowest for Stürmer trimming. With unmeasured confounding leading to last‐resort treatment, ATT, ATO, and ATM were less biased than ATE, and all trimming methods reduced bias. MSE was lowest for Crump trimming.

Conclusions: ATO and Crump trimming reduce the variance of PS‐weighted treatment effect estimates compared with ATE but change the causal interpretation. In settings where unmeasured confounding (eg, frailty) may lead physicians to withhold treatment, only Stürmer and Walker trimming reduce bias consistently in scenarios assessed.

Publications Info

To contact an RTI author, request a report, or for additional information about publications by our experts, send us your request.

publications@rti.org

RTI shares its evidence-based research - through peer-reviewed publications and media - to ensure that it is accessible for others to build on, in line with our mission and scientific standards.

Recent Publications

Article

Patient-reported outcome improvements following scalp hair regrowth among patients with Alopecia Areata: analysis of the ALLEGRO-2b/3 trial

December 2025

Article

Plain language summary of mortality rates of patients with Parkinson’s disease psychosis who were treated either with pimavanserin or with different second-generation (atypical) antipsychotics

December 2025

Article

Biological parenthood rates among men with sickle cell disease

December 2025

Article

Patterns of felt stigma among rural-dwelling people who use drugs: A latent class analysis

December 2025

Article

One voice and vision: How the RISE network built a collective identity as the foundation for strategic dissemination

December 2025

Article

Estimating community-level prevalence of opioid use disorder: Extrapolating from Medicaid claims data and other publicly available data sources in Ohio, USA

December 2025

Article

Experiences of parents who receive a false-positive CK-MM screening for their newborn

December 2025

Article

Evaluating the efficacy and safety of milrinone for prevention of post-patent ductus arteriosus closure syndrome (the MIDAS trial) in extremely preterm infants: A multicentre, double-masked, randomised, placebo-controlled trial

December 2025

View All Publications