论文标题
在回归校准中的协变量的因果选择,连续暴露次数不足
Causal Selection of Covariates in Regression Calibration for Mismeasured Continuous Exposure
论文作者
论文摘要
Rosner,Spiegelman和Willet开发的回归校准用于纠正由于连续暴露的测量误差而导致的偏差。该方法涉及两个模型:一个测量误差模型(MEM),该模型(MEM)将接触不高的暴露与真实暴露的暴露和结果模型有关,将接触不足的暴露与结果有关。但是,不存在确定每个模型中应包括哪些协变量的全面指导。在本文中,我们研究了因果推理框架下的最小和最有效的协变量调整集的选择。我们表明,为了纠正测量误差,研究人员必须在MEM和结果模型中调整真正暴露的任何常见原因(1)以及测量误差和结果的结果以及(2)。如果仅在主要研究中可用此类变量时,研究人员仍应在结果模型中对它们进行调整以减少偏差,前提是这些协变量最多与测量误差相关。我们还表明,调整所谓的预后变量,这些变量独立于真正的暴露和结果模型中的测量误差,可能会提高效率,同时调整仅与真正暴露的任何协变量,通常会导致现实设置中的效率损失。我们将拟议的协变量选择方法应用于卫生专业的后续研究数据集,以研究纤维摄入对心血管疾病的影响。最后,我们将最初提出的估计量扩展到允许协变量效果修改的非参数设置。
Regression calibration as developed by Rosner, Spiegelman and Willet is used to correct the bias in effect estimates due to measurement error in continuous exposures. The method involves two models: a measurement error model (MEM) relating the mismeasured exposure to the true exposure and an outcome model relating the mismeasured exposure to outcome. However, no comprehensive guidance exists for determining which covariates should be included in each model. In this paper, we investigate the selection of the minimal and most efficient covariate adjustment sets under a causal inference framework. We show that in order to correct for the measurement error, researchers must adjust for, in both MEM and outcome model, any common causes (1) of true exposure and the outcome and (2) of measurement error and the outcome. When such variable(s) are only available in the main study, researchers should still adjust for them in the outcome model to reduce bias, provided that these covariates are at most weakly associated with measurement error. We also show that adjusting for so called prognostic variables that are independent of true exposure and measurement error in outcome model, may increase efficiency, while adjusting for any covariates that are associated only with true exposure generally results in efficiency loss in realistic settings. We apply the proposed covariate selection approach to the Health Professional Follow-up Study dataset to study the effect of fiber intake on cardiovascular disease. Finally, we extend the originally proposed estimators to a non-parametric setting where effect modification by covariates is allowed.