论文标题

评论:2020年人口普查披露系统的政策评估的基本作用

Comment: The Essential Role of Policy Evaluation for the 2020 Census Disclosure Avoidance System

论文作者

Kenny, Christopher T., Kuriwaki, Shiro, McCartan, Cory, Rosenman, Evan T. R., Simko, Tyler, Imai, Kosuke

论文摘要

在“差异观点:围绕美国人口普查局使用差异隐私的使用的认知脱节”中,博伊德和萨拉西认为,对人口普查披露避免系统(DAS)的经验评估,包括我们发表的分析,未能评估2020年DAS的基准数据对2020年DAS的评估从来没有评估过人口的基础,这是人数的基数。在这篇评论中,我们解释了为什么政策评估是我们分析的主要目标,但在没有完美的基础真理的情况下仍然是有意义的。我们还指出,我们的评估利用了特定于十年型人口普查和重新分配数据的特征,例如在交换和选民归档种族识别的块级人口不变性,更好地近似于与地面真相进行比较。最后,我们表明,基于贝叶斯改进的姓氏地理编码的姓氏的准确统计预测,虽然并不违反差异隐私,但大大增加了人口普查局寻求保护的私人信息的披露风险。我们结论说,决策者必须面对数据效用和隐私保护之间的关键权衡,而仅认知脱节就不足以解释政策选择之间的分歧。

In "Differential Perspectives: Epistemic Disconnects Surrounding the US Census Bureau's Use of Differential Privacy," boyd and Sarathy argue that empirical evaluations of the Census Disclosure Avoidance System (DAS), including our published analysis, failed to recognize how the benchmark data against which the 2020 DAS was evaluated is never a ground truth of population counts. In this commentary, we explain why policy evaluation, which was the main goal of our analysis, is still meaningful without access to a perfect ground truth. We also point out that our evaluation leveraged features specific to the decennial Census and redistricting data, such as block-level population invariance under swapping and voter file racial identification, better approximating a comparison with the ground truth. Lastly, we show that accurate statistical predictions of individual race based on the Bayesian Improved Surname Geocoding, while not a violation of differential privacy, substantially increases the disclosure risk of private information the Census Bureau sought to protect. We conclude by arguing that policy makers must confront a key trade-off between data utility and privacy protection, and an epistemic disconnect alone is insufficient to explain disagreements between policy choices.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源