论文标题

性爱与公平的机器学习有什么关系?

What's Sex Got To Do With Fair Machine Learning?

论文作者

Hu, Lily, Kohler-Hausmann, Issa

论文摘要

关于机器学习中公平性的辩论主要集中在群体之间需要什么公平或非歧视的竞争定义上。但是,几乎没有关注一个小组。许多最近的“公平”方法要求一个人指定数据生成过程的因果模型。这些练习是一个隐性的本体论假设,即种族或性别群体只是一个具有给定特征的人的集合。我们通过探索因果模型中模块化的形式假设来证明这一点,该假设认为,一种因果途径捕获的依赖性是在任何其他途径上的干预措施不变的。性别的因果模型提出了两个实质性的主张:1)存在一个特征,性别与自己所拥有的特征,这是一个人的固有特征,它在因果关系中带来了世界外部的社会现象; 2)性别及其效果之间的关系可以通过哪种方式进行修改,而前者的特征仍然可以保留性别在我们世界中具有的含义。我们认为这种本体论的情况是错误的。性别据称“原因”的许多“效果”实际上是性别作为社会地位的构成特征。他们赋予了性特征的社会意义,含义恰恰是使性别歧视成为一种独特的道德上有问题的行动的原因。纠正此概念错误对如何使用模型来检测歧视具有许多影响。构型关系的形式图呈现出有关歧视推理的完全不同的途径。尽管因果图指导了复杂的模块化反事实的构建,但本构图将另一种反事实识别为对歧视的询问的核心:询问如果改变其非模块化特征,则询问群体的社会含义将如何改变。

Debate about fairness in machine learning has largely centered around competing definitions of what fairness or nondiscrimination between groups requires. However, little attention has been paid to what precisely a group is. Many recent approaches to "fairness" require one to specify a causal model of the data generating process. These exercises make an implicit ontological assumption that a racial or sex group is simply a collection of individuals who share a given trait. We show this by exploring the formal assumption of modularity in causal models, which holds that the dependencies captured by one causal pathway are invariant to interventions on any other pathways. Causal models of sex propose two substantive claims: 1) There exists a feature, sex-on-its-own, that is an inherent trait of an individual that causally brings about social phenomena external to it in the world; and 2) the relations between sex and its effects can be modified in whichever ways and the former feature would still retain the meaning that sex has in our world. We argue that this ontological picture is false. Many of the "effects" that sex purportedly "causes" are in fact constitutive features of sex as a social status. They give the social meaning of sex features, meanings that are precisely what make sex discrimination a distinctively morally problematic type of action. Correcting this conceptual error has a number of implications for how models can be used to detect discrimination. Formal diagrams of constitutive relations present an entirely different path toward reasoning about discrimination. Whereas causal diagrams guide the construction of sophisticated modular counterfactuals, constitutive diagrams identify a different kind of counterfactual as central to an inquiry on discrimination: one that asks how the social meaning of a group would be changed if its non-modular features were altered.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源