recsys公平度量指标：许多要使用，但要选择哪一个？

论文标题

recsys公平度量指标：许多要使用，但要选择哪一个？

RecSys Fairness Metrics: Many to Use But Which One To Choose?

论文作者

Smith, Jessie J., Beattie, Lex

论文摘要

近年来，推荐和排名系统在数字平台上变得越来越流行。但是，以前的工作强调了个性化系统如何对用户造成无意的危害。从业者要求指标来衡量和减轻生产系统中的这些危害。为了满足这一需求，Recsys社区已经引入和探讨了许多公平定义。不幸的是，这导致了可能的公平指标的扩散，从业者可以从中选择。指标的数量和复杂性的增加创造了从业者深入了解公平定义和实现的细微差别。此外，从业人员需要了解这些指标伴随着负责任实施的道德准则。最近的工作表明，道德准则存在扩散，并指出需要更多的实施指导，而不是单独的原则。各种各样的可用指标，再加上缺乏可接受的标准或实践中的共享知识，从而为从业人员导航带来了充满挑战的环境。在该职位论文中，我们关注研究界与从业者之间有关指标的可用性与将其付诸实践的能力的扩大差距。我们通过当前的工作解决了这一差距，该工作着重于开发方法，以帮助ML从业人员在选择公平指标的决策过程中进行建议和排名系统。在我们的迭代设计访谈中，我们已经发现，从业者在完善公平限制时需要实用和反思性指导。鉴于从业者面临越来越多的挑战，要利用正确的指标，同时平衡复杂的公平环境，这尤其重要。

In recent years, recommendation and ranking systems have become increasingly popular on digital platforms. However, previous work has highlighted how personalized systems might lead to unintentional harms for users. Practitioners require metrics to measure and mitigate these types of harms in production systems. To meet this need, many fairness definitions have been introduced and explored by the RecSys community. Unfortunately, this has led to a proliferation of possible fairness metrics from which practitioners can choose. The increase in volume and complexity of metrics creates a need for practitioners to deeply understand the nuances of fairness definitions and implementations. Additionally, practitioners need to understand the ethical guidelines that accompany these metrics for responsible implementation. Recent work has shown that there is a proliferation of ethics guidelines and has pointed to the need for more implementation guidance rather than principles alone. The wide variety of available metrics, coupled with the lack of accepted standards or shared knowledge in practice leads to a challenging environment for practitioners to navigate. In this position paper, we focus on this widening gap between the research community and practitioners concerning the availability of metrics versus the ability to put them into practice. We address this gap with our current work, which focuses on developing methods to help ML practitioners in their decision-making processes when picking fairness metrics for recommendation and ranking systems. In our iterative design interviews, we have already found that practitioners need both practical and reflective guidance when refining fairness constraints. This is especially salient given the growing challenge for practitioners to leverage the correct metrics while balancing complex fairness contexts.

下载PDF全文

下载文献需遵守相关版权规定

论文标题