论文标题

选择偏见校正的可视化通过动态重新加权

Selection-Bias-Corrected Visualization via Dynamic Reweighting

论文作者

Borland, David, Zhang, Jonathan, Kaul, Smiti, Gotz, David

论文摘要

来自复杂系统的大规模数据的收集和视觉分析,例如电子健康记录或ClickStream数据,在广泛的行业中变得越来越普遍。但是,这种回顾性的视觉分析容易产生各种选择偏置效应,尤其是对于在任何给定时间只能可视化的高维数据的高维数据。当分析人员在临时分析过程中动态应用过滤器或执行分组操作时,选择偏差的风险甚至更高。这些偏见效应威胁着视觉分析中发现的见解的有效性和概括性,作为决策的基础。过去的工作集中在偏见透明度上,帮助用户了解何时可能发生选择偏差。但是,通过缓解偏差来反驳选择偏差的影响通常保留给用户作为一个单独的过程完成。动态重新加权(DR)是一种新型的计算方法,用于选择偏置缓解措施,可帮助用户制作偏置校正的可视化。本文介绍了DR工作流程,引入了关键DR可视化设计,并提出了支持DR过程的统计方法。还报道了医疗领域的用例以及域专家用户访谈的发现。

The collection and visual analysis of large-scale data from complex systems, such as electronic health records or clickstream data, has become increasingly common across a wide range of industries. This type of retrospective visual analysis, however, is prone to a variety of selection bias effects, especially for high-dimensional data where only a subset of dimensions is visualized at any given time. The risk of selection bias is even higher when analysts dynamically apply filters or perform grouping operations during ad hoc analyses. These bias effects threatens the validity and generalizability of insights discovered during visual analysis as the basis for decision making. Past work has focused on bias transparency, helping users understand when selection bias may have occurred. However, countering the effects of selection bias via bias mitigation is typically left for the user to accomplish as a separate process. Dynamic reweighting (DR) is a novel computational approach to selection bias mitigation that helps users craft bias-corrected visualizations. This paper describes the DR workflow, introduces key DR visualization designs, and presents statistical methods that support the DR process. Use cases from the medical domain, as well as findings from domain expert user interviews, are also reported.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源