论文标题
临床风险预测,具有时间概率的不对称多任务学习
Clinical Risk Prediction with Temporal Probabilistic Asymmetric Multi-Task Learning
论文作者
论文摘要
尽管最近的多任务学习方法已证明可以有效地改善深层神经网络的概括,但应谨慎使用它们,以进行至关重要的关键应用应用,例如临床风险预测。这是因为即使他们实现了提高的任务平均绩效,它们仍可能在单个任务上产生降低的绩效,这可能是至关重要的(例如,预测死亡率风险)。现有的非对称多任务学习方法通过从损失低的任务转移到高损失的任务来解决这一负面转移问题。但是,使用损失作为可靠性量度是有风险的,因为它可能是由于过度拟合而导致的。在时间序列预测任务的情况下,在特定时间段上学习的一项任务的知识(例如,预测败血症发作)可能在以后的时间段上学习另一个任务(例如,死亡率的预测),但每个时间段缺乏损失,因此很难在每个时间段上衡量可靠性。为了捕获时间序列数据中任务之间的这种动态变化的不对称关系,我们提出了一种新型的时间不对称的多任务学习模型,该模型基于功能级别的不确定性,将知识转移从某些任务/时间段转移到相关的不确定任务。我们针对时间序列预测的各种深度学习模型验证了多个临床风险预测任务的模型,我们的模型明显胜过表现,而没有任何负转移迹象。临床医生对学习知识图的进一步定性分析表明,它们有助于分析模型的预测。我们的最终代码可在https://github.com/anhtuan5696/tpamtl上找到。
Although recent multi-task learning methods have shown to be effective in improving the generalization of deep neural networks, they should be used with caution for safety-critical applications, such as clinical risk prediction. This is because even if they achieve improved task-average performance, they may still yield degraded performance on individual tasks, which may be critical (e.g., prediction of mortality risk). Existing asymmetric multi-task learning methods tackle this negative transfer problem by performing knowledge transfer from tasks with low loss to tasks with high loss. However, using loss as a measure of reliability is risky since it could be a result of overfitting. In the case of time-series prediction tasks, knowledge learned for one task (e.g., predicting the sepsis onset) at a specific timestep may be useful for learning another task (e.g., prediction of mortality) at a later timestep, but lack of loss at each timestep makes it difficult to measure the reliability at each timestep. To capture such dynamically changing asymmetric relationships between tasks in time-series data, we propose a novel temporal asymmetric multi-task learning model that performs knowledge transfer from certain tasks/timesteps to relevant uncertain tasks, based on feature-level uncertainty. We validate our model on multiple clinical risk prediction tasks against various deep learning models for time-series prediction, which our model significantly outperforms, without any sign of negative transfer. Further qualitative analysis of learned knowledge graphs by clinicians shows that they are helpful in analyzing the predictions of the model. Our final code is available at https://github.com/anhtuan5696/TPAMTL.