论文标题
基于主要梯度期望的可转移性估算
Transferability Estimation Based On Principal Gradient Expectation
论文作者
论文摘要
转移学习旨在通过转移源任务中获得的知识来提高目标任务的执行。标准方法是预训练,然后进行微调或线性探测。特别是,在预定义的任务下为特定目标域选择适当的源域对于提高效率和有效性至关重要。通常,通过估计可传递性来解决此问题。但是,现有方法无法在绩效和成本之间取消权衡。为了全面评估估计方法,我们总结了三个属性:稳定性,可靠性和效率。在他们的基础上,我们提出了主要梯度期望(PGE),这是一种评估可转移性的简单但有效的方法。具体而言,我们使用重新启动方案多次计算每个重量单位上的梯度,然后计算所有梯度的期望。最后,通过计算归一化主梯度的间隙来估计源和目标之间的可传递性。广泛的实验表明,所提出的指标优于所有特性的最新方法。
Transfer learning aims to improve the performance of target tasks by transferring knowledge acquired in source tasks. The standard approach is pre-training followed by fine-tuning or linear probing. Especially, selecting a proper source domain for a specific target domain under predefined tasks is crucial for improving efficiency and effectiveness. It is conventional to solve this problem via estimating transferability. However, existing methods can not reach a trade-off between performance and cost. To comprehensively evaluate estimation methods, we summarize three properties: stability, reliability and efficiency. Building upon them, we propose Principal Gradient Expectation(PGE), a simple yet effective method for assessing transferability. Specifically, we calculate the gradient over each weight unit multiple times with a restart scheme, and then we compute the expectation of all gradients. Finally, the transferability between the source and target is estimated by computing the gap of normalized principal gradients. Extensive experiments show that the proposed metric is superior to state-of-the-art methods on all properties.