论文标题
用于多功能图像合成的调制对比度
Modulated Contrast for Versatile Image Synthesis
论文作者
论文摘要
感知图像之间的相似性一直是各种视觉生成任务的长期存在和基本问题。主要的方法通过计算刻度的绝对偏差来测量图像间距离,这倾向于估计实例分布的中值并导致生成图像中的模糊和伪影。本文介绍了蒙斯(Monce),这是一种多功能度量,它引入了图像对比度,以学习校准的度量,以感知多方面的图像间距离。与香草的对比不同,这不论是在锚点中不分青红差异的样本,无论其相似性如何,我们建议根据与锚的相似性调整负样品的推动力,从而有助于从信息性的负样品中进行对比学习。由于图像距离测量中涉及多个贴片级对比目标,因此我们在蒙斯引入了最佳运输,以在多个对比目标上协同调节负样本的推动力。对多个图像翻译任务进行的广泛实验表明,所提出的Monce的表现大大优于各种普遍的指标。该代码可从https://github.com/fnzhan/monce获得。
Perceiving the similarity between images has been a long-standing and fundamental problem underlying various visual generation tasks. Predominant approaches measure the inter-image distance by computing pointwise absolute deviations, which tends to estimate the median of instance distributions and leads to blurs and artifacts in the generated images. This paper presents MoNCE, a versatile metric that introduces image contrast to learn a calibrated metric for the perception of multifaceted inter-image distances. Unlike vanilla contrast which indiscriminately pushes negative samples from the anchor regardless of their similarity, we propose to re-weight the pushing force of negative samples adaptively according to their similarity to the anchor, which facilitates the contrastive learning from informative negative samples. Since multiple patch-level contrastive objectives are involved in image distance measurement, we introduce optimal transport in MoNCE to modulate the pushing force of negative samples collaboratively across multiple contrastive objectives. Extensive experiments over multiple image translation tasks show that the proposed MoNCE outperforms various prevailing metrics substantially. The code is available at https://github.com/fnzhan/MoNCE.