对抗性的连续图像

论文标题

对抗性的连续图像

Adversarial Generation of Continuous Images

论文作者

Skorokhodov, Ivan, Ignatyev, Savva, Elhoseiny, Mohamed

论文摘要

在大多数现有的学习系统中，图像通常被视为2D像素阵列。然而，在另一个范式中获得流行，2D图像表示为隐式神经表示（INR） - 一个MLP，它预测了鉴于其（x，y）坐标的RGB像素值。在本文中，我们提出了两种用于构建基于INR的图像解码器的新型架构技术：分解的乘法调制和多尺度INR，并使用它们来构建最先进的连续图像gan。以前将INR适应图像生成的尝试仅限于类似MNIST的数据集，并且不扩展到复杂的现实世界数据。我们提出的INR-GAN架构将连续图像发生器的性能提高了几次，从而大大减少了连续图像gan和基于像素的图像之间的差距。除此之外，我们还探索了基于INR的解码器的几个令人兴奋的属性，例如开箱即用的超分辨率，有意义的图像空间插值，加速的低分辨率图像推理，能够推断出图像边界外部外部外部的能力以及强大的几何图。项目页面位于https://universome.github.io/inr-gan。

In most existing learning systems, images are typically viewed as 2D pixel arrays. However, in another paradigm gaining popularity, a 2D image is represented as an implicit neural representation (INR) - an MLP that predicts an RGB pixel value given its (x,y) coordinate. In this paper, we propose two novel architectural techniques for building INR-based image decoders: factorized multiplicative modulation and multi-scale INRs, and use them to build a state-of-the-art continuous image GAN. Previous attempts to adapt INRs for image generation were limited to MNIST-like datasets and do not scale to complex real-world data. Our proposed INR-GAN architecture improves the performance of continuous image generators by several times, greatly reducing the gap between continuous image GANs and pixel-based ones. Apart from that, we explore several exciting properties of the INR-based decoders, like out-of-the-box superresolution, meaningful image-space interpolation, accelerated inference of low-resolution images, an ability to extrapolate outside of image boundaries, and strong geometric prior. The project page is located at https://universome.github.io/inr-gan.

下载PDF全文

下载文献需遵守相关版权规定

论文标题