论文标题

对抗性的连续图像

Adversarial Generation of Continuous Images

论文作者

Skorokhodov, Ivan, Ignatyev, Savva, Elhoseiny, Mohamed

论文摘要

在大多数现有的学习系统中,图像通常被视为2D像素阵列。然而,在另一个范式中获得流行,2D图像表示为隐式神经表示(INR) - 一个MLP,它预测了鉴于其(x,y)坐标的RGB像素值。在本文中,我们提出了两种用于构建基于INR的图像解码器的新型架构技术:分解的乘法调制和多尺度INR,并使用它们来构建最先进的连续图像gan。以前将INR适应图像生成的尝试仅限于类似MNIST的数据集,并且不扩展到复杂的现实世界数据。我们提出的INR-GAN架构将连续图像发生器的性能提高了几次,从而大大减少了连续图像gan和基于像素的图像之间的差距。除此之外,我们还探索了基于INR的解码器的几个令人兴奋的属性,例如开箱即用的超分辨率,有意义的图像空间插值,加速的低分辨率图像推理,能够推断出图像边界外部外部外部的能力以及强大的几何图。项目页面位于https://universome.github.io/inr-gan。

In most existing learning systems, images are typically viewed as 2D pixel arrays. However, in another paradigm gaining popularity, a 2D image is represented as an implicit neural representation (INR) - an MLP that predicts an RGB pixel value given its (x,y) coordinate. In this paper, we propose two novel architectural techniques for building INR-based image decoders: factorized multiplicative modulation and multi-scale INRs, and use them to build a state-of-the-art continuous image GAN. Previous attempts to adapt INRs for image generation were limited to MNIST-like datasets and do not scale to complex real-world data. Our proposed INR-GAN architecture improves the performance of continuous image generators by several times, greatly reducing the gap between continuous image GANs and pixel-based ones. Apart from that, we explore several exciting properties of the INR-based decoders, like out-of-the-box superresolution, meaningful image-space interpolation, accelerated inference of low-resolution images, an ability to extrapolate outside of image boundaries, and strong geometric prior. The project page is located at https://universome.github.io/inr-gan.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源