检测视频中的面部面部重演

论文标题

检测视频中的面部面部重演

Detecting Face2Face Facial Reenactment in Videos

论文作者

Kumar, Prabhat, Vatsa, Mayank, Singh, Richa

论文摘要

视觉内容已成为每天在互联网上共享和上传的数十亿张图像和视频中明显的信息来源。这导致图像和视频的变化增加，使它们在全球观众更有益和引人注目。这些更改中的一些很简单，例如复制移动，并且很容易检测到，而其他复杂的改动（例如基于重演的深层）很难检测到。重新制定更改使源可以更改目标表达式并创建照片真实的图像和视频。尽管技术可以用于多种应用，但对自动重演的恶意使用具有很大的社会影响。因此，重要的是要开发检测技术来区分实际图像和视频。这项研究提出了一种基于学习的算法，用于检测基于重演的改变。所提出的算法使用一个多流网络，该网络学习区域工件，并在各种压缩水平上提供了稳健的性能。我们还为拟议网络的流平衡学习提供了损失函数。该性能是在公开可用的FaceForensics数据集上评估的。结果表明，对于NO，简单和硬压缩因子，最新的分类精度分别为99.96％，99.10％和91.20％。

Visual content has become the primary source of information, as evident in the billions of images and videos, shared and uploaded on the Internet every single day. This has led to an increase in alterations in images and videos to make them more informative and eye-catching for the viewers worldwide. Some of these alterations are simple, like copy-move, and are easily detectable, while other sophisticated alterations like reenactment based DeepFakes are hard to detect. Reenactment alterations allow the source to change the target expressions and create photo-realistic images and videos. While technology can be potentially used for several applications, the malicious usage of automatic reenactment has a very large social implication. It is therefore important to develop detection techniques to distinguish real images and videos with the altered ones. This research proposes a learning-based algorithm for detecting reenactment based alterations. The proposed algorithm uses a multi-stream network that learns regional artifacts and provides a robust performance at various compression levels. We also propose a loss function for the balanced learning of the streams for the proposed network. The performance is evaluated on the publicly available FaceForensics dataset. The results show state-of-the-art classification accuracy of 99.96%, 99.10%, and 91.20% for no, easy, and hard compression factors, respectively.

下载PDF全文

下载文献需遵守相关版权规定

论文标题