DeepFake检测：人类与机器

论文标题

DeepFake检测：人类与机器

Deepfake detection: humans vs. machines

论文作者

Korshunov, Pavel, Marcel, Sébastien

论文摘要

Deepfake视频自动与其他人的面孔自动互换，而更现实的结果变得更容易产生。为了应对这种操纵的威胁，可能会构成我们对视频证据的信任，最近提出了一些大型Deepfake视频数据集和许多检测它们的方法。但是，尚不清楚普通人的逼真效果视频如何现实，以及算法在检测它们时是否比人类要好得多。在本文中，我们介绍了一项在类似众包的场景中进行的主观研究，该研究系统地评估了人类查看视频是否深层摄影的难度。为了进行评估，我们使用了从Facebook Deepfake数据库中预先选择的120个不同的视频（60个深击和60个原件），该数据库是在Kaggle的2020年Deepfake Technect Challenge Challenge中提供的。对于每个视频，一个简单的问题：“一个真实的视频中的人的面孔？”平均由19个幼稚的受试者回答。将主观评估的结果与基于Xception和效率（B4变体）神经网络的两种不同状态的Deepfake检测方法的性能进行了比较，这些神经网络已在其他两个大型公共数据库中进行了预先训练：Faceforensics ++的Google子集和最近的Celeb-DF数据集。评估表明，尽管人类的看法与机器的感知截然不同，但无论是成功还是以不同的方式被深层涂抹欺骗。具体而言，算法难以检测那些深击视频，人类受试者发现这些视频很容易发现。

Deepfake videos, where a person's face is automatically swapped with a face of someone else, are becoming easier to generate with more realistic results. In response to the threat such manipulations can pose to our trust in video evidence, several large datasets of deepfake videos and many methods to detect them were proposed recently. However, it is still unclear how realistic deepfake videos are for an average person and whether the algorithms are significantly better than humans at detecting them. In this paper, we present a subjective study conducted in a crowdsourcing-like scenario, which systematically evaluates how hard it is for humans to see if the video is deepfake or not. For the evaluation, we used 120 different videos (60 deepfakes and 60 originals) manually pre-selected from the Facebook deepfake database, which was provided in the Kaggle's Deepfake Detection Challenge 2020. For each video, a simple question: "Is face of the person in the video real of fake?" was answered on average by 19 naïve subjects. The results of the subjective evaluation were compared with the performance of two different state of the art deepfake detection methods, based on Xception and EfficientNets (B4 variant) neural networks, which were pre-trained on two other large public databases: the Google's subset from FaceForensics++ and the recent Celeb-DF dataset. The evaluation demonstrates that while the human perception is very different from the perception of a machine, both successfully but in different ways are fooled by deepfakes. Specifically, algorithms struggle to detect those deepfake videos, which human subjects found to be very easy to spot.

下载PDF全文

下载文献需遵守相关版权规定

论文标题