论文标题
频率思考:通过采矿频率了解线索的面对伪造检测
Thinking in Frequency: Face Forgery Detection by Mining Frequency-aware Clues
论文作者
论文摘要
随着现实的面部操纵技术取得了显着的进步,社会对这些技术潜在恶意滥用的关注提出了一个新兴的研究主题。但是,由于最近的进步能够超出人眼的感知能力,尤其是在压缩的图像和视频中,这是极具挑战性的。我们发现,具有频率意识的采矿伪造模式可以治愈,因为频率提供了一个互补的观点,可以很好地描述微妙的伪造文物或压缩误差。为了将频率引入面部伪造检测中,我们提出了一个新型的伪造网络(F3-NET)中的新频率,获得了两个不同但互补的频率感知线索的优点,1)频率吸引的分解图像组件和2)局部频率统计数据,通过我们的两流程协作性学习框架深入挖掘了伪造模式。我们将DCT用作应用的频域变换。通过全面的研究,我们表明,在具有挑战性的FaceForensics ++数据集中,提出的F3-NET在所有压缩质量方面的最新方法都显着优于最先进的方法,尤其是在低品质媒体上赢得了很大的领先优势。
As realistic facial manipulation technologies have achieved remarkable progress, social concerns about potential malicious abuse of these technologies bring out an emerging research topic of face forgery detection. However, it is extremely challenging since recent advances are able to forge faces beyond the perception ability of human eyes, especially in compressed images and videos. We find that mining forgery patterns with the awareness of frequency could be a cure, as frequency provides a complementary viewpoint where either subtle forgery artifacts or compression errors could be well described. To introduce frequency into the face forgery detection, we propose a novel Frequency in Face Forgery Network (F3-Net), taking advantages of two different but complementary frequency-aware clues, 1) frequency-aware decomposed image components, and 2) local frequency statistics, to deeply mine the forgery patterns via our two-stream collaborative learning framework. We apply DCT as the applied frequency-domain transformation. Through comprehensive studies, we show that the proposed F3-Net significantly outperforms competing state-of-the-art methods on all compression qualities in the challenging FaceForensics++ dataset, especially wins a big lead upon low-quality media.