论文标题

面部表达综合和识别的共同深入学习

Joint Deep Learning of Facial Expression Synthesis and Recognition

论文作者

Yan, Yan, Huang, Ying, Chen, Si, Shen, Chunhua, Wang, Hanzi

论文摘要

最近,基于深度学习的面部表达识别(FER)方法引起了很大的关注,通常需要大规模标记的训练数据。但是,公开可用的面部表达数据库通常包含少量标记的数据。在本文中,为了克服上述问题,我们提出了一个新颖的联合深入学习面部表达合成和有效FER的识别方法。更具体地说,提出的方法涉及两个阶段的学习程序。首先,预先训练面部表达合成生成对抗网络(FESGAN),以生成具有不同面部表情的面部图像。为了增加训练图像的多样性,Fesgan经过精心设计,以生成具有先前分布的新身份的图像。其次,在统一框架中,与预训练的fesgan共同学习表达识别网络。特别是,从识别网络计算出的分类损失用于同时优化识别网络和FESGAN的生成器的性能。此外,为了减轻真实图像和合成图像之间的数据偏差问题,我们提出了一类新型的实际数据引导的背部传播(RDBP)算法的类内部损失,以减少同一类图像的类内部变化,从而可以显着提高最终性能。公共面部表达数据库的广泛实验结果表明,与几种最先进的方法相比,该方法的优越性。

Recently, deep learning based facial expression recognition (FER) methods have attracted considerable attention and they usually require large-scale labelled training data. Nonetheless, the publicly available facial expression databases typically contain a small amount of labelled data. In this paper, to overcome the above issue, we propose a novel joint deep learning of facial expression synthesis and recognition method for effective FER. More specifically, the proposed method involves a two-stage learning procedure. Firstly, a facial expression synthesis generative adversarial network (FESGAN) is pre-trained to generate facial images with different facial expressions. To increase the diversity of the training images, FESGAN is elaborately designed to generate images with new identities from a prior distribution. Secondly, an expression recognition network is jointly learned with the pre-trained FESGAN in a unified framework. In particular, the classification loss computed from the recognition network is used to simultaneously optimize the performance of both the recognition network and the generator of FESGAN. Moreover, in order to alleviate the problem of data bias between the real images and the synthetic images, we propose an intra-class loss with a novel real data-guided back-propagation (RDBP) algorithm to reduce the intra-class variations of images from the same class, which can significantly improve the final performance. Extensive experimental results on public facial expression databases demonstrate the superiority of the proposed method compared with several state-of-the-art FER methods.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源