论文标题
私人半监督知识转移,以从嘈杂的标签中进行深度学习
Private Semi-supervised Knowledge Transfer for Deep Learning from Noisy Labels
论文作者
论文摘要
经过大规模数据培训的深度学习模型已在许多实际任务中取得了令人鼓舞的表现。同时,发布在敏感数据集中培训的那些模型(例如医疗记录)可能会引起严重的隐私问题。为了解决这些问题,当前最新的方法之一是教师合奏或PATE的私人汇总,在提供强大的隐私保证的同时,在保留模型的实用性方面取得了有希望的结果。 Pate结合了一个对敏感数据培训的“教师模型”的合奏,并通过嘈杂的教师投票的噪声聚合将知识转移到“学生”模型中,以标记将培训学生模型的未标记的公共数据。但是,由于私人汇总,学生学到的知识或投票标签是嘈杂的。直接从嘈杂的标签中学习可以显着影响学生模型的准确性。 在本文中,我们提出了PATE ++机制,该机制将当前的高级噪声标签训练机制与原始PATE框架相结合,以提高其准确性。为了有效地整合它们的新型生成对抗网(GAN)的新结构。此外,我们开发了一种新型的嘈杂标签检测机制,用于半监督模型训练,以进一步提高使用嘈杂标签训练的学生模型表现。我们评估了有关时尚摄影者和SVHN的方法,以显示所有措施原始PATE的改进。
Deep learning models trained on large-scale data have achieved encouraging performance in many real-world tasks. Meanwhile, publishing those models trained on sensitive datasets, such as medical records, could pose serious privacy concerns. To counter these issues, one of the current state-of-the-art approaches is the Private Aggregation of Teacher Ensembles, or PATE, which achieved promising results in preserving the utility of the model while providing a strong privacy guarantee. PATE combines an ensemble of "teacher models" trained on sensitive data and transfers the knowledge to a "student" model through the noisy aggregation of teachers' votes for labeling unlabeled public data which the student model will be trained on. However, the knowledge or voted labels learned by the student are noisy due to private aggregation. Learning directly from noisy labels can significantly impact the accuracy of the student model. In this paper, we propose the PATE++ mechanism, which combines the current advanced noisy label training mechanisms with the original PATE framework to enhance its accuracy. A novel structure of Generative Adversarial Nets (GANs) is developed in order to integrate them effectively. In addition, we develop a novel noisy label detection mechanism for semi-supervised model training to further improve student model performance when training with noisy labels. We evaluate our method on Fashion-MNIST and SVHN to show the improvements on the original PATE on all measures.