论文标题
具有对抗性鲁棒性的生物学启发机制
Biologically Inspired Mechanisms for Adversarial Robustness
论文作者
论文摘要
尚未证明卷积神经网络在合理的计算和性能成本下对对抗性扰动非常强大。灵长类动物的视觉腹流似乎对视觉刺激中的小扰动是可靠的,但是引起这种强大感知的基本机制却尚不清楚。在这项工作中,我们研究了两个生物学上合理的机制在对抗鲁棒性中的作用。我们证明了灵长类动物视网膜进行的非均匀采样以及在每个偏心率下具有一系列接受场大小的多个接受场的存在,可以改善神经网络对小型对抗性扰动的鲁棒性。我们验证了这两种机制不会因梯度混淆而遭受损害,并通过消融研究研究了它们对对抗性鲁棒性的贡献。
A convolutional neural network strongly robust to adversarial perturbations at reasonable computational and performance cost has not yet been demonstrated. The primate visual ventral stream seems to be robust to small perturbations in visual stimuli but the underlying mechanisms that give rise to this robust perception are not understood. In this work, we investigate the role of two biologically plausible mechanisms in adversarial robustness. We demonstrate that the non-uniform sampling performed by the primate retina and the presence of multiple receptive fields with a range of receptive field sizes at each eccentricity improve the robustness of neural networks to small adversarial perturbations. We verify that these two mechanisms do not suffer from gradient obfuscation and study their contribution to adversarial robustness through ablation studies.