Adversarial robustness in deep learning: attacks on fragile neuronsPravin, C., Martino, I., Nicosia, G. and Ojha, V. ORCID: https://orcid.org/0000-0002-9256-1192 (2021) Adversarial robustness in deep learning: attacks on fragile neurons. In: 30th International Conference on Artificial Neural Networks, September 14-17, 2021, Bratislava, Slovakia (Online), pp. 16-28, https://doi.org/10.1007/978-3-030-86362-3_2.
It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing. To link to this item DOI: 10.1007/978-3-030-86362-3_2 Abstract/SummaryWe identify fragile and robust neurons of deep learning architectures using nodal dropouts of the first convolutional layer. Using an adversarial targeting algorithm, we correlate these neurons with the distribution of adversarial attacks on the network. Adversarial robustness of neural networks has gained significant attention in recent times and highlights intrinsic weaknesses of deep learning networks against carefully constructed distortion applied to input images. In this paper, we evaluate the robustness of state-of-the-art image classification models trained on the MNIST and CIFAR10 datasets against the fast gradient sign method attack, a simple yet effective method of deceiving neural networks. Our method identifies the specific neurons of a network that are most affected by the adversarial attack being applied. We, therefore, propose to make fragile neurons more robust against these attacks by compressing features within robust neurons and amplifying the fragile neurons proportionally.
Download Statistics DownloadsDownloads per month over past year Altmetric Deposit Details University Staff: Request a correction | Centaur Editors: Update this record |