Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Robustifying models against adversarial attacks by Langevin dynamics

Full metadata record
DC Field Value Language
dc.contributor.authorSrinivasan, Vignesh-
dc.contributor.authorRohrer, Csaba-
dc.contributor.authorMarban, Arturo-
dc.contributor.authorMueller, Klaus-Robert-
dc.contributor.authorSamek, Wojciech-
dc.contributor.authorNakajima, Shinichi-
dc.date.accessioned2022-03-01T23:43:00Z-
dc.date.available2022-03-01T23:43:00Z-
dc.date.created2022-01-20-
dc.date.issued2021-05-
dc.identifier.issn0893-6080-
dc.identifier.urihttps://scholar.korea.ac.kr/handle/2021.sw.korea/137429-
dc.description.abstractAdversarial attacks on deep learning models have compromised their performance considerably. As remedies, a number of defense methods were proposed, which however, have been circumvented by newer and more sophisticated attacking strategies. In the midst of this ensuing arms race, the problem of robustness against adversarial attacks still remains a challenging task. This paper proposes a novel, simple yet effective defense strategy where off-manifold adversarial samples are driven towards high density regions of the data generating distribution of the (unknown) target class by the Metropolis-adjusted Langevin algorithm (MALA) with perceptual boundary taken into account. To achieve this task, we introduce a generative model of the conditional distribution of the inputs given labels that can be learned through a supervised Denoising Autoencoder (sDAE) in alignment with a discriminative classifier. Our algorithm, called MALA for DEfense (MALADE), is equipped with significant dispersion-projection is distributed broadly. This prevents white box attacks from accurately aligning the input to create an adversarial sample effectively. MALADE is applicable to any existing classifier, providing robust defense as well as off-manifold sample detection. In our experiments, MALADE exhibited state-of-the-art performance against various elaborate attacking strategies. (C) 2021 Elsevier Ltd. All rights reserved.-
dc.languageEnglish-
dc.language.isoen-
dc.publisherPERGAMON-ELSEVIER SCIENCE LTD-
dc.titleRobustifying models against adversarial attacks by Langevin dynamics-
dc.typeArticle-
dc.contributor.affiliatedAuthorMueller, Klaus-Robert-
dc.identifier.doi10.1016/j.neunet.2020.12.024-
dc.identifier.scopusid2-s2.0-85100149772-
dc.identifier.wosid000686896300001-
dc.identifier.bibliographicCitationNEURAL NETWORKS, v.137, pp.1 - 17-
dc.relation.isPartOfNEURAL NETWORKS-
dc.citation.titleNEURAL NETWORKS-
dc.citation.volume137-
dc.citation.startPage1-
dc.citation.endPage17-
dc.type.rimsART-
dc.type.docTypeReview-
dc.description.journalClass1-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaNeurosciences & Neurology-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.relation.journalWebOfScienceCategoryNeurosciences-
dc.subject.keywordAuthorAdversarial examples-
dc.subject.keywordAuthorRobustness-
dc.subject.keywordAuthorLangevin dynamics-
Files in This Item
There are no files associated with this item.
Appears in
Collections
Graduate School > Department of Artificial Intelligence > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetrics

Total Views & Downloads

BROWSE