research-article

Forming Adversarial Example Attacks Against Deep Neural Networks With Reinforcement Learning

Authors:

Matthew Akers,

Armon BartonAuthors Info & Claims

Computer, Volume 57, Issue 1

Pages 88 - 99

https://doi.org/10.1109/MC.2023.3324751

Published: 03 January 2024 Publication History

Abstract

We propose a novel reinforcement learning-based adversarial example attack, Adversarial Reinforcement Learning Agent, designed to learn imperceptible perturbation that causes misclassification when added to the input of a deep learning classifier.

References

[1]

I. Goodfellow, Y. Bengio, and A. Courville, Deep Learning, 1st ed. Cambridge, MA, USA: MIT Press, 2016.

Digital Library

Google Scholar

[2]

I. J. Goodfellow, J. Shlens, and C. Szegedy, “Explaining and harnessing adversarial examples,” Mar. 2015. Accessed: Feb. 28, 2022. [Online]. Available: http://arxiv.org/abs/1412.6572

Google Scholar

[3]

R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction, 2nd ed. Cambridge, MA, USA: MIT Press, 2018.

Google Scholar

[4]

V. Mnih, K. Kavukcuoglu, A. G. David Silver, I. Antonoglou, D. Wierstra, and M. A. Riedmiller, “Playing Atari with deep reinforcement learning,” 2013. [Online]. Available: http://arxiv.org/abs/1312.5602

Google Scholar

[5]

V. Mnih et al., “Human-level control through deep reinforcement learning,” Nature, vol. 518, no. 7540, pp. 529–533, 2015.

Crossref

Google Scholar

[6]

D. Silver et al., “Mastering the game of Go with deep neural networks and tree search,” Nature, vol. 529, no. 7587, pp. 484–489, 2016.

Crossref

Google Scholar

[7]

D. Silver et al., “Mastering the game of Go without human knowledge,” Nature, vol. 550, no. 7676, pp. 354–359, Oct. 2017.

Crossref

Google Scholar

[8]

H. van Hasselt, A. Guez, and D. Silver, “Deep reinforcement learning with double Q-learning,” Proc. AAAI Conf. Artif. Intell., vol. 30, no. 1, pp. 2094–2100, Mar. 2016.

Crossref

Google Scholar

[9]

Z. Wang, T. Schaul, M. Hessel, H. van Hasselt, M. Lanctot, and N. de Freitas, “Dueling network architectures for deep reinforcement learning,” in Proc. 33rd Int. Conf. Mach. Learn., 2016, pp. 1995–2003.

Google Scholar

[10]

A. Barton, “Defending neural networks against adversarial examples,” Ph.D. thesis, Dept. of Comp. Sci., Univ. of Texas Arlington, Arlington, TX, USA, 2018.

Google Scholar

[11]

A. Krizhevsky and G. Hinton, “Learning multiple layers of features from tiny images,” Univ. of Toronto, Toronto, ON, Canada, 2009. [Online]. Available: https://www.cs.toronto.edu/∼kriz/learning-features-2009-TR.pdf

Google Scholar

[12]

A. Madry, A. Makelov, L. Schmidt, D. Tsipras, and A. Vladu, “Towards deep learning models resistant to adversarial attacks,” Sep. 2019. Accessed: Feb. 28, 2022. [Online]. Available: http://arxiv.org/abs/1706.06083

Google Scholar

[13]

S.-M. Moosavi-Dezfooli, A. Fawzi, and P. Frossard, “DeepFool: A simple and accurate method to fool deep neural networks,” 2015. [Online]. Available: https://arxiv.org/abs/1511.04599

Google Scholar

[14]

N. Carlini and D. Wagner, “Towards evaluating the robustness of neural networks,” in Proc. IEEE Symp. Secur. Privacy, May 2017, pp. 39–57.

Crossref

Google Scholar

[15]

M. Andriushchenko, F. Croce, N. Flammarion, and M. Hein, “Square attack: A query-efficient black-box adversarial attack via random search,” in Proc. Eur. Conf. Comput. Vision (ECCV), Aug. 2020, pp. 484–501.

Google Scholar

[16]

F. Croce and M. Hein, “Reliable evaluation of adversarial robustness with an ensemble of diverse parameter-free attacks,” in Proc. 37th Int. Conf. Mach. Learn., 2020, pp. 2206–2216. [Online]. Available: https://proceedings.mlr.press/v119/croce20b.html

Digital Library

Google Scholar

[17]

M.-I. Nicolae et al., “Adversarial robustness toolbox v1.2.0,” 2018. [Online]. Available: https://arxiv.org/pdf/1807.01069

Google Scholar

[18]

S. Zagoruyko and N. Komodakis, “Wide residual networks,” 2016. [Online]. Available: https://arxiv.org/abs/1605.07146

Google Scholar

[19]

G. Brockman et al., “OpenAI Gym,” 2016,.

Google Scholar

Index Terms

Forming Adversarial Example Attacks Against Deep Neural Networks With Reinforcement Learning
1. Computing methodologies
  1. Machine learning
    1. Learning paradigms
    2. Machine learning approaches
      1. Neural networks
2. Security and privacy

Index terms have been assigned to the content through auto-classification.

Recommendations

Robust Deep Reinforcement Learning with Adversarial Attacks
AAMAS '18: Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems

This paper proposes adversarial attacks for Reinforcement Learning (RL). These attacks are then leveraged during training to improve the robustness of RL within robust control framework. We show that this adversarial training of DRL algorithms like Deep ...
XSS adversarial example attacks based on deep reinforcement learning
Abstract
Cross-site scripting (XSS) attack is one of the most serious security problems in web applications. Although deep neural network (DNN) has been used in XSS attack detection and achieved unprecedented success, it is vulnerable to ...
Snooping Attacks on Deep Reinforcement Learning
AAMAS '20: Proceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems

Adversarial attacks have exposed a significant security vulnerability in state-of-the-art machine learning models. Among these models include deep reinforcement learning agents. The existing methods for attacking reinforcement learning agents assume the ...

Comments

Information & Contributors

Information

Published In

U.S. Government work not protected by U.S. copyright.

Publisher

IEEE Computer Society Press

Washington, DC, United States

Publication History

Published: 03 January 2024

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 21 Sep 2024

Other Metrics

View Author Metrics

Citations

View Options

View options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Abstract

References

Index Terms

Recommendations

Robust Deep Reinforcement Learning with Adversarial Attacks

XSS adversarial example attacks based on deep reinforcement learning

Snooping Attacks on Deep Reinforcement Learning

Comments

Information

Published In

Publisher

Publication History

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

View options

Get Access

Login options

Full Access

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations