A hypothetical defenses-based training framework for generating transferable adversarial examples

Hao L, Hao K, Jin Y, Zhao H (2024)
Knowledge-Based Systems: 112602.

Zeitschriftenaufsatz | Veröffentlicht | Englisch
 
Download
Es wurden keine Dateien hochgeladen. Nur Publikationsnachweis!
Autor*in
Hao, Lingguang; Hao, Kuangrong; Jin, YaochuUniBi ; Zhao, Hongzhi
Abstract / Bemerkung
Transfer-based attacks utilize the proxy model to craft adversarial examples against the target model and make significant advancements in the realm of black-box attacks. Recent research suggests that these attacks can be enhanced by incorporating adversarial defenses into the training process of adversarial examples. Specifically, adversarial defenses supervise the training process, forcing the attacker to overcome greater challenges and produce more robust adversarial examples with enhanced transferability. However, current methods mainly rely on limited input transformation defenses, which apply only linear affine changes. These defenses are insufficient for effectively removing harmful content from adversarial examples, resulting in restricted improvements in their transferability. To address this issue, we propose a novel training framework named Transfer-based Attacks through Hypothesis Defense (TA-HD). This framework enhances the generalization of adversarial examples by integrating a hypothesis defense mechanism into the proxy model. Specifically, we propose an input denoising network as the hypothesis defense to effectively remove harmful noise from adversarial examples. Furthermore, we introduce an adversarial training strategy and design specific adversarial loss functions to optimize the input denoising network’s parameters. The visualization of the training process demonstrates the effective denoising capability of the hypothesized defense mechanism and the stability of the training process. Extensive experiments show that the proposed training framework significantly improves the success rate of transfer-based attacks by up to 19.9%. The code is available at https://github.com/haolingguang/TA-HD.
Erscheinungsjahr
2024
Zeitschriftentitel
Knowledge-Based Systems
Art.-Nr.
112602
ISSN
09507051
Page URI
https://pub.uni-bielefeld.de/record/2993318

Zitieren

Hao L, Hao K, Jin Y, Zhao H. A hypothetical defenses-based training framework for generating transferable adversarial examples. Knowledge-Based Systems. 2024: 112602.
Hao, L., Hao, K., Jin, Y., & Zhao, H. (2024). A hypothetical defenses-based training framework for generating transferable adversarial examples. Knowledge-Based Systems, 112602. https://doi.org/10.1016/j.knosys.2024.112602
Hao, Lingguang, Hao, Kuangrong, Jin, Yaochu, and Zhao, Hongzhi. 2024. “A hypothetical defenses-based training framework for generating transferable adversarial examples”. Knowledge-Based Systems: 112602.
Hao, L., Hao, K., Jin, Y., and Zhao, H. (2024). A hypothetical defenses-based training framework for generating transferable adversarial examples. Knowledge-Based Systems:112602.
Hao, L., et al., 2024. A hypothetical defenses-based training framework for generating transferable adversarial examples. Knowledge-Based Systems, : 112602.
L. Hao, et al., “A hypothetical defenses-based training framework for generating transferable adversarial examples”, Knowledge-Based Systems, 2024, : 112602.
Hao, L., Hao, K., Jin, Y., Zhao, H.: A hypothetical defenses-based training framework for generating transferable adversarial examples. Knowledge-Based Systems. : 112602 (2024).
Hao, Lingguang, Hao, Kuangrong, Jin, Yaochu, and Zhao, Hongzhi. “A hypothetical defenses-based training framework for generating transferable adversarial examples”. Knowledge-Based Systems (2024): 112602.
Export

Markieren/ Markierung löschen
Markierte Publikationen

Open Data PUB

Suchen in

Google Scholar