Black-Box Dissector: Towards Erasing-based Hard-Label Model Stealing Attack

Wang, Yixu; Li, Jie; Liu, Hong; Wang, Yan; Wu, Yongjian; Huang, Feiyue; Ji, Rongrong

Computer Science > Computer Vision and Pattern Recognition

arXiv:2105.00623v2 (cs)

[Submitted on 3 May 2021 (v1), revised 16 Jun 2021 (this version, v2), latest version 26 Sep 2022 (v3)]

Title:Black-Box Dissector: Towards Erasing-based Hard-Label Model Stealing Attack

Authors:Yixu Wang, Jie Li, Hong Liu, Yan Wang, Yongjian Wu, Feiyue Huang, Rongrong Ji

View PDF

Abstract:Previous studies have verified that the functionality of black-box models can be stolen with full probability outputs. However, under the more practical hard-label setting, we observe that existing methods suffer from catastrophic performance degradation. We argue this is due to the lack of rich information in the probability prediction and the overfitting caused by hard labels. To this end, we propose a novel hard-label model stealing method termed \emph{black-box dissector}, which consists of two erasing-based modules. One is a CAM-driven erasing strategy that is designed to increase the information capacity hidden in hard labels from the victim model. The other is a random-erasing-based self-knowledge distillation module that utilizes soft labels from the substitute model to mitigate overfitting. Extensive experiments on four widely-used datasets consistently demonstrate that our method outperforms state-of-the-art methods, with an improvement of at most $8.27\%$. We also validate the effectiveness and practical potential of our method on real-world APIs and defense methods. Furthermore, our method promotes other downstream tasks, \emph{i.e.}, transfer adversarial attacks.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2105.00623 [cs.CV]
	(or arXiv:2105.00623v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2105.00623

Submission history

From: Yixu Wang [view email]
[v1] Mon, 3 May 2021 04:12:31 UTC (896 KB)
[v2] Wed, 16 Jun 2021 04:05:53 UTC (1,397 KB)
[v3] Mon, 26 Sep 2022 15:31:11 UTC (1,572 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Black-Box Dissector: Towards Erasing-based Hard-Label Model Stealing Attack

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Black-Box Dissector: Towards Erasing-based Hard-Label Model Stealing Attack

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators