Learning Security Classifiers with Verified Global Robustness Properties

Chen, Yizheng; Wang, Shiqi; Qin, Yue; Liao, Xiaojing; Jana, Suman; Wagner, David

doi:10.1145/3460120.3484776

Computer Science > Cryptography and Security

arXiv:2105.11363 (cs)

[Submitted on 24 May 2021 (v1), last revised 1 Dec 2021 (this version, v3)]

Title:Learning Security Classifiers with Verified Global Robustness Properties

Authors:Yizheng Chen, Shiqi Wang, Yue Qin, Xiaojing Liao, Suman Jana, David Wagner

View PDF

Abstract:Many recent works have proposed methods to train classifiers with local robustness properties, which can provably eliminate classes of evasion attacks for most inputs, but not all inputs. Since data distribution shift is very common in security applications, e.g., often observed for malware detection, local robustness cannot guarantee that the property holds for unseen inputs at the time of deploying the classifier. Therefore, it is more desirable to enforce global robustness properties that hold for all inputs, which is strictly stronger than local robustness.
In this paper, we present a framework and tools for training classifiers that satisfy global robustness properties. We define new notions of global robustness that are more suitable for security classifiers. We design a novel booster-fixer training framework to enforce global robustness properties. We structure our classifier as an ensemble of logic rules and design a new verifier to verify the properties. In our training algorithm, the booster increases the classifier's capacity, and the fixer enforces verified global robustness properties following counterexample guided inductive synthesis.
We show that we can train classifiers to satisfy different global robustness properties for three security datasets, and even multiple properties at the same time, with modest impact on the classifier's performance. For example, we train a Twitter spam account classifier to satisfy five global robustness properties, with 5.4% decrease in true positive rate, and 0.1% increase in false positive rate, compared to a baseline XGBoost model that doesn't satisfy any property.

Comments:	ACM Conference on Computer and Communications Security (CCS) 2021 Best Paper Award Runner-Up
Subjects:	Cryptography and Security (cs.CR); Machine Learning (cs.LG)
Cite as:	arXiv:2105.11363 [cs.CR]
	(or arXiv:2105.11363v3 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2105.11363
Related DOI:	https://doi.org/10.1145/3460120.3484776

Submission history

From: Yizheng Chen [view email]
[v1] Mon, 24 May 2021 15:46:20 UTC (381 KB)
[v2] Thu, 16 Sep 2021 05:10:06 UTC (380 KB)
[v3] Wed, 1 Dec 2021 20:17:02 UTC (380 KB)

Computer Science > Cryptography and Security

Title:Learning Security Classifiers with Verified Global Robustness Properties

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:Learning Security Classifiers with Verified Global Robustness Properties

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators