The Vulnerability Is in the Details: Locating Fine-grained Information of Vulnerable Code Identified by Graph-based Detectors

Cheng, Baijun; Wang, Kailong; Gao, Cuiyun; Luo, Xiapu; Li, Li; Guo, Yao; Chen, Xiangqun; Wang, Haoyu

Computer Science > Software Engineering

arXiv:2401.02737 (cs)

[Submitted on 5 Jan 2024 (v1), last revised 7 Sep 2024 (this version, v3)]

Title:The Vulnerability Is in the Details: Locating Fine-grained Information of Vulnerable Code Identified by Graph-based Detectors

Authors:Baijun Cheng, Kailong Wang, Cuiyun Gao, Xiapu Luo, Li Li, Yao Guo, Xiangqun Chen, Haoyu Wang

View PDF HTML (experimental)

Abstract:Vulnerability detection is a crucial component in the software development lifecycle. Existing vulnerability detectors, especially those based on deep learning (DL) models, have achieved high effectiveness. Despite their capability of detecting vulnerable code snippets from given code fragments, the detectors are typically unable to further locate the fine-grained information pertaining to the vulnerability, such as the precise vulnerability triggering this http URL this paper, we propose VULEXPLAINER, a tool for automatically locating vulnerability-critical code lines from coarse-level vulnerable code snippets reported by DL-based this http URL approach takes advantage of the code structure and the semantics of the vulnerabilities. Specifically, we leverage program slicing to get a set of critical program paths containing vulnerability-triggering and vulnerability-dependent statements and rank them to pinpoint the most important one (i.e., sub-graph) as the data flow associated with the vulnerability. We demonstrate that VULEXPLAINER performs consistently well on four state-of-the-art graph-representation(GP)-based vulnerability detectors, i.e., it can flag the vulnerability-triggering code statements with an accuracy of around 90% against eight common C/C++ vulnerabilities, outperforming five widely used GNN-based explanation approaches. The experimental results demonstrate the effectiveness of VULEXPLAINER, which provides insights into a promising research line: integrating program slicing and deep learning for the interpretation of vulnerable code fragments.

Subjects:	Software Engineering (cs.SE)
Cite as:	arXiv:2401.02737 [cs.SE]
	(or arXiv:2401.02737v3 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2401.02737

Submission history

From: Baijun Cheng [view email]
[v1] Fri, 5 Jan 2024 10:15:04 UTC (3,381 KB)
[v2] Wed, 21 Feb 2024 08:21:43 UTC (3,381 KB)
[v3] Sat, 7 Sep 2024 12:26:49 UTC (3,381 KB)

Computer Science > Software Engineering

Title:The Vulnerability Is in the Details: Locating Fine-grained Information of Vulnerable Code Identified by Graph-based Detectors

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:The Vulnerability Is in the Details: Locating Fine-grained Information of Vulnerable Code Identified by Graph-based Detectors

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators