research-article

Boosting fuzzer efficiency: an information theoretic perspective

Authors:

ESEC/FSE 2020: Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering

Pages 678 - 689

https://doi.org/10.1145/3368089.3409748

Published: 08 November 2020 Publication History

Get Access

Abstract

In this paper, we take the fundamental perspective of fuzzing as a learning process. Suppose before fuzzing, we know nothing about the behaviors of a program P: What does it do? Executing the first test input, we learn how P behaves for this input. Executing the next input, we either observe the same or discover a new behavior. As such, each execution reveals ”some amount” of information about P’s behaviors. A classic measure of information is Shannon’s entropy. Measuring entropy allows us to quantify how much is learned from each generated test input about the behaviors of the program. Within a probabilistic model of fuzzing, we show how entropy also measures fuzzer efficiency. Specifically, it measures the general rate at which the fuzzer discovers new behaviors. Intuitively, efficient fuzzers maximize information.

From this information theoretic perspective, we develop Entropic, an entropy-based power schedule for greybox fuzzing which assigns more energy to seeds that maximize information. We implemented Entropic into the popular greybox fuzzer LibFuzzer. Our experiments with more than 250 open-source programs (60 million LoC) demonstrate a substantially improved efficiency and confirm our hypothesis that an efficient fuzzer maximizes information. Entropic has been independently evaluated and invited for integration into main-line LibFuzzer. Entropic now runs on more than 25,000 machines fuzzing hundreds of security-critical software systems simultaneously and continuously.

Supplementary Material

Auxiliary Teaser Video (fse20main-p597-p-teaser.mp4)

This is the presentation video for our ESEC/FSE'20 paper "Boosting Fuzzer Efficiency: An Information Theoretic Perspective" by Marcel Böhme, Valentin J. M. Manès, and Sang Kil Cha.

Download
17.46 MB

Auxiliary Presentation Video (fse20main-p597-p-video.mp4)

This is the presentation video for our ESEC/FSE'20 paper "Boosting Fuzzer Efficiency: An Information Theoretic Perspective" by Marcel Böhme, Valentin J. M. Manès, and Sang Kil Cha.

Download
244.87 MB

References

[1]

Abhishek Aarya, Oliver Chang, Max Moroz, Martin Barbella, and Jonathan Metzman. 2019. Open sourcing ClusterFuzz. https://security.googleblog.com/ 2019 /02/ open-sourcing-clusterfuzz.html. ( 2019 ). Accessed: 2020-09-30.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Boosting Fuzzer Efficiency: An Information Theoretic Perspective

Information Theory, Information View, and Software Testing

Grey-box concolic testing on binary code

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Badges

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Get Access

Login options

Full Access

View options

PDF

eReader

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations