Boosting Fuzzer Efficiency: An Information Theoretic Perspective (ESEC/FSE 2022 - ESEC/FSE 2020)

Write a Blog >>

Mon 14 - Fri 18 November 2022 Singapore

Who

Marcel Böhme, Valentin Manès, Sang Kil Cha

Track

ESEC/FSE 2022 ESEC/FSE 2020

Time Zone

The program is currently displayed in (GMT+08:00) Beijing, Chongqing, Hong Kong, Urumqi.

Use conference time zone: (GMT+08:00) Beijing, Chongqing, Hong Kong, UrumqiSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Wed 16 Nov 2022 11:45 - 12:00 at SRC LT 52 - ESEC/FSE 20 Software Testing II Chair(s): Xi Zheng

Abstract

In this paper, we take the fundamental perspective of fuzzing as a learning process. Suppose before fuzzing, we know nothing about the behaviors of a program P: What does it do? Executing the first test input, we learn how P behaves for this input. Executing the next input, we either observe the same or discover a new behavior. As such, each execution reveals ”some amount” of information about P’s behaviors. A classic measure of information is Shannon’s entropy. Measuring entropy allows us to quantify how much is learned from each generated test input about the behaviors of the program. Within a probabilistic model of fuzzing, we show how entropy also measures fuzzer efficiency. Specifically, it measures the general rate at which the fuzzer discovers new behaviors. Intuitively, efficient fuzzers maximize information.

From this information theoretic perspective, we develop Entropic, an entropy-based power schedule for greybox fuzzing which assigns more energy to seeds that maximize information. We implemented Entropic into the popular greybox fuzzer LibFuzzer. Our experiments with more than 250 open-source programs (60 million LoC) demonstrate a substantially improved efficiency and confirm our hypothesis that an efficient fuzzer maximizes information. Entropic has been independently evaluated and invited for integration into main-line LibFuzzer. Entropic now runs on more than 25,000 machines fuzzing hundreds of security-critical software systems simultaneously and continuously.

Teaser Video:

Link to Preprint

https://mboehme.github.io/paper/FSE20.Entropy.pdf

DOI

https://doi.org/10.1145/3368089.3409748

Marcel Böhme

MPI-SP, Germany and Monash University, Australia

Germany

Valentin Manès

KAIST, South Korea

Sang Kil Cha