Is Neuron Coverage a Meaningful Measure for Testing Deep Neural Networks? (ESEC/FSE 2022 - ESEC/FSE 2020)

Write a Blog >>

Mon 14 - Fri 18 November 2022 Singapore

Who

Fabrice Harel-Canada, Lingxiao Wang, Muhammad Ali Gulzar, Quanquan Gu, Miryung Kim

Track

ESEC/FSE 2022 ESEC/FSE 2020

Time Zone

The program is currently displayed in (GMT+08:00) Beijing, Chongqing, Hong Kong, Urumqi.

Use conference time zone: (GMT+08:00) Beijing, Chongqing, Hong Kong, UrumqiSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 15 Nov 2022 14:30 - 14:45 at SRC LT 52 - ESEC/FSE 20 Software Testing I Chair(s): Arie van Deursen

Abstract

Recent effort to test deep learning systems has produced an intuitive and compelling test criterion called neuron coverage (NC), which resembles the notion of traditional code coverage. NC measures the proportion of neurons activated in a neural network and it is implic- itly assumed that increasing NC improves the quality of a test suite. In an attempt to automatically generate a test suite that increases NC, we design a novel diversity promoting regularizer that can be plugged into existing adversarial attack algorithms. We then assess whether such attempts to increase NC could generate a test suite that (1) detects adversarial attacks successfully, (2) produces natural inputs, and (3) is unbiased to particular class predictions. Contrary to expectation, our extensive evaluation finds that increasing NC actually makes it harder to generate an effective test suite: higher neuron coverage leads to fewer defects detected, less natural inputs, and more biased prediction preferences. Our results invoke skep- ticism that increasing neuron coverage may not be a meaningful objective for generating tests for deep neural networks and call for a new test generation technique that considers defect detection, naturalness, and output impartiality in tandem.

Link to Publication

http://web.cs.ucla.edu/~miryung/Publications/fse2020-testingdeeplearning.pdf

Link to Preprint

http://web.cs.ucla.edu/~miryung/Publications/fse2020-testingdeeplearning.pdf

Authorizer Link

https://dl.acm.org/doi/abs/10.1145/3368089.3409754

Fabrice Harel-Canada

University of California at Los Angeles, USA

Lingxiao Wang

University of California at Los Angeles, USA

Muhammad Ali Gulzar

Virginia Tech, USA

United States

Quanquan Gu

University of California at Los Angeles, USA

Miryung Kim

University of California at Los Angeles, USA

United States

Time Zone

The program is currently displayed in (GMT+08:00) Beijing, Chongqing, Hong Kong, Urumqi.

Use conference time zone: (GMT+08:00) Beijing, Chongqing, Hong Kong, UrumqiSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Tue 15 Nov
Displayed time zone: Beijing, Chongqing, Hong Kong, Urumqi change

14:00 - 15:30	ESEC/FSE 20 Software Testing IESEC/FSE 2020 at SRC LT 52 Chair(s): Arie van Deursen Delft University of Technology

14:00 15m Talk		Testing Self-Adaptive Software with Probabilistic Guarantees on Performance Metrics ESEC/FSE 2020 Claudio Mandrioli Lund University, Sweden, Martina Maggio Saarland University, Germany / Lund University, Sweden DOI Pre-print
14:15 15m Talk		Search-Based Adversarial Testing and Improvement of Constrained Credit Scoring Systems ESEC/FSE 2020 Salah Ghamizi University of Luxembourg, Luxembourg, Maxime Cordy University of Luxembourg, Luxembourg, Martin Gubri University of Luxembourg, Luxembourg, Mike Papadakis University of Luxembourg, Luxembourg, Andrey Boystov University of Luxembourg, Luxembourg, Yves Le Traon University of Luxembourg, Luxembourg, Anne Goujon BGL BNP Paribas, Luxembourg
14:30 15m Talk		Is Neuron Coverage a Meaningful Measure for Testing Deep Neural Networks? ESEC/FSE 2020 Fabrice Harel-Canada University of California at Los Angeles, USA, Lingxiao Wang University of California at Los Angeles, USA, Muhammad Ali Gulzar Virginia Tech, USA, Quanquan Gu University of California at Los Angeles, USA, Miryung Kim University of California at Los Angeles, USA Link to publication Authorizer link Pre-print
14:45 15m Talk		When Does My Program Do This? Learning Circumstances of Software Behavior ESEC/FSE 2020 Alexander Kampmann CISPA, Germany, Nikolas Havrikov CISPA, Germany, Ezekiel Soremekun SnT, University of Luxembourg, Andreas Zeller CISPA Helmholtz Center for Information Security Link to publication DOI
15:00 15m Talk		FrUITeR: A Framework for Evaluating UI Test Reuse ESEC/FSE 2020 Yixue Zhao University of Massachusetts at Amherst, Justin Chen Columbia University, USA, Adriana Sejfia University of Southern California, Marcelo Schmitt Laser University of Southern California, USA, Jie M. Zhang King's College London, Federica Sarro University College London, Mark Harman University College London, Nenad Medvidović University of Southern California Pre-print Media Attached