On the Effectiveness of Data Balancing Techniques in the Context of ML-based Test Case Prioritization
Regression testing is the cornerstone of quality assurance of software systems. However, executing regression test cases can impose significant computational and operational costs. In this context, Machine Learning-based Test Case Prioritization (ML-based TCP) techniques rank the execution of regression tests based on their probability of failures and execution time so that the faults can be detected as early as possible during the regression testing. Despite the recent progress of ML-based TCP, even the best reported ML-based TCP techniques can reach 90% or higher effectiveness in terms of Cost-cognizant Average Percentage of Faults Detected (APFDC) only in 20% of studied subjects. We argue that the imbalanced nature of used training datasets caused by the low failure rate of regression tests is one of the main reasons for this shortcoming. This work conducts an empirical study on applying 19 state-of-the-art data balancing techniques for dealing with imbalanced data sets in the TCP context, based on the most comprehensive publicly available datasets. The results demonstrate that data balancing techniques can improve the effectiveness of the best-known ML-based TCP technique for most subjects, with an average of 0.06 in terms of APFDC.
Fri 18 NovDisplayed time zone: Beijing, Chongqing, Hong Kong, Urumqi change
11:00 - 12:30 | |||
11:00 20mResearch paper | On the Effectiveness of Data Balancing Techniques in the Context of ML-based Test Case Prioritization PROMISE Jediael Mendoza , Jason Mycroft , Lyam Milbury , Nafiseh Kahani University of Carlton, Jason Jaskolka | ||
11:20 20mResearch paper | Profiling Developers to Predict Vulnerable Code Changes PROMISE Tugce Coskun Istanbul Technical University, Rusen Halepmollasi Istanbul Technical University, Khadija Hanifi Ericsson, Ramin Fadaei Fouladi Ericsson, Pinar Comak De Cnudde Ericsson, Ayse Tosun Istanbul Technical University | ||
11:40 20mResearch paper | Assessing the Quality of GitHub Copilot’s Code Generation PROMISE |