Data
SZZ: "When do changes Induce Fixed?"
The csv table with the list of those papers that have cited SZZ from Google Scholar is available here.
The csv table with the list of those papers that have cited SZZ from Semantic Scholar is available here.
SZZ-1: "Automatic identification of bug-introducing changes"
The csv table with the list of those papers that have cited SZZ-1 from Google Scholar is available here.
The csv table with the list of those papers that have cited SZZ-1 from Semantic Scholar is available here.
SZZ-2: "SZZ Revisited: Verifying When Changes Induce Fixes"
The csv table with the list of those papers that have cited SZZ-2 from Google Scholar is available here.
The csv table with the list of those papers that have cited SZZ-2 from Semantic Scholar is available here.
Procedure
In the SLR we only considered studies that use both the whole algorithm or the second part of the SZZ algorithm. We identified a subset of 192 papers that fullfilled with this criteria and from which we extracted data.
We have extracted the following context data information
- Paper ID
- Does it carry out a manual analysis?
- Does it use the SZZ, SZZ-1 or SZZ-2 algorithm?
- Does it aware of the TTV in the first part?
- Does it aware of the TTV in the second part?
- Does it provide a versioned implementation of the SZZ?
- Does it provide a detailed description of the data and methods used?
- Does it provide a replication package?
Results
The csv table with the data (listed below) of each paper is available here.
The references included in the paper whith the passed and deleted articles is available here.
The JSON files with the data of the papers analyzed after the filtering (2) are available here.
The algorithm to remove false positives is here. And, the algorithm to remove duplicates is here .
The chi-square script and the statistical results are here.
Authors and Contributors
Gregorio Robles (@grex), Jesus M. Gonzalez-Barahora (@jgbarah), and Gema Rodriguez (@gemarodri).