Dataset

The Equivalent Mutant Dataset created by MutantBench

DOI

Mutant dataset standard

We use RDF to define and store the metadata. This also ensures the FAIR data principles F1, F3, A1, I1, and I3. Although RDF is not designed to enforce the user to supply metadata and thus not enforcing for F2, MutantBench enables the user to include missing metadata during conversion.

This section goes into detail of this standard and how this is defined.

Program

The program (also original program) is the piece of software that does not have any mutants applied. In this standard, it is a subclass of the SoftwareSourceCode created by Schema, or schema:SoftwareSourceCode for short (https://schema.org/SoftwareSourceCode). This is to done to integrate with the existing and widely used language defined by Schema. Each program that we provide contains at least the following metadata (also called properties):

URI: A globally unique and persistent identifier. Same [name].[extension]. This restricts programs with the same name and programming language existing multiple times in the dataset.
name: The name of the file without the extension.
extension: The file extension
fileName: The name of the file including the extension.
programmingLanguage: The computer programming language.
codeRepository: Link to the source code of the program.

Operator

Each operator is defined using the class mb:Operator. Which is a subclass of the schema:Thing (https://schema.org/Thing). The operator standard includes the following metadata:

URI: A globally unique and persistent identifier. For this the abbriviation of the name is used. e.g. ABSI. For the full list, see Table~\ref{tab:mutantbench_operators}.
name: A descriptive name. Example: ``Absolute insertion" for the ABSI operator.
operatorAbbreviation: The short name, or abbreviation, of the mutant operator
operatorDescription: The text representation in LaTeX of the mathematical operation.
primitiveOperator: The primitive action of operator used as defined by by Y. Ma in "Description of muJava's Method-level Mutation Operators".
operatorClass: The class of the operator as defined by King and Offurr in "A fortran language system for mutation-based software testing".
operatorAction: The type of operation the operator will perform.

Mutant

Each mutant is defined using the class mb:Mutant}. This is a subclass of schema:SoftwareSourceCode (https://schema.org/SoftwareSourceCode). This comes with the following added properties:

URI: A globally unique and persistent identifier. Generated by using the concatenated string of the program name (without extension) with the mutant difference on the SHA-1 algorithm (https://en.wikipedia.org/wiki/SHA-1) in hexadecimal. Python example: hashlib.sha1((file_name + diff).encode()).hexdigest(). This is chosen for its ease of implementation.
citation: The original paper this mutant came from.
program: The program that the mutant originates from.
difference: The difference between the mutant and the program. Generated using the following bash command: diff -u0 --ignore-all-space --ignore-blank-lines [SoftwareSourceCode location] [mutant location]}.
equivalence: If the mutant is equivalent to the program this should be true. If it is not equivalent, it should be false. Else empty/unknown/none/null
operator: The operator(s) applied to the program to generate the mutant.
RIP: The RIP class of the mutant.

Mutation testing is a method of evaluating the effectiveness of a test suite. It does this by altering the program with a predefined set of operations, resulting in a mutant. The effectiveness of the test suite is then determined by the number of mutants the test suite correctly identifies to be altered. However, these alternations can sometimes lead to semantically equivalent programs, called equivalent mutants. These mutants can never be detected by a test suite, and thus incorrectly lowering the effectiveness of the test suite. The problem of these equivalent mutants is called the Equivalent Mutant Problem, or EMP.

Equivalent Mutant Problem is an ongoing research problem that led to the creation of multiple reusable non-standardized mutant datasets. These datasets can be used to measure and compare the performance of solutions to this problem. However, because these are non-standardized datasets without any benchmarking tool attached, cross-study evaluation is still tedious.

To tackle this problem, we propose MutantBench, a novel open-source comparison framework that is designed with a focus on the FAIR data principles and adoptability by the community. This contains four contributions. First, we design a dataset standard which aims to make datasets findable, accessible, interoperable, and reusable. Next, we convert existing datasets to this new standard. Which we then combine to create a mutant dataset containing 4400 mutants containing 1416 equivalent mutants. This increases the previously largest mutant dataset by more than a thousand equivalent mutants. Lastly, we create a benchmarking tool that uses the combined dataset to give a detailed report on the performance of solutions to the equivalent mutant problem.

This link contains the dataset with its standard, and the benchmarking tool can be found at https://github.com/MutantBench/MutantBench

Identifier
DOI	https://doi.org/10.23728/b2share.fd8e674385214fe9a327941525c31f53
Source	https://b2share.eudat.eu/records/fd8e674385214fe9a327941525c31f53
Metadata Access	https://b2share.eudat.eu/api/oai2d?verb=GetRecord&metadataPrefix=eudatcore&identifier=oai:b2share.eudat.eu:b2rec/fd8e674385214fe9a327941525c31f53

Provenance
Creator	van Hijfte, Lars
Publisher	EUDAT B2SHARE
Publication Year	2021
Rights	Public Domain Mark (PD); info:eu-repo/semantics/openAccess
OpenAccess	true
Contact	lars.vanhijfte(at)student.uva.nl, larsvanhijfte(at)hotmail.com, a.m.oprescu(at)uva.nl

Representation
Format	ttl
Size	1.7 MB; 2 files
Discipline	4.1.13 → Computer sciences → Software engineering