XCorpus – An executable corpus of java programs

Dietrich JB; Schole H; Sui L; Tempero E

View/Open Full Text

article1.pdf (859.4Kb)

10.5381/jot.2017.16.4.a1

Open Access Location

http://www.jot.fm/issues/issue_2017_04/article1.pdf

Find Full Text

Abstract

Empirical studies on code require standardized datasets of significant size extracted from real-world programs in order to be reproducible and generalisable. We argue that there is a need for such data sets that are executable and can therefore be used for experiments using static and dynamic analysis. A harness for such a data set should have high coverage in order to facilitate the construction of comprehensive models of program execution. We present XCorpus, a set of 76 executable, real-world Java programs, including a subset of 70 programs from the Qualitas Corpus. XCorpus uses a harness that is a combination of built-in and generated test cases, resulting in a branch coverage that is significantly better than what is available from DaCapo.