Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

ForkSim: Generating software forks for evaluating cross-project similarity analysis tools

: Svajlenko, Jeffrey; Roy, Chanchal K.; Duszynski, Slawomir


Adams, B. ; Institute of Electrical and Electronics Engineers -IEEE-; IEEE Computer Society:
IEEE 13th International Working Conference on Source Code Analysis and Manipulation, SCAM 2013. Proceedings : 22-23 September 2013, Eindhoven, the Netherlands
Los Alamitos: IEEE Computer Society, 2013
ISBN: 978-1-4673-5739-5
International Working Conference on Source Code Analysis and Manipulation (SCAM) <13, 2013, Eindhoven>
Fraunhofer IESE ()
software tool; software variant; evaluation; code generation; clone detection; variant analysis

Software project forking, that is copying an existing project and developing a new independent project from the copy, occurs frequently in software development. Analysing the code similarities between such software projects is useful as developers can use similarity information to merge the forked systems or migrate them towards a reuse approach. Several techniques for detecting cross-project similarities have been proposed. However, no good benchmark to measure their performance is available. We developed ForkSim, a tool for generating datasets of synthetic software forks with known similarities and differences. This allows the performance of cross-project similarity tools to be measured in terms of recall and precision by comparing their output to the known properties of the generated dataset. These datasets can also be used in controlled experiments to evaluate further aspects of the tools, such as usability or visualization concepts. As a demonstration of our tool, we evaluated the performance of the clone detector NiCad for similarity detection across software forks, which showed the high potential of ForkSim.