Skip to content

Latest commit

 

History

History
10 lines (7 loc) · 412 Bytes

README.md

File metadata and controls

10 lines (7 loc) · 412 Bytes

SimCorp.

This corpus contains different datasets of behaviorally equivalent C/C++ programs to evaluate their semantic similitude.

The datasets:

  • 6 Type-4 scenarios extracted from the BigCloneBench
  • 10 programs for sorting, aggregation, and search algorithms
  • 566 programs extracted from CodeForces solving 5 different problems