Table 1. Tasks and datapoints contained in each dataset split.
| Task name | Labeled dataset | Blind dataset |
|---|---|---|
| Binary similarity | 732,376 | 243,044 |
| Compiler provenance | 89,744 | 9,600 |
| Function naming | 120,640 | 9,600 |
| Signature recovery | 120,259 | 9,086 |
| Task name | Labeled dataset | Blind dataset |
|---|---|---|
| Binary similarity | 732,376 | 243,044 |
| Compiler provenance | 89,744 | 9,600 |
| Function naming | 120,640 | 9,600 |
| Signature recovery | 120,259 | 9,086 |