Skip to main content
. 2018 Jan 29;35(5):1037–1046. doi: 10.1093/molbev/msy014

Table 4.

Results of a Code Duplication Analysis Using the Simian Tool.

Code Lines Checked Files Checked Duplicate Lines Duplication % Blocks Files
PAML 22,200 17 1,210 5.5% 120 11
PHYML 42,786 73 5,878 13.7% 549 32
MrBayes 70,680 19 21,862 30.9% 1,680 10
RAxML 55,873 25 17,137 30.7% 1,304 22
SOAP 27,514 116 10,107 36.7% 527 72
Abyss 37,038 212 4,245 11.5% 441 71
MS 1,718 24 186 10.8% 21 9
SweepFinder 3,777 12 293 7.8% 28 3
MAFFT 45,045 72 28,630 63.6% 1,647 59
T-Coffee 82,758 196 19,345 23.4% 1,325 58
Prank 16,124 67 5,318 33.0% 462 43
BEAST 228,316 2,336 64,024 28.0% 4,786 1,151
BP&P 14,332 5 502 3.5% 56 3
Seq-Gen 3,244 44 206 6.4% 25 6
INDELible 9,840 7 1,954 19.9% 106 5
Gadget-2 9,770 31 3,314 33.9% 180 31

Note.—The column “Lines checked” refers to the total number of source lines and “Files checked” to the total number of source files analyzed with Simian. Note that, the “Lines checked” number is not identical to the LoC numbers reported in tables 2 and 3, since the Simian tool does not take header files into account. Column “Duplicate lines” provides the number of duplicate lines detected, “duplication %” the relative amount of code duplication, and “Blocks” provides the total number of contiguous duplicated blocks of code. Finally, column “Files” gives the number of files in which duplicated code was detected.