Table 5.
Grp | Group Name | Num Target | Best FM-Style Scoring
|
First FM-style Scoring
|
Win/Loss Scoring
|
|||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Sum Z | Sum R | Avg Z | Avg R | Sum Z | Sum R | Avg Z | Avg R | Win F | P-value | Win R | ||||
A | Tp | |||||||||||||
169 | LEE*1,2 | 23 | 16.54 | 1 | 0.72 | 2 | 14.85 | 2 | 0.68 | 6 | 0.86 | 1.3E−32 | 1 | |
38 | nns*1,2 | 23 | 15.98 | 2 | 0.69 | 3 | 15.64 | 1 | 0.68 | 5 | 0.86 | 7.8E−32 | 2 | |
162 | McGuffin* | 23 | 8.85 | 3 | 0.38 | 7 | −0.93 | 5 | −0.04 | 12 | 0.67 | 2.5E−08 | 4 | |
345 | FUSION* | 22 | 3.25 | 4 | 0.24 | 10 | 7.44 | 3 | 0.43 | 7 | 0.59 | 3.1E−03 | 5 | |
420 | MULTICOM-CLUSTER | 23 | −0.24 | 5 | −0.01 | 12 | −2.42 | 6 | −0.11 | 13 | 0.48 | 6.7E−01 | 8 | |
479 | RBO Aleph | 21 | −2.58 | 6 | 0.07 | 11 | 2.33 | 4 | 0.30 | 9 | 0.56 | 4.3E−02 | 6 | |
65 | Jones-UCL | 21 | −5.27 | 7 | −0.06 | 13 | −2.96 | 7 | 0.05 | 10 | 0.52 | 2.5E−01 | 7 | |
80 | MeilerLab | 23 | −5.73 | 8 | −0.25 | 14 | −5.41 | 8 | −0.24 | 14 | 0.43 | 9.8E−01 | 9 | |
64 | BAKER1,2 | 10 | −13.73 | 9 | 1.23 | 1 | −14.99 | 9 | 1.10 | 1 | 0.96 | 1.4E−26 | 3 | |
357 | STAP | 21 | −16.97 | 10 | −0.62 | 15 | −15.40 | 10 | −0.54 | 15 | 0.24 | 1.0E+00 | 10 | |
32 | Legato | 21 | −17.24 | 11 | −0.63 | 16 | −17.90 | 11 | −0.66 | 17 | 0.24 | 1.0E+00 | 11 | |
41 | MULTICOM-NOVEL | 23 | −21.00 | 12 | −0.91 | 19 | −19.92 | 12 | −0.87 | 19 | 0.16 | 1.0E+00 | 12 | |
219 | Sternberg | 17 | −27.11 | 13 | −0.89 | 18 | −25.97 | 13 | −0.82 | 18 | 0.14 | 1.0E+00 | 13 | |
| ||||||||||||||
B | Tc | |||||||||||||
44 | LEER*1,2,3,4 | 24 | 29.94 | 1 | 1.25 | 1 | 30.37 | 1 | 1.27 | 1 | 0.95 | 7.8E−79 | 1 | |
169 | LEE*3,4 | 24 | 28.47 | 2 | 1.19 | 3 | 28.51 | 2 | 1.19 | 2 | 0.92 | 7.6E−66 | 2 | |
64 | BAKER1,2 | 23 | 25.30 | 3 | 1.19 | 2 | 24.68 | 4 | 1.16 | 3 | 0.88 | 4.0E−48 | 3 | |
38 | nns | 24 | 24.70 | 4 | 1.03 | 4 | 25.15 | 3 | 1.05 | 4 | 0.83 | 2.8E−38 | 4 | |
420 | MULTICOM-CLUSTER | 24 | 3.57 | 5 | 0.15 | 8 | 3.73 | 5 | 0.16 | 8 | 0.62 | 6.9E−06 | 8 | |
41 | MULTICOM-NOVEL | 24 | −10.29 | 6 | −0.43 | 13 | −10.35 | 6 | −0.43 | 14 | 0.38 | 1.0E+00 | 12 | |
276 | FLOUDAS A4 | 22 | −10.36 | 7 | −0.29 | 11 | −10.67 | 8 | −0.30 | 11 | 0.42 | 1.0E+00 | 10 | |
65 | Jones-UCL | 23 | −11.37 | 8 | −0.41 | 12 | −10.36 | 7 | −0.36 | 12 | 0.39 | 1.0E+00 | 11 | |
80 | MeilerLab | 24 | −15.92 | 9 | −0.66 | 17 | −15.83 | 9 | −0.66 | 17 | 0.24 | 1.0E+00 | 18 | |
345 | FUSION | 24 | −16.69 | 10 | −0.70 | 18 | −16.82 | 10 | −0.70 | 18 | 0.26 | 1.0E+00 | 17 | |
162 | McGuffin | 24 | −16.87 | 11 | −0.70 | 19 | −19.21 | 12 | −0.80 | 20 | 0.27 | 1.0E+00 | 16 | |
357 | STAP | 21 | −17.74 | 12 | −0.56 | 16 | −17.30 | 11 | −0.54 | 15 | 0.35 | 1.0E+00 | 13 | |
479 | RBO_Aleph | 21 | −23.16 | 13 | −0.82 | 20 | −21.05 | 13 | −0.75 | 19 | 0.20 | 1.0E+00 | 19 | |
428 | Laufer | 8 | −26.15 | 14 | 0.73 | 5 | −26.30 | 16 | 0.71 | 5 | 0.74 | 1.3E−08 | 5 | |
476 | Foldit | 9 | −26.28 | 15 | 0.41 | 6 | −26.06 | 14 | 0.44 | 6 | 0.70 | 1.3E−07 | 6 | |
342 | Anthropic Dreams | 9 | −26.66 | 16 | 0.37 | 7 | −26.25 | 15 | 0.42 | 7 | 0.69 | 1.7E−06 | 7 | |
186 | Void_Crushers | 9 | −28.97 | 17 | 0.11 | 9 | −30.68 | 17 | −0.08 | 10 | 0.58 | 2.8E−02 | 9 | |
32 | Legato | 21 | −31.74 | 18 | −1.23 | 21 | −30.83 | 18 | −1.18 | 21 | 0.05 | 1.0E+00 | 20 | |
40 | GoScience | 9 | −34.98 | 19 | −0.55 | 15 | −35.79 | 19 | −0.64 | 16 | 0.33 | 1.0E+00 | 14 | |
361 | Contenders | 7 | −37.33 | 20 | −0.48 | 14 | −36.66 | 20 | −0.38 | 13 | 0.33 | 1.0E+00 | 15 | |
| ||||||||||||||
C | Ts | |||||||||||||
64 | BAKER*1,2,3,4 | 19 | 25.96 | 1 | 1.37 | 1 | 25.21 | 1 | 1.33 | 1 | 0.90 | 2.9E−55 | 3 | |
44 | LEER1,2,3,4 | 19 | 22.24 | 2 | 1.17 | 2 | 22.33 | 3 | 1.18 | 3 | 0.92 | 2.8E−60 | 1 | |
169 | LEE1,2,3,4 | 19 | 22.17 | 3 | 1.17 | 3 | 22.52 | 2 | 1.19 | 2 | 0.92 | 2.8E−60 | 2 | |
38 | nns3 | 19 | 21.11 | 4 | 1.11 | 4 | 21.69 | 4 | 1.14 | 4 | 0.85 | 1.6E−41 | 4 | |
420 | MULTICOM-CLUSTER | 19 | 1.55 | 5 | 0.08 | 10 | 0.82 | 6 | 0.05 | 10 | 0.62 | 1.3E−05 | 7 | |
276 | FLOUDAS A4 | 18 | 0.98 | 6 | 0.17 | 9 | 1.41 | 5 | 0.19 | 7 | 0.65 | 1.3E−07 | 6 | |
357 | STAP | 19 | 0.33 | 7 | 0.02 | 11 | −0.25 | 7 | −0.01 | 11 | 0.60 | 3.0E−04 | 8 | |
428 | Laufer3 | 10 | −7.75 | 8 | 1.02 | 5 | −8.04 | 8 | 1.00 | 5 | 0.87 | 1.2E−26 | 5 | |
345 | FUSION | 19 | −10.38 | 9 | −0.55 | 18 | −10.28 | 10 | −0.54 | 20 | 0.32 | 1.0E+00 | 18 | |
41 | MULTICOM-NOVEL | 19 | −11.26 | 10 | −0.59 | 20 | −10.18 | 9 | −0.54 | 19 | 0.34 | 1.0E+00 | 17 | |
65 | Jones-UCL | 19 | −12.33 | 11 | −0.65 | 21 | −11.60 | 11 | −0.61 | 21 | 0.25 | 1.0E+00 | 19 | |
162 | McGuffin | 19 | −12.65 | 12 | −0.67 | 22 | −16.73 | 14 | −0.88 | 23 | 0.25 | 1.0E+00 | 20 | |
80 | MeilerLab | 19 | −13.55 | 13 | −0.71 | 23 | −13.24 | 12 | −0.70 | 22 | 0.24 | 1.0E+00 | 21 | |
310 | MUFOLD-R | 15 | −13.92 | 14 | −0.39 | 16 | −13.60 | 13 | −0.37 | 17 | 0.39 | 1.0E+00 | 13 | |
32 | Legato | 19 | −18.52 | 15 | −0.97 | 24 | −18.44 | 16 | −0.97 | 24 | 0.12 | 1.0E+00 | 22 | |
287 | RBO-Human | 11 | −18.80 | 16 | −0.25 | 13 | −18.38 | 15 | −0.22 | 13 | 0.49 | 5.6E−01 | 12 | |
219 | Sternberg | 19 | −20.19 | 17 | −1.06 | 25 | −20.23 | 17 | −1.06 | 25 | 0.08 | 1.0E+00 | 23 | |
476 | Foldit | 8 | −20.39 | 18 | 0.20 | 7 | −20.81 | 19 | 0.15 | 9 | 0.59 | 1.4E−02 | 10 | |
342 | Anthropic_Dreams | 8 | −20.53 | 19 | 0.18 | 8 | −20.73 | 18 | 0.16 | 8 | 0.61 | 3.8E−03 | 9 | |
186 | Void_Crushers | 8 | −22.71 | 20 | −0.09 | 12 | −22.77 | 20 | −0.10 | 12 | 0.50 | 5.0E−01 | 11 | |
479 | RBO_Aleph | 9 | −24.98 | 21 | −0.55 | 19 | −24.47 | 21 | −0.50 | 18 | 0.35 | 1.0E+00 | 16 | |
40 | GoScience | 8 | −25.06 | 22 | −0.38 | 14 | −24.81 | 22 | −0.35 | 16 | 0.36 | 1.0E+00 | 14 | |
361 | Contenders | 6 | −28.70 | 23 | −0.45 | 17 | −28.10 | 23 | −0.35 | 15 | 0.35 | 1.0E+00 | 15 | |
| ||||||||||||||
D | Tx | |||||||||||||
64 | BAKER*1,2 | 4 | 6.20 | 1 | 1.55 | 1 | 5.85 | 1 | 1.46 | 1 | 0.95 | 3.1E−14 | 1 | |
287 | RBO-Human | 4 | 3.31 | 2 | 0.83 | 2 | 3.07 | 2 | 0.77 | 2 | 0.78 | 6.1E−06 | 3 | |
169 | LEE1 | 4 | 3.06 | 3 | 0.76 | 3 | 2.91 | 4 | 0.73 | 4 | 0.82 | 3.8E−07 | 2 | |
38 | nns1 | 4 | 3.01 | 4 | 0.75 | 4 | 2.95 | 3 | 0.74 | 3 | 0.75 | 6.7E−05 | 4 | |
162 | McGuffin | 4 | 1.96 | 5 | 0.49 | 5 | −0.74 | 7 | −0.19 | 9 | 0.65 | 1.4E−02 | 6 | |
479 | RBO Aleph | 4 | 0.76 | 6 | 0.19 | 7 | 1.45 | 5 | 0.36 | 5 | 0.53 | 3.5E−01 | 8 | |
65 | Jones-UCL | 4 | 0.49 | 7 | 0.12 | 8 | 0.21 | 6 | 0.05 | 7 | 0.58 | 1.2E−01 | 7 | |
420 | MULTICOM-CLUSTER | 4 | −1.45 | 8 | −0.36 | 10 | −1.35 | 9 | −0.34 | 11 | 0.42 | 8.8E−01 | 10 | |
357 | STAP | 4 | −1.55 | 9 | −0.39 | 11 | −0.96 | 8 | −0.24 | 10 | 0.30 | 1.0E+00 | 14 | |
276 | FLOUDAS A4 | 4 | −1.58 | 10 | −0.40 | 12 | −1.46 | 10 | −0.37 | 12 | 0.38 | 9.5E−01 | 11 | |
345 | FUSION | 4 | −1.86 | 11 | −0.47 | 13 | −1.77 | 12 | −0.44 | 15 | 0.35 | 9.9E−01 | 13 | |
32 | Legato | 4 | −1.91 | 12 | −0.48 | 14 | −1.70 | 11 | −0.43 | 13 | 0.38 | 9.5E−01 | 12 | |
157 | FLOUDAS_A1 | 4 | −2.33 | 13 | −0.58 | 16 | −2.97 | 14 | −0.74 | 17 | 0.27 | 1.0E+00 | 15 | |
80 | MeilerLab | 3 | −2.55 | 14 | −0.18 | 9 | −1.89 | 13 | 0.04 | 8 | 0.44 | 7.7E−01 | 9 | |
42 | TASSER | 2 | −3.49 | 15 | 0.25 | 6 | −3.33 | 15 | 0.33 | 6 | 0.73 | 8.1E−03 | 5 | |
219 | Sternberg | 3 | −4.01 | 16 | −0.67 | 17 | −3.93 | 16 | −0.64 | 16 | 0.23 | 1.0E+00 | 16 | |
41 | MULTICOM-NOVEL | 4 | −4.70 | 17 | −1.18 | 18 | −3.95 | 17 | −0.99 | 19 | 0.00 | 1.0E+00 | 17 |
Same ranking by GDT_TS
Significant Best FM score by T-test
Significant Best FM score by Bootstrap
Significant Best TBM score by T-test
Significant Best TBM score by Bootstrap
Positive FM-style scores and win/loss fraction >=0.5 are shaded; top-ranked groups by best model scores (and any groups not significantly different using Bootstraps and T-tests on FM-style and TBM-style scores for Tc and Ts) are bolded.