Appendix 1—table 4. Statistical comparison of mean proportions of a flight spent exploring versus exploiting across treatments (experimental pairs, solo, and fixed-pairs controls) for the first 12 releases (generation 1) and for releases 13–60 (generation 2–5).
Entries report the proportion of exploration vs. exploitation for pairs of treatments as well as the results of two-sided two-sample Whitney–Mann–Wilcoxon rank-sum tests with continuity correction (p-value and W statistic) for differences in proportion of exploration. Significant p-values are reported in bold. Results of testing for the proportion of exploitation are equivalent and not repeated below.
| Releases | Dataset | Solo control | Fixed-pairs control |
|---|---|---|---|
| 1–12 | Experimental (generation 1) | Row: 36.7% vs. 63.3%Col: 34.2% vs. 65.8% | Row: 36.7% vs. 63.3%Col: 51.7% vs. 48.3% p <. 001 (W = 2230) |
| Solo control | – | Row: 36.7% vs. 63.3%Col: 34.2% vs. 65.8% p <.001 (W=1837) |
|
| 13–60 | Experimental (generations 2–5) | Row: 32.9% vs. 67.1%Col: 15.7% vs. 84.3% | Row: 32.9% vs. 67.1%Col: 29.3% vs. 70.7% p=.0456 (W=50472) |
| Solo control | – | Row: 15.7% vs. 84.3%Col: 29.3% vs. 70.7% p<.001 (W=31517) |