Fig. 3.
The average sequence distinctiveness as a predictor of future changes in prevalence of a lineage. (a) Comparing the correlations of Distinctiveness of a lineage with its competitiveness in the geographic region (across 294 country/time data points). Distinctiveness of a lineage, relative to the average of all sequences that were collected from the same region during the same time, is predictive of future changes in prevalence. The ROC is shown for predicting an increase in prevalence of greater than 20 percentage points from an initial 28-d time window and a subsequent 28-d time window, starting 56 d in the future. (b) Distinctiveness calculated for only a subset of 66 positions involved in neutralizing antibody binding (orange) retains most of the predictive capacity. These positions were found to contribute disproportionately to the overall Distinctiveness, with ∼20-fold higher average Distinctiveness as compared with average positions.
