Using interpretable machine learning to extend heterogeneous antibody-virus datasets

Tal Einav; Rong Ma

doi:10.1016/j.crmeth.2023.100540

. 2023 Jul 25;3(8):100540. doi: 10.1016/j.crmeth.2023.100540

Using interpretable machine learning to extend heterogeneous antibody-virus datasets

Tal Einav ^1,^2,^4,^5,^∗, Rong Ma ^3,⁴

PMCID: PMC10475791 PMID: 37671020

Summary

A central challenge in biology is to use existing measurements to predict the outcomes of future experiments. For the rapidly evolving influenza virus, variants examined in one study will often have little to no overlap with other studies, making it difficult to discern patterns or unify datasets. We develop a computational framework that predicts how an antibody or serum would inhibit any variant from any other study. We validate this method using hemagglutination inhibition data from seven studies and predict 2,000,000 new values ± uncertainties. Our analysis quantifies the transferability between vaccination and infection studies in humans and ferrets, shows that serum potency is negatively correlated with breadth, and provides a tool for pandemic preparedness. In essence, this approach enables a shift in perspective when analyzing data from “what you see is what you get” into “what anyone sees is what everyone gets.”

Keywords: antibody-virus interactions, influenza, matrix completion, imputation, error estimation, serology, hemagglutination inhibition

Graphical abstract

Highlights

•
Predict unmeasured antibody-virus interactions across multiple studies
•
Empower direct comparisons between studies through expanded virus panels
•
Determine which criteria (age, exposure history) lead to distinct responses
•
Assess how accurately animal data predict the human antibody response

Motivation

To quantify the immune response against a rapidly evolving virus, groups routinely measure antibody inhibition against many virus variants. Over time, the variants being studied change, and there is a need for methods that infer missing interactions and distinguish between confident predictions and hallucinations. Here, we develop a matrix completion framework that uses patterns in antibody-virus inhibition to infer the value and confidence of unmeasured interactions. This same approach can combine general datasets—from drug-cell interactions to user movie preferences—that have partially overlapping features.

Einav and Ma develop a framework to infer unmeasured antibody-virus interactions. If three studies measure antibody inhibition against viruses 1–30, 10–40, and 20–50, respectively, then their approach predicts how any antibody inhibits all 50 viruses. As more datasets are combined, the number of predictions rapidly increases, and prediction accuracy improves.

Introduction

Our understanding of how antibody-mediated immunity drives viral evolution and escape relies upon painstaking measurements of antibody binding, inhibition, or neutralization against variants of concern.¹ While antibodies can cross-react and inhibit multiple variants, viral evolution slowly degrades such immunity, leading to periodic reinfections that elicit new antibodies. To get an accurate snapshot of this complex response, we must not only measure inhibition against currently circulating strains but also against historical variants.²^,³

Every antibody-virus interaction is unique because (1) the antibody response (serum) changes even in the absence of viral exposure and (2) for rapidly evolving viruses such as influenza, the specific variants examined in one study will often have little to no overlap with other studies (Figure 1). This lack of crosstalk hampers our ability to comprehensively characterize viral antigenicity, predict the outcomes of viral evolution, and determine the best composition for the annual influenza vaccine.⁴

Challenges of comparing antibody-virus datasets

(A) We develop a framework that predicts antibody responses (e.g., binding, hemagglutination inhibition [HAI], or neutralization) of any serum against viral variants from any other dataset, enabling direct cross-study comparison.

(B) Because each serum is unique and virus panels often only partially overlap, these expanded measurements are necessary to characterize the limits of the antibody response or quantify tradeoffs between key features, such as potency (the strength of a response) vs. breadth (how many viruses are inhibited).

In this work, we develop a new cross-study matrix completion algorithm that leverages patterns in antibody-virus inhibition data to infer unmeasured interactions. Specifically, we demonstrate that multiple datasets can be combined to predict the behavior of viruses that were entirely absent from one or more datasets (e.g., Figure 2A, predicting values for the green viruses in dataset 2 and the gray viruses in dataset 1). Whereas past efforts could only predict values for partially observed viruses within a single dataset (i.e., predicting the red squares for the blue/gray viruses in dataset 2 or the green/blue viruses in dataset 1),⁵^,⁶^,⁷ here we predict the behavior of viruses that do not have a single measurement in a dataset.

Combining datasets to predict values and uncertainties for missing viruses

(A) Schematic of data availability; two studies measure antibody responses against overlapping viruses (shades of blue) as well as unique viruses (green/gray). Studies may have different fractions of missing values (dark-red boxes) and measured values (gray). To test whether virus behavior can be inferred across studies, we predict the titers of a virus in dataset 1 (V₀, gold squares), using measurements from the overlapping viruses (V₁–V_n) as features in a random forest model.

(B) We train a decision tree model using a random subset of antibodies and viruses from dataset 2 (boxed in purple), cross-validate against the remaining antibody responses in dataset 2, and compute the root-mean-square error (RMSE, denoted by σ_Training).

(C) Multiple decision trees are trained, and the average from the 5 trees with the lowest error are used as the model going forward. Applying this model to dataset 1 (which was not used during training) yields the desired predictions, whose RMSE is given by σ_Actual. We repeat this process, withholding each virus in every dataset.

(D) To estimate the prediction error σ_Actual (which we are not allowed to directly compute because V₀’s titers are withheld), we define the transferability relation f_2→1 between the training error σ_Training in dataset 2 and actual error σ_Actual in dataset 1 using the decision trees that predict viruses V₁–V_n (without using V₀). Applying this relation to the training error, f_2→1(σ_Training), estimates σ_Actual for V₀.

Algorithms that predict the behavior of large virus panels are crucial because they render the immunological landscape in higher resolution, helping to reveal which viruses are potently inhibited and which escape antibody immunity.³^,⁴ For example, polyclonal human sera that strongly neutralize one virus may exhibit 10× weaker neutralization against a variant with one additional mutation.⁸ Given the immense diversity and rapid evolution of viruses, it behooves us to pool together measurements from different studies and build a more comprehensive description of serum behavior.

Even when each dataset is individually complete, many interactions can still be inferred by combining studies. The seven datasets examined in this work measured 60%–100% of interactions between their specific virus panel and sera, but against an expanded virus panel containing all variants, fewer than 10% of interactions were measured. Moreover, the missing entries are highly structured, with entire columns (representing viruses; Figure 2A) missing from each dataset. This introduces unique challenges because most matrix completion or imputation methods require missing entries to be randomly distributed,⁵^,⁹^,¹⁰^,¹¹^,¹²^,¹³ and the few methods tailored for structured missing data focus on special classes of generative models that are less effective in this context.¹⁴^,¹⁵^,¹⁶ In contrast, we construct a framework that harnesses the specific structure of these missing values, enabling us to predict over 2,000,000 new values comprising the remaining 90% of interactions.

The key feature we develop that enables matrix completion across studies is error quantification. Despite numerous algorithms to infer missing values, only a few methods exist that can estimate the error of these predictions under the assumption that missing values are randomly distributed,¹⁷^,¹⁸ and to our knowledge, no methods can quantify error for general patterns of missing data. Because we do not know a priori whether datasets can inform one another, it is crucial to estimate the confidence of cross-study predictions. Our framework does so using a data-driven approach to quantify the individual error of each prediction so that users can focus on high-confidence inferences (e.g., those with ≤4-fold error) or search for additional datasets that would further reduce this uncertainty.

Our results provide guiding principles in data acquisition and promote the discovery of new mechanisms in several key ways: (1) Existing antibody-virus datasets can be unified to predict each serum against any virus, providing a massive expansion of data and fine-grained resolution of these antibody responses. (2) This expanded virus panel enables an unprecedented direct comparison of human ↔ ferret and vaccination ↔ infection studies, quantifying how distinct the antibody responses are in each category. (3) Using the expanded data, we explore the relation between two key features of the antibody response, showing the tradeoff between potency and breadth. (4) We demonstrate an application for pandemic preparedness, where the inhibition of a new variant measured in one study is immediately extrapolated to other datasets. (5) Our approach paves the way to rationally design virus panels in future studies, saving time and resources by measuring a substantially smaller set of viruses. In particular, we determine which viruses will be maximally informative and quantify the benefits of measuring each additional virus.

Although this work focuses on antibody-virus inhibition measurements for influenza, it readily generalizes to other viruses, other assays (e.g., using binding or neutralization), and more general applications involving intrinsically low-dimensional datasets.

Results

The low dimensionality of antibody-virus interactions empowers matrix completion

Given the vast diversity of antibodies, it is easy to imagine that serum responses cannot inform one another. Indeed, many factors, including age, geographic location, frequency/type of vaccinations, and infection history, shape the antibody repertoire and influence how it responds to a vaccine or a new viral threat.¹⁹^,²⁰^,²¹^,²²^,²³

Yet much of the heterogeneity of antibody responses found through sequencing²⁴ collapses when we consider functional behavior such as binding, inhibition, or neutralization against viruses.²⁵^,²⁶ Previous work has shown that antibody-virus inhibition data are intrinsically low dimensional,²⁷ which spurred applications ranging from antigenic maps to the recovery of missing values from partially observed data.⁵^,⁶^,⁷^,²⁸ However, these efforts have almost exclusively focused on individual datasets of ferret sera generated under controlled laboratory conditions, circumventing the many obstacles of predicting across heterogeneous human studies.

In the following sections, we develop a matrix completion algorithm that predicts measurements for a virus in dataset 1 (e.g., the virus-of-interest in Figure 2A, boxed in gold) by finding universal relationships between the other overlapping viruses and the virus-of-interest in dataset 2 and applying them to dataset 1. We first demonstrate the accuracy of matrix completion by withholding all hemagglutination inhibition (HAI) measurements from one virus in one dataset (Figure 2A, gold boxes) and using the other datasets to generate predictions ± errors, where each error quantifies the uncertainty of a prediction. Although we seek accurate predictions with low estimated error, it may be impossible to accurately predict some interactions (e.g., measurements of viruses from 2000–2010 may not be able to predict a distant virus from 1970), and those error estimates should be larger to faithfully reflect this uncertainty. After validating our approach on seven large serological studies, we apply matrix completion to greatly extend their measurements.

Cross-study matrix completion using a random forest

We first predict virus behavior between two studies before considering multiple studies. Figure 2 and Box 1 summarize leave-one-out analysis, where a virus-of-interest V₀ is withheld from one dataset (Figure 2A, blue virus boxed in gold). We create multiple decision trees using a subset of overlapping viruses V₁, V₂…V_n as features and a subset of antibody responses within dataset 2 for training (STAR Methods). These trees are cross-validated using the remaining antibody responses from dataset 2 to quantify each tree’s error σ_Training, and we predict V₀ in dataset 1 using the average of the values ± errors from the 5 best trees with the lowest error (Figures 2B and 2C; Box 1).

Box 1. Predicting virus behavior (value ± error) across studies.

Input:

•
Dataset-of-interest D₀ containing virus-of-interest V₀ whose measurements we predict
•
Other datasets {D_j}, each containing V₀ and at least 5 viruses V_j,1, V_j,2… that overlap with the D₀ virus panel, used to extrapolate virus behavior
•
Antibody responses A_j,1, A_j,2… in each dataset D_j. When j ≠ 0, we only consider antibody responses with non-missing values against V₀

Steps:

1.
For each D_j, create n_Tree_s = 50 decision trees predicting V₀ based on n_Features = 5 other viruses and a fraction f_Samples = 3/10 of sera
- ○
  For robust training, we restrict attention to features with ≥80% non-missing values. If fewer than n_Features viruses in D_j satisfy this criterion, do not grow decision trees for this dataset
- ○
  Bootstrap sample (with replacement) both the viruses and antibody responses
- ○
  Data are analyzed in log₁₀ and row-centered on the features (i.e., for each antibody response in either the training set D_j or testing set D₀, subtract the mean of the log₁₀[titers] for the n_Features viruses using all non-missing measurements) to account for systematic shifts between datasets. Row-centering is undone once decision trees make their predictions by adding the serum-dependent mean
- ○
  Compute the cross-validation root-mean-square error (RMSE, σ_Training) of each tree using the remaining 1 − f_Samples fraction of samples in D_j
2.
Predict the (un-row-centered) values of V₀ in D₀ using the n_BestTrees = 5 decisions trees with the lowest σ_Training
- ○
  Trees only make predictions in D₀ where all n_Features are non-missing
- ○
  Predict μ_j ± σ_j for each antibody response
  - ▪
    μ_j = (mean value for n_BestTrees predictions)
  - ▪
    σ_j = $f_{D_{j} \to D_{0}}$ (mean σ_Training for n_BestTrees trees), where the transferability $f_{D_{j} \to D_{0}}$ is computed by predicting V_j,1, V_j,2… in D₀ using D_j (see Box 2)
3.
Combine predictions for V₀ in D₀ with all other datasets {D_j} using $\frac{Σ_{j} (μ_{j} / σ_{j}^{2})}{Σ_{j} (1 / σ_{j}^{2})} \pm \frac{1}{{[Σ_{j} (1 / σ_{j}^{2})]}^{1 / 2}}$

One potential pitfall of this approach is that the estimated error σ_Training derived from dataset 2 will almost always underestimate the true error for these predictions (σ_Actual) in dataset 1 because the antibody responses in both studies may be very distinct (e.g., sera collected decades apart or from people/animals with different infection histories).

To correct for this effect, we estimate an upper bound for σ_Actual by computing the transferability f_2→1(x), which quantifies the accuracy of a relation found in dataset 2 (e.g., V₀ = V₁ + V₂, although complex non-linear relations are allowed) when applied to dataset 1. More precisely, if a relation has error σ_Training in dataset 2 and σ_Actual in dataset 1, then the transferability gives an upper bound, f_2→1(σ_Training from dataset 2) ≥ σ_Actual in dataset 1, that holds for the majority of decision trees. Thus, a low f_2→1(σ_Training from dataset 2) guarantees accurate predictions.

To calculate the transferability f_2→1, we repeat the above algorithm, but rather than inferring values for V₀, we predict each of the overlapping viruses V₁-V_n measured in both datasets whose σ_Training and σ_Actual can be directly computed (Figure 2D; Box 2). We found that transferability was well characterized by a simple linear relationship (Figure S1; note that f_2→1 represents an upper bound and not an equality). Finally, we apply this relation to the training error for virus V₀ to estimate prediction error in dataset 1, σ_Predict ≡ f_2→1(σ_Training). In this way, both values and errors for V₀ are inferred using a generic, data-driven approach that can be applied to diverse datasets.

Box 2. Computing the transferability fDj→D0 between datasets.

Input:

•
Datasets {D_j} that collectively include viruses V₁, V₂… Each virus must be included in at least two datasets

Steps:

•
For each dataset D₀ in {D_j}, for each virus V₀ in D₀, for every other dataset D_j containing V₀
•
Create n_Tree_s = 50 decision trees predicting V₀ based on n_Features = 5 other viruses, as described in Box 1
•
For each tree, store the following:
- ○
  D₀, V₀, and D_j used to construct the tree
- ○
  Viruses used to train the tree
- ○
  RMSE σ_Training on the 1-f_Samples samples in D_j
- ○
  Predictions of V₀’s values in D₀
- ○
  True RMSE σ_Actual of these predictions for V₀ in D₀
•
When predicting V₀ using D_j→D₀ in Box 1, we compute $f_{D_{j} \to D_{0}}$ between σ_Training and σ_Actual by predicting the other viruses V₁, V₂…V_n that overlap between D_j and D₀ (making sure to only use decision trees that exclude the withheld V₀)
- ○
  From the forest of decision trees above, find the top 10 trees for each virus predicted between D_j→D₀ and plot σ_Training vs. σ_Actual for all trees (see Figure S1)
- ○
  Find the best-fit line using perpendicular offsets, y = ax+b where x = σ_Training and y = σ_Actual. Since there is scatter about this best-fit line, and because it is better to overestimate rather than underestimate error, we add a correction factor c=(RMSE between σ_Actual and ax+b). Lastly, we expect that a decision tree’s error in another dataset will always be at least as large as its error on the training set (σ_Actual≥σ_Training), and hence we define $f_{D_{j} \to D_{0}}$ = max(aσ_Training+b+c, σ_Training). This max term is important in a few cases where $f_{D_{j} \to D_{0}}$ has a very steep slope but some decision trees have small σ_Training
- ○
  Datasets with high transferability will have $f_{D_{j} \to D_{0}}$ (σ_Training)≈σ_Training, meaning that viruses can be removed from D₀ and accurately inferred from D_j. In contrast, two datasets with low transferability will have a nearly vertical line, ∂ $f_{D_{j} \to D_{0}}$ /∂σ_Training≫1, signifying that viruses will be poorly predicted between these studies
- ○
  In the chord diagrams (Figures 4B and 5B), the width of the arc between Dataset D_j and D₀ is proportional to (∂ $f_{D_{j} \to D_{0}}$ /∂σ_Training)⁻¹

Leave one out: Inferring virus behavior without a single measurement

To assess matrix completion across studies, we applied it to three increasingly difficult scenarios: (1) between two highly similar human vaccination studies, (2) between a human infection and human vaccination study, and (3) between a ferret infection and human vaccination study. We expected prediction accuracy to decrease as the datasets become more distinct, resulting in both a larger error (σ_Actual) and larger estimated uncertainty (σ_Predict).

For these predictions, we utilized the Fonville influenza datasets consisting of six studies: four human vaccination studies (dataset_Vac,1–4), one human infection study (dataset_Infect,1), and one ferret infection study (dataset_Ferret).²⁰ In each study, sera were measured against a panel of H3N2 viruses using HAI. Collectively, these studies contained 81 viruses, and each virus was measured in at least two studies.

We first predicted values for the virus V₀ = A/Auckland/5/1996 in the most recent vaccination study (dataset_Vac,4) using data from another vaccination study (dataset_Vac,3) carried out in the preceding year and in the same geographic location (Table S1). After training our decision trees, we found that the two studies had the best possible transferability (σ_Predict = f_{Vac,3→Vac,4}(σ_Training) ≈ σ_Training), suggesting that there is no penalty in extrapolating virus behavior between these datasets. More precisely, if there exist five viruses, V₁–V₅, that can accurately predict V₀’s measurements in dataset_Vac,3, then V₁–V₅ will predict V₀ equally well in dataset_Vac,4.

Indeed, we found multiple such decision trees that predicted V₀’s HAI titers with σ_Predict = 2.0-fold uncertainty, meaning that each titer t is expected to lie between t/2 and t·2 with 68% probability (or, equivalently, that log₁₀(t) has a standard deviation of log₁₀(2)) (top panel in Figure 3A, gray bands represent σ_Predict). Notably, this estimated uncertainty closely matched the true error σ_Actual = 1.7-fold. To put these results into perspective, the HAI assay has roughly 2-fold error (i.e., repeated measurements differ by 2-fold 50% of the time and by 4-fold 10% of the time; STAR Methods), implying that these predictions are as good as possible given experimental error.

Predicting virus behavior between two datasets

Example predictions between two Fonville studies. Top: plots comparing predicted and withheld HAI measurements (which take the discrete values 5, 10, 20…). Estimated error is shown in two ways: (1) as vertical lines emanating from each point and (2) by the diagonal gray bands showing σ_Predict. Bottom: histograms of the standardized absolute prediction errors compared with a standard folded Gaussian distribution (black dashed line). The fraction of predictions within 1.0σ are shown at the top left, which can be compared with the expected 68% for the standard folded Gaussian distribution.

(A) Predicting A/Auckland/5/1996 between two human vaccination studies (datasets_{Vac,3→Vac,4}).

(B) Predicting A/Netherlands/620/1989 between a human infection and human vaccination study (datasets_{Infect,1→Vac,4}).

(C) Predicting A/Victoria/110/2004 between a ferret infection and human vaccination study (datasets_{Ferret→Vac,4}).

When we inferred every other virus between these vaccine studies (datasets_{Vac,3→Vac,4}₎, we consistently found the same highly accurate predictions: σ_Predict≈σ_Actual ≈ 2-fold (Figure S2A). As an alternative way of quantifying error, we plotted the distribution of predictions within 0.5, 1.0, 1.5… standard deviations from the measurement, which we compare against a folded Gaussian distribution (Figure 3A, bottom). For example, 82% of predictions were within 1 standard deviation, somewhat larger than the 68% expected for a Gaussian, confirming that prediction error was slightly overestimated.

We next predicted values for V₀ = A/Netherlands/620/1989 between a human infection and vaccination study (dataset_{Infect,1→Vac,4}). In this case, the predicted values were also highly accurate with true error σ_Actual = 2.3-fold (Figure 3B; remaining viruses predicted in Figure S2B). When quantifying the uncertainty of these predictions, we found worse transferability of virus behavior (f_{Infect,1→Vac,4}(σ_Training) ≈ 2.8σ_Training, where the larger prefactor of 2.8 indicates less transferability; STAR Methods), and hence we overestimated the prediction error as σ_Predict = 4.3-fold. Last, when we predicted values for V₀ = A/Victoria/110/2004 between a ferret infection and human vaccination study (dataset_{Ferret→Vac,4}), our predictions had a larger true error, σ_Actual = 4.4-fold (Figure 3C), than the inferences between human data, as expected. Moreover, poor transferability between these datasets led to a poorer guarantee of prediction accuracy, σ_Predict = 6.5-fold, indicative of larger variability when predicting between ferret and human data.

Importantly, we purposefully constructed σ_Predict to overestimate σ_Actual when datasets X and Y exhibit disparate behaviors, since matching the average distribution of σ_Predict to σ_Actual could lead to an unwanted underestimation of the true error. With our approach, a low σ_Predict guarantees accurate predictions. As we show in the following section, the estimated values and error become more precise when we use multiple datasets to infer virus behavior.

Combining influenza datasets to predict 200,000 measurements with ≤3-fold error

When multiple datasets are available to predict virus behavior in dataset 1, we obtain predictions ± errors (μ_j ± σ_j) from dataset 2→1, dataset 3→1, dataset 4→1… These predictions and their errors are combined using the standard Bayesian approach as

\frac{Σ_{j} (μ_{j} / σ_{j}^{2})}{Σ_{j} (1 / σ_{j}^{2})} \pm \frac{1}{{[Σ_{j} (1 / σ_{j}^{2})]}^{1 / 2} .}

(Equation 1)

The uncertainty term in this combined prediction has two key features. First, adding any additional dataset (with predictions μ_k ± σ_k) can only decrease the uncertainty. Second, if a highly uninformative dataset is added (with σ_k→∞), it will negligibly affect the cumulative prediction. Therefore, as long as the uncertainty estimates are reasonably precise, datasets do not need to be prescreened before matrix completion, and adding more datasets will always result in lower uncertainty.

To test the accuracy of combining multiple datasets, we performed leave-one-out analysis using all six Fonville studies, systematically withholding every virus in each dataset (311 virus-dataset pairs) and predicting the withheld values using all remaining data. Each dataset measured 35–300 sera against 20–75 viruses (with 81 unique viruses across all 6 studies) and had 0.5%–40% missing values (Figure 4A).

Validating prediction ± error quantification across 200,000 measurements

(A) We combined seven influenza datasets spanning human vaccination studies (blue boxes), human infection studies (green), and a ferret infection study (orange). Each virus in every dataset was withheld and predicted using the remaining data (shown schematically in gold in the top left box). We display each dataset (left; missing values in dark red and measurements in grayscale) and the collective predictions for all viruses in that dataset (right; gray diagonal bands show the average predicted error σ_Predict). The total number of predictions N from each dataset is shown above the scatterplots; when this number of points is too great to show, we subsampled each distribution evenly while maintaining its shape. The inset at the bottom right of each plot shows the probability density function (PDF) histogram of error measurements (y axis) that were within 0.5σ, 1.0σ, 1.5σ… (x-axis) compared with a standard folded Gaussian distribution (black curve). The fraction of predictions within 1.0σ is explicitly written and can be compared with the expected 68% for a standard folded Gaussian.

(B) Chord diagram representing the transferability between datasets. For each arc connecting dataset X→Y, transferability is shown near the outer circle of Y, with larger width representing greater transferability (Figures S1 and S4; STAR Methods).

Collectively, we predicted the 50,000 measurements across all datasets with a low error of σ_Actual = 2.1-fold (between the measured value and the left-hand side of Equation 1). Upon stratifying these predictions by dataset, we found that the four human vaccination studies were predicted with the highest accuracy (datasets_Vac,1–4, σ_Actual ≈ 2-fold), while the human infection study had slightly worse accuracy (dataset_Infect,1, σ_Actual = 2.7-fold) (Figure 4A). Remarkably, even the least accurate human → ferret predictions had ≤4-fold error on average (σ_Actual = 3.4-fold), demonstrating the potential for these cross-study inferences. As negative controls, permutation testing as well as predictions based solely on virus sequence similarity led to nearly flat predictions with substantially larger error (Figure S3).

In addition to accurately predicting these values, the estimated error closely matched the true error in every human study (σ_Predict ≈ σ_Actual, datasets_Vac,1–4 and dataset_Infect,1). The uncertainty of the ferret predictions was slightly overestimated (σ_Predict = 4.2-fold, dataset_Ferret); mathematically, this occurs because the upper envelope of σ_Training-vs-σ_Actual is steep, making σ_Actual difficult to precisely determine (Figure S1).

We visualize the transferability between datasets using a chord diagram (Figure 4B), where wider bands connecting datasets_X↔Y represent larger transferability (Figure S4; STAR Methods). As expected, there was high transferability between the human vaccine studies carried out in consecutive years (datasets_{Vac,1↔Vac,2} and datasets_{Vac,3↔Vac,4}, Table S1) but generally less transferability across vaccine studies more than 10 years apart (datasets_{Vac,1↔Vac,3}, datasets_{Vac,1↔Vac,4}, datasets_{Vac,2↔Vac,3}, or datasets_{Vac,2↔Vac,4}).

Transferability is not necessarily symmetric because virus inhibition in dataset X could exhibit all patterns in dataset Y (leading to high transferability from X→Y) along with unique patterns not seen in dataset Y (resulting in low transferability from Y→X). For example, all human datasets displayed small transferability to the ferret data, whereas the ferret dataset accurately predicts the human dataset_Infect,1; this suggests that the ferret responses show some patterns present in the human data but also display unique phenotypes. As another example, the human infection study carried out from 2007–2012 had high transferability from the human vaccine studies conducted in 2009 and 2010 (dataset_{Vac,3/4→Infect,1}) but showed smaller transferability in the reverse direction.

To show the generality of this approach beyond H3N2 HAI data, we predicted H1N1 virus neutralization across two monoclonal antibody datasets, finding an error σ_Actual = 3.0–3.6-fold across measurements spanning two orders of magnitude (Figure S5). While these serum and monoclonal antibody results lay the foundation to compare datasets and quantify the impact of a person’s age, geographic location, and other features on the antibody response, they are not exhaustive characterizations; for example, additional human datasets may be able to more accurately predict these ferret responses. The strength of this approach lies in the fact that cross-study relationships are learned in a data-driven manner. As more datasets are added, the number of predictions between datasets increases, while the uncertainty of these predictions decreases.

Versatility of matrix completion: Predicting values from a distinct assay using only 5 overlapping viruses

To test the limits of our approach, we used the Fonville datasets to predict values from a large-scale serological dataset by Vinh et al.,²⁵ where only 6 influenza viruses were measured against 25,000 sera. This exceptionally long and skinny matrix is challenging for several reasons. First, after entirely withholding a virus, only 5 other viruses remain to infer its behavior. Furthermore, only 4 of the 6 Vinh viruses had exact matches in the Fonville dataset; given this small virus panel, we utilized the remaining 2 viruses by associating them with the closest Fonville virus based on their hemagglutinin sequences (STAR Methods; sequences available in GitHub repository). Associating functionally distinct viruses will result in poor transferability, and hence the validity of matching nearly homologous viruses can be directly assessed by comparing the transferability with or without these associations.

Second, the Vinh study used protein microarrays to measure serum binding to the HA1 subunit that forms the hemagglutinin head domain. While HAI also measures how antibodies bind to this head domain, such differences in the experimental assay could lead to fundamentally different patterns of virus inhibition, resulting in smaller transferability and higher error.

Third, there were only 1,200 sera across all Fonville datasets, and hence predicting the behavior of 25,000 Vinh sera will be impossible if they all exhibit distinct phenotypes. Indeed, any such predictions would only be possible if this swarm of sera are highly degenerate, the behavior of each Vinh virus can be determined from the remaining 5 viruses, and these same relations can be learned from the Fonville data. Last, we note one superficial difference: the Vinh data span a continuum of values, while the Fonville data take on discrete 2-fold increments, although this feature does not affect our algorithm.

After growing a forest of decision trees to establish the transferability between the Fonville and Vinh datasets (Figure S1), we predicted the 25,000 serum measurements for all 6 Vinh viruses with an average σ_Actual = 3.2-fold error, demonstrating that even a small panel containing 5 viruses can be expanded to predict the behavior of additional strains (Figure 4A, dataset_Infect,2).

Notably, 5 of these 6 viruses (which all circulated between 2003 and 2011) had a very low σ_Predict≈σ_Actual ≈ 2- to 3-fold error (Figure S6). The final Vinh virus circulated three decades earlier (in 1968), and its larger prediction error was underestimated (σ_Actual = 9.3-fold, σ_Predict = 3.8-fold). This highlights a shortcoming of any matrix completion algorithm; namely, that when a dataset contains one exceptionally distinct column (i.e., one virus circulating 30 years before all other viruses), its values will not be accurately predicted. These predictions would have improved had these six viruses been sampled uniformly between 1968 and 2011.

Leave multi out: Designing a minimal virus panel that maximizes the information gained per experiment

Given the accuracy of leave-one-out analysis and that only 5 viruses are needed to expand a dataset, we reasoned that these studies contain a plethora of measurements that could have been inferred by cross-study predictions. Pushing this to the extreme, we combined the Fonville and Vinh datasets and performed leave-multi-out analysis, where multiple viruses were simultaneously withheld and recovered. Future studies seeking to measure any set of viruses, V₁–V_n, can use a similar approach to select the minimal virus panel that predicts their full data.

In the present search, we sought the minimum viruses needed to recover all Fonville and Vinh measurements with ≤4-fold error; we chose this threshold because it lets us remove dozens of viruses while being much smaller than the 1,000-fold range of the data. A virus was randomly selected from a dataset and added to the withheld list when its values, and those of all other withheld viruses, could be predicted with σ_Predict ≤ 4-fold (without using σ_Actual to confirm these predictions; STAR Methods). In this way, 133 viruses were concurrently withheld, representing 15%–60% of the virus panels from every dataset or a total of N = 70,000 measurements (Figure 5A).

Simultaneously predicting 133 viruses withheld from multiple datasets

(A) Viruses were concurrently withheld from each dataset (left, gold columns), and their 70,000 values were predicted using the remaining data. We withheld as many viruses as possible while still estimating a low error of σ_Predict ≤ 4-fold (blinding ourselves to actual measurements), and indeed, the actual prediction error was smaller than 4-fold in every dataset. As in Figure 4, plots and histograms show the collective predictions and error distributions. The plot label enumerates the number of concurrent predictions (and percent of data predicted).

(B) Chord diagram representing the transferability between datasets after withholding the viruses. For each arc connecting datasets X→Y, transferability is shown near the outer circle of Y, with larger width representing greater transferability (Figures S1 and S4; STAR Methods).

Even with this hefty withheld set, prediction error was only slightly larger than during leave-one-out analysis (σ_Actual between 2.1- to 3.0-fold for the human datasets and σ_Actual = 3.8-fold for the ferret data). This small increase is due to two competing factors. On one hand, prediction is far harder with fewer viruses. At the same time, our approach specifically withheld the most “redundant” viruses that could be accurately estimated (with σ_Predict ≤ 4-fold). These factors mostly offset one another so that the 70,000 measurements exhibited the desired σ_Actual ≤ 4-fold.

The transferability between datasets, computed without the withheld viruses, was similar to the transferability between the full datasets (Figure 5B). Some connections were lost when there were <5 overlapping viruses between datasets, while other connections were strengthened when the patterns in the remaining data became more similar across studies. Notably, the ferret data now showed some transferability from vaccination datasets_Vac,1/2, which resulted in smaller estimated error (σ_Predict = 2.9-fold) than in our leave-one-out analysis. This emphasizes that transferability depends on the specific viruses and sera examined and that some parts of the Fonville human dataset can better characterize ferret data. While this uncertainty underestimated the true error of the ferret predictions (σ_Actual = 3.8-fold), both types of errors were within the desired 4-fold error threshold. Moreover, in all six human datasets, the estimated uncertainty σ_Predict closely matched the true error σ_Actual, demonstrating significant potential in predicting virus behavior, especially between datasets of the same type such as human vaccine or infection studies.

Expanding datasets with 2 × 10⁶ new measurements reveals a tradeoff between serum potency and breadth

In the previous section, we combined datasets to predict serum-virus HAI titers, validating our approach on 200,000 existing measurements. Future studies can immediately leverage the Fonville datasets to expedite their efforts. If a new dataset contains at least 5 Fonville viruses (green arrows/boxes in Figure 6A), then HAI values ± errors for the remaining Fonville viruses can be predicted. Viruses with an acceptably low error (purple in Figure 6A) can be added without requiring any additional experiments.

Expanding the Vinh dataset with 75 additional viruses

(A) If a new study contains at least 5 previously characterized viruses (green boxes and arrows), we can predict the behavior of all previously characterized viruses in the new dataset. Those with an acceptable error (e.g., ≤4-fold error boxed in purple) are used to expand the dataset.

(B) Distribution of the estimated uncertainty σ_Predict when predicting how each Fonville virus inhibits the 25,000 Vinh sera. Most viruses are estimated with ≤4-fold error.

(C) Estimated uncertainty of each virus. The six viruses on the left represent the Vinh virus panel. Colors at the bottom represent the year each virus circulated.

To demonstrate this process, we first focus on the Vinh dataset, where expansion will have the largest impact because the Vinh virus panel is small (6 viruses), but its serum panel is enormous (25,000 sera). By predicting the interactions between these sera and all 81 unique Fonville viruses, we add 2,000,000 new predictions (more than 10× the number of measurements in the original dataset).

For each Fonville virus V₀ that was not measured in the Vinh dataset, we grew a forest of decision trees as described above, with the minor modification that the 5 features were restricted to the Vinh viruses to enable expansion. The top trees were combined with the transferability functions (Figure S1) to predict the values ± errors for V₀ (Figure S7).

The majority of the added Fonville viruses (67 of 75) had tight predictions of σ_Predict ≤ 4-fold (Figure 6B). As expected, viruses circulating around the same time as the Vinh panel (1968 or 2003–2011) tended to have the lowest uncertainty, whereas the furthest viruses from the 1990s had the largest uncertainty (Figure 6C). To confirm these estimates, we restricted the Fonville datasets to these same 6 viruses and expanded out, finding that any virus with σ_Predict ≤ 6-fold prediction error (which applies to nearly all Vinh predictions) had a true error σ_Actual ≤ 6-fold (Figure S8). We similarly expanded the Fonville datasets, adding 175 new virus columns across the six studies (Figure S7; extended datasets provided on GitHub). In addition, dimensionality reduction via uniform manifold approximation and projection (UMAP) recovered a linear trend from the oldest to newest viruses in both the Fonville and Vinh datasets; this trend is especially noteworthy in the latter case because we did not supply the circulation year for the 75 inferred viruses, yet we can discern its impact on the resulting data (Figure S9).

For each Vinh serum, this expansion fills in the 3.5-decade gap between 1968 and 2003 by predicting 47 additional viruses, as well as adding another 28 measurements between 2003 and 2011 (Figure 7A, new interactions highlighted in purple). We also predicted dozens of new viruses in the vaccine studies, and for some sera this increased resolution revealed a more jagged landscape than what was apparent from the direct measurements (Figure 7A). Although HAI titers tend to be similar for viruses circulating around the same time, exceptions do arise (e.g., A/Tasmania/1/1997 vs. A/Perth/5/1997 as well as A/Hanoi/EL201/2009 vs. A/Hanoi/EL134/2008 had >4-fold difference in their predicted titers), and our expanded data reveal these functional differences between variants.

Applications of cross-study predictions

(A) We predict HAI titers for 25,000 sera against the same set of 81 viruses, providing high-resolution landscapes that can be directly compared against each other. Representative responses are shown for dataset_Infect,2 (top, serum 5130165 in GitHub), dataset_Vac,1 (center, subject 525), and dataset_Vac,3 (bottom, subject A028).

(B) Tradeoff between serum breadth and potency, showing that viruses spaced apart in time are harder to simultaneously inhibit. For every study and each possible set of viruses circulating within Δvirus years of each other, we calculate the highest HAI_min (i.e., a serum exists with HAI titers ≥ HAI_min against the entire set of viruses).

(C) Top: when a new variant emerges and is measured in a single study, we can predict its titers in all previous studies with ≥5 overlapping viruses. Bottom: example predicting how the newest variant in the newest vaccine dataset is inhibited by sera from a previous vaccine study (datasets_{Vac,4→Vac,3}).

The expanded data also enable a direct comparison of sera across studies, something that is exceedingly difficult with the original measurements given that none of the 81 viruses were in all 7 datasets. Figure 7A shows that an antibody response may be potent against older strains circulating before 2000 but weak against newer variants (bottom), highly specific against strains from 1980–2000 with specific vulnerabilities to viruses from 1976 (center), or relatively uniformly against the entire virus panel (top).

We next used the expanded data to probe a fundamental but often unappreciated property of the antibody response; namely, the tradeoff between serum potency and breadth. Given a set of viruses circulating within Δvirus years of each other (the top of Figure 7B shows an example with Δvirus years = 2), how potently can a serum inhibit all of these variants simultaneously? For any set of viruses spanning Δvirus years, we computed HAI_min (the minimum titer against this set of viruses) for each serum and plotted the maximum HAI_min in each dataset (Figure 7B). (While children born after the earliest circulating strains may have artificially smaller HAI_min, every dataset contains adults born before the earliest strain, and we only report the largest potency in each study.) We find that HAI_min decreases with Δvirus years, demonstrating that it is harder to simultaneously inhibit more diverse viruses. This same tradeoff was seen for monoclonal antibodies,²⁹^,³⁰ and it suggests that efforts geared toward finding extremely broad and potentially universal influenza responses may run into an HAI ceiling.

Toward pandemic preparedness

When two studies have high transferability, each serves as a conduit to rapidly propagate information. For example, if a new variant V₀ emerges this year, the most pressing question is whether our preexisting immunity will inhibit this new variant or whether it is sufficiently distinct to bypass our antibody response.

Traditionally, antigenic similarity is measured by infecting ferrets with prior circulating strains and measuring their cross-reactivity to the new variant, yet the above analysis (and work by many others³¹^,³²) shows that ferret↔human inferences can be poor. Instead, we can rapidly assess the inhibition of V₀ in multiple existing human cohorts that measured HAI against viruses V₁–V₅ by measuring a single additional human cohort against V₀–V₅ and then predicting V₀’s titers in all other studies. As an example, consider the more recent virus strain in the latest vaccine dataset (A/Perth/16/2009 from vaccine study 4, carried out in 2010, around the time this variant emerged). Our framework predicts how all individuals in vaccine study 3 inhibit this variant with σ_Actual = 2.4-fold error (Figure 7C).

Another recent application of pandemic preparedness tested the breadth of an influenza vaccine containing H1N1 A/Michigan/45/2015 by measuring the serum response against one antigenically distinct H1N1 A/Puerto Rico/8/1934 strain.³³ Inferring additional virus behavior would provide greater resolution into the coverage and potential holes of an antibody response. As shown in Figure 7A, ≈5 measurements can extrapolate serum HAI against viruses circulating in multiple decades, providing this needed resolution from a small number of interactions.

Matrix completion via nuclear norm minimization poorly predicts behavior across studies

In this final section, we briefly contrast our algorithm against singular value decomposition (SVD)-based approaches, such as nuclear norm minimization (NNM), which are arguably the simplest and best-studied matrix completion methods. With NNM, missing values are filled by minimizing the sum of singular values of the completed dataset.

To compare our results, we reran our leave-multi-out analysis from Figure 5, simultaneously withholding 133 viruses and predicting their values using an established NNM algorithm from Einav and Cleary.⁷ The resulting predictions were notably worse, with σ_Actual between 3.4 and 5.4-fold.

Because of two often neglected features of NNM, we find that our approach significantly outperforms this traditional route of matrix completion in predicting values for a completely withheld virus column. First, NNM is asymmetrical when predicting large and small values for a withheld virus. Consider a simple noise-free example where one virus’s measurements are proportional to another’s, (virus 2’s values) = m × (virus 1’s values) (Figure S10A shows m = 5). Surprisingly, even if provided with one perfect template for these measurements, NNM incorrectly predicts that (virus 2’s values) = (virus 1’s values) for any m ≥ 1 (Figure S10B). This behavior is exacerbated when multiple datasets are combined, emphasizing that NNM can catastrophically fail for very simple examples (Figures S10C and S10D). This artifact can be alleviated by first row-centering a dataset (subtracting the mean of the log₁₀[titers] for each serum in Figure 2A), as in Box 1.

Even with row-centering, a second artifact of NNM is that large swaths of missing values can skew matrix completion because relationships are incorrectly inferred between these missing values. Intuitively, all iterative NNM algorithms must initialize the missing entries (often either with 0 or the row/column means), so that after initialization, two viruses with very different behaviors may end up appearing identical across their missing values. For example, suppose we want to predict values for virus V₀ from dataset X→Y and that “useful” viruses V₁–V₄ behave similarly to V₀ in datasets X and Y. On the other hand, “useless” viruses V₅–V₈ are either not measured in dataset 2 or are measured against complementary sera; moreover, these viruses show very different behavior from V₀ in dataset 1 (Figures S10E and S10F show a concrete example from Fonville). Ideally, matrix completion should ignore V₅–V₈ (given that they do not match V₀ in dataset 2) and only use V₁–V₄ to infer V₀’s values in dataset 1. In practice, NNM using V₀–V₈ results in poor predictions (Figures S10E and S10F). This behavior is disastrous for large serological datasets, where there can be >50% missing values when datasets are combined.

Our algorithm was constructed to specifically avoid both artifacts. First, we infer each virus’s behavior using a decision tree on row-centered data that does not exhibit the asymmetry discussed above. Second, we restrict our analysis to features that have ≥80% observed measurements to ensure that patterns detected are based on measurements rather than on missing data.

As another point of comparison, consider the leave-one-out predictions of the six Vinh viruses using the Fonville datasets. Whereas our algorithm yields tight predictions across the full range of values (Figure S6), NNM led to a nearly flat response, with all 25,000 sera incorrectly predicted to be the mean of the measurements (see Figure S11 in Einav and Cleary⁷). In addition, we utilized an existing SVD-based matrix completion method that quantifies the prediction uncertainty for each entry under the assumption that values are randomly missing from a dataset.¹⁸ Applying this method to the Fonville datasets resulted in predictions whose actual error was >20-fold larger than the estimated error, emphasizing the need for frameworks that specifically handle structured missing data.³⁴

Discussion

By harnessing the wealth of previously measured antibody-virus interactions, we can catapult future efforts and design experiments that are far larger in size and scope. Here, we developed an algorithm that leverages patterns in HAI data to predict how a virus measured in one study would inhibit sera from another study without requiring any additional experiments. Even when the original studies only had a few overlapping viruses, the expanded datasets can be directly compared using all variants.

While it is understood that sera cross-react, exhibiting similar inhibition against nearly homologous variants, it is unclear whether there are universal relationships that hold across datasets. We introduce the notion of transferability to quantify how accurately local relations within one dataset map onto another dataset (Figure 4B; STAR Methods).³⁵ Transferability is based on the functional responses of viruses, and it does not require side information, such as virus sequence or structure, although future efforts should quantify how incorporating such information reduces prediction error. In particular, incorporating sequence information could strengthen predictions when virus panels have little direct overlap but contain many nearly homologous variants.

It is rarely clear a priori when two datasets can inform one another; will differences in age, geographic location, or infection history between individuals fundamentally change how they inhibit viruses?³⁶^,³⁷^,³⁸ Transferability directly addresses these questions. Through this lens, we compared the Fonville and Vinh studies, which utilized different assays, had different dynamic ranges, and used markedly different virus panels.²⁰^,²⁵ We found surprisingly large transferability between human infection and vaccination studies. For example, vaccine studies from 1997/1998 (dataset_Vac,1/2) were moderately informed by the Vinh infection study from 2009–2015 (dataset_Infect,2), even though none of the Vinh participants had ever been vaccinated (Figure 4B). Conversely, both infection studies we analyzed were well informed by at least one vaccine study (e.g., dataset_Infect,1 was most informed by datasets_Vac,3/4).

These results demonstrate that diverse cohorts can inform one another. Hence, instead of thinking about each serum sample as being entirely unique, large collections of sera may often exhibit surprisingly similar inhibition profiles. For example, the 1,200 sera in the Fonville datasets predicted the behavior of the 25,000 Vinh sera with ≤2.5-fold error on average, demonstrating that these Vinh sera were at least 20-fold degenerate.²⁵ This corroborates recent work showing that different individuals often target the same epitopes,²⁶ which should limit the number of distinct functional behaviors. As studies continue to measure sera in new locations, their transferability will quantify the level of heterogeneity across the world.

To demonstrate the scope of new antibody-virus interactions that can be inferred using available data, we predicted 2,000,000 new interactions between the Fonville and Vinh sera and their combined 81 H3N2 viruses. Upon stratifying by age, these landscapes can quantify how different exposure histories shape the subsequent antibody response.²⁵^,³⁹ Given the growing interest in universal influenza vaccines that inhibit diverse variants, these high-resolution responses can examine the breadth of the antibody response both forwards in time against newly emerging variants and backwards in time to assess how rapidly immunity decays.³^,²³^,⁴⁰^,⁴¹ We found that serum potency (the minimum HAI titer against a set of viruses) decreases for more distinct viruses (Figure 7B), as shown for monoclonal antibodies,⁷^,²⁹ suggesting that there is a tug-of-war between antibody potency and breadth. For example, a specific HAI target (e.g., responses with HAI ≥ 80 against multiple variants) may only be possible for viruses spanning 1–2 decades.

Our framework inspires new principles of data acquisition, where future studies can save time and effort by choosing smaller virus panels that are designed to be subsequently expanded (Figure 6A). One powerful approach is to perform experiments in waves. A study measuring serum inhibition against 100 viruses could start by measuring 5 of these viruses that are widely spaced out in time. With these initial measurements, we can compute the values ± errors of the remaining viruses as well as the next 5 maximally informative viruses, whose measurements will further decrease prediction error. Each additional wave of measurements serves as a test for the predictions, and experiments can stop oncewhen enough measurements match the predictions.

Antibody-virus interactions underpin diverse efforts, from virus surveillance⁴ to characterizing the composition of antibodies within serum²⁰^,³⁰^,⁴²^,⁴³ to predicting future antibody-virus coevolution.⁴⁴^,⁴⁵ Although we focused on influenza HAI data, our approach can readily generalize to other inherently low-dimensional datasets, both in and out of immunology. In the context of antibody-virus interactions, this approach not only massively extends current datasets but also provides a level playing field where antibody responses from different studies can be directly compared using the same set of viruses. This shift in perspective expands the scope and utility of each measurement, enabling future studies to always build on top of previous results.

Limitations of the study

For cross-study antibody-virus predictions, there must be partial overlap in either the antibodies or viruses used across datasets. We only investigated cases where the virus panels overlapped, and we found that studies should contain ≥5 viruses (whose data can inform one another’s inhibition) for accurate predictions. For example, pre-pandemic H1N1, post-pandemic H1N1, and H3N2 would all minimally inform one another and should be considered separately (or else both the estimated and actual prediction error will be large). While we mostly investigated influenza HAI data, further work should extend this analysis to other viruses, other assays, and even to non-biological systems. In each context, this framework combines datasets to predict the value ± uncertainty of unmeasured interactions, and it circumvents issues of reproducibility or low-quality data (i.e., garbage in, garbage out) by explicitly computing intra- and inter-study relationships in a data-driven manner.

STAR★Methods

Key resources table

REAGENT or RESOURCE	SOURCE	IDENTIFIER
Deposited data

Fonville influenza datasets	Fonville et al.²⁰	https://doi.org/10.1126/science.1256427
Vinh influenza dataset	Vinh et al.²⁵	https://doi.org/10.1038/s41467-021-26948-8

Software and algorithms

Cross-study prediction algorithm	This paper	https://doi.org/10.5281/zenodo.8034507

Open in a new tab

Resource availability

Lead contact

Further information and requests for resources and reagents should be directed to and will be fulfilled by the lead contact, Tal Einav (tal.einav@lji.org).

Materials availability

This study did not generate new materials.

Method details

Datasets analyzed

Information about the Fonville and Vinh datasets (type of study, year conducted, and geographic location) is provided in Table S1. The number of sera, viruses, and missing measurements in each dataset is listed below the schematics in Figure 4. Every serum was unique, appearing in a single study. All Fonville viruses appeared in at least two studies (see Figure S7C for the distribution of viruses), enabling us to entirely remove a virus from one dataset and infer its behavior from another dataset.

Although the Vinh data contained H1N1 and H3N2 viruses, we only considered the H3N2 strains since this was the only subtype measured in the Fonville data. 4/6 of the Vinh viruses (H3N2 A/Wyoming/3/2003, A/Wisconsin/67/2005, A/Brisbane/10/2007, and A/Victoria/361/2011) were in the Fonville virus panels. We associated the remaining two viruses with the most similar Fonville strain based on HA sequence (Vinh virus A/Aichi/2/1968↔Fonville virus A/Bilthoven/16190/1968; Vinh virus A/Victoria/210/2009↔Fonville virus A/Hanoi/EL201/2009). While such substitutions may increase prediction error (which can be gauged through leave-one-out analysis), they also vastly increase the number of possible cross-study predictions.

Matrix completion on log₁₀(HAI titers)

The hemagglutination inhibition (HAI) assay quantifies how potently an antibody or serum inhibits the ability of a virus to bind red blood cells. The value (or titer) for each antibody-virus pair corresponds to the maximum dilution at which an antibody inhibits this interaction, so that larger values represent a more potent antibody. This assay is traditionally done using a series of 2-fold dilutions, so that the HAI titers can equal 10, 20, 40…

As in previous studies, all analysis was done on log₁₀(HAI titers) because experimental measurements span orders of magnitude, and taking the logarithm prevents biasing the predictions toward the largest values⁷ while also accounting for the declining marginal protection from increasing titers.⁴⁶ Thus, when computing the distribution of errors (histogram in Figures 3, 4 and 5), each of M, μ, and σ are computed in log₁₀. The only exception is that when presenting the numeric values of a prediction or its error, we did so in un-logged units so the value could be readily compared to experiments. An un-logged value is exponentiated by 10 (i.e., σ_{Predict,log10} = 0.3 for log₁₀ titers corresponds to an error of σ_Predict = 10⁰^.3 = 2-fold, with “fold” indicating an un-logged number). The following sections always refer to M, μ, and σ in log₁₀ units.

In the Fonville dataset, we replaced lower or upper bounds by their next 2-fold increment (”<10”→5 and “≥1280”→2560). The Vinh dataset did not include any explicit bounded measurements, although their HAI titers were clipped to lie between 10 and 1810, as can be seen by plotting the values of any two viruses across all sera. Hence, the Vinh predictions in Figure 4 (Dataset_Infect,2) contains multiple points on the left and right edges of the plot.

Error of the hemagglutination (HAI) assay

In the Fonville 2014 study, analysis of repeated HAI measurements showed that the inherent error of the assay is log-normally-distributed with standard deviation σ_Inherent ≈ 2-fold. This is shown by Figure S8B in Fonville et al.²⁰ (neglecting the stack of not-determined measurements outside the dynamic range of the assay), where 40% of repeats had the same HAI value, 50% had a 2-fold discrepancy, and 10% had a 4-fold discrepancy.

Using decision trees to quantify the relationships between viruses

Decision trees are a simple, easily-interpretable, and well-studied form of machine learning. An advantage of decision trees is that they are fast to train and have out-of-the-box implementations in many programming languages. The predictions from decision trees are made even more robust by averaging over the 5 top trees to create a small random forest, and we use such a “random copse” in this work. Similar approaches averaging across multiple decision trees (as well as variations such as survival decision trees) have been applied in various biological settings including genomics data and cancer.⁴⁷^,⁴⁸

As described in Box 1, we trained regression trees that take as input the row-centered log₁₀(HAI titers) from viruses V₁-V₅ to predict another virus V₀. These trees can then be applied in another dataset to predict V₀ based on the values of V₁-V₅.

Row-centering means that if we denote the log₁₀(titers) of V₀-V₅ to be t₀-t₅ with mean t_avg, then the decision tree will take (t₁-t_avg, t₂-t_avg, t₃-t_avg, t₄-t_avg, t₅-t_avg) as input to predict t₀-t_avg. The value t_avg (which will be different for each serum) is then added to this prediction to undo the row-centering. If any of the t_j are missing (including t₀ when we withhold V₀’s values), we proceed in the same way but compute t_avg as the average of the measured values. Row-centering enables the algorithm to handle systematic differences in data, including changes to the unit of measurement; for example, neutralization measurements in μg/mL or Molar would both be handled the same, since in log₁₀ they are offset from each other by a constant factor that will be subtracted during row-centering. If one serum is concentrated by 2x, its titers would all increase by 2x but the relationships between viruses would remain the same; row-centering subtracts this extra concentration factor and yields the same analysis.

We chose a random fraction f_Samples = 3/10 of sera to train each decision tree when HAI data was continuous (Dataset_Infect,2). For the remaining datasets with discrete measurements, we grouped sera based on the HAI titer of their virus-of-interest V₀ (either HAI = 5, 10, 20, 40, 80, 160, or ≥320), picked among these bins with uniform probability, and then randomly chose a serum within that bin. This prevents the uneven HAI distribution from overwhelming the model, since the majority of measurements are HAI = 5 with very few cases of HAI≥320. This form of sampling minimally affected most predictions, but it improved the estimated error for human→ferret predictions (σ_Predict = 4.2x with this binning, σ_Predict = 6.4x with completely uniform binning), since HAI values in the ferret dataset within the limit of detection are not skewed toward low titers.

When training our decision trees, we allow missing values for V₁-V₅ but not V₀ (as shown by the schematic in Figure 2B), with these missing values replaced by the most likely value (i.e., mode-finding) given the known values in the training set. When applying a trained decision tree to other datasets, we only predicted a value for V₀ when none of V₁-V₅ were missing (otherwise that decision tree was ignored). If all 5 top trees were ignored due to missing values, then no prediction was made for that virus V₀ and serum combination.

Predicting the behavior of a new virus

As described in Box 1, the values for V₀ predicted from dataset D_j→D₀ is based on the top 5 decision trees that predict V₀ in D_j with the lowest σ_Training. The value of V₀ against any serum is given by the average value of the top 5 decision trees, while its error is given by the estimated error σ_Predict = $f_{D_{j} \to D_{0}}$ (σ_Training) of these top 5 trees, where $f_{D_{j} \to D_{0}}$ represents the transferability map (described in the next section). Thus, every prediction of V₀ in D₀ will have the same σ_Predict, unless some of the top 5 trees cannot cast a vote because their required input titers are missing (in which case the value ± error is computed using the average from the trees that can vote). In practice, the estimated error for V₀ in D₀ is overwhelmingly the same across all sera, as seen in Figure 3 where the individual error of each measurement is shown via error bars.

In Figures 4 and 5, we did not display the small fraction of measurements with HAI titers≥640 to better show the portions of the plots with the most points. However, the estimated error σ_Predict and true error σ_Actual were computed using all data.

Transferability maps between datasets

Transferability quantifies how the error of a decision tree trained in dataset D_j translates into this tree’s error in dataset D₀. Importantly, when predicting the behavior of a virus V₀ in D₀, we cannot access V₀’s values and hence cannot directly compute the actual error of this tree.

To solve this problem, we temporarily ignore V₀ and apply Box 1 to predict the titers of viruses measured in both D₀ and D_j. Using the values of these viruses from both datasets, we can directly compare their σ_Training in D_j against σ_Actual in D₀. We did not know a priori what the relationship would be between these two quantities, yet surprisingly, it turned out to be well-characterized by a simple linear relationship $f_{D_{j} \to D_{0}}$ (blue lines in Figure S1; if curves fall below the diagonal line, they are set to y = x since cross-study error should never fall below within-study error). As described in the following paragraph, these relations represent an upper bound, not a best fit, through the (σ_Training, σ_Actual) points, so that our estimated error σ_Predict ≡ $f_{D_{j} \to D_{0}}$ (σ_Training) ≥ σ_Actual. Therefore, when we estimate a small σ_Predict we expect σ_Actual to be small; a large σ_Predict may imply a large σ_Actual, although we may also be pleasantly surprised with a smaller actual error.

Following Box 2, we obtain the best-fit line to these data using perpendicular offsets, which are more appropriate when we expect equal error in the x- and y-coordinates. To account for the scatter about this best-fit line, we add a vertical shift given by the RMSE of the deviations from the best-fit line, thereby ensuring that in highly-variable cases where some trees have small σ_Training but large σ_Actual (e.g., Dataset_Ferret→Dataset_Vac,1), we tend to overestimate rather than underestimate the error.

To visualize the transferability maps between every pair of datasets, we construct a chord diagram where the arc connecting Dataset X and Y represents a double-sided arrow quantifying both the transferability from X→Y (thickness of the arc at Dataset Y) as well as the transferability from Y→X (thickness of the arc at Dataset X) (Figure S4). The width of each arc is equal to Δθ≡(2π/18.5)(∂ $f_{D_{j} \to D_{0}}$ /∂σ_Training)⁻¹, so that the width is proportional to 1/slope of the transferability best-fit line from Figure S1. We used the factor 18.5 in the denominator so that the chord diagrams in Figures 4B and 5B would form nearly complete circles, and if more studies are added this denominator can be modified (increasing it would shrink all the arcs proportionally). Note that the size of the arcs in Figures 4B and 5B can be directly compared to one another, so that if the arc from X→Y is wider in one figure, it implies more transferability between these datasets. A chord connects every pair of studies, unless there were fewer than 5 overlapping viruses between the studies (e.g., between Dataset_Infect,2 and Dataset_Vac,3/4), in which case the transferability could not be computed.

The transferability in Figures 4B and S1 represents all antibody-virus data, which is slightly different from the transferability maps we use when predicting virus V₀ in dataset D₀. When withholding a virus, we made sure to remove all trees from Figure S1 that use this virus as a feature. Although this can slightly change the best-fit line, in practice the difference is minor. However, when withholding multiple viruses in our leave-multi-out analysis, the number of datapoints in Figure S1 substantially decreased, and to counter this we trained additional decision trees (as described in the following section).

Leave-multi-out analysis

To withhold multiple viruses, we trained many decision trees using different choices of viruses V₁-V₅ to predict V₀ in different datasets. Note that for leave-one-out analysis, we created 50,000 trees which provided ample relationships between the variants. However, when we withheld 133 viruses during the leave-multi-out analysis, we were careful to not only exclude decision trees predicting one of these withheld viruses (as V₀), but to also exclude decision trees using any withheld virus in the feature set (in V₁-V₅). As a result, only 6,000 trees out of our original forest remained, and this smaller number of trees leads to higher σ_Predict and σ_Actual error. Fortunately, this problem is easily countered by growing additional trees that specifically avoid the withheld viruses. Once these extra trees were grown, we applied Box 1 as before.

To find a minimal virus panel, we randomly choose one of the 317 virus-study pairs from the Fonville/Vinh datasets, adding it to the list of withheld viruses provided that all withheld entries could be predicted with ≤4-fold error. We note that given a forest of decision trees, it is extremely fast to test whether a set of viruses all have σ_Predict ≤ 4-fold. However, as described above, as more viruses are withheld, our forest is trimmed which leads to poorer estimations of σ_Predict. Hence we worked in stages, interspersing pruning the list of viruses with growing more decision trees. Our procedure to find a minimal virus panel proceeded in three steps.

•
Step 1: Choose Vinh viruses to withhold, and then choose viruses from the Fonville human studies. Because there are only 6 Vinh viruses, and removing any one of them from the Fonville datasets could preclude making any Vinh predictions, we first withheld 2 Vinh viruses. We then started withholding viruses from the Fonville human datasets (Dataset_Vac,1-4 and Dataset_Infect,1) where we had the most decision trees.
•
Step 2: Create an additional random forest for the Fonville ferret dataset (Dataset_Ferret). This forest only used the non-withheld viruses from the other datasets as features. With this forest, choose additional viruses from the ferret dataset to withhold.
•
Step 3: Create additional random forests for the Fonville human datasets. Use the improved resolution provided by these new forests to determine if any of the previously withheld viruses now have σ_Predict > 4-fold and remove them. Finally, use the additional high-resolution forests to search for additional Fonville viruses to withhold.

Extending virus panels

To extend the Fonville and Vinh datasets, we grew another forest of decision trees. Unlike in our leave-one-out analysis, the two key differences with this forest were that none of the data were withheld and that the feature set when expanding dataset D₀ was restricted to only the viruses within D₀. For example, to expand the Vinh dataset and predict one of the 81-6 = 75 Fonville viruses V₀ (excluding the 6 viruses already in the Vinh data), we only searched for relationships between the six Vinh viruses and V₀ across the Fonville datasets.

After growing these additional trees, we predicted the behavior of all 81 Fonville viruses against nearly every serum analyzed in the Fonville or Vinh datasets. The exceptions were sera such as those shown in the middle and bottom of Datasets_Vac,1/2 (Figure S7) ‒ these sera were measured against few viruses, and hence we found no relationship between their available measurements in our random forest. The expanded virus panels are available in the GitHub repository associated with this paper.

With the expanded panels, we computed the tradeoff between serum potency and breadth as follows. For every range of Δvirus years, we considered every interval within our dataset (1968–1970, 1969–1971, …, 2009–2011), provided that at least one virus in the panel circulated at the earliest and latest year to ensure that the virus set spanned this full range (e.g., we would not consider the interval 1971–1973 since we had no viruses from 1971 or 1973). For each interval, we took whichever of the 81 viruses circulated during that interval, and for each serum we computed the weakest response (minimum titer) against any virus in this set. Figure 7B plots the largest minimum titer (HAI_min) found in each dataset for any interval of Δvirus years, demonstrating that serum potency decreases when inhibiting viruses spanning a broader range of time.

Quantification and statistical analysis

Details on the statistical details can be found in the figure captions and the method details section above. Errors (σ) were calculated as the root-mean-squared error of log₁₀(titers), which were then exponentiated by 10 to un-log the result. All analysis was carried out in Mathematica.

Acknowledgments

We would like to thank Andrew Butler, Ching-Ho Chang, Bernadeta Dadonaite, David Donoho, and Katelyn Gostic for their input on this manuscript as well as the many authors of the manuscripts used to power this analysis. T.E. is a Damon Runyon Fellow supported by the Damon Runyon Cancer Research Foundation (DRQ 01-20). R.M. is supported by Professor David Donoho at Stanford University.

Author contributions

T.E. and R.M. conducted the research and wrote the paper.

Declaration of interests

The authors declare no competing interests.

Inclusion and diversity

We support inclusive, diverse, and equitable conduct of research.

Published: July 25, 2023

Footnotes

Supplemental information can be found online at https://doi.org/10.1016/j.crmeth.2023.100540.

Supplemental information

Document S1. Figures S1–S10 and Table S1

mmc1.pdf^{(5.5MB, pdf)}

Document S2. Article plus supplemental information

mmc2.pdf^{(12.6MB, pdf)}

Data and code availability

•
Source data statement: This paper analyzes existing, publicly available data. The accession numbers for the datasets are listed in the key resources table.
•
Code statement: All original code has been deposited in GitHub (https://github.com/TalEinav/CrossStudyCompletion) and is publicly available as of the date of publication. The DOI is listed in the key resources table. This repository includes code to perform matrix completion in Mathematica and R, as well as the expanded Fonville and Vinh datasets shown in Figure S7.
•
Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.

References

1.Petrova V.N., Russell C.A. The evolution of seasonal influenza viruses. Nat. Rev. Microbiol. 2018;16:47–60. doi: 10.1038/nrmicro.2017.118. [DOI] [PubMed] [Google Scholar]
2.Kucharski A.J., Lessler J., Cummings D.A.T., Riley S. Timescales of influenza A/H3N2 antibody dynamics. PLoS Biol. 2018;16 doi: 10.1371/journal.pbio.2004974. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Yang B., García-Carreras B., Lessler J., Read J.M., Zhu H., Metcalf C.J.E., Hay J.A., Kwok K.O., Shen R., Jiang C.Q., et al. Long term intrinsic cycling in human life course antibody responses to influenza A(H3N2): an observational and modelling study. Elife. 2022;11 doi: 10.7554/eLife.81457. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Morris D.H., Gostic K.M., Pompei S., Bedford T., Łuksza M., Neher R.A., Grenfell B.T., Lässig M., McCauley J.W. Predictive modeling of Influenza shows the promise of applied evolutionary biology. Trends Microbiol. 2018;26:102–118. doi: 10.1016/j.tim.2017.09.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Cai Z., Zhang T., Wan X.F. A computational framework for influenza antigenic cartography. PLoS Comput. Biol. 2010;6 doi: 10.1371/journal.pcbi.1000949. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Ndifon W. New methods for analyzing serological data with applications to influenza surveillance. Influenza Other Respir. Viruses. 2011;5:206–212. doi: 10.1111/j.1750-2659.2010.00192.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Einav T., Cleary B. Extrapolating missing antibody-virus measurements across serological studies. Cell Syst. 2022;13:561–573.e5. doi: 10.1016/j.cels.2022.06.001. [DOI] [PubMed] [Google Scholar]
8.Lee J.M., Eguia R., Zost S.J., Choudhary S., Wilson P.C., Bedford T., Stevens-Ayers T., Boeckh M., Hurt A.C., Lakdawala S.S., et al. Mapping person-to-person variation in viral mutations that escape polyclonal serum targeting influenza hemagglutinin. Elife. 2019;8 doi: 10.7554/eLife.49324. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Candès E.J., Recht B. Exact matrix completion via convex optimization. Found. Comput. Math. 2009;9:717–772. doi: 10.1007/s10208-009-9045-5. [DOI] [Google Scholar]
10.Candes E.J., Tao T. The power of convex relaxation: near-optimal matrix completion. IEEE Trans. Inf. Theory. 2010;56:2053–2080. doi: 10.1109/tit.2010.2044061. [DOI] [Google Scholar]
11.Candes E.J., Plan Y. Matrix completion with noise. Proc. IEEE. 2010;98:925–936. doi: 10.1109/jproc.2009.2035722. [DOI] [Google Scholar]
12.Keshavan R.H., Montanari A., Oh S. Matrix completion from a few entries. IEEE Trans. Inf. Theory. 2010;56:2980–2998. doi: 10.1109/tit.2010.2046205. [DOI] [Google Scholar]
13.Little R.J.A., Rubin D.B. 3rd ed. Wiley; 2019. Statistical Analysis with Missing Data. [Google Scholar]
14.Cai T., Cai T.T., Zhang A. Structured matrix completion with applications to genomic data integration. J. Am. Stat. Assoc. 2016;111:621–633. doi: 10.1080/01621459.2015.1021005. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Xue F., Qu A. Integrating multisource block-wise missing data in model selection. J. Am. Stat. Assoc. 2021;116:1914–1927. doi: 10.1080/01621459.2020.1751176. [DOI] [Google Scholar]
16.Xue F., Ma R., Li H. Semi-supervised statistical inference for high-dimensional linear regression with blockwise missing data. arXiv. 2021 doi: 10.48550/arxiv.2106.03344. Preprint at. [DOI] [Google Scholar]
17.Carpentier A., Klopp O., Löffler M., Nickl R. Adaptive confidence sets for matrix completion. Bernoulli. 2018;24:2429–2460. doi: 10.3150/17-bej933. [DOI] [Google Scholar]
18.Chen Y., Fan J., Ma C., Yan Y. Inference and uncertainty quantification for noisy matrix completion. Proc. Natl. Acad. Sci. USA. 2019;116:22931–22937. doi: 10.1073/pnas.1910053116. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Kim J.H., Davis W.G., Sambhara S., Jacob J. Strategies to alleviate original antigenic sin responses to Influenza viruses. Proc. Natl. Acad. Sci. USA. 2012;109:13751–13756. doi: 10.1073/pnas.0912458109. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Fonville J.M., Wilks S.H., James S.L., Fox A., Ventresca M., Aban M., Xue L., Jones T.C., Le N.M.H., Pham Q.T., et al. Antibody landscapes after influenza virus infection or vaccination. Science. 2014;346:996–1000. doi: 10.1126/science.1256427. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Thompson M.G., Naleway A., Fry A.M., Ball S., Spencer S.M., Reynolds S., Bozeman S., Levine M., Katz J.M., Gaglani M. Effects of repeated annual inactivated Influenza vaccination among healthcare personnel on serum hemagglutinin inhibition antibody response to A/Perth/16/2009 (H3N2)-like virus during 2010-11. Vaccine. 2016;34:981–988. doi: 10.1016/J.VACCINE.2015.10.119. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Gouma S., Kim K., Weirick M.E., Gumina M.E., Branche A., Topham D.J., Martin E.T., Monto A.S., Cobey S., Hensley S.E. Middle-aged individuals may be in a perpetual state of H3N2 influenza virus susceptibility. Nat. Commun. 2020;11:4566. doi: 10.1038/s41467-020-18465-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Fox A., Carolan L., Leung V., Vu H., Phuong M., Khvorov A., Auladell M., Tseng Y.-Y., Thai P.Q., Barr I., et al. Opposing effects of prior infection versus prior vaccination on vaccine immunogenicity against Influenza A(H3N2) viruses. Viruses. 2022;14:470. doi: 10.3390/v14030470. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Horns F., Vollmers C., Dekker C.L., Quake S.R. Signatures of selection in the human antibody repertoire: selective sweeps, competing subclones, and neutral drift. Proc. Natl. Acad. Sci. USA. 2019;116:1261–1266. doi: 10.1073/pnas.1814213116. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Vinh D.N., Nhat N.T.D., de Bruin E., Vy N.H.T., Thao T.T.N., Phuong H.T., Anh P.H., Todd S., Quan T.M., Thanh N.T.L., et al. Age-seroprevalence curves for the multi-strain structure of influenza A virus. Nat. Commun. 2021;12:6680. doi: 10.1038/s41467-021-26948-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Shrock E.L., Timms R.T., Kula T., Mena E.L., West A.P., Jr., Guo R., Lee I.-H., Cohen A.A., McKay L.G.A., Bi C., et al. Germline-encoded amino acid–binding motifs drive immunodominant public antibody responses. Science. 2023;380 doi: 10.1126/science.adc9498. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Lapedes A., Farber R. The geometry of shape space: application to Influenza. J. Theor. Biol. 2001;212:57–69. doi: 10.1006/jtbi.2001.2347. [DOI] [PubMed] [Google Scholar]
28.Smith D.J., Lapedes A.S., de Jong J.C., Bestebroer T.M., Rimmelzwaan G.F., Osterhaus A.D.M.E., Fouchier R.A.M. Mapping the antigenic and genetic evolution of influenza virus. Science. 2004;305:371–376. doi: 10.1126/science.1097211. [DOI] [PubMed] [Google Scholar]
29.Creanga A., Gillespie R.A., Fisher B.E., Andrews S.F., Lederhofer J., Yap C., Hatch L., Stephens T., Tsybovsky Y., Crank M.C., et al. A comprehensive influenza reporter virus panel for high-throughput deep profiling of neutralizing antibodies. Nat. Commun. 2021;12:1722. doi: 10.1038/s41467-021-21954-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Einav T., Creanga A., Andrews S.F., McDermott A.B., Kanekiyo M. Harnessing low dimensionality to visualize the antibody–virus landscape for influenza. Nat. Comput. Sci. 2022;3:164–173. doi: 10.1038/s43588-022-00375-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Hensley S.E. Challenges of selecting seasonal influenza vaccine strains for humans with diverse pre-exposure histories. Curr. Opin. Virol. 2014;8:85–89. doi: 10.1016/j.coviro.2014.07.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Xie H., Li L., Ye Z., Li X., Plant E.P., Zoueva O., Zhao Y., Jing X., Lin Z., Kawano T., et al. Differential effects of prior Influenza exposures on H3N2 cross-reactivity of human postvaccination sera. Clin. Infect. Dis. 2017;65:259–267. doi: 10.1093/cid/cix269. [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Arevalo C.P., Bolton M.J., Le Sage V., Ye N., Furey C., Muramatsu H., Alameh M.-G., Pardi N., Drapeau E.M., Parkhouse K., et al. A multivalent nucleoside-modified mRNA vaccine against all known influenza virus subtypes. Science. 2022;378:899–904. doi: 10.1126/science.abm0271. [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Hartford J., Graham D.R., Leyton-Brown K., Ravanbakhsh S. Deep models of interactions across sets. arXiv. 2018 doi: 10.48550/arXiv.1803.02879. Preprint at. [DOI] [Google Scholar]
35.Huang R., Xu M., Zhu H., Chen C.Z., Zhu W., Lee E.M., He S., Zhang L., Zhao J., Shamim K., et al. Biological activity-based modeling identifies antiviral leads against SARS-CoV-2. Nat. Biotechnol. 2021;39:747–753. doi: 10.1038/s41587-021-00839-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Lewnard J.A., Cobey S. Immune history and Influenza vaccine effectiveness. Vaccines. 2018;6:28. doi: 10.3390/vaccines6020028. [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Henry C., Zheng N.-Y., Huang M., Cabanov A., Rojas K.T., Kaur K., Andrews S.F., Palm A.-K.E., Chen Y.-Q., Li Y., et al. Influenza virus vaccination elicits poorly adapted B cell responses in elderly individuals. Cell Host Microbe. 2019;25:357–366.e6. doi: 10.1016/j.chom.2019.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Dugan H.L., Guthmiller J.J., Arevalo P., Huang M., Chen Y.Q., Neu K.E., Henry C., Zheng N.Y., Lan L.Y.L., Tepora M.E., et al. Preexisting immunity shapes distinct antibody landscapes after influenza virus infection and vaccination in humans. Sci. Transl. Med. 2020;12:3601. doi: 10.1126/scitranslmed.abd3601. [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Cowling B.J., Lim W.W., Perera R.A.P.M., Fang V.J., Leung G.M., Peiris J.S.M., Tchetgen Tchetgen E.J. Influenza hemagglutination-inhibition antibody titer as a mediator of vaccine-induced protection for Influenza B. Clin. Infect. Dis. 2019;68:1713–1717. doi: 10.1093/cid/ciy759. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Carter D.M., Darby C.A., Lefoley B.C., Crevar C.J., Alefantis T., Oomen R., Anderson S.F., Strugnell T., Cortés-Garcia G., Vogel T.U., et al. Design and characterization of a computationally optimized broadly reactive hemagglutinin vaccine for H1N1 Influenza viruses. J. Virol. 2016;90:4720–4734. doi: 10.1128/jvi.03152-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Boyoglu-Barnum S., Ellis D., Gillespie R.A., Hutchinson G.B., Park Y.-J., Moin S.M., Acton O.J., Ravichandran R., Murphy M., Pettie D., et al. Quadrivalent influenza nanoparticle vaccines induce broad protection. Nature. 2021;592:623–628. doi: 10.1038/s41586-021-03365-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
42.Georgiev I.S., Doria-Rose N.A., Zhou T., Kwon Y.D., Staupe R.P., Moquin S., Chuang G.Y., Louder M.K., Schmidt S.D., Altae-Tran H.R., et al. Delineating antibody recognition in polyclonal sera from patterns of HIV-1 isolate neutralization. Science. 2013;340:751–756. doi: 10.1126/science.1233989. [DOI] [PubMed] [Google Scholar]
43.Lee J., Paparoditis P., Horton A.P., Frühwirth A., McDaniel J.R., Jung J., Boutz D.R., Hussein D.A., Tanno Y., Pappas L., et al. Persistent antibody clonotypes dominate the serum response to Influenza over multiple years and repeated vaccinations. Cell Host Microbe. 2019;25:367–376.e5. doi: 10.1016/j.chom.2019.01.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Sheng J., Wang S. Coevolutionary transitions emerging from flexible molecular recognition and eco-evolutionary feedback. iScience. 2021;24:102861. doi: 10.1016/j.isci.2021.102861. [DOI] [PMC free article] [PubMed] [Google Scholar]
45.Marchi J., Lässig M., Walczak A.M., Mora T. Antigenic waves of virus-immune coevolution. Proc. Natl. Acad. Sci. USA. 2021;118 doi: 10.1073/pnas.2103398118. [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Zhao X., Fang V.J., Ohmit S.E., Monto A.S., Cook A.R., Cowling B.J. Quantifying protection against Influenza virus infection measured by hemagglutination-inhibition assays in vaccine trials. Epidemiology. 2016;27:143–151. doi: 10.1097/ede.0000000000000402. [DOI] [PMC free article] [PubMed] [Google Scholar]
47.Guo Y., Graber A., McBurney R.N., Balasubramanian R. Sample size and statistical power considerations in high-dimensionality data settings: a comparative study of classification algorithms. BMC Bioinformatics. 2010;11:447. doi: 10.1186/1471-2105-11-447. [DOI] [PMC free article] [PubMed] [Google Scholar]
48.Bohannan Z.S., Coffman F., Mitrofanova A. Random survival forest model identifies novel biomarkers of event-free survival in high-risk pediatric acute lymphoblastic leukemia. Comput. Struct. Biotechnol. J. 2022;20:583–597. doi: 10.1016/j.csbj.2022.01.003. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Figures S1–S10 and Table S1

mmc1.pdf^{(5.5MB, pdf)}

Document S2. Article plus supplemental information

mmc2.pdf^{(12.6MB, pdf)}

Data Availability Statement

•
Source data statement: This paper analyzes existing, publicly available data. The accession numbers for the datasets are listed in the key resources table.
•
Code statement: All original code has been deposited in GitHub (https://github.com/TalEinav/CrossStudyCompletion) and is publicly available as of the date of publication. The DOI is listed in the key resources table. This repository includes code to perform matrix completion in Mathematica and R, as well as the expanded Fonville and Vinh datasets shown in Figure S7.
•
Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.

[bib1] 1.Petrova V.N., Russell C.A. The evolution of seasonal influenza viruses. Nat. Rev. Microbiol. 2018;16:47–60. doi: 10.1038/nrmicro.2017.118. [DOI] [PubMed] [Google Scholar]

[bib2] 2.Kucharski A.J., Lessler J., Cummings D.A.T., Riley S. Timescales of influenza A/H3N2 antibody dynamics. PLoS Biol. 2018;16 doi: 10.1371/journal.pbio.2004974. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib3] 3.Yang B., García-Carreras B., Lessler J., Read J.M., Zhu H., Metcalf C.J.E., Hay J.A., Kwok K.O., Shen R., Jiang C.Q., et al. Long term intrinsic cycling in human life course antibody responses to influenza A(H3N2): an observational and modelling study. Elife. 2022;11 doi: 10.7554/eLife.81457. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib4] 4.Morris D.H., Gostic K.M., Pompei S., Bedford T., Łuksza M., Neher R.A., Grenfell B.T., Lässig M., McCauley J.W. Predictive modeling of Influenza shows the promise of applied evolutionary biology. Trends Microbiol. 2018;26:102–118. doi: 10.1016/j.tim.2017.09.004. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib5] 5.Cai Z., Zhang T., Wan X.F. A computational framework for influenza antigenic cartography. PLoS Comput. Biol. 2010;6 doi: 10.1371/journal.pcbi.1000949. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib6] 6.Ndifon W. New methods for analyzing serological data with applications to influenza surveillance. Influenza Other Respir. Viruses. 2011;5:206–212. doi: 10.1111/j.1750-2659.2010.00192.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib7] 7.Einav T., Cleary B. Extrapolating missing antibody-virus measurements across serological studies. Cell Syst. 2022;13:561–573.e5. doi: 10.1016/j.cels.2022.06.001. [DOI] [PubMed] [Google Scholar]

[bib8] 8.Lee J.M., Eguia R., Zost S.J., Choudhary S., Wilson P.C., Bedford T., Stevens-Ayers T., Boeckh M., Hurt A.C., Lakdawala S.S., et al. Mapping person-to-person variation in viral mutations that escape polyclonal serum targeting influenza hemagglutinin. Elife. 2019;8 doi: 10.7554/eLife.49324. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib9] 9.Candès E.J., Recht B. Exact matrix completion via convex optimization. Found. Comput. Math. 2009;9:717–772. doi: 10.1007/s10208-009-9045-5. [DOI] [Google Scholar]

[bib10] 10.Candes E.J., Tao T. The power of convex relaxation: near-optimal matrix completion. IEEE Trans. Inf. Theory. 2010;56:2053–2080. doi: 10.1109/tit.2010.2044061. [DOI] [Google Scholar]

[bib11] 11.Candes E.J., Plan Y. Matrix completion with noise. Proc. IEEE. 2010;98:925–936. doi: 10.1109/jproc.2009.2035722. [DOI] [Google Scholar]

[bib12] 12.Keshavan R.H., Montanari A., Oh S. Matrix completion from a few entries. IEEE Trans. Inf. Theory. 2010;56:2980–2998. doi: 10.1109/tit.2010.2046205. [DOI] [Google Scholar]

[bib13] 13.Little R.J.A., Rubin D.B. 3rd ed. Wiley; 2019. Statistical Analysis with Missing Data. [Google Scholar]

[bib14] 14.Cai T., Cai T.T., Zhang A. Structured matrix completion with applications to genomic data integration. J. Am. Stat. Assoc. 2016;111:621–633. doi: 10.1080/01621459.2015.1021005. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib15] 15.Xue F., Qu A. Integrating multisource block-wise missing data in model selection. J. Am. Stat. Assoc. 2021;116:1914–1927. doi: 10.1080/01621459.2020.1751176. [DOI] [Google Scholar]

[bib16] 16.Xue F., Ma R., Li H. Semi-supervised statistical inference for high-dimensional linear regression with blockwise missing data. arXiv. 2021 doi: 10.48550/arxiv.2106.03344. Preprint at. [DOI] [Google Scholar]

[bib17] 17.Carpentier A., Klopp O., Löffler M., Nickl R. Adaptive confidence sets for matrix completion. Bernoulli. 2018;24:2429–2460. doi: 10.3150/17-bej933. [DOI] [Google Scholar]

[bib18] 18.Chen Y., Fan J., Ma C., Yan Y. Inference and uncertainty quantification for noisy matrix completion. Proc. Natl. Acad. Sci. USA. 2019;116:22931–22937. doi: 10.1073/pnas.1910053116. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib19] 19.Kim J.H., Davis W.G., Sambhara S., Jacob J. Strategies to alleviate original antigenic sin responses to Influenza viruses. Proc. Natl. Acad. Sci. USA. 2012;109:13751–13756. doi: 10.1073/pnas.0912458109. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib20] 20.Fonville J.M., Wilks S.H., James S.L., Fox A., Ventresca M., Aban M., Xue L., Jones T.C., Le N.M.H., Pham Q.T., et al. Antibody landscapes after influenza virus infection or vaccination. Science. 2014;346:996–1000. doi: 10.1126/science.1256427. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib21] 21.Thompson M.G., Naleway A., Fry A.M., Ball S., Spencer S.M., Reynolds S., Bozeman S., Levine M., Katz J.M., Gaglani M. Effects of repeated annual inactivated Influenza vaccination among healthcare personnel on serum hemagglutinin inhibition antibody response to A/Perth/16/2009 (H3N2)-like virus during 2010-11. Vaccine. 2016;34:981–988. doi: 10.1016/J.VACCINE.2015.10.119. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib22] 22.Gouma S., Kim K., Weirick M.E., Gumina M.E., Branche A., Topham D.J., Martin E.T., Monto A.S., Cobey S., Hensley S.E. Middle-aged individuals may be in a perpetual state of H3N2 influenza virus susceptibility. Nat. Commun. 2020;11:4566. doi: 10.1038/s41467-020-18465-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib23] 23.Fox A., Carolan L., Leung V., Vu H., Phuong M., Khvorov A., Auladell M., Tseng Y.-Y., Thai P.Q., Barr I., et al. Opposing effects of prior infection versus prior vaccination on vaccine immunogenicity against Influenza A(H3N2) viruses. Viruses. 2022;14:470. doi: 10.3390/v14030470. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib24] 24.Horns F., Vollmers C., Dekker C.L., Quake S.R. Signatures of selection in the human antibody repertoire: selective sweeps, competing subclones, and neutral drift. Proc. Natl. Acad. Sci. USA. 2019;116:1261–1266. doi: 10.1073/pnas.1814213116. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib25] 25.Vinh D.N., Nhat N.T.D., de Bruin E., Vy N.H.T., Thao T.T.N., Phuong H.T., Anh P.H., Todd S., Quan T.M., Thanh N.T.L., et al. Age-seroprevalence curves for the multi-strain structure of influenza A virus. Nat. Commun. 2021;12:6680. doi: 10.1038/s41467-021-26948-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib26] 26.Shrock E.L., Timms R.T., Kula T., Mena E.L., West A.P., Jr., Guo R., Lee I.-H., Cohen A.A., McKay L.G.A., Bi C., et al. Germline-encoded amino acid–binding motifs drive immunodominant public antibody responses. Science. 2023;380 doi: 10.1126/science.adc9498. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib27] 27.Lapedes A., Farber R. The geometry of shape space: application to Influenza. J. Theor. Biol. 2001;212:57–69. doi: 10.1006/jtbi.2001.2347. [DOI] [PubMed] [Google Scholar]

[bib28] 28.Smith D.J., Lapedes A.S., de Jong J.C., Bestebroer T.M., Rimmelzwaan G.F., Osterhaus A.D.M.E., Fouchier R.A.M. Mapping the antigenic and genetic evolution of influenza virus. Science. 2004;305:371–376. doi: 10.1126/science.1097211. [DOI] [PubMed] [Google Scholar]

[bib29] 29.Creanga A., Gillespie R.A., Fisher B.E., Andrews S.F., Lederhofer J., Yap C., Hatch L., Stephens T., Tsybovsky Y., Crank M.C., et al. A comprehensive influenza reporter virus panel for high-throughput deep profiling of neutralizing antibodies. Nat. Commun. 2021;12:1722. doi: 10.1038/s41467-021-21954-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib30] 30.Einav T., Creanga A., Andrews S.F., McDermott A.B., Kanekiyo M. Harnessing low dimensionality to visualize the antibody–virus landscape for influenza. Nat. Comput. Sci. 2022;3:164–173. doi: 10.1038/s43588-022-00375-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib31] 31.Hensley S.E. Challenges of selecting seasonal influenza vaccine strains for humans with diverse pre-exposure histories. Curr. Opin. Virol. 2014;8:85–89. doi: 10.1016/j.coviro.2014.07.007. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib32] 32.Xie H., Li L., Ye Z., Li X., Plant E.P., Zoueva O., Zhao Y., Jing X., Lin Z., Kawano T., et al. Differential effects of prior Influenza exposures on H3N2 cross-reactivity of human postvaccination sera. Clin. Infect. Dis. 2017;65:259–267. doi: 10.1093/cid/cix269. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib33] 33.Arevalo C.P., Bolton M.J., Le Sage V., Ye N., Furey C., Muramatsu H., Alameh M.-G., Pardi N., Drapeau E.M., Parkhouse K., et al. A multivalent nucleoside-modified mRNA vaccine against all known influenza virus subtypes. Science. 2022;378:899–904. doi: 10.1126/science.abm0271. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib34] 34.Hartford J., Graham D.R., Leyton-Brown K., Ravanbakhsh S. Deep models of interactions across sets. arXiv. 2018 doi: 10.48550/arXiv.1803.02879. Preprint at. [DOI] [Google Scholar]

[bib35] 35.Huang R., Xu M., Zhu H., Chen C.Z., Zhu W., Lee E.M., He S., Zhang L., Zhao J., Shamim K., et al. Biological activity-based modeling identifies antiviral leads against SARS-CoV-2. Nat. Biotechnol. 2021;39:747–753. doi: 10.1038/s41587-021-00839-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib36] 36.Lewnard J.A., Cobey S. Immune history and Influenza vaccine effectiveness. Vaccines. 2018;6:28. doi: 10.3390/vaccines6020028. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib37] 37.Henry C., Zheng N.-Y., Huang M., Cabanov A., Rojas K.T., Kaur K., Andrews S.F., Palm A.-K.E., Chen Y.-Q., Li Y., et al. Influenza virus vaccination elicits poorly adapted B cell responses in elderly individuals. Cell Host Microbe. 2019;25:357–366.e6. doi: 10.1016/j.chom.2019.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib38] 38.Dugan H.L., Guthmiller J.J., Arevalo P., Huang M., Chen Y.Q., Neu K.E., Henry C., Zheng N.Y., Lan L.Y.L., Tepora M.E., et al. Preexisting immunity shapes distinct antibody landscapes after influenza virus infection and vaccination in humans. Sci. Transl. Med. 2020;12:3601. doi: 10.1126/scitranslmed.abd3601. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib39] 39.Cowling B.J., Lim W.W., Perera R.A.P.M., Fang V.J., Leung G.M., Peiris J.S.M., Tchetgen Tchetgen E.J. Influenza hemagglutination-inhibition antibody titer as a mediator of vaccine-induced protection for Influenza B. Clin. Infect. Dis. 2019;68:1713–1717. doi: 10.1093/cid/ciy759. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib40] 40.Carter D.M., Darby C.A., Lefoley B.C., Crevar C.J., Alefantis T., Oomen R., Anderson S.F., Strugnell T., Cortés-Garcia G., Vogel T.U., et al. Design and characterization of a computationally optimized broadly reactive hemagglutinin vaccine for H1N1 Influenza viruses. J. Virol. 2016;90:4720–4734. doi: 10.1128/jvi.03152-15. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib41] 41.Boyoglu-Barnum S., Ellis D., Gillespie R.A., Hutchinson G.B., Park Y.-J., Moin S.M., Acton O.J., Ravichandran R., Murphy M., Pettie D., et al. Quadrivalent influenza nanoparticle vaccines induce broad protection. Nature. 2021;592:623–628. doi: 10.1038/s41586-021-03365-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib42] 42.Georgiev I.S., Doria-Rose N.A., Zhou T., Kwon Y.D., Staupe R.P., Moquin S., Chuang G.Y., Louder M.K., Schmidt S.D., Altae-Tran H.R., et al. Delineating antibody recognition in polyclonal sera from patterns of HIV-1 isolate neutralization. Science. 2013;340:751–756. doi: 10.1126/science.1233989. [DOI] [PubMed] [Google Scholar]

[bib43] 43.Lee J., Paparoditis P., Horton A.P., Frühwirth A., McDaniel J.R., Jung J., Boutz D.R., Hussein D.A., Tanno Y., Pappas L., et al. Persistent antibody clonotypes dominate the serum response to Influenza over multiple years and repeated vaccinations. Cell Host Microbe. 2019;25:367–376.e5. doi: 10.1016/j.chom.2019.01.010. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib44] 44.Sheng J., Wang S. Coevolutionary transitions emerging from flexible molecular recognition and eco-evolutionary feedback. iScience. 2021;24:102861. doi: 10.1016/j.isci.2021.102861. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib45] 45.Marchi J., Lässig M., Walczak A.M., Mora T. Antigenic waves of virus-immune coevolution. Proc. Natl. Acad. Sci. USA. 2021;118 doi: 10.1073/pnas.2103398118. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib46] 46.Zhao X., Fang V.J., Ohmit S.E., Monto A.S., Cook A.R., Cowling B.J. Quantifying protection against Influenza virus infection measured by hemagglutination-inhibition assays in vaccine trials. Epidemiology. 2016;27:143–151. doi: 10.1097/ede.0000000000000402. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib47] 47.Guo Y., Graber A., McBurney R.N., Balasubramanian R. Sample size and statistical power considerations in high-dimensionality data settings: a comparative study of classification algorithms. BMC Bioinformatics. 2010;11:447. doi: 10.1186/1471-2105-11-447. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib48] 48.Bohannan Z.S., Coffman F., Mitrofanova A. Random survival forest model identifies novel biomarkers of event-free survival in high-risk pediatric acute lymphoblastic leukemia. Comput. Struct. Biotechnol. J. 2022;20:583–597. doi: 10.1016/j.csbj.2022.01.003. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Using interpretable machine learning to extend heterogeneous antibody-virus datasets

Tal Einav

Rong Ma

Summary

Graphical abstract

Highlights

Motivation

Introduction

Figure 1.

Figure 2.

Results

The low dimensionality of antibody-virus interactions empowers matrix completion

Cross-study matrix completion using a random forest

Box 1. Predicting virus behavior (value ± error) across studies.

Box 2. Computing the transferability fDj→D0 between datasets.

Leave one out: Inferring virus behavior without a single measurement

Figure 3.

Combining influenza datasets to predict 200,000 measurements with ≤3-fold error

Figure 4.

Versatility of matrix completion: Predicting values from a distinct assay using only 5 overlapping viruses

Leave multi out: Designing a minimal virus panel that maximizes the information gained per experiment

Figure 5.

Expanding datasets with 2 × 106 new measurements reveals a tradeoff between serum potency and breadth

Figure 6.

Figure 7.

Toward pandemic preparedness

Matrix completion via nuclear norm minimization poorly predicts behavior across studies

Discussion

Limitations of the study

STAR★Methods

Key resources table

Resource availability

Lead contact

Materials availability

Method details

Datasets analyzed

Matrix completion on log10(HAI titers)

Error of the hemagglutination (HAI) assay

Using decision trees to quantify the relationships between viruses

Predicting the behavior of a new virus

Transferability maps between datasets

Leave-multi-out analysis

Extending virus panels

Quantification and statistical analysis

Acknowledgments

Author contributions

Declaration of interests

Inclusion and diversity

Footnotes

Supplemental information

Data and code availability

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Expanding datasets with 2 × 10⁶ new measurements reveals a tradeoff between serum potency and breadth

Matrix completion on log₁₀(HAI titers)