Significance
In addition to binding to a desired target molecule, all antibody drugs must also meet a set of criteria regarding the feasibility of their manufacture, stability in storage, and absence of off-target stickiness. This suite of characteristics is often termed “developability.” We present here a comprehensive analysis of these properties for essentially the full set of antibody drugs that have been tested in phase-2 or -3 clinical trials, or are approved by the FDA. Surprisingly, many of the drugs or candidates in this set exhibit properties that indicate significant developability risks; however, the number of such red warning flags decreases with advancement toward approval. This reference dataset should help prioritize future drug candidates for development.
Keywords: monoclonal antibody, developability, biophysical properties, manufacturability, nonspecificity
Abstract
Antibodies are a highly successful class of biological drugs, with over 50 such molecules approved for therapeutic use and hundreds more currently in clinical development. Improvements in technology for the discovery and optimization of high-potency antibodies have greatly increased the chances for finding binding molecules with desired biological properties; however, achieving drug-like properties at the same time is an additional requirement that is receiving increased attention. In this work, we attempt to quantify the historical limits of acceptability for multiple biophysical metrics of “developability.” Amino acid sequences from 137 antibodies in advanced clinical stages, including 48 approved for therapeutic use, were collected and used to construct isotype-matched IgG1 antibodies, which were then expressed in mammalian cells. The resulting material for each source antibody was evaluated in a dozen biophysical property assays. The distributions of the observed metrics are used to empirically define boundaries of drug-like behavior that can represent practical guidelines for future antibody drug candidates.
Target binding is the predominant first concern in development of any drug. However, once a lead molecule attains the desired potency of biological modification, a suite of characteristics termed “developability” assumes critical importance. For monoclonal antibodies, these properties include high-level expression, high solubility, covalent integrity, conformational and colloidal stability, low polyspecificity, and low immunogenicity. The high cost of failing any of these criteria at a late stage in drug development has led to considerable efforts at predicting developability on the basis of sequence motifs and experimentally determined biophysical properties (1–15).
In a landmark study of small-molecule drugs over 2,000 molecules with United States Adopted Names (USAN) designations and known to have oral availability were collected and computationally analyzed (16). A simple set of thresholds, encapsulated as the “Lipinski rule of fives,” was formulated and has been used by many to prioritize small molecules for entry into clinical development. To date, analogous guiding principles for antibody drugs have not emerged—we therefore endeavor here to do so. By analogy to the Lipinski effort, we first collected the sequences of antibodies that had reached at least phase-2 trials and had USAN or WHO International Nonproprietary Names (INN) designations (137 in total as of the start of this project). As a common basis for comparison of intrinsic variable domain phenotypes we expressed each antibody as the human IgG1 isotype and formulated them in simple Hepes-buffered saline. Each antibody was then subjected to a battery of 12 different biophysical assays in common use for developability assessment.
Unexpectedly, for many of the measures the distribution of values was not symmetrically Gaussian, but instead was long-tailed, with several antibodies exhibiting values significantly higher (more problematic) than the mean. Because all antibodies in this study have been developed past phase 1, this result suggests that many of the currently available metrics suffer from a propensity for falsely predicting development failure. However, stratifying the dataset according to stage of development reveals that the number of such “red flags” (defined below) decreases for those molecules furthest along in the process, with approved antibodies trending toward fewer warnings.
Results
Sequence Characteristics of Antibody Drugs.
To compare antibody variable domain properties within a common context the chosen set of 137 antibodies was expressed as human IgG1 isotype (allele *01) with standard constant regions for kappa and lambda (alleles IGKC*01 and IGLC2*01, respectively) as appropriate. This necessarily means that the exact isotypes, let alone full chain sequences, of many of these antibodies are different from those of the actual drugs—however, comparisons strictly on the basis of variable domain differences are made possible by this study design. Perfectly replicating biological drug substances is notoriously complex and a field unto itself concerned with this task has emerged, known as biosimilar development. Creating over 130 biosimilars is clearly beyond the intended scope of this work. However, by reporting data for published sequences this study will enable others to replicate the results as well as perform their own sequence/function correlation analyses.
The antibodies chosen for this study are listed in Dataset S1; 48 antibodies were built from variable region sequences found in clinically approved antibodies (two of them approved so far only outside the United States), 42 are in the phase-3 or phase-2/3 stage, and the remaining 47 are in phase 2. A total of 124 have kappa light chains, and 13 are lambdas. Fifty-eight are classified as “fully human” (with -UMAB suffix) and 67 as “humanized” (with –ZUMAB suffix), and 12 have at least one “fully” nonhuman variable region (–XIMAB, –XIZUMAB, or –MONAB suffix). The variable region sequences for these antibodies are presented in Dataset S2.
Brief Description of Assays.
The criteria applied in selecting the suite of assays were twofold. First, a peer-reviewed publication exists of it’s being used for therapeutic antibody characterization, or it is a generally used assay (e.g., titers). Second, the assay was amenable to characterizing hundreds of antibodies and consumed <1 mg per assay of material. As such, it is more likely that these or related assays would be used in the early stages of an antibody discovery cascade. Assays performed in this study included assessment of antibody self-interaction by AC-SINS (affinity-capture self-interaction nanoparticle spectroscopy) (3, 9, 11) and CSI-BLI (clone self-interaction by bio-layer interferometry) (7), a variety of metrics of cross-interaction such as binding PSR (poly-specificity reagent) (8), or BVP (baculovirus particle) (4), CIC (cross-interaction chromatography) (1), and classic ELISA with a panel of commonly used antigens (17). Data on expression titer in HEK cells, melting temperature (Tm) of the Fab, hydrophobic interaction chromatography (HIC) and a related assay, SGAC-SINS (salt-gradient affinity-capture self-interaction nanoparticle spectroscopy) were also collected (18), as were data on standup monolayer adsorption chromatography (SMAC) (13). Finally, the percentage of monomeric species assessed by size-exclusion chromatography (SEC) in the context of an accelerated stability (AS) study completes the panel of assays. The experimental measurements obtained for the antibodies used in this study are presented in Dataset S3.
Biophysical Property Distributions Are Long-Tailed.
The distributions of measured values for 12 biophysical measures are presented in Fig. 1. We had assumed a priori that the success of these molecules in overcoming the considerable hurdles of high-level expression, formulation, and acceptable phase-1 clinical trial safety would yield a set of antibodies with uniformly favorable biophysical properties. However, for most of the measures we find this not to be the case, with a small but significant population of molecules exhibiting unusually unfavorable values for one or more measures. An example would be the PSR measurement, high values of which have been shown previously to correlate well with pharmacokinetics (PK) in mice (19, 20). In the earlier work (see, for example, figure 2A of ref. 19), a rough PSR cutoff above which antibodies tended to present faster clearance in mice was observed; when converted to the scaled values used in the present study, the cutoff is around 0.25. It is thus interesting that 33 of the 137 antibodies in the examined set have PSR values above this previously established cutoff. Such non-FcRn-, non-target-mediated systemic clearance is likely due to simple stickiness in disseminated tissues (19, 21, 22). It is therefore surprising that 24% of these antibodies, which have reached at least phase-2 stage, exhibit such high values for this nonspecific stickiness criterion. One possibility is that the actual drug substance exhibits different properties due to its particular expression, purification, formulation, or non-IgG1 isotype (11 of the 29 have IgG2 or IgG4 isotypes). To test this hypothesis, we measured the full biophysical property set of nine vialed monoclonal antibody drugs and found that, for the most part, the numerical values for these measures are not significantly different from those assessed with our preparations (Table S1); the respective PSR values show a linear correlation, r2, of 0.99.
Table S1.
Name | PSR | AC-SINS (Dlmax) | Fab Tm, °C | CIC, min | HIC RT, min | ELISA | BVP | SMAC RT, min | SGAC100 | AS | CSI-BLI, nm | Constant region comments |
Clinical samples | ||||||||||||
Infliximab | 0.00 | 30.04 | 65.0 | 8.70 | 10.20 | 0.89 | 1.21 | 8.84 | 0.00 | 0.07 | 0.05 | IgG1 [CH3] |
Golimumab | 0.23 | 30.04 | 70.0 | 11.16 | 11.65 | 0.98 | 1.81 | 12.06 | 0.00 | 0.00 | 0.00 | IgG1 |
Pembrolizumab | 0.26 | 30.04 | 65.5 | 10.42 | 11.17 | 2.04 | 4.12 | 13.23 | 0.00 | 0.02 | −0.04 | IgG4 |
Vedolizumab | 0.00 | 1.30 | 80.5 | 8.93 | 10.91 | 1.08 | 2.95 | 12.11 | 500.00 | 0.00 | −0.02 | IgG1 [lower hinge] |
Rituximab | 0.39 | 2.03 | 74.0 | 9.99 | 10.82 | 1.11 | 4.12 | 9.08 | 700.00 | 0.02 | 0.00 | IgG1 [CH1] |
Adalimumab | 0.00 | 1.26 | 74.5 | 8.81 | 8.82 | 1.09 | 1.31 | 8.65 | 1000.00 | 0.00 | −0.02 | IgG1 |
Denosumab | 0.01 | 6.71 | 73.0 | 8.63 | 8.37 | 1.14 | 1.13 | 8.67 | 900.00 | 0.00 | −0.03 | IgG2 |
Ipilimumab | 0.21 | 7.50 | 77.0 | 10.81 | 11.57 | 1.00 | 1.91 | 12.11 | 300.00 | 0.00 | −0.01 | IgG1 [CH1] |
Bevacizumab | 0.00 | 1.53 | 73.5 | 9.82 | 11.75 | 1.51 | 4.76 | 11.00 | 600.00 | 0.14 | −0.02 | IgG1 [CH3] |
HEK samples with matching variable region sequences (data from Dataset S3) | Isotype | |||||||||||
Infliximab | 0.00 | 29.65 | 64.5 | 9.01 | 10.36 | 1.04 | 1.37 | 8.89 | 0.00 | 0.18 | 0.05 | IgG1 |
Golimumab | 0.23 | 22.97 | 70.0 | 11.23 | 11.36 | 1.31 | 2.71 | 12.75 | 0.00 | 0.06 | −0.01 | |
Pembrolizumab | 0.30 | 5.62 | 66.0 | 10.92 | 11.07 | 1.28 | 2.23 | 13.48 | 500.00 | 0.03 | 0.01 | |
Vedolizumab | 0.00 | 0.39 | 80.5 | 8.99 | 10.94 | 1.15 | 1.58 | 12.31 | 600.00 | 0.07 | −0.02 | |
Rituximab | 0.38 | 2.13 | 69.0 | 10.15 | 10.80 | 1.19 | 2.93 | 9.11 | 700.00 | 0.03 | −0.01 | |
Adalimumab | 0.00 | 1.06 | 71.0 | 8.87 | 8.82 | 1.08 | 1.49 | 8.67 | 900.00 | 0.05 | −0.01 | |
Denosumab | 0.00 | 5.91 | 69.5 | 8.81 | 8.50 | 5.92 | 17.75 | 8.60 | 1,000.00 | 0.01 | 0.11 | |
Ipilimumab | 0.23 | 10.41 | 73.0 | 10.94 | 11.57 | 1.05 | 1.69 | 12.98 | 400.00 | 0.04 | 0.01 | |
Bevacizumab | 0.00 | 0.79 | 63.5 | 9.82 | 11.77 | 1.29 | 2.78 | 11.07 | 700.00 | 0.22 | −0.02 |
Assay values highlighted in bold are those that exceed the thresholds defined in Table 1. Differences in the constant regions of the heavy chain or the isotype are highlighted in italics in the last column.
The evidence is definitive that some unfavorable values within the ranges observed do not represent an insurmountable obstacle for ultimately successful clinical development of a given molecule. However, it would be difficult to argue that unfavorable values for any of these measures could not present at least some difficulties for successful drug development. Perhaps the flaws indicated by these attributes were overcome through creative and/or exhaustive efforts in cell-line development, cell-culture optimization, and formulation to result in a vialed drug that meets minimum standards for advancement. However, of course the impact of such fixes ends at the moment of parenteral administration, at which point negative impact on drug pharmacokinetics might result in negative clinical trial outcomes [see, for a possible example, the case of briakinumab vs. ustekinumab (20, 23)]. Intravenous administration, which still dominates the approved set, is less demanding of high concentrations and thus may be more tolerant of suboptimal aggregation behavior. Also, target-mediated clearance, a more “intrinsic” feature of the biology of the molecule, may dominate overall PK in patients.
Clustering of Biophysical Property Values.
It is likely that there is considerable redundancy in the information obtainable from the dozen measurements presented here. To examine this possibility, we performed an unsupervised clustering of the data and obtained the clusters shown in Fig. 2A. The Spearman rank order correlation of each pair of measures is presented graphically in Fig. 2B. Based on this analysis we have chosen five such clusters for further examination: (PSR, CSI, AC-SINS, CIC), (SGAC100, SMAC, HIC), (ELISA, BVP), (AS), and (HEK titer, Tm). On first inspection, it may be surprising to find the biological parameter of cell culture expression titer correlating most closely with the thermodynamic state variable of Tm. However, in fact this same correlation has been observed previously for secretion in yeast (24, 25) and mammalian cells (26). It should be noted that the titers obtained from transient HEK expression may not closely reflect productivity in optimized cell lines by stable transfection of typical production host cells such as CHO or murine myeloma lines. The assumption here is that for most cases, nevertheless, HEK transition titers are capturing intrinsic, sequence-dependent properties of an antibody that make it more likely that it will express in high levels. The observed correlation to Tm, consistent with literature precedent, supports the notion that this assumption is reasonable. A cluster is observed among SGAC100 (18), SMAC (13), and HIC, which are assays that measure the tendency of antibodies to either self-associate, or associate with a column matrix, under salt-stress conditions. The multiantigen nonspecificity ELISAs closely correlated with BVP assays that use the same ELISA plate readout—indicating a potential common plate-binding aspect to these assays. With respect to the largest cluster of (PSR, CSI, AC-SINS, CIC), we had a priori expected PSR and CIC to cluster together as measures of cross-interaction, and CSI and AC-SINS to cluster together as self-interaction measures. Because this was not the case, it can be inferred that this group of protein-binding assays measures a common property that differs to some extent from thermodynamic stability (HEK titer, Tm), hydrophobic interaction (SGAC100, SMAC, HIC), long-term aggregation propensity (AS), or ELISA plate binding (ELISA, BVP). It is unexpected that an AS did not quantitatively correlate closely with any of the other four groupings, although the limited dynamic range in the data for these largely well-behaved molecules limits the ability to explore correlation to other assays. Nonetheless, 19/24 of the antibodies with an AS result outside the acceptable range (discussed below) also had at least one out-of-range value within the (PSR, CSI, AC-SINS, CIC) and (SGAC100, SMAC, HIC) sets.
Threshold Warning Flag Approach.
To reconcile these assays’ discovery of purported developability defects with the success of these antibodies’ progression into clinical trials, we reframed the existence of high values for any of the measures in a given cluster simply as a “red flag,” indicating the potential presence of problems that perhaps had been resolved during bioprocess and formulation development. In keeping with the threshold used by Lipinski et al. (16), we chose the cutoff of 10% worst values among approved drugs as a threshold for a warning flag (Table 1). When antibodies at different stages of development were examined for the number of warning flags there was a progressive decrease in warning flags with advancement (Fig. 3A). Approved drugs had the fewest flags, with 65% (31) possessing no flags at all. By contrast, that number dropped to 40% (19) for the antibodies still being tested in phase-2 clinical trials, and nearly 32% (15) had over two warning flags in the developability assays. Examination of the trend in Fig. 3A gives the impression that lower warning flags correspond to higher probability of clinical trial advancement; however, of course phase-2 and -3 trials evaluate therapeutic efficacy rather than the abstract biophysical properties measured in these developability assays. The statistical significance of the results is further detailed in Fig. 3B, where distinct distributions between the approved set and the rest are demonstrated for the (PSR, CSI, ACSINS, CIC) and (ELISA, BVP) groups of assays. It is possible that excessive promiscuous antibody interactions previously shown to correlate with accelerated systemic clearance (4, 19, 21, 22) also limit efficacy by decreasing the amount of drug on target. In fact, an analogous trend has been consistently observed for small-molecule drugs, whereby lipophilicity and molecular weight decrease with clinical advancement (27, 28), a trend also attributed in part to affect relative drug levels on target. The Tm was not considered for the above analysis because the values were close to normally distributed and within an acceptable range for development. Additionally, we also excluded the titer measurements from transient HEK expression, because these measurements may not reflect production in stable mammalian cell lines, as discussed earlier. In addition, the distribution of boundary violation counts using only antibodies known to have been developed as IgG1 (77 of the 137) looks like the distribution obtained with the entire set (Fig. S1), suggesting that isotype differences are not a dominant factor to these results.
Table 1.
Group | Assay | Worst 10% threshold | Units (flag) |
Group 1 | PSR | 0.27 ± 0.06 | None (>) |
ACSINS | 11.8 ± 6.2 | Nanometers (wavelength change) (>) | |
CSI | 0.01 ± 0.02 | BLI response units (>) | |
CIC | 10.1 ± 0.5 | Retention time (min) (>) | |
Group 2 | HIC | 11.7 ± 0.6 | Retention time (min) (>) |
SMAC | 12.8 ± 1.2 | Retention time (min) (>) | |
SGAC-SINS | 370 ± 133 | Salt concentration (mM) (<) | |
Group 3 | BVP | 4.3 ± 2.2 | Fold-over-background (>) |
ELISA | 1.9 ± 1.0 | Fold-over-background (>) | |
Group 4 | AS | 0.08 ± 0.03 | Monomer percentage loss per day (>) |
It has been noted previously that antibodies discovered via phage display of fragments can display unfavorable biophysical characteristics relative to antibodies cloned from immunized mice (26, 29). Examination of the provenance of the antibodies in our dataset is consistent with this observation (Fig. 3C), in that the number of flags for antibodies engineered at some point by phage display are significantly higher than those derived from mammalian sources.
Antibody Clustering by Property Set.
To investigate whether the antibodies would partition into distinct sets based on measurements in the selected assays we performed hierarchical clustering on the dataset. The calculation of pairwise distances between antibodies using the vector of biophysical properties is detailed in Supporting Information. The result of these calculations is shown in Fig. 4, where we have overlaid rectangles corresponding to five observed distinct clusters. The two largest clusters consisted of 80 and 22 antibodies, respectively. The largest cluster exhibited more generally favorable biophysical properties as evidenced by the distribution of red flags shown in Fig. S2. This “clean” cluster includes 34 of the 48 approved antibodies in our dataset, indicating that the general property of exhibiting one or fewer red flags is associated with clinical success. The second-largest cluster was composed predominantly of phase-2 and phase-3 antibodies. The distribution of assay measurements for each cluster is shown in Fig. S3. The clustering was robust to a change in the parameters used for the calculation of pairwise distances (Fig. S4).
Discussion
In earlier publications from our laboratories we had described and emphasized the importance of a subset of high-throughput assays applicable to very large numbers of candidate antibodies (7, 8, 11, 18, 19). It is thus both practical and instructive to see how these assays, in particular, delineate the space of antibody drugs actually in clinical development. A natural application of the main results in this paper would be a prescription, analogous to Lipinski’s rule of fives, to filter or at least rank-order a set of candidates for possible development as therapeutics.
A consequence of the increased pace of development of antibody therapeutics has been the establishment of so-called platform approaches for process development and formulation (30, 31). Particularly favorable developability profiles for a given candidate are more likely to fit within the constraints of given platforms (30). Conversely, and in the best-case scenario, molecules with characteristics outside what is typically favorable are likely to require additional resources for establishment of successful process development and formulation. How the cutoffs defined in this work may relate to the features of specific development platforms is the subject of ongoing research. A closely related point is how establishment of early-stage developability assessment may affect the ultimate success rate of antibodies in the clinic (of course, related to factors distinct from the biomedical hypothesis a given drug is testing). Judging from the literature discussion on developability, one can estimate that these concepts may have begun to become prevalent around 2010 or later (1). Because our set of antibodies have started at least phase-2 clinical trials, it follows that the clear majority, if not all, of them started development before 2010 (for example, the associated patent priority dates on the sequences of the antibodies in this study are all before midyear of 2009).
A key goal of this work was to establish cutoffs for acceptable values of each of the assays performed, derived from the antibody drug space defined here. However, we were surprised to find values asymmetrically far into the undesirable tail of the histogram for many antibodies seemingly advancing successfully through clinical trials. Although we cannot rule out that some of these poor properties may be attributable to isotype differences with the actual drug product, or particulars of expression, purification, and formulation, we found that formulated drug versions of several of these antibodies gave values very similar to those of the preparations we tested (Table S1). Despite these exceptions, it is clear that the great majority of approved antibodies cluster within a priori favorable ranges of parameter space. We have used 90th percentiles in these metrics, using this subset of the data, to define a zone of most favorable properties. This is analogous to the original Lipinski work where reasonable boundaries were likewise drawn (16). Although the choice of a given percentile cutoff value is arbitrary, it is noteworthy that unsupervised hierarchical clustering also reveals natural groupings into “clean” and “less-clean” subsets (Fig. 4). As observed in Fig. 3A, the number of drug boundary violations exhibited by a given IgG tends to decrease with the stage of clinical advance of the corresponding drug (from which variable region sequences were obtained).
Because no single unfavorable assay we examined definitively predicted failure to advance to at least phase-2 clinical trials, a more holistic approach of weighted consideration of multiple biophysical properties is necessary. The practical problem faced by an antibody drug developer can be broken down to two parts: (i) which from among a small set of lead antibody molecules exhibiting similar bioactivity is the one predicted to be most likely to successfully navigate the subsequent development process and (ii) whether any danger signals apparent for that lead molecule are sufficiently menacing to necessitate further protein engineering or formulation efforts before advancing the program. We propose that weighted consideration of about a dozen biophysical assays with complementary and overlapping characteristics, such as those reported here, can be a useful tool for making these decisions.
Materials and Methods
Curated Set of Antibodies.
For antibodies, and biologicals in general, unlike for small molecules, full disclosure of chemical structure (amino acid sequence) did not become a requirement for attaining INN before 2006; thus, many antibody molecules that entered development in the early time frame do not have an “official” source for sequence information. Nonetheless, based on analysis of peer-reviewed and patent literature, a number of sequences for those antibodies were likewise collected. Details of the process including sources for all of the sequence information are included in Supporting Information.
Antibody Expression, Production, and Purification.
The 137 antibodies were expressed in HEK293 cells. The VH and VL encoding gene fragments (Integrated DNA Technologies) were subcloned into heavy- and light-chain pcDNA 3.4+ vectors (ThermoFisher). The variable region sequences are listed in Dataset S2. All mAbs were expressed as IgG1 isotype. The corresponding vectors were cotransfected into HEK293 suspension cells. After 6 d of growth, the cell culture supernatant was harvested by centrifugation and passed over Protein A agarose (MabSelect SuRe; GE Healthcare Life Sciences). The bound antibodies were then washed with PBS and eluted with buffer (200 mM acetic acid/50 mM NaCl, pH 3.5) into 1/8 volume 2 M Hepes, pH 8.0. The final products were buffer-exchanged into 25 mM Hepes and 150 mM sodium chloride, pH 7.3.
Approved Antibody Drugs.
The antibody drugs Remicade, Simponi, Keytruda, Entyvio, Mabthera, Humira, Xgeva, Yervoy, and Avastin (corresponding to infliximab, golimumab, pembrolizumab, vedolizumab, rituximab, adalimumab, denosumab, ipilimumab, and bevacizumab, respectively) were ordered from myoderm.com. All of these clinical samples were diluted down to 1 mg/mL with corresponding assay buffers before analysis. All samples were analyzed by reducing liquid chromatography-MS to confirm sequence match to both light chain and heavy chain after consideration of any differences in the constant regions (Dataset S3).
Biophysical Measurements.
Protocol details for CIC, PSR binding assay, CSI-BLI, AC-SINS, SGAC-SINS, Tm using differential scanning fluorescence (DSF), SMAC, accelerated stability SEC slope, HIC, BVP assay, and ELISA are all provided in Supporting Information.
Statistical Analysis.
All statistical analysis was completed with the assistance of R version 3.2.4. Although many metrics such as Tm and HIC seemed to show normal or log-normal distributions, there were many that did not, precluding the use of Pearson’s correlations to analyze the full dataset. As such, Spearman’s rank correlations were calculated for all pairwise combinations of antibody characteristics. Therefore, one can only conclude correlation between relative rankings on the assays, but not directly between the assays themselves. To assess statistical significance of correlations, an exact P value was calculated for each pairwise comparison. Two-by-two contingency tables were constructed for approval status vs. presence of flags within each group of assays. Fisher’s exact tests for count data were performed on these contingency tables and an exact P value (Fig. 3B) was calculated for the null hypothesis that the true odds ratio is unity. Additionally, an identical analysis was carried out for each assay individually, and the results are summarized in Table S2. To further assess whether the distributions of the assay measurements differed between the approved and nonapproved antibodies, the two-sample Anderson–Darling (33) test was performed. Table S3 lists the P values for rejecting the null hypothesis that the measurements arise from a common underlying distribution. The error bars in Table 1 and Fig. S5 were estimated using bootstrapping as implemented in the package boot.
Table S2.
Assay | P value |
PSR | 0.018 |
ACSINS | 0.045 |
CSI | 0.044 |
CIC | 0.42 |
HIC | 1.0 |
SMAC | 1.0 |
SGAC-SINS | 0.07 |
BVP | 0.004 |
ELISA | 0.017 |
AS | 0.16 |
Tm | 0.32 |
HEK titer | 0.13 |
P values are for testing the hypothesis that the true odds ratio is one for the 2 × 2 table of approved vs. 90% threshold exceeded.
Table S3.
Assay | P value |
PSR | 0.005 |
ACSINS | 0.03 |
CSI | 0.09 |
CIC | 0.34 |
HIC | 0.93 |
SMAC | 0.82 |
SGAC-SINS | 0.76 |
BVP | 0.001 |
ELISA | 0.003 |
AS | 0.03 |
Tm | 0.35 |
HEK titer | 0.38 |
P values are for testing the hypothesis that the two sets have values drawn from the same underlying distribution.
CIC
CIC was performed as described previously (1). In brief, the CIC column was prepared by coupling ∼30 mg of human serum polyclonal antibodies (I4506; Sigma) to a 1-mL HiTrap column (17-0716-01; GE Healthcare), followed by quenching with ethanolamine. Approximately 5 µg of each antibody was tested at a flow rate of 0.1 mL/min using PBS as a mobile phase on an Agilent 1100 series HPLC system.
PSR Binding Assay
The PSR assay was done as previously described (8). In short, soluble membrane proteins were prepared from CHO cells. The enriched membrane fraction was biotinylated using NHS-LC-Biotin (Pierce, 21336; Thermo Fisher). This polyspecificity reagent was incubated with IgG-presenting yeast, followed by washing. Then secondary labeling mix (Extravidin-R-PE, anti-human LC-FITC, and propidium iodide) was added to the mixture. Samples were analyzed on FACSCanto (BD Biosciences) using an HTS sample injector. Flow cytometry data were analyzed for median fluorescence intensity (MFI) in the R-PE channel to assess nonspecific binding. MFI values were normalized from 0 to 1 based on three reference antibodies exhibiting low, medium, and high PSR MFI values.
CSI-BLI
The CSI-BLI assay was carried out as previous described (7). Briefly, human IgG was loaded to an AHQ biosensor (ForteBio) to ∼1 nm, followed by sensor blocking with human IgG1 Fc. The self-association was performed at 1 µM solution concentration of IgG for 300 s on an Octet HTX system (ForteBio). The binding response from the association step was subtracted from that of a reference IgG (adalimumab).
AC-SINS
The AC-SINS assay was performed as described previously (3, 11). In short, gold nanoparticles (15705; Ted Pella Inc.) were coated with 80% capturing anti-human goat IgG Fc (109-005-098; Jackson ImmunoResearch) and 20% with polyclonal goat nonspecific antibody (005-000-003; Jackson ImmunoResearch). The antibodies of interest were then incubated with the particles for 2 h and the wavelength shift was measured using Molecular Devices SpectraMax M2 with SoftMax Pro6 software. The self-interacting clones show a higher wavelength shift away from the PBS sample.
SGAC-SINS
The SGAC-SINS assay was performed as described previously (18). In short, gold nanoparticles (15705; Ted Pella Inc.) were coated with 80% capturing anti-human goat IgG Fc (109-005-098; Jackson ImmunoResearch) and 20% with polyclonal goat nonspecific antibody (005-000-003; Jackson ImmunoResearch). The antibodies of interest were then incubated with the particles for 30 min. The samples were then diluted with varying ranges of ammonium sulfate (300–1,000 mM in 100-mM steps) and incubated for an additional 1.5 h. The wavelength shift was measured using Molecular Devices SpectraMax M2 with SoftMax Pro6 software. The SGAC100 was obtained by graphing the wavelength shifts of a sample against the ammonium sulfate concentration and extrapolating the concentration at which the shift was 560 nm. For antibodies where the shift was below 560 nm at the highest salt concentration a value of 1,000 mM was assigned.
Tm Using DSF
The Tm was determined using a CFX96 Real-Time System from BioRad, based on the protocol described earlier (32). Briefly, 20 µL of 1 mg/mL sample was mixed with 10 µL of 20× SYPRO orange. The plate was scanned from 40 °C to 95 °C at a rate of 0.5 °C/2 min. The Fab Tm was assigned using the first derivative of the raw data from the BioRad analysis software.
SMAC
The SMAC assay was performed as described previously (13). Briefly, 2 µg of samples were injected into a Zenix SEC-300 column (213300-4630; Sepax Technologies). A flow rate of 0.35 mL/min with the running buffer containing 150 mM sodium phosphate at pH 7.0 was used. Retention time for each sample was assigned based on the major peak.
AS SEC Slope
Samples were kept 1 mg/mL at 40 °C for 30 d in HBS (25 mM Hepes and 150 mM sodium chloride, pH 7.3). Time points were taken at day 0, 5, 20, and 30, and the samples were then analyzed by SEC (0022855; Tosoh Bioscience). For SEC analysis, the running buffer composition was 200 mM sodium phosphate and 250 mM sodium chloride, pH 7.0. A long-term stability slope was calculated from the percent aggregated, measured on the SEC.
HIC
The methodology for this assay was described previously (18). In brief, 5 μg IgG samples (1 mg/mL) were spiked in with a mobile phase A solution (1.8 M ammonium sulfate and 0.1 M sodium phosphate at pH 6.5) to achieve a final ammonium sulfate concentration of about 1 M before analysis. A Sepax Proteomix HIC butyl-NP5 column was used with a liner gradient of mobile phase A and mobile phase B solution (0.1 M sodium phosphate, pH 6.5) over 20 min at a flow rate of 1 mL/min with UV absorbance monitoring at 280 nm.
BVP Assay
The method used was adopted from Hötzel et al. (4). Briefly, 50 µL baculovirus particles (BlueSky Biotech) stock was diluted with equal volume of 50 mM sodium carbonate (pH 9.6) per well and incubated on ELISA plates (3369; Corning) at 4 °C overnight. The next day, unbound BVPs were aspirated from the wells. All remaining steps were performed at room temperature. One hundred microliters of blocking buffer (PBS with 0.5% BSA) was added and let incubate for 1 h before three washes with 100 µL of PBS. Next, 50 µL of 1 µM testing antibodies in blocking buffer was added to the wells and incubated for 1 h followed by six washes with 100 µL of PBS. Fifty microliters of diluted anti-human IgG-HRP conjugate (81-7120; ZyMax) was added to the wells and incubated for 1 h followed by six washes as before. Finally, 50 µL of TMB substrate (34021; Fisher Scientific) was added to each well and incubated for 10–15 min. The reactions were stopped by adding 50 µL of 2 M sulfuric acid to each well. The absorbance was read at 450 nm and BVP score determined by normalizing absorbance by control wells with no test antibody.
ELISA
The ELISA protocol we used followed previously established procedures (17). In brief, six different antigens, cardiolipin (50 μg/mL, C0563; Sigma), KLH (5 μg/mL, H8283; Sigma), LPS (10 μg/mL, tlrl-eblps; InvivoGen), ssDNA (1 μg/mL, D8899; Sigma), dsDNA (1 μg/mL, D4522; Sigma), and insulin (5 μg/mL, I9278; Sigma), were coated onto ELISA plates (3369; Corning) individually at 50 μL per well overnight at 4 °C. Plates were blocked with PBS with 0.5% BSA at room temperature (RT) for 1 h, followed by three washes with PBST (PBS plus 0.1% Tween 20). Fifty microliters of 100 nM testing antibody solution was added to each well and incubated at RT for 1 h. The rest of the steps followed the same procedures as in the BVP assay described above.
Antibody Clustering by Property Set.
The clustering of antibodies by the properties measured in the assays requires the calculation of a distance between each pair of antibodies. Because the measurements in each assay are on different scales, we used the following scaling procedure for each assay:
[S1] |
where is the scaled value, is the measurement in the assay, and , are the median, 80th percentile, and 20th percentile over the 137 antibodies, respectively. Additionally, because the assays can exhibit long-tailed distributions that can skew the pairwise distance computation, we instituted upper and lower bounds for at ±2 scaled units.
Following the calculation of the scaled values, the pairwise distances were calculated such that each group of assays, (PSR, CSI, AC-SINS, CIC), (SGAC100, SMAC, HIC), (ELISA, BVP), and (AS), contributed a single value to the final distance. For example, in the case of the (PSR, CSI, AC-SINS, CIC) assay group,
The final pairwise distance was calculated as
The pairwise distance matrix calculated in the above manner was used as an input to hierarchical clustering with an average linkage criterion used for merging clusters. The function hclust in package:stats in R 3.2.4 was used to perform the clustering.
To assess the robustness of the clustering procedure we varied the percentile cutoffs used in the denominator in Eq. S1 to and and increased the bounds for to ±3 scaled units. On repeating the clustering with these changes we did not observe materially different clusters emerge compared with the original procedure. The largest cluster contained a majority of the Approved antibodies and the second-largest cluster was dominated by phase-2 and phase-3 antibodies. The results are shown in Fig. S4.
Further Details on the BVP Assay.
The BVP score reported here is slightly different from the original method (4). In the earlier work, the BVP score was calculated as the ratio of signal from testing antibody binding to ELISA plate with vs. without BVP. This way, a higher BVP score is only connected to antibodies binding to the BVP, not to the ELISA plate or blocking material, such as BSA. We used a different approach to define background, measuring the signal when the plate was coated with BVP, but no testing antibody was added. This way, molecules that exhibit binding to the ELISA plate or BSA would generate a measurable signal. We considered nontarget binding behavior an undesirable property for the drug candidate. However, this could represent an overstringent criterion because the ELISA plate may represent an artificial surface that an antibody drug may never encounter during its “life cycle.”
Supplementary Material
Acknowledgments
We thank other Adimab LLC staff members for their many contributions.
Footnotes
Conflict of interest statement: All of the authors are employed by Adimab, LLC, whose business is the discovery of antibody drugs.
This article is a PNAS Direct Submission.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1616408114/-/DCSupplemental.
References
- 1.Jacobs SA, Wu SJ, Feng Y, Bethea D, O’Neil KT. Cross-interaction chromatography: A rapid method to identify highly soluble monoclonal antibody candidates. Pharm Res. 2010;27(1):65–71. doi: 10.1007/s11095-009-0007-z. [DOI] [PubMed] [Google Scholar]
- 2.Nishi H, et al. Fc domain mediated self-association of an IgG1 monoclonal antibody under a low ionic strength condition. J Biosci Bioeng. 2011;112(4):326–332. doi: 10.1016/j.jbiosc.2011.06.017. [DOI] [PubMed] [Google Scholar]
- 3.Sule SV, et al. High-throughput analysis of concentration-dependent antibody self-association. Biophys J. 2011;101(7):1749–1757. doi: 10.1016/j.bpj.2011.08.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Hötzel I, et al. A strategy for risk mitigation of antibodies with fast clearance. MAbs. 2012;4(6):753–760. doi: 10.4161/mabs.22189. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Lauer TM, et al. Developability index: A rapid in silico tool for the screening of antibody aggregation propensity. J Pharm Sci. 2012;101(1):102–115. doi: 10.1002/jps.22758. [DOI] [PubMed] [Google Scholar]
- 6.Sule SV, Dickinson CD, Lu J, Chow CK, Tessier PM. Rapid analysis of antibody self-association in complex mixtures using immunogold conjugates. Mol Pharm. 2013;10(4):1322–1331. doi: 10.1021/mp300524x. [DOI] [PubMed] [Google Scholar]
- 7.Sun T, et al. High throughput detection of antibody self-interaction by bio-layer interferometry. MAbs. 2013;5(6):838–841. doi: 10.4161/mabs.26186. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Xu Y, et al. Addressing polyspecificity of antibodies selected from an in vitro yeast presentation system: A FACS-based, high-throughput selection and analytical tool. Protein Eng Des Sel. 2013;26(10):663–670. doi: 10.1093/protein/gzt047. [DOI] [PubMed] [Google Scholar]
- 9.Geng SB, Cheung JK, Narasimhan C, Shameem M, Tessier PM. Improving monoclonal antibody selection and engineering using measurements of colloidal protein interactions. J Pharm Sci. 2014;103(11):3356–3363. doi: 10.1002/jps.24130. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Jayaraman J, et al. Plasmonic measurements of monoclonal antibody self-association using self-interaction nanoparticle spectroscopy. Biotechnol Bioeng. 2014;111(8):1513–1520. doi: 10.1002/bit.25221. [DOI] [PubMed] [Google Scholar]
- 11.Liu Y, et al. High-throughput screening for developability during early-stage antibody discovery using self-interaction nanoparticle spectroscopy. MAbs. 2014;6(2):483–492. doi: 10.4161/mabs.27431. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Jarasch A, et al. Developability assessment during the selection of novel therapeutic antibodies. J Pharm Sci. 2015;104(6):1885–1898. doi: 10.1002/jps.24430. [DOI] [PubMed] [Google Scholar]
- 13.Kohli N, et al. A novel screening method to assess developability of antibody-like molecules. MAbs. 2015;7(4):752–758. doi: 10.1080/19420862.2015.1048410. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Obrezanova O, et al. Aggregation risk prediction for antibodies and its application to biotherapeutic development. MAbs. 2015;7(2):352–363. doi: 10.1080/19420862.2015.1007828. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Sharma VK, et al. In silico selection of therapeutic antibodies for development: Viscosity, clearance, and chemical stability. Proc Natl Acad Sci USA. 2014;111(52):18601–18606. doi: 10.1073/pnas.1421779112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Lipinski CA, Lombardo F, Dominy BW, Feeney PJ. Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Adv Drug Deliv Rev. 2001;46(1-3):3–26. doi: 10.1016/s0169-409x(00)00129-0. [DOI] [PubMed] [Google Scholar]
- 17.Mouquet H, et al. Polyreactivity increases the apparent affinity of anti-HIV antibodies by heteroligation. Nature. 2010;467(7315):591–595. doi: 10.1038/nature09385. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Estep P, et al. An alternative assay to hydrophobic interaction chromatography for high-throughput characterization of monoclonal antibodies. MAbs. 2015;7(3):553–561. doi: 10.1080/19420862.2015.1016694. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Kelly RL, et al. High throughput cross-interaction measures for human IgG1 antibodies correlate with clearance rates in mice. MAbs. 2015;7(4):770–777. doi: 10.1080/19420862.2015.1043503. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Kelly RL, et al. Target-independent variable region mediated effects on antibody clearance can be FcRn independent. MAbs. 2016;8(7):1269–1275. doi: 10.1080/19420862.2016.1208330. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Datta-Mannan A, et al. The interplay of non-specific binding, target-mediated clearance and FcRn interactions on the pharmacokinetics of humanized antibodies. MAbs. 2015;7(6):1084–1093. doi: 10.1080/19420862.2015.1075109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Sigounas G, Harindranath N, Donadel G, Notkins AL. Half-life of polyreactive antibodies. J Clin Immunol. 1994;14(2):134–140. doi: 10.1007/BF01541346. [DOI] [PubMed] [Google Scholar]
- 23.Schoch A, et al. Charge-mediated influence of the antibody variable domain on FcRn-dependent pharmacokinetics. Proc Natl Acad Sci USA. 2015;112(19):5997–6002. doi: 10.1073/pnas.1408766112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Kowalski JM, Parekh RN, Mao J, Wittrup KD. Protein folding stability can determine the efficiency of escape from endoplasmic reticulum quality control. J Biol Chem. 1998;273(31):19453–19458. doi: 10.1074/jbc.273.31.19453. [DOI] [PubMed] [Google Scholar]
- 25.Kowalski JM, Parekh RN, Wittrup KD. Secretion efficiency in Saccharomyces cerevisiae of bovine pancreatic trypsin inhibitor mutants lacking disulfide bonds is correlated with thermodynamic stability. Biochemistry. 1998;37(5):1264–1273. doi: 10.1021/bi9722397. [DOI] [PubMed] [Google Scholar]
- 26.Wang F, et al. Somatic hypermutation maintains antibody thermodynamic stability during affinity maturation. Proc Natl Acad Sci USA. 2013;110(11):4261–4266. doi: 10.1073/pnas.1301810110. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
- 27.Leeson PD, Springthorpe B. The influence of drug-like concepts on decision-making in medicinal chemistry. Nat Rev Drug Discov. 2007;6(11):881–890. doi: 10.1038/nrd2445. [DOI] [PubMed] [Google Scholar]
- 28.Waring MJ, et al. An analysis of the attrition of drug candidates from four major pharmaceutical companies. Nat Rev Drug Discov. 2015;14(7):475–486. doi: 10.1038/nrd4609. [DOI] [PubMed] [Google Scholar]
- 29.Spencer S, Bethea D, Raju TS, Giles-Komar J, Feng Y. Solubility evaluation of murine hybridoma antibodies. MAbs. 2012;4(3):319–325. doi: 10.4161/mabs.19869. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Hartmann S, Kocher HP. Best practices in assessment of developability of biopharmaceutical candidates. In: Kumar S, Singh SK, editors. Developability of Biotherapeutics: Computational Approaches. CRC; Boca Raton, FL: 2016. pp. 155–174. [Google Scholar]
- 31.Warne NW. Development of high concentration protein biopharmaceuticals: The use of platform approaches in formulation development. Eur J Pharm Biopharm. 2011;78(2):208–212. doi: 10.1016/j.ejpb.2011.03.004. [DOI] [PubMed] [Google Scholar]
- 32.He F, Woods CE, Becker GW, Narhi LO, Razinkov VI. High-throughput assessment of thermal and colloidal stability parameters for monoclonal antibody formulations. J Pharm Sci. 2011;100(12):5126–5141. doi: 10.1002/jps.22712. [DOI] [PubMed] [Google Scholar]
- 33.Anderson TW, Darling DA. Asymptotic theory of certain “goodness of fit” criteria based on stochastic processes. Ann Math Stat. 1952;23(2):193–212. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.