Table 1. Antibody Liability Reference.
Name | Short name/Tag | Severity | Motif | Description | Citations |
---|---|---|---|---|---|
Deamidation (high) | DeAmdH | High | N[GS] in CDRs | Deamidation of Asparagine occurs in the following motifs: NG motif (Asparagine followed by Glycine) and NS motif (Asparagine followed by Serine). Such motifs are known to be associated with deamidation (type of degradation) and can result in reduced "shelf-life". | [7,16] |
Fragmentation (high) | FragH | High | DP in CDRs | Fragmentation occurs as cleavage at the interface between Aspartate and Proline. It is an example of a common motif that is susceptible to hydrolysis in response to pH. | [17] |
Isomerization | Isom | High | D[DGHST] in CDRs | Isomerization of Aspartate occurs in the following motifs: DD (Aspartate followed by aspartate), DG motif (Aspartate followed by Glycine), DH (Aspartate followed by Histidine), DS motif (Aspartate followed by Serine) and DT (Aspartate followed by Threonine). Such motifs are known to be connected to isomerization (type of degradation) and can cause a shorter “shelf-life” of antibodies. | [18] |
Missing Cyst (C) | mCys | High | C not present at 23 or 104 IMGT positions | Missing Cysteine occurs as cysteine absence at IMGT 23 or 104. Certain antibody sequence regions containing unpaired cysteines may result in structural changes, surface charges, or hydrophobicity. | [19] |
Extra Cys (C) | xCys | High | C present at different position then 23 or 104 IMGT positions | Extra Cysteine occurs as cysteine present at a different position than IMGT 23 or 104. Certain antibody sequence regions containing unpaired cysteines can change an antibody’s structure, apparent surface charges, or hydrophobicity. | [20] |
N-linked glycosylation (NXS/T, X not P) | Ngly | High | N[^P][ST] in variable fragment | N-linked glycosylation occurs as an addition of a sugar molecule. Reduced conformational stability and shorter "shelf-life" of antibody products are connected to asparagine linked glycosylation. Incidence of glycosylation in the CDRs can also directly impair antigen recognition and therefore lead to lower efficacy. | [21] |
Deamidation (medium) | DeAmdM | Medum | N[AHNT] in CDRs | Occurs in the following motifs: NA (Aspargine followed by Alanine), NH (Aspargine followed by Histidine), NN (Aspargine followed by Aspargine) and NT (Aspargine followed by Threonine). This type of deamidation is less common in comparison to the NG and NS motifs. | [7,16] |
Hydrolysis | Hydro | Medium | NP in CDRs | Hydrolysis gives rise to the DP motif as a result of the deamidation of Asparagine (N) to Aspartate (D). | [17] |
Fragmentation (medium) | FragM | Medium | TS in CDRs | Occurs as pH-dependent cleavage at the Threonine—Serine interface. | [17] |
Trp (W) oxidation | TrpOx | Medium | W in CDRs | Tryptophan oxidation is one of the Post-translational modifications (PTMs). | [22] |
Met (M) oxidation | MetOx | Medium | M in CDRs | Methionine oxidation occurs in the CDRs. Reduced binding affinity and quicker degradation of the antibody product are linked to oxidation in these particular spots. | [23] |
Deamidation (low) | DeAmdL | Low | [STK]N in CDRs | Occurs in the following motifs: SN (Serine followed by Aspargine), TN (Threonine followed by Aspargine), and KN (Lysine followed by Aspargine). This type of deamidation is less common than others. | [7,16] |
Integrin binding | IntBind | Low | GPR|RGD|RYD|LDV|DGE|KGD|NGR in fragment variable | Motifs for following integrin binding: αVβ3 (RGD|RYD|KGD|NGR), α4β1 (LDV), α2β1 (DGE) CD11c/CD18 (GPR). eight human integrins act as RGD receptors: α5β1, α8β1, αVβ1, αVβ3, αVβ5, αVβ6, αVβ8 and αIIbβ3 |
[24] |
We used antibody sequences from therapeutics, patents, GenBank, literature, and a large paired next-generation (NGS) sequencing dataset. The therapeutics and patents can be thought of as representatives of the clinical spectrum of sequences [25,26]. GenBank and literature are a mix of antibodies developed for scientific/therapeutic purposes [27,28]. The NGS dataset is a sample of the natural diversity [29]. We extracted unique heavy and light chain sequences from each source and we stratified them by detected organisms (human or non-human closest germline). In the case of the NGS and therapeutics datasets, the heavy and light chains were already paired. All other datasets were unpaired with heavy or light chain sequences.