Skip to main content
Wolters Kluwer - PMC COVID-19 Collection logoLink to Wolters Kluwer - PMC COVID-19 Collection
. 2023 Feb 20;46(3):75–88. doi: 10.1097/CJI.0000000000000455

COVID-19: Attacks Immune Cells and Interferences With Antigen Presentation Through MHC-Like Decoy System

Wenzhong Liu *,†,, Hualan Li
PMCID: PMC9987643  PMID: 36799912

Abstract

The high mortality of coronavirus disease 2019 is related to poor antigen presentation and lymphopenia. Cytomegalovirus and the herpes family encode a series of major histocompatibility complex (MHC)-like molecules required for targeted immune responses to achieve immune escape. In this present study, domain search results showed that many proteins of the severe acute respiratory syndrome coronavirus 2 virus had MHC-like domains, which were similar to decoys for the human immune system. MHC-like structures could bind to MHC receptors of immune cells (such as CD4+ T-cell, CD8+ T-cell, and natural killer-cell), interfering with antigen presentation. Then the oxygen free radicals generated by E protein destroyed immune cells after MHC-like of S protein could bind to them. Mutations in the MHC-like region of the viral proteins such as S promoted weaker immune resistance and more robust transmission. S 127–194 were the primary reason for the robust transmission of delta variants. The S 144–162 regulated the formation of S trimer. The mutations of RdRP: G671S and N: D63G of delta variant caused high viral load. S 62–80 of alpha, beta, lambda variants were the important factor for fast-spreading. S 616–676 and 1014–1114 were causes of high mortality for gamma variants infections. These sites were in the MHC-like structure regions.

Key Words: CD4+ T-cell, CD8+ T-cell, NK-cell, lymphopenia, delta variant, neutralizing antibody, N-terminal supersite

BACKGROUND

The lymphopenia of coronavirus disease 2019 (COVID-19) patients includes CD4+ T cells, CD8+ T cells, B cells, and natural killer (NK) cells, with the damage of CD8+ T cells being more significant.1,2 No obvious virus infection is detected in lymphocytes and mesenchymal cell.3 Lymphopenia at the initial appearance of COVID-19 is associated with poor prognosis.4 Lymphopenia and its severity are reliable predictors of the clinical outcome of COVID-19, including mortality, intensive care needs, and oxygen requirements.4 Besides, the high fatality rate of COVID-19 is related to the poor performance of major histocompatibility complex (MHC) II and the low coverage of MHC II.5 The quality of MHC II presented by T cells is an essential prerequisite for T-cell-dependent antibody production. The binding capacity of MHC I epitope load and severe acute respiratory syndrome coronavirus 2 (SARS-COV-2) peptide affects the immunity of T cells to infection.6 Therefore, MHC presentation is closely related to lymphopenia of COVID-19.

Lymphopenia, cell degeneration, necrosis, and atrophy are found in SARS and COVID-19 patients.7 The sort of lymphopenia and apoptosis between SARs and COVID-19 patients appears different. Namely, lymphopenia in SARs patients precedes apoptosis, while apoptosis in COVID-19 patients precedes lymphopenia.8 The frequency and activation of SARS-COV-2 specific CD8+ T cells increase during severe illness, highlighting differences in T-cell responses associated with disease progression.9 The number of regulatory T cells (Treg) has nothing to do with the severity of the disease, suggesting that T-cell exhaustion occurs in a process independent of Treg.10 SARS-COV-2 generates reactive oxygen species (ROS) through the combination of E protein and heme,11 in which hydroxyl free radicals can directly destroy the cell membrane and cause damage to immune cells. Immune cells would be directly attacked along the route of antigen recognition and antigen presentation. So, CD4+ T cells, CD8+ T cells, and NK cells are most likely to be destroy and apoptosis in this link of binding to MHC molecules.

Mononuclear macrophages ingest antigens and process them into antigen peptides. Then antigen peptides are combined with surface MHC molecules and are expressed on the cell surface, effectively presenting antigens to helper T lymphocytes. B lymphocytes also have a similar antigen presentation effect. T cells combine with MHC II/antigen to activate B cells. While the BCR of the memory B-cell binds to a specific antigen, the antigen is endocytosed by the B-cell. After these antigens are cut into fragments, they return to the cell membrane in a state combined with MHC molecules.12 T cells express CD4 or CD8 co-receptors. They recognize non-polymorphic regions of MHC protein on target cells and can bind to partial MHC protein regions.13 Helper T cells express CD4 and recognize MHC class II proteins, while cytotoxic T cells express CD8 and recognize MHC class I proteins.13 NK cells express inhibitory receptors (KIR) of MHC class I molecules. These inhibitory receptors include the human KIR [killer cell immunoglobulin (Ig)-like receptor].14 Another function of NK cells is recognizing and eliminating cells that cannot express their MHC class I molecules.15 It is interesting to note that, individuals with specific MHC alleles are less susceptible to severe forms of malaria.13 It means that the combination of immune cells and MHC is closely related to the susceptibility of certain diseases.

The MHC class II transactivator CIITA induces cell resistance to the Ebola virus and SARS-like coronavirus.16 However, the apparent CD4 conserved residues at the receptor-binding domain (RBD)-S1 site of SARS-COV-2 interrupt the CD4-MHC-II interaction for adaptive immune activation.17 The immunity of CD8+ T cells to SARS-COV-2 is related to the severity of COVID-19 and virus control. SARS-COV-2 evades CD8+ T-cell surveillance by mutation of the MHC I restricted epitope of CD8+ T cells.18 Mutant peptides exhibit reduced or abolished MHC I binding, which is related to the loss of recognition and functional response of CD8+ T cells isolated from human leukocyte antigen (HLA)-matched COVID-19 patients. However, the proportion of interferon-γ-producing cells in SARS-COV-2 specific CD8+ T cells expressing PD-1 is higher than that of PD-1 cells in multimer+ cells. The SARS-COV-2 specific CD8+ T cells expressing PD-1 are not exhausted and function normally.19 It meant SARS-COV-2 had evolved an MHC-like structure that could bind to the MHC receptor of immune cells. Immune cells that could not attach to the MHC-like form of the virus had survived.

Some viruses have acquired inhibitors that target the MHC class I antigen presentation pathway.20 The cytomegalovirus (CMV) and herpes family encode a series of key molecules required for a targeted immune response.21 All aspects of acute and chronic CMV disease may be controlled by antibodies, NK, and other cells of the innate immune system, as well as CD8+ T and CD4+ T-cell.22 About half of the identified genes in CMV23 and beta herpes virus24 have HCMV homologs.22 The m14425 and m145 family of CMV (m17, m145–m158),22 m157,26 UL3727 are all MHC I-like molecules. The Ly49H NK-cell activation receptor recognizes m157.28 Ly49 receptor binds m157 glycoprotein encoded by mouse CMV (MCMV).21 Human CMV(HCMV) UL18 binds inhibitory leukocyte immunoglobulin-like receptor R-1.29 Human CMV express and distribute a complete library of immune evasion factors for a single MHC class I target.30 Human CMV encodes glycoproteins homologous to MHC class I.31 The MHC class I homologs encoded by human CMV binds to endogenous peptides.32

Many viruses have evolved surprising strategies to interfere with the MHC class I antigen presentation pathway.33 After the initial NK response,34 the host will produce adaptive CD8+ T35 and CD4+ T36 cellular responses. Viral MHC class I molecules allow evasion of NK-cell effector responses in the body26 and contribute to immune evasion.22 Many studies have shown that MHC class I virus proteins interfere with infected cells recognizing, antigen processing, and presentation.22 The specific recognition of MHC by inhibitory KIR provides excellent protection against a decoy molecule of virus evolution.37 The diversity of the receptor system may be the result of this specific interaction between MHC and KIR molecules. However, NK cells in severely ill patients with COVID-19 are severely depleted. The protective function of inhibitory KIR shows signs of failure. It shows that some regions of human MHC have an irreplaceable role. The MHC-like structures of the virus were precisely in these areas. If a mutation site was in the MHC-like domain, the mutation enhanced the MHC decoy function. In other words, the human immune system hard to neutralize these MHC-like sites by producing antibodies. Otherwise, the antibodies could also bind to MHC proteins. Then the antibodies would affect normal MHC antigen presentation function, causing autoimmune diseases.

The N-terminal domain (NTD) of the S protein and the S2 membrane fusion region may be MHC-like structural sites for the challenging battle between the immune system and the virus. Most antibodies that recognize the SARS-CoV-2 S protein are directed against the RBD.38 Analysis of the human monoclonal antibody (mAb) library in the sera of convalescent patients showed that most anti-S antibodies recognize RBD, and a small portion of antibodies recognizes NTD.39 Some NTD-targeted mAbs can effectively inhibit SARS-CoV-2 infection in vitro; in vivo, the immune system uses neutralization and Fc-mediated effector function activities.40 Fc receptor cells generally include B cells, killer cells, and macrophages. Compared with neutralizing RBD targeting antibodies that recognize multiple nonoverlapping epitopes, effective NTD targeting neutralizing antibodies appear to target a single supersite:41 N17, N74, N122, and N149. However, popular variants will partially or completely escape the neutralization mediated by human mAbs targeting the antigen supersite (site i).39 The variants include B.1.1.7, B. 1.35, and P.1 pedigree. It is difficult for immune system antibodies to neutralize part of the mutation sites in the NTD and the fusion region of the S2 membrane.

In this present study, we used the domain search method to find that many proteins of the SARS-COV-2 virus have MHC-like structures. It indicates that SARS-COV-2 interferes with antigen presentation and attacks immune cells through the MHC-like systems. The SARS-COV-2 virus protein with MHC-like forms could interfere with the antigen presentation response by binding to the MHC receptor of immune cells. The SARS-COV-2 virus employees the MHC-like structures of the S protein as bait. After the SARS-COV-2 S protein binds to CD4+ T, CD8+ T, and NK cells, the oxygen free radicals (ROS) generated by the E protein destroys these immune cells, resulting in a decrease in the number of lymphocytes. Through the analysis of the MHC-like enhanced regions of existing popular variants, we found that: 127–194 and 144–162 areas of S MHC-like of delta variants were in the NTD; the 62–80 regions of S MHC-like of alpha, beta, lambda variants were also in the NTD; the 616–676 and 1014–1114 regions of S MHC-like of gamma variants were in the S2 membrane fusion region.

METHODS

Data Set

The Sequences of SARS-COV-2 Proteins

The SARS-COV-2 protein sequences came from the NCBI database. Including: S, E, N, M, ORF3a, ORF8, ORF7a, ORF7b, ORF6, ORF10, ORF1ab.

MHC-Related Sequence

We downloaded 18,112 protein sequences of MHC-related from the UniProt data set and searched keyword was “MHC.” The MHC-related sequences were compared with the viral proteins to search for the conserved domains.

Localized MEME Tool to Scan for Conserved Domains

The analysis steps are listed as follows:

  1. Download MEME from the official website and subsequently install in the virtual machine Ubuntu operating system. The virtual machine was VM 15.2.

  2. Download the SARS-COV-2 protein sequence from NCBI official website.

  3. Download the FASTA format sequence of MHC-related from Uniprot official website, respectively. The search keyword was “MHC.”

  4. For each sequence in all MHC-related protein, paired with each SARS-COV-2 protein sequence to generate fasta format files for MEME analysis.

  5. For the files generated in Step 4, a batch of 50,000 was used to create several batches. It was considered as the limited space of the virtual Ubuntu system.

  6. In Ubuntu, searched the conserved domains (E-value<=0.05) of SARS-COV-2 protein and MHC-related with MEME tools in batches.

  7. Collected the result files of conserved domains. Find the domain name corresponding to the motif from the UniProt database.

  8. We analyzed the domains’ activity of the each SARS-COV-2 protein according to the characteristics of the MHC-related protein domains.

RESULTS

We downloaded MHC-related sequences from the UniProt database. Then compared these sequences with the SARS-COV-2 protein sequences to find the domains related to MHC function. We merged the motif sequences according to the domains of the search results. Both MHC 1 and MHC 2 structures include Ig-like and MHC domains. If a viral protein could bind to the antigen peptide like the MHC protein, the viral protein would have both domains.

SARS-COV-2 Virus Proteins Had Ig-Like Domains

Ig-like domains are involved in multiple functions, including cell-to-cell recognition, cell surface receptors, muscle structure, and the immune system. We first listed Ig-like domains of viral proteins in Table 1. Table 1 shows the structural proteins (S, E, N, M) and nonstructural proteins (ORF3a, ORF6, ORF7a, ORF7b, ORF8, ORF10, ORF1ab) of SARS-COV-2 all have Ig-like domains. Ig-like (IPR032165) is a domain composed of ∼100 residues. Smaller domains (74–90 residues) are observed in several Ig-related molecules (CD2, CD4). The Ig-like motifs of ORF10, E, some subprotein of ORF1ab are the short. ORF7a Ig-like A, ORF8 Ig-like A, ORF3a Ig-like C, N Ig-like B and C, M Ig-like A and C, S Ig-like B and H, 3′-to-5′ exonuclease C, 3′-to-5′ exonuclease C, helicase B motifs are longer. The Ig-like structures may help the receptor of CD4+ T, CD8+ T, and NK-cell recognize the MHC-like area of the viral proteins.

TABLE 1.

Motifs of Ig-like domains of SARS-COV-2 virus proteins

Protein Alias Motif Start End
S A WFHAIH 64 69
B KVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEY 129 170
C DCTMYIC 737 743
D MQMAYR 900 905
E YHLMSFPQSAPHG 1047 1059
F HVTYVPAQEKNFTTAPAICHDGKAHFPRE 1064 1092
G THWFVTQRNFYEPQI 1100 1114
H DLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHY 1199 1272
E A AILTALRLCAYCCNIVNVSLVKPSFYVYSRVKNLNSSRVPD 32 72
M A WICLLQFAYANRNRFLYIIKLIFLWLLWPVTLACFVLAAVYRINWITGGIAIAMACLV 31 88
B MWSFNPE 109 115
C HHLGRCDIKDLPKEITVATSRTLSYYKLGASQRVAGDSGFAAYSRYRIGNYKLNTDHSSSSDNIA 154 218
N A QGLPNNTASWFTALTQHGKED 43 63
B DQIGYYRRATRRIRGGDGKMKDLSPRWYFYYLGTGPEAGLPYGANKDGIIWVATEGALNTPKDHIG 82 147
C LIRQGTDYKHWPQIAQFAPSASAFFGMSRIGMEVTPSGTW 291 330
D FKDQVILLNKHIDAYKTFPPTE 346 367
ORF3a A MDLFMR 1 6
B ASKIITLKKRWQ 59 70
C YLYALVYFLQSINFVRIIMRLWLCWKCRSKNPLLYDANYFLCWHTNCYDYCIPYNS 107 162
D EHDYQIGGYTEKWESGVKDCVVLHSYFTSDYYQ 181 213
E HVTFFIYNKIVDEPEEHVQIHTIDGSSGVVNPVMEPIYD 227 265
ORF6 A MFHLVDFQVTIAEILLIIMRTFKVSIWNLDYIINLIIKNLSKSLTENKYSQLDEEQPMEID 1 61
ORF7a A MKIILFLALITLATCELYHYQECVRGTTVLLKEPCSSGTYEGNSPFHPLADNKFALTCFSTQFAFACPDGVKHVYQLRARSVSPKLFIRQEEVQELYSPIFLIVAAIVFITLCFTL 1 116
ORF7b A MIELSLIDFYLCFLAFLLFLVLIMLIIFWFSLELQDHNETCHA 1 43
ORF8 A MKFLVFLGIITTVAAFHQECSLQSCTQHQPYVVDDPCPIHFYSKWYIRVGARKSAPLIELCVDEAGSKSPIQYIDIGNYTVSCLPFTINCQEPKLGSLVVRCSFYEDFLEYHDVRV 1 116
ORF10 A MGYINVFAFPFTIYSLLLCRMNSRNYIAQVDVVNFNLT 1 38
nsp2 A IDTKRGVYCCREHEHEIAWYTERSEKSYELQTPF 42 75
B CDHCGETSWQTGDFVKATCE 143 162
nsp3 A SHMYCSFY 100 107
B EDDYQGKPLEFGATSAALQPEEEQEEDW 134 161
C SEYTGNYQCGHYKHITSKE 1007 1025
D HKPIVWH 1169 1175
E HFISNSWLMWLIINLVQM 1539 1556
F YYVWKSYVHVVDGCNSSTCMMCYKRNRATRVE 1573 1604
G SHNIALIWNVKDFMSLSEQLRKQIRSAAKKNNLPF 1888 1922
nsp4 A MRFRRAFGEYSH 302 313
B FLAHIQWMVMFTPLVPFWITIAYIICISTKHFYWFFSNYLKRRV 359 402
nsp6 A YFNMVYMPASWVMRIMTWLDM 80 100
nsp10 A SCCLYCRCHIDHPNPKGFCDLKGKYVQIPTTC 72 103
B CTVCGMWKGYGCSCDQ 117 132
RNA-dependent RNA polymerase A RYFKYWDQTYHPNCVNCLDDRCI 285 307
B FYGGWHNMLKTVYSDVENPHLMGWDYPKCDRAMPNMLRIM 594 633
C SRYWEPEFYEAMYTPH 913 928
2′-O-ribose methyltransferase A EHSWNADLYKLMGHFAWWT 173 191
3C-like proteinase A YDCVSFCYMHHMELP 154 168
3′-to-5′ exonuclease A DMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIG 48 88
B PPPGDQFKHLIP 140 151
C TYACWHHSIGFDYVYNPFMIDVQQWGFTGNLQSNHDLYCQVHGNAHVASCDAIMTRCLAVHECFVKRVDWTIEYPIIG 223 300
D RHHANEYRLYLDAYNM 485 500
helicase A MPLSAPTLVPQEHYVRITG 233 251
B SAQCFKMFYKGVITHDVSSAINRPQIGVVREFLTRNPAWRKAVFISPYN 468 516

Ig indicates immunoglobulin; SARS-COV-2, severe acute respiratory syndrome coronavirus 2.

SARS-COV-2 Virus Proteins had MHC Domains

We listed MHC-like domains of viral proteins in Table 2. Table 2 shows that the structural proteins (S, E, N, M) and nonstructural proteins (ORF3a, ORF6, ORF7a, ORF7b, ORF8, ORF10, ORF1ab) of SARS-COV-2 have MHC_I-like_Ag-recog domains. Many proteins have the MHC_II_alpha and MHC_II_beta domains. N, ORF10, ORF3a, 2′-O-ribose methyltransferase, nsp10, nsp6, S and have MHC2-interact domains. N, ORF10, ORF3a, ORF8, ORF7b, 2′-O-ribose methyltransferase, nsp6, and S have MHCassoc_trimer domain. ORF10, ORF3a, 3′-to-5′ exonuclease, nsp4 has MHC_I_2 domain. S has the MHC_I_C domain.

TABLE 2.

MHC domains’ motifs of SARS-COV-2 virus proteins

Protein Domain Alias Motif Start End
S MHC_I_2 A CEFQFCNDPFLGVYYHKNNKSWMESE 131 156
B WPWYIW 1212 1217
MHC_I_C A KWPWYIWLGFIAGLIAIVMVTIMLCCM 1211 1237
MHC_II_alpha A CEFQFCNDPFLGVYYHKNNKSWMESEFRVYSS 131 162
B QIPFAMQMAYR 895 905
C LGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSC 1203 1243
MHC_II_beta A WFHAIHVSGTNGTKRFD 64 80
B VIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMD 127 178
C FAMQMAYRFN 898 907
D KMSECV 1028 1033
E YVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQR 1067 1107
F DLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPV 1199 1264
MHC_I-like_Ag-recog A VTWFHAIH 62 69
B VIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVF 127 194
C RFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGV 346 382
D SNKKFLPFQQFGRDIADTTDAVRDPQTLE 555 583
E NCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQT 616 676
F IPFAMQMAYR 896 905
G RAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQI 1014 1114
H QPELDSFKEELDKY 1142 1155
I ESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT 1195 1273
MHC2-interact A APAICHDGKAHFPRE 1078 1092
B WPWYIW 1212 1217
C IVMVTIMLCCMTSCCSCLKGCC 1227 1248
MHCassoc_trimer A YYHKNNKSWMESEFRVYSS 144 162
B AHFPREGVFVSNGTHW 1087 1102
C ELGKYEQYIKWPWYIW 1202 1217
E MHC_II_alpha A TLAILTALRLCAYCCNIVNVSLVKPSFYVYSRVKNLN 30 66
MHC_II_beta A FVVFLLVTLAILTALRLCAYCCNIVNVSLVKPSFYVYSRVKNLNSSRVPD 23 72
MHC_I-like_Ag-recog A ALRLCAYCCNI 36 46
MHCassoc_trimer A CAYCCNI 40 46
M MHC_II_alpha A RCDIKDLPKE 158 167
MHC_II_beta A EELKKLLEQWN 11 21
B SMWSFNPETN 108 117
C HHLGRCDIKDLPKEITVATSRTLSYYKLGASQRVAGDSGFAAYSRYRIGNYKLNTDHSSSSDN 154 216
MHC_I-like_Ag-recog A GHHLGRCDIKD 153 163
N MHC_II_alpha A DQIGYYRRATRRIRGGDGKMKDLSPRWYFYYLGTGPEAGLPYGANKDGIIWVATEGALNTPKDHI 82 146
B TDYKHWPQIAQFAPSASAFFGMSRIGMEVT 296 325
MHC_II_beta A SWFTALTQHGKEDLKFPRGQGVPIN 51 75
B QIGYYRRATRRIRGGDGKMKDLSPRWYFYYLGTGPEAGLPYGANKDGIIWVATEGALNTPKDHIGTRNPANNAAIV 83 158
C EQTQGNFGDQELIRQGTDYKHWPQIAQFAPSASAFFGMSRIGMEVTPSGTWLTYTGAIKLDDKDPNFKDQVILLNKHIDAYKTFPPTEPK 280 369
MHC_I-like_Ag-recog A RRPQGLPNNTASWFTALTQHGKEDL 40 64
B DDQIGYYRRATRRIRGGDGKMKDLSPRWYFYYLGTGPEAGLPYGANKDGIIWVATEGALNTPKDHIGTRNPANNAAIVLQLPQGTTLPKGFY 81 172
C LIRQGTDYKHWPQIAQFAPSASAFFGMSRIGMEVTPSGTWLTYTGAIKLDDKDPNFKDQVILLNKHIDAYKTFPPTEPKKD 291 371
MHC2-interact A RWYFYYL 107 113
MHCassoc_trimer A WYFYYL 108 113
ORF3a MHC_I_2 A NFVRIIMRLWLCW 119 131
MHC_II_alpha A RIIMRLWLCWKCRSKNPLLYDANYFLCWHTNCYDYCIPYN 122 161
MHC_II_beta A MDLFMR 1 6
B NFVRIIMRLWLCWKCRSKNPLLYDANYFLCWHTNCYDYCIP 119 159
MHC_I-like_Ag-recog A INFVRIIMRLWLCWKCRSKNPLLYDANYFLCWHTNCYDYCIPY 118 160
B YNKIVDEPEEHVQIH 233 247
C NPVMEP 257 262
MHC2-interact A YFLCWHTNC 145 153
MHCassoc_trimer A IMRLWLCWKCRSKNP 124 138
B DANYFLCWHTNCYDYCIPYN 142 161
ORF6 MHC_II_alpha A DFQVTIAEILLIIMRTFKVSIWNLDYIINLIIKNLSKSLTENKYSQ 6 51
MHC_II_beta A MFHLVDFQVTIAEILLIIMRTFKVSIWNLDYIINLIIKNLSKSLTENKY 1 49
MHC_I-like_Ag-recog A MFHLVD 1 6
B TFKVSIWNLDYIINLIIKNLSKSLTENKYSQ 21 51
ORF7a MHC_II_alpha A YEGNSPFH 40 47
MHC_II_beta A GTTVLLKEPCSSGTYEGNSPFHPLADNKFALTCFSTQFAFACPDGVKHVYQLRARSVSPKLFIRQEEVQELYSPIFLIVAAIVFI 26 110
MHC_I-like_Ag-recog A CELYHYQECVRG 15 26
B YEGNSPFHPLADNK 40 53
C CPDGVKHVY 67 75
ORF7b MHC_II_alpha A DHNETCHA 36 43
MHC_II_beta A CFLAFLLFLVLIMLIIFWFSLELQDHNETCH 12 42
MHC_I-like_Ag-recog A FYLCFLAFLLFLVLIMLIIFWFSLELQDHNETCHA 9 43
MHCassoc_trimer A MLIIFWFSLELQDHNETCH 24 42
ORF8 MHC_II_alpha A TTVAAFHQECSLQSCTQHQPYVVDDPCPIHFYSKWYIRVGARKSAPLI 11 58
MHC_II_beta A TVAAFHQECSLQSCTQHQPYVVDDPCPIHFYSKWYIRVGARKSAPLIELCVDEAGSKSPIQYIDIGNYTVSCLPFTINC 12 90
MHC_I-like_Ag-recog A ITTVAAFHQECSLQSCTQHQPYVVDDPCPIHFYSKWYIRVGARKS 10 54
B DEAGSKSPIQYIDI 63 76
C NYTVSCLPFTINCQEPK 78 94
MHCassoc_trimer A DDPCPIHFYSKW 34 45
ORF10 MHC_I_2 A AFPFTIYSLLLCRMNSRNYIAQVDVVN 8 34
MHC_II_beta A MGYINVFAFPFTIYSLLLCRMNSRNYIAQVDVVNFNLT 1 38
MHC_I-like_Ag-recog A MGYINVFAFPFTIYSLLLCRMNSRNYIAQVDVVNFN 1 36
MHC2-interact A MGYINVFAFPFTIYSLLLC 1 19
MHCassoc_trimer A MGYINVFAFPFTIYSLLLCRMNSRNYIA 1 28
nsp2 MHC_II_alpha A RGVYCCREHEHEIAW 46 60
MHC_II_beta A DTKRGVYCCREHEHEIAWYTERSEKSYELQTPF 43 75
MHC_II_beta B DGFMGRIRSVYPVASPNECNQMCLSTLMKCDHCGETSWQT 114 153
MHC_I-like_Ag-recog A FIDTKRGVYCCREHEHEIAWYTERSEKSYELQTPFEI 41 77
MHC_I-like_Ag-recog B KLDGFMGRIRSVYPVASPNECNQMCLSTLMKCDHCGETSWQTGDFVKATCEFCGTENLTKEGATTCGYLPQNA 112 184
MHC_I-like_Ag-recog C CPACHNSEVGPEHSLAEYHN 190 209
nsp3 MHC_II_beta A DYKHYTPSFKKGAKLLHKPIVWHVNNATNKATYKPNTWCIRCLWS 1153 1197
MHC_II_beta B IMQLFFSYFAVHFISNSWLMWLIINLVQMAPISAMVRMYIFFASFYYVWKSYVHVVDGCNSSTCMMCYKRNRATRVECT 1528 1606
MHC_II_beta C CSARHIN 1876 1882
MHC_I-like_Ag-recog A ASHMYCSFYPPDEDEEEGDCEEEEF 99 123
MHC_I-like_Ag-recog B QPEEEQEEDW 152 161
MHC_I-like_Ag-recog C NEKQEILGTVSWNLREMLAHAEETR 544 568
MHC_I-like_Ag-recog D WCIRCLW 1190 1196
MHC_I-like_Ag-recog E SWLMWLIINLVQMAPISAMVRMYIFFASFYYVW 1544 1576
MHC_I-like_Ag-recog F RRSFYVYANGGKGFCKLHNWNCVNCDT 1613 1639
nsp4 MHC_I_2 A QWMVMFTPLVPFWITIAYIICISTKHFYWFFSNYLKRR 364 401
MHC_II_alpha A QWMVMFTPLVPFWI 364 377
MHC_II_beta A EYCRHGTCER 219 228
MHC_II_beta B HIQWMVMFTPLVPFWITIAYIICISTKHFYWFFSNYLKRR 362 401
MHC_I-like_Ag-recog A PVHVMSKHTDFSSEIIGYKAIDGGVTRDIASTDTCFANKHADFDTWFSQR 29 78
MHC_I-like_Ag-recog B FYLTNDVSFLAHIQWMVMFTPLVPFWITIAYIICISTKHFYWF 351 393
nsp6 MHC_II_beta A QSTQWSLFFFLYENAFLPFAMGIIAMSAFAMMFVKH 27 62
MHC_II_beta B WVMRIMTWLDM 90 100
MHC_I-like_Ag-recog A SWVMRIMTWLDM 89 100
MHC2-interact A MVYMPASWVMRIMTWLDM 83 100
MHCassoc_trimer A GTHHWL 9 14
nsp7 MHC_I-like_Ag-recog A WAQCVQLHND 29 38
nsp8 MHC_I-like_Ag-recog A KSEFDRDAAMQRKLEKMADQAMTQMYKQARSEDKRAKVTSAMQTM 46 90
nsp10 MHC_II_beta A NMDQESFGGASCCLYCRCHIDHPNP 62 86
MHC_II_beta B WKGYGCSCDQLREPMLQ 123 139
MHC_I-like_Ag-recog A TPEANMDQESFGGASCCLYCRCHIDHPN 58 85
MHC2-interact A YCRCHIDHPNPKGFCD 76 91
RNA-dependent RNA polymerase MHC_II_alpha A RKHTTCCSLSHRFYR 640 654
MHC_II_alpha B YWEPEF 915 920
MHC_II_beta A ERLKLFDRYFKYWDQTYHPNCVNCLDDRCILH 278 309
MHC_II_beta B YSDVENPHLMGWDYPKCDRAMPNMLRIMA 606 634
MHC_II_beta C HPNQEYADVFHLYLQYIRKLHDELTGHMLDMYSVM 872 906
MHC_II_beta D SRYWEPEFYEAMYT 913 926
MHC_I-like_Ag-recog A SNYQHEETIYNLLKDCPAVAKHDFFKFRIDGDMVPHISRQRL 78 119
MHC_I-like_Ag-recog B FDRYFKYWDQTYHPNCVNCLDDRCILH 283 309
MHC_I-like_Ag-recog C KFYGGWHNMLKTVYSDVENPHLMGWDYPKCDRAMPNMLRIMASLVLARKHTTCCSLSHRFYRLANECAQVLSEMVMCGGS 593 672
MHC_I-like_Ag-recog D KCWTETDLTKGPHEFCSQHTMLVKQGDDY 798 826
MHC_I-like_Ag-recog E LMIERFVSLAIDAYPLTKHPNQEYADVFHLYLQYIRKLHDELTGHMLDMYSVMLTNDNTSRYWEPEFYEAMYTPHT 854 929
2′-O-ribose methyltransferase MHC_II_alpha A TEHSWNADLYKLMGHFAWW 172 190
MHC_II_beta A PREQIDGYVMHANYIFWRNT 215 234
MHC_I-like_Ag-recog A HSWNADLYKLMGHFAWWT 174 191
MHC_I-like_Ag-recog B PREQIDGYVMHANYIFWR 215 232
MHC2-interact A MGHFAWWTAF 184 193
MHCassoc_trimer A MGHFAWW 184 190
3C-like proteinase MHC_II_beta A YMHHMEL 161 167
MHC_I-like_Ag-recog A YDCVSFCYMHHME 154 166
3′-to-5′ exonuclease MHC_I_2 A CWHHSIGFDYVYNPFMIDVQQW 226 247
MHC_II_beta A EGLCVDIPGIPKDMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRE 36 99
MHC_II_beta B CWHHSIGFDYVYNPFMIDVQQW 226 247
MHC_II_beta C AVCRHHANEYRLYLDAYNMMISAGFSLWVYKQ 482 513
MHC_I-like_Ag-recog A IPGIPKDMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATR 42 98
MHC_I-like_Ag-recog B DTYACWHHSIGFDYVYNPFMIDVQQWGFTGNLQSNHDLYCQVHGNAHVASCDAIMTRCLAVHECFVKRVDWTIEYPIIGDELKINAACRKVQHM 222 315
MHC_I-like_Ag-recog C CRHHANEYRLYLDAYNMMISAGFSLWVYKQFDTYNLWNTF 484 523
endoRNAse MHC_I-like_Ag-recog A RNLQEFKPRSQMEIDFLELAMDEFIERYKLEGYAFEHI 198 235
helicase MHC_I-like_Ag-recog A RPFLCCKCCYDHVISTSH 22 39
MHC_I-like_Ag-recog B EPEYFNSVCRLMKTIGPDMFLGTCRR 418 443
MHC_I-like_Ag-recog C REFLTRNPAWRKAVFISPYNSQNA 497 520

MHC indicates major histocompatibility complex; SARS-COV-2, severe acute respiratory syndrome coronavirus 2.

We downloaded the functional descriptions of the relevant domains from the InterPro database.

The members of MHC_I_2 (PF14586) are called retinoic acid-inducible proteins. They are ligands that activate the immune receptor NKG2D. NKG2D is widely expressed on NK cells, T cells, and macrophages. MHC_I_C (PF06623) represents the C-terminal region of MHC class I antigen. MHC_I-like_Ag-recog (IPR011161) is an MHC class I antigen recognition sample. Class I MHC glycoproteins are expressed on the surface of all somatic nucleated cells, except neurons. MHC class I receptors present peptide antigens synthesized in the cytoplasm, including self-peptides (offered for self-tolerance) and foreign peptides (such as viral proteins). These antigens are produced by degraded protein fragments transported by the TAP protein (antigenic peptide transporter) to the endoplasmic reticulum, where they can bind to MHC I molecules and then transport them to the cell surface through the Golgi apparatus. MHC class I receptors display antigens recognized by cytotoxic T cells that can destroy virus-infected or malignant (self-peptide excess) cells. CD8+ T toxic cells and NK cells can recognize class I MHC proteins.

MHC_II_alpha (SM00920) is the alpha domain of class II histocompatibility antigen. MHC_II_beta (SM00921) is the beta domain of class II histocompatibility antigen. Class II MHC glycoproteins are expressed on the surface of antigen-presenting cells, including macrophages, dendritic cells, and B cells. MHC II protein presents extracellular peptide antigens derived from foreign substances such as bacteria. Proteins from pathogens are degraded into peptide fragments within the antigen-presenting cells. These fragments are sequestered into endosomes to bind to MHC class II proteins before being transported to the cell surface. MHC class II receptors display antigens for recognition by helper T cells and Inflammatory T cells. CD4+T helper cells recognize MHC class II proteins.

MHC2-interact (PF09307) is the interaction domain of class II invariant chain-related peptides and MHC2. Members of this family are found in class II invariant chain-related peptides. They are required for binding to the class II MHC in the MHC class II processing pathway. MHCassoc_trimer (PF08831) is an invariant chain trimerization domain related to class II MHC. The folding and positioning of MHC class II heterodimers require class II-related consistent chain peptides. This domain participates in the trimerization of the ectoderm and interferes with DM/class II binding. The trimeric protein forms a cylindrical shape, which is considered necessary for the interaction between the invariant and class II molecules.

We noticed that S protein could form a trimer structure and had three MHCassoc_trimer domains: MHCassoc_trimer A, B, and C. MHCassoc_trimer A is in S1 protein, but MHCassoc_trimer B and C in S2 protein. It represents that MHCassoc_trimer plays an important role in the formation of S protein trimer.

MHC-like Structures Had a Decoy Function Against the Immune System

Previous analysis shows that structural proteins and nonstructural proteins can bind to T (CD4+ T and CD8+ T) and NK immune cells through MHC-like structures. The binding prevented the MHC receptors of immune cells from securing to MHC, causing interference in antigen presentation. In addition, E protein generates oxygen free radicals (ROS) after attaching to heme, and the hydroxyl free radicals directly damaged cell membranes.11 After CD4+ T, CD8+ T, and NK cells were bound to the MHC-like structure of S protein, the hydroxyl free radicals generated by E protein destroyed these immune cell membranes. It caused immune cells to die because of oxidative stress. For these two reasons, the MHC-like structure of the SARS-COV-2 virus protein had a decoy function against immune cells.

We noticed that N protein could form a multimer, S protein could form a trimer structure, and E could form a pentameric channel structure. The ORF3a protein and ORF8 protein can form a dimer structure, respectively. The dimer of ORF3a protein (or ORF8 protein) has a groove structure. The Ig-like sites of the ORF3a protein do not fully overlap with the MHC-like areas (Tables 1, 2). However, Table 2 shows that the MHC II and MHC I of ORF3a are located at the “CWKCR” heme-binding area42 and upstream and downstream. But the sites of MHC structures are not near the groove structure on the ORF3a crystal structure view (PDBID: 6xdc). Therefore, ORF3a is unlikely to have the ability to bind antigen peptides.

The Ig-like structure of the transmembrane protein ORF8 overlaps with the MHC-like system. The dimer of ORF8 has no rod-like structure. Table 2 and the crystal structure view of ORF8 (PDBID: 7jtl) show that the MHC II and MHC I structures of ORF8 include sites near the groove structure. Therefore, ORF8 may trapp antigen peptides through the MHC structure and interfering with antigen presentation. Besides, ORF8 captures MHC-1 and reroutes to autophagosomes for degradation.43 Table 2 indicates that the MHC I-like domain of ORF8 is MHC_I-like_Ag-recog. ORF8 also has the MHCassoc_trimer domain. The MHC II-like domain of ORF8 overlaps with the MHC_I-like_Ag-recog and MHCassoc_trimer structures. So ORF8 may trap MHC I by MHC_I-like_Ag-recog and MHCassoc_trimer domains. Therefore, the MHC-like system of ORF8 has a decoy function for MHC I or antigen peptides.

MHC-like Enhanced Regions of S Protein Mutation

If the S mutation site was in the MHC-like domain, the mutation enhanced the MHC decoy function. Then the human immune system hard to neutralize these MHC-like sites by producing antibodies. Otherwise, it would affect the normal MHC antigen presentation function by combing MHC and the antibodies. On the basis of this principle, we analyzed several significant variants that were now popular to determine the MHC-like enhanced region of the S protein.

SARS-COV-2 Delta Variant

The B.1.617.2/Delta variant is highly confluent, especially in infected hamsters more pathogenic than the prototype SARS-COV-2.44 The virus is more infectious and directly reduces the efficacy of antibodies produced by infection and vaccines. It is the most prevalent and difficult mutant virus strain in the world.

B.1.617.2/Delta variant mutation sites include:45 T19R, G142D, E156G, F157Δ, R158Δ, L452R, T478K, D614G, P681R, D950N. Table 3 shows that G142D, E156G, F157Δ, and R158Δ are all in the MHC_II_alpha A, MHC_II_beta B, MHC_I-like_Ag-recog B domains. Both G142D and E156G are in the MHC_I_2 A domain. E156G, F157Δ, and R158Δ are all located in the MHCassoc_trimer A domain. Other mutation sites are not in the MHC-like domain. These four MHC-like domains are highly overlapping. Combining these four MHC-like domain sites, the MHC-like enhanced distribution area of B.1.617.2/Delta variant S protein is 127–194. It is in the NTD (14–305 residues, the S1 protein region).46 Among them, MHCassoc_trimer (144–162) plays an essential role in forming S trimer. The MHC-like enhanced distribution area of the S protein has MHC-I_like and MHC-II_like functions, so it can also bind to CD4+ T, CD8+ T, NK cells.

TABLE 3.

The S1 mutation sites of the B.1.617.2/Delta variant are in the MHC-like domains

Domain Alias Start End T19R G142D E156G F157Δ R158Δ L452R T478K D614G P681R D950N
MHC_I_2 A 131 156 V V
B 1212 1217
MHC_I_C A 1211 1237
MHC_II_alpha A 131 162 V V V V
B 895 905
C 1203 1243
MHC_II_beta A 64 80
B 127 178 V V V V
C 898 907
D 1028 1033
E 1067 1107
F 1199 1264
MHC_I-like_Ag-recog A 62 69
B 127 194 V V V V
C 346 382
D 555 583
E 616 676
F 896 905
G 1014 1114
H 1142 1155
I 1195 1273
MHC2-interact A 1078 1092
B 1212 1217
C 1227 1248
MHCassoc_trimer A 144 162 V V V
B 1087 1102
C 1202 1217

MHC indicates major histocompatibility complex.

SARS-COV-2 Gamma Variant

The seropositivity rate of SARS-COV-2 antibody is very high. There is a greater chance of infectivity and death. The S mutation sites of Gamma variants are:47,48 L18F, T20N, P26S, D138Y, R190S, K417T, E484K, N501Y, D614G, H655Y, T1027I, V1176F. Table 4 shows that D138Y is in the MHC_I_2 A, MHC_II_alpha A, MHC_II_beta B, and MHC_I-like_Ag-recog B domains. R190S, H655Y and T1027I are in MHC_I-like_Ag-recog B, E, G domains, respectively. Compared with the B.1.617.2/Delta variant, the MHC-like domain of the S protein of the SARS-COV-2 Gamma variant has two mutation points, H655Y, and T1027I. It shows that Gamma variant S participates in receptor-binding H655Y and participates in membrane fusion T1027I in the MHC-LIKE region. Therefore, the infection rate and mortality of Gamma variants are high. However, the Gamma variant does not have a mutation site located in the MHCassoc_trimer domain. It may not enhance the immune escape of the regulatory region of the S trimer. Therefore, the MHCassoc_trimer domain (144–162) is an important reason why the Delta variant spreads infection faster than the Gamma variant.

TABLE 4.

P.1/Gamma Variant S1 and S2 mutation sites are in the MHC-like domains

Domain Alias Start End L18F T20N P26S D138Y R190S K417T E484K N501Y D614G H655Y T1027I V1176F
MHC_I_2 A 131 156 V
B 1212 1217
MHC_I_C A 1211 1237
MHC_II_alpha A 131 162 V
B 895 905
C 1203 1243
MHC_II_beta A 64 80
B 127 178 V
C 898 907
D 1028 1033
E 1067 1107
F 1199 1264
MHC_I-like_Ag-recog A 62 69
B 127 194 V V
C 346 382
D 555 583
E 616 676 V
F 896 905
G 1014 1114 V
H 1142 1155
I 1195 1273
MHC2-interact A 1078 1092
B 1212 1217
C 1227 1248
MHCassoc_trimer A 144 162
B 1087 1102
C 1202 1217

MHC indicates major histocompatibility complex.

SARS-COV-2 Alpha Variant

B.1.1.7/Alpha variant has a more tremendous increase in the transmission rate than the earlier SARS-COV-2 virus. However, there is no significant difference in overall mortality. The mutation site of S in the B.1.1.7/Alpha variant is:49 ∆69–70, ∆144, ∆145, N501Y, A570D, D614G, P681H, T716I, S982A, D1118H. Table 5 shows that ∆69–70 is at MHC_II_beta A and MHC_I-like_Ag-recog A domains. ∆144 and ∆145 is in MHC_I_2 A, MHC_II_alpha A, MHC_II_beta B, MHC_I-like_Ag-recog B, MHCassoc_trimer A domains. A570D is in MHC_I-like_Ag-recog D domain. ∆144 and A570D are all in the S1 protein. There are no MHC-like domain mutations in the S2 protein. The mutation at position 144-145 is in the Alpha variant S. The mutation at position 156 is in the delta variant S. They are in the MHCassoc_trimer A domain. The mutations at position 144–162 may enhance the immune escape of the MHC-like region involved in receptor-binding and regulate trimer’s formation.

TABLE 5.

B.1.1.7/Alpha variant S1 mutation sites are in the MHC-like domains

Domain Alias Start End ∆69–70 ∆144–∆145 N501Y A570D D614G P681H T716I S982A D1118H
MHC_I_2 A 131 156 V
B 1212 1217
MHC_I_C A 1211 1237
MHC_II_alpha A 131 162 V
B 895 905
C 1203 1243
MHC_II_beta A 64 80 V
B 127 178 V
C 898 907
D 1028 1033
E 1067 1107
F 1199 1264
MHC_I-like_Ag-recog A 62 69 V
B 127 194 V
C 346 382
D 555 583 V
E 616 676
F 896 905
G 1014 1114
H 1142 1155
I 1195 1273
MHC2-interact A 1078 1092
B 1212 1217
C 1227 1248
MHCassoc_trimer A 144 162 V
B 1087 1102
C 1202 1217

MHC indicates major histocompatibility complex.

SARS-COV-2 Beta Variant

The vaccine is effective against the B.1.351/Beta variant. The mutation sites of B.1.351/Beta variant S are:50 L18F, D80A, D215G, LAL241–243∆, K417N, E484K, N501Y, D614G, A701V. Table 6 shows that D80A is in the MHC_II_beta A domains. Most of the other mutation sites are not in the MHC-like domains. It shows that most of the mutation sites do not affect the MHC-like domain.

TABLE 6.

The S1 mutation site of the B.1.351/Beta variant is located in the MHC-like domain

Domain Alias Start End L18F D80A D215G LAL241–243∆ K417N E484K N501Y D614G A701V
MHC_I_2 A 131 156
B 1212 1217
MHC_I_C A 1211 1237
MHC_II_alpha A 131 162
B 895 905
C 1203 1243
MHC_II_beta A 64 80 V
B 127 178
C 898 907
D 1028 1033
E 1067 1107
F 1199 1264
MHC_I-like_Ag-recog A 62 69
B 127 194
C 346 382
D 555 583
E 616 676
F 896 905
G 1014 1114
H 1142 1155
I 1195 1273
MHC2-interact A 1078 1092
B 1212 1217
C 1227 1248
MHCassoc_trimer A 144 162
B 1087 1102
C 1202 1217

MHC indicates major histocompatibility complex.

Mutation sites G75V, T76I of the SARS-COV-2 C.37/lambda51 variant are both in the MHC_II_beta A domain. So S 62–80 of SARS-COV-2 alpha, beta, lambda variants were the first MHC-like enhanced distribution area. The second MHC-like enhanced distribution area of S protein is 127–194, located in the NTD (14–305 residues) of S1 protein. MHCassoc_trimer (144–162) is a trimer of S Formation, which plays an important role. The third and fourth MHC-like enhanced distribution areas of S protein are MHC_I-like_Ag-recog E (616–676), MHC_I-like_Ag-recog G (1014–1114).

Mutations in MHC-Like Regions of RNA-Dependent RNA Polymerase and N Protein Causing High Viral Load of Delta Variant

The viral load of Delta variant patients is very high. It shows that the replication activity of the Delta variant of the SARS-COV-2 virus is very active. The viral proteins directly related to viral replication activities are orf1ab and N proteins. We searched the orf1ab and N protein mutation sites of five variants of Alpha, Beta, Gama, Delta, and Lambda from “CORONAVIRUS CORONAVIRUS ANTIVIRAL & RESISTANCE DATABASE” (https://covdb.stanford.edu/page/mutation-viewer). Then compared with Table 2 to find the MHC-like enhanced region sites of orf1ab and N proteins (Table 7). Table 7 shows that the orf1ab and N protein mutation sites of Alpha, Beta, and Gama variants are not in the MHC-like region. The N protein mutation site of the Lambda variant is also not in the MHC-like area. However, the mutation sites of Delta (orf1ab and N protein) and Lambda (orf1ab protein) are in the MHC-like region. The RdRP: G671S mutation site of Delta variant orf1ab is at MHC_I-like_Ag-recog C. The nsp3:F1569V of Delta variant orf1ab is at MHC_II_beta B and MHC_I-like_Ag-recog E. The D63G of Dalta variant N is in MHC_II_beta A, MHC_I-like_Ag-recog A. Both N and RNA-dependent RNA polymerase are directly related to virus replication. In addition, the interaction between N and Nsp3 is essential for connecting the viral genome for processing. So, Table 7 indicates that the mutations of RdRP : G671S and N : D63G enhanced the immune escape ability of the Delta variant virus during the replication process. Therefore, the replication activity for this variant is very active.

TABLE 7.

MHC-like enhancement sites of orf1ab and N proteins

Protein Variant Code Mutation site MHC-like enhancement site
orf1ab Alpha B.1.1.7 nsp3:T183I, nsp3:A890D, nsp3:I1412T, nsp6:SGF106-108, RdRp:P323L
Beta B.1.351 nsp2:T85I, nsp3:K837N, 3CL:K90R, nsp6:SGF106-108, RdRP:P323L
Gama P.1 nsp3:S370L, nsp3:K977Q, nsp6:SGF106-108, RdRP:P323L, nsp13: E341D
Delta B.1.617.2 nsp3:A488S, nsp3:P1228L, nsp3:P1469S, nsp4:V167L, nsp4:T492I, nsp6:T77A, RdRP:P323L, RdRP: G671S, nsp13:P77L, nsp14:A394V RdRP: G671S (MHC_I-like_Ag-recog C)
Lambda C.37 nsp3:T428I, nsp3:P1469S, nsp3:F1569V, nsp4:L438P, nsp4:T492I, 3CL:G15S, nsp6:SGF106-108, RdRP:P323L nsp3:F1569V(MHC_II_beta B, MHC_I-like_Ag-recog E)
N Alpha B.1.1.7 D3L, R203K, G204R, S235F
Beta B.1.351 T205I
Gama P.1 P80R, R203K, G204R
Delta B.1.617.2 D63G, R203M, G215G,D377Y D63G (MHC_II_beta A, MHC_I-like_Ag-recog A)
Lambda C.37 P13L, R203K, G204R, G214C

MHC indicates major histocompatibility complex.

DISCUSSION

Genetic Variation of MHC Protected Immune Cells

The MHC molecule is a cell surface protein complex encoded in the HLA locus.52 The genetic variation of the three MHC class I genes (HLA-A, -B, and -C genes) affect the susceptibility and severity of COVID-19 disease.53 The HLA gene complex is closely linked to genes, and there is little exchange between homologous chromosomes. HLA loci located on the same chromosome constitute a closely linked gene group (including HLA-I and II genes), called haplotype or haplotype. A haplotype is inherited as a unit. To the offspring, it is called haplotype genetics. However, HLA haplotypes are not the main risk/protective factor for SARS-COV-2 infection or severity in the Israeli population.54 It indicates that the genetic variation of MHC structure may protect immune cells that can bind to MHC to a certain extent.

In this present study, we found that many proteins of the SARS-COV-2 virus have MHC-like structures recognized by MHC receptors. CD4 or CD8 co-receptors expressed by T cells can bind to part of MHC proteins.13 The inhibitory receptors of NK cells can also bind to the MHC-1 receptor recognition structure. Therefore, the S protein of the SARS-COV-2 virus could bind to CD4+T, CD8+T, and NK cells through MHC-like structures. Then the ROS generated by the E protein destroyed these immune cells,11 resulting in a decrease in lymphocytes. The genetic variation of HLA may produce MHC molecules that could not bind to the viral MHC-like structure. It was helpful for immune cells to evade the attachment and positioning of SARS-COV-2 MHC-like proteins. In this situation, the antigen presentation response would not be disturbed, and immune cells (such as CD4+T, CD8+T, NK cells) would be protected from the virus’s ROS damage.

S Mutations in the MHC-Like Regions Promoted Weaker Immune Resistance and More Robust Transmission

If a mutation site was in the MHC-like domain, the mutation enhanced the MHC decoy function. It challenged the production of antibodies to neutralize these MHC decoy sites. If antibodies could attach to the MHC-like proteins, the antibodies could also bind to MHC proteins. Then the normal MHC antigen presentation function would be affected, and the body would appear autoimmune diseases. It is not a piece of good news for CD4+ T, CD8+ T, NK cells, and other immune cells that can bind to MHC. These immune cells could indiscriminately bind to MHC-like structures of the S protein, and were attacked by ROS from the E protein. Then the immune system could not effectively perform the antigen presentation for the SARS-COV-2 virus protein. It also could not produce the neutralizing antibody effectively. Moreover, the probability of infected cells was killed by immune cells would be significantly reduced.

This present study found that neutralizing antibodies were challenging to generated for mutations in S MHC-like regions 127–194 and 144–162. It occurred with delta variant infections. The delta variant was the SARS-COV-2 virus with a robust transmission. The S 62–80 mutations of SARS-COV-2 alpha, beta, lambda variants had a similar situation. McCallum et al39 found R246A substitution reduces the binding of S2L28, S2M28, and S2X333. This substitution significantly affected the binding of S2X28 and mAb 4A8. The L18F, D80A, D253G/Y, or S255F variants only abolish the combination of S2L28 and NTD. The L18F substitution exists in B.1.351 and P. 1 pedigree. The Y144 deletion abolished the binding to S2M28, S2X28, S2X333, and 4A8 instead of S2L28. It explains that these mAbs have lost the ability to neutralize the B.1.1.7 S pseudovirus, which contains this deletion. The H146Y mutant reduces S2M28, S2X28, especially the combination of 4A8. The binding of all site i-specific NTD mAbs to B.1.351 NTD is abolished, and 4A8 does not recognize this NTD variant. The evidence indicates that the NTD variants located in MHC-like regions 127–194 and 144–162 enhance the immune escape of the virus and increase the efficiency of virus transmission.

We also found that the immune system was challenging to generate neutralizing antibodies against mutations in S MHC-like 616–676 and 1014–1114 regions. It happened to gamma variant infections. The gamma variant was the SARS-COV-2 virus that caused high mortality. Chen et al55 find that specific mAbss have reduced or weakened neutralizing activity against B.1.351, B.1.1.28, B.1.617.1, and B.1.526 viruses in cell culture. And the neutralizing effect of antibodies against H655Y and T1027I mutation sites is not apparent.55 It shows that the variants in MHC-like regions 616–676 and 1014–1114 also strengthen the immune escape of the virus, and enhance the virus’s receptor engagement and membrane fusion ability.

CONCLUSION

The high mortality rate of COVID-19 is related to poor antigen presentation and lymphopenia. MHC genetic variations may protect immune cells. CMV and the herpes family encode a series of MHC-like molecules required for targeted immune responses to achieve immune escape. This present study used bioinformatics methods to study whether the SARS-COV-2 virus proteins also had MHC-like structures. The domain search results indicate that MHC receptors could recognize many proteins of the SARS-COV-2 virus because of their MHC-like domains. The MHC-like structures were equivalent to bait against the human immune system. We believed that the SARS-COV-2 virus proteins with MHC-like structures could bind to the MHC receptor of immune cells to interfere with the antigen presentation response. After the S protein was bound to CD4+T, CD8+T, and NK cells through MHC-like structures, ROS generated by the E protein destroyed these immune cells, decreasing the number of lymphocytes. Mutations in the MHC-like region of the proteins such as S protein promoted weaker immune resistance and more robust transmission. The mutations in the S MHC-like 127–194 and 144–162 regions were the reason for the entire transmission of delta variant. It is worth noting that the 144–162 region regulates the formation of S trimer. Mutations in S MHC-like 62–80 of SARS-COV-2 alpha, beta, lambda variants were one important factor for fast-spreading. The mutations in the S MHC-like 616–676 and 1014–1114 regions were causes of high mortality for gamma variants infections. The mutations of RdRP : G671S and N : D63G of delta variant caused high viral load.

Acknowledgments

CONFLICTS OF INTEREST/FINANCIAL DISCLOSURES

This work was funded by a grant from the Talent Introduction Project of Sichuan University of Science and Engineering (award number: 2018RCL20, grant recipient: W.Z.L.).

All authors have declared that there are no financial conflicts of interest with regard to this work.

Footnotes

Availability of data and material: the data sets and results supporting the conclusions of this article are available at: https://pan.baidu.com/s/1A0DlmP0po3QK_guBOzoWpw, code:x7c9. or: https://mega.nz/folder/M3p0nASC#CO8HgROV9Ydzr9d7RVEMEQ

Funding was obtained by W.Z.L. Besides, design, analysis, and writing are finished by W.Z.L., while data curation and manuscript check are undertaken by H.L.L. Both authors have read and agreed to the published version of the manuscript.

Contributor Information

Wenzhong Liu, Email: liuwz@suse.edu.cn.

Hualan Li, Email: lihualan@yibinu.edu.cn.

REFERENCES

  • 1.Huang C, Wang Y, Li X, et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet. 2020;395:497–506. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Xu Z, Shi L, Wang Y, et al. Pathological findings of COVID-19 associated with acute respiratory distress syndrome. Lancet Respir Med. 2020;8:420–422. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Sami R, Fathi F, Eskandari N, et al. Characterizing the immune responses of those who survived or succumbed to COVID-19: can immunological signatures predict outcome? Cytokine. 2021;140:155439. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Lee J, Park S-S, Kim TY, et al. Lymphopenia as a biological predictor of outcomes in COVID-19 patients: a nationwide cohort study. Cancers (Basel). 2021;13:471. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Liang C, Bencurova E, Psota E, et al. Population-predicted MHC class II Epitope presentation of SARS-CoV-2 structural proteins correlates to the case fatality rates of COVID-19 in different countries. Int J Mol Sci. 2021;22:2630. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Wilson EA, Hirneise G, Singharoy A, et al. Total predicted MHC-I epitope load is inversely associated with population mortality from SARS-CoV-2. Cell Rep Med. 2021;2:100221. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Jafarzadeh A, Jafarzadeh S, Nozari P, et al. Lymphopenia an important immunological abnormality in patients with COVID-19: possible mechanisms. Scand J Immunol. 2021;93:e12967. [DOI] [PubMed] [Google Scholar]
  • 8.Yan S, Wu G. Is lymphopenia different between SARS and COVID-19 patients? FASEB J. 2021;35:e21245. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Saini SK, Hersby DS, Tamhane T, et al. SARS-CoV-2 genome-wide T cell epitope mapping reveals immunodominance and substantial CD8+ T cell activation in COVID-19 patients. Sci Immunol. 2021;6:eabf7550. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Tavakolpour S, Rakhshandehroo T, Wei EX, et al. Lymphopenia during the COVID-19 infection: what it shows and what can be learned. Immunol Lett. 2020;225:31–32. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Wenzhong L, Hualan L. COVID-19: captures iron and generates reactive oxygen species to damage the human immune system. Autoimmunity. 2021;54:213–224. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.McHeyzer-Williams M, Okitsu S, Wang N, et al. Molecular programming of B cell memory. Nat Rev Immunol. 2012;12:24–34. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Alberts B, Johnson A, Lewis J, et al. T cells and MHC proteins. Molecular Biology of the Cell. 4th ed. Garland Science; 2002. [Google Scholar]
  • 14.Anfossi N, André P, Guia S, et al. 2006Human NK cell education by inhibitory receptors for MHC class I. Immunity, 25:331–342. [DOI] [PubMed] [Google Scholar]
  • 15.Ljunggren H-G, Kärre K. In search of the ‘missing self’: MHC molecules and NK cell recognition. Immunol Today. 1990;11:237–244. [DOI] [PubMed] [Google Scholar]
  • 16.Bruchez A, Sha K, Johnson J, et al. MHC class II transactivator CIITA induces cell resistance to Ebola virus and SARS-like coronaviruses. Science. 2020;370:241–247. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Kumar SA, Dasgupta D, Wei H. An in-silico based clinical insight on the effect of noticeable CD4 conserved residues of SARS-CoV-2 on the CD4-MHC-II interactions. BioRxiv. 2020;6:161802. [Google Scholar]
  • 18.Agerer B, Koblischke M, Gudipati V, et al. SARS-CoV-2 escapes CD8 T cell surveillance via mutations in MHC-I restricted epitopes. BioRxiv. 2020;12:423507. [Google Scholar]
  • 19.Rha M-S, Jeong HW, Ko J-H, et al. PD-1-expressing SARS-CoV-2-specific CD8+ T cells are not exhausted, but functional in patients with COVID-19. Immunity. 2021;54:44–52.e43. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.van de Weijer ML, Luteijn RD, Wiertz EJHJ. Viral immune evasion: lessons in MHC class I antigen presentation. Semin Immunol. 2015;27:125–137. [DOI] [PubMed] [Google Scholar]
  • 21.Arase H, Mocarski ES, Campbell AE, et al. Direct recognition of cytomegalovirus by activating and inhibitory NK cell receptors. Science. 2002;296:1323–1326. [DOI] [PubMed] [Google Scholar]
  • 22.Revilleza MJ, Wang R, Mans J, et al. How the virus outsmarts the host: function and structure of cytomegalovirus MHC-I-like molecules in the evasion of natural killer cell surveillance. J Biomed Biotechnol. 2011;2011:724607. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Rawlinson WD, Farrell HE, Barrell BG. Analysis of the complete DNA sequence of murine cytomegalovirus. J Virol. 1996;70:8833–8849. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Brocchieri L, Kledal TN, Karlin S, et al. Predicting coding potential from genome sequence: application to betaherpesviruses infecting rats and mice. J Virol. 2005;79:7570–7596. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Natarajan K, Hicks A, Mans J, et al. Crystal structure of the murine cytomegalovirus MHC-I homolog m144. J Mol Biol. 2006;358:157–171. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Pyzik M, Dumaine AA, Charbonneau B, et al. Viral MHC class I-like molecule allows evasion of NK cell effector responses in vivo. J Immunol. 2014;193:6061–6069. [DOI] [PubMed] [Google Scholar]
  • 27.Wyrwicz LS, Rychlewski L. Cytomegalovirus immediate early gene UL37 encodes a novel MHC-like protein. Acta Biochim Pol. 2008;55:67–74. [PubMed] [Google Scholar]
  • 28.Tripathy SK, Smith HRC, Holroyd EA, et al. Expression of m157, a murine cytomegalovirus-encoded putative major histocompatibility class I (MHC-I)-like protein, is independent of viral regulation of host MHC-I. J Virol. 2006;80:545–550. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Wilkinson GW, Tomasec P, Stanton RJ, et al. Modulation of natural killer cells by human cytomegalovirus. J Clin Virol. 2008;41:206–212. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Halenius A, Gerke C, Hengel H. Classical and non-classical MHC I molecule manipulation by human cytomegalovirus: so many targets—but how many arrows in the quiver? Cell Mol Immunol. 2015;12:139–153. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Beck S, Barrell BG. Human cytomegalovirus encodes a glycoprotein homologous to MHC class-I antigens. Nature. 1988;331:269–272. [DOI] [PubMed] [Google Scholar]
  • 32.Fahnestock ML, Johnson JL, Renny Feldman RM, et al. The MHC class I homolog encoded by human cytomegalovirus binds endogenous peptides. Immunity. 1995;3:583–590. [DOI] [PubMed] [Google Scholar]
  • 33.Ambagala APN, Solheim JC, Srikumaran S. Viral interference with MHC class I antigen presentation pathway: the battle continues. Vet Immunol Immunopathol. 2005;107:1–15. [DOI] [PubMed] [Google Scholar]
  • 34.Wiertz E, Hill A, Tortorella D, et al. Cytomegaloviruses use multiple mechanisms to elude the host immune response. Immunol Lett. 1997;57:213–216. [DOI] [PubMed] [Google Scholar]
  • 35.Reddehase MJ, Mutter W, Münch K, et al. CD8-positive T lymphocytes specific for murine cytomegalovirus immediate-early antigens mediate protective immunity. J Virol. 1987;61:3102–3108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.He X, Yoshida H, Minamishima Y. Analysis of the role of CD4+ T-cells during murine cytomegalovirus infection in different strains of mice. Virus Res. 1995;36:233–245. [DOI] [PubMed] [Google Scholar]
  • 37.Carrillo-Bustamante P, Keşmir C, De Boer RJ. Virus encoded MHC-like decoys diversify the inhibitory KIR repertoire. PLoS Comput Biol. 2013;9:e1003264. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Lok S-M. An NTD supersite of attack. Cell Host Microbe. 2021;29:744–746. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.McCallum M, De Marco A, Lempp FA, et al. N-terminal domain antigenic mapping reveals a site of vulnerability for SARS-CoV-2. Cell. 2021;184:2332–2347.e2316. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Suryadevara N, Shrihari S, Gilchuk P, et al. Neutralizing and protective human monoclonal antibodies recognizing the N-terminal domain of the SARS-CoV-2 spike protein. Cell. 2021;184:2316–2331.e2315. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Cerutti G, Guo Y, Zhou T, et al. Potent SARS-CoV-2 neutralizing antibodies directed against spike N-terminal domain target a single supersite. Cell Host Microbe. 2021;29:819–833.e817. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Liu W, Li H. COVID-19: attacks the 1-beta chain of hemoglobin and captures the porphyrin to inhibit human heme metabolism. ChemRxiv Preprint chemrxiv. 2020:11938173.v9. [Google Scholar]
  • 43.Zhang Y, Zhang J, Chen Y, et al. The ORF8 protein of SARS-CoV-2 mediates immune evasion through potently downregulating MHC-I. Proc Natl Acad Sci. 2021;118:e2024202118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Saito A, Irie T, Suzuki R, et al. Enhanced fusogenicity and pathogenicity of SARS-CoV-2 Delta P681R mutation. Nature. 2022;602:300–306. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Arora P, Kempf A, Nehlmeier I, et al. Increased lung cell entry of B.1.617.2 and evasion of antibodies induced by infection and BNT162b2 vaccination. bioRxiv. 2021;6:449568. [Google Scholar]
  • 46.Xia S, Zhu Y, Liu M, et al. Fusion mechanism of 2019-nCoV and fusion inhibitors targeting HR1 domain in spike protein. Cell Mol Immunol. 2020;17:765–767. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Naveca F, Nascimento V, Souza V, et al. Phylogenetic relationship of SARS-CoV-2 sequences from Amazonas with emerging Brazilian variants harboring mutations E484K and N501Y in the Spike protein. Virologic Org. 2021;1:1–8. [Google Scholar]
  • 48.Faria NR, Mellan TA, Whittaker C, et al. Genomics and epidemiology of the P. 1 SARS-CoV-2 lineage in Manaus, Brazil. Science. 2021;372:815–821. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Rambaut A, Loman N, Pybus O, et al. Preliminary genomic characterisation of an emergent SARS-CoV-2 lineage in the UK defined by a novel set of spike mutations. Genom Epidemiol. 2020;9:1–5. [Google Scholar]
  • 50.Tegally H, Wilkinson E, Giovanetti M. Detection of a SARS-CoV-2 variant of concern in South Africa. Nature. 2021;592:438–443. [DOI] [PubMed] [Google Scholar]
  • 51.Romero PE, Dávila-Barclay A, Salvatierra G, et al. The emergence of SARS-CoV-2 variant lambda (C.37) in South America. Microbiol Spectr. 2021;9:e0078921. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Sanchez-Mazas A. A review of HLA allele and SNP associations with highly prevalent infectious diseases in human populations. Swiss Med Wkly. 2020;150:w20214–w20214. [DOI] [PubMed] [Google Scholar]
  • 53.Nguyen A, David JK, Maden SK, et al. Human leukocyte antigen susceptibility map for severe acute respiratory syndrome coronavirus 2. J Virol. 2020;94:e00510–e00520. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Ben Shachar S, Barda N, Manor S, et al. MHC haplotyping of SARS-CoV-2 patients: HLA subtypes are not associated with the presence and severity of COVID-19 in the Israeli population. J Clin Immunol. 2021;41:1154–1161. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Chen RE, Winkler ES, Case JB, et al. In vivo monoclonal antibody efficacy against SARS-CoV-2 variant strains. Nature. 2021;596:103–108. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Journal of Immunotherapy (Hagerstown, Md. : 1997) are provided here courtesy of Wolters Kluwer Health

RESOURCES