Abstract
Structures of peptide-loaded major histocompatibility complex class I (pMHC-I) and class II (pMHC-II) complexes are similar. However, whereas pMHC-II complexes include similar-sized IIα and IIβ chains, pMHC-I complexes include a heavy chain (HC) and a single domain molecule β2-microglobulin (β2-m). Recently, we elucidated several pMHC-I and pMHC-II structures of primitive vertebrate species. In the present study, a comprehensive comparison of pMHC-I and pMHC-II structures helps to understand pMHC structural evolution and supports the earlier proposed—though debated—direction of MHC evolution from class II-type to class I. Extant pMHC-II structures share major functional characteristics with a deduced MHC-II-type homodimer ancestor. Evolutionary establishment of pMHC-I presumably involved important new functions such as (i) increased peptide selectivity by binding the peptides in a closed groove (ii), structural amplification of peptide ligand sequence differences by binding in a non-relaxed fashion, and (iii) increased peptide selectivity by syngeneic heterotrimer complex formation between peptide, HC, and β2-m. These new functions were associated with structures that since their establishment in early pMHC-I have been very well conserved, including a shifted and reorganized P1 pocket (aka A pocket), and insertion of a β2-m hydrophobic knob into the peptide binding domain β-sheet floor. A comparison between divergent species indicates better sequence conservation of peptide binding domains among MHC-I than among MHC-II, agreeing with more demanding interactions within pMHC-I complexes. In lungfishes, genes encoding fusions of all MHC-IIα and MHC-IIβ extracellular domains were identified, and although these lungfish genes presumably derived from classical MHC-II, they provide an alternative mechanistic hypothesis for how evolution from class II-type to class I may have occurred.
Keywords: MHC, evolution, structure, class II, class I, b2-microglobulin, peptide
Introduction
Major histocompatibility complex (MHC) molecules present peptide fragments of intracellularly digested protein antigens to T cells (1, 2). MHC class I (MHC-I) is expressed by most nucleated cell types and important for presentation of peptides from intracellular antigens to CD8+ cytotoxic T cells, which can kill the presenting cell in case of infection or cancerous mutations. MHC-II is expressed by professional antigen presenting cells and important for presentation of peptide fragments from digested endocytosed/phagocytosed antigens to CD4+ T cells, which help to decide on how to proceed with a possible immune response. Bjorkman et al. (3, 4) and Brown et al. (5) were the first to determine the structures of the ectodomains of pMHC-I and pMHC-II, respectively, namely of human pHLA-A2 and pHLA-DR1. The similarity between MHC-I and MHC-II, already evident from the sequences (6), was even more impressively observed in the structures (5). The ectodomain structures formed by both MHC classes consist of two membrane-distal domains which each constitute a similar half of a pseudo-symmetric unit consisting of a curved β-sheet topped by two antiparallel α-helix structures which leave a groove in between, and two membrane-proximal domains of the immunoglobulin superfamily (IgSF) C1 set. In MHC-I structures, a heavy chain (HC) comprising the two membrane-distal domains (I-α1 and I-α2), a membrane-proximal IgSF domain (I-α3), and a connecting peptide/transmembrane/cytoplasmic tail (CP/TM/CY) region, binds a free single IgSF domain molecule, β2-microglobulin (β2-m). On the other hand, MHC-II structures consist of two similarly sized molecules, the IIα and IIβ chains, which each possess a membrane-distal domain (II-α1 or II-β1), a membrane-proximal IgSF domain (II-α2 or II-β2), and a CP/TM/CY region. MHC-I molecules, within the groove—formed by their membrane-distal domains—which is closed at both ends, typically bind peptides of 8-11 amino acids (aa), most commonly of 9 aa, whereas MHC-II molecules typically bind peptides of 12-25 aa that extend beyond the ends of their open groove (7–9).
The above described structures and functions concern “classical” MHC-I and MHC-II and they are—except a few cases that represent gene losses [e.g (10, 11)]—found throughout jawed vertebrate species. The wide distribution and their conservation of ancestral traits suggest that these classical types are the evolutionary oldest types among known extant MHC-I and MHC-II types (12–15). However, at various times during evolution, classical MHC-I and MHC-II genes were duplicated and modified for encoding “nonclassical” MHC molecules which exhibit diverged features, a diversity of functions, and a more restricted distribution among species (12–18). In the present article, if not specified, “MHC-I” and “MHC-II” refer to the classical type molecules, with MHC-I referring to both HC and β2-m, and MHC-II referring to both MHC-IIα and MHC-IIβ.
In jawless fish and invertebrates, MHC genes or MHC-homologous genes have not been found, and MHC features as well as the distinctions between MHC-I and MHC-II probably were established during the early evolution of jawed vertebrates (19). The first MHC molecule in evolution probably was a homodimer with class II-like features, as postulated by Kaufman et al. (20) based on sequence comparisons [see also (6)] and considerations of parsimony. The model was further explored and supported in later studies [e.g., (21–23)]. In the model, after gene duplication and differentiation, the homodimer evolved into a class II-type heterodimer molecule that was ancestral to both extant MHC-I and MHC-II [ Figure 1 , the model shown here is essentially as proposed by Kaufman et al. (20, 21)]. The generation of MHC-I genes HC and β2-m from MHC-II-type genes involved exon shuffling events ( Figure 1 ). The model implies that the I-α1+ β2-m and II-α1+II-α2 domain sequences form a phylogenetic lineage, in the present study called the “a” lineage, while the I-α2+ I-α3 and II-β1+II-β2 sequences form the “b” lineage ( Figure 1 ). The model is consistent with sequence-based computerized phylogenetic tree analyses (22, 24) and with structural analysis of pMHC-I and pMHC-II complexes (4, 5). The present article refers to this model of MHC class evolution as the “II-to-I” model. Some scientists do not subscribe to this model and proposed that MHC-I organization came before MHC-II organization (25, 26), but such I-to-II model cannot explain the close phylogenetic relatedness between the I-α1/II-α1 and I-α2/II-β1 sequences and between the β2-m/II-α2 and I-α3/II-β2 sequences.
Structures of pMHC-I have now been determined throughout the major clades of jawed vertebrates including cartilaginous fish (27), bony fish (28, 29), amphibians (30), the birds chicken [e.g., (31–33)] and duck (34), and a variety of mammals [e.g., (4, 35–38)], whereas pMHC-II structures only have been determined for chicken (39), mouse [e.g., (40)], and human [e.g., (5)]; most of the here listed studies in primitive vertebrates were performed by our group. This recent accumulation of MHC structural information across widely divergent species invites a deeper analysis of MHC evolution.
Here, we present a comprehensive analysis of MHC sequences and structures, and a deduction of major evolutionary developments within the course of pMHC structural evolution. Our study supports the II-to-I model and is the first to analyze pMHC structural evolution beyond the issue of covalent domain organization. It concludes that pMHC-II structures share important similarities with the deduced ancestral MHC homodimer, whereas pMHC-I structures acquired a significantly new peptide binding mode in which β2-m plays a pivotal role.
Materials and Methods
Alignment of MHC Sequences and Phylogenetic Tree Analysis
The sequence alignment between various MHC-I and MHC-II sequences was made with the intention to, as good as possible, align evolutionarily corresponding residues. MHC sequences and structures were retrieved from databases at the National Center for Biotechnology Information (NCBI; https://www.ncbi.nlm.nih.gov/). Representative MHC sequences were aligned by hand [see also (12–14, 41)] based on similarities between sequences, considerations of likely evolutionary events, and structural comparisons of various pMHC-I and pMHC-II ( Supplementary File 1B ). For most parts of the β-strands and helices the alignments are unambiguous, but in many of the loop regions, and in some β-strand and helical regions with insertions or deletions, the best possible alignment is uncertain. That superimposing of structures does not provide definite clues for all parts of the alignment can be seen in Supplementary File 1B . Secondary structures as indicated in the sequence alignment figures were determined by DSSP software [https://swift.cmbi.umcn.nl/gv/dssp/ (42)]. Leader sequences were predicted using SignalP software [http://www.cbs.dtu.dk/services/SignalP/ (43)]. For alignment of lungfish MHC-IIabSol sequences only, we used CLUSTAL 2.1 software (https://www.genome.jp/tools-bin/clustalw). For phylogenetic tree analysis of lungfish MHC-IIabSol and classical MHC-II sequences the neighbor joining method was applied using MEGA7 software (44).
Calculations and Generation of Illustrations
Peptide-contacting residues were identified using the program CONTACT and were defined as residues containing an atom within 4.0 Å of the target partner (45). Structural illustrations and the electron density-related figures were generated using the PyMOL molecular graphics system (http://www.pymol.org/), and the same software system was used for making structural superposition figures by using the program “super” and to calculate dihedral angles of the pep56 backbone using the program “Measurement - Dihedrals”. PDBePISA software [http://www.ebi.ac.uk/msd-srv/prot_int/cgi-bin/piserver (46)] was used for interdomain contact analysis and for measuring the exposed surface areas (=Accessible Surface Area minus Buried Surface Area) of the peptide ligands. Unless mentioned otherwise, the PDB accessions of the pMHC structures shown as representative structures are: Shark UAA, nurse shark (Ginglymostoma cirratum) Gici-UAA*01, 6LUP; Carp UAA, grass carp (Ctenopharyngodon idella) UAA, 5Y91; Frog UAA, African clawed frog (Xenopus laevis) UAAg, 6A2B; Chicken (Gallus gallus) BF2*0401, 4E0R; HLA-A2, 3PWN; Chicken (Gallus gallus) BL2*01901, 6KVM; Mouse (Mus musculus) H2-Ag7, 1F3J; HLA-DR1, 1AQD. For comparisons of a large number of pMHC-Is, structures with the following PDB accession numbers were analyzed: HLA-A*0101 (4NQV), HLA-A*0201 (3PWN), HLA-A*0203 (3OX8), HLA-A*0206 (3OXR), HLA-A* A0207 (3OXS), HLA-A*0301 (3RL1), HLA-A*1101 (1X7Q), HLA-A*2402 (2BCK), HLA-A*6801 (4HX1), HLA-A*6802 (4HWZ), HLA-B*0701 (3VCL), HLA-B*0801 (1M05), HLA-B*1402 (3BVN), HLA-B*1501 (1XR8), HLA-B*1801 (4JQV), HLA-B*2705 (1HSA), HLA-B*2709 (1UXW), HLA-B*3501 (1A9E), HLA-B*3505 (4JRX), HLA-B*3508 (3VFR), HLA-B*3901 (4O2C), HLA-B*4103 (3LN4), HLA-B*4402 (1M6O), HLA-B*4403 (1N2R), HLA-B*4405 (1SYV), HLA-B*4601 (4LCY), HLA-B*5101 (1E27), HLA-B*5201 (3W39), HLA-B*5301 (1A1O), HLA-B*5701 (2RFX), HLA-B*5703 (2BVO), HLA-C*0801 (4NT6), HLA-C*CW3 (1EFX), HLA-C* CW4 (1IM9), H-2*Db (1WBX), H-2*Dd (3E6H), H-2* Kb (3TID), H-2*Kd (1VGK), H-2*Kk (1ZT1), H-2*Kw_M7 (3FOL), H-2*Ld (1LD9), RT1-Aa (1ED3), RT1-Ac (1KJV), Mamu-A1 (1ZVS), Mamu-B17 (3RWC), SLA-1*0401 (3QQ3), BoLA-N*1801 (3PWU), BF2*0401 (4E0R), BF2*2101 (3BEV), Xela-UAA (6A2B), Ctid-UAA (5Y91), Gici-UAA (6LUP). For comparisons of a large number of pMHC-IIs, those were: HLA-DR1 (1AQD), HLA-DR2 (1YMM), HLA-DR3 (1A6A), HLA-DR4 (1D5M), HLA-DR5 (2Q6W), HLA-DR14 (6ATZ), HLA-DR15 (5V4M), HLA-DQ2.3 (4D8P), HLA-DQ8 (1JK8), HLA-DQ0602 (1UVQ), HLA-DM (1HDM), HLA-DM-DE (4FQX), HLA-DM-DO (4I0P), HLA-DP2 (3LQZ), HLA-DP5 (3WEX), HLA-DP-TCR (4P4K), HLA-DQ1-TCR (3PL6), HLA-DQ2-TCR (4OZH), HLA-DQ2.5-TCR (4OZF), I-Ab (1LNU), I-Ad (1IAO), I-Ak (1IAK), I-Ag (1ES0), I-Au (1K2d), I-EK (1FNG), Chicken-B-LA (6KVM).
The PDB accession numbers and residue positions of the non-MHC C1 set IgSF domains compared in Figure 10B were as follows: Ig-L, 1A4J A/113-217; Ig-H-C1, 1MCP L/119-220; IgNAR-C1, 4Q97 A/139-241; IgNAR-C2, 4Q9B A/243-343; TCR-beta, 1BD2 E/124-247; HFE, 1A6Z C/181-275; MIC, 1B3J A/181-274; M144-A3, 1PQZ A/142-242; ZAG-A3, 1T7V A/186-277; FcRn, 1EXU A/180-267; SIRPa-C1, 2WNG A/116-222.
Amplification and Sequencing of MHC-IIabSol for Slender Lungfish (Protopterus dolloi)
A slender lungfish (Protopterus dolloi) was obtained from Meitosuien Co. Ltd. (Nagakute, Aichi, Japan). The animal was handled according to the Guidelines for the Management of Laboratory Animals in Fujita Health University. Total RNA was isolated from kidney by use of “TRIzol” (Gibco) following the instructions of the manufacturer, except that the protocol was repeated for additional removal of DNA and proteins. SuperScript III reverse transcriptase kit (Invitrogen) with random hexamer primers was used for the construction of cDNA. PCR was conducted using ExTaq DNA polymerase (Takara) with 40 PCR cycles and primers MHC-IIabSol-F, 5’-AGAACCGTTTGGCACTGGGATC, and MHC-IIabSol-R, 5’-TTCTGAAGCACATCAGTAATACTGCCTG. The amplified fragment was inserted into plasmid vector, followed by transformation to E.coli, and sequence analysis of multiple clones (to minimize the chance of PCR or sequencing artefacts) by using BigDye Terminator v3.1 Sequencing Standard kit (Applied Biosystems) and 3100Avant/3130xl Genetic Analyzer (Applied Biosystems Life Technologies, Foster City, CA, USA). The purified MHC-IIabSol sequence was confirmed by its identity with SRA reports, and was deposited to GenBank as accession MT909553.
Results
Overall pMHC-I and pMHC-II Structures, and Introduction of a Nomenclature for Similar Domains
A comparison of five and three representative pMHC-I and MHC-II structures, respectively, of different species, reveals an overall similarity in organization ( Figure 2A ). Their most readily observed distinction is the confinement of the peptide within a closed groove in the case of pMHC-I (4, 5). For the phylogeny of the species whose pMHC structures are shown in Figure 2A , see Figure 2B . As reported previously [e.g., (19)], the absence of MHC genes in jawless vertebrates and invertebrates, combined with the presence of classical MHC-I and classical MHC-II in all extant classes of jawed vertebrates, concludes a relatively rapid emergence and differentiation of MHC molecules early in jawed vertebrate evolution ( Figure 2B ).
To allow convenient comparisons between similar type domains of pMHC-I and pMHC-II structures, we introduced a nomenclature denoting I-α1 and II-α1 as “pa” domains (MHC peptide binding domain a), I-α2 and II-β1 as “pb” domains, β2-m and II-α2 as “ia” domains (MHC IgSF domain a), and I-α3 and II-β2 as “ib” domains, respectively ( Figures 1 , 2A ). The combination of a pa and a pb domain, commonly referred to as the peptide binding domain or PBD—which would be a confusing term in the present study—is denoted as “pab” domain. Residue numbering in the present study follows the sequence alignments in Figure 3 and Supplementary File 1A , and is based on the residue numbering by Saper et al. (49) for HLA-A2 α1 domain and human β2-m.
Sequence Comparison of MHC-I and MHC-II Molecules
Sequence similarities are readily found between MHC-I and MHC-II, and also between the pa and pb domains, and between the ia and ib domains ( Figure 3 and Supplementary File 1A ). Some sequence features are well conserved in both MHC-I and MHC-II, while others are characteristic for either class. A relatively large set of residues are characteristic for MHC-I.
The sequences of the representative pMHC-I and pMHC-II structures compared in Figure 2A are aligned per domain-type in Figure 3 with indications of their secondary structures. In Supplementary File 1A , additional representative MHC sequences throughout jawed vertebrates are compared. Most parts of the sequence alignments readily agree with structural superimpositions ( Supplementary File 1B ), but in some highly variable—mostly loop—regions, the best possible alignments are not certain.
The Figure 3 and Supplementary File 1A sequence alignments show that, arguably, the most dramatic consistent difference between the MHC-I and MHC-II sequences maps to the N-terminal part of the pa-domain helical region, where II-α1 domains exhibit deletions of a few residues compared to the MHC p-domain consensus situation (5). Moreover, compared to I-α2 and II-β1, the I-α1 sequences have insertions of one or few residues around position p55 plus, in most cases, deletions of a few residues around position p62b, which are coincident with the presence and absence of helix interruptions (kinks), respectively. Among the l-α3 domain sequences, length variation is observed in the S1S2-loop regions, and, in teleost fish, in the C-terminal regions [ Supplementary File 1A (b) and see also (51)].
The color shadings of residues in Figure 3 and Supplementary File 1A are based on, partially subjective, comparisons of sequences, and help to estimate when in evolution certain residues or “residue-types” were established or fixated. “Residue-types,” here, refers to the sets of residues indicated above the alignment, which were chosen based on observed MHC sequence conservation patterns [see also (12–14)], and on similarities between amino acids. In Figure 3 , residues or residue-types that, by deduction, were probably present at the respective position in the assumed ancestral homodimer, are shaded black. Residues or residue-types that probably were present in early members of, and are characteristic for, the a lineage (I-α1+β2-m/IIα) and the b lineage (I-α2+I-α3/IIβ), are shaded dark blue and dark purple, respectively. Residues or residue-types that are characteristic for the I-α1+β2-m, IIα, I-α2+I-α3, and IIβ lineages are shaded light blue, blue, pink, and purple, respectively.
In Supplementary File 1A (b), also some IgSF C1 set sequences of non-MHC molecules are compared, which shows that the IgSF residues shaded black in Figure 3 are common among IgSF C1 sequences. However, the tryptophan at position 95 is quite characteristic for MHC, and is shaded red.
Importantly, Figure 3A and Supplementary File 1A (a) show that the MHC-I pa and pb domain sequences show many more specifically conserved residues than found in MHC-II. This is reflected in the overall similarity levels per domain if comparing across wide species borders, as summarized for a set of representative sequences in Supplementary File 1C . For example, nurse shark UAA*01 and β2-m compared with human HLA-A2 and β2-m show the following percentages of aa identities per domain (alignment as in Figure 3 ): pa, 45%; pb, 38%; ia, 40%; ib, 27%, whereas between nurse shark MHC-II (the IIα and IIβ sequences shown in Supplementary File 1A ) and HLA-DR1 the identity percentages are: pa, 31%; pb, 21%; ia, 44%; ib, 44% ( Supplementary File 1C ).
As for variation among MHC-I sequences, the better conservation across wide species borders of I-α1, I-α2, and β2-m compared to I-α3 sequences ( Supplementary File 1C ) was already noticed by Kaufman et al. (52). Furthermore, the lower levels of similarity among I-α3 domain sequences compared to among β2-m, II-α2 and II-β2 domain sequences ( Supplementary File 1C ) was also reflected in published phylogenetic tree analysis results [e.g., (53)]. The relatively poor conservation of I-α3 sequences probably is related to this domain participating in only few interdomain interactions and an apparent absence of stringent evolutionary pressure to conserve MHC-IgSF-typical intradomain features (see below).
Comparison Between pMHC-I and pMHC-II Structures
Similarity in Overall Domain Organization but With Differences in ib Domain Orientations
Structures of pMHC-I and pMHC-II are very similar ( Figure 2A ) (5). Apart from the peptide lengths, the biggest differences in their overall structures are the orientations of the ib domains (I-α3 versus II-β2; Figure 4 ) (5). The orientations of ib domains do not only differ between pMHC-I and pMHC-II, but also among pMHC-Is (28, 31) and among pMHC-IIs (54) ( Figure 4B ). Compared to α3 domains in mammalian pMHC-Is, the orientations of α3 domains in pMHC-Is of primitive jawed vertebrates are more similar to the orientations of β2 domains in pMHC-IIs [ Figures 4A (c, d) and 4B ] and they overlap with the outer range of reported II-β2 orientations ( Figure 4B ), suggesting that they represent the more primitive orientation of I-α3 domains.
The pb Domain Helices Are Similarly Organized in pMHC-I and pMHC-II, and May Represent the Ancestral Helix Organization
To follow the curve of the β-sheet, both in pMHC-I and pMHC-II, and both in the pa and the pb domains, the α-helical regions are divided into parts separated by “kinks” (4, 5). The parts separated by kinks are commonly named H1 and H2 in the case of the pa domains, and H1, H2a, H2b, and H3 in the case of the pb domains ( Figures 3A , 5A, B ) (49, 50). The pb domain H1 helices are longer than those in the pa domains ( Figure 3A ), and run parallel to, and above, the pb domain β-strands S3 and S4 ( Figure 5A ). After a big kink (“elbow”), this is followed by a long pb H2 helix that because of a small kink can be divided into H2a and H2b parts [ Figures 5A, B (a)]. In pMHC-I pb, the H2b helix continues longer than in pMHC-II pb, and at the MHC-I-specific residue pbG85 the strand bends in a rather straight angle to continue as a small helix named H3 [ Figure 5B (a)]. In pMHC-II pb, in the absence of pbG85 and having a few residues less to span the same distance as in pMHC-I, the H2b helix structure stops earlier and continues in a more tightly wound H3 helix without α-helix characteristics [ Figure 5B (a)] (5, 50); consequently, in pMHC-II there is no sharp H2-H3 angle [ Figure 5B (a)]. Compared to pMHC-II, the pbC11-pbC74 cysteine bridge is pushed slightly upwards in pMHC-I, causing a slight rotational change in the pb H2b helix that results in a more upward orientation of pbW77 in pMHC-I compared to the pbN77 orientation in pMHC-II that presumably represents the ancestral situation [ Figure 5B (a), -(c), and -(d); see also below]. The higher position of pbC11-pbC74 in pMHC-I is accompanied by the evolutionary acquisition/fixation of pbG10 and a disruption of the regular β-sheet folding in the pb9-pb11 stretch, whereas regular folding is found at corresponding positions in pMHC-II pb and in the pMHC-I and pMHC-II pa domains [ Figure 5B (c) and Supplementary File 1B (a)]. The pbG10-dependent β-sheet irregularity resulting in different positions/orientations of the pbC11-pbC74 pair and the pb H2b helix region had been noted before in a classical MHC-I versus MIC-A (nonclassical MHC-I) comparison context (55), but, as far as we know, not in a I-versus-II context.
We assume that, in evolution, the ancestral MHC homodimer had a H1-H2a-H2b helix organization similar to that in MHC-II β1 domain, because: (i) a similar organization is found in MHC-I α2 domain [ Figures 5A, B (a)]; (ii) the conservative nature of the MHC-I and MHC-II pb helices is suggested by their conservation of the ancestral pC11-pC74 cysteine bridge [ Figure 5B (c) and Supplementary File 1A (a)]; and, (iii) MHC-II β1 helices can bind the peptide ligand orientated in either N-to-C or C-to-N direction using their ancestral pW60 and pN77 residues (see below) (56).
The pa Domain Helices of pMHC-I and pMHC-II Differ From Each Other and From the Presumed Ancestral Situation
If, as we assume, the helix organization in the ancestral MHC homodimer was similar to that in extant pMHC-II pb indeed, the shared evolution of MHC-I and MHC-II pa domains probably involved the reorientation of the H1 helix from being parallel as in the pb domains to being rather perpendicular to the β-sheet as in the pa domains ( Figure 5A ). However, the MHC-I and MHC-II pa domain helix organizations also differ from each other. In I-α1 domains, compared to MHC pb domain helix organization, seemingly, insertions of residues around position p55 and deletions around position p62b deleted the H2a-H2b kink and repositioned the H1-H2 kink, and herewith shortened the H1 helix and lengthened the H2 helix [ Figures 3A , 5B (d), Supplementary File 1A (a)]; this reorganization resulted in partial rotation of the helical stretch including residues p59 and p60, so that in pMHC-I paY59 now points to the bottom of the groove (see below) and pMHC-I paW60 no longer is part of the groove but only connects the helix to the β-sheet [ Figures 5B (b) and -(d), and see below]. Compared to I-α1, in II-α1 domains the H1-H2 elbow region was shortened by deletion of a few residues and the helices partially melted (5), while ancestral paW60 was lost [ Figures 3A , 5B (b) and -(d), Supplementary File 1A (a)]; given the diverged sequences of this MHC-II pa stretch, it can’t be determined with certainty at which precise positions the residues were deleted. Sequence comparisons suggest that the helix organization of II-α1 found in mammals and chicken is common throughout MHC-II in jawed vertebrates, although the ancestral paC11-paC74 cysteine pair that was independently lost in tetrapods and Elasmonbranchii (sharks/rays) has been retained in other clades of jawed vertebrates [ Supplementary File 1A (a)]. The losses in several clades of animals indicate that the paC11-paC74 cysteine pair is not very important for MHC-II, and the pair has also been lost throughout MHC-I [ Supplementary File 1A (a)]. In both pMHC-I and pMHC-II, the region including pa11 interacts with the below ia domain (β2-m or II-α2, respectively), and so may provide a regional stabilization causing redundancy of the stabilization by paC11-paC74; in contrast, in both pMHC-I and pMHC-II, the well-conserved pbC11-pbC74 cysteine bridge stabilizes a part of the pab domain that is free of IgSF interaction ( Supplementary File 3C ).
The lifting of the end of the C-terminal part of the pa domain helical region in pMHC-I compared to pMHC-II [already noted by Brown et al. (5)] may be explained by the pMHC-I pa domain strand bending backward for continuation as a pb domain strand, concurrently inserting the MHC-I-specific paQ87 sidechain into the pab domain bottom [ Figure 5B (e)].
The pab Domain β-Sheets Show Conserved β-Bulges in Both pa and pb Domains of Both pMHC-I and pMHC-II, and Unique Differences in the Aligning of β-Strands Between pMHC-I and pMHC-II
An unusual feature shared among all investigated pa and pb domains of both pMHC-I and pMHC-II is a β-bulge at positions p33 and p34 of β-strand S3 with both sidechains pointing upwards ( Figure 5C and Supplementary File 3D ) (5, 49). In the ancestral MHC homodimer molecule, these p33 and p34 residues probably were hydrophobic [ Supplementary File 1A (a)], and the best residue conservation is found among MHC-I α2 domains which tend to possess pb(F/Y)33 and pb(I/L)34 [ Figures 3A , 5C (a), and Supplementary File 1A (a)]. Together with the pb-lineage-specific aromatic residue pb(F/Y/W)45, of which the sidechain “leans over” from β-strand S4 to β-strand S3, in I-α2 and II-β1 domains the residues pb33+pb34+pb45 form an, in many instances hydrophobic, ridge on which the pb domain H1 helix rests ( Figure 5C ). The better conservation of the pb33 and pb34 residues among I-α2 than among II-β1 sequences [ Supplementary File 1A (a)] suggests that the ridge function is under stricter requirements in pMHC-I than in pMHC-II. In the pa domains of pMHC-I and pMHC-II the β-bulge does not have the same function as in the pb domains because the H1 helices have a different orientation ( Figures 5A, C ).
Figure 5C and Supplementary File 3D also show remarkably conserved features that, to our knowledge, have not been noted before, and which we named “pb S2-S3 cleft” in pMHC-I and “pa S2-S3 cleft” in pMHC-II. These features concern that in pMHC-I pab β-sheets the α2 domain strands S2 and S3 only show main chain pairing until residues pb24 and pb36, respectively, and that beyond this contact the S3+S4 strands bend away from the rest of the β-sheet; in investigated pMHC-II structures a similar situation is found in the MHC-II α1 domains, with pa24 and pa36 as border. Meanwhile, in pMHC-I α1 and pMHC-II β1 domains the S2-to-S3 main chain pairings extend beyond the p24-p36 interaction ( Figure 5C and Supplementary File 3D ). The S2-S3 clefts probably increase the regional flexibility, and their positions associate with structures that undergo pronounced positional changes upon peptide binding/editing, including the pMHC-I α2 domain H1 helix (57–60) and the border region of helices H1 and H2 in pMHC-II α1 domain (61, 62).
Loops Connecting β-Strands S2 and S3 Show Conserved Orientations in Both pa and pb Domains of Both pMHC-I and pMHC-II, Whereas the Other pab β-Sheet Loops Can Display Considerable Variation
The S2S3-loop orientations are very well conserved in both pa and pb domains of both pMHC-I and pMHC-II [ Figure 5D ; see also the superimpositions of many pMHC-Is and pMHC-IIs in Figures 5C (b) and –(d), respectively]. In the MHC homodimer ancestor, though not conserved in most MHC-II pb (II-β1) domain sequences [ Supplementary File 1A (a)], the S2S3-loop probably bound to the S1 strand of the same domain by a hydrogen bond between pH3 and p(D/E/N)29 as seen in extant pMHC-I pa (I-α1), pMHC-I pb (I-α2), and pMHC-II pa (II-α1) [ Figure 5D (c)] (49). In case of the pb domains, the S2S3-loop tends to possess an aromatic residue at position pb28 which also participates in an interaction with residue pb3 [ Figure 5D (c)]. Among MHC-II α1 domain sequences a variety of hydrophobic residues is found at position pa28 [ Supplementary File 1A (a)], which in higher tetrapods including mammals and birds tends to be phenylalanine which can interact with paH3 and paD28 reminiscent of the pMHC-I pb domain situation [ Figure 5D (c); for chicken pMHC-II see PDB 6KVM]. Meanwhile, most MHC-I α1 domain sequences possess paV28 which may affect the orientation of the important A pocket residue pbY81 with which it interacts (see below).
In contrast to the S2S3-loops, considerable variations are observed in pa and pb domain S1S2- and S3S4-loop orientations among pMHC-Is [for more details see (27)] and in pa and pb domain S1S2-loop orientations among pMHC-IIs [ Figures 5C (b) and –(d), and 5D]. The pa and pb domain S3S4 loop orientations of pMHC-II and pMHC-I in primitive animals are similar and probably reflect the orientation within the ancestral MHC homodimer; compared to this, the S3S4 loops of mammalian pMHC-I pa (I-α1) domains are two residues longer, and the S3S4 loops of mammalian pMHC-I pa (I-α2) domains have flipped upwards [ Figures 3A , 5C (b) and –(d), and 5D ] [for more details see (27)].
In pMHC-I the Peptide Ligand P1 and P9 Positions Are Closer Together Than in pMHC-II, and the P1 Sidechain Points Upwards Instead of Downwards.
In contrast to pMHC-Is, in pMHC-IIs the groove ends are open, and the peptide lays in the groove in an extended, polyproline-like manner (5, 63). Although in pMHC-IIs the peptides typically are >9 aa and extend beyond the groove ends, the core interaction within the pMHC-II groove concerns a 9 aa peptide fragment as in most pMHC-Is, and the start and end residues of such fragment are named P1 and P9 (the residues N-terminal of the P1 residue are named P-1, P-2, etc.). Whereas in pMHC-I the P1 sidechain points upwards and the PΩ sidechain points downwards, in pMHC-II both the P1 and P9 sidechains point downwards ( Figure 6A ) in a semi-symmetric structure. The distances between the P1 and P9 pockets (aka A and F pockets in pMHC-I) in the groove of pMHC-II are larger than in pMHC-I ( Figure 6A ), and the linear distances between the Cα atoms of pMHC-I peptide residues P1 and PΩ (P9 in 9 aa peptides) are similar to those between pMHC-II peptide residues P2 and P9 (see Figure 6A legend). The different organization explains why peptide ligands of 9 aa length bulge in pMHC-Is, whereas in pMHC-IIs the peptide ligand main chains are in a regular extended configuration ( Figure 6A ). Naturally, as has been shown in multiple studies but is not shown here, in pMHC-I structures with peptides >9 aa, the bulging is more extensive (e.g., 64, 65). At a different note, the closing of the pMHC-I P1 and P9 pockets for peptide extensions beyond the groove is not absolute, but for such discussion we refer to other articles (66–69).
Peptide Binding in pMHC-II Is Rather Symmetric, but in pMHC-I the P1 Pocket Very Much Changed in Comparison to pMHC-II and the Presumed Homodimer Ancestor
The peptide binding mode in pMHC-II is characterized by pa-pb semi-symmetry and presumably similar to a symmetric peptide binding mode of the ancestral MHC homodimer. In pMHC-IIs, three ancient residues that make hydrogen bonds with the peptide ligand main chain are paN77, pbW60, and pbN77 (5, 12, 39, 54), and they can bind peptides in either N-to-C or C-to-N direction ( Figure 6B ) (55). The ancestral MHC homodimer presumably possessed a symmetry of pW60 and p(D/N)77 in each p domain, as specific loss of pW60 in the MHC-II pa lineage is suggested by the presence of pW60 in MHC-II pb as well as in MHC-I pa and pb [ Figure 3A and Supplementary File 1A (a)]. In pMHC-Is, ancestral residues (or residue characteristics) for forming hydrogen bonds with the peptide ligand were conserved in the form of pa(D/N/S)77 and pbW60 (better known as “W147”), although their hydrogen bonding with the peptide ligand is not consistently conserved among pMHC-Is and may depend on the HC allele and the bulging of the particular peptide ligand (e.g., Figure 6C ) (27, 30, 70). In pMHC-Is, the paW60 residue, because of reorganization of the respective pa domain helical region, can no longer participate in peptide binding [ Figures 5B (b) and –(d), 6F, G ]. Furthermore, in pMHC-Is, the ancestral pb(D/N)77 was replaced by the large hydrophobic pbW77, which because of a rotation in the respective pb domain helical region has its sidechain pointing more upwards and blocking the groove at the P1 pocket side [ Figures 5B (a), -(c), and –(d), 6F, G ].
Besides the ancestral paN77, pbW60, and pbN77 residues, the MHC-II lineage rather consistently possesses the two residues paN70 and pb(H/N)76 for (presumably) making hydrogen bonds with the peptide ligand main chain ( Figure 6B ) (5, 12, 39, 54); residue pbN76 is only common in MHC-II of primitive vertebrates [ Supplementary File 1A (a)], and its possible hydrogen bonding with the peptide ligand remains to be determined. In pMHC-I, residues at positions pa70 and pb76 in pMHC-I do not possess homogeneous characteristics [ Figure 3A and Supplementary File 1A (a)] and do not usually bind the peptide ligand main chain ( Figure 6C ). Therefore, and also because they are not part of an ancient pa-pb symmetry, we assume that paN70 and pb(H/N)76 were specifically acquired/fixated in the MHC-II lineage. Conserved orientations of the here listed MHC-II residues are shown in the pMHC-II superposition figure in Figure 6D (note that mouse H2-Ag7 possesses pbY60 instead of the typical pbW60).
The P9 pockets (aka F pocket in case of pMHC-I) are quite similar between pMHC-I and pMHC-II, but in pMHC-I a higher number of residues for peptide ligand main chain binding were acquired/fixated including pa(R/Y)84, pbT56 (aka “T143”), and pbK59 (“T146”) [ Figures 3A , 6C and Supplementary File 1A (a)] (27, 71). Explanations for why in pMHC-Is the groove is closed, while in pMHC-IIs the groove is open, are, in the case of pMHC-I, the higher and lower positions of the sidechains of the pa84 and pb59 residues, respectively ( Figures 6F, G ) (5). A mechanical explanation for the lifting of the C-terminal part of the pa domain helical region in pMHC-I compared to pMHC-II, was discussed above [ Figure 5B (e)]. Even among pMHC-I P9 pockets there is some variation in the contribution of the conserved peptide binding residues (examples in Figures 6C, E ) (27, 70, 72), and pMHC-I P9 pockets may need some flexibility for engulfing various amino acid sidechains.
In contrast to the P9 pockets, the P1 pockets (aka A pocket in case of pMHC-I) are very different between pMHC-I and pMHC-II. A critical difference caused by the change and reorientation of the ancestral pbN77 into pMHC-I pbW77 was explained above, and this change forced the P1 residue of the peptide ligand to bind more towards the center of the groove ( Figures 6A, F, G ). This was accompanied by another critical change, namely the binding of the P1 main chain at the bottom of the P1 pocket instead of the P1 sidechain as seen in pMHC-II. For this purpose, in MHC-I sequences a number of new residues were acquired/fixated, including paY7, paY59, pa(D/E/Q)63, pbY70 (“Y159”), and pbY81 (“Y171”) for participating in a hydrogen-bond network with the main chain of peptide ligand residue P1 [ Figures 3A , 6C and Supplementary File 1A (a)]; the tyrosines among these residues show near-absolute conservation in orientation ( Figure 6E ), which is possible because the pMHC-I P1 pocket needs little flexibility since the P1 sidechain points out of the groove ( Figure 6A ) (27, 71). For the evolutionary creation of the P1 pocket in the MHC-I lineage it was also important that residue paG26 was acquired/fixated [ Figure 3A and Supplementary File 1A (a)], because the absence of a sidechain at this position allows paY7 to reach the P1 pocket [ Figure 5C (a-i)]. It is difficult to speculate which of the unique pMHC-I P1 pocket residues and features was acquired first in evolution. In some MHC-I, the pbW77 was lost with the consequences not well understood, such as in the elucidated frog UAA structure (30) which possesses pbG77 ( Figure 6E ).
Although the Overall Orientation of the ia Domain Is Similar Between pMHC-I and pMHC-II, only in pMHC-I the Domain Acquired a Hydrophobic Knob That Inserts Into the pab Domain β-Sheet
Supplementary Files 2A, B list all residues that according to PDBePISA software analysis are part of the ia to pab interface of several representative pMHC-I and pMHC-II structures; relevant pHLA-A2 and pHLA-DR1 residues are highlighted in the structural figures in Figures 7A (a), 7B (a), with their names colored according to conservation patterns as shown in Figure 3 and Supplementary File 1A . Among the, probably ancestral, interface residues shared between pMHC-I and pMHC-II are: pa(D/E/Q)32, paR48, pbQ6, pb(Q/R)25, iaP32, iaP33, and ia(F/Y)62 [also see Supplementary File 3A (b)]; among these residues, pa(D/E/Q)32 is better conserved in MHC-II whereas pbQ6, pb(Q/R)25 and iaP33 are better conserved in MHC-I ( Supplementary File 1A ). In contrast to pMHC-I, pMHC-II only has a few specific residues at the ia-to-pab interface [shaded non-dark blue in Figure 7B (a)], namely iaF54, pa(E/Q)23, pa(D/E/N)31, and paE32, the last two of which participate in the connection between the pab domain and the IgSF domains at the P1 side of the pab domain (see below). Compared to pMHC-II and, presumably, to the last common ancestor of MHC-I and MHC-II, the biggest change in the pMHC-I interdomain contacts was the acquisition of the iaF56+iaW60 hydrophobic knob and its insertion into the pab pa9 pleat [ Figure 7A and Supplementary File 3E ]; in pMHC-II structures, at the corresponding position, the pa9 pleat is closed by sidechains from the pleat ridge residues [ Figure 7B and Supplementary File 3E ]. The establishment of the iaF56+iaW60 hydrophobic knob in MHC-I was accompanied by acquisition/fixation of a large set of interacting residues that have been well conserved throughout extant MHC-I from shark to human [ Figure 3 and Supplementary Files 1A , 2A , and 3A (a)] [for more details see (27)].
The ib to pab Interdomain Interface in pMHC-II Acquired a Large Hydrophobic Tryptophan Residue
Whereas in pMHC-I an iW60 residue was acquired in the ia (β2-m) domain [ Figure 3B and Supplementary File 1A (b)], making a large interaction with the pab domain that involves a set of additionally selected residues [ Figure 7A (a)], in pMHC-II a reminiscent situation is observed for a specifically acquired iW60 residue in the ib (II-β2) domain ( Figure 8 ) (73, 74). Figure 8A shows how in pHLA-DR1 the ibW60 residue inserts into the pab domain and is surrounded by multiple conserved residues including ones specifically selected in MHC-II, namely pa(D/E/N)31, paE32, pa(I/L)49, and paF52. Whereas the iaW60 residue in pMHC-I structures binds closer to the P9 side of the pab domain [ Figure 7A (b)], the ibW60 residue in pMHC-II structures binds to the floor at the P1 end of the pab domain ( Figure 8B ) (73, 74). Figure 8C shows the dramatic difference in ib-pab interfaces caused by having a large ibW60 in pMHC-II β2 domains rather than a small ibG60 common among pMHC-I α3 domains. Considering the frequent finding of iG60 in non-MHC IgSF sequences [ Supplementary File 1A (b)], we speculate that glycine represents the ancestral MHC residue at this position.
At the end of the MHC-I pb domain, residue pbL90, which is specifically conserved in MHC-I but probably also ancestral to MHC-II, forms a conserved structure with MHC-I-specific residues paV28, paW51, and pbY81 ( Figure 8D ). The nearby pbR90b residue also appears to be ancestral to both MHC-I and MHC-II [ Supplementary File 1A (a)] and tends to strengthen the contact between the end of the pb domain and the ib domain by making one or more polar contacts with the main chain of ibG60 (in case of pMHC-I) or ibW60 (in case of pMHC-II) in most of the investigated structures ( Figure 8D and data not shown). Shark UAA is an exception, as a residue was lost in this region [ Supplementary File 1A (a)], and the sidechain of shark pUAA pbR90b points in a different direction ( Figure 8D ).
The ia to ib Interdomain Interface Is Different Between pMHC-I and pMHC-II
Supplementary Files 2C, D list all residues that according to PDBePISA software analysis are part of the ia to ib domain interface of representative pMHC-I and pMHC-II structures. In both pMHC-Is and pMHC-IIs, the interaction between the ia and ib domains basically involves their S1-S2-S5-S4 (ABED) sheets, with the ia domain participating more with its S1 and S2 strands, and the ib domain participating more with its S4 and S5 strands [ Supplementary File 3B (a)] (49). The iaY10-ibP56 bond, which is highly conserved in pMHC-I, appears to be absent in pMHC-II. Namely, (i) in MHC-II sequences ibP56 is very uncommon and iaY10 is only partially conserved and seems, judging from its conservation pattern [ Supplementary File 1A (b)], functionally equivalent to iaF10 (bearing a sidechain without polar groups), and (ii) in elucidated pMHC-II structures, in contrast to pMHC-I, the ib56 residue is not close to the ia10 residue ( Figure 9 ). On the other hand, in both chicken and mammalian pMHC-IIs, a tyrosine at position ia67 makes a hydrogen bond with the ibN57 main chain [ Supplementary File 3B (b)] and, given the presence of these residues in MHC-II of cartilaginous fish—though iaY67 is poorly conserved in ray-finned fish [ Supplementary File 1A (b)]—, iaY67-ibN57 may represent an ancestral pMHC-II interaction.
The pMHC IgSF Domains Inherited Some Unique Structural Features From the Presumed Homodimer Ancestor That Are Best Conserved in pMHC-II; For Unknown Reasons, Some Consensus IgSF Features Have Not Been Well Conserved in pMHC-I
The β-strands of C1 set IgSF domains of various MHC and non-MHC molecules superimpose well (75). However, considerable variation can be found in the relative orientations of their S1S2-loops and interacting S5S6-loops ( Figure 10A ), which in pMHCs are positioned distal to (away from) the pab domain. Although among the different pMHC IgSF domains the S5S6-loop orientations tend to be similar, the S1S2-loop orientation can differ a lot ( Figure 10A ). The pMHC ia domains (β2-m and II-α2) possess the MHC IgSF consensus orientation of the S1S2-loops ( Figure 10A ), which we assume to have been inherited from the MHC homodimer ancestor. This S1S2-loop consensus orientation is only found in approximately half of the investigated pMHC-II ib (II-β2) domain structures, while among the pMHC-I ib (I-α3) domains there is little consensus ( Figure 10A ); the variation among I-α3 S1S2-loops is also reflected in their variable lengths [ Figure 3B and Supplementary File 1A (b)]. When pMHC structures representative of the MHC IgSF S1S2-loop plus S5S-loop consensus organization are compared with several non-MHC C1 set IgSF domains, it is seen that this organization is uncommon ( Figure 10B ).
Figure 11 shows important structures in the internal organization of the pMHC IgSF domains. Interestingly, compared to pMHC-II and the deduced ancestral situation, in pMHC-I evolution there have been complete or partial losses of features that probably increase the rigidity of the IgSF domains. Namely: (i) in pMHC-II α2 and β2 domains, as common among IgSF domains (76), the conserved iW39 and iL68 residues interact and form part of a topohydrophobic inner core, whereas iW39 (77) and iL68 were lost in β2-m, iL68 was lost in I-α3, and iW39 has not been not stringently conserved in I-α3 [ Figure 11A and Supplementary File 1A (b)]; (ii) The iW95 residue and its orientation, possibly supporting the S1S2 and S5S6 loops region, are very well conserved among pMHC-II α2 and β2 domains, but not among pMHC-I α3 and β2-m domains [ Figure 11B and Supplementary Files 1A (b), 1B (b) last picture]. The iW95 residue is very uncommon in non-MHC IgSF molecules and probably was specifically established in the MHC homodimer ancestral molecule [ Supplementary File 1A (b)]. The iW95 orientation in pMHC-II IgSF domains [ Figure 11B ] probably represents the ancestral situation. Experimental replacement of β2-m iW95 by a glycine has been reported to induce protein instability, confirming a role of iW95 in β2-m stabilization (78). The lack of strict conservation of the I-α3 domain structures is highlighted by the single residue shift in the main chain pairing between β-strands S6 and S7 in frog pMHC-I [last two pictures in Supplementary File 1B (b)].
Taken together, among the different pMHC IgSF domains, the MHC-II α2 domain appears to be the most conservative and representative of the IgSF structure in the MHC homodimer ancestor regarding the combined features of S1S2-loop orientation, IgSF-typical topohydrophobic inner core, and iW95 orientation.
MHC-IIα Plus MHC-IIβ Genetic Fusions in Lungfishes Suggest Another Possible Route for the Creation of MHC-I Genes From MHC-II-Like Ancestral Genes
In four different lungfish species, we found transcripts encoding soluble proteins consisting of fusions of MHC-IIα and MHC-IIβ with a linker in between ( Figure 12A and Supplementary File 4 ), which we named MHC-IIabSol. For three of the species we assembled these sequences by hand from single read archive (SRA) NCBI databases, namely for slender lungfish (Protopterus dolloi), West African lungfish (Protopterus annectens), and Australian lungfish (Neoceratodus forsteri). For South American lungfish (Lepidosiren paradoxa) sequence, a similar sequence was retrieved from an NCBI transcriptome shotgun assembly (TSA) database. For slender lungfish, we confirmed the MHC-IIabSol sequence experimentally using RNA from kidney. Indications that the gene encoding MHC-IIabSol might also produce single α or β chains, or transmembrane proteins, were not observed; however, we dedicated little effort to potentially finding alternatively spliced transcripts, and we cannot deny their possible existence. Although recombinant fusions between IIα and IIβ have been created artificially as a tool for MHC research (e.g., 79), we are not aware of previous reports on their natural existence. Considering their conservation of peptide-binding residues ( Supplementary File 4B ), the MHC-IIabSol proteins are expected to bind proteins. How a secreted pMHC-II molecule can have a biological function is unclear at this moment. The Australian and other lungfishes separated >150 million years ago (80), concluding that MHC-IIabSol is ancient. Phylogenetic tree analysis is not conclusive on how to cluster MHC-IIabSol with classical MHC-II sequences ( Supplementary File 4C ), which is a common problem when comparing proteins with different functions over large evolutionary distances (14). However, the sequences ( Supplementary File 4B ) and trees for the different domains ( Supplementary File 4C ) collectively suggest that MHC-IIabSol was established from classical MHC-II genes only in Sarcopterygii (lobe-finned fish plus tetrapods), possibly only in Dipnomorpha (lungfishes). In the context of the present paper, we are predominantly interested in MHC-IIabSol because its MHC-IIα to MHC-IIβ fusion suggests an alternative possible route for II-I evolution. Namely, rather than that a translocation event directly created an MHC-I/β2-m ectodomain exon organization from an MHC-II-like organization (the Figure 1 model), an intermediate step in the creation of the MHC-I/β2-m system might have been an MHC-IIα/MHC-IIβ genetic fusion as speculated in Figure 12B .
Discussion
The present study is the first to extensively compare structures of pMHC-I with pMHC-II. After Pamela Bjorkman as part of the Don Wiley group was the first in 1987 to elucidate a pMHC-I structure (4), hitherto, the most comprehensive analysis of pMHC-I structural organization had been the seminal study in 1991 by Saper, Bjorkman, and Wiley (49); both studies only investigated pHLA-A2. After the first pMHC-II structure was reported in 1993 by Brown et al. (5), global comparisons between pMHC-I and pMHC-II structures were made [e.g., (5, 7)], but those studies did not achieve the comprehensiveness of the Saper et al. (49) analysis, and the majority of the pMHC structural studies that followed focused on peptide-binding groove properties.
To our knowledge, other than discussion of how genes and exons may have been duplicated, differentiated, and/or shuffled [e.g., (20–23)], the early evolution of pMHC structures has hardly been discussed before. In Supplementary File 5 , we propose a model of the major structural changes in the evolution of an ancestral MHC homodimer towards extant pMHC-I and pMHC-II. In the present study, we do not speculate on the type of homodimer molecules that may have been the evolutionary origin of the MHC homodimer molecules, and for such speculation we refer to our previous study (15).
The MHC homodimer ancestral state probably concerned a symmetric molecular structure ( Figure 13 , which corresponds to Stage 1 in Supplementary File 5 ). Probable features were a pab domain with an 8-stranded β-sheet, as common in extant pMHC-I and pMHC-II ( Figure 5A ), topped by two helical structures each similar to as found in extant pMHC-II pb (II-β1) domains [ Figures 4B (a), 5B (a)]. Characteristic features of the β-sheet were the p33+p34 β-bulge ( Figure 5C and Supplementary File 3D ) and the S2S3- and S3S4-loop orientations [ Figures 5C (b) and -(d), and 5D ]. The peptide probably was bound, as in extant pMHC-II, in an extended poly-proline-like fashion, and important residues for peptide main chain binding presumably were pW60 and p(D/N)77 ( Figure 6 ). Unique features of the MHC homodimer IgSF domains appear to have been their S1S2- plus S5S6-loop orientations ( Figure 10 ) in combination with residue iW95 ( Figure 11 ); these features locate distal from the pab domains, and, although this is speculation only, by increasing local stability they may have allowed modifications in other parts of the IgSF domains.
The homodimer ancestral gene must have duplicated and given rise to the early MHC a- and b-lineage genes ( Figure 1 ) (20, 21), which together encoded an MHC-II-like heterodimer structure. Several a- and b-lineage-specific residues are rather well conserved in both MHC-I and MHC-II ( Figure 3 and Supplementary File 1A ). The most dramatic change from the homodimer ancestral structure to the last common structure before the evolutionary separation of the MHC-I and MHC-II lineages (Stage 2 in Supplementary File 5 ) presumably was the repositioning of the IgSF domain ia domain (β2-m or II-α2 in extant pMHC). Bjorkman et al. (4) already noted that the asymmetrical organization of the pMHC-I IgSF domains was very distinct from a typical homodimer-like structure, and the observed, unusual, central position of the ia domain under the pab floor was later confirmed for pMHC-II (5). Given that there is no space for two IgSF domains at that central position, this cannot have been the IgSF orientation in a symmetrical ancestral MHC homodimer structure, and the IgSF domain orientations may have been more similar to those of ib domains (I-α3 and II-β2) in extant pMHC (as speculated in Figure 13A ). The central position of the ia domain orientation may have stabilized the pab floor and so have allowed modifications of the pa domain helix organization. However, it is unclear in how far the different pa domain helix organizations in extant pMHC-I α1 and pMHC-II α1 share a common history, although they do share common differences from the presumed ancestral helix organization found in extant pMHC-II β1 domains [ Figures 3A , 5B (b) and –(d)]. Possibly, an increased flexibility of the pa domain helical region near the P1 pocket may have contributed to peptide ligand selectivity, as is known for extant pMHC-II in mammals (62).
Dominant changes from the time of the last common MHC-I and MHC-II ancestor towards extant MHC-II (Stage 3a in Supplementary File 5 ) presumably were: (i) the melting of part of the pa domain (II-α1 domain) helical region at the border between the H1 and H2 helices [ Figure 5B (b)], probably in order to enhance peptide editing abilities (12, 56); (ii) acquisition of residue ibW60 in the MHC-II β2 domain [ Supplementary File 1A (b)] for insertion into the pab domain near the P1 pocket ( Figure 8 ) (73, 74), possibly for strengthening the local pab floor; (iii) the acquisition of peptide binding groove residues paN70 in the II-α1 domain and pb(H/N)76 in the II-β1 domain for binding the peptide ligand main chain ( Figure 6B ), possibly to compensate for the loss of the ancestral pW60 in the II-α1 domain ( Figures 3A , 6F and Supplementary File 1A ); and (iv) —although this remains to be determined for amphibians and fishes—an “S2-S3 cleft” in the pa part of the pab domain β-sheet, possibly for increasing the flexibility of the P1 pocket region ( Figure 5C and Supplementary File 3D ). Sequence comparisons indicate that the change of the P1 pocket pa helical region together with the probable strengthening of the P1 pocket floor by ibW60 is shared among MHC-II throughout jawed vertebrate species ( Supplementary File 1A ), despite that nonclassical MHC-II lineage DM, which binds to the classical pMHC-II P1 pocket pa helical wall and so participates in peptide editing (62), has only been found from the level of Rhipidistia (Dipnomorpha plus tetrapods) (15). It has been speculated that in more primitive jawed vertebrates, DM-like peptide editing function may be executed by classical MHC-II in higher order complexes (12, 15).
Pronounced changes did occur from the last common ancestor with MHC-II towards extant MHC-I (Stage 3b in Supplementary File 5 ). We propose that the major changes for creating extant MHC-I were: (i) Establishment of covalent pa+pb+ib+CP/TM/CY (HC) and free ia (β2-m) by a process involving exon shuffling events, possibly by a route as proposed in Figure 1 (20, 21) or as in Figure 5 ; (ii) Creation of a closed pab groove ( Figures 5B and 6F, G ) with a new P1 pocket closer to the pab center ( Figures 5A , 6 ) and the acquisition of new/additional P1 and P9 pocket residues [ Figures 3A , 6 and Supplementary File 1A (a)] (iii) Establishment of three unique interdomain interactions involving the ia domain residues iaY10, iaD53, and iaF56+iaW60 ( Figures 7 , 9 and Supplementary Files 1 , 3A, B, E ); and (iv), an “S2-S3 cleft” in the pb part of the pab domain β-sheet which may increase the flexibility of the P9 pocket region ( Figure 5C and Supplementary File 3D ). As for the third point, especially the insertion of the hydrophobic iaF56+iaW60 knob into the pa9 pleat of the pab domain made a big structural difference and, in evolution, was accompanied by the acquisition/fixation of a large set of interacting residues [ Figure 7A (a) and Supplementary Files 1A , 2A, B , 3A ]. In our joint study we provided experimental evidence by alanine exchanges showing that all four β2-m residues iaY10, iaD53, iaF56, and iaW60 are important for pMHC-I complex formation (27), whereas previous studies only showed this for iaD53 and iaW60 (78, 81).
Comparison of sequences and structures suggest that the ia and ib IgSF domains in pMHC-I lost some rigidity compared to these domains in pMHC-II and in the presumed ancestral MHC homodimer ( Figure 11 ). This may be related to a single β2-m molecule having to be able to bind a variety of different I-α3 domains of classical and nonclassical MHC-I (e.g., 17), or with an induced fit mechanism. However, as for the latter, the only consistent structural difference of free β2-m versus β2-m within pMHC-I complexes, analyzed for a variety of species, appears to concern that the orientations of the iaF56 and iaW60 residues in free β2-m are different from that in pMHC-I; however, although their orientations are similar among pMHC-Is, they differ between free β2-m of different species, making a more detailed discussion difficult (27).
MHC-II appear to represent the older MHC organization in regard to molecular structure, intron-exon organization, and peptide binding. Nevertheless, overall, the pa and pb domain sequences of MHC-I are better conserved in long term evolution, including ancestral residues that predate the separation of the MHC-I and MHC-II lineages ( Supplementary Files 1A, C ). This probably relates to the requirements for MHC-I function being more stringent than for MHC-II function.
A big question is why both MHC-I and MHC-II evolved and were both maintained in most jawed vertebrate species. Extant pMHC-II may have functions quite similar to the ancestral MHC homodimer, although with an improved capacity for peptide editing. On the other hand, in pMHC-I a substantially different peptide binding mode was created, with closed grooves, a shorter distance of P1-P9 binding, and a soluble ia (β2-m) domain. Probably, similar to the evolution of the MHC-II peptide editing system (12, 15, 56, 82, 83), the HC+β2-m system was developed to more strictly select only a subset of possible peptide ligands. Laws of thermodynamics conclude that synergistic binding of peptides to HC and its non-stably associated heterodimer partner β2-m (84, 85) should select for higher affinity peptides than in an imaginary situation with a similar but stable peptide-receptive MHC (e.g., imaginary covalent HC+β2-m) complex (three non-stably associated units have a stronger tendency to dissociate than two). Furthermore, closing of the grooves at the P1 and P9 pocket ends in MHC-I, compared to an open groove in the MHC-II-like ancestral situation, caused a major restriction to the lengths of peptides that could be bound. Additionally, the MHC-I groove acquired unfavorable properties for a stretched polyproline-type peptide conformation of most peptide ligands, resulting in the bulging of these peptides in individual ways ( Figure 6A ) (e.g., 70, 72, 86). Such individual bulging can structurally amplify peptide ligand sequence differences (87) and in case of tumor cells that probably makes it easier for T cells to distinguish between pMHC-I complexes with peptide ligands that differ in only single residues, even if the sidechains of those residue are not exposed at the pMHC-I surface. Considering these arguments, we speculate that the pMHC-I peptide binding mode has been developed to structurally maximize small sequence differences in peptide ligands so that it is better equipped to also recognize tumor antigens. In addition, the increased selectivity for peptide ends by having a closed groove such as in MHC-I, may have improved the ability of the immune system to distinguish between peptides generated by the different peptidase activities of proteasomes versus immunoproteasomes (88), and thus between homeostasis and inflammation conditions. After MHC-I was established, it has been superior compared to MHC-II as a source for creating a wide array of divergent nonclassical MHC molecules (e.g., 25, 89), which might be explained by the covalent association of the pa and pb domains providing a stabler situation for making functional changes. Given this superior plasticity of MHC-I, it is puzzling why most species still have MHC-II. We speculate that in unstable/variable endocytic/phagocytic compartments, where classical MHC-II molecules are loaded with peptides, the seemingly simple and robust classical MHC-II loading system (90) has advantages over the classical MHC-I loading system which—at least for the initial loading with peptide—requires free β2-m and multiple other factors (2). It probably would be difficult to deliver the different molecules of the MHC-I loading system in proper stoichiometry to the variable endocytic/phagocytic compartments, and the system might also be susceptible to the biochemical variations in those compartments. That the classical MHC-I loading system in mammals and fishes is similar is suggested by the conservation of relevant genes (91–93).
It has been argued that the presence of classical MHC-I genes on human Chr. 6 and of a nonclassical MHC-I gene (encoding FcRn) on human Chr. 19 suggest that MHC-I is older than MHC-II (26). This hypothesis is based on a model, from a time before whole genome sequences were known, that these human chromosomal regions are paralogues deriving from a whole genome duplication very early in vertebrate evolution (94). However, previously, we have shown that the corresponding genomic regions are linked in teleost fishes, so that many similarities between the regions may have derived from tandem gene duplications and intrachromosomal rearrangements occurring before the separation between Actinopterygii (ray-finned-fishes) and Sarcopterygii, which then in the tetrapod lineage were separated by an interchromosomal translocation event (95). Moreover, the FcRn lineage probably only separated from the classical MHC-I lineage within tetrapod evolution (14) and, in our opinion, FcRn is not a candidate for representing MHC molecules that predated the I-II divide.
With the finding of the natural MHC IIα-IIβ fusions in lungfish, which we designated MHC-IIabSol, the present study provides evidence that nature allows structures other than—complete or partial—consensus MHC-I or MHC-II organizations. Although the MHC-IIabSol sequences are very different from the classical MHC-II sequences in lungfishes, they probably were established from duplicates of classical MHC-II genes within Sarcopterygii or even within Dipnomorpha (lungfishes) ( Supplementary File 4 ). Thus, these fusions represent a principle of how MHC-II exons can be combined for encoding larger structures, although we do not propose lungfish MHC-IIabSol itself to belong to an MHC lineage that predated the I-II divide. Whether the exon shuffling events for the evolutionary creation of MHC-I genes from an MHC-II-like origin were similar to the original model proposed by Kaufman et al. (20) ( Figure 1 ) or involved a IIα-IIβ fusion as hypothesized in our Figure 12 model, can’t be determined.
In short, the present study provides structural and genetic support for the II-I model of MHC evolution, identified critical evolutionary events, and recognized a large set of characteristic features— some of which for the first time—in especially pMHC-I. Future studies should use these insights to further unravel MHC functions and mechanisms.
Data Availability Statement
The original contributions presented in the study are included in the article/ Supplementary Material . Further inquiries can be directed to the corresponding authors.
Author Contributions
YW and NZ contributed equally to the manuscript. They made figures and were involved in the generation of the concepts and checking the manuscript. KH and CX were involved in the generation of the concepts and checking the manuscript. CX was responsible for the generation of many of the primary data compared in this study. JD generated most of the concepts, made most of the figures, and wrote the manuscript. All authors contributed to the article and approved the submitted version.
Funding
This work was supported by the 973 Project of the China Ministry of Science and Technology, Grant Number 2013CB835302, by the National Natural Science Foundation of China (NSFC) and by the Japan Society for the Promotion of Science Grants-in-Aid for Scientific Research, Grant Number JP26440201. The funders had no role in study design, data collection, or interpretation.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2021.621153/full#supplementary-material
References
- 1. Germain RN. MHC-Dependent Antigen Processing and Peptide Presentation: Providing Ligands for T Lymphocyte Activation. Cell (1994) 76(2):287–99. 10.1016/0092-8674(94)90336-0 [DOI] [PubMed] [Google Scholar]
- 2. Neefjes J, Jongsma ML, Paul P, Bakke O. Towards a Systems Understanding of MHC Class I and MHC Class II Antigen Presentation. Nat Rev Immunol (2011) 11(12):823–36. 10.1038/nri3084 [DOI] [PubMed] [Google Scholar]
- 3. Bjorkman PJ, Strominger JL, Wiley DC. Crystallization and X-Ray Diffraction Studies on the Histocompatibility Antigens HLA-A2 and HLA-A28 From Human Cell Membranes. J Mol Biol (1985) 186(1):205–10. 10.1016/0022-2836(85)90271-2 [DOI] [PubMed] [Google Scholar]
- 4. Bjorkman PJ, Saper MA, Samraoui B, Bennett WS, Strominger JL, Wiley DC. Structure of the Human Class I Histocompatibility Antigen, HLA-A2. Nature (1987) 329(6139):506–12. 10.1038/329506a0 [DOI] [PubMed] [Google Scholar]
- 5. Brown JH, Jardetzky TS, Gorga JC, Stern LJ, Urban RG, Strominger JL, et al. Three-Dimensional Structure of the Human Class II Histocompatibility Antigen HLA-DR1. Nature (1993) 364(6432):33–9. 10.1038/364033a0 [DOI] [PubMed] [Google Scholar]
- 6. Brown JH, Jardetzky T, Saper MA, Samraoui B, Bjorkman PJ, Wiley DC. A Hypothetical Model of the Foreign Antigen Binding Site of Class II Histocompatibility Molecules. Nature (1988) 332(6167):845–50. 10.1038/332845a0 [DOI] [PubMed] [Google Scholar]
- 7. Stern LJ, Wiley DC. Antigenic Peptide Binding by Class I and Class II Histocompatibility Proteins. Structure (1994) 2(4):245–51. 10.1016/s0969-2126(00)00026-5 [DOI] [PubMed] [Google Scholar]
- 8. Rammensee HG, Friede T, Stevanoviic S. MHC Ligands and Peptide Motifs: First Listing. Immunogenetics (1995) 41(4):178–228. 10.1007/BF00172063 [DOI] [PubMed] [Google Scholar]
- 9. Schellens IM, Hoof I, Meiring HD, Spijkers SN, Poelen MC, van Gaans-van den Brink JA, et al. Comprehensive Analysis of the Naturally Processed Peptide Repertoire: Differences Between HLA-A and B in the Immunopeptidome. PloS One (2015) 10(9):e0136417. 10.1371/journal.pone.0136417 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Malmstrom M, Matschiner M, Torresen OK, Star B, Snipen LG, Hansen TF, et al. Evolution of the Immune System Influences Speciation Rates in Teleost Fishes. Nat Genet (2016) 48(10):1204–10. 10.1038/ng.3645 [DOI] [PubMed] [Google Scholar]
- 11. Dijkstra JM, Grimholt U. Major Histocompatibility Complex (MHC) Fragment Numbers Alone - In Atlantic Cod and in General - Do Not Represent Functional Variability. F1000Res (2018) 7:963. 10.12688/f1000research.15386.2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Dijkstra JM, Grimholt U, Leong J, Koop BF, Hashimoto K. Comprehensive Analysis of MHC Class II Genes in Teleost Fish Genomes Reveals Dispensability of the Peptide-Loading DM System in a Large Part of Vertebrates. BMC Evol Biol (2013) 13:260. 10.1186/1471-2148-13-260 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Grimholt U, Tsukamoto K, Azuma T, Leong J, Koop BF, Dijkstra JM. A Comprehensive Analysis of Teleost MHC Class I Sequences. BMC Evol Biol (2015) 15:32. 10.1186/s12862-015-0309-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Dijkstra JM, Yamaguchi T, Grimholt U. Conservation of Sequence Motifs Suggests That the Nonclassical MHC Class I Lineages CD1/PROCR and UT Were Established Before the Emergence of Tetrapod Species. Immunogenetics (2018) 70(7):459–76. 10.1007/s00251-017-1050-2 [DOI] [PubMed] [Google Scholar]
- 15. Dijkstra JM, Yamaguchi T. Ancient Features of the MHC Class II Presentation Pathway, and A Model for the Possible Origin of MHC Molecules. Immunogenetics (2019) 71(3):233–49. 10.1007/s00251-018-1090-2 [DOI] [PubMed] [Google Scholar]
- 16. Hughes AL, Nei M. Evolution of the Major Histocompatibility Complex: Independent Origin of Nonclassical Class I Genes in Different Groups of Mammals. Mol Biol Evol (1989) 6(6):559–79. 10.1093/oxfordjournals.molbev.a040573 [DOI] [PubMed] [Google Scholar]
- 17. Adams EJ, Luoma AM. The Adaptable Major Histocompatibility Complex (MHC) Fold: Structure and Function of Nonclassical and MHC Class I-Like Molecules. Annu Rev Immunol (2013) 31:529–61. 10.1146/annurev-immunol-032712-095912 [DOI] [PubMed] [Google Scholar]
- 18. Mellins ED, Stern LJ. HLA-DM and HLA-DO, Key Regulators of MHC-II Processing and Presentation. Curr Opin Immunol (2014) 26:115–22. 10.1016/j.coi.2013.11.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Flajnik MF, Kasahara M. Origin and Evolution of the Adaptive Immune System: Genetic Events and Selective Pressures. Nat Rev Genet (2010) 11(1):47–59. 10.1038/nrg2703 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Kaufman JF, Auffray C, Korman AJ, Shackelford DA, Strominger J. The Class II Molecules of the Human and Murine Major Histocompatibility Complex. Cell (1984) 36(1):1–13. 10.1016/0092-8674(84)90068-0 [DOI] [PubMed] [Google Scholar]
- 21. Kaufman J. Vertebrates and the Evolution of the Major Histocompatibility Complex (MHC) Class I and Class II Molecules. Verh Dtsch Zool Ges (1988) 81:131–44. [Google Scholar]
- 22. Hughes AL, Nei M. Evolutionary Relationships of the Classes of Major Histocompatibility Complex Genes. Immunogenetics (1993) 37(5):337–46. 10.1007/BF00216798 [DOI] [PubMed] [Google Scholar]
- 23. Hashimoto K. The Transformation Hypothesis for the Origin of the Major Histocompatibility Complex (MHC) Family Molecules and the Elucidation of the Fundamental Ancient Duplication Unit (FADU) in the Genome. Recent Res Dev Immun (2003) 1:55–80. [Google Scholar]
- 24. Ohta Y, Shiina T, Lohr RL, Hosomichi K, Pollin TI, Heist EJ, et al. Primordial Linkage of Beta2-Microglobulin to the MHC. J Immunol (2011) 186(6):3563–71. 10.4049/jimmunol.1003933 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Flajnik MF, Canel C, Kramer J, Kasahara M. Which Came First, MHC Class I or Class II? Immunogenetics (1991) 33(5-6):295–300. 10.1007/BF00216688 [DOI] [PubMed] [Google Scholar]
- 26. Ohta Y, Kasahara M, O’Connor TD, Flajnik MF. Inferring the “Primordial Immune Complex”: Origins of MHC Class I and Antigen Receptors Revealed by Comparative Genomics. J Immunol (2019) 203(7):1882–96. 10.4049/jimmunol.1900597 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Wu Y, Zhang N, Wei X, Lu S, Li S, Hashimoto K, et al. The Structure of a Peptide-Loaded Shark MHC Class I Molecule Reveals Features of the Binding Between β2-Microglobulin and Heavy Chain Conserved in Evolution. J Immunol (Forthcoming 2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Chen Z, Zhang N, Qi J, Chen R, Dijkstra JM, Li X, et al. The Structure of the MHC Class I Molecule of Bony Fishes Provides Insights Into the Conserved Nature of the Antigen-Presenting System. J Immunol (2017) 199(10):3668–78. 10.4049/jimmunol.1600229 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Li Z, Zhang N, Ma L, Zhang L, Meng G, Xia C. The Mechanism of Beta2m Molecule-Induced Changes in the Peptide Presentation Profile in a Bony Fish. iScience (2020) 23(5):101119. 10.1016/j.isci.2020.101119 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Ma L, Zhang N, Qu Z, Liang R, Zhang L, Zhang B, et al. A Glimpse of the Peptide Profile Presentation by Xenopus Laevis MHC Class I: Crystal Structure of pXela-UAA Reveals a Distinct Peptide-Binding Groove. J Immunol (2020) 204(1):147–58. 10.4049/jimmunol.1900865 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Koch M, Camp S, Collen T, Avila D, Salomonsen J, Wallny HJ, et al. Structures of An MHC Class I Molecule From B21 Chickens Illustrate Promiscuous Peptide Binding. Immunity (2007) 27(6):885–99. 10.1016/j.immuni.2007.11.007 [DOI] [PubMed] [Google Scholar]
- 32. Zhang J, Chen Y, Qi J, Gao F, Liu Y, Liu J, et al. Narrow Groove and Restricted Anchors of MHC Class I Molecule BF2*0401 Plus Peptide Transporter Restriction can Explain Disease Susceptibility of B4 Chickens. J Immunol (2012) 189(9):4478–87. 10.4049/jimmunol.1200885 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Li X, Zhang L, Liu Y, Ma L, Zhang N, Xia C. Structures of the MHC-I Molecule BF2*1501 Disclose the Preferred Presentation of an H5N1 Virus-Derived Epitope. J Biol Chem (2020) 295(16):5292–306. 10.1074/jbc.RA120.012713 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Wu Y, Wang J, Fan S, Chen R, Liu Y, Zhang J, et al. Structural Definition of Duck Major Histocompatibility Complex Class I Molecules That Might Explain Efficient Cytotoxic T Lymphocyte Immunity to Influenza A Virus. J Virol (2017) 91(14):e02511–16. 10.1128/JVI.02511-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Fremont DH, Matsumura M, Stura EA, Peterson PA, Wilson IA. Crystal Structures of Two Viral Peptides in Complex With Murine MHC Class I H-2Kb. Science (1992) 257(5072):919–27. 10.1126/science.1323877 [DOI] [PubMed] [Google Scholar]
- 36. Fan S, Wu Y, Wang S, Wang Z, Jiang B, Liu Y, et al. Structural and Biochemical Analyses of Swine Major Histocompatibility Complex Class I Complexes and Prediction of the Epitope Map of Important Influenza A Virus Strains. J Virol (2016) 90(15):6625–41. 10.1128/JVI.00119-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Liang R, Sun Y, Liu Y, Wang J, Wu Y, Li Z, et al. Major Histocompatibility Complex Class I (FLA-E*01801) Molecular Structure in Domestic Cats Demonstrates Species-Specific Characteristics in Presenting Viral Antigen Peptides. J Virol (2018) 92(6):e01631–17. 10.1128/JVI.01631-17 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Qu Z, Li Z, Ma L, Wei X, Zhang L, Liang R, et al. Structure and Peptidome of the Bat MHC Class I Molecule Reveal a Novel Mechanism Leading to High-Affinity Peptide Binding. J Immunol (2019) 202(12):3493–506. 10.4049/jimmunol.1900001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Zhang L, Li X, Ma L, Zhang B, Meng G, Xia C. A Newly Recognized Pairing Mechanism of the Alpha- and Beta-Chains of the Chicken Peptide-MHC Class II Complex. J Immunol (2020) 204(6):1630–40. 10.4049/jimmunol.1901305 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Fremont DH, Hendrickson WA, Marrack P, Kappler J. Structures of an MHC Class II Molecule With Covalently Bound Single Peptides. Science (1996) 272(5264):1001–4. 10.1126/science.272.5264.1001 [DOI] [PubMed] [Google Scholar]
- 41. Hashimoto K, Okamura K, Yamaguchi H, Ototake M, Nakanishi T, Kurosawa Y. Conservation and Diversification of MHC Class I and Its Related Molecules in Vertebrates. Immunol Rev (1999) 167:81–100. 10.1111/j.1600-065x.1999.tb01384.x [DOI] [PubMed] [Google Scholar]
- 42. Kabsch W, Sander C. Dictionary of Protein Secondary Structure: Pattern Recognition of Hydrogen-Bonded and Geometrical Features. Biopolymers (1983) 22(12):2577–637. 10.1002/bip.360221211 [DOI] [PubMed] [Google Scholar]
- 43. Nielsen H, Tsirigos KD, Brunak S, von Heijne G. A Brief History of Protein Sorting Prediction. Protein J (2019) 38(3):200–16. 10.1007/s10930-019-09838-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Kumar S, Stecher G, Tamura K. MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets. Mol Biol Evol (2016) 33(7):1870–4. 10.1093/molbev/msw054 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Collaborative Computational Project N . The CCP4 Suite: Programs for Protein Crystallography. Acta Crystallogr D Biol Crystallogr (1994) 50(Pt 5):760–3. 10.1107/S0907444994003112 [DOI] [PubMed] [Google Scholar]
- 46. Krissinel E, Henrick K. Inference of Macromolecular Assemblies From Crystalline State. J Mol Biol (2007) 372(3):774–97. 10.1016/j.jmb.2007.05.022 [DOI] [PubMed] [Google Scholar]
- 47. Donoghue PCJ, Keating JN, Smith A. Early Vertebrate Evolution. Palaeontology (2014) 57(5):879–93. 10.1111/pala.12125 [DOI] [Google Scholar]
- 48. Benton MJ, Donoghue PC. Paleontological Evidence to Date the Tree of Life. Mol Biol Evol (2007) 24(1):26–53. 10.1093/molbev/msl150 [DOI] [PubMed] [Google Scholar]
- 49. Saper MA, Bjorkman PJ, Wiley DC. Refined Structure of the Human Histocompatibility Antigen HLA-A2 at 2.6 A Resolution. J Mol Biol (1991) 219(2):277–319. 10.1016/0022-2836(91)90567-p [DOI] [PubMed] [Google Scholar]
- 50. Fremont DH, Monnaie D, Nelson CA, Hendrickson WA, Unanue ER. Crystal Structure of I-Ak in Complex With a Dominant Epitope of Lysozyme. Immunity (1998) 8(3):305–17. 10.1016/s1074-7613(00)80536-1 [DOI] [PubMed] [Google Scholar]
- 51. Aoyagi K, Dijkstra JM, Xia C, Denda I, Ototake M, Hashimoto K, et al. Classical MHC Class I Genes Composed of Highly Divergent Sequence Lineages Share a Single Locus in Rainbow Trout (Oncorhynchus Mykiss). J Immunol (2002) 168(1):260–73. 10.4049/jimmunol.168.1.260 [DOI] [PubMed] [Google Scholar]
- 52. Kaufman J, Andersen R, Avila D, Engberg J, Lambris J, Salomonsen J, et al. Different Features of the MHC Class I Heterodimer Have Evolved at Different Rates. Chicken B-F and Beta 2-Microglobulin Sequences Reveal Invariant Surface Residues. J Immunol (1992) 148(5):1532–46. [PubMed] [Google Scholar]
- 53. Shand R, Dixon B. Teleost Major Histocompatibility Genes: Diverse But Not Complex. Mod Asp Immunobiol (2001) 2:66–72. [Google Scholar]
- 54. Painter CA, Stern LJ. Conformational Variation in Structures of Classical and Non-Classical MHCII Proteins and Functional Implications. Immunol Rev (2012) 250(1):144–57. 10.1111/imr.12003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. Li P, Willie ST, Bauer S, Morris DL, Spies T, Strong RK. Crystal Structure of the MHC Class I Homolog MIC-A, A Gammadelta T Cell Ligand. Immunity (1999) 10(5):577–84. 10.1016/s1074-7613(00)80057-6 [DOI] [PubMed] [Google Scholar]
- 56. Gunther S, Schlundt A, Sticht J, Roske Y, Heinemann U, Wiesmuller KH, et al. Bidirectional Binding of Invariant Chain Peptides to an MHC Class II Molecule. Proc Natl Acad Sci USA (2010) 107(51):22219–24. 10.1073/pnas.1014708107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Abualrous ET, Saini SK, Ramnarayan VR, Ilca FT, Zacharias M, Springer S. The Carboxy Terminus of the Ligand Peptide Determines the Stability of the MHC Class I Molecule H-2Kb: A Combined Molecular Dynamics and Experimental Study. PloS One (2015) 10(8):e0135421. 10.1371/journal.pone.0135421 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58. Jiang J, Natarajan K, Boyd LF, Morozov GI, Mage MG, Margulies DH. Crystal Structure of a TAPBPR-MHC I Complex Reveals the Mechanism of Peptide Editing in Antigen Presentation. Science (2017) 358(6366):1064–8. 10.1126/science.aao5154 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Thomas C, Tampe R. Structure of the TAPBPR-MHC I Complex Defines the Mechanism of Peptide Loading and Editing. Science (2017) 358(6366):1060–4. 10.1126/science.aao6001 [DOI] [PubMed] [Google Scholar]
- 60. Ayres CM, Abualrous ET, Bailey A, Abraham C, Hellman LM, Corcelli SA, et al. Dynamically Driven Allostery in MHC Proteins: Peptide-Dependent Tuning of Class I MHC Global Flexibility. Front Immunol (2019) 10:966. 10.3389/fimmu.2019.00966 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61. Chou CL, Sadegh-Nasseri S. HLA-DM Recognizes the Flexible Conformation of Major Histocompatibility Complex Class II. J Exp Med (2000) 192(12):1697–706. 10.1084/jem.192.12.1697 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62. Pos W, Sethi DK, Call MJ, Schulze MS, Anders AK, Pyrdol J, et al. Crystal Structure of the HLA-DM-HLA-DR1 Complex Defines Mechanisms for Rapid Peptide Selection. Cell (2012) 151(7):1557–68. 10.1016/j.cell.2012.11.025 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63. Jardetzky TS, Brown JH, Gorga JC, Stern LJ, Urban RG, Strominger JL, et al. Crystallographic Analysis of Endogenous Peptides Associated With HLA-DR1 Suggests a Common, Polyproline II-Like Conformation for Bound Peptides. Proc Natl Acad Sci USA (1996) 93(2):734–8. 10.1073/pnas.93.2.734 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64. Guo HC, Jardetzky TS, Garrett TP, Lane WS, Strominger JL, Wiley DC. Different Length Peptides Bind to HLA-Aw68 Similarly at Their Ends But Bulge Out in the Middle. Nature (1992) 360(6402):364–6. 10.1038/360364a0 [DOI] [PubMed] [Google Scholar]
- 65. Ekeruche-Makinde J, Miles JJ, van den Berg HA, Skowera A, Cole DK, Dolton G, et al. Peptide Length Determines the Outcome of TCR/Peptide-MHCI Engagement. Blood (2013) 121(7):1112–23. 10.1182/blood-2012-06-437202 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66. Remesh SG, Andreatta M, Ying G, Kaever T, Nielsen M, McMurtrey C, et al. Unconventional Peptide Presentation by Major Histocompatibility Complex (MHC) Class I Allele HLA-A*02:01: Breaking Confinement. J Biol Chem (2017) 292(13):5262–70. 10.1074/jbc.M117.776542 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67. Guillaume P, Picaud S, Baumgaertner P, Montandon N, Schmidt J, Speiser DE, et al. The C-terminal Extension Landscape of Naturally Presented HLA-I Ligands. Proc Natl Acad Sci USA (2018) 115(20):5083–8. 10.1073/pnas.1717277115 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68. Xiao J, Xiang W, Zhang Y, Peng W, Zhao M, Niu L, et al. An Invariant Arginine in Common With MHC Class II Allows Extension at the C-Terminal End of Peptides Bound to Chicken MHC Class I. J Immunol (2018) 201(10):3084–95. 10.4049/jimmunol.1800611 [DOI] [PubMed] [Google Scholar]
- 69. Natarajan K, Margulies DH. Cutting Antigenic Peptides Down to Size. J Biol Chem (2019) 294(49):18545–6. 10.1074/jbc.H119.011803 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70. Chappell P, Meziane el K, Harrison M, Magiera L, Hermann C, Mears L, et al. Expression Levels of MHC Class I Molecules are Inversely Correlated With Promiscuity of Peptide Binding. Elife (2015) 4:e05345. 10.7554/eLife.05345 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71. Matsumura M, Fremont DH, Peterson PA, Wilson IA. Emerging Principles for the Recognition of Peptide Antigens by MHC Class I Molecules. Science (1992) 257(5072):927–34. 10.1126/science.1323878 [DOI] [PubMed] [Google Scholar]
- 72. Madden DR, Garboczi DN, Wiley DC. The Antigenic Identity of peptide-MHC Complexes: A Comparison of the Conformations of Five Viral Peptides Presented by HLA-A2. Cell (1993) 75(4):693–708. 10.1016/0092-8674(93)90490-h [DOI] [PubMed] [Google Scholar]
- 73. Murthy VL, Stern LJ. The Class II MHC Protein HLA-DR1 in Complex With an Endogenous Peptide: Implications for the Structural Basis of the Specificity of Peptide Binding. Structure (1997) 5(10):1385–96. 10.1016/s0969-2126(97)00288-8 [DOI] [PubMed] [Google Scholar]
- 74. Li XL, Teng MK, Reinherz EL, Wang JH. Strict Major Histocompatibility Complex Molecule Class-Specific Binding by Co-Receptors Enforces MHC-Restricted Alphabeta TCR Recognition During T Lineage Subset Commitment. Front Immunol (2013) 4:383. 10.3389/fimmu.2013.00383 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75. Hatherley D, Graham SC, Harlos K, Stuart DI, Barclay AN. Structure of Signal-Regulatory Protein Alpha: A Link to Antigen Receptor Evolution. J Biol Chem (2009) 284(39):26613–9. 10.1074/jbc.M109.017566 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76. Halaby DM, Poupon A, Mornon J. The Immunoglobulin Fold Family: Sequence Analysis and 3D Structure Comparisons. Protein Eng (1999) 12(7):563–71. 10.1093/protein/12.7.563 [DOI] [PubMed] [Google Scholar]
- 77. Becker JW, Reeke GN., Jr. Three-Dimensional Structure of Beta 2-Microglobulin. Proc Natl Acad Sci USA (1985) 82(12):4225–9. 10.1073/pnas.82.12.4225 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78. Esposito G, Ricagno S, Corazza A, Rennella E, Gumral D, Mimmi MC, et al. The Controlling Roles of Trp60 and Trp95 in Beta2-Microglobulin Function, Folding and Amyloid Aggregation Properties. J Mol Biol (2008) 378(4):887–97. 10.1016/j.jmb.2008.03.002 [DOI] [PubMed] [Google Scholar]
- 79. Starwalt SE, Masteller EL, Bluestone JA, Kranz DM. Directed Evolution of a Single-Chain Class II MHC Product by Yeast Display. Protein Eng (2003) 16(2):147–56. 10.1093/proeng/gzg018 [DOI] [PubMed] [Google Scholar]
- 80. Tokita M, Okamoto T, Hikida T. Evolutionary History of African Lungfish: A Hypothesis From Molecular Phylogeny. Mol Phylogenet Evol (2005) 35(1):281–6. 10.1016/j.ympev.2004.11.025 [DOI] [PubMed] [Google Scholar]
- 81. Shields MJ, Assefi N, Hodgson W, Kim EJ, Ribaudo RK. Characterization of the Interactions Between MHC Class I Subunits: A Systematic Approach for the Engineering of Higher Affinity Variants of Beta 2-Microglobulin. J Immunol (1998) 160(5):2297–307. 10.1016/j.micinf.2003.12.009 [DOI] [PubMed] [Google Scholar]
- 82. Pos W, Sethi DK, Wucherpfennig KW. Mechanisms of Peptide Repertoire Selection by HLA-DM. Trends Immunol (2013) 34(10):495–501. 10.1016/j.it.2013.06.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83. Guce AI, Mortimer SE, Yoon T, Painter CA, Jiang W, Mellins ED, et al. HLA-DO Acts as a Substrate Mimic to Inhibit HLA-DM by a Competitive Mechanism. Nat Struct Mol Biol (2013) 20(1):90–8. 10.1038/nsmb.2460 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84. Gakamsky DM, Bjorkman PJ, Pecht I. Peptide Interaction With A Class I Major Histocompatibility Complex-Encoded Molecule: Allosteric Control of the Ternary Complex Stability. Biochemistry (1996) 35(47):14841–8. 10.1021/bi961707u [DOI] [PubMed] [Google Scholar]
- 85. Gakamsky DM, Boyd LF, Margulies DH, Davis DM, Strominger JL, Pecht I. An Allosteric Mechanism Controls Antigen Presentation by the H-2K(b) Complex. Biochemistry (1999) 38(37):12165–73. 10.1021/bi9905821 [DOI] [PubMed] [Google Scholar]
- 86. Chelvanayagam G, Jakobsen IB, Gao X, Easteal S. Structural Comparison of Major Histocompatibility Complex Class I Molecules and Homology Modelling of Five Distinct Human Leukocyte Antigen-A Alleles. Protein Eng (1996) 9(12):1151–64. 10.1093/protein/9.12.1151 [DOI] [PubMed] [Google Scholar]
- 87. Theodossis A, Guillonneau C, Welland A, Ely LK, Clements CS, Williamson NA, et al. Constraints Within Major Histocompatibility Complex Class I Restricted Peptides: Presentation and Consequences for T-Cell Recognition. Proc Natl Acad Sci USA (2010) 107(12):5534–9. 10.1073/pnas.1000032107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88. Murata S, Takahama Y, Kasahara M, Tanaka K. The Immunoproteasome and Thymoproteasome: Functions, Evolution and Human Disease. Nat Immunol (2018) 19(9):923–31. 10.1038/s41590-018-0186-z [DOI] [PubMed] [Google Scholar]
- 89. Grimholt U, Tsukamoto K, Hashimoto K, Dijkstra JM. Discovery of a Novel MHC Class I Lineage in Teleost Fish Which Shows Unprecedented Levels of Ectodomain Deterioration While Possessing an Impressive Cytoplasmic Tail Motif. Cells (2019) 8(9):1056. 10.3390/cells8091056 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90. Unanue ER, Turk V, Neefjes J. Variations in MHC Class II Antigen Processing and Presentation in Health and Disease. Annu Rev Immunol (2016) 34:265–97. 10.1146/annurev-immunol-041015-055420 [DOI] [PubMed] [Google Scholar]
- 91. Ohta Y, McKinney EC, Criscitiello MF, Flajnik MF. Proteasome, Transporter Associated With Antigen Processing, and Class I Genes in the Nurse Shark Ginglymostoma Cirratum: Evidence for a Stable Class I Region and MHC Haplotype Lineages. J Immunol (2002) 168(2):771–81. 10.4049/jimmunol.168.2.771 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92. Venkatesh B, Lee AP, Ravi V, Maurya AK, Lian MM, Swann JB, et al. Elephant Shark Genome Provides Unique Insights Into Gnathostome Evolution. Nature (2014) 505(7482):174–9. 10.1038/nature12826 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93. Yamaguchi T, Dijkstra JM. Major Histocompatibility Complex (MHC) Genes and Disease Resistance in Fish. Cells (2019) 8(4):378. 10.3390/cells8040378 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94. Kasahara M. The Chromosomal Duplication Model of the Major Histocompatibility Complex. Immunol Rev (1999) 167:17–32. 10.1111/j.1600-065x.1999.tb01379.x [DOI] [PubMed] [Google Scholar]
- 95. Ohashi K, Takizawa F, Tokumaru N, Nakayasu C, Toda H, Fischer U, et al. A Molecule in Teleost Fish, Related With Human MHC-Encoded G6F, Has a Cytoplasmic Tail With ITAM and Marks the Surface of Thrombocytes and in Some Fishes Also of Erythrocytes. Immunogenetics (2010) 62(8):543–59. 10.1007/s00251-010-0460-1 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The original contributions presented in the study are included in the article/ Supplementary Material . Further inquiries can be directed to the corresponding authors.