Skip to main content
Springer logoLink to Springer
. 2016 Jul 28;68(8):515–523. doi: 10.1007/s00251-016-0926-x

Mammalian CD1 and MR1 genes

Peter Reinink 1,2, Ildiko Van Rhijn 1,2,
PMCID: PMC5002277  PMID: 27470004

Abstract

All higher vertebrates share the fundamental components of the adaptive immune system: the B cell receptor, the T cell receptor, and classical MHC proteins. At a more detailed level, their immune systems vary considerably, especially with respect to the non-polymorphic MHC class I-like proteins. In mammals, the CD1 family of lipid-presenting proteins is encoded by clusters of genes of widely divergent sizes and compositions. Another MHC class I-like protein, MR1, is typically encoded by a single gene that is highly conserved among species. Based on mammalian genomes and the available data on cellular expression profiles and protein structure, we review MR1 genes and families of CD1 genes in modern mammals from a genetic and functional perspective. Understanding the CD1 and MR1 systems across animal species provides insights into the specialized functions of the five types of CD1 proteins and facilitates careful consideration of animal models for human diseases in which immune responses to lipids and bacterial metabolites play a role.

Keywords: CD1, Lipid antigens, Mammals, MR1


Whereas MHC genes and the genes for T cell receptors appeared simultaneously during evolution and are present in all extant jawed vertebrates, the genes for CD1 and MR1 have a more limited distribution. Keeping the phylogenetic relationships among animal species in mind, we will describe the CD1 and MR1 genes in extant species. The evolution of these genes is beyond the scope of this review. CD1 and MR1 proteins bind lipids and vitamin B metabolites respectively, and present these to T cells, as opposed to the classical MHC proteins, which present peptide antigens to T cells. CD1 and MR1 genes have not been found in fish, while fish do contain MHC class I and II genes (Dascher 2007). Reptiles (Yang et al. 2015), birds (Miller et al. 2005; Salomonsen et al. 2005), and marsupials (Baker and Miller 2007; Cheng and Belov 2014) have CD1 genes that can clearly be distinguished from classical MHC genes and form an interspecies group with the mammalian CD1 genes. CD1a, CD1b, CD1c, CD1d, and CD1e proteins most likely arose in a common ancestor of placental mammals from a primordial form of CD1. Gene duplications, deletions, and gene inactivations shaped the composition of the CD1 family of genes, possibly under selective pressure associated with an immune function. MR1 genes are absent in fish and reptiles, but in marsupials and mammals, there is typically a single functional MR1 gene. Of all the MHC Class I-like proteins, MR1 shows the highest conservation among species.

Chromosomal location and gene structure of CD1 and MR1

In mammals, the CD1 and MR1 genes are not part of the MHC locus. In humans, the CD1 and MR1 genes are located on chromosome 1 and the MHC locus lies on chromosome 6. However, in chicken, the CD1 and MHC loci are linked (Miller et al. 2005; Salomonsen et al. 2005). The genes that encode the classical MHC molecules are highly polymorphic with hundreds to thousands allelic variants, and this is thought to be closely related to their function of presenting peptides to T cells. No allelic variants for CD1b, CD1c, and MR1 are known. Allelic variants in the form of non-synonymous single nucleotide polymorphisms in human CD1a and CD1d are known, but the affected amino acids are not located in the antigen-binding cleft or the TCR-binding surface (Han et al. 1999; Oteo et al. 2001). Together, this justifies the common description of CD1 and MR1 genes as non-polymorphic.

Because of its non-polymorphic nature, shared structural features, and the fact that it presents small metabolite antigens rather than peptides, MR1 is often compared to CD1. However, MR1 genes are distinct from CD1 genes, and, based on sequence alignments, form their own interspecies group. Even though in humans MR1 and CD1 are located in the same MHC paralogous region on chromosome 1, the MR1 locus is separate from the CD1 cluster of genes, or CD1 locus (Hashimoto et al. 1995; Shiina et al. 2001). The CD1 locus is located between the KIRREL and olfactory receptor genes and consists of multiple genes, often in both orientations. The MR1 gene is embedded between STX6 and IER5 genes. In mice, a chromosomal rearrangement caused the separation of the CD1 and MR1 genes, which are located on chromosome 3 and chromosome 1, respectively (Dascher and Brenner 2003). In the available assembled primate data, MR1 and CD1 are located on the same chromosome, but in other mammals that were studied, like cow and pig, they are not (Goldfinch et al. 2010; Reinink and Van Rhijn 2016).

The human genome encodes one MR1 protein and five different CD1 proteins, called CD1a through CD1e. These five CD1 proteins and their orthologs are called CD1 isoforms. Even though for other proteins this term is used to define RNA splice variants derived from the same gene, in the CD1 field, the word isoforms is used to indicate products of separate genes. The overall structure of CD1 and MR1 proteins resembles MHC class I molecules: a type I transmembrane protein, called the heavy chain, consisting of α1, α2, and α3 domains associated with β2 microglobulin. CD1 and MR1 genes have an intron-exon structure comparable to MHC class I genes: they contain 6 exons that encode 5′ UTR and leader signal peptide, α1 domain, α2 domain, α3 domain, transmembrane domain, and cytoplasmic tail and 3′ UTR combined (Yamaguchi et al. 1998). CD1 genes in mammals are named after the human CD1 isoform they group with, based on sequence comparison. Comparison, based on overall alignment of the full coding sequence or only the α1 and α2 domains that form the antigen binding cleft, gives identical results. This is caused by the fact that α3 domains are highly conserved among all CD1 isoforms within one species (Balk et al. 1989), and cytoplasmic domains, even though they show considerable differences among the isoforms, are very short. Thus, effectively, CD1 isoforms are grouped and named according to resemblance of the sequence encoding their antigen-binding cleft-forming α1 and α2 domains.

CD1 in humans and common research, farm, and companion animals

For nine mammalian species (human, rabbit, guinea pig, cow, pig, dog, horse, mouse, and rat), the CD1 genes have been studied extensively and, except for rabbit and guinea pig, their CD1 loci have been carefully mapped and curated. The functionality of many of these CD1 genes has been studied by cloning the transcripts from cDNA, sometimes followed by protein expression studies. Genomes from these nine mammalian species contain from one (rat (Katabami et al. 1998)) to thirteen (horse and dog (Dossa et al. 2014; Schjaerff et al. 2016)) CD1 genes. From rabbits, two CD1a, two CD1b, one CD1d, and one CD1e transcripts have been identified, but this study was not set up or intended to define the complete rabbit CD1 locus (Hayes and Knight 2001). Southern blots suggested the presence of at least eight CD1 genes in the rabbit (Calabi et al. 1989). Guinea pigs have been reported to contain four functional genes for CD1b, three for CD1c, one for CD1e gene, and at least five CD1 pseudogenes (Dascher et al. 1999). Later, one functional gene for CD1d was described (Looringh van Beeck et al. 2009), and one gene of unknown functionality for CD1a (Van Rhijn and Moody 2015). The guinea pig CD1 locus has not been mapped, but while the genome is being updated and assembled, even more CD1 genes have been identified in guinea pig (Reinink and Van Rhijn 2016). The first attempt to define the bovine CD1 locus was based on an early draft of the bovine genome and identified one CD1a gene, five CD1b genes, two CD1d genes, and a CD1e gene. Among these, the CD1a gene, three CD1b genes, and the CD1e were identified as functional genes (Van Rhijn et al. 2006). A later version of the genome brought the number of bovine CD1 genes to 12 (Nguyen et al. 2015). The porcine CD1 locus has been described based on BAC sequencing and contains six CD1 genes: two genes for CD1a, and one for each of the other isoforms (Eguchi-Ogawa et al. 2007). One of the CD1a genes is a pseudogene. Twelve CD1 genes were mapped in the canine locus based on the available genome at the time, and a 13th gene was not mapped in the locus and was thought to be an allele (Looringh van Beeck et al. 2008). Subsequently, BAC sequencing placed the 13th gene in the locus, which is now known to contain nine CD1a genes, and one gene or each of the other isoforms (Schjaerff et al. 2016). Four of the canine CD1a genes are thought to be functional. The horse genome contains 18 CD1 genes, among which five pseudogenes (Dossa et al. 2014). The functional equine CD1 genes are seven CD1a genes, two CD1b genes, one CD1c gene, one CD1d gene, and two CD1e genes. The mouse genome, with its small locus consisting of only two CD1d genes, has been shown to have undergone a rearrangement that caused the loss of the other CD1 genes (Bradbury et al. 1988; Dascher and Brenner 2003). Rats have only one CD1 gene, which encodes CD1d (Ichimiya et al. 1994).

CD1 loci in less well studied mammalian genomes

Only a very small number of CD1 loci of the total of almost 4000 extant species of placental mammals have been studied. For species other than human, rabbit, guinea pig, cow, pig, dog, horse, mouse, and rat, the study of CD1 genes is now facilitated by the availability of multiple genomes. It should be noted however that many genomes, especially from species other than primates and rodents, are not yet finalized and contain multiple gaps. Also, not all sequence materials may have been assigned to a chromosomal location yet, and artifacts like duplications may be present. Therefore, attempts to describe a CD1 locus based on genomic data available at a certain point in time may need to be adjusted later based on improved versions of the genome (Nguyen et al. 2015; Schjaerff et al. 2016). Because transcription and correct splicing of existing genes cannot reliably be predicted, mRNA- and cDNA-based sequences provide the most reliable data on functionality and the possible expression of individual CD1 proteins in vivo. However, RNA sequence-based data are unlikely to provide a complete overview of CD1 proteins in a species because expression of CD1 genes is often limited to a specific tissue and cell type. Therefore, to get an indication of the total number of CD1 genes in a species, searches using the basic local alignment search tool (BLAST) or collection of annotated genes in genomes provide more reliable information than expressed sequence tags or cDNA databases. Table 1 shows numbers of CD1 genes obtained from whole genomes of 15 mammals, including less well studied species like alpaca, dolphin, elephant, two bat species, panda, and sloth (Reinink and Van Rhijn 2016). Of note, BLAST-based searches reveal functional genes and pseudogenes. Therefore, differences between results of de novo BLAST searches, automated annotation of open reading frames, and published data with regard to the total number of CD1 genes per species are expected.

Table 1.

CD1 and MR1 gene numbers

Common name Genome Binomial species name CD1a CD1b CD1c CD1d CD1e Total CD1 MR1
Alpaca vicPac2 Vicugna pacos 1 1 1 1 1 5 1
Bonobo panPan1 Pan paniscus 1 1 1 1 1 5 2
Chimpanzee panTro4 Pan troglodytes 1 1 1 1 1 5 2
Dog CanFam3 Canis lupus 9 1 1 1 1 13 1
Dolphina turTru2 Tursiops truncatus 0 1 0 0 0 1 0
Elephant loxAfr3 Loxodonta africana 1 2 1 1 1 6 1
Horse equCab2 Equus caballus 9 2 2 1 2 16 1
Human hg38 Homo sapiens 1 1 1 1 1 5 2
Megabat pteVam1 Pteropus vampyrus 3 1 1 0 1 6 1
Microbat myoLuc2 Myotis lucifugus 17 2 0 5 2 26 1
Mouse mm10 Mus musculus 0 0 0 2 0 2 1
Panda ailMel1 Ailuropoda melanoleuca 8 1 1 1 1 12 1
Pig susScr3 Sus scrofa 2 1 1 1 2 7 1
Rabbit oryCun2 Oryctolagus cuniculus 5 2 0 1 2 10 0
Rhesus macaque rheMac3 Macaca mulatta 2 1 1 1 1 6 2
Slotha choHof1 Choloepus hoffmanni 1 0 0 1 0 2 1

For each of the indicated mammalian genomes, a list of CD1 and MR1 genes as determined by BLAST-based searches was merged with a list of Ensembl-annotated CD1 and MR1 genes when available (adapted from (Reinink and Van Rhijn 2016)). Redundancies (genes with identical genomic location) were removed

aThe dolphin and sloth genomes are not completely assembled and consist of relatively small contigs, which may have led to the fragmentation of CD1 or MR1 genes and subsequent failure of identification

CD1 genes have undergone multiple duplications in most mammals, leading to extended multigene families. The general picture that emerges is that CD1a has undergone the most extensive multiplications and CD1e the least or none. Automated annotations of CD1 isoforms of species other than mouse or human often indicate a certain degree of uncertainty, and are sometimes annotated as “CD1a-like,” for example. However, the golden standard for isoform assignment is sequence alignment of the combined α1 and α2 domains. All CD1 genes from 15 mammals that were recovered group with one of the five known CD1 isoforms, but were not always correctly named by automated annotation (Fig. 1). The clear grouping with the five isoforms suggests that the isoform nomenclature that was based on the human CD1 locus appropriately describes all currently known mammalian CD1 isoforms. With more than 10 CD1 genes, the little brown bat, Myotis lucifugus, can be considered a species with high numbers of CD1 genes. Intermediate numbers are found primates, alpaca, and elephant. These data confirm once more that mouse and rat are atypical with the lowest absolute number of CD1 genes (two and one, respectively). Though not identical, the gene numbers obtained by BLAST-based searches and automated annotation are largely consistent (Reinink and Van Rhijn 2016). Because misassemblies of repeated sequences and unmerged overlaps due to polymorphisms resulting in artificial duplications can occur in incompletely assembled genomes, we expect that in some species, the exact numbers of CD1 genes will be adjusted in the future. However, it is clear that humans should be considered to have an intermediate number of CD1 genes, while many other mammals have more extensive families of CD1 genes. Large CD1 families are often dominated by multiplied CD1a genes and to a lesser degree, CD1b, CD1c, and CD1d genes.

Fig. 1.

Fig. 1

CD1 and MR1 genes in mammals. From 16 mammals, known CD1 genes and predicted CD1 paralog open reading frames were obtained from Ensembl (www.ensembl.org). An alignment of these sequences and human MICA, MICB, HLA-A, HLA-B, and murine H2-M3 was generated by MUSCLE (Edgar 2004), clustered according to a neighbor joining algorithm, and shown as a radial cladogram. Groups were color-coded based on the clustering with human CD1 isoforms

Mammalian MR1 genes

Searches in whole genomes (Table 1), as well as other published data, show that primates have two MR1 genes. In humans and chimpanzees, one of these genes is known to be functional and the other one is a pseudogene (Parra-Cuadrado et al. 2001; Parra-Cuadrado et al. 2000). No MR1 gene could be identified in the rabbit genome, which is in line with recently published data on lagomorphs (rabbit and pika) (Boudinot et al. 2016). The panda and dog MR1 genes that resulted from BLAST-based searches (Table 1) are most likely pseudogenes because it was shown that members of the order of Carnivora, including cats, dogs, ferrets, and pandas, have a MR1 pseudogene and lack a functional MR1 gene (Boudinot et al. 2016). Other mammals (alpaca, elephant, horse, bat) also have one MR1 gene. While in dolphin no MR1 gene was found, this does not conclusively prove the absence of MR1 in this species because the dolphin genome is still incompletely assembled. The sloth MR1 gene has a gap in the current version of the sloth genome (choHof1).

Functional specialization of CD1 isoforms

Interspecies comparisons between individual CD1 proteins have been made based on sequence alignment, protein models (canine versus human CD1a (Looringh van Beeck et al. 2008)), or crystal structures (bovine versus human CD1b (Girardi et al. 2010), bovine versus human CD1d (Wang et al. 2012), and murine versus human CD1d (Koch et al. 2005)). CD1 proteins within one species differ from each other in ways that suggest functional specialization. One aspect determines the function of a CD1 protein is the size and shape of the antigen-binding cleft, which is formed by the α1 and α2 helices. Among the human CD1 proteins, the biggest size difference is observed between CD1a and CD1b. In addition, there are significant differences in shape with the CD1b cleft consisting of four pockets (Gadola et al. 2002) of which three are interconnected and the CD1a cleft consisting of one cavity (Zajonc et al. 2003). These differences translate into differences in size and shape of antigens that can be bound by the human CD1a and CD1b proteins. Because the CD1 isoforms form interspecies groups based on phylogenetic relationship based on the sequences of their α1 and α2 helices, functional specializations of CD1 molecules that relate to the antigen-binding cleft are generally conserved across species. Especially, the size of the cleft seems to be well conserved when human CD1 isoforms are compared with an ortholog. However, as seen in the bovine CD1d molecule, despite a very high homology between human and bovine CD1d, tryptophan 166 in the bovine protein, which is a cysteine in humans and mice, blocks part of the A’ pocket so that long fatty acyl chains will not fit (Wang et al. 2012). A comparable situation exists when human CD1b and bovine CD1b3 are compared: the tunnel that is present in the human CD1b protein is closed in bovine CD1b3 by valine 98, which prevents the binding of extremely long ligands (Girardi et al. 2010). Structural aspects of these CD1 molecules will be reviewed by Zajonc et al. in this special issue of Immunogenetics. Thus, despite very high sequence homology and overall structural resemblance between orthologs, a single amino acid difference can have profound functional impact.

One specific feature of CD1 molecules that is difficult to predict from crude genomic data is the cytoplasmic tail, which determines the transport and subcellular location of individual CD1 isoforms (Moody and Porcelli 2003). The cytoplasmic tail is encoded by the very small exon 5 that is located between two longer introns of variable length and cannot reliably be predicted based on the genomic sequence of the CD1 gene. However, the many cDNA sequences that are available have provided a number of cytoplasmic tails that allows for comparative analysis and predictions on intracellular trafficking of these CD1 proteins (Table 2).

Table 2.

Cytoplasmic tails of CD1 proteins in mammals

Gene name (alias) Cytoplasmic tail Motif Reference cDNA
boCD1a2 RKSWSTYMSDA (Nguyen et al. 2015)
boCD1a1 WKHWTHRESPSSVLPLE (Van Rhijn et al. 2006)
boCD1a
canCD1a2 KAHWRPQCMDFPSEREPSSPSSSTYLNPAQH (Schjaerff et al. 2016)
canCD1a6 KRWKTHNRPQCTDFPLK (Looringh van Beeck et al. 2008)
canCD1a9 KAHWRPQCTDFPSEQEPSSPGSSTYLNPAQH (Looringh van Beeck et al. 2008; Schjaerff et al. 2016)
canCD1a8.1
canCD1a8 KRWKAH (Looringh van Beeck et al. 2008; Schjaerff et al. 2016)
canCD1a8.2
eqCD1a1 THCEAPCTIVPLK (Dossa et al. 2014)
eqCD1a2 IRHQLQRTLLPLD Dileucine (Dossa et al. 2014)
eqCD1a3 IHSELPRTLLPLE Dileucine (Dossa et al. 2014)
eqCD1a4 VISISVSILVRKPCATPRTPLPSQ (Dossa et al. 2014)
eqCD1a5 RSCESASNLLWNEIPGAQDPGHI Dileucine (Dossa et al. 2014)
eqCD1a6 WLRKRWTRCEPPSNLISLE (Dossa et al. 2014)
eqCD1a7 WLRKRGTHCEFPRTCLPLE (Dossa et al. 2014)
huCD1a RKRCFC (Calabi and Milstein 1986; Martin et al. 1987)
pigCD1a1 WHRKHWKHCDPSSALHRLE (Chun et al. 1999; Eguchi-Ogawa et al. 2007)
pCD1.1
rabCD1a1 RKCWIHHGPLETLLPLQ Dileucine (Hayes and Knight 2001)
rabCD1a2 KKRWSHHGSPNSLLPLK Dileucine (Hayes and Knight 2001)
boCD1b1 RFMGSHRVGHD (Nguyen et al. 2015; Van Rhijn et al. 2006)
boCD1b3 RRWSYQNIL YXXZ, dileucine (Nguyen et al. 2015; Van Rhijn et al. 2006)
boCD1b5 RRWSYQTIL YXXZ, dileucine (Nguyen et al. 2015)
canCD1b RRWSYQSIS YXXZ (Looringh van Beeck et al. 2008)
eqCD1b1 SYQNIS YXXZ (Dossa et al. 2014)
eqCD1b2 SYLNIP YXXZ (Dossa et al. 2014)
gpCD1b1 RRWSYEDIL YXXZ, dileucine (Dascher et al. 1999; Hiromatsu et al. 2002)
gpCD1b2 KHWSYQDIL YXXZ, dileucine (Dascher et al. 1999; Dascher et al. 2002)
gpCD1b3 RRLRCEGIF (Dascher et al. 1999; Dascher et al. 2002)
gpCD1b4 RRWSYEDIF YXXZ (Dascher et al. 1999)
huCD1b RRRSYQNIP YXXZ (Martin et al. 1987)
ovCD1b RRWSYQNIL YXXZ, dileucine (Ferguson et al. 1996)
scd1b42
ovCD1b RRWSHRNIL Dileucine (Ferguson et al. 1996)
scd1b52
ovCD1b RRWSYLTIL YXXZ, dileucine (Ferguson et al. 1996)
scd1a25
pigCD1b RRWSYQSVL YXXZ (Eguchi-Ogawa et al. 2007)
rabCD1b RRRSYQNIL YXXZ, dileucine (Calabi et al. 1989; Hayes and Knight 2001)
canCD1c RKCCSYQDIP YXXZ (Looringh van Beeck et al. 2008)
eqCD1c SYQNIQRDSSPCFPHCNENCTAQEHRTTE YXXZ (Dossa et al. 2014)
gpCD1c1 KRCTYQGIQ YXXZ (Dascher et al. 1999)
gpCD1c2 KRCTYQGIP YXXZ (Dascher et al. 1999)
gpCD1c3 KKCCTYQGIPa YXXZ (Dascher et al. 1999)
huCD1c KKHCSYQDIL YXXZ, dileucine (Martin et al. 1987)
eqCD1d KKRSSYQDIL YXXZ, dileucine (Dossa et al. 2014; Looringh van Beeck et al. 2009)
gpcD1d RRGRSYQDIL YXXZ, dileucine (Looringh van Beeck et al. 2009)
huCD1d RFKQTSYQGVL YXXZ, dileucine (Balk et al. 1989)
lafCD1d KRHCS (Looringh van Beeck et al. 2009)
moCD1d1 RRRSAYQDIR YXXZ (Bradbury et al. 1988)
moCD1d2 RRRSAYQDIR YXXZ (Bradbury et al. 1988)
ovCD1d RKHRRYQDIS YXXZ, dileucine (Rhind et al. 1999)
scd1d
pigCD1d RRRVYQNIQ YXXZ (Eguchi-Ogawa et al. 2007)
rabCD1d RRRCSYQGIL YXXZ, dileucine (Calabi et al. 1989; Looringh van Beeck et al. 2009)
ratCD1d RRRSYQDIM YXXZ (Ichimiya et al. 1994)

The cytoplasmic tails of CD1 proteins, grouped by isoform. Only tail sequences that have been confirmed by cDNA sequences are included. Species from which the sequences were derived are indicated as bo: bovine; can: canine; eq: equine; gp: guinea pig; hu: human; laf: African elephant; mo: mouse; ov: ovine; rab: rabbit; dileucine: modified dileucine motif

aThe sequence shown is based on GenBank sequence NM_001172855, but has been published as KKCCTYQGFP (Dascher et al. 1999)

Intracellular trafficking is determined by the CD1 cytoplasmic tail

Cellular lipids are bound in the antigen binding cleft during synthesis of the CD1 protein. When the CD1 protein recycles and encounters other lipids in an endosomal compartment, these may replace the initial endoplasmic reticulum-derived lipid. The type of endosomal compartment a lipid travels to is thus a determining factor in the kind of antigenic lipids it presents. The cytoplasmic tails of CD1 molecules may or may not contain short consensus sequences for adaptor proteins. CD1b, CD1c, or CD1d molecules often carry a modified dileucine motif and/or a tyrosine-based YXXZ motif, where X is any amino acid, and Z is a bulky hydrophobic amino acid. YXXZ motifs interact with adaptor proteins and are responsible for recycling from the cell surface to intermediate and late endosomal compartments (Briken et al. 2002; Sugita et al. 2002). CD1e does not appear at the cell surface and does not recycle and has a function in lipid antigen loading and processing (Angenieux et al. 2005; de la Salle et al. 2005).

Comparing the cytoplasmic tails of all cell surface-expressed isoforms across species, the following general picture emerges: CD1a tails are highly variable in length. Most CD1a tails contain histidine residues, with unknown function, or cysteines, which can be palmitoylated. The human CD1a cytoplasmic tail contains a cysteine residue that is palmitoylated and involved in incorporation in lipid rafts (Barral et al. 2008; Sloma et al. 2008), but the mechanism of recycling to the sorting or early endosomal compartment, which is so typical for human CD1a, is unknown. None of the 17 known cytoplasmic tails of CD1a proteins has a modified dileucine- and/or a tyrosine-based YXXZ motif, consistent with the absence from intermediate and late endosomes of the human CD1a protein, and strongly suggesting that this is an evolutionary conserved feature (Table 2 ). CD1b, CD1c, and CD1d tails are of comparable length, typically contain a YXXZ motif, and sometimes an additional modified dileucine motif. In addition, all known CD1c cytoplasmic tails contain a cysteine of unknown function. There are exceptions to these general observations, like the bovine CD1b1 (Nguyen et al. 2015), guinea pig CD1b3 (Dascher et al. 1999), and African elephant CD1d (Looringh van Beeck et al. 2009) cytoplasmic tails, which lack any known motif. Indeed, guinea pig CD1b3 does not travel to late endosomes and rather has a CD1a-like subcellular distribution pattern (Dascher et al. 2002). However, the bovine CD1b3 and CD1b5 and guinea pig CD1b1, CD1b2, and CD1b4 have “typical” CD1b cytoplasmic tails. Thus, cows and guinea pigs have typical and atypical cytoplasmic tails among the proteins belonging to the CD1b isoform. For the African elephant, no other cytoplasmic tails have been sequenced from cDNA yet. Generally speaking, it seems that when there are multiple proteins of one CD1 isoform in a species, there is functional diversification, but when a species has just one gene for a certain CD1 isoform, this one gene usually shows the “typical” combination of cleft and tail.

Expression patterns of CD1 isoforms

Human cortical thymocytes express high levels of CD1a, CD1b, CD1c, and CD1d on their cell surface. Broad CD1 expression in the thymus was confirmed in all other species where CD1 expression was studied (Hiromatsu et al. 2002; Howard et al. 1993; Looringh van Beeck et al. 2008; Van Rhijn et al. 2006). Outside the thymus, the general picture that emerges is that CD1d is widely expressed at a low level, with some specific high CD1d-expressing cells in certain tissues, like hepatic stellate cells in the liver (Winau et al. 2007), and marginal zone B cells in the spleen (Makowska et al. 1999). In humans, CD1b expression is limited to dendritic cells; CD1a is expressed by dendritic cells and at high levels by Langerhans cells in the skin, and CD1c is expressed by dendritic cells and subsets of B cells. Like humans, dogs express CD1a in the skin and thymus (Looringh van Beeck et al. 2008). However, only two of the three studied canine CD1a genes (out of the nine CD1a genes in the dog CD1 locus) were preferentially expressed in the skin. In rabbits, one of the two CD1a genes is preferentially expressed in the skin, while the other one has a more general expression pattern (Hayes and Knight 2001). In guinea pigs, different CD1b genes have different expression patterns in peripheral blood and spleen (Hiromatsu et al. 2002). Mice that are transgenic for the part of the human locus that encodes CD1a, CD1b, CD1c, and CD1e with their endogenous promoters show a CD1 expression pattern that is surprisingly comparable to the human CD1 expression pattern in that all isoforms are expressed on thymocytes and lymph node dendritic cells, while CD1a stands out as highly expressed on Langerhans cells, and CD1c is expressed on B cells (Felio et al. 2009).

Closing remarks

A typical pattern of gene distribution has been described for immune function-related genes including genes that encode TCRs, immunoglobulins, classical MHC molecules, and NK receptors. This pattern is the result of gene family expansion, diversification, and contraction or “birth and death” evolution, and usually leads to families of genes that include a considerable number of pseudogenes (Nei and Rooney 2005). Furthermore, complete loss of certain family members and expansion of other family members are observed when contemporary animal species are compared. With its many duplications, deletions, and pseudogenes, the CD1 loci seem to follow this pattern. Although MR1 genes have been subject to inactivation in the carnivores, extensive gene family expansion and diversification were not observed in any of the species studied. The lack of functional diversification of MR1 genes is hard to explain but may be related to its seemingly exclusive interaction with the highly specialized MAIT cells.

Even though mammals vary widely in the numbers of CD1 genes they have, some general observations can be made and help understand the function of the different CD1 isoforms. The skin seems to be a preferred site for CD1a expression across species. While CD1a genes have undergone extensive multiplication in some species, even more so than CD1b and CD1c, none of the currently described CD1a proteins contains any known signal for endosomal location. Multiplication and diversification of genes may have occurred in response to changes in the environment, including pathogenic and non-pathogenic microbes, to which evolving mammals were exposed.

Acknowledgments

This work was supported by The Netherlands Organisation for Scientific Research (NWO) grant 824.02.002.

Footnotes

This article is published in the Special Issue CD1, MR1, NKT, and MAIT: Evolution and Origins of Non-peptidic Antigen Recognition by T lymphocytes with Guest Editor Dr. Dirk Zajonc

References

  1. Angenieux C, Fraisier V, Maitre B, Racine V, van der Wel N, Fricker D, Proamer F, Sachse M, Cazenave JP, Peters P, Goud B, Hanau D, Sibarita JB, Salamero J, de la Salle H. The cellular pathway of CD1e in immature and maturing dendritic cells. Traffic. 2005;6:286–302. doi: 10.1111/j.1600-0854.2005.00272.x. [DOI] [PubMed] [Google Scholar]
  2. Baker ML, Miller RD. Evolution of mammalian CD1: marsupial CD1 is not orthologous to the eutherian isoforms and is a pseudogene in the opossum Monodelphis domestica. Immunology. 2007;121:113–121. doi: 10.1111/j.1365-2567.2007.02545.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Balk SP, Bleicher PA, Terhorst C. Isolation and characterization of a cDNA and gene coding for a fourth CD1 molecule. Proc Natl Acad Sci U S A. 1989;86:252–256. doi: 10.1073/pnas.86.1.252. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Barral DC, Cavallari M, McCormick PJ, Garg S, Magee AI, Bonifacino JS, De Libero G, Brenner MB. CD1a and MHC class I follow a similar endocytic recycling pathway. Traffic. 2008;9:1146–1157. doi: 10.1111/j.1600-0854.2008.00781.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Boudinot P, Mondot S, Jouneau L, Teyton L, Lefranc MP, Lantz O (2016) Restricting nonclassical MHC genes coevolve with TRAV genes used by innate-like T cells in mammals. Proc Natl Acad Sci USA 113:E2983-E2992 [DOI] [PMC free article] [PubMed]
  6. Bradbury A, Belt KT, Neri TM, Milstein C, Calabi F. Mouse CD1 is distinct from and co-exists with TL in the same thymus. EMBO J. 1988;7:3081–3086. doi: 10.1002/j.1460-2075.1988.tb03173.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Briken V, Jackman RM, Dasgupta S, Hoening S, Porcelli SA. Intracellular trafficking pathway of newly synthesized CD1b molecules. EMBO J. 2002;21:825–834. doi: 10.1093/emboj/21.4.825. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Calabi F, Belt KT, Yu CY, Bradbury A, Mandy WJ, Milstein C. The rabbit CD1 and the evolutionary conservation of the CD1 gene family. Immunogenetics. 1989;30:370–377. doi: 10.1007/BF02425277. [DOI] [PubMed] [Google Scholar]
  9. Calabi F, Milstein C. A novel family of human major histocompatibility complex-related genes not mapping to chromosome 6. Nature. 1986;323:540–543. doi: 10.1038/323540a0. [DOI] [PubMed] [Google Scholar]
  10. Cheng Y, Belov K. Characterisation of non-classical MHC class I genes in the Tasmanian devil (Sarcophilus harrisii) Immunogenetics. 2014;66:727–735. doi: 10.1007/s00251-014-0804-3. [DOI] [PubMed] [Google Scholar]
  11. Chun T, Wang K, Zuckermann FA, Gaskins HR. Molecular cloning and characterization of a novel CD1 gene from the pig. J Immunol. 1999;162:6562–6571. [PubMed] [Google Scholar]
  12. Dascher CC. Evolutionary biology of CD1. Curr Top Microbiol Immunol. 2007;314:3–26. doi: 10.1007/978-3-540-69511-0_1. [DOI] [PubMed] [Google Scholar]
  13. Dascher CC, Brenner MB. Evolutionary constraints on CD1 structure: insights from comparative genomic analysis. Trends Immunol. 2003;24:412–418. doi: 10.1016/S1471-4906(03)00179-0. [DOI] [PubMed] [Google Scholar]
  14. Dascher CC, Hiromatsu K, Naylor JW, Brauer PP, Brown KA, Storey JR, Behar SM, Kawasaki ES, Porcelli SA, Brenner MB, LeClair KP. Conservation of a CD1 multigene family in the guinea pig. J Immunol. 1999;163:5478–5488. [PubMed] [Google Scholar]
  15. Dascher CC, Hiromatsu K, Xiong X, Sugita M, Buhlmann JE, Dodge IL, Lee SY, Roura-Mir C, Watts GF, Roy CJ, Behar SM, Clemens DL, Porcelli SA, Brenner MB. Conservation of CD1 intracellular trafficking patterns between mammalian species. J Immunol. 2002;169:6951–6958. doi: 10.4049/jimmunol.169.12.6951. [DOI] [PubMed] [Google Scholar]
  16. de la Salle H, Mariotti S, Angenieux C, Gilleron M, Garcia-Alles LF, Malm D, Berg T, Paoletti S, Maitre B, Mourey L, Salamero J, Cazenave JP, Hanau D, Mori L, Puzo G, De Libero G. Assistance of microbial glycolipid antigen processing by CD1e. Science. 2005;310:1321–1324. doi: 10.1126/science.1115301. [DOI] [PubMed] [Google Scholar]
  17. Dossa RG, Alperin DC, Hines MT, Hines SA. The equine CD1 gene family is the largest and most diverse yet identified. Immunogenetics. 2014;66:33–42. doi: 10.1007/s00251-013-0741-6. [DOI] [PubMed] [Google Scholar]
  18. Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Eguchi-Ogawa T, Morozumi T, Tanaka M, Shinkai H, Okumura N, Suzuki K, Awata T, Uenishi H. Analysis of the genomic structure of the porcine CD1 gene cluster. Genomics. 2007;89:248–261. doi: 10.1016/j.ygeno.2006.10.003. [DOI] [PubMed] [Google Scholar]
  20. Felio K, Nguyen H, Dascher CC, Choi HJ, Li S, Zimmer MI, Colmone A, Moody DB, Brenner MB, Wang CR. CD1-restricted adaptive immune responses to mycobacteria in human group 1 CD1 transgenic mice. J Exp Med. 2009;206:2497–2509. doi: 10.1084/jem.20090898. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Ferguson ED, Dutia BM, Hein WR, Hopkins J. The sheep CD1 gene family contains at least four CD1B homologues. Immunogenetics. 1996;44:86–96. [PubMed] [Google Scholar]
  22. Gadola SD, Zaccai NR, Harlos K, Shepherd D, Castro-Palomino JC, Ritter G, Schmidt RR, Jones EY, Cerundolo V. Structure of human CD1b with bound ligands at 2.3A, a maze for alkyl chains. Nat Immunol. 2002;3:721–726. doi: 10.1038/nrm947. [DOI] [PubMed] [Google Scholar]
  23. Girardi E, Wang J, Mac TT, Versluis C, Bhowruth V, Besra G, Heck AJ, Van Rhijn I, Zajonc DM. Crystal structure of bovine CD1b3 with endogenously bound ligands. J Immunol. 2010;185:376–386. doi: 10.4049/jimmunol.1000042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Goldfinch N, Reinink P, Connelley T, Koets A, Morrison I, Van Rhijn I. Conservation of mucosal associated invariant T (MAIT) cells and the MR1 restriction element in ruminants, and abundance of MAIT cells in spleen. Vet Res. 2010;41:62. doi: 10.1051/vetres/2010034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Han M, Hannick LI, DiBrino M, Robinson MA. Polymorphism of human CD1 genes. Tissue Antigens. 1999;54:122–127. doi: 10.1034/j.1399-0039.1999.540202.x. [DOI] [PubMed] [Google Scholar]
  26. Hashimoto K, Hirai M, Kurosawa Y. A gene outside the human MHC related to classical HLA class I genes. Science. 1995;269:693–695. doi: 10.1126/science.7624800. [DOI] [PubMed] [Google Scholar]
  27. Hayes SM, Knight KL. Group 1 CD1 genes in rabbit. J Immunol. 2001;166:403–410. doi: 10.4049/jimmunol.166.1.403. [DOI] [PubMed] [Google Scholar]
  28. Hiromatsu K, Dascher CC, Sugita M, Gingrich-Baker C, Behar SM, LeClair KP, Brenner MB, Porcelli SA. Characterization of guinea-pig group 1 CD1 proteins. Immunology. 2002;106:159–172. doi: 10.1046/j.1365-2567.2002.01422.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Howard CJ, Sopp P, Bembridge G, Young J, Parsons KR. Comparison of CD1 monoclonal antibodies on bovine cells and tissues. Vet Immunol Immunopathol. 1993;39:77–83. doi: 10.1016/0165-2427(93)90166-2. [DOI] [PubMed] [Google Scholar]
  30. Ichimiya S, Kikuchi K, Matsuura A. Structural analysis of the rat homologue of CD1. Evidence for evolutionary conservation of the CD1D class and widespread transcription by rat cells. J Immunol. 1994;153:1112–1123. [PubMed] [Google Scholar]
  31. Katabami S, Matsuura A, Chen HZ, Imai K, Kikuchi K. Structural organization of rat CD1 typifies evolutionarily conserved CD1D class genes. Immunogenetics. 1998;48:22–31. doi: 10.1007/s002510050396. [DOI] [PubMed] [Google Scholar]
  32. Koch M, Stronge VS, Shepherd D, Gadola SD, Mathew B, Ritter G, Fersht AR, Besra GS, Schmidt RR, Jones EY, Cerundolo V. The crystal structure of human CD1d with and without alpha-galactosylceramide. Nat Immunol. 2005;6:819–826. doi: 10.1038/ni1225. [DOI] [PubMed] [Google Scholar]
  33. Looringh van Beeck FA, Reinink P, Hermsen R, Zajonc DM, Laven MJ, Fun A, Troskie M, Schoemaker NJ, Morar D, Lenstra JA, Vervelde L, Rutten VP, van Eden W, Van Rhijn I. Functional CD1d and/or NKT cell invariant chain transcript in horse, pig, African elephant and guinea pig, but not in ruminants. Mol Immunol. 2009;46:1424–1431. doi: 10.1016/j.molimm.2008.12.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Looringh van Beeck FA, Zajonc DM, Moore PF, Schlotter YM, Broere F, Rutten VP, Willemse T, Van Rhijn I. Two canine CD1a proteins are differentially expressed in skin. Immunogenetics. 2008;60:315–324. doi: 10.1007/s00251-008-0297-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Makowska A, Faizunnessa NN, Anderson P, Midtvedt T, Cardell S. CD1high B cells: a population of mixed origin. Eur J Immunol. 1999;29:3285–3294. doi: 10.1002/(SICI)1521-4141(199910)29:10<3285::AID-IMMU3285>3.0.CO;2-P. [DOI] [PubMed] [Google Scholar]
  36. Martin LH, Calabi F, Lefebvre FA, Bilsland CA, Milstein C. Structure and expression of the human thymocyte antigens CD1a, CD1b, and CD1c. Proc Natl Acad Sci U S A. 1987;84:9189–9193. doi: 10.1073/pnas.84.24.9189. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Miller MM, Wang C, Parisini E, Coletta RD, Goto RM, Lee SY, Barral DC, Townes M, Roura-Mir C, Ford HL, Brenner MB, Dascher CC. Characterization of two avian MHC-like genes reveals an ancient origin of the CD1 family. Proc Natl Acad Sci U S A. 2005;102:8674–8679. doi: 10.1073/pnas.0500105102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Moody DB, Porcelli SA. Intracellular pathways of CD1 antigen presentation. Nat Rev Immunol. 2003;3:11–22. doi: 10.1038/nri979. [DOI] [PubMed] [Google Scholar]
  39. Nei M, Rooney AP. Concerted and birth-and-death evolution of multigene families. Annu Rev Genet. 2005;39:121–152. doi: 10.1146/annurev.genet.39.073003.112240. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Nguyen TK, Reinink P, El Messlaki C, Im JS, Ercan A, Porcelli SA, Van Rhijn I. Expression patterns of bovine CD1 in vivo and assessment of the specificities of the anti-bovine CD1 antibodies. PLoS One. 2015;10:e0121923. doi: 10.1371/journal.pone.0121923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Oteo M, Arribas P, Setien F, Parra JF, Mirones I, Gomez del Moral M, Martinez-Naves E. Structural characterization of two CD1A allelic variants. Hum Immunol. 2001;62:1137–1141. doi: 10.1016/S0198-8859(01)00314-7. [DOI] [PubMed] [Google Scholar]
  42. Parra-Cuadrado JF, Gomez del Moral M, Garcia-Pavia P, Setien F, Martinez-Naves E. Characterization of the MHC class I-related MR1 locus in nonhuman primates. Immunogenetics. 2001;53:643–648. doi: 10.1007/s00251-001-0380-1. [DOI] [PubMed] [Google Scholar]
  43. Parra-Cuadrado JF, Navarro P, Mirones I, Setien F, Oteo M, Martinez-Naves E. A study on the polymorphism of human MHC class I-related MR1 gene and identification of an MR1-like pseudogene. Tissue Antigens. 2000;56:170–172. doi: 10.1034/j.1399-0039.2000.560211.x. [DOI] [PubMed] [Google Scholar]
  44. Reinink P, Van Rhijn I. In silico method for identification of MHC class I-like molecules in whole genomes. BioRxiv. 2016 [Google Scholar]
  45. Rhind SM, Hopkins J, Dutia BM. Amino-terminal sequencing of sheep CD1 antigens and identification of a sheep CD1D gene. Immunogenetics. 1999;49:225–230. doi: 10.1007/s002510050483. [DOI] [PubMed] [Google Scholar]
  46. Salomonsen J, Sorensen MR, Marston DA, Rogers SL, Collen T, van Hateren A, Smith AL, Beal RK, Skjodt K, Kaufman J. Two CD1 genes map to the chicken MHC, indicating that CD1 genes are ancient and likely to have been present in the primordial MHC. Proc Natl Acad Sci U S A. 2005;102:8668–8673. doi: 10.1073/pnas.0409213102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Schjaerff M, Keller SM, Fass J, Froenicke L, Grahn RA, Lyons L, Affolter VK, Kristensen AT, Moore PF. Refinement of the canine CD1 locus topology and investigation of antibody binding to recombinant canine CD1 isoforms. Immunogenetics. 2016;68:191–204. doi: 10.1007/s00251-015-0889-3. [DOI] [PubMed] [Google Scholar]
  48. Shiina T, Ando A, Suto Y, Kasai F, Shigenari A, Takishima N, Kikkawa E, Iwata K, Kuwano Y, Kitamura Y, Matsuzawa Y, Sano K, Nogami M, Kawata H, Li S, Fukuzumi Y, Yamazaki M, Tashiro H, Tamiya G, Kohda A, Okumura K, Ikemura T, Soeda E, Mizuki N, Kimura M, Bahram S, Inoko H. Genomic anatomy of a premier major histocompatibility complex paralogous region on chromosome 1q21-q22. Genome Res. 2001;11:789–802. doi: 10.1101/gr.175801. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Sloma I, Zilber MT, Vasselon T, Setterblad N, Cavallari M, Mori L, De Libero G, Charron D, Mooney N, Gelin C. Regulation of CD1a surface expression and antigen presentation by invariant chain and lipid rafts. J Immunol. 2008;180:980–987. doi: 10.4049/jimmunol.180.2.980. [DOI] [PubMed] [Google Scholar]
  50. Sugita M, Cao X, Watts GF, Rogers RA, Bonifacino JS, Brenner MB. Failure of trafficking and antigen presentation by CD1 in AP-3-deficient cells. Immunity. 2002;16:697–706. doi: 10.1016/S1074-7613(02)00311-4. [DOI] [PubMed] [Google Scholar]
  51. Van Rhijn I, Koets AP, Im JS, Piebes D, Reddington F, Besra GS, Porcelli SA, van Eden W, Rutten VP. The bovine CD1 family contains group 1 CD1 proteins, but no functional CD1d. J Immunol. 2006;176:4888–4893. doi: 10.4049/jimmunol.176.8.4888. [DOI] [PubMed] [Google Scholar]
  52. Van Rhijn I, Moody DB. CD1 and mycobacterial lipids activate human T cells. Immunol Rev. 2015;264:138–153. doi: 10.1111/imr.12253. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Wang J, Guillaume J, Pauwels N, Van Calenbergh S, Van Rhijn I, Zajonc DM. Crystal structures of bovine CD1d reveal altered aGalCer presentation and a restricted A’ pocket unable to bind very long-chain lipids. PLoS One. 2012;7:1–10. doi: 10.1371/journal.pone.0047989. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Winau F, Hegasy G, Weiskirchen R, Weber S, Cassan C, Sieling PA, Modlin RL, Liblau RS, Gressner AM, Kaufmann SH. Ito cells are liver-resident antigen-presenting cells for activating T cell responses. Immunity. 2007;26:117–129. doi: 10.1016/j.immuni.2006.11.011. [DOI] [PubMed] [Google Scholar]
  55. Yamaguchi H, Kurosawa Y, Hashimoto K. Expanded genomic organization of conserved mammalian MHC class I-related genes, human MR1 and its murine ortholog. Biochem Biophys Res Commun. 1998;250:558–564. doi: 10.1006/bbrc.1998.9353. [DOI] [PubMed] [Google Scholar]
  56. Yang Z, Wang C, Wang T, Bai J, Zhao Y, Liu X, Ma Q, Wu X, Guo Y, Ren L. Analysis of the reptile CD1 genes: evolutionary implications. Immunogenetics. 2015;67:337–346. doi: 10.1007/s00251-015-0837-2. [DOI] [PubMed] [Google Scholar]
  57. Zajonc DM, Elsliger MA, Teyton L, Wilson IA. Crystal structure of CD1a in complex with a sulfatide self antigen at a resolution of 2.15 A. Nat Immunol. 2003;4:808–815. doi: 10.1038/ni948. [DOI] [PubMed] [Google Scholar]

Articles from Immunogenetics are provided here courtesy of Springer

RESOURCES