Abstract
Killer cell Immunoglobulin-like Receptors (KIRs) modulate disease progression of pathogens including HIV, malaria, and hepatitis C. Cynomolgus and rhesus macaques are widely used as nonhuman primate models to study human pathogens and so considerable effort has been put into characterizing their KIR genetics. However, previous studies have relied on cDNA cloning and Sanger sequencing that lacks the throughput of current sequencing platforms. In this study, we present a high throughput, full-length allele discovery method utilizing PacBio circular consensus sequencing (CCS). We also describe a new approach to Macaque Exome Sequencing (MES) and the development of the Rhexome1.0, an adapted target capture reagent that includes macaque-specific capture probesets. By using sequence reads generated by whole genome sequencing (WGS) and MES to inform primer design, we were able to increase the sensitivity of KIR allele discovery. We demonstrate this increased sensitivity by defining nine novel alleles within a cohort of Mauritian cynomolgus macaques (MCM), a geographically isolated population with restricted KIR genetics that was thought to be completely characterized. Finally, we describe an approach to genotyping KIRs directly from sequence reads generated using WGS/MES reads. The findings presented here expand our understanding of KIR genetics in MCM by associating new genes with all eight KIR haplotypes and demonstrating the existence of at least one KIR3DS gene associated with every haplotype.
INTRODUCTION
Killer cell Immunoglobulin-like receptors (KIRs) are a complex family of receptors expressed on the surface of natural killer (NK) cells and subsets of T lymphocytes (Lodoen and Lanier 2006). KIRs modulate immune responses through interactions with major histocompatibility complex (MHC) class I molecules expressed on target cell surfaces (Lanier 2005; Sambrook et al. 2006). KIRs can possess either two Ig-like domains (KIR2D) or three Ig-like domains (KIR3D), a transmembrane domain and either long (L) or short (S) cytoplasmic tails capable of eliciting inhibitory or activating signaling cascades, respectively (Gardiner 2008). KIRs with one Ig-like domain and a truncated cytoplasmic tail (KIR1D) are present in some nonhuman primates, but are absent in humans (Bimber and Evans 2015; Hershberger et al. 2005).
Human KIR genes are located within the leukocyte receptor complex (LRC) on chromosome 19q13.4 along with other immunoglobulin superfamily receptor genes (Trowsdale et al. 2001; Wende et al. 2000). This genomic region contains anywhere from seven to twelve KIR genes. Fifteen genes (and two pseudogenes) have been characterized in humans to date (Carrington and Norman 2003; Uhrberg et al. 1997). Each chromosome contains a group of KIR genes that are inherited together; collectively, these comprise a KIR haplotype. The combined genes of both haplotypes define an individual’s KIR genotype. Human KIR haplotypes are defined by the presence or absence of genes other than KIR3DL3, KIR3DP1, KIR2DL4, and KIR3DL2, which are found on all haplotypes and are considered framework genes (Wilson et al. 2000). Haplotypes can be broadly categorized into group A haplotypes and more complex group B haplotypes. Group A haplotypes always contain seven loci which include the framework genes and a single activating receptor, KIR2DS4. Group B haplotypes contain variable numbers of genes that include multiple activating receptors other than KIR2DS4. Each KIR gene exhibits a high degree of nucleotide sequence polymorphism (Middleton and Gonzelez 2010). As of February 2015 (release 2.6.1), 753 unique, full-length human KIR alleles have been characterized and catalogued in the Immuno Polymorphism Database (Robinson et al. 2005).
Each KIR binds a specific set of ligands and so the genes encoded within an individual’s genome determine, in part, the specificity of potential immune responses (Carillo-Bustamante et al. 2016; Long and Rajagopalan 2000). NK cells only express a subset of an individual’s KIR genes. However, all functional KIR genes within an individual’s genome are expressed by at least one subpopulation of NK cells (Gardiner 2008; Valiante et al. 1997). Sequence polymorphisms of KIR genes are known to play an important role in levels of cell surface expression as well as binding affinities and NK cell effector function (Biassoni et al. 1995; Carr et al. 2005; Dunphy et al. 2015; Hilton et al. 2015). For instance, Yawata et al. (2006) showed that KIR3DL1*001, KIR3DL1*020, and KIR3DL1*01502 alleles exhibited higher antibody binding and were expressed by larger proportions of NK cells than KIR3DL1*005 and KIR3DL1*007 alleles. Furthermore, it was shown that the inhibitory capacity of NK cells was greater for high-expressing KIR3DL1 alleles except for KIR3DL1*005. Sequence polymorphisms between HLA genes can also influence the relative abundance of NK cell populations resulting in unequal representation of KIRs within the repertoire (Shilling et al. 2002; Yawata et al 2006). Taken together, compound KIR/HLA genotypes are crucial in the determination of an individual’s receptor repertoire and signaling capabilities.
Compound KIR/HLA genotypes have been associated with protection from a wide spectrum of pathogens including HIV, hepatitis C, malaria, and papilloma virus (Alter et al. 2007; Bonagura et al. 2010; Hansen et al. 2007; Khakoo et al. 2004; Knapp et al. 2010; Zipperlen et al. 2015). The role of KIR3DS1 and HLA-Bw4-80 Ile has been evaluated in the context of HIV infection. Multiple groups have observed an association between KIR3DS1+ and HLA-Bw4-80Ile+ genotypes and slower progression to AIDS (Jiang et al. 2013; Martin et al. 2002). Functionally, KIR3DS1+ NK cells control HIV replication within autologous CD4+ T cells in vitro (Alter et al. 2007; Zipperlen et al. 2015). KIR genotypes also affect transplantation outcomes. Donor/recipient KIR ligand mismatches reduce graft survival in hepatic and renal transplantation (Kuśnierczyk 2013; La Manna et al. 2013; Legaz et al. 2013; Van Bergen et al. 2011). For instance, Cirocco et al. (2007) and Kunert et al. (2007) independently found that acute rejection was significantly decreased in transplants involving KIR2DL2+ recipients matched with HLA-C1+ donors compared to HLA-C1- donors. Although our understanding of KIR signaling has significantly advanced in recent years, the mechanisms governing these differential immune responses remains elusive due to the complexity of KIR/HLA genetics.
Cynomolgus and rhesus macaques (Macaca fascicularis, Macaca mulatta) are widely used model organisms in the study of human infectious diseases and transplantation due to their genetic homology and ability to mount similar immune responses (Anderson and Kirk 2013; Antony and MacDonald 2015; Messaoudi et al. 2011; Schmitz and Korioth-Schmitz 2013). Significant effort has been put into characterizing the genetic diversity of KIRs and MHC in macaques, particularly in rhesus macaques of Indian origin (Blokhuis et al. 2010; Blokhuis et al. 2011; Hershberger et al. 2001; Kruse et al. 2010; Moreland et al. 2011; Sambrook et al. 2005; Wiseman et al. 2009). Twenty-two KIR genes (referred to as Mamu-KIR) are thought to exist in Indian rhesus macaques and a total of 149 full-length sequence variants of these genes are currently deposited in GenBank (Bimber and Evans 2015). The average number of expressed genes per animal is twelve (Blokhuis et al. 2011). Kruse et al. (2010) found five to eleven KIR genes/haplotype in their Indian rhesus families. In this study, they concluded, “rhesus macaque and human KIR haplotypes show a comparable level of diversity and complexity.” A study by Blokhuis et al. (2011) found 272 unique genotypes within a relatively small population of 378 rhesus macaques. For comparison, only 573 KIR genotypes have been characterized in humans despite screening 18,783 individuals from 155 different populations to date (González-Galarza et al. 2015). It is therefore likely that the diversity of KIR genotypes in macaques is at least comparable to, and may even exceed, that of humans. Because studying KIRs requires genetic control over both KIR and MHC genotypes, the extreme diversity of Mamu-KIR genotypes makes establishing a cohort of KIR/MHC identical rhesus macaques virtually impossible.
Mauritian cynomolgus macaques (MCM) provide a possible alternative due to their restricted genetic diversity. MCM are descended from a small founder population introduced on the island of Mauritius approximately 500 years ago (Sussman et al. 1986). This founder effect can be seen through MHC genetics in which seven haplotypes account for virtually all variation within the population (Wiseman et al. 2013). In a previous study we characterized the genetic diversity of KIRs in MCM (referred to as Mafa-KIR) using microsatellite analysis within a cohort of 274 MCM. Similar to MHC, eight KIR haplotypes were sufficient in describing essentially all genetic variation within the MCM population (Bimber et al. 2008). Cloning and Sanger-based sequencing identified a total of 40 alleles and 11 splice variants that segregated into one KIR1D, one KIR3DS, two KIR2DL, and four presumptive KIR3DL lineages.
Previous efforts to characterize KIRs in macaques have utilized cloning and Sanger sequencing, allele lineage-specific PCR, microsatellite analysis, and Roche/454 pyrosequencing (Bimber et al 2008; Blokhuis et al. 2010; Blokhuis et al. 2011; Kruse et al. 2010; Moreland et al. 2011; Sambrook et al. 2005). Although informative, these methods are time and resource intensive or cannot resolve allele-level genotypes unequivocally due to short products. Pacific Biosciences (PacBio) circular consensus sequencing (CCS) with single-molecule real-time (SMRT) technology produces long reads with high redundancy (Eid et al. 2009, Travers et al. 2010; Westbrook et al. 2015). Long reads eliminate the uncertainty of short read assembly required by previous massively parallel sequencing approaches to resolve full-length KIR transcripts. Here we present a novel full-length KIR allele discovery method utilizing the PacBio RSII platform. We increased the sensitivity of our assay by utilizing whole genome and macaque exome sequencing (WGS/MES) data to design primers in conserved regions of the genome. We then constructed a Mafa-KIR allele library based on PacBio amplicon sequences and demonstrated Mafa-KIR genotypes can be derived directly from genomic sequence reads. This study provides a new understanding of Mafa-KIR genetics and highlights the power of applying long-read deep sequencing techniques to KIRs and other complex immune loci.
MATERIALS AND METHODS
Animal Selection
Blood samples from 30 MCM were selected for KIR analysis (Supplementary Table 1). KIR genotypes for ten animals were determined through microsatellite analysis in a previous study representing at least one chromosome with a non-recombinant KIR region for each of the eight KIR haplotypes in the MCM population (Bimber et al. 2008).
cDNA amplicons and PacBio RS II Sequencing
RNA was isolated from peripheral blood mononuclear cells (PBMCs) or whole blood using the Promega Maxwell 16 MDx Instrument and Maxwell 16 LEV simplyRNA Cells or Maxwell 16 LEV simplyRNA Blood kits (Promega, Madison, WI, USA) according to manufacturer’s protocols. First strand complementary DNA (cDNA) was synthesized using SuperScript III First-Strand Synthesis System (Invitrogen, Carlsbad, CA, USA).
To amplify complete KIR open reading frames (ORFs), cDNA-PCR was performed to create amplicons ranging from 500-1600 bp using Phusion high-fidelity polymerase (New England Biolabs, Ipswich, MA, USA). The following primer sequences were used to generate templates for sequencing: KIR1D/3D_F (5′-CTKTCTGMACCGGCAGCACC-3′), KIR1D/3D_R (5′-GGGGTCAAGTGAAGTGGAGA-3′), KIR2DL4_F (5′-GAGTCACTGCATCCTGGCA-3′), KIR2DL4_R (5′-CGCAGACGTTGGTAAGCAAG-3′). Independent reactions were performed to generate amplicons for KIR1D/3D and KIR2DL4 products and the resulting products for each animal were pooled prior to PacBio library construction. A unique 16 bp barcode was appended to the 5′ end of each primer from a set of 384 barcodes designed for the PacBio system. Each sample could then be identified by its unique combination of barcodes. For all reactions, thermal cycling conditions were: denaturation at 98°C for 30 s; 30 cycles of 98°C for 5 s; 64°C (KIR3DL/KIR3DS/KIR1D) or 56°C (KIR2DL4) for 10 s, 72°C for 20 s; and a final extension of 72°C for 5 min. Successful amplification was confirmed using the FlashGel System (Lonza Group Ltd., Basel, Switzerland).
Amplified DNA was purified twice using Ampure XP SPRI beads (Agencourt Bioscience Corporation, Beverly, MA, USA) at a 0.6:1 bead to DNA volume ratio. All samples were quantified using the Qubit dsDNA BR assay kit and Qubit 2.0 Fluorometer (Thermo Fisher Scientific, Waltham, MA, USA). Amplicon size distributions were measured using the Agilent 2100 Bioanalyzer DNA12000 kit (Life Technologies, Madison, WI, USA). SMRTbell libraries were created using the PacBio Amplicon Template Preparation protocol for CCS (http://www.pacb.com). This protocol allows individual molecules to be sequenced multiple times in both orientations. Briefly, a pool of primary PCR products was end-repaired and hairpin adapters were incorporated using the PacBio DNA Template Prep Kit 2.0. After removal of failed ligation products using exonuclease III and VII, SMRTbells were purified using a 0.6:1 volume ratio of AMPure PB beads to DNA. Concentrations and sizes were obtained again for the final library prep using the Qubit dsDNA BR assay and Agilent 2100 Bioanalyzer DNA12000 kits. The volume of sequencing primer and polymerase was determined using a PacBio calculator by entering the concentrations of SMRTbells and around 1200 bp average insert size. Sequencing primer was annealed to the single-stranded loop region of the SMRTbell template and primer-annealed SMRTbells were bound to DNA polymerase P6. Polymerase-bound SMRTbells were MagBead loaded over zero-mode waveguides and immobilized. Sequencing was performed by the Great Lakes Genomics core at the University of Wisconsin-Milwaukee using a PacBio RS II instrument with C4 sequencing chemistry. PacBio CCS generates long read lengths (>10 - 15 kb) from a single molecule allowing unambiguous phasing of alleles.
Pacific Biosciences Data Analysis
The Pacific Biosciences command line tool pbccs from the smrttools package version 3.0.1 (https://github.com/PacificBiosciences/pbccs) was used to produce CCS from raw data files. This tool produced amplicon sequences in the form of BAM files, which were not demultiplexed. These BAM files were then filtered on the Read Quality field or “rq” tag and sequences with an “rq” value of 0.99 or higher were saved. This data was used for the purpose of novel allele discovery. Pacific Biosciences provided tools pbbarcode and bamCCSBarcodeDemultiplex.py were used to demultiplex the read data by barcode. A barcode score threshold was set at 25 to accurately match barcode sequences within reads. Demultiplexed fastq files were used for genotyping individual samples.
A series of analysis steps with open-source bioinformatics utilities were performed to discover putative novel allele sequences within pre-processed Pacific Biosciences sequencing data. As a first step after read pre-processing, chimera filtering was performed using the USEARCH (Edgar 2010) software package and the uchime_ref algorithm with a reference KIR allele sequence database. Following chimera filtering, primers were trimmed from read ends using the bbduk software tool as part of the bbtools (https://sourceforge.net/projects/bbmap/files/) software package. Primers were trimmed from the left and right ends of sequencing reads sequentially, using bbduk’s parameters “ktrim=l” and “ktrim=r” to specify which end to trim. bbduk’s parameters “restrictleft=50” and “restrictright=50” were used to limit the region of a sequencing read considered for primer trimming and a kmer size of 16 was specified with “k=16”. Next, open reading frames with sizes between 400 and 1300 base pairs were identified within read sequences using the software tool getorf as part of the EMBOSS software package (Rice et al. 2000). Only sequence reads containing an intact ORF within the specified size range were included in further processing steps. Reads identical to previously described allele sequences were then removed by mapping them to a database of known KIR allele sequences using mapPacBio.sh as part of the bbtools software package, and requiring reads match a reference allele sequence perfectly end to end using the parameter “perfectmode=t”. Unmapped reads were then clustered using the cluster_fast algorithm within the USEARCH software package, requiring 100% identity between sequences in a cluster. Cluster representative sequences with 3 or more reads in support of the cluster were saved for further analysis.
Putative novel allele sequences having been filtered by the previously mentioned analysis steps were then imported into the software tool Geneious Pro R9 (http://www.geneious.com, Kearse et al., 2012). Circular consensus sequences were mapped to the set of putative novel allele sequences within Geneious (requiring 100% identity in the overlap) using the bbmap plugin and parameters “semiperfectmode” and “PacBio mode”’. Putative novel sequences meeting the requirement of having greater than or equal to 3 reads whose mapping spanned the complete length of the sequence, and had at least 3 reads from a single specific sample were considered valid novel sequences and included in genotyping analyses.
To genotype the samples within the sequencing run, the set of valid novel sequences was merged with the set of previously described reference allele sequences with known haplotype associations. Barcode binned circular consensus sequences were then mapped to this merged set of allele sequences using Geneious Pro R9 (requiring 100% identity in the overlap). The mapping results were then exported from Geneious Pro R9 and imported into Microsoft Excel 2011 to generate a table of genotypes. Genotypes were called based on reads that mapped to previously described sequences. Haplotype associations for novel sequences were determined by their co-occurrence with previously described sequences.
Whole Genome Sequencing (WGS)
WGS datasets from 18 MCM were generated in a previous study (Supplementary Table 2) (Ericsen et al. 2014). Briefly, genomic DNA was isolated from PBMCs and Illumina paired-end libraries were constructed following standard protocol (complete description available at http://www.hgsc.bcm.edu). Library templates were prepared with TruSeq PE Cluster Generation Kits (Illumina, San Diego, CA, USA) using Illumina’s cBot cluster generation system. Libraries were loaded onto three HiSeq flow cell lanes and paired-end sequencing reads were generated using the Illumina HiSeq 2000 platform. Sequencing-by-synthesis reactions were extended for 101 cycles from each end using TruSeq SBS Kits (Illumina, San Diego, CA, USA). On average, 118 Gb (approximately 44Gb per lane) of sequence was generated per sample.
Rhexome1.0 target capture reagent
To improve overall coverage of the rhesus exome and successfully sequence the macaque MHC region, capture probes were designed targeting rhesus exons that were not covered sufficiently using the human based NimbleGen VCRome2.1 capture reagents. Sequencing reads from rhesus macaque DNA samples captured using VCRome2.1 were mapped to the rheMac2 genome assembly. Macaque exons that did not pass read coverage thresholds were identified (exons that showed less than 20× coverage or had excessive coverage were considered failures). In addition, all exons plus introns and UTR sequences for each class I and class II loci from the human MHC region (hg19) were included in this custom probeset to increase the number of MHC class I and II sequence reads recovered in each experiment.
The resulting set of novel exonic and MHC targets sequences were provided to NimbleGen scientists (NimbleGen, Madison, WI, USA), where they used proprietary software to design and manufacture novel sequence capture probes. This sequence capture probe set (macaque exonic regions: 1,544,913bp; human MHC region: 607,629bp, hg19) was used as a “spike-in” reagent added to the VCRome2.1 probe set in standard human exome capture procedures (Lupski et al. 2013). A series of titrations comparing the results for exome sequencing using the combined VCRome2.1 plus “spike-in” reagents against whole genome sequencing results from the same animals showed that a ratio of 1:2.5 for VCRome2.1-to-“spike-in” provided superior overall coverage results. We have designated the resulting VCRome2.1 plus “spike-in” reagents the Rhexome1.0.
Macaque Exome Sequencing (MES)
Eight MCM exomes using standard VCRome2.1 capture reagents and 36 MCM exomes using Rhexome1.0 capture reagents were prepared using standard human exome capture procedures (Supplementary Table 2) (Lupski et al. 2013). After standard DNA quality control tests on samples, we produced Illumina sequencing libraries with incorporated barcodes, following standard procedures (Lupski et al. 2013). Groups of six macaque bar-coded samples were pooled and captured together (6Plex). Each resulting pool of six DNA samples (enriched for the macaque exome by the capture process) was sequenced in one lane of Illumina HiSeq2000. This procedure results in an estimated sequence read depth of greater than 20X for 91% of on-target reads.
Genotyping from WGS and MES datasets
A reference allele sequence database was created consisting of discriminatory exon regions found within the complete set of full-length reference alleles having been discovered from PacBio sequencing. For KIR1D, KIR2DL4, KIR3DS and KIR3DL loci, exon sets were chosen based on having polymorphisms that enabled allelic resolution. For KIR1D, D1 exon sequence was used for each full-length allele; for KIR2DL4, D0 and D2 exon sequences were used for each full-length allele; and for KIR3DS/KIR3DL, D0, D1, and D2 exon sequences were used for each full-length allele. The genotyping rationale used was that mapping results that included specific exon sequences for a given KIR locus being completely covered 5′ to 3′ with reads that map with 100% identity represents the presence of the associated full length allele sequence in the sample being genotyped.
A custom software pipeline was written in Python and used to run sequential analysis steps for genotyping samples from WGS/MES datasets (https://bitbucket.org/dholab/mcm_kir/src). As a first step, read sorting and read pairing verification was done using the repair.sh software tool within the bbtools (https://sourceforge.net/projects/bbmap/files/) software package. Next, reads were mapped to the exon allele reference database that was created using bbmap as part of the bbtools (https://sourceforge.net/projects/bbmap/files/) software package. Reads were required to map with 100% identity in the overlap with the exon reference sequences. Mapping results were then translated to coverage statistics using coverage Bed within the bedtools (Quinlan and Hall 2010) software package. Mapping results were then filtered using custom Python code based on the coverage statistics to only include results where exon reference sequences had at least 1 read covering every base of the reference. Filtering was also done to require the correct number of discriminatory exons be covered for each KIR locus. Mapping results were then aggregated and read counts broken out by exon for each sample per KIR allele sequence. The results were reported as a genotyping table.
Phylogenetic Analysis
Codon based nucleotide alignments were generated in Geneious Pro R9 (Kearse et al. 2012) using the ClustalW2 plugin (Larkin et al. 2007) with default settings. Phylogenetic trees were constructed with MEGA7.0 (Kumar et al. 2016) based on the JTT amino acid sequence distance (Jones et al. 1992) using the neighbor-joining method (Saitou and Nei 1987). Clustering patterns were assessed by bootstrapping (Felsenstein et al. 1985) with 1000 bootstrap replicates.
RESULTS
Full-length KIR allele discovery
Previously, we designed a primer pair to amplify full-length KIR transcripts based on rhesus macaque cDNA transcripts deposited in NCBI Genbank as of 2008. We adapted this primer pair for PacBio sequencing in a pilot study with a cohort of twelve MCM. As expected KIR1D and KIR3D sequences were recovered that had been characterized previously but we were unable to detect any KIR2DL4 sequences. To investigate whether this reflected variation within the binding sites of primers for KIR2DL4 transcripts, we compared our original primer sequences against genomic sequences from 62 total WGS and MES MCM samples (Supplementary Table 2) (Ericsen et al. 2014). In order to distinguish KIR2DL4-specific reads from other KIRs, raw genomic sequence reads were mapped onto a complete genomic Mamu-KIR2DL4 sequence extracted from a bacterial artificial chromosome (bRH-242L13, accession# BX842591) sequenced by Sambrook et al. (2005). All reads that mapped to the 5′UTR and exon one region of KIR2DL4 contained two mismatches within exon one and three mismatches 3bp upstream of exon one relative to our previous 5′ UTR forward primer. To accommodate this variation, a KIR2DL4-specific primer was designed within a conserved region of the 5′ UTR, shown in Figure 1. We then wanted to investigate potential sequence variation within other KIR loci and so genomic reads were mapped against the Mamu-KIR3DL1 gene from BAC bRH-242L13 in a similar manner. The majority of animals contained reads with variation in the region targeted by the forward sense PCR primer (data not shown). We also noted the targeted region contained two in-frame start codons at amino acid positions one and four, located within the region targeted by the forward primer. Alternative translational start sites are commonly observed in signal sequences (Kochetov et al. 2008). We therefore designed a revised forward primer upstream of these two possible start sites to ensure sequencing of full open reading frames. The revised primer included two degenerate bases to account for observed variation within the 5′ UTRs of KIR1D and KIR3D transcripts.
Using revised primer pairs, we generated full-length cDNA amplicons from 30 MCM for PacBio SMRT-CCS analysis. Ten of these 30 individuals had known KIR genotypes determined using microsatellite analysis from our previous study (Supplementary Table 1). These ten animals were selected to represent all eight KIR haplotypes in the MCM population by one or more heterozygous animals; individuals homozygous for the three most common KIR haplotypes were available for this analysis. We successfully confirmed 37 of the 40 full-length Mafa-KIR transcripts and nine splice variants that were previously described in MCM (Table 1). Of the 37 confirmed full-length transcripts, we added 5′ extensions to 35 creating complete open reading frames. The three transcripts that were not detected (EU419100, EU419107, and EU419118) could have been missed due to a variety of reasons in this analysis. EU419107 is the only reported KIR3DL20 allele from our previous study and is presumably the most centromeric locus within the KIR region. Because of the differences in sequence motifs observed between centromeric and telomeric KIR genes, it is possible we were unable to resequence EU419107 due to mismatches within primer binding sites. EU419118 (KIR3DL7) and EU419100 (KIR3DL2) are associated with the less common haplotypes K4 and K5, respectively and may have been underrepresented within the pooled library. We suspect that sequencing more MCM may resolve some of the missed transcripts. In addition to the 37 confirmed transcripts, we discovered nine novel KIR transcript sequences plus five additional novel splice variants (Table 1).
Table 1. Accession numbers for PacBio-identified sequences.
Haplotype | Gene | Accession | Amino Acids | Representative animals | Prev Accession |
---|---|---|---|---|---|
K1 | KIR1D | KX892645 | 279 | CY0161,CY0327 | EU419110 |
K1K7 | KIR2DL4 | KX892646 | 375 | CY0161,CY0355 | EU419111 |
K1K7 | KIR2DL4-SV | KX892647 | 277 | CY0563 | EU419111 |
K1 | KIR3DS | KX892648 | 374 | CY0161,CY0327 | EU419113 |
K1 | KIR3DL7 | KX892649 | 447 | CY0161,CY0327 | EU419109 |
K1 | KIR3DL11 | KX892650 | 447 | CY0161,CY0327 | EU419101 |
K1 | KIR3DL11-SV | KX892651 | 349 | CY0161,CY0327 | EU419102 |
K2 | KIR1D | KX892652 | 279 | CY0166,CY0568 | EU419124 |
K2K4K5 | KIR2DL4 | KX892653 | 375 | CY0166,CY0570,CY0390 | EU419114 |
K2 | KIR3DL2 | KX892654 | 444 | CY0166,CY0353 | EU419090 |
K2 | KIR3DL2-SV | KX892655 | 394 | CY0425,CY0568 | EU419091 |
K2 | KIR3DL2 | KX892656 | 444 | CY0166,CY0353 | EU419083 |
K2 | KIR3DS | KX892657 | 343 | CY0166,CY0353 | |
K2 | KIR3DS | KX892658 | 386 | CY0166,CY0568 | EU419092 |
K2 | KIR3DS | KX892659 | 343 | CY0166,CY0568 | EU419079 |
K2 | KIR3DL1 | KX892660 | 447 | CY0166,CY0568 | EU419076 |
K2 | KIR3DL7 | KX892661 | 461 | CY0163 | EU419078 |
K2 | KIR3DL7 | KX892662 | 447 | CY0166,CY0568 | EU419089 |
K3K4 | KIR1D | KX892663 | 176 | CY0111,CY0570 | EU419120 |
K3K4 | KIR1D-SV | KX892664 | 210 | CY0111,CY0570 | |
K3 | KIR2DL4 | KX892665 | 375 | CY0111,CY0390 | |
K3 | KIR2DL4-SV | KX892666 | 280 | CY0111,CY0390 | |
K3 | KIR3DL2 | KX892667 | 444 | CY0111,CY0390 | EU419123 |
K3 | KIR3DS | KX892668 | 374 | CY0111,CY0390 | |
K3 | KIR3DL11 | KX892669 | 447 | CY0111,CY0390 | EU419119 |
K4 | KIR3DL2 | KX892670 | 444 | CY0570,CY0332 | EU688993 |
K4 | KIR3DL2-SV | KX892671 | 394 | CY0570,CY0332 | |
K4 | KIR3DL2-SV | KX892672 | 435 | CY0570,CY0332 | |
K4 | KIR3DS | KX892673 | 374 | CY0570,CY0332 | EU419121 |
K4 | KIR3DL11 | KX892674 | 447 | CY0570,CY0332 | EU419115 |
K4 | KIR3DL11-SV | KX892675 | 397 | CY0570,CY0342 | EU419116 |
K5 | KIR3DL2 | KX892676 | 444 | CY0327,CY0163 | EU419105 |
K5 | KIR3DL2-SV | KX892677 | 435 | CY0327,CY0163 | EU419106 |
K5 | KIR3DL2-SV | KX892678 | 394 | CY0327,CY0163 | |
K5 | KIR3DS | KX892679 | 374 | CY0327,CY0163 | EU419126 |
K5 | KIR3DS | KX892680 | 386 | CY0327,CY0163 | EU419097 |
K5 | KIR3DS | KX892681 | 386 | CY0327,CY0163 | EU419098 |
K5 | KIR3DL1 | KX892682 | 458 | CY0327,CY0163 | EU419075 |
K5 | KIR3DL7 | KX892683 | 447 | CY0327,CY0163 | EU419103 |
K5 | KIR3DL7 | KX892684 | 447 | CY0327,CY0354 | EU419099 |
K6 | KIR1D | KX892685 | 279 | CY0568 | |
K6 | KIR3DL2 | KX892686 | 444 | CY0568,CY0330 | |
K6 | KIR3DS | KX892687 | 386 | CY0330 | EU688994 |
K6 | KIR3DS | KX892688 | 374 | CY0568,CY0330 | |
K6 | KIR3DS | KX892689 | 374 | CY0568,CY0330 | |
K6 | KIR3DL11 | KX892690 | 447 | CY0568,CY0330 | EU688995 |
K7 | KIR3DL2 | KX892691 | 440 | CY0356,CY0336 | EU419080 |
K7 | KIR3DL2-SV | KX892692 | 348 | CY0356,CY0336 | EU419081 |
K7 | KIR3DS | KX892693 | 374 | CY0356,CY0336 | EU419086 |
K7 | KIR3DS | KX892694 | 374 | CY0356,CY0336 | |
K7 | KIR3DL7 | KX892695 | 447 | CY0356,CY0336 | EU419087 |
K7 | KIR3DL7-SV | KX892696 | 349 | CY0356,CY0336 | EU419088 |
K7 | KIR3DL11 | KX892697 | 447 | CY0356,CY0336 | EU419093 |
K8 | KIR1D | KX892698 | 312 | CY0322,CY0558 | |
K8 | KIR3DS | KX892699 | 447 | CY0332 | EU419122 |
K8 | KIR3DL7 | KX892700 | 397 | CY0322,CY0558 | EU419077 |
K8 | KIR3DL7 | KX892701 | 374 | CY0322,CY0558 | EU419084 |
K8 | KIR3DL7-SV | KX892702 | 447 | CY0322,CY0558 | EU419085 |
K8 | KIR3DL11 | KX892703 | 447 | CY0322,CY0558 | EU419094 |
K8 | KIR3D11-SV | KX892704 | 349 | CY0322,CY0558 | EU419095 |
KIR genotypes for each animal were determined from a total of 28,086 reads (avg 936 reads/animal) that mapped perfectly to a Mafa-KIR allele library consisting of known and novel sequences identified in the current study. Figure 2 displays the genotypes of a subset of sequenced individuals representative of the eight KIR haplotypes in the MCM population. All 37 previously described transcripts were detected on the expected KIR haplotypes with the exception of KX892670 (KIR3DL2), which was associated with K4 haplotypes rather than K6 as previously thought. Of the nine novel sequences we identified two KIR1D and one KIR2DL4 allele. To infer the lineage of the remaining novel sequences we constructed KIR3DL and KIR3DS phylogenetic trees using all of the sequences containing three immunoglobulin domains that were characterized in this study (Supplementary Fig. 1). Based on phylogenetic analysis the novel sequences were characterized as five KIR3DS and one KIR3DL2 transcript (Table 1). Among the newly identified transcripts was a K3-associated KIR3DS (KX892668). No KIR3DS alleles were detected for this haplotype in our previous study (Bimber et al. 2008). This establishes KIR3DS as the first gene present on all eight MCM haplotypes. In addition, we discovered new haplotype associations for two previously identified KIR2DL4 (KX892646, KX892653) and one KIR1D (KX892663) transcript. KX892646 (KIR2DL4) was associated with both K1 and K7 haplotypes; KX892653 (KIR2DL4) was associated with K2, K4 and K5 haplotypes; and KX892663 (KIR1D) was associated with K3 and K4 haplotypes. Similar observations have been reported in rhesus macaques where identical Mamu-KIR1D and Mamu-KIR2DL4 alleles are shared between haplotypes (Blokhuis et al. 2010). The combined novel transcripts and haplotype associations have added transcripts to seven of the eight haplotypes. The range of transcripts per haplotype increased to between five and ten.
Genotyping from WGS and MES datasets
In previous studies, we and others (Vallender 2011) have used human exome probe arrays to capture and sequence exons from the rhesus and cynomolgus macaques. However, using probes designed for the human genome (VCRome2.1, total length of capture probes 35.25 Mb) to capture the rhesus macaque exome failed to produce sufficient read coverage for 12% of exons in the rheMac2 annotated rhesus genome assembly (Bainbridge et al. 2011). Therefore, to improve overall coverage of macaque exomes and simultaneously determine macaque MHC class I and class II genotypes, we designed additional capture probes targeting rhesus exons that were not covered sufficiently using VCRome2.1 capture reagents (Supplementary Table 3). A series of tests comparing the results for macaque exome sequencing using the combined VCRome2.1 plus “spike-in” reagents against whole genome sequencing results from the same animals showed that a ratio of 1:2.5 for VCRome2.1-to-“spike-in” provided superior overall coverage results (Supplementary Tables 4 and 5).
With a full-length Mafa-KIR alleles database established, we evaluated the feasibility of genotyping KIRs directly from MES with Rhexome1.0 reads compared to reads generated using WGS. All 30 PacBio-sequenced animals were genotyped using their respective MES (n=23) and WGS (n=7) datasets. 100 bp paired end reads were mapped to a library of Mafa-KIR D0, D1, and D2 exon sequences. Genotypes for the 30 PacBio-sequenced animals were determined by requiring complete coverage of D0, D1, and D2 for KIR3DL/KIR3DS alleles; D0 and D2 for KIR2DL4 alleles; and D1 for KIR1D alleles. KIR alleles detected per haplotype for representative MCM are shown in Figure 3. In total, we were able to detect 39 of the 46 PacBio-identified, full-length transcripts. Sequence depths between WGS and MES were comparable. Interestingly, all three previously identified alleles that were absent within our PacBio sequencing experiment (EU419100, EU419107, and EU419118) were successfully mapped from genomic reads (Supplementary Table 6). Because these KIR alleles were originally identified by cDNA cloning as fully spliced transcripts with complete ORFs, it is unlikely that these alleles represent pseudogenes. We suspect that these alleles may include sequence variation within the PacBio primer binding sites or represent transcripts with low expression levels in the animals evaluated in this study. Figure 4 displays an updated genomic model for KIR haplotypes using the combined results of PacBio CCS and WGS/MES genotyping in conjunction with the results of our previous study.
DISCUSSION
Previously, we and others have developed PacBio CCS methods for full length MHC class I allele discovery and genotyping (Karl et al. 2017, Westbrook et al. 2015) Here we adapted this framework to full-length allele discovery and genotyping for KIRs. We were able to increase our sensitivity of detection by utilizing genomic sequence data from a cohort of 62 MCM to accommodate multiple sites of variation during primer design. We then used full-length Pacific Biosciences sequencing to interrogate 30 MCM from our genomic cohort and characterized nine novel transcripts, five novel splice variants, and four novel haplotype associations, demonstrating the increased sensitivity of our method.
In this study we also demonstrated an approach to KIR genotyping directly from WGS/MES reads using a reference library of D0, D1, and D2 exon sequences derived from full-length transcripts sequenced using PacBio. When this method was applied to the associated WGS/MES datasets for our PacBio CCS cohort, we successfully detected 42 alleles including three alleles that were absent from our PacBio experiment. However, we were unsuccessful in detecting seven PacBio-identified alleles (Supplementary Table 6). This is most likely due to the strict constraints of the WGS/MES genotyping algorithm. For example, CY0322 was identified as a K3/K8 heterozygote (Supplementary Table 1). One hundred and one reads for a K8-associated KIR3DL7, KX892700, were detected using PacBio amplicon sequencing (Figure 2). This allele wasn’t detected from the same animal using mapped WGS reads. Viewing the per-exon coverage for KX892700 reveals complete coverage for D2, but incomplete coverage for D0 and D1. Similarly, a K2-associated KIR3DL7, KX892661, appears to be absent in MES data for CY0166 who is a K2/K2 homozygous animal. Again, viewing the per exon coverage for CY0166 reveals that complete D0 and D2 coverage exists for KX892661, but D1 has incomplete coverage and thus the allele was not scored as positive. One solution to this problem may be to slightly decrease the coverage required to call an allele or shorten the length of reference contigs that reads are mapped to. However, this could potentially complicate genotyping in instances where alleles differ by only a few nucleotides. Improved sequencing platforms such as the Illumina HiSeq X, which generate greater amounts of data per lane at lower cost than earlier platforms, may also help by providing greater read depth across KIR genes at no additional cost. Another solution may be to spike-in KIR probes into the Rhexome1.0 just as we have with the MHC in order to increase read depth. Though the Rhexome1.0 was able to capture a significant amount of KIR sequences without spike-in probes, greater read depth may help increase confidence when determining genotypes. Because of the restricted genetic diversity of MCM, spike-in KIR probes may not be necessary as genotypes can be inferred based on the full-length sequencing studies we have performed to define the extent of KIR diversity within the population. Inference will most likely fall short once more genetically diverse populations are considered such as Indian rhesus macaques. Regardless, applying this genotyping approach to other species will require full-length allele libraries to be established for each population.
PacBio analyses revealed a novel KIR3DS transcript associated with K3 haplotypes; no KIR3DS alleles were detected for this haplotype in our previous study (Bimber et al. 2008). Thus, at least one KIR3DS gene is present on all eight KIR haplotypes in MCM. It is therefore possible that a Mafa-KIR3DS lineage represents a framework gene within MCM. KIR3DS (formerly KIR3DH) in macaques describes a class of receptors that are evolutionarily distinct from human KIR3DS, most likely arising from a crossover event between a KIR3DL and KIR2DL4 within an ancestral macaque (Hershberger et al. 2001). Like KIR2DL4, KIR3DS genes contain a charged arginine residue within their transmembrane domains. KIR3DS also contain a large deletion resulting in early termination before the immunoreceptor tyrosine-based inhibitory motifs (ITIMs) encoded by KIR2DL4 and KIR3DL cytoplasmic tails (Hershberger et al. 2001). Therefore, KIR3DS are thought to be the only activating receptors in macaques. Several haplotypes that lack KIR3DS have been reported in rhesus macaques including the single Mamu-KIR haplotype that was completely sequenced from two overlapping BACs (Blokhuis et al. 2011; Kruse et al. 2010; Moreland et al. 2011, Sambrook et al. 2005). However, Mamu-KIR genotyping studies from a variety of groups have shown that virtually all rhesus macaques encode at least one KIR3DS activating receptor (Blokhuis et al. 2009; Blokhuis et al. 2010; Hellmann et al. 2011; Hershberger et al. 2001; Kruse et al. 2010; Moreland et al. 2011). In all of the sequencing literature from the past decade we could find only a handful of rhesus macaques reported with no Mamu-KIR3DS (Blokhuis et al. 2011). While it is possible that rare macaque haplotypes lack KIR3DS it seems likely that KIR3DS variants may be present in these animals that were not detected by the various genotyping assays used in these earlier studies. Taken with our findings that activating KIRs are expressed in all MCM, it is tempting to speculate that activating KIR provide some selective advantage.
A growing amount of evidence suggesting the role of particular activating Mamu-KIR3DS alleles in influencing plasma viral loads in SIV-infected rhesus macaques has surfaced from associative studies (Albrect et al. 2014; Chaichompoo et al. 2010). Similarly, the copy number of Mamu-KIR3DS genes is speculated to influence peak viral loads following SIVmac251 infection in Indian rhesus macaques. Hellmann et al. (2011) showed that increased copies of Mamu-KIR3DS were inversely correlated with peak viral loads, though it should be noted that the association was seen only in the rhesus that lack protective Mamu-A*001 and were homozygous for restrictive TRIM5 alleles (Hellman et al. 2011). The methods described in this study could aid in untangling the biological mechanisms of activating KIRs in future studies by providing unambiguous genotyping down to the allelic identity. Furthermore, copy number variation of activating KIR3DS exists across MCM haplotypes. Investigating activating KIR3DS copy number variation using MHC/KIR identical cohorts may lead to novel insights into these complex associations.
KIR2DL4 is considered a framework gene in humans and is thought to play a role in recognition of fetal tissues by the maternal immune system through binding nonclassical MHC-G molecules (Ponte et al. 1999). Data by Yan et al. (2007) has shown that KIR2DL4 surface expression in uterine NK was significantly lower in women with recurrent spontaneous abortion than in normal early pregnancy woman. Our study found that the two previously identified KIR2DL4 sequences are shared among multiple haplotypes. In addition, a novel K3-associated KIR2DL4 was characterized making six of the eight haplotypes KIR2DL4 positive. When we genotyped animals using WGS/MES reads, all 30 samples had complete coverage of both KIR2DL4 exons for at least one of the three alleles (data not shown). It is possible that WGS/MES KIR2DL4 reads are from pseudogenes within these animals; however, the absence of PacBio reads from KIR2DL4 in five out of 30 probably reflects failed 2DL4 PCRs for these individuals. The latter explanation is interesting when considering that all 62 genomic datasets contained reads spanning the first exon of KIR2DL4 including an in-frame start codon when mapped against the Mamu-KIR2DL4 contig extracted from the genomic haplotype (Fig. 1). Furthermore, seven unique KIR2DL4 5′ UTR sequences were identified within the genomic datasets but only three unique KIR2DL4 transcripts were identified with PacBio sequencing. It’s known that the relative abundance of KIR transcripts fluctuates over the course of infection due to selective expansion of NK cell populations (reviewed in Maras et al. 2014). Any of these factors may have contributed to the varying levels of CCS read depth we observed across our PacBio cohort. Interestingly, six animals (e.g. CY0336, CY0351) were positive for all three KIR2DL4 respectively when genotyped using WGS/MES data. This could be a byproduct of ambiguous mapping of short reads onto larger exon contigs. However, this observation is consistent in rhesus macaques where Hellman et al. (2013) found Mamu-KIR2DL4 copy numbers varied between one and three per cell. Understanding KIR2DL4 presence/absence and copy number variation requires sequencing of more animals to identify homozygous individuals with less common KIR haplotypes.
In conclusion, the KIR genotyping approaches presented here have given us a framework for interpreting complex, duplicated families of genes. Full-length transcript sequencing using PacBio CCS can aid study design with unambiguous genotyping results that weren’t available previously. In addition, genotyping KIRs directly from genomic data is a task that has not been possible using reference guided alignment given the current genome assemblies for nonhuman primates. Currently, we are working to expand this methodology to other complex immune gene families such as MHC class I, class II and Fc gamma receptors. This would eventually allow genotyping of a variety of immune loci directly from a single MES experiment. The combined results of these methods have provided an improved perspective of the genetics of KIR within the MCM population.
Supplementary Material
Acknowledgments
This research was supported by contracts HHSN272201600007C and HHSN272201100013C from the National Institute of Allergy and Infectious Diseases of the National Institutes of Health, and was conducted at a facility constructed with support from the Research Facilities Improvement Program (RR15459-01, RR20141-01).
References
- Albrecht C, Malzahn D, Brameier M, Hermes M, Ansari AA, Walter L. Progression to AIDS in SIV-Infected Rhesus Macaques is Associated with Distinct KIR and MHC class I Polymorphisms and NK Cell Dysfunction. Front Immunol. 2014;5:600. doi: 10.3389/fimmu.2014.00600. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Alter G, Martin MP, Teigen N, Carr WH, Suscovich TJ, Schneidewind A, Streeck H, Waring M, Meier A, Brander C, Lifson JD, Allen TM, Carrington M, Altfeld M. Differential natural killer cell-mediated inhibition of HIV-1 replication based on distinct KIR/HLA subtypes. J Exp Med. 2007;204:3027–3036. doi: 10.1084/jem.20070695. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Anderson DJ, Kirk AD. Primate models in organ transplantation. Cold Spring Harb Perspect Med. 2013;3:a015503. doi: 10.1101/cshperspect.a015503. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Antony JM, MacDonald KS. A critical analysis of the cynomolgus macaque, Macaca fascicularis, as a model to test HIV-1/SIV vaccine efficacy. Vaccine. 2015;33:3073–3083. doi: 10.1016/j.vaccine.2014.12.004. [DOI] [PubMed] [Google Scholar]
- Bainbridge MN, Wang M, Wu Y, Newsham I, Muzny DM, Jefferies JL, Albert TJ, Burgess DL, Gibbs RA. Targeted enrichment beyond the consensus coding DNA sequence exome reveals exons with higher variant densities. Genome Biol. 2011;12:R68. doi: 10.1186/gb-2011-12-7-r68. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Biassoni R, Falco M, Cambiaggi A, Costa P, Verdiani S, Pende D, Conte R, Di Donato C, Parham P, Moretta L. Amino acid substitutions can influence the natural killer (NK)-mediated recognition of HLA-C molecules. Role of serine-77 and lysine-80 in the target cell protection from lysis mediated by “group 2” or “group 1” NK clones. J Exp Med. 1995;182:605–609. doi: 10.1084/jem.182.2.605. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bimber BN, Moreland AJ, Wiseman RW, Hughes AL, O’Connor DH. Complete characterization of killer Ig-like receptor (KIR) haplotypes in Mauritian cynomolgus macaques: novel insights into nonhuman primate KIR gene content and organization. J Immunol. 2008;181:6301–6308. doi: 10.4049/jimmunol.181.9.6301. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bimber BN, Evans DT. The killer-cell immunoglobulin-like receptors of macaques. Immunol Rev. 2015;267:246–258. doi: 10.1111/imr.12329. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blokhuis JH, Doxiadis GG, Bontrop RE. A splice site mutation converts an inhibitory killer cell Ig-like receptor into an activating one. Mol Immunol. 2009;46:640–648. doi: 10.1016/j.molimm.2008.08.270. [DOI] [PubMed] [Google Scholar]
- Blokhuis JH, van der Wiel MK, Doxiadis GG, Bontrop RE. The mosaic of KIR haplotypes in rhesus macaques. Immunogenetics. 2010;62:295–306. doi: 10.1007/s00251-010-0434-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blokhuis JH, van der Wiel MK, Doxiadis GG, Bontrop RE. The extreme plasticity of killer cell Ig-like receptor (KIR) haplotypes differentiates rhesus macaques from humans. Eur J Immunol. 2011;41:2719–2728. doi: 10.1002/eji.201141621. [DOI] [PubMed] [Google Scholar]
- Bonagura VR, Du Z, Ashouri E, Luo L, Hatam LJ, DeVoti JA, Rosenthal DW, Steinberg BM, Abramson AL, Gjertson DW, Reed EF, Rajalingam R. Activating killer cell immunoglobulin-like receptors 3DS1 and 2DS1 protect against developing the severe form of recurrent respiratory papillomatosis. Hum Immunol. 2010;71:212–219. doi: 10.1016/j.humimm.2009.10.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carr WH, Pando MJ, Parham P. KIR3DL1 polymorphisms that affect NK cell inhibition by HLA-Bw4 ligand. J Immunol. 2005;175:5222–5229. doi: 10.4049/jimmunol.175.8.5222. [DOI] [PubMed] [Google Scholar]
- Carrillo-Bustamante P, Keşmir C, de Boer RJ. The evolution of natural killer cell receptors. Immunogenetics. 2016;68:3–18. doi: 10.1007/s00251-015-0869-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carrington M, Norman P. The KIR Gene Cluster 2003 [Google Scholar]
- Chaichompoo P, Bostik P, Stephenson S, Udompunturuk S, Kobkitjaroen J, Pattanapanyasat K, Ansari AA. Multiple KIR gene polymorphisms are associated with plasma viral loads in SIV-infected rhesus macaques. Cell Immunol. 2010;263:176–187. doi: 10.1016/j.cellimm.2010.03.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cirocco RE, Mathew JM, Burke GW, Esquenazi V, Miller J. Killer cell immunoglobulin-like receptor polymorphisms in HLA-identical kidney transplant recipients: lack of 2DL2 and 2DS2 may be associated with poor graft function. Tissue Antigens. 2007;69(Suppl 1):123–124. doi: 10.1111/j.1399-0039.2006.76211.x. [DOI] [PubMed] [Google Scholar]
- Dunphy SE, Guinan KJ, Chorcora CN, Jayaraman J, Traherne JA, Trowsdale J, Pende D, Middleton D, Gardiner CM. 2DL1, 2DL2 and 2DL3 all contribute to KIR phenotype variability on human NK cells. Genes Immun. 2015;16:301–310. doi: 10.1038/gene.2015.15. [DOI] [PubMed] [Google Scholar]
- Edgar RC. Search and clustering orders of magnitude faster than BLAST. Bioinformatics. 2010;26:2460–2461. doi: 10.1093/bioinformatics/btq461. [DOI] [PubMed] [Google Scholar]
- Eid J, Fehr A, Gray J, Luong K, Lyle J, Otto G, Peluso P, Rank D, Baybayan P, Bettman B, Bibillo A, Bjornson K, Chaudhuri B, Christians F, Cicero R, Clark S, Dalal R, Dewinter A, Dixon J, Foquet M, Gaertner A, Hardenbol P, Heiner C, Hester K, Holden D, Kearns G, Kong X, Kuse R, Lacroix Y, Lin S, Lundquist P, Ma C, Marks P, Maxham M, Murphy D, Park I, Pham T, Phillips M, Roy J, Sebra R, Shen G, Sorenson J, Tomaney A, Travers K, Trulson M, Vieceli J, Wegener J, Wu D, Yang A, Zaccarin D, Zhao P, Zhong F, Korlach J, Turner S. Real-time DNA sequencing from single polymerase molecules. Science. 2009;323:133–138. doi: 10.1126/science.1162986. [DOI] [PubMed] [Google Scholar]
- Ericsen AJ, Starrett GJ, Greene JM, Lauck M, Raveendran M, Deiros DR, Mohns MS, Vince N, Cain BT, Pham NH, Weinfurter JT, Bailey AL, Budde ML, Wiseman RW, Gibbs R, Muzny D, Friedrich TC, Rogers J, O’Connor DH. Whole genome sequencing of SIV-infected macaques identifies candidate loci that may contribute to host control of virus replication. Genome Biol. 2014;15:478. doi: 10.1186/s13059-014-0478-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Felsenstein J. Confidence limits on phylogenies: an approach using the bootstrap. Evolution. 1985;39:783–791. doi: 10.1111/j.1558-5646.1985.tb00420.x. [DOI] [PubMed] [Google Scholar]
- Gardiner CM. Killer cell immunoglobulin-like receptors on NK cells: the how, where and why. Int J Immunogenet. 2008;35:1–8. doi: 10.1111/j.1744-313X.2007.00739.x. [DOI] [PubMed] [Google Scholar]
- González-Galarza FF, Takeshita LY, Santos EJ, Kempson F, Maia MH, da Silva AL, Teles e Silva AL, Ghattaoraya GS, Alfirevic A, Jones AR, Middleton D. Allele frequency net 2015 update: new features for HLA epitopes, KIR and disease and HLA adverse drug reaction associations. Nucleic Acids Res. 2015;43:D784–8. doi: 10.1093/nar/gku1166. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hansen DS, D’Ombrain MC, Schofield L. The role of leukocytes bearing Natural Killer Complex receptors and Killer Immunoglobulin-like Receptors in the immunology of malaria. Curr Opin Immunol. 2007;19:416–423. doi: 10.1016/j.coi.2007.07.011. [DOI] [PubMed] [Google Scholar]
- Hellmann I, Letvin NL, Schmitz JE. KIR2DL4 copy number variation is associated with CD4+ T-cell depletion and function of cytokine-producing NK cell subsets in SIV-infected Mamu-A*01-negative rhesus macaques. J Virol. 2013;87:5305–5310. doi: 10.1128/JVI.02949-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hellmann I, Lim SY, Gelman RS, Letvin NL. Association of activating KIR copy number variation of NK cells with containment of SIV replication in rhesus monkeys. PLoS Pathog. 2011;7:e1002436. doi: 10.1371/journal.ppat.1002436. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hershberger KL, Kurian J, Korber BT, Letvin NL. Killer cell immunoglobulin-like receptors (KIR) of the African-origin sabaeus monkey: evidence for recombination events in the evolution of KIR. Eur J Immunol. 2005;35:922–935. doi: 10.1002/eji.200425408. [DOI] [PubMed] [Google Scholar]
- Hershberger KL, Shyam R, Miura A, Letvin NL. Diversity of the killer cell Ig-like receptors of rhesus monkeys. J Immunol. 2001;166:4380–4390. doi: 10.4049/jimmunol.166.7.4380. [DOI] [PubMed] [Google Scholar]
- Hilton HG, Guethlein LA, Goyos A, Nemat-Gorgani N, Bushnell DA, Norman PJ, Parham P. Polymorphic HLA-C Receptors Balance the Functional Characteristics of KIR Haplotypes. J Immunol. 2015;195:3160–3170. doi: 10.4049/jimmunol.1501358. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jiang Y, Chen O, Cui C, Zhao B, Han X, Zhang Z, Liu J, Xu J, Hu Q, Liao C, Shang H. KIR3DS1/L1 and HLA-Bw4-80I are associated with HIV disease progression among HIV typical progressors and long-term nonprogressors. BMC Infect Dis. 2013;13:405. doi: 10.1186/1471-2334-13-405. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jones DT, Taylor WR, Thornton JM. The rapid generation of mutation data matrices from protein sequences. Comput Appl Biosci. 1992;8:275–282. doi: 10.1093/bioinformatics/8.3.275. [DOI] [PubMed] [Google Scholar]
- Karl JA, Graham ME, Wiseman RW, Heimbruch KE, Gieger SM, Doxiadis GG, Bontrop RE, O’Connor DH. Major histocompatibility complex haplotyping and long-amplicon allele discovery in cynomolgus macaques from Chinese breeding facilities. Immunogenetics. 2017 doi: 10.1007/s00251-017-0969-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, Buxton S, Cooper A, Markowitz S, Duran C, Thierer T, Ashton B, Meintjes P, Drummond A. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics. 2012;28:1647–1649. doi: 10.1093/bioinformatics/bts199. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Khakoo SI, Thio CL, Martin MP, Brooks CR, Gao X, Astemborski J, Cheng J, Goedert JJ, Vlahov D, Hilgartner M, Cox S, Little AM, Alexander GJ, Cramp ME, O’Brien SJ, Rosenberg WM, Thomas DL, Carrington M. HLA and NK cell inhibitory receptor genes in resolving hepatitis C virus infection. Science. 2004;305:872–874. doi: 10.1126/science.1097670. [DOI] [PubMed] [Google Scholar]
- Knapp S, Warshow U, Hegazy D, Brackenbury L, Guha IN, Fowell A, Little AM, Alexander GJ, Rosenberg WM, Cramp ME, Khakoo SI. Consistent beneficial effects of killer cell immunoglobulin-like receptor 2DL3 and group 1 human leukocyte antigen-C following exposure to hepatitis C virus. Hepatology. 2010;51:1168–1175. doi: 10.1002/hep.23477. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kochetov AV. Alternative translation start sites and hidden coding potential of eukaryotic mRNAs. Bioessays. 2008;30:683–691. doi: 10.1002/bies.20771. [DOI] [PubMed] [Google Scholar]
- Kruse PH, Rosner C, Walter L. Characterization of rhesus macaque KIR genotypes and haplotypes. Immunogenetics. 2010;62:281–293. doi: 10.1007/s00251-010-0433-4. [DOI] [PubMed] [Google Scholar]
- Kumar S, Stecher G, Tamura K. MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets. Mol Biol Evol. 2016;33:1870–1874. doi: 10.1093/molbev/msw054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kunert K, Seiler M, Mashreghi MF, Klippert K, Schönemann C, Neumann K, Pratschke J, Reinke P, Volk HD, Kotsch K. KIR/HLA ligand incompatibility in kidney transplantation. Transplantation. 2007;84:1527–1533. doi: 10.1097/01.tp.0000290681.41859.41. [DOI] [PubMed] [Google Scholar]
- Kuśnierczyk P. Are killer cell immunoglobulin-like receptor genes important for the prediction of kidney graft rejection. Arch Immunol Ther Exp (Warsz) 2013;61:321–325. doi: 10.1007/s00005-013-0225-2. [DOI] [PubMed] [Google Scholar]
- La Manna G, Corsini S, Iannelli S, Cappuccilli ML, Comai G, Iorio M, Todeschini P, Carretta E, Scolari MP, Bontadini A, Stefoni S. Influence of the immunogenetic KIR and HLA systems on long-term renal transplant outcome. Ann Transplant. 2013;18:611–621. doi: 10.12659/AOT.889157. [DOI] [PubMed] [Google Scholar]
- Lanier LL. NK cell recognition. Annu Rev Immunol. 2005;23:225–274. doi: 10.1146/annurev.immunol.23.021704.115526. [DOI] [PubMed] [Google Scholar]
- Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R, Thompson JD, Gibson TJ, Higgins DG. Clustal W and Clustal X version 2.0. Bioinformatics. 2007;23:2947–2948. doi: 10.1093/bioinformatics/btm404. [DOI] [PubMed] [Google Scholar]
- Legaz I, López-Álvarez MR, Campillo JA, Moya-Quiles MR, Bolarín JM, de la Peña J, Salgado G, Gimeno L, García-Alonso AM, Muro M, Miras M, Alonso C, Álvarez-López MR, Minguela A. KIR gene mismatching and KIR/C ligands in liver transplantation: consequences for short-term liver allograft injury. Transplantation. 2013;95:1037–1044. doi: 10.1097/TP.0b013e318286486c. [DOI] [PubMed] [Google Scholar]
- Lodoen MB, Lanier LL. Natural killer cells as an initial defense against pathogens. Curr Opin Immunol. 2006;18:391–398. doi: 10.1016/j.coi.2006.05.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Long EO, Rajagopalan S. HLA class I recognition by killer cell Ig-like receptors. Semin Immunol. 2000;12:101–108. doi: 10.1006/smim.2000.0212. [DOI] [PubMed] [Google Scholar]
- Lupski JR, Gonzaga-Jauregui C, Yang Y, Bainbridge MN, Jhangiani S, Buhay CJ, Kovar CL, Wang M, Hawes AC, Reid JG, Eng C, Muzny DM, Gibbs RA. Exome sequencing resolves apparent incidental findings and reveals further complexity of SH3TC2 variant alleles causing Charcot-Marie-Tooth neuropathy. Genome Med. 2013;5:57. doi: 10.1186/gm461. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martin MP, Gao X, Lee JH, Nelson GW, Detels R, Goedert JJ, Buchbinder S, Hoots K, Vlahov D, Trowsdale J, Wilson M, O’Brien SJ, Carrington M. Epistatic interaction between KIR3DS1 and HLA-B delays the progression to AIDS. Nat Genet. 2002;31:429–434. doi: 10.1038/ng934. [DOI] [PubMed] [Google Scholar]
- Messaoudi I, Estep R, Robinson B, Wong SW. Nonhuman primate models of human immunology. Antioxid Redox Signal. 2011;14:261–273. doi: 10.1089/ars.2010.3241. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Middleton D, Gonzelez F. The extensive polymorphism of KIR genes. Immunology. 2010;129:8–19. doi: 10.1111/j.1365-2567.2009.03208.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moreland AJ, Guethlein LA, Reeves RK, Broman KW, Johnson RP, Parham P, O’Connor DH, Bimber BN. Characterization of killer immunoglobulin-like receptor genetics and comprehensive genotyping by pyrosequencing in rhesus macaques. BMC Genomics. 2011;12:295. doi: 10.1186/1471-2164-12-295. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ponte M, Cantoni C, Biassoni R, Tradori-Cappai A, Bentivoglio G, Vitale C, Bertone S, Moretta A, Moretta L, Mingari MC. Inhibitory receptors sensing HLA-G1 molecules in pregnancy: decidua-associated natural killer cells express LIR-1 and CD94/NKG2A and acquire p49, an HLA-G1-specific receptor. Proc Natl Acad Sci U S A. 1999;96:5674–5679. doi: 10.1073/pnas.96.10.5674. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26:841–842. doi: 10.1093/bioinformatics/btq033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rice P, Longden I, Bleasby A. EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet. 2000;16:276–277. doi: 10.1016/s0168-9525(00)02024-2. [DOI] [PubMed] [Google Scholar]
- Robinson J, Waller MJ, Stoehr P, Marsh SG. IPD--the Immuno Polymorphism Database. Nucleic Acids Res. 2005;33:D523–6. doi: 10.1093/nar/gki032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Saitou N, Nei M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987;4:406–425. doi: 10.1093/oxfordjournals.molbev.a040454. [DOI] [PubMed] [Google Scholar]
- Sambrook JG, Bashirova A, Andersen H, Piatak M, Vernikos GS, Coggill P, Lifson JD, Carrington M, Beck S. Identification of the ancestral killer immunoglobulin-like receptor gene in primates. BMC Genomics. 2006;7:209. doi: 10.1186/1471-2164-7-209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sambrook JG, Bashirova A, Palmer S, Sims S, Trowsdale J, Abi-Rached L, Parham P, Carrington M, Beck S. Single haplotype analysis demonstrates rapid evolution of the killer immunoglobulin-like receptor (KIR) loci in primates. Genome Res. 2005;15:25–35. doi: 10.1101/gr.2381205. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schmitz JE, Korioth-Schmitz B. Immunopathogenesis of simian immunodeficiency virus infection in nonhuman primates. Curr Opin HIV AIDS. 2013;8:273–279. doi: 10.1097/COH.0b013e328361cf5b. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shilling HG, Young N, Guethlein LA, Cheng NW, Gardiner CM, Tyan D, Parham P. Genetic control of human NK cell repertoire. J Immunol. 2002;169:239–247. doi: 10.4049/jimmunol.169.1.239. [DOI] [PubMed] [Google Scholar]
- Sussman RW, Tattersall I. Distribution, abundance, and putative ecological strategy of Macaca fascicularis on the island of Mauritius, southwestern Indian Ocean. Folia primatologica. 1986;46:28–43. [Google Scholar]
- Travers KJ, Chin CS, Rank DR, Eid JS, Turner SW. A flexible and efficient template format for circular consensus sequencing and SNP detection. Nucleic Acids Res. 2010;38:e159. doi: 10.1093/nar/gkq543. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Trowsdale J, Barten R, Haude A, Stewart CA, Beck S, Wilson MJ. The genomic context of natural killer receptor extended gene families. Immunol Rev. 2001;181:20–38. doi: 10.1034/j.1600-065x.2001.1810102.x. [DOI] [PubMed] [Google Scholar]
- Uhrberg M, Valiante NM, Shum BP, Shilling HG, Lienert-Weidenbach K, Corliss B, Tyan D, Lanier LL, Parham P. Human diversity in killer cell inhibitory receptor genes. Immunity. 1997;7:753–763. doi: 10.1016/s1074-7613(00)80394-5. [DOI] [PubMed] [Google Scholar]
- Valiante NM, Uhrberg M, Shilling HG, Lienert-Weidenbach K, Arnett KL, D’Andrea A, Phillips JH, Lanier LL, Parham P. Functionally and structurally distinct NK cell receptor repertoires in the peripheral blood of two human donors. Immunity. 1997;7:739–751. doi: 10.1016/s1074-7613(00)80393-3. [DOI] [PubMed] [Google Scholar]
- Vallender EJ. Expanding whole exome resequencing into non-human primates. Genome Biol. 2011;12:R87. doi: 10.1186/gb-2011-12-9-r87. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Van Bergen J, Thompson A, Haasnoot GW, Roodnat JI, de Fijter JW, Claas FH, Koning F, Doxiadis II. KIR-ligand mismatches are associated with reduced long-term graft survival in HLA-compatible kidney transplantation. Am J Transplant. 2011;11:1959–1964. doi: 10.1111/j.1600-6143.2011.03621.x. [DOI] [PubMed] [Google Scholar]
- Wende H, Volz A, Ziegler A. Extensive gene duplications and a large inversion characterize the human leukocyte receptor cluster. Immunogenetics. 2000;51:703–713. doi: 10.1007/s002510000187. [DOI] [PubMed] [Google Scholar]
- Westbrook CJ, Karl JA, Wiseman RW, Mate S, Koroleva G, Garcia K, Sanchez-Lockhart M, O’Connor DH, Palacios G. No assembly required: Full-length MHC class I allele discovery by PacBio circular consensus sequencing. Hum Immunol. 2015;76:891–896. doi: 10.1016/j.humimm.2015.03.022. [DOI] [PubMed] [Google Scholar]
- Wilson MJ, Torkar M, Haude A, Milne S, Jones T, Sheer D, Beck S, Trowsdale J. Plasticity in the organization and sequences of human KIR/ILT gene families. Proc Natl Acad Sci U S A. 2000;97:4778–4783. doi: 10.1073/pnas.080588597. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wiseman RW, Karl JA, Bimber BN, O’Leary CE, Lank SM, Tuscher JJ, Detmer AM, Bouffard P, Levenkova N, Turcotte CL, Szekeres E, Wright C, Harkins T, O’Connor DH. Major histocompatibility complex genotyping with massively parallel pyrosequencing. Nat Med. 2009;15:1322–1326. doi: 10.1038/nm.2038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wiseman RW, Karl JA, Bohn PS, Nimityongskul FA, Starrett GJ, O’Connor DH. Haplessly hoping: macaque major histocompatibility complex made easy. ILAR J. 2013;54:196–210. doi: 10.1093/ilar/ilt036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yan WH, Lin A, Chen BG, Zhou MY, Dai MZ, Chen XJ, Gan LH, Zhu M, Shi WW, Li BL. Possible roles of KIR2DL4 expression on uNK cells in human pregnancy. Am J Reprod Immunol. 2007;57:233–242. doi: 10.1111/j.1600-0897.2007.00469.x. [DOI] [PubMed] [Google Scholar]
- Yawata M, Yawata N, Draghi M, Little AM, Partheniou F, Parham P. Roles for HLA and KIR polymorphisms in natural killer cell repertoire selection and modulation of effector function. J Exp Med. 2006;203:633–645. doi: 10.1084/jem.20051884. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zipperlen K, Gallant M, Stapleton S, Heath J, Barrett L, Grant M. Protective genotypes in HIV infection reflect superior function of KIR3DS1+ over KIR3DL1+ CD8+ T cells. Immunol Cell Biol. 2015;93:67–76. doi: 10.1038/icb.2014.68. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.