Abstract
Immunogenetic data from wild primate populations have been difficult to obtain, due to logistic and methodological constraints. We applied a well-characterized deep sequencing method for MHC I typing, developed for macaques, to a population of wild red colobus to assess the feasibility of identifying MHC I-A/B haplotypes. Ten individuals produced sufficient data from blood and tissue samples to assign haplotypes. Eighty-two sequences were classified as red colobus MHC I alleles distributed across six MHC I loci. Individuals averaged ~13k reads across six MHC I loci, with 83% of all alleles representing either MHC I-A or MHC I-B loci. This study not only represents an important advance in the identification and genotyping of MHC in the endangered red colobus but also shows the potential for using this approach in other endangered wild primates.
Keywords: Major Histocompatibility Complex, next generation sequencing, primates, immunogenetics, red colobus
Technical Note
Red colobus monkeys (genus Procolobus; Grubb et al. 2003; Oates & Davies 1994) are among the most threatened African primates due to hunting and habitat modification/loss (Struhsaker 2005; Ting 2008). While habitat modification can drive disease susceptibility (Brearley et al. 2013), there remains a lack of immunogenetic data for red colobus, which precludes the advancement of disease association studies. Few wild, endangered primates have been characterized for immune genes, including Major Histocompatibility Complex (MHC). MHC genotyping of wild primates remains challenging principally because of logistic constraints and because wild primates such as red colobus are non-model organisms (Sommer et al. 2013).
We tested a well-characterized MHC deep sequencing pipeline, developed for macaques (Karl et al. 2014), in wild red colobus. We genotyped a highly polymorphic 156 base pair region of MHC Class I exon 2. Nine blood samples from Ugandan red colobus (Procolobus rufomitratus tephrosceles) from Kibale National Park were collected in EDTA in 2006 and 2010. One tissue sample from Temminick's red colobus (Procolobus badius temminckii) was collected from Abuko Nature Reserve in the Gambia (Figure 1). Genomic DNA was extracted using QIAamp DNA Mini Kits (Qiagen) following the manufacturers protocol and normalized to 10ng/μl. Primers used were (SBT195-F: GCCTCGCTCTGGTTG- TAGTAG, SBT195-R: GGGCTACGTGGACGACAC; Karl et al. 2014). PCR amplification in 25μl reactions contained 10ng of DNA, 12.5nM of each primer, 100nM of Fluidigm barcoded Illumina adapter primers, 12.5μl 2X Phusion High Fidelity PCR Master Mix (New England BioLabs), and 7.5μl of water. PCR conditions included an initial denaturation for 3 minutes (98°C), followed by 25 cycles of 5-second denaturation (98°C), 10-second annealing (60°C) and 20-second extension (72°C), followed by a final 5-minute extension (72°C). PCR products were purified using an Agilent AMPure XP (Beckman Coulter) bead cleanup. Purified products were quantified using PicoGreen chemistry (Thermo Fisher Scientific). We pooled together equimolar concentrations of each sample and carried out two additional AMPure XP cleaning steps to remove remaining dimers. The library was quantified using the Qubit™ High-Sensitivity dsDNA Assay (Invitrogen) and sequenced as part of a single, paired-end run on an Illumina MiSeq using the 500-cycle MiSeq Reagent Kit v2 (Illumina, San Diego, CA, USA).
Raw Illumina reads were demultiplexed and primer and adapter sequences removed using Flexbar (Dodt et al. 2012). Primers were trimmed in left trim mode with a minimum overlap equal to the length of the primer with no unknown bases allowed. Sequences were then trimmed from the right to a maximum length of 156 base pairs. Paired-end reads were merged using FLASH under default settings (Magoc & Salzberg 2011). Merged reads were mapped against a curated database of known macaque MHC class I alleles from the Wisconsin National Primate Research Center (WNPRC) using Bowtie, allowing zero mismatches (Langmead et al. 2009). Mapped reads were removed as contamination (<0.0023%), and unmapped reads were retained as red colobus sequences. Sequences were searched against the WNPRC macaque database using BLAST in Geneious Pro v. 6.1.2 (Biomatters) allowing a 1% mismatch (Figure 2). Sequences were assigned to a locus based on closest match in the macaque database. A minimum threshold of 100x coverage was used, and we retained only sequences that matched the expected sequence size (156) ± 3 base pairs in order to remove sequences in which a reading frame shift had occurred. We translated the sequences and searched for stop codons using the second base position as the start of the reading frame, and sequences containing a stop codon were designated as pseudogenes. Alleles with a Mean Per Amplicon Frequency (MPAF) <0.01 were visually checked for chimeras and sequencing errors (no chimeras identified, one allele removed as sequencing error) following Zagalska-Neubauer et al. (2010).
In total, 82 alleles (including two pseudogenes) were identified that mapped closely to six macaque (Macaca mulatta) MHC I loci (DRYAD DOI). Haplotypes were determined based on patterns of allele sharing across individuals (see Supplemental Material S1). The majority of haplotypes represented MHC I-A (6 haplotypes, 16 alleles) and MHC I-B (7 haplotypes, 41 alleles), suggesting that, like macaques, MHC I has undergone extensive duplication events in red colobus. Number of reads per individual ranged from 7,332-34,611 (average = 12,156). Six Ugandan red colobus were homozygous for the A001a, one was homozygous for A002, and three were heterozygotes. A005 and A006 were restricted to Temminick's red colobus. One Ugandan red colobus was homozygous for B003, and eight were heterozygous. B007 was restricted to Temminick's red colobus.
We have demonstrated that a protocol for genotyping MHC I in macaques can be applied successfully to a non-model primate, the red colobus. This finding shows that rapid and accurate MHC genotyping in endangered primates is currently feasible, given the availability of suitable biomaterials. Because our primers were designed for a distantly related Old World monkey, we predict that this approach will be successful across the family Cercopithecidae. Applying our method more broadly should facilitate studies of the genetic determinants of disease susceptibility as well as assessments of immunogenetic diversity in isolated primate populations.
Supplementary Material
References
- Brearley G, Rhodes J, Bradley A, Baxter G, Seabrook L, Lunney D, Liu Y, McAlpine C. Wildlife disease prevalence in human-modified landscapes. Biological Reviews. 2013;88:427–442. doi: 10.1111/brv.12009. [DOI] [PubMed] [Google Scholar]
- Dodt M, Roehr JT, Ahmed R, Dieterich C. FLEXBAR-flexible barcode and adapter processing for next-generation sequencing platforms. Biology. 2012;1(3):895–905. doi: 10.3390/biology1030895. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Grubb P, Butynski TM, Oates JF, Bearder SK, Disotell TR, Groves CP, Struhsaker TT. Assessment of the Diversity of African Primates. International Journal of Primatology. 2003;24:1301–1357. [Google Scholar]
- Karl JA, Heimbruch KE, Vriezen CE, Mironczuk CJ, Dudley DM, Wiseman RW, O'Connor DH. Survey of major histocompatibility complex class II diversity in pig-tailed macaques. Immunogenetics. 2014;66(11):613–623. doi: 10.1007/s00251-014-0797-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biology. 2009;10(3):R25. doi: 10.1186/gb-2009-10-3-r25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Magoc T, Salzberg SL. FLASH: fast length adjustment of short reads to improve genome assemblies. Bioinformatics. 2011;27(21):2957–2963. doi: 10.1093/bioinformatics/btr507. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Oates J, Davies G. Behaviour and Evolution. Cambridge University Press; Cambridge: 1994. Colobine Monkeys: Their Ecology; p. 415. [Google Scholar]
- Sommer S, Courtiol A, Mazzoni CJ. MHC genotyping of non-model organisms using next-generation sequencing: a new methodology to deal with artefacts and allelic dropout. BMC genomics. 2013;14(1):542. doi: 10.1186/1471-2164-14-542. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Struhsaker TT. Conservation of Red Colobus and Their Habitats. International Journal of Primatology. 2005;26:525–538. [Google Scholar]
- Ting N. Ph.D. Dissertation. City University of New York Graduate Center; 2008. Molecular Systematics of Red Colobus Monkeys (Procolobus [Piliocolobus]): Understanding the Evolution of an Endangered Primate. [Google Scholar]
- Zagalska-Neubauer M, Babik W, Stuglik M, Gustafsson L, Cichoń M, Radwan J. 454 sequencing reveals extreme complexity of the class II Major Histocompatibility Complex in the collared flycatcher. BMC evolutionary biology. 2010;10(1):395. doi: 10.1186/1471-2148-10-395. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.