Abstract
African bovine trypanosomiasis caused by Trypanosoma sp., is a major constraint on cattle productivity in sub-Saharan Africa. Some African Bos taurus breeds are highly tolerant of infection, but the potentially more productive Bos indicus zebu breeds are much more susceptible. Zebu cattle are well adapted for plowing and haulage, and increasing their tolerance of trypanosomiasis could have a major impact on crop cultivation as well as dairy and beef production. We used three strategies to obtain short lists of candidate genes within QTL that were previously shown to regulate response to infection. We analyzed the transcriptomes of trypanotolerant N'Dama and susceptible Boran cattle after infection with Trypanosoma congolense. We sequenced EST libraries from these two breeds to identify polymorphisms that might underlie previously identified quantitative trait loci (QTL), and we assessed QTL regions and candidate loci for evidence of selective sweeps. The scan of the EST sequences identified a previously undescribed polymorphism in ARHGAP15 in the Bta2 trypanotolerance QTL. The polymorphism affects gene function in vitro and could contribute to the observed differences in expression of the MAPK pathway in vivo. The expression data showed that TLR and MAPK pathways responded to infection, and the former contained TICAM1, which is within a QTL on Bta7. Genetic analyses showed that selective sweeps had occurred at TICAM1 and ARHGAP15 loci in African taurine cattle, making them strong candidates for the genes underlying the QTL. Candidate QTL genes were identified in other QTL by their expression profile and the pathways in which they participate.
Keywords: nagana, positional cloning, sustainable agriculture, trypanosomosis
African trypanosomes are extracellular protozoan parasites that cause severe diseases in humans and livestock, usually with fatal consequences unless treated. Though Trypanosoma brucei spp. cause significant morbidity and mortality in humans, infections with Trypanosoma congolense and Trypanosoma vivax are among the most significant constraints on cattle production in Africa, causing major economic losses, which in turn have serious consequences for human health and welfare (1). Anemia is the most prominent and consistent clinical sign of infection and is the main indicator for treatment rather than parasitemia, which is highly variable, particularly in indicine cattle.
Some African Bos taurus cattle breeds, such as N'Dama, are tolerant of infection with T. congolense, remaining apparently healthy despite the presence of parasites. This capacity to remain productive while harboring potentially lethal trypanosome infections is known as trypanotolerance (2) and is an evolutionary adaptation between the host and pathogen (3). N'Dama cattle are native to West Africa, where they were introduced about 3,000 y ago (4), whereas the introduction of zebu (Bos indicus) cattle to the region is recent, probably only intensifying after a rinderpest epidemic just over a century ago (5).
Although N'Dama can tolerate moderate levels of trypanosome challenge, their temperament and relatively small size makes them unsuited for draft purposes. The use of cattle for pulling plows and carts can dramatically increase crop production, as well as provide, milk, meat, and manure. This loss of draft power means that many poor African farmers are dependent on the heavy labor of hand tillage, and suffer from loss of meat and milk production. Development of breeds of trypanotolerant cattle that are better adapted to the diverse needs of African farmers could substantially improve productivity in the whole agricultural system.
We are seeking to identify the genetic determinants of trypanotolerance in N'Dama with a view to introgressing this trait into zebu breeds. Ten major quantitative trait loci (QTL) that control trypanotolerance in cattle have been genetically mapped by our group using an F2 cross between Boran (zebu) and N'Dama cattle (6). However, the positions of the loci have been resolved only to within ±20 cm and lie in gene-rich regions that lack obvious functional candidates, or contain numerous genes with similar potential. In the present study we have used transcriptome analyses to identify pathways that are responding to infection in the liver, spleen, and precrural lymph nodes. We have combined these data with three different genetic analyses to identify mutations and signatures of selective sweeps in genes in those pathways that might regulate the differences in response to infection. This data has been combined with prior knowledge from our linkage mapping studies and genetic and expression analyses of O'Gorman et al. (7) and Gautier et al. (8) to identify candidate genes in the five QTL of largest effect that have also been mapped to sufficiently small chromosomal regions to make the identification of candidate genes practicable.
Results
Candidate Quantitative Trait Genes.
Hanotte et al. (6) localized QTL for parasitemia, body weight, and packed cell volume after T. congolense infection in a Boran × N'Dama F2 cross. We used four strategies to identify candidate quantitative trait genes (QTG) within the large QTL regions. First, we screened an EST library for nonsynonymous SNP in genes within the QTL; second, we used gene expression data from spleen, liver, and lymph nodes at nine time points to identify pathways that responded to infection and genes within those pathways that were in QTL; third, we undertook a genetic screen of 17 loci to discover evidence of selection; and fourth, we screened two QTL using published SNP data from the bovine HapMap for signatures of selection.
EST Library Screen Identifies an ARHGAP15282H→P Mutation in the Bta2 QTL.
We obtained lists of genes that are in QTL regions and then screened EST libraries made from four tissues from N'Dama and Boran for SNP within those genes. This procedure identified two synonymous SNP and one nonsynonymous SNP (nsSNP) in the ARHGAP15 (Rho GTPase-activating protein 15) gene. No other nsSNPs were discovered in the other 30 genes in the Bta2 QTL. The nsSNP causes an H→P mutation at ARHGAP15282 and was predicted to be “probably damaging” by polymorphism phenotyping (PolyPhen) (9). Boran cattle carry the ancestral H allele at high frequency (0.71, n = 28), which is conserved in humans, pigs, chickens, and salmon, the P allele appears fixed in N'Dama from Guinea (n = 30). We have not found any records of the P allele in other species. ARHGAP15 is a RAC binding protein, and the mutation is at the proximal end of the RAC binding domain (PS50238) of the protein (10), suggesting a possible mechanism for any effect of the mutation on function. RAC1 is a GTPase that is active when binding GTP and inactive when binding GDP; the RhoGAP domain of ARHGAP15 accelerates the intrinsic rate of hydrolysis of GTP to GDP, thereby inactivating RAC1 (10). To determine whether the ARHGAP15282H→P polymorphism might cause a difference in the rate of GTP hydrolysis by RAC1, the two alleles were expressed in vitro, and the rate of GTP hydrolysis was measured by following the change in absorbance at 360 nm caused by an increase in γPi abundance (Fig. 1). Both alleles significantly increased the rate of GTP hydrolysis by RAC1 over the rate of the hydrolysis by RAC1 alone; however, the ARHGAP15282P allele increased the rate of hydrolysis by 21% more than the ARHGAP15282H allele over three replicate protein extractions and activity assays (F3, 25.0.9 = 56.40, P < 0.0001).
Fig. 1.
Absorbance profile for the intrinsic RAC1, ARHGAP15282H, and ARHGAP15282P stimulated GTPase reactions and their fitted profiles. The 95% confidence limits were very small and are shown as a fine dotted line close to the fitted line.
Response of the Transcriptomes of N'Dama and Boran to Infection.
Twenty-five animals of each breed were included in the expression analysis experiment. Five animals of each breed were killed before infection, and on days 21 and 35 postinfection (pi) for collection of liver, spleen, and precrural lymph node samples. Needle liver biopsies were taken from five animals of each breed at six additional time points (days 12, 15, 18, 26, 29, and 32 pi). Ten animals of each breed were only used for needle biopsies.
Analysis of the expression data focused on identification of KEGG pathways (11) that either responded to infection or differed between breeds. To identify candidate genes in each QTL we used a Taverna workflow to obtain a list of all genes that were in the KEGG pathways that contained an excess of genes (P < 0.05) that either changed over time in response to infection or were differentially expressed between breeds. The workflow then obtained a list of all genes within the QTL region from Ensembl (Assembly Btau3) irrespective of whether they were differentially expressed (12–14). These two lists were then scanned for genes that were in a responding pathway and in a QTL region; such genes were considered as candidate QTG. The most significant (P < 0.05) pathways and genes that were identified in this way for the five QTL of largest effect are shown in Table 1.
Table 1.
Pathways that responded to infection in liver, spleen, or lymph node (P < 0.05 for at least one condition), and genes within each pathway also within each of the QTL
| QTL location and trait affected (4) |
||||||
| KEGG pathway ID | KEGG pathway name | BTA2 anemia | BTA4 parasitemia | BTA7 parasitemia and anemia | BTA16 anemia | BTA27 anemia |
| 04660 | T-cell receptor signaling pathway | CD28 | VAV1 | IKBKB | ||
| CTLA4 | ||||||
| ICOS | ||||||
| 04620 | Toll-like receptor signaling pathway | CASP8 | IRAK1 | IKBKB | ||
| TRIF | ||||||
| 04670 | Leukocyte transendothelial migration | VAV1 | CLDN23 | |||
| 04514 | Cell adhesion molecules (CAMs) | CD28 | MADCAM1 | CLDN23 | ||
| CTLA4 | ICAM1 | |||||
| NLGN1 | ICAM3 | |||||
| ICOS | ||||||
| 04640 | Hematopoietic cell lineage | EPOR | ||||
| FCER2 | ||||||
| 04210 | Apoptosis | CASP8 | IRAK1 | CAPN2 | IKBKB | |
| PRKACA | ||||||
| 00561 | Glycerolipid metabolism | LCT | DGKI | AGPAT6 | ||
| 00564 | Glycerophospholipid | DGKI | ARD1A | AGPAT6 | ||
| 04110 | Cell cycle | ORC4L,MCM6 | CDKN2D | |||
| 04010 | MAPK signaling pathway | CASP8 | CASP2 | PRKACA | DUSP10 | BRAF |
| ECSIT | DUSP4 | |||||
| IKBKB | ||||||
| FGF20 | ||||||
| 03010 | Ribosome | RPL36 | ||||
| RPS28 | ||||||
| 04810 | Regulation of actin cytoskeleton | FN1 | CHRM2 | VAV1 | BRAF | |
| PIP5K3 | FGF20 | |||||
| 04540 | Gap junction | TUBB4 | GNAQ | |||
| PRKACA | ||||||
| 04510 | Focal adhesion | FN1 | ZYX | VAV1 | CAPN2 | BRAF |
| COL5A3 | ||||||
| 00480 | Glutathione metabolism | IDH1 | GSTK1 | |||
| 04520 | Adherens junction | INSR | FGFR1 | |||
| 01510 | Neurodegenerative disorders | CASP8 | ||||
| 04720 | Long-term potentiation | PRKACA | GNAQ | BRAF | ||
| 04310 | Wnt signaling pathway | PRKACA | DKK4 | |||
| SFRP1 | ||||||
| 04360 | Axon guidance | EFNB1 | EPHA1 | UNC5D | ||
| EPHB6 | ||||||
| 04020 | Calcium signaling pathway | CHRM2 | PTGER1 | GNAQ | ADRB3 | |
| PRKACA | GNA14 | VDAC3 | ||||
| ITPKB | ||||||
The genes within QTL that are listed were not necessarily differentially expressed, because the QTL effect on the pathway could be a consequence of a structural polymorphism that does not affect expression. Note that ARHGAP15 does not appear in this list only because KEGG have yet to include it in the MAPK pathway. A list of complete gene names is available in Table S1. [Reproduced with permission from ref. 31 (Copyright 2008, Karger, Basel).]
TICAM1 (TRIF), IKBKB (IKKB), ECSIT, VAV1, and CASP8 were under QTL and in pathways that responded to infection (Table 1). The first four genes are important in the NFKB pathway; ECSIT bridges TRAF6 to MAP3K1, and the MAPK pathway and TICAM1 participates in the MYD88-independent alternative pathway (15). We found evidence that TICAM1 has been under positive selection in West African taurine cattle (see below), suggesting that this gene may be an important regulator of response to infection by altering the balance between the MYD88 classical TLR response and the MYD88-independent alternative response (16). Differences in TICAM1 activity could be causing the differences we see in the expression of some or all of the other genes in this pathway.
VAV1 is an agonist for RAC1and is most highly expressed in CD14+ monocytes (17); in contrast, ARHGAP15 is an antagonist for RAC1. VAV1 was twofold more highly expressed in the liver of Boran than N'Dama, and if regulated by SNP within or close to the gene, then VAV1 is a candidate QTL gene; however, it may also be indirectly regulated as a consequence of the difference in activity of ARHGAP15. VAV1 was resequenced in Boran and N'Dama, but no structural differences were observed.
Responding KEGG Pathways.
We also obtained lists of responding pathways, irrespective of whether they contained genes within QTL, to obtain a more comprehensive view of the response to infection. Five immune-related categories were strong responders in the liver of both breeds: natural killer (NK) cell-mediated cytotoxicity, T-cell receptor signaling pathway, B-cell receptor signaling pathway, Fc epsilon RI signaling pathway, and leukocyte transendothelial migration (Dataset S1). Of the immune-related categories, NK cell-mediated cytotoxicity was by far the most significantly overrepresented class of genes, and showed two- to fourfold up-regulation of all genes that were differentially expressed (Fig. 2). This up-regulation is most likely a marker for increased NK cell abundance rather than increased gene expression by resident NK cells, and corresponds to a lymphocyte-rich inflammatory reaction that was apparent on histological examination of the liver. The MAPK pathway, which is highly expressed in NK cells, was particularly strongly up-regulated and contains VAV1 and RAC1. MAPK pathway gene expression increased strongly and consistently in both breeds, suggesting that it has an important role in the response to infection. ARHGAP15 acts as a negative regulator of RAC1, which is a key mediator in this pathway. The combined effect of higher expression of VAV1, a RAC1 agonist, in Boran and higher activity of ARHGAP15, a RAC1 anatagonist, in N'Dama might lead to a stronger MAPK-mediated inflammatory response in Boran than N'Dama.
Fig. 2.
KEGG NK cell pathway (hsa04650) in the liver showing up-regulation (P < 0.05, fold change > 2; black); no significant change (gray); not represented in array (white). No genes were down-regulated. Within each block, conditions are represented in the following order: (i) N'Dama change from day 0–21; (ii) N'Dama change from day 0–35; (iii) Boran change from day 0–21; (iv) Boran change from day 0–35. Multiple receptors are up-regulated, and the MAPK pathway from the receptors ITGAL/ITGB2 through VAV1, RAC, and PAK1 are all consistently up-regulated in both breeds.
Pathways Associated with Differentially Expressed Genes.
The above analysis was undertaken on genes that differed between breeds as well as those that that did not differ between breeds but changed in expression in response to infection. The genes that differed in expression between the breeds were tested separately to identify the KEGG pathways that differed in response. Cytokine-cytokine receptor interaction, MAPK signaling pathway, and neurodegenerative disease were the only pathways containing an excess of differentially expressed genes and were only differentially expressed in the lymph nodes (Dataset S1). The presence of the MAPK pathway in the list of pathways associated with differentially expressed genes as well as the pathways associated with responding genes emphasizes its potential importance and is consistent with observations from previous studies of PBMC from the same animals (7).
Signatures of Selection Around TICAM1 and ARHGAP15.
The trypanosomiasis challenge of West Africa has undoubtedly been a source of selective pressure on cattle as they spread from the largely trypanosome-free Sahel into intense challenge in West Africa within the last 3,000 y (18). Trypanotolerant populations might be expected to show signatures of selection around genetic variants responsible for that trait. Therefore, we used the data from the Bovine SNP consortium to test the QTL that contained ARHGAP15 and TICAM1 for evidence of a selective sweep. The data included the East African Sheko and West African N'Dama, but not Boran. Sheko is commonly considered a taurine breed, but the HapMap data indicates that the majority of alleles are of zebu origin (19). We separately tested for a signature of selection at TICAM1 in a panel of B. indicus and European and African B. taurus breeds.
Selective sweeps can influence wide areas and local evidence might sometimes originate from selection on linked loci. However, the cross-population extended haplotype homozygosity (XP-EHH) test (20) for extended chromosome segment variants that have reached high frequency suggested narrow selective sweeps in Sheko around ARHGAP15, but not around the TICAM1 gene (Fig. 3). For the 0.5-megabase (Mb) segment with ARHGAP15, the XP-EHH score was in top 2.3% in the genome, but TICAM1 was not an outlier, being only in the top 22%. The ARHGAP15 sweep area included only one other gene, glycosyltransferase-like domain containing 1 (GTDC1), which shared the same signal of selective sweep. GTDC1 is ubiquitously expressed and could conceivably affect the response to infection, but there was no other evidence to implicate it. Use of other reference breeds (Brahman, Gir, Holstein, and Jersey) resulted in similar signals for positive selection at these QTL, although the peak shapes demonstrated some variation. These results provide evidence for a rapid completed positive sweep at ARHGAP15 in West African B. taurus.
Fig. 3.
Standardized cross-population extended haplotype homozygosities (XP-EHH) >10 Mb around ARHGAP15 (A) and TICAM1 (B) whose location in the center is marked with an orange column. Extreme positive values imply positive selective sweep in N'Dama, and negative values suggest sweep in Sheko. Genome-wide first and 99th percentiles across all XP-EHH scores are indicated with dotted line; fifth and 95th percentiles are indicated with hashed line. Alternating pale green and blue bars indicate other genes in the region (excluding hypothetical unrecognized genes). ARHGAP15 is in a gene-poor region with a broad peak of positive selection, whereas TICAM1 is in a gene-rich region with a narrow peak of selection that does not reach the 95% confidence threshold. At ARHGAP15 and TICAM1, the median intermarker intervals were 107 and 87 kb, respectively.
We genotyped multiple animals of each breed to identify the frequencies of the resistant ARHGAP15282P allele and found Guinea N'Dama 1.0 (n = 30), Sheko 0.67 (n = 18), and Kenyan Boran 0.29 (n = 28). Although it is not easy to quantify the trypanotolerance of these animals, these frequencies do at least correlate with how these breeds are ranked for this trait, and in the case of Sheko and Boran are about twice the allele frequencies that would be expected from the percentage of taurine alleles in their genomes in the HapMap data (19).
We also investigated TICAM1 for a signature of selection by resequencing the gene plus neighboring noncoding regions in a panel of 10 African B. taurus, 18 European B. taurus, and 11 Indian B. indicus. This gene exhibited outlying behavior with 17 segregating nsSNPs, two of which were scored as “probably damaging” by PolyPhen (TICAM1268P→Land TICAM1294P→L) and are therefore likely to have an effect on function. Seven of the coding changes in TICAM1 define two highly divergent segregating haplotypes. One of these predominates in the African sample and shows a pronounced excess of derived alleles, reflected in a highly significant value when Fay and Wu's H test of neutrality is applied to the African population (H = −19.17, P = 0.001) but not to the European or Indian populations. The locus shows a high level of linkage disequilibrium throughout the 4 kbp sequenced. Interestingly, each of these behaviors was extreme when viewed within the context of results for a total of 18 loci that were wholly or partially resequenced in similar continental cohorts (21, 22) (Table S2). These included TNFAIP8L1, a second immunogene also targeted from under the chromosome 7 QTL peak, which gave no outlying behavior, plus several other loci of immunological consequence (e.g., several interleukins and Toll-like receptors), some of which also give indications of adaptive history. These population genetic values at TICAM1 provide evidence for a local selective sweep within African B. taurus.
Discussion and Conclusion
This combination of genetic, in vitro analysis and expression data has provided unique insights into the differences between susceptible Boran and tolerant N'Dama cattle in their response to infection with T. congolense. We see clear evidence of differential innate responses early in the infection, apparently mediated by sequence differences in previously identified QTL regions. The pathway analysis suggested an important role for NK cells in the response to infection in both breeds and differential expression of the MAPK pathway. We have also identified a copy number variation in the murine Cd244 NK cell receptor gene as a candidate QTL gene for response to T. congolense infection in mice, suggesting that NK cells may play an important role in the response to infection in mice as well as cattle (23). A map kinase (MAPK14) was differentially expressed in two-color spotted array study of PBMC obtained from the same animals at the same time points and may be indicative of the MAPK pathway being differentially expressed in PBMC as well (7).
There is compelling evidence that a nonsynonymous sequence difference in the ARHGAP15 gene is playing an important role in the response and that it is the sequence variant that gives rise to the QTL on Bta2 in the mapping population. The putatively tolerant allele has altered behavior in vitro relative to the typical B. taurus allele, and it would be expected to inhibit RAC1 activity in the MAPK pathway, which could lead to the observed differences in expression or amplify downstream expression differences caused by other factors. There is also evidence for a selective sweep at this locus that includes just one other gene.
Similarly, KEGG pathway analysis of expression data identified TICAM1 in the Toll-like receptor pathway as a candidate gene on Bta7, and genotyping identified TICAM1 as being under positive selection in African taurine cattle. TICAM1 contained two nsSNP that would probably significantly affect function. An independent whole genome scan with 36,000 SNP for signatures of selection in West African cattle recently identified TICAM1 as a major hub in a network of genes under selection (8). Therefore, one analysis presented here and an independent study both indicate that TICAM1 is under selection in at least some African cattle breeds, including N'Dama, although there was no evidence from the XPEHH test for a sweep at TICAM1 in Sheko. Trypanosomiasis is probably the largest disease constraint on cattle in sub-Saharan Africa, and this fact alone makes it likely that it is trypanosomiasis that has been driving this selection. In addition, TICAM1 is also under the peak of a QTL associated with lower parasitemia in N'Dama, and our systematic analysis of expression data flagged TICAM1 as being in a pathway responding to infection with T. congolense. These independent lines of evidence make it a strong candidate gene at this locus. In this case, two nonsynonymous differences between the resistant and tolerant alleles are predicted to significantly change the gene's characteristics.
Trypanotolerance is an economically important trait of great biological interest, and an understanding of its mechanisms has the potential to transform cattle-keeping in tsetse-affected areas of Africa. However, trypanotolerance is a complex quantitative trait, with strong interactions with environmental and other disease factors (24). Identification of quantitative trait genes is particularly difficult because trypanosomes are free-living bloodstream parasites that interact with all tissues, causing a generalized response with cachexia and anemia but no pathognomonic signs that could be related to particular pathways or cell types. We have used microarrays to give us objective signatures of infection, and we have used these to identify the major pathways that respond to infection. We have investigated two genes that are in these pathways and under QTL by additional methods and show that both ARHGAP15 and TICAM1 appear to have functional polymorphisms that could affect the response to infection and are associated with signatures of selection. This is persuasive evidence that these genes are QTL genes, but definitive proof of their role in determining trypanotolerance status must await either a transgenic approach or a breeding program, which exploits them and in the process identifies recombinants. Such a breeding program could also produce cattle that could significantly increase the productivity of arable as well as livestock-based farming systems by making draft power available to all African farmers.
Methods
Animals, Design, Infection, and Sample Collection.
All animal experiments were conducted in compliance with the International Livestock Research Institute (ILRI) ethical review process. Twenty-five susceptible (Boran) and 25 trypanotolerant (N'Dama) cattle were selected from the herd at the ILRI Kapiti Plains ranch, which is free from tsetse flies and trypanosomiasis. All animals were screened and confirmed to be negative for infection with tick-borne parasites before being transferred to the ILRI research facility at Kabete. On arrival, the cattle were treated with an anthelmintic (albendazole), sprayed with acaricide, and quarantined for 2 wk. They were allowed to acclimatize for at least 6 wk and maintained on a diet of hay supplemented with minerals and concentrates.
Cattle were infected by tsetse-mediated inoculation of trypanosomes with T. congolense IL1180 as previously described (25), and infection was confirmed by microscopy. Tissue specimens were collected by biopsy (liver) or at postmortem (liver, spleen, and precrural lymph nodes). For biopsy collection, animals were sedated with 0.5–0.7 mL of 20% xylaxine, and locally anesthetized with 10–15 mL of lignocaine. Liver biopsy samples (75 mg) were collected using Quick-Core biopsy needles through an incision made between the 10th and 11th ribs.
All tissues were collected into cryotubes and preserved in liquid nitrogen. The entire experiment was performed in four phases over 4 mo. Total RNA was extracted from tissues pulverized under liquid nitrogen using the TRIzol method (Gibco). Total RNA was treated with DNaseI, purified with RNAeasy (Qiagen), and checked for RNA integrity number >6.3 using an Agilent Bioanalyzer 2000.
Samples of liver and spleen collected into cryotubes for histopathology were postfixed in neutral buffered formalin, sectioned at 6 μm, and stained with H&E for light microscopy.
Expression data were acquired on Affymetrix Bovine Genome arrays, which have probesets for 23,000 transcripts. Samples were hybridized in random order. A total of 5 μg of RNA preparation was labeled using the One-Cycle Eukaryotic Target labeling protocol (Affymetrix), hybridized to Bovine Genome array chips, and scanned with an Affymetrix Gene Chip 3000 scanner. Any sample that failed the labeling or hybridization data quality control using dChip (26) or principal component analysis using SVD in the MAXD package (27) was discarded and the procedure repeated on a new RNA preparation from the original tissue sample. A total of 160 RNA samples passed quality control and the data were normalized using robust multichip average. Additionally, sets of 25-mer probes for each gene represented on the array were identified using AffyProbeMiner (28) and normalized in the R environment using multi-mgMOS (29).
The DAVID database (30) was used to identify responding KEGG pathways (Dataset S1). A Taverna workflow (12–14) was used to identify KEGG pathways that responded to infection and contained genes in QTL regions as previously described (31).
Comparisons between breeds were based on the change in expression over time for each breed as calculated by Eq. 1. This strategy favors detection of KEGG categories that differ in response to infection rather than having baseline differences in expression. Lists of genes for which |d| > 2 was submitted to DAVID to identify KEGG categories that contained an excess of differentially expressed genes. The threshold of |d| > 2 corresponds to 2.0 SDs of d over the entire dataset (Dataset S1).
In Eq. 1, letters indicate breeds (N, N'Dama; B, Boran) and numbers indicate days pi. Each alphanumeric symbol indicates the mean absolute log2 signal from the five replicates of a given gene for a given condition. Hence N21 – N0 is the log ratio of expression for a given gene at day 21 pi vs. that at day 0 pi in N'Dama.
Preparation of cDNA Libraries and EST Sequencing.
RNA was prepared from liver, lymph node, spleen, and bone marrow of three Boran cattle (BW064 uninfected, BW062 slaughtered at day 15 pi, and BW063 day 50 pi) and three N'Dama cattle (ND159 uninfected, ND161 day 15 pi, and ND160 day 50 pi). Tissues were submitted to GATC Biotech Ltd. for RNA extraction, cDNA synthesis and normalization, and cloning. Libraries were constructed for each tissue, and tissue was identified by linker sequences. Five thousand clones were picked from each library at the Roslin Institute and submitted to the Sanger Centre for bidirectional sequencing. Vector-trimmed and quality-clipped sequences were deposited at European Molecular Biology Laboratory under accession nos. AM003907–AM039430.
ARHGAP15 Cloning and Hydrolysis Rate.
ARHGAP15 RT-PCR products from N'Dama and Boran liver cDNA were cloned into the pGEX-4T1 expression vector. GST-tagged protein was purified, and the rate of hydrolysis of GTP in the presence of the Boran or N'Dama alleles was followed by measurement of absorbance at 360 nm. A semiparametric generalized linear mixed model was used to model the time series of absorbances to allow for possible nonlinear and disparate trends for the intrinsic RAC1, ARHGAP15 (H), and ARHGAP15 (P) stimulated GTPase reactions and to test for differences in expected absorbance across the three experiments. The model assumed a normal error distribution and the identity link function. The trend model included the experiment as a fixed effect and a B-spline smoothing covariance structure with three equally spaced interior knots (32). The model was fitted by restricted log pseudolikelihood within a mixed-model framework in the SAS GLIMMIX procedure (33).
Cross-Population Extended Haplotype Homozygosity Test for Positive Sweeps in N'Dama.
Published cattle HapMap SNP data for N'Dama and Sheko was used for exploring signatures of positive selection in N'Dama using the East African Sheko breed as the reference population. The cattle HapMap data consisted of 21,034 SNP after exclusion of poorly supported SNP, and included 25 N'Dama and 20 Sheko individuals (19). The median interval between markers was 65 kb, which increased to 85 kb if the three most densely genotyped chromosomes (6, 14, and 25) were excluded. At ARHGAP15 and TICAM1, the median intervals were 107 and 87 kb, respectively. Haplotyping of 35-Mb segments was done using Shape-IT (34). Standardized XP-EHH values (20) were calculated as in Pickrell et al. (34). Exclusion of related animals had no noticeable effect. The data were specifically tested for positive sweeps at ARHGAP15 and TICAM1, and a genome wide analysis was done only to provide the distribution of experimental P values; therefore, it was not necessary to correct P values for multiple testing.
Sequencing and Population Genetic Analysis of TICAM1.
A panel of 10 African (Ndama, Somba, Lagune) B. taurus, 18 European (Aberdeen Angus, Friesian, Norwegian Red, German Black, Highland, Alentejana, Mertolenga, Romagnola, Sikias, Anatolian Black) B. taurus, and 11 Indian (Hariana, Tharparkar, Sahiwal, Ongole) B. indicus were sequenced as previously described (21). Plains bison (Bison bison) was used as an outgroup. The data were analyzed using Fay and Wu's H test (35).
Additional Files and Resources.
All expression data are available from ArrayExpress under accession no. E-MEXP-1778. Plots of the expression from each probe set on the array are also available from the authors’ Web site at http://www.genomics.liv.ac.uk/tryps/resources.html.
Supplementary Material
Acknowledgments
We thank the staff of the International Livestock Research Institute Large Animal Unit for expert technical assistance, and the International Atomic Energy Agency for making the Sheko genotype data available. This work was supported by Wellcome Trust Grant GR066764MA (to S.J.K.), Science Foundation Ireland Grant 02-IN.1-B256 (to D.G.B., A.F., and C.M.), and Ondex Systems Approach to Biological Research (SABR) Grant BB/F006012/1 (to P.F.).
Footnotes
The authors declare no conflict of interest.
*This Direct Submission article had a prearranged editor.
Data deposition: Array data were deposited in ArrayExpress (http://www.ebi.ac.uk/arrayexpress/) under accession no. E-MEXP-1778. SNP were deposited in dbSNP (http://www.ncbi.nlm.nih.gov/projects/SNP/) under ssIDs 196313094–196313199 and 196003459–196003467. EST sequences are available from EMBL (http://www.ebi.ac.uk/embl/) under accession nos. AM003907–AM039430.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1013486108/-/DCSupplemental.
References
- 1.Kristjanson PM, Swallow BM, Rowlands GJ, Kruska RL, de Leeuw PN. Measuring the costs of African animal trypanosomosis, the potential benefits of control and returns to research. Agric Syst. 1999;59:79–98. [Google Scholar]
- 2.Murray M, Trail JC, Davis CE, Black SJ. Genetic resistance to African Trypanosomiasis. J Infect Dis. 1984;149:311–319. doi: 10.1093/infdis/149.3.311. [DOI] [PubMed] [Google Scholar]
- 3.Mattioli RC, Wilson RT. Trypanosomes, tsetse and trypanotolerance: Coevolution in tropical Africa. Parassitologia. 1996;38:531–535. [PubMed] [Google Scholar]
- 4.Hassan F. Climate and cattle in North Africa: A first approximation. In: Blech R, MacDonald K, editors. The Origins and Development of African Livestock: Archaeology, Genetics, Linguistics and Ethnography. London: UCL Press; 2000. [Google Scholar]
- 5.Epstein H. The Origin of the Domestic Animals of Africa. New York: Africana; 1971. [Google Scholar]
- 6.Hanotte O, et al. Mapping of quantitative trait loci controlling trypanotolerance in a cross of tolerant West African N'Dama and susceptible East African Boran cattle. Proc Natl Acad Sci USA. 2003;100:7443–7448. doi: 10.1073/pnas.1232392100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.O'Gorman GM, et al. Transcriptional profiling of cattle infected with Trypanosoma congolense highlights gene expression signatures underlying trypanotolerance and trypanosusceptibility. BMC Genomics. 2009;10:207. doi: 10.1186/1471-2164-10-207. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Gautier M, et al. A whole genome Bayesian scan for adaptive genetic divergence in West African cattle. BMC Genomics. 2009;10:550. doi: 10.1186/1471-2164-10-550. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Ramensky V, Bork P, Sunyaev S. Human non-synonymous SNPs: Server and survey. Nucleic Acids Res. 2002;30:3894–3900. doi: 10.1093/nar/gkf493. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Seoh ML, Ng CH, Yong J, Lim L, Leung T. ArhGAP15, a novel human RacGAP protein with GTPase binding property. FEBS Lett. 2003;539:131–137. doi: 10.1016/s0014-5793(03)00213-8. [DOI] [PubMed] [Google Scholar]
- 11.Kanehisa M, Goto S, Kawashima S, Okuno Y, Hattori M. The KEGG resource for deciphering the genome. Nucleic Acids Res. 2004;32(Database issue):D277–D280. doi: 10.1093/nar/gkh063. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Fisher P, et al. A systematic strategy for large-scale analysis of genotype phenotype correlations: Identification of candidate genes involved in African trypanosomiasis. Nucleic Acids Res. 2007;35:5625–5633. doi: 10.1093/nar/gkm623. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Oinn T, et al. Taverna: Lessons in creating a workflow environment for the life sciences. Concurr Comput. 2006;18:1067–1100. [Google Scholar]
- 14.Hull D, et al. Taverna: A tool for building and running workflows of services. Nucl Acids Res. 2006;34(Suppl 2):W729–W732. doi: 10.1093/nar/gkl320. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.O'Neill LA, Bowie AG. The family of five: TIR-domain-containing adaptors in Toll-like receptor signalling. Nat Rev Immunol. 2007;7:353–364. doi: 10.1038/nri2079. [DOI] [PubMed] [Google Scholar]
- 16.Lee MS, Kim Y-J. Signaling pathways downstream of pattern-recognition receptors and their cross talk. Annu Rev Biochem. 2007;76:447–480. doi: 10.1146/annurev.biochem.76.060605.122847. [DOI] [PubMed] [Google Scholar]
- 17.Su AI, et al. A gene atlas of the mouse and human protein-encoding transcriptomes. Proc Natl Acad Sci USA. 2004;101:6062–6067. doi: 10.1073/pnas.0400782101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Jousse H. A new contribution to the history of pastoralism in West Africa. J African Archaeology. 2004;2(2):187–204. [Google Scholar]
- 19.Gibbs RA, et al. Bovine HapMap Consortium. Genome-wide survey of SNP variation uncovers the genetic structure of cattle breeds. Science. 2009;324:528–532. doi: 10.1126/science.1167936. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Sabeti PC, et al. International HapMap Consortium. Genome-wide detection and characterization of positive selection in human populations. Nature. 2007;449:913–918. doi: 10.1038/nature06250. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Murray C, Huerta-Sanchez E, Casey F, Bradley DG. Cattle demographic history modelled from autosomal sequence variation. Philos Trans R Soc Lond B Biol Sci. 2010;365:2531–2539. doi: 10.1098/rstb.2010.0103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Freeman AR, Lynn DJ, Murray C, Bradley DG. Detecting the effects of selection at the population level in six bovine immune genes. BMC Genet. 2008;9:62. doi: 10.1186/1471-2156-9-62. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Goodhead I, et al. A comprehensive genetic analysis of candidate genes regulating response to Trypanosoma congolense infection in mice. PLoS Negl Trop Dis. 2010;4:e880. doi: 10.1371/journal.pntd.0000880. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Courtin D, et al. Host genetics in African trypanosomiasis. Infect Genet Evol. 2008;8:229–238. doi: 10.1016/j.meegid.2008.02.007. [DOI] [PubMed] [Google Scholar]
- 25.Akol GW, Murray M. Trypanosoma congolense: Susceptibility of cattle to cyclical challenge. Exp Parasitol. 1983;55:386–393. doi: 10.1016/0014-4894(83)90036-x. [DOI] [PubMed] [Google Scholar]
- 26.Li C, Wong W. DNA-Chip Analyzer (dChip) In: Parmigiani G, Garrett E, Irizarry R, Zeger S, editors. The Analysis of Gene Expression Data: Methods and Software. New York: Springer; 2003. pp. 120–141. [Google Scholar]
- 27.Hancock D, et al. maxdLoad2 and maxdBrowse: Standards-compliant tools for microarray experimental annotation, data management and dissemination. BMC Bioinformatics. 2005;6:264. doi: 10.1186/1471-2105-6-264. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Liu H, et al. AffyProbeMiner: A web resource for computing or retrieving accurately redefined Affymetrix probe sets. Bioinformatics. 2007;23:2385–2390. doi: 10.1093/bioinformatics/btm360. [DOI] [PubMed] [Google Scholar]
- 29.Liu X, Milo M, Lawrence ND, Rattray M. A tractable probabilistic model for Affymetrix probe-level analysis across multiple chips. Bioinformatics. 2005;21:3637–3644. doi: 10.1093/bioinformatics/bti583. [DOI] [PubMed] [Google Scholar]
- 30.Dennis G, Jr, et al. DAVID: Database for Annotation, Visualization, and Integrated Discovery. Genome Biol. 2003;4:3. [PubMed] [Google Scholar]
- 31.Rennie C, et al. Developments in Biologicals. Vol 132. Basel: Karger; 2008. A systematic, data-driven approach to the combined analysis of microarray and QTL data; pp. 293–299. [DOI] [PubMed] [Google Scholar]
- 32.de Boor C. A Practical Guide to Splines. Rev. Ed. New York: Springer; 2001. [Google Scholar]
- 33.Ruppert D, Wand M, Carroll R. Semiparametric Regression. Cambridge, UK: Cambridge Univ Press; 2003. [Google Scholar]
- 34.Pickrell JK, et al. Signals of recent positive selection in a worldwide sample of human populations. Genome Res. 2009;19:826–837. doi: 10.1101/gr.087577.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Fay JC, Wu CI. Hitchhiking under positive Darwinian selection. Genetics. 2000;155:1405–1413. doi: 10.1093/genetics/155.3.1405. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.



