Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2022 Jan 21;17(1):e0262051. doi: 10.1371/journal.pone.0262051

Susceptibility to disease (tropical theileriosis) is associated with differential expression of host genes that possess motifs recognised by a pathogen DNA binding protein

Stephen D Larcombe 1,¤,#, Paul Capewell 1,#, Kirsty Jensen 2, William Weir 3, Jane Kinnaird 1, Elizabeth J Glass 2, Brian R Shiels 1,*
Editor: Gordon Langsley4
PMCID: PMC8782480  PMID: 35061738

Abstract

Background

Knowledge of factors that influence the outcome of infection are crucial for determining the risk of severe disease and requires the characterisation of pathogen-host interactions that have evolved to confer variable susceptibility to infection. Cattle infected by Theileria annulata show a wide range in disease severity. Native (Bos indicus) Sahiwal cattle are tolerant to infection, whereas exotic (Bos taurus) Holstein cattle are susceptible to acute disease.

Methodology/Principal findings

We used RNA-seq to assess whether Theileria infected cell lines from Sahiwal cattle display a different transcriptome profile compared to Holstein and screened for altered expression of parasite factors that could generate differences in host cell gene expression.

Significant differences (<0.1 FDR) in the expression level of a large number (2211) of bovine genes were identified, with enrichment of genes associated with Type I IFN, cholesterol biosynthesis, oncogenesis and parasite infection. A screen for parasite factors found limited evidence for differential expression. However, the number and location of DNA motifs bound by the TashAT2 factor (TA20095) were found to differ between the genomes of B. indicus vs. B. taurus, and divergent motif patterns were identified in infection-associated genes differentially expressed between Sahiwal and Holstein infected cells.

Conclusions/Significance

We conclude that divergent pathogen-host molecular interactions that influence chromatin architecture of the infected cell are a major determinant in the generation of gene expression differences linked to disease susceptibility.

Introduction

It is well established that the outcome of infection by a pathogen can vary between individuals from asymptomatic to fatal disease. How such variability is generated is of great importance, particularly for emerging viral disease such as COVID-19 and infectious disease of livestock. Adaptation of hosts that display no or mild clinical signs upon pathogen infection is thought to have arisen via competitive co-evolution [1,2]. In this process, mechanisms deployed by the pathogen to disrupt defence mechanisms are countered by the selection of genetic traits that attenuate this disruption. Over time this can lead to establishment of a general relationship where, upon infection, immunopathology is minimised to promote host survival and pathogen transmission. Hosts that have not co-evolved with a pathogen, however, are more susceptible to severe disease pathology when infected. Thus, host breeds that are generally tolerant of infection or susceptible to acute disease provide extremes of a susceptibility spectrum that can be exploited to investigate pathogen-host interactions that give rise to variable infection outcomes.

Tropical theileriosis, caused by the tick-borne apicomplexan parasite Theileria annulata, is a disease of great economic significance to livestock producers over large areas of the Old World [3,4]. It impacts both large and smallholding production systems with losses caused by mortality, reduced productivity and costs for treatment and control. B. taurus cattle show greater susceptibility to acute tropical theileriosis than B. indicus breeds native to regions endemic for T. annulata (reviewed in [5]). Susceptibility to the disease constrains the use of B. taurus in countries such as India, where a major crossbreeding programme aims to select for productive cattle tolerant to T. annulata infection [6]. While this has mitigated losses from acute disease, crossbred animals still show more pronounced signs of subclinical infection relative to native breeds [7,8]. Thus, strategies to improve disease tolerance of productive exotic breeds are needed but require greater understanding of how differential susceptibility to disease is conferred.

Previous work has investigated mechanisms of susceptibility to tropical theileriosis. Experimental challenge confirmed that infected Sahiwal (B. indicus) calves showed reduced measures of clinical pathology, relative to susceptible Holstein (B. taurus), including lymph node enlargement and fever [9]. A reduced pro-inflammatory response was observed for the Sahiwal calves, indicating that in a tolerant host immune pathology is limited, while in a susceptible one disease is caused by parasite-mediated dysregulation of the host immune response (reviewed in [2]).

During tropical theileriosis, sporozoites inoculated by a feeding tick rapidly invade leukocytes (of predominantly myeloid origin) and differentiate into the intracellular macroschizont stage [10]. Macroschizont formation is accompanied with transformation of the infected leukocyte into a cancer like cell. Infected cells proliferate in lymph nodes and metastasise throughout the lymphoid system to establish foci in the liver and lungs: death can occur via pneumonia like condition [11,12]. Based on the kinetics of disease it has been concluded that the major mechanism of tolerance is linked to the macroschizont infected leukocyte [9]. Moreover, five independent macroschizont infected cell lines derived from Sahiwal showed a reduced ability to transverse Matrigel compared to five Holstein lines, indicating that reduced invasiveness of the infected cell is linked to tolerance of infection [13].

Transformation of the bovine leukocyte by Theileria parasites occurs via constitutive activation of bovine transcription factors that act as inflammatory mediators and promote oncogenesis (for example, NF-κB and AP1). Activation is accompanied with an extensive, irreversible reorganisation of the infected cell transcriptome [14] that alters the expression levels of many genes encoding proteins with the potential to promote oncogenesis or disrupt the immune response [15]. Thus, parasite-host interactions that activate and/or moderate inflammatory mediators in the transformed, infected cell are likely to be associated with disease susceptibility.

Only a handful of Theileria-encoded factors that translocate to the host cell compartment and modulate the infected leukocyte have been identified. TaPIN is a peptidyl-prolyl isomerase that was reported as secreted into the host cell by transforming Theileria species. It contributes to AP1 activation by destabilizing the FBW7 ubiquitin ligase that targets c-Jun, a key component of the AP1 heterodimer, for degradation [16]. Likewise, the Ta9 protein is located in the host cytoplasm and has recently been reported as an activator of AP1 [17]. Lastly members of the TashAT gene cluster are only found in transforming Theileria genomes, they are translocated to the host nucleus, possess AT hook domains homologous to those of mammalian HMGA proteins, bind DNA and modulate bovine gene expression [1820]. It is unknown whether any of these candidate factors show altered expression in infected cells from tolerant breeds, but this could significantly influence the infected cell phenotype and potentially the severity of disease it generates.

To investigate whether differences in disease tolerance are mediated by pathogen-host interactions that modulate gene expression in the Theileria-transformed leukocyte, we have compared the transcriptome profile of the available set of ex vivo infected cell lines derived from Sahiwal vs. Holstein calves [13,21]. These low passage lines, although genetically diverse, offer the best infected cell lines available for comparative RNA-seq that relates to infection in vivo, since long term passage or cloning of infected cells results in attenuation of pathogenicity and a loss of parasite diversity [13,22]. The current work builds on an earlier microarray analysis, limited to 5000 immune related genes, that demonstrated differences in expression of genes encoding factors that regulate immune cell interaction [23]. The microarray study only covered the response to early infection events (first 72 h after invasion), and only a relatively small number of differentially expressed genes were identified (150 genes), reducing the power of pathway analysis. Moreover, a comparison of the host or parasite transcriptome of the Sahiwal and Holstein cell lines was not performed in the previous studies [13,21]. The results from our analysis show that infected cell lines from cattle breeds differentially susceptible to tropical theileriosis display significant differences in expression of bovine genes associated with parasite infection, innate immunity, cholesterol biosynthesis and oncogenesis. Furthermore, a screen for DNA motifs bound by the parasite encoded factor, TashAT2 identified differences in number and pattern between the genomes of B. indicus vs. B. taurus, which included infection-associated genes differentially expressed between Sahiwal and Holstein infected cell lines. These data provide a platform to characterise parasite-host interactions that have evolved to generate the wide range of disease outcomes that occur following infection.

Methods

Parasite infected cell lines and culture conditions

We used eleven infected cell lines, previously established ex vivo from the peripheral blood of cattle experimentally challenged with T. annulata. Six of these were derived from infected Sahiwal calves (B. indicus) and five were derived from infected Holstein calves (B. taurus), as reported [13,21]. Both Sahiwal and Holstein animals were infected with sporozoites from the same parasite stock, T. annulata Hissar and cultured in RPMI-1640 medium (Sigma-Aldrich) supplemented with 10% FCS, 4 mM L-glutamine and 50 μM β-mercaptoethanol at 37°C for a limited period (less than 10 passages). All cell lines were fully established based on the level of infection (>95% macroschizont infected cells). Clones were not generated in the original or current studies in order to retain the heterogeneity that naturally occurs upon infection in vivo, and because cloning and prolonged passage result in an attenuation of infected cell pathogenicity [13,22]. Thus, RNA was isolated from low passage cell lines, between passages 4–6, for Sahiwal and between passages 6–8, for Holstein. Cells were harvested by centrifugation (120 g for 10 min) after 48 h of culture. RNA was isolated using Trizol (Thermo Fisher Scientific) followed by purification using the RNeasy mini kit (Qiagen), including an on-column DNAse digestion step, and using the suppliers’ protocols. RNA quality and quantity were then assessed using a Nanodrop ND-1000 spectrophotometer (Thermo Fisher Scientific) and by gel electrophoresis.

RNA-seq

Between 160 and 260 μg DNAse-treated RNA was sequenced for each sample at the Centre for Genomic Research (CGR), Liverpool. Dual-indexed strand-specific RNA-seq libraries were made from total RNA, using NEBNext Poly(A) mRNA Isolation and Ultra Directional RNA Library Preparation kits (New England Biolabs). Sequencing was performed using one lane of Illumina HiSeq 4000 (Paired-end, 2×150 bp sequencing, generating data from >280 M clusters per lane). Following sequencing, paired end reads were QCed and prepared for analysis at CGR: FASTQ files were trimmed for Illumina adapter sequences using Cutadapt version 1.2.1 [24] using option -O, so the 3’ end of any reads which matched the adapter sequence for 3 bp or more were trimmed. The reads were further trimmed using Sickle version 1.200 [25], with a minimum window quality score of 20. Reads shorter than 20 bp after trimming were removed. Approximately 40 million paired end reads were generated from each sample.

Read mapping, differential expression analyses and pathway analyses

The trimmed FASTQ files for each sample were mapped to the B. taurus transcriptome: v UMD3.1 downloaded from Ensembl Biomart [26]. This strategy was used because of the greater fidelity of the B. taurus reference genome sequence relative to B. indicus, and on the premise that the two subspecies share a high level of sequence identity in mRNA coding regions of the genome. Bowtie2 [27] was used to map reads and generate SAM files, and the Python script “SAM2counts” [28] to generate reference sequence counts for each transcript. We collapsed reads mapping to multiple transcripts to the same Ensembl gene ID prior to analysis. Differential Expression analyses were performed on the count files using DESeq2 [29], to compare the five B. taurus samples to the six B. indicus samples. The B. taurus data values were used as the primary comparator, such that negative fold changes in gene expression are designated as lower expression in B. taurus vs. B. indicus, and positive fold changes are designated as higher in B. taurus vs. B. indicus. Genes with a false discovery rate (FDR) less than 0.1 (10%) were considered significantly differentially expressed between B. indicus relative to B. taurus infected cells lines. Following differential expression analyses, we used Ensembl Biomart to annotate the read files and Ingenuity Pathway Analysis (IPA) (Ingenuity Systems Inc., Redwood City, USA) to elucidate significantly modulated gene networks, disease pathways and potential effector molecules. We also mapped the RNA-seq reads to the T. annulata genome [30], to assess whether parasite encoded genes are differentially expressed between the established macroschizont infected B. taurus vs. B. indicus cell lines.

Comparison with TBL20 infection associated dataset

To elucidate mechanisms that might underpin differences in breed susceptibility to T. annulata infection, we matched our data set of breed-associated differentially expressed genes with a data set of host cell genes whose differential expression was identified as associated with infection by T. annulata. The infection-associated data set was obtained by comparison of the transcriptome (obtained by microarray analysis) of uninfected (BL20) vs. macroschizont infected (TBL20) bovine lymphosarcoma cell lines, treated and not treated with the anti-Theileria drug buparvaquone. Full details can be found in [14,15]. As the two datasets used different systematic ID codes (Ensembl vs. Entrezgene), we first used Ensembl Biomart to filter the microarray BL20/TBL20 dataset by finding common Gene stable IDs. Since some genes were not found in this way we also used DAVID [31] and PANTHER [32] to generate alternative gene symbols to screen for overlap between gene lists independent of common gene IDs. To check for statistical significance, the representation factor was calculated as the number of genes in common ÷ expected number of genes; where the expected number of genes is given by: (number of genes in group 1 × number of genes in group 2) ÷ number of genes in total. In our case we used 19,018 as the total number of genes as this is the number of unique identifiable bovine genes included on the microarray. The p value was then computed as a normal approximation of the exact hypergeometric probability [33].

Identification of infected host cell genes encoding predicted secreted or receptor proteins

Ensembl Biomart was used to generate the peptide sequences for all bovine genes classed as differentially expressed. A pipeline of publicly available resources was then used to perform in silico predictions. Signal P version 5.0 [34] generated a list of proteins with a predicted signal peptide (cutoff of >0.5 probability). Sequences for this new list were then screened for the presence of transmembrane domains or GPI anchors. This was performed using TMHMMserver v 2.0 [35] for transmembrane domains, and PredGPI [36] for GPI anchors. Any predicted proteins that scored positive for either motif was excluded from the list. The final list of proteins was obtained by screening against subcellular location prediction and gene ontology (GO) data available in the Genecards database version 4.14 [37]. Only proteins with a top confidence score as extracellular were retained in the final list. To generate a list of receptor proteins, we used gene ontology (GO) software to look for evidence of annotation as a receptor, or receptor-like domains in our data sets of differentially expressed genes. Thus, lists of annotations, protein family and classes were generated using Ensembl, DAVID, PANTHER and IPA. The data from each resource was collated, and we excluded any molecule that was not a confirmed or predicted receptor. Following receptor classification, the resulting gene list was curated by eye and nuclear receptors excluded, as we were primary interested in a cell surface location.

Validation of RNA-seq results by qRT-PCR

Validation of differential gene expression was carried out by qRT PCR, using RNA from the eleven T. annulata cell lines isolated on a different day than the samples used for RNA-seq. Primer pairs specific for 20 genes were designed (S1 Table) and qRT-PCR performed, as described previously [15]. Briefly, 500 ng of total RNA from each sample was used to synthesise cDNA, using the Affinity Script cDNA Synthesis Kit (Agilent Technologies) and Oligo-dT as primer. 1 μl cDNA for each sample was then used for qRT-PCR, using the Brilliant III Ultra-fast SYBR®Green qPCR Master mix (Agilent technologies) and the Stratagene Mx3005P system. Comparative quantitative analysis of gene expression across samples was performed using Stratagene MxPro Software. REPS1 (RALBP1 associated Eps domain containing 1) was utilised as a house keeping gene, based on previous identification of constitutive expression in Mycobacterium bovis infected macrophage cells [38] and initial validation using RNA from the eleven T. annulata infected cell lines. The ΔΔCT method was then used to calculate fold-change (vs. the lowest expressed sample). The data were transformed on the log10 scale to stabilize the variance and the mean group differences (Sahiwal vs. Holstein) were tested using Student’s t tests.

Bioinformatic screen for DNA motifs bound by TashAT2

A consensus of AT rich motifs preferentially bound by the AT hook domain of recombinant TashAT2 was generated using previously published data [19]. This was used to generate a PROSITE pattern (TAAAT(1)N(4,6)T(1)A(3,4)T.) to search the B. taurus (UCD1.2) and B. indicus (Bos_indicus_1.0) genomes using FUZZNUC [39]. Bedtools [40] was used to find the intersections of identified motifs and introns and exons in the bovine genomes. Proximal upstream and downstream sequences in validated mRNA sequences not characterized as exons or introns were defined as untranslated regions (UTRs) and the intersection of motifs assessed. Sites without intersection were characterised as non-coding. Pathway enrichment of genes with TashAT2 binding motifs was assessed using PANTHER [32] and summary figures generated using CIRCOS [41].

Immunoblotting and immunofluorescence

Three of the independent Sahiwal infected cell lines (SA-C) and three Holstein lines (HA-C) were randomly selected, passaged 1X, cultured for 48 h and harvested, as described above. Cells were then washed 3X in phosphate buffered saline (PBS), resuspended in 20 μL of PBS per 106 cells and an equal volume of 2X SDS-sample buffer added. Immunoblotting was performed using standard methodology, as described [42]. Antisera raised against the TA06460 fusion protein (the endoplasmic reticulum form of HSP90 (ER-HSP90)) was used at a dilution of 1:1500, as previously described [42]. For the Ta9 (TA15705) antigen, rat antisera were raised against a 35.7kDa fusion protein [43]. Antiserum production was carried out by the PTU/BS Unit at the Scottish National Blood Transfusion Service, Pentlands Science Park Edinburgh. The anti-Ta9 serum was used at a dilution of 1 in 1200.

Slide preparation, fixation and immunofluorescence were performed as described [15]. The cell lines used were the same as for immunoblotting and the EL24 antiserum specific for TashAT2 [44] was used at 1 in 250. Images were acquired with an Olympus BX60 microscope, SPOT camera and SPOTTM Advanced image software Version Mac: 4.6.1.26, using the matched exposure feature (Diagnostic Instruments).

Ethics statement

This study did not require ethical approval since it was conducted on in vitro cell lines established ex vivo from animals in a previous study where ethical approval was met (details available in [23]) and no further human or animal participation was needed.

Results

Early passage Holstein and Sahiwal infected cell lines display significant differences in their host transcriptome profile

Six Sahiwal and five Holstein lines infected with T. annulata (Hissar) were utilised for differential transcriptome analysis. As reported previously [9], under standard in vitro culture conditions, no significant difference in growth potential between the Sahiwal and Holstein lines was observed. RNA was isolated from each line and RNA-seq performed: the RNA-seq data are publicly available at the NCBI GEO online repository under accession numbers GSM4824553GSM4824563. The results obtained indicate pronounced differences in host gene expression levels between T. annulata infected cells derived from Holstein or Sahiwal cattle. These differences were marked in terms of the large number of genes differentially expressed: 2211 genes (without duplicates) at FDR < 0.1 (adjusted), subsequently referred to as Holstein/Sahiwal differentially expressed (DE) dataset (H/S-DE). The full list of H/S-DE genes can be viewed in S1 File. Of the 2211 DE genes there was an almost even split in genes displaying modulated expression associated with one breed relative to the other; thus, 1068 genes (48.3%) were expressed at lower levels in the Holstein infected lines compared to Sahiwal. Furthermore, analysis of the proportion of reads mapped to the B. taurus genome showed no statistical difference between the 6 Sahiwal relative to the 5 Holstein RNA-seq sample sets (S2 Table). These results support the premise that sequence identity between mRNA coding regions of the two genomes allow comparative analysis of gene expression using the B. taurus genome as the reference.

A demonstration that the H/S-DE data set represents differences in gene expression profiles between infected cells from Holstein vs. Sahiwal was provided by principal component analysis (PCA). Clustering of the samples with host type can clearly be seen in the plot shown in Fig 1, where PC1 explained 35% of variation in the data, although a sizeable degree of variability within each breed, that was more pronounced for the Holstein samples, was also detected (PC2 displaying 25% variance).

Fig 1. Clustering of samples by host breed.

Fig 1

A PCA plot showing clustering based on the H/S-DE data set derived from five samples of Holstein (blue) or six samples of Sahiwal (red) cell lines infected with the T. annulata macroschizont.

Indicative of the clear separation between the two breeds, a substantial proportion of genes in the data set showed a consistent difference between the breed types, with 439 of the 500 most statistically significant H/S-DE genes displaying differential expression across all Sahiwal vs. Holstein samples. Specific examples include the IFN stimulated gene (ISG) MX1 expressed at lower levels in all Holstein samples relative to Sahiwal cells (-4.62, log2); whereas the FAT1 cadherin encoding gene, associated with TGFB signalling, was consistently expressed at a higher level for Holstein cell lines (7.49, log2; S1 File).

Not unexpectedly, from the PCA, a sizeable number of genes indicated as differentially expressed between the breeds did not display a difference that was consistent across all six Sahiwal vs. five Holstein samples. A good example being the gene encoding pro-inflammatory cytokine IL-6, designated as significantly higher in Holstein (1.34 log2), where two Holstein samples showed lower expression values than the two highest values recorded for the Sahiwal cell lines (S1 File). Thus, genes of this type are indicated as generally differentially expressed between the two breeds but show a more substantial level of variable expression within a breed, and as reported previously [21] this appeared to be more marked for the Holstein sample set.

Table 1 shows the top 19 genes displaying greatest higher or lower expression (fold change log2) in Holstein (B. taurus) relative to Sahiwal (B. indicus) infected leukocytes. A number of these genes possess gene ontology (GO) annotation indicating they could play a role in determining phenotypic differences between Holstein and Sahiwal macroschizont infected cells. Thus, for genes with higher relative levels of expression in Holstein infected cells: Slit Guidance Ligand 2 (SLIT2), functions in cancer and leukocyte chemotaxis/infiltration [45,46]; Ubiquitin D (UBD/FAT10), operates in protein modification/degradation and activation of pro-inflammatory mediators (reviewed in [47]): GULP1, is involved in phagocytosis [48] and associated with emphysema [49], a clinical symptom of acute theileriosis [50]. For the genes designated as expressed lower in Holstein infected cells relative to Sahiwal, those encoding proteins associated with a tumour suppressor/inhibitor of metastasis function (e.g. ANXA8; PAX6; COL4A1 and ADAMTS18) or associated with the innate immune/ISG response were of particular interest. Of the 68 IFN type I response genes [51] identified in the H/S-DE data set, 56 displayed lower expression in Holstein derived infected cells relative to Sahiwal (S1 File).

Table 1. Top 19 (by Log2 fold change) down- and up-regulated genes in B. taurus vs. B. indicus T. annulata infected cell lines.

Gene log2 FC lfcSE Padj Gene symbol Gene description
ENSBTAG00000021565 -26.131 3.038 <0.001 PRSS2 serine protease 2
ENSBTAG00000002335 -11.108 1.021 <0.001 HAND1 heart and neural crest derivatives expressed 1
ENSBTAG00000012849 -9.151 1.622 <0.001 COL4A1 collagen type IV alpha 1 chain
ENSBTAG00000009842 -8.637 1.417 <0.001 CRYM crystallin mu
ENSBTAG00000031355 -8.014 0.780 <0.001 LOC529196 C-C chemokine receptor type 1-like
ENSBTAG00000046377 -7.793 0.992 <0.001 LOC100336807  
ENSBTAG00000001010 -7.450 1.462 <0.001 ADAMTS18 ADAM metallopeptidase with thrombospondin type 1 motif 18
ENSBTAG00000046412 -7.448 1.056 <0.001 LOC512617 acyl-CoA dehydrogenase family member 10
ENSBTAG00000004561 -6.888 1.114 <0.001 PAX6 paired box 6
ENSBTAG00000038064 -6.865 0.457 <0.001 LOC614531 ras-related GTP-binding protein A
ENSBTAG00000005857 -6.751 1.312 <0.001 SLC6A1 solute carrier family 6 member 1
ENSBTAG00000027782 -6.529 0.998 <0.001 OR5K1 olfactory receptor, family 5, subfamily K, member 1
ENSBTAG00000008129 -6.455 1.668 0.002 CLSTN3 calsyntenin 3
ENSBTAG00000047616 -6.311 1.719 0.005 ZNF114 zinc finger protein 114
ENSBTAG00000018499 -5.923 1.758 0.011 ANXA8L1 annexin A8-like 1
ENSBTAG00000035110 -5.918 2.049 0.038 DEUP1 deuterosome assembly protein 1
ENSBTAG00000019428 -5.765 1.473 0.002 CCR1 chemokine (C-C motif) receptor 1
ENSBTAG00000018367 -5.733 1.347 <0.001 CD6 CD6 molecule
ENSBTAG00000014628 -5.624 0.852 <0.001 OAS2 2’-5’-oligoadenylate synthetase 2
ENSBTAG00000018694 8.087 1.629 <0.001 ACSS3 acyl-CoA synthetase short chain family member 3
ENSBTAG00000044185 7.729 3.016 0.075 SOX6 SRY-box 6
ENSBTAG00000005108 7.513 1.576 <0.001 SLIT2 slit guidance ligand 2
ENSBTAG00000025803 7.506 1.809 <0.001 INSYN1 inhibitory synaptic factor 1
ENSBTAG00000020657 7.490 0.865 <0.001 FAT1 FAT atypical cadherin 1
ENSBTAG00000003345 7.149 1.769 0.001 FAT4 FAT atypical cadherin 4
ENSBTAG00000037649 7.111 1.212 <0.001 VIPR2 vasoactive intestinal peptide receptor 2
ENSBTAG00000015905 7.013 1.136 <0.001 ARHGAP32 Rho GTPase activating protein 32
ENSBTAG00000025398 6.927 1.707 0.001 LOC504548 ubiquitin D
ENSBTAG00000004510 6.777 1.310 <0.001 SARDH sarcosine dehydrogenase
ENSBTAG00000047336 6.726 2.294 0.034 LOC100847115 thyrotropin-releasing hormone receptor-like
ENSBTAG00000031802 6.597 1.335 <0.001 SPATA16 spermatogenesis associated 16
ENSBTAG00000014354 6.519 1.246 <0.001 FXYD6 FXYD domain containing ion transport regulator 6
ENSBTAG00000019340 6.436 1.438 <0.001 PCDH9 protocadherin 9
ENSBTAG00000021526 6.296 1.994 0.020 RPRM reprimo, TP53 dependent G2 arrest mediator homolog
ENSBTAG00000027246 6.247 1.388 <0.001 UBD ubiquitin D
ENSBTAG00000020385 6.216 1.408 <0.001 ALDH1B1 aldehyde dehydrogenase 1 family member B1
ENSBTAG00000016525 6.179 1.898 0.015 ITGA1 integrin subunit alpha 1
ENSBTAG00000007141 6.159 1.438 <0.001 GULP1 GULP PTB domain containing engulfment adaptor 1

IPA reveals breed-associated modulation of gene expression linked to innate immunity, cholesterol biosynthesis and oncogenesis

Differential susceptibility to theileriosis could arise via pre-existing breed associated differences in gene expression between infected (and uninfected) cells or only become manifest after infection of the bovine leukocyte. Therefore, we applied Ingenuity Pathway Analysis (IPA) on both the full data set of 2211 H/S-DE genes and a more limited data set of Infection Associated (IA) genes. To identify breed associated differences in gene expression that are induced by infection, we overlapped the H/S-DE data set with a set of bovine genes demonstrated to display altered expression in T. annulata, macroschizont infected B. taurus derived lymphosarcoma (TBL20) cells relative to uninfected (BL20) cells [14]. Of the 2211 H/S-DE genes, 517 overlapped and were indicated as the infection associated Holstein/Sahiwal data set (IA-H/S): see S2 File for the full data set. The representation factor for the 517 genes was 1.4 and the p value was < 4.2 e-15; indicating statistical significance for the obtained overlap (Fig 2). Strikingly the most significant canonical pathways identified by IPA for both the full (S3 File) and infection associated data set (S4 File) were virtually identical, with involvement in innate immunity, cholesterol biosynthesis and oncogenesis highlighted.

Fig 2. Overlap of genes modulated between Holstein and Sahiwal breeds of infected cell and Theileria infection of BL20 cells.

Fig 2

(A) Venn diagram of overlapping genes, (B) the top IPA canonical pathways and (C) the modulation of infected-associated genes found in enriched pathways linked to cholesterol biosynthesis or the IFN response.

Innate immunity pathways

Enriched pathways linked to innate immunity in the infection-associated data set were “Activation of Interferon by Cytosolic Pattern Recognition Receptors” and “Interferon signalling” and these pathways were also highly enriched by the analysis of the full H/S-DE data set. Other related significantly enriched pathways included “Role of RIG1 Like Receptors in Antiviral Innate Immunity” and, “NF-κB Activation by Viruses” (full data set only).

IPA indicates the activation state of canonical pathways via the calculated Z-score, a negative Z score predicting lower activation potential, while a positive score indicates higher potential. For pathways linked to innate immunity highlighted above, all except NF-κB showed lower activation status in Holstein relative to Sahiwal infected cells (Z score range > -1.3 to -2.8). This finding is illustrated for “Activation of Interferon by Cytosolic Pattern Recognition Receptors” (Fig 3) and “Interferon Signalling Pathways” (Fig 4) generated by IPA for the H/S-DE data set. As would be expected, these two pathways overlap, with lower transcript levels in Holstein infected cells for STAT1, STAT2, IRF9 and the ISG targets of their transcription factor complex. Furthermore, all identified infection-associated ISG genes (MX1, MX2, OAS1Y/Z, OAS2, ISG15, IFI6, IFI44 and RSAD2) showed lower expression values in Holstein infected cells relative to Sahiwal (Fig 2; S2 File).

Fig 3. IPA canonical pathway, “Activation of IRF by Cytosolic Pattern Recognition Receptors”.

Fig 3

The nodes represent genes/molecules/complexes in a pathway, and the lines and arrows between nodes indicate known relationships from the Ingenuity Knowledge Base. Nodes with purple outline indicate molecules that were measured as differentially expressed in our dataset, with the intensity of coloured infill indicating the level of up (red) or down (green) regulation of Holstein relative to Sahiwal. The blue- and orange-coloured molecules and lines are predicted activation states generated by the Molecular Activity Predictor function in Ingenuity Pathway Analysis (IPA). Blue colour indicates a predicted inhibition, and orange a predicted activation state in Holstein relative to Sahiwal. Yellow lines indicate relationships where our findings are inconsistent with the state of the downstream molecule. Broad lines with explanatory text beside the pathway indicate the cellular location of molecules in the pathway. The molecules in the pathway are given shapes that indicate their functional class (Nested Circle/Square = Group/Complex, Horizontal ellipse = Transcriptional Regulator, Vertical Ellipse = transmembrane receptor, Vertical Rhombus = enzyme, Square = Cytokine/Growth Factor, Triangle = Kinase, Vertical Ellipse = Transmembrane Receptor, Circle = other). The edges between molecules are also differentiated to indicate the type of relationship between them. Solid lines are direct relationships and dashed lines are indirect.

Fig 4. IPA canonical pathway, “Interferon signalling”.

Fig 4

Nodes, colours, shapes and lines in pathway are as described in legend to Fig 3.

IPA for disease or molecular function also predicted lower innate immune function for Holstein infected cells. As shown in Table 2, the three most significant disease or functions identified for the full H/S-DE data set that are predicted to be lower in Holstein compared to Sahiwal infected cells were “Antiviral response”; “Immune response of cells” and “Antimicrobial response”. Furthermore, the IPA function that predicts activation or inhibition of upstream regulators identified inhibition of cytokines, transcription factors and compounds associated with the interferon response in Holstein relative to Sahiwal infected cells (negative Z scores >2). Very similar results were obtained for both the full and IA data sets (Table 3 & S3 Table).

Table 2. IPA for disease pathways and functions enriched in data set of differentially expressed genes between Holstein and Sahiwal T. annulata infected cell lines.

Category Diseases or Functions Annotation p-value Predicted Activation State Activation z-score # Molecules
Antimicrobial Response, Inflammatory Response Antiviral response <0.001 Decreased -3.341 48
Inflammatory Response Immune response of cells <0.001 Decreased -2.725 84
Antimicrobial Response, Inflammatory Response Antimicrobial response <0.001 Decreased -3.107 56
Neurological Disease Progressive neurological disorder <0.001 Decreased -2.157 133
Cancer, Organismal Injury and Abnormalities, Renal and Urological Disease Urinary tract cancer <0.001 Decreased -2.236 240
Cancer, Organismal Injury and Abnormalities, Renal and Urological Disease Renal cancer <0.001 Decreased -2.236 150
Cell Death and Survival Cell death of lymphoma cell lines <0.001 Decreased -3.104 36
Lipid Metabolism, Small Molecule Biochemistry, Vitamin and Mineral Metabolism Synthesis of cholesterol <0.001 Decreased -2 18
Cell Death and Survival Apoptosis of lymphoma cell lines <0.001 Decreased -2.81 29
Cell Death and Survival Cell death of macrophage cancer cell lines <0.001 Decreased -2.735 11
Cell Death and Survival Cell death of connective tissue cells <0.001 Decreased -2.034 75
Cell-To-Cell Signalling and Interaction, Inflammatory Response Response of phagocytes 0.002 Decreased -2.498 32
Cell-To-Cell Signalling and Interaction, Embryonic Development Response of embryonic cell lines 0.002 Decreased -3.111 12
Cell-To-Cell Signalling and Interaction Response of myeloid cells 0.003 Decreased -2.466 31
Cell-To-Cell Signalling and Interaction, Inflammatory Response Immune response of phagocytes 0.003 Decreased -2.787 29
Embryonic Development, Organismal Development Development of body trunk <0.001 Increased 3.647 153
Infectious Diseases Viral Infection <0.001 Increased 3.15 220
Infectious Diseases Infection by RNA virus <0.001 Increased 3.075 114
Cancer, Organismal Injury and Abnormalities Development of malignant tumour <0.001 Increased 2.984 786
Infectious Diseases Infection of Mammalia 0.001 Increased 2.978 42
Cellular Movement Cell movement of melanoma cell lines 0.002 Increased 2.715 21
Infectious Diseases Replication of Herpesviridae <0.001 Increased 2.449 15
Cancer, Organismal Injury and Abnormalities Incidence of tumour <0.001 Increased 2.428 815
Infectious Diseases Infection of cells <0.001 Increased 2.331 97
Cancer, Organismal Injury and Abnormalities Malignant solid tumour <0.001 Increased 2.277 1292
Infectious Diseases Replication of Murine herpesvirus 4 0.003 Increased 2.236 5
Cardiovascular System Development and Function, Embryonic Development, Organ Development, Organismal Development, Tissue Development Cardiogenesis <0.001 Increased 2.198 79
Cancer, Organismal Injury and Abnormalities, Reproductive System Disease Tumorigenesis of reproductive tract <0.001 Increased 2.19 459
Cancer, Organismal Injury and Abnormalities, Reproductive System Disease Female genital neoplasm <0.001 Increased 2.19 459
Cancer, Endocrine System Disorders, Organismal Injury and Abnormalities, Reproductive System Disease Ovarian tumour <0.001 Increased 2.19 205
Cancer, Endocrine System Disorders, Organismal Injury and Abnormalities Endocrine gland tumour <0.001 Increased 2.183 1024
Cancer, Gastrointestinal Disease, Organismal Injury and Abnormalities Digestive organ tumour <0.001 Increased 2.161 1130
Cancer, Organismal Injury and Abnormalities Frequency of tumour <0.001 Increased 2.152 801
Cancer, Haematological Disease, Organismal Injury and Abnormalities Hematologic cancer 0.002 Increased 2.088 329
Cancer, Gastrointestinal Disease, Hepatic System Disease, Organismal Injury and Abnormalities Liver tumour <0.001 Increased 2.017 556

Table 3. Top activated or repressed upstream regulators predicted by IPA from gene targets within the H/S-DE data set.

Upstream Regulator Molecule Type Predicted Activation State Activation z-score p-value of overlap Genes in dataset (Number of regulators from data in network)
IFNA2 cytokine Inhibited -5.284 <0.001 108 (11)
IRF7 transcription regulator Inhibited -5.147 <0.001 137 (13)
Interferon alpha group Inhibited -5.024 <0.001 180 (13)
IRF3 transcription regulator Inhibited -4.863 <0.001 120 (12)
PRL cytokine Inhibited -4.788 <0.001 241 (10)
poly rI:rC-RNA biologic drug Inhibited -4.586 <0.001 292 (13)
IFNL1 cytokine Inhibited -4.577 <0.001 374 (15)
IFN Beta group Inhibited -4.506 <0.001 233 (11)
STAT1 transcription regulator Inhibited -4.412 <0.001 318 (12)
Ifnar group Inhibited -4.245 <0.001 181 (14)
IFNG cytokine Inhibited -4.078 <0.001 367 (14)
IRF1 transcription regulator Inhibited -3.994 <0.001 317 (13)
CpG ODN 2006 chemical reagent Inhibited -3.846 <0.001 244 (11)
IFNB1 cytokine Inhibited -3.806 <0.001 286 (13)
IRF5 transcription regulator Inhibited -3.763 <0.001 357 (17)
IFNA1/IFNA13 cytokine Inhibited -3.538 <0.001 103 (10)
TGM2 enzyme Inhibited -3.516 <0.001
SREBF2 transcription regulator Inhibited -3.482 <0.001 250 (5)
TRIM24 transcription regulator Activated 4.884 <0.001 68 (6)
MAPK1 kinase Activated 4.26 <0.001 106 (8)
PNPT1 enzyme Activated 4.123 <0.001 260 (7)
NKX2-3 transcription regulator Activated 3.788 <0.001
INSIG1 other Activated 3.628 <0.001 190 (5)
ACKR2 G-protein coupled receptor Activated 3.606 <0.001 233 (5)
PTGER4 G-protein coupled receptor Activated 3.286 <0.001 363 (10)
MYC transcription regulator Activated 3.202 <0.001 231 (2)
SIRT1 transcription regulator Activated 3.184 0.0582
SOCS1 other Activated 3.18 0.015
MFSD2A transporter Activated 3.148 <0.001
IL1RN cytokine Activated 3.048 <0.001 362 (11)
LEPR transmembrane receptor Activated 3 0.219
Mek group Activated 2.992 0.475
MMP3 peptidase Activated 2.828 0.351
IRF8 transcription regulator Activated 2.819 0.016
JQ1 chemical reagent Activated 2.779 0.055
USP18 peptidase Activated 2.772 <0.001 72 (9)
POR enzyme Activated 2.729 <0.001
IKZF3 transcription regulator Activated 2.72 0.005
HMGA1 transcription regulator Activated 2.718 0.337

Cholesterol biosynthesis

The top three scoring IPA canonical pathways for the infection-associated data set of 517 genes (Fig 2, S4 File) were all linked to cholesterol biosynthesis, with negative Z scores or reduced relative expression values for pathway genes in Holstein infected cells.

Similar results were obtained using the full H/S-DE data set with “Superpathway of Cholesterol Biosynthesis” ranked as the second top canonical pathway and the highest negative Z score recorded (S3 File). Inhibition of the SREBF2 transcription factor, an important positive regulator of genes involved in cholesterol biosynthesis [52], was also predicted (Z score -3.48, P value 6.66 E-06), and relative SREBF2 expression levels were lower for Holstein (B. taurus) infected cells (S1 File). Elevated expression of genes involved in cholesterol biosynthesis is associated with oncogenesis/metastasis via the PI3K/AKT pathway [53,54], which is activated in Theileria infected cells [55] and enriched by IPA in both the full H/S-DE and IA data sets (S3 and S4 Files). Therefore, prediction of elevated PI3K/AKT signalling together with lower expression of cholesterol biosynthesis pathway genes in Holstein infected cells was unexpected. However, in other studies decreased expression of cholesterol biosynthesis genes has been associated with activation of Ras/Erk signalling and a more malignant cancer cell phenotype [56].

Oncogenesis

IPA showed a clear association of the full Holstein vs. Sahiwal data set of DE genes with oncogenesis/neoplasia. Thus, the canonical pathways “Retinoic Acid Mediated Apoptosis Signalling”, “Death Receptor Signalling” and “Induction of Apoptosis by HIV1” showed negative Z scores of -2.8, -2.3 and -1, respectively (S3 File): predicting lower capacity for apoptosis/cell death in Holstein infected cells. In contrast, “FAT10 (UBD) Cancer Signalling Pathway” (Z score +2.6), “Small Cell Lung Carcinoma” (Z score +1.89), “Non-Small Cell Lung Cancer Signaling” (Z score +2.36), “PI3K Signalling in B lymphocytes” (Z score + 0.8) and “Wnt/Ca+ Pathway” (Z score +0.7) predicted elevated pathway activation for Holstein infected cells. “FAT10 cancer signalling” with the higher expression of UBD (FAT10) predicted to promote carcinogenesis via NF-κB and TGFB is illustrated (S1 Fig). In addition, four of the top fifteen entries with a negative Z score in the disease processes or molecular function analysis (Table 2) were placed in “Cell Death and Survival”, while the majority (twelve) of the top twenty activated (positive Z score) disease processes/functions were in “Cancer” or “Cellular movement”.

Canonical pathways associated with oncogenesis displaying a positive Z score were not identified in analysis of the infection associated-H/S data set. However, infection associated genes known to function in oncogenesis were identified as displaying higher expression in Holstein infected cells compared to Sahiwal. These genes include UBD (FAT10) and SLIT2 (see Table 1 and S1 Fig). PRUNE2 (B-cell CLL/lymphoma 2 and adenovirus E1B 19 kDa interacting family) and CLU, encoding Clusterin a secreted chaperone, were also identified and have been shown to operate in cell death, cell transformation, tumour progression and neurodegenerative disorders [5759].

Taken together the results of the IPA analysis indicate for Holstein infected cells relative to Sahiwal a higher oncogenic potential, but lower innate immunity and cholesterol biosynthesis potential. And for all three of these processes, a clear link to genes whose expression is altered by parasite infection of leukocytes was established

Differential expression of genes encoding predicted secreted/receptor proteins

In our dataset of H/S-DE genes, we found 83 predicted to encode secreted/extracellular proteins, with 28 of these identified as infection associated (S5 File). The most significantly elevated of these genes was SLIT2. Genes encoding CLU, SPARC and PLAU, were also higher more than 1fold, log2 in Holstein cells and modulated by parasite infection. SPARC (cysteine rich acidic matrix associated protein) promotes both metastasis [60] and degradation of p53 [61], which also occur in Theileria infected leukocytes [13,62]. PLAU encodes a secreted urokinase associated with cancer progression/metastasis via upregulation by the AT hook factor, HMGA1 [63]. Infection associated genes encoding secreted proteins that showed lower relative expression > 1fold, log2 in Holstein cells were ADMATS18, C1QTNF6 (complement C1q tumour necrosis related protein: anti-inflammatory), SPOCK2 (ERK signalling pathway, pro-cancer), TNFSF10 (cytokine that induces apoptosis in transformed cells) and CCL5 (chemokine involved in inflammation and immuno-regulation; chemo-attractant for blood monocytes and memory T helper cells). A number of non-infection associated genes that encode secreted proteins with potential to influence the phenotype of the infected cell and/or cells of the host immune system were also identified (e.g. IL9, CXCL2, CLEC3B and ADAMTS19). Therefore, differential stimulation of and interaction with cells of the haematopoietic system by factors secreted by Holstein relative to Sahiwal infected cells is likely.

Numerous genes encoding cellular receptors were also present in the H/S DE data set (see S6 File). Infection associated receptor encoding genes include CCR1 (encoding a receptor for CCL5), CRLF2 (cytokine receptor like factor 2), IL10RA, TLR2, CD2, CD48 and IL7R. Non-infection associated genes encoding receptors included ROBO2; encoding the receptor for the secreted SLIT2 factor (both receptor and factor expressed higher in infected Holstein cells), TLR7, CD4, TRAF1 (TNF receptor associated factor 1), IL27RA, IL17RD and IRAK1 (Interleukin-1 receptor-associated kinase 1). Therefore, Holstein infected cells are predicted to possess the potential to react differently than Sahiwal, in response to a large array of receptor ligands, including chemokines, cytokines and growth factors.

qRT-PCR validation of differential expression of candidate genes

To confirm the validity of the RNA-seq data, qRT-PCR analysis was performed. Seventeen genes were selected based on their representative RNA-seq profiles across the H/S-DE data set. The results are summarized in Table 4 and shown for a subset of the genes in Fig 5. SLIT2 and PRUNE2 represent infection-associated genes where expression was assessed as higher by RNA-seq in Holstein relative to Sahiwal infected cells. qRT-PCR confirmed this profile (Table 4), with statistically significant higher mRNA levels for both genes, with on average 62.1- and 171.6-fold higher values for SLIT2 and PRUNE2 from Holstein infected cell lines, respectively. IFI44, ISG15, MX1, RSAD2, SAMD9 and TNFSF10 were chosen as they are infection-associated genes that show lower expression levels in Holstein infected cells compared to Sahiwal cells. In all six cases the qRT-PCR supported this profile with a statistically significant lower mean expression level in Holstein cells of 8.9-fold (IFI44), 22.2-fold (ISG15), 17.5-fold (MX1), 6.1-fold (RSAD2), 5.9-fold (SAMD9) and 3.4-fold (TNFSF10), respectively (Table 4, Fig 5A & 5B). ADAMTS20, NECTIN3 and WLS were selected to represent non-infection associated genes with higher expression in Holstein cells. The qRT-PCR analysis confirmed this profile with statistically significant higher relative expression of 2.0 fold (ADAMTS20), 9.0-fold (NECTIN3) and 5.4-fold (WLS) (Table 4, Fig 5C & 5D) in Holstein cells compared to Sahiwal cells. The reciprocal profile was also validated with HERC5, IFI6, CGAS (MB21D1) and OAS1Y. Statistically significant lower expression of 3.3 -fold (HERC5), 6.0-fold (IFI6), 2.2-fold (CGAS) and 10.5-fold (OAS1Y) was found in Holstein infected cells compared to Sahiwal cells (Table 4). Thus, for the majority of genes in the RNA-seq data set, the trend of expression predicted is likely to be accurate. Two further genes encoding the pro-inflammatory cytokines, IL6 and IL23A, designated as significantly higher in the H/S DE data set but which showed non-consistent differential expression across the eleven samples, were tested. The results indicated that although the mean expression values were 1.6-fold (IL6) and 3.6-fold (IL23A) higher for Holstein infected cells relative to Sahiwal, variability within the data precluded validation of a significant difference (Table 4, Fig 5E & 5F).

Table 4. qRT-PCR validation of differential expression of a subset of H/S-DE genes.

Gene Gene Symbol Sahiwal Holstein P value
ADAM Metallopeptidase With Thrombospondin Type 1 Motif 20 ADAMTS20 1.3 ± 0.1 2.6 ± 0.4 0.011
Cyclic GMP-AMP Synthase CGAS 2.8 ± 0.2 1.2 ± 0.1 <0.001
HECT And RLD Domain Containing E3 Ubiquitin Protein Ligase 5 HERC5 3.7 ± 0.6 1.1 ± 0.1 <0.001
Interferon Alpha IFNA 1.7 ± 0.3 2.1 ± 0.4 n.s.
Interferon Alpha Inducible Protein 6 IFI6 11.8 ± 1.1 2.0 ± 0.4 0.001
Interferon Beta 1 IFNB1 2.3 ± 0.4 5.7 ± 1.1 0.006
Interferon Beta 3 IFNB3 5.1 ± 1.4 2.9 ± 0.7 n.s.
Interferon Induced Protein 44 IFI44 50.0 ± 8.4 5.6 ± 1.4 0.002
Interleukin 6 IL6 2.8 ± 0.6 4.5 ± 1.8 n.s.
Interleukin 23 Subunit Alpha IL23A 1.6 ± 0.2 5.8 ± 3.8 n.s.
ISG15 Ubiquitin Like Modifier ISG15 110.8 ± 26.3 5.0 ± 1.8 0.001
MX Dynamin Like GTPase 1 MX1 191.2 ± 14.4 10.9 ± 8.1 0.005
Nectin Cell Adhesion Molecule 3 NECTIN3 5.4 ± 2.0 48.3 ± 13.8 0.018
2’-5’-Oligoadenylate Synthetase 1 OAS1 231.0 ± 23.5 22.0 ± 17.9 0.001
Prune Homolog 2 With BCH Domain PRUNE2 2.0 ± 0.6 335.3 ± 172.9 0.018
Radical S-Adenosyl Methionine Domain Containing 2 RSAD2 153.4 ± 35.1 25.1 ± 19.2 0.013
Sterile Alpha Motif Domain Containing 9 SAMD9 8.5 ± 1.5 1.4 ± 0.1 0.001
Slit Guidance Ligand 2 SLIT2 2.1 ± 0.5 127.4 ± 65.0 0.007
TNF Superfamily Member 10 TNFSF10 12.5 ± 1.6 3.7 ± 0.9 0.009
Wnt Ligand Secretion Mediator WLS 1.2 ± 0.1 6.7 ± 1.7 0.008

Fig 5. qRT-PCR validation of candidate H/S-DE genes.

Fig 5

Panels show results for representative genes. (A) MX1; (B) TNSF10; (C) NECTIN3; (D) WLS; (E) IL6; (F) IL23A. Within each panel Y axis shows mRNA fold difference (calculated vs. the lowest expressed sample); X axis designates sample number and breed; Hol1-5, Holstein; Sah1-6, Sahiwal.

IPA analysis of our RNA-seq datasets predicted marked differences between Holstein and Sahiwal infected cells in pathways linked to activation of and response to type I IFN. However, no type I IFN genes were identified as differentially expressed in the H/S-DE dataset. In an attempt to elucidate whether a type I IFN could be associated with differential expression of ISGs, the expression of IFNB1, IFNB3 and IFNA was investigated by qRT-PCR. No significant difference in IFNA and IFNB3 mRNA levels were detected between Holstein and Sahiwal derived infected cells. However, there was a significant difference in IFNB1 expression. Unexpectedly, on average 2.5-fold higher expression of IFNB1 was detected in Holstein infected cells compared to Sahiwal cells (Table 4, Fig 6).

Fig 6. qRT-PCR analysis of INF genes: INFA, INFB1 and INFB3.

Fig 6

The Y-axis shows mRNA fold difference (calculated vs. the lowest expressed sample): Green infill designates, Sahiwal; blue infill, Holstein. Degree of significance denoted above bar.

Screen for differential expression of parasite transformation candidates and motifs bound by TashAT2 in B. indicus vs. B. taurus genomes

In total, 109 T. annulata encoded genes were identified as significantly differentially expressed between Holstein and Sahiwal infected cell lines (S7 File). Compared to host genes, however, the parasite genes showed markedly smaller magnitudes of change in expression level; a maximum log2 fold change of 2.6 compared to -26.1 for the bovine gene set. A large number of the parasite genes were annotated as hypothetical or encode proteins with a predicted function/location that cannot be linked to transformation of the infected cell or host immune response. Moreover, there was limited evidence for altered expression of genes encoding candidate modulators of host cell phenotype (TaPIN, Ta9 and TashATs). Thus, gene TA15705, encoding the Ta9 immuno-dominant antigen [17], was the only one identified as differentially expressed (0.84 log2 fold higher in Holstein, third most significant). Further analysis of Ta9 expression was performed by immunoblot. The result failed to indicate significant elevation in Ta9 protein levels across three of the Holstein lines relative to three Sahiwal (S2 Fig). A similar result was obtained for TashAT2 translocated to the host nucleus of the infected cell [44]. Immunofluorescence failed to detect a consistent difference in reactivity against the infected cell nucleus between all three Holstein lines relative to Sahiwal (S3 Fig).

Polymorphism of a secreted parasite factor, or diversity of the host target it interacts with, could differentially modulate host cell gene expression in the absence of differential expression, and this model has been predicted for TashAT proteins. Thus, for TashAT2/3, polymorphism has been identified in the number and pattern of their AT hook DNA binding domains [64]. The AT hook domain of TashAT2/3 shows similarity to that of mammalian HMGA proteins, suggesting potential overlap in target DNA motifs, and it is known that the TashAT2 domain recognises motifs rich in AT (Fig 7A) [19]. Our IPA upstream analysis predicted activation of HMGA1 (Table 3), and HMGA1 has been shown to lower expression of cholesterol biosynthesis genes [55], one of the most enriched pathways in our data sets. Therefore, we assessed whether the pattern of motifs bound by TashAT2 show diversity in the genomes of B. indicus relative to B. taurus and whether they might be enriched in our IA-H/S data set. Our initial screen identified 167,151 incidences of the target motifs in the B. taurus and 149,107 in the B. indicus genome (S8 File). The majority of the motif patterns in both genomes were located outside of gene coding regions (≈ 56% and ≈65% in B. taurus and B. indicus respectively), while ≈42% were within introns in B. taurus and ≈35% in B. indicus. Approximately 2% of patterns were in upstream or downstream UTRs and less than 1% were within exons in each breed (S8 File). As motif sites located within gene coding regions are more likely to be cis-acting elements and their associated targets more easily inferred, we used the cohort of motifs located in UTRs, introns and exons for a more detailed comparison (S9 File). In B. taurus 11,983 genes featured at least one TashAT2 binding motif compared to 9,516 in B. indicus and there were substantial differences in the pattern and number of TashAT2 binding motifs between breeds (Fig 7B; S9 File). There were 7,575 shared genes that possessed at least one TashAT2 binding motif and it was evident that for certain genes (including SLIT2, PRUNE2, LITAF and NECTIN3) the number of motifs differed between B. indicus and B. taurus genomes. Subsequent PANTHER analysis of the shared genes data set showed an enrichment in genes involved in “EGF (PI3K/AKT, ERK) receptor signalling”, “PGDF signalling”, “Wnt signalling pathway”, “FGF signalling”, “Gonadotropin-releasing hormone (GnRH)” and “Integrin signalling” (FDR adjusted p < 0.05; Fig 7C). Comparison with IPA on the H/S-DE gene set showed overlap with “ERK/MAPK signalling”, “PI3K/AKT signalling”, “Wnt/Ca+ pathway and “Integrin signalling” (S3 File). Additionally, of the 517 transcripts previously noted to be altered by parasite infection of BL20 cells and differ in expression between infected Holstein and Sahiwal cells, 232 (45%) feature TashAT2 binding motifs. This is significantly more than would be expected by chance under a binomial distribution in which ≈26% of annotated features in the bovine genome possess a binding motif (p < 0.001). SLIT2, LITAF, PRUNE2 and genes involved in “Integrin signalling” showed altered numbers of TashAT2 motifs (S10 File) and, together with “PI3K/AKT signalling”, integrin signalling genes were significantly enriched by IPA in the Infection Associated-H/S data set (S4 File).

Fig 7. Analysis of DNA motifs bound by the parasite encoded TashAT2 factor in the genomes of B. indicus vs. B. taurus.

Fig 7

(A) Consensus motif of the DNA region bound by parasite TashAT2 in the genome of B. indicus vs. B. taurus compared to the binding motifs of mammalian HGMA [19]. AT rich regions bound by AT hooks found in both TashAT2 and HGMA are shown in red. (B) A CIRCOS summary plot with automatic binning showing the number of TashAT2 binding motifs found in genes across the B. indicus (red) and B. taurus (blue) genomes. Genes with the same number of motifs are excluded to demonstrate the pattern of differences between each breed. Zoomed views are shown for chromosomes 2, 8 and 16 without automatic binning. The approximate regions of genes in the “EGF” and “Integrin signalling” pathways that possess different numbers of TashAT2 binding motifs and differ in expression between infected Holstein and Sahiwal breeds are indicated in green (ITGA4, MAPK10 and LAMC1). (C) Overrepresented PANTHER pathways in the cohort of bovine genes with FDR adjusted p value < 0.05 after a Fisher’s Exact test. Blue is the observed members of a pathway in the dataset, whereas red indicates the expected number if the distribution was random.

Discussion

Understanding how differential susceptibility to infectious disease is conferred is of great importance. Such understanding is needed to highlight at risk groups, inform therapeutic regimes and develop strategies for breeding productive but infection tolerant livestock. To gain insight on how variable infection responses are generated, we have utilised two cattle breeds representing the ends of a susceptibility spectrum to tropical theileriosis, caused by Theileria annulata. The Sahiwal breed (B. indicus) has evolved in tandem with the pathogen and in general, displays mild symptoms of disease when infected, whereas Holstein (B. taurus) are generally susceptible to acute disease. Based on the premise that tolerance has most likely evolved via competitive molecular interactions between pathogen and host that lead to pathology, we focused on identifying gene expression differences between low passage T. annulata macroschizont infected cell lines from tolerant Sahiwals and susceptible Holsteins, and screened for parasite factors that could generate these differences.

RNA-seq analysis identified a large number of genes displaying a significant difference in expression level between Holstein and Sahiwal infected cells (the H/S-DE data set). In order to replicate, as closely as possible, the situation in vivo and to prevent loss of virulence through cloning, this data set was generated from uncloned, low passage infected cell lines derived from 11 different animals. PCA analysis of the data set showed that although the two breed types clearly separated there was an inherent level of variability for samples within a breed. The result was not unexpected. It has been shown that gene expression levels, including cytokine encoding genes, vary within breed for both Holstein and Sahiwal infected cell lines [13,21,42], and gene expression markers associated with virulence show appreciable variability [65]. Such within breed variability could be generated by: a) genetic differences between individual cattle, with variance likely to be higher for the Holstein (genetically unrelated) relative to the Sahiwal cattle (full and 1/2 siblings) [9]; b) genetic differences in parasite genotype represented by infected cell lines, because although all cell lines are infected by Hissar, this strain contains multiple parasite genotypes [65]; c) epigenetic differences generated during establishment of cell lines in vivo or in vitro and d) variable host cell type composition between infected cell lines. We propose that the gene expression profile of infected cells may be unique for each individual animal and that this influences the wide variability in infection outcome between and within “tolerant” and “susceptible” breeds [9,66]. To validate this premise, analysis of multiple pairs of infected and uninfected cells from a range of “tolerant” and “susceptible” breeds is required.

Despite the expectation of variable expression profiles, a large number of genes in our H/S-DE data set displayed expression differences that were consistent across all Sahiwal vs. Holstein samples (see denoted genes, S2 File); while the remainder, as indicated above, showed a higher level of variability between individuals across the breeds (for example see IL-6). The results support the conclusion that there is a major influence of breed type on the phenotype of the macroschizont-infected leukocyte linked to disease susceptibility [2,5,13].

IPA was performed on both the full H/S-DE data set and the more limited set (IA-H/S) of genes whose expression is induced by parasite infection of the host leukocyte. Strikingly the most significantly enriched pathways for both data sets were the same. However, previous studies indicate that a number of infection-associated genes differentially expressed between breeds have been missed by our analysis, and there are at least several candidates. Thus, the TGFB2 gene, that is induced by infection and expressed at a higher level in Holstein infected cells [13], was present in our H/S-DE data set (elevated 3.5 log2 fold) but was not identified as infection associated. ICAM1 was reported as differentially expressed between Sahiwal and Holstein infected cells [23] but was not highlighted as significantly different in our study. The reasons for these false negatives are not conclusively known, but most likely relate to differences in bovine cells used to establish changes linked to infection and methodology for identification of differential gene expression.

The pathway showing the most consistent and significant modulation across our data sets was type 1 IFN. Modulation was most evident as lower relative expression of ISG in Holstein infected cells (82% of identified ISGs) but was also recorded for genes encoding pattern recognition receptors (DDX58 [RIG-1], ZBP1 [DAI], IFIH1 [MDA-5] and CGAS) and transcription factors (STAT1, STAT2, IRF3, IRF7 and IRF9). Notably, for many ISG genes differential expression was consistent across all Holstein vs. Sahiwal samples. Validation of gene expression differences was obtained by qRT-PCR for several ISG genes (MX1, OAS1Y, ISG15, RSAD2 [Viperin], IFI6, IFI44 and HERC5), and a number of ISG genes (HERC6, ISG15, IFI6, IFI44, IFIH1, IFI35, MX1, MX2, OAS2 and ZBP1) were identified as modulated by parasite infection, supporting the previous report on ISG15 [20]. The function of ISG genes in innate immunity has been studied predominantly using viral systems (reviewed in [67,68]). Limited information regarding the function of ISGs against intracellular protozoa is available. Nevertheless, it is reasonable to propose that, by acting as tumour suppressors or regulators of immune effector cells, ISGs could modulate the phenotype of the Theileria-infected leukocyte and influence infection outcome [6974]. Modulation of ISG expression between cattle breeds could also be of relevance to other pathogens. An ISG response is generated in infected cells against Foot and Mouth Disease Virus [75] and B. indicus cattle are known to be less susceptible to FMDV infection than B. taurus [76].

While IPA predicted modulation of type I IFN production associated with tolerance, there was no direct evidence for this in the RNA-seq data set. This may have been due to difficulty in mapping RNA-seq reads to specific IFN genes within the large type I IFN family [77]. However, contrary to the IPA network prediction, qRT-PCR indicated significantly higher levels of IFNB1 expression in the Holstein cell lines, supporting previous data demonstrating expression of IFNB in infected leukocytes [78]. How differential expression of IFNB1 is generated between the infected cell lines is currently unclear but could occur via lower expression of genes involved in cholesterol biosynthesis recorded for Holstein infected cells (see Fig 5). A reduction in cholesterol biosynthesis has been linked to induction of IFNB expression and inflammatory disease [52,79] and is implicated in the induction of an inflammatory cytokine storm in COVID-19 [80]. It appears, therefore, that tolerance to tropical theileriosis is associated with a higher relative ISG expression in Sahiwal infected cells; while disease susceptibility of Holstein may be linked to elevated levels of IFNB1 cytokine production and a stronger propensity to stimulate an inflammatory response. Whether these events are associated with the massive pulmonary oedema following metastasis of infected cells to the lungs [12] requires investigation. Parallels with mechanisms conferring susceptibility to other infectious diseases are of interest.

IPA also identified several processes linked to oncogenesis. This included a prediction of greater metastasis potential for Holstein infected cells (via, for example elevated FAT10 [UBD], TGFB2, ITGA1, PLAU and SPARC gene expression), supporting and validating the previous work demonstrating a role for TGFB2 in stimulating elevated metastasis of Holstein relative to Sahiwal infected cell lines [13]. Previous KEGG pathway analysis of microarray data led to postulation that disease susceptibility is linked to an aberrant interaction with cells of the immune response that promotes immune dysfunction [23]. Major differences in expression of genes associated with immune cell interaction (such as receptors, chemokines and cytokines), highlighted in our current analysis, supports this premise. Thus, reduced oncogenic potential and lower aberrant immune cell activation could account for the less pronounced lymph node enlargement observed in infected Sahiwal relative to Holstein cattle [9].

A major aim of this study was to investigate potential pathogen-host interactions that generate gene expression differences between infected cells from tolerant vs. susceptible breeds. One potential interaction previously considered is that the type of cell preferentially infected could differ between B. indicus and B. taurus breeds. Analysis with antibodies against cell type markers indicated that myeloid cells were the predominant infected cell type infected for both breeds [81], although evidence was presented for some macroschizont infected NK cells in Sahiwal lines. Our RNA-seq data support these findings. For genes encoding the myeloid lineage markers, CCR2, CCR3, CD13, CD11, CD14, CD34, CD36, CD9 and CD38, none were identified as differentially expressed. For genes encoding the CD8 and CD2 markers previously identified on some Sahiwal infected cells [81], CD8 was not identified as significant in our data set. CD2 showed higher expression levels associated with Sahiwal line, although in three Holstein lines the level of expression was close to that of one of the Sahiwal lines. In contrast, the gene encoding the CD4 marker found on T cells and myeloid cells showed significantly lower expression values in all Holstein cell lines (-4.6 log2). The results imply that either the profile of cell types infected can vary to a degree between B. indicus and B. taurus, or following transformation of predominantly myeloid cells, the expression of cell surface molecules is differentially modulated. Previous analysis of infected, purified myeloid cells [23] also highlighted lower expression of CD4 in Holstein relative to Sahiwal cells, and together with alteration of surface marker expression following infection of the BL20 line [14,15], indicates the latter possibility as most likely.

Interactions that modulate gene expression between Sahiwal and Holstein infected cells are likely to involve pathogen modulators of host cell gene expression. No conclusive evidence of this for known parasite candidates was detected. While the gene encoding Ta9 reported to activate AP1 [17] was scored as significantly expressed at a higher level (0.84 log2 fold) in Holstein infected leukocytes, this was not validated at the protein level by immunoblot. IPA, however, did identify HMGA1 as a potential regulator of differential gene expression between the two breeds. Mammalian HMGAs act as architectural transcription factors and bind patterns of AT rich motifs to alter chromatin structure (both locally and globally) [82]. HMGA factors regulate gene expression primarily during development [83], but also in neoplasia [84,85] and genes targeted by NF-κB [86]. Given that TashAT2 is known to encode a DNA binding domain with homology to HMGAs, binds a related AT rich motif pattern (Fig 7A) and was detected in the host nucleus of both Sahiwal and Holstein infected cells (S3 Fig), we screened the available B. indicus and B. taurus genomes for the presence of motifs bound by TashAT2. The results clearly demonstrated a difference in the predicted pattern of these motifs (Fig 7B). Thus, genes in both genomes uniquely display a predicted TashAT2 bound motif, while for genes that shared possession of motifs, differences in motif number were frequently detected. Within this cohort of genes there are many that could influence the infected cell phenotype. For example, there is enrichment for genes that function in “Epidermal Growth Factor signalling” and overlap with pathways highlighted by IPA of the H/S-DE data set (“ERK”, “PI3K/AKT”, “Integrin signalling”). PI3K/AKT signalling has previously been linked to mechanisms involved in transformation of the Theileria-infected leukocyte [55,87] and is modulated by mammalian HMGA2 proteins [84]. Integrin signalling is well established in oncogenesis and cellular interaction [88,89], and integrin genes are targets for modulation by HMGA [85]. It was also demonstrated that 45% of infection-associated genes differentially expressed between Sahiwal and Holstein infected cell lines feature at least one putative TashAT2 binding motif, which is more than expected by chance (p < 0.001). “PI3K/AKT” and “Integrin signalling” were significant in IPA of the IA-H/S data set (S4 File) and several infection-associated genes involved in integrin signalling, possess differences in the number of TashAT2 binding motifs between the two genomes (LAMC1, ACTA2, ITGA4 and ITGB5). The majority of integrin genes are expressed at a lower level in Sahiwal infected cells relative to Holstein. Reduced expression of ITGA4 via PI3K signalling has been linked previously to a loss of infected cell virulence [87].

Based on the above results, and on the premise that AT hook DNA binding proteins act as base composition readers to mould the structure of the epigenome [90], we propose the following model. Differences in the pattern of AT rich DNA motifs between the genomes of tolerant vs. susceptible cattle breeds (and individuals), when bound by pathogen TashAT factors located in the host cell nucleus, give rise to alternative chromatin architectures. These architectures then allow variable accessibility of host transcription factors activated by infection, resulting in differential modulation of gene expression in the macroschizont infected cell. Given the known association between the infected cell and pathology, and the polymorphism identified in the TashAT2/3 DNA binding domain [64], this model could explain the considerable variability in disease severity between and within breeds that occur upon infection [9,66]. Validation of differential TashAT2 factor binding to sites in host DNA by chromatin immunoprecipitation, and further investigation of divergent pathogen-host interactions that have evolved to modulate host cell gene expression will provide understanding of how differential susceptibility to disease has arisen, and inform strategies aiming to breed productive livestock tolerant to infection.

Supporting information

S1 Fig. IPA canonical pathway, “FAT10 cancer signalling”.

The nodes represent genes/molecules/complexes in a pathway, and the lines and arrows between nodes indicate known relationships from the Ingenuity Knowledge Base. Nodes with purple outline indicate molecules that were measured as differentially expressed in our dataset, with the intensity of coloured infill indicating the level of up–(red) or down-(green) regulation of Holstein relative to Sahiwal. The blue and orange coloured molecules and lines are predicted activation states generated by the Molecular Activity Predictor function in Ingenuity Pathway Analysis. Blue colour indicates a predicted inhibition, and orange a predicted activation state in Holstein relative to Sahiwal. Broad lines with explanatory text beside the pathway indicate the cellular location of molecules in the pathway. The molecules in the pathway are given shapes that indicate their functional class (Nested Circle/Square = Group/Complex, Horizontal ellipse = Transcriptional Regulator, Vertical Ellipse = transmembrane receptor, Vertical Rhombus = enzyme, Square = Cytokine/Growth Factor, Triangle = Kinase, Vertical Ellipse = Transmembrane Receptor, Circle = other). The edges between molecules are also differentiated to indicate the type of relationship between them. Solid lines are direct relationships and dashed lines are indirect.

(TIF)

S2 Fig. Immunoblot carried out with protein extracts prepared from 3 independent Sahiwal lines compared to 3 Holstein lines.

Sahiwal samples are denoted SA, SB, SC; Holstein denoted HA, HB, HC. Protein size markers are indicated on the right (kDa). A. Extracts probed with Rat anti-Ta9 (TA15705) at 1/1200 dilution. Ta9 was detected at a variable size of 43-46kDa, which is a similar to that described previously for the polymorphic Ta9 antigen [17,43]. B. As a control, the same blot was reprobed with Rabbit antiserum raised against constitutively expressed ER HSP90 (TA06470) at 1/1500 dilution. The Ta9 and HSP90 reactive proteins are denoted by arrow.

(TIF)

S3 Fig. Immunofluorescence assay carried out on cells prepared from 3 independent Sahiwal lines compared to 3 Holstein lines.

Sahiwal samples are denoted SA, SB, SC; Holstein denoted HA, HB, HC. Cells were reacted with antiserum specific for TashAT2 (EL24) and images obtained using matched exposures. Bar = 7 μm.

(TIFF)

S1 Table. Details of oligonucleotide primers used in qRT-PCR.

F and R denote forward and reverse primers respectively. *Primers designed based on Ensembl transcript.

(DOCX)

S2 Table. Number of RNA-seq reads mapped on to B. taurus genome are not statistically different.

Rows denote RNA-seq sample set derived from 6 Sahiwal (S1-6) or 5 Holstein (H1-5) infected cell lines. Columns denote the % of reads mapped to the B. taurus genome. Summary denotes mean % of reads mapped, standard deviation and no significant difference between means.

(DOCX)

S3 Table. Top upstream regulators with predicted activated or inhibited activation states in infection-associated H/S-DE genes.

In total 655 upstream regulators were found with a p-value of < 0.05; including 123 with a significantly predicted activation state (activated or inhibited, based on a z-score >2, or <-2). Only the top 20 activated and top 20 inhibited regulators based on z-score are presented in the table.

(DOCX)

S1 File. Excel spread sheet of 2211 H/S-DE genes (<0.1 FDR).

Columns designate: Ensemble Gene ID; base mean of RNA-seq counts across all samples; log2 fold change between mean of Holstein vs. Sahiwal sample counts; lfcSE (Standard Error of the log fold change); stat (Wald statistic); p value (unadjusted p value for the Wald test); padj (Benjamini-Hochberg adjusted p-value for significance of the Wald test; DESeq2 normalized RNA-seq counts for each of the six Sahwal samples (Sah 1–6) and five Holstein (Hol 1–6); the gene name; an alternative gene name, when applicable; whether RNA-seq counts were consistent across all Sahiwal samples vs. Holstein; Interferon associated genes, as identified in the study of Liu et al. [50] or by GO [37]; description of protein/factor encoded by the gene. Yellow highlight designate genes also modulated in Theileria infected (TBL20) cells.

(XLSX)

S2 File. Excel spread sheet of the 517 genes in H/S-DE that overlap with genes designated as infection associated (modulated in TBL20 vs. BL20).

Columns designate: Gene id; DESeq normalized base mean of counts in A (Sahiwal) samples; base mean counts in B (Holstein) samples; log2 fold change between mean of Holstein vs. Sahiwal sample counts; lfcSE (Standard Error of the log fold change); stat (Wald statistic); pvalue (unadjusted p value for the Wald test); padj (Benjamini-Hochberg adjusted p-value for significance of the Wald test; the gene name; description of protein/factor encoded by the gene.

(XLSX)

S3 File. Excel spread sheet of the 100 most significant canonical pathways designated as enriched by IPA for H/S-DE data set.

Columns designate: The enriched canonical pathway; the–log p value; ratio of the number of genes from the list that maps to the pathway divided by the total number of genes that map to pathway; the (activation) Z-score; the gene symbol for molecules in H/S-DE data set present in the pathway. Blue highlight denotes pathways with a Z score>2, indicating repression in Holstein vs. Sahiwal infected cells; red highlight indicates activation (Z-score >1.5).

(XLSX)

S4 File. Excel spread sheet of the 100 most significant canonical pathways enriched by IPA from the data set of 517 overlapped infection-associated H/S-DE genes.

Columns designate: The enriched canonical pathway; the–log p value; the ratio of the number of genes from the list that maps to the pathway divided by the total number of genes that map to pathway; the (activation) Z-score; the gene symbol for molecules in H/S DE data set present in the pathway. Blue highlight denotes pathways with an activation score (Z score>2) indicating repression in Holstein vs. Sahiwal infected cells.

(XLSX)

S5 File. Excel spread sheet of genes in H/S-DE data set predicted to encode secreted proteins.

Columns designate: Ensemble Gene ID; DESeq normalized base mean of counts in all samples; log2 fold change between mean of Holstein vs.Sahiwal sample counts; lfcSE (Standard Error of the log fold change); stat (Wald statistic); pvalue (unadjusted p value for the Wald test); padj (Benjamini-Hochberg adjusted p-value for significance of the Wald test; the gene name; description of protein/factor encoded by the gene; prediction of location based on gene ontology data in Genecards [35]; Yellow highlight designate genes also modulated in Theileria infected (TBL20) cells.

(XLSX)

S6 File. Excel spread sheet of genes in H/S-DE data set predicted to encode receptors.

Columns designate: Ensemble Gene ID; Deseq2 normalized base mean for RNA-seq counts for all Sahiwal and Holstein samples; the log2 fold change between Holstein and Sahiwal samples for that gene; lfcSE (Standard Error of the log fold change); stat (Wald statistic); pvalue (unadjusted p value for the Wald test); padj (Benjamini-Hochberg adjusted p-value for significance of the Wald test; the gene name; description of protein/factor encoded by the gene; Yellow highlight designate genes also modulated in Theileria infected (TBL20) cells.

(XLSX)

S7 File. Excel spread sheet of 109 parasite (T. annulata) genes (<0.1 FDR).

Columns designate: TA number (T. annulata gene ID); DESeq normalized base mean of RNA-seq counts across all samples; designation of whether expression is higher (up) or lower (down) in Holstein (B. taurus) samples relative to Sahiwal (B. indicus); DESeq normalized base mean of counts in A (Sahiwal) samples; base mean counts in B (Holstein) samples; log2 fold change between mean of Holstein vs.Sahiwal sample counts; lfcSE (Standard Error of the log fold change); stat (Wald statistic); pvalue (unadjusted p value for the Wald test); padj (Benjamini-Hochberg adjusted p-value for significance of the Wald test; description of protein/factor predicted to be encoded by the gene; location of gene on T. annulata genome; genomic sequence ID.

(XLS)

S8 File. Excel spread sheet of nucleotide motifs bound by TashAT2 in genome of B. taurus and B. indicus.

Columns designate chromosome; motif start position; motif end position; genomic feature linked to motif position (non-coding, intron, exon, untranslated region (UTR); gene product associated with motif and genomic feature, if applicable.

(XLSX)

S9 File. Excel spread sheet of nucleotide motifs bound by TashAT2 in gene coding (RNA) regions of B. taurus and B. indicus genomes.

The data sheets list motifs identified in B. taurus genes, motifs identified in B. indicus genes and motifs in genes shared by B. taurus and B. indicus. Columns indicate: Gene identifier, number of motifs identified in that gene region; and for shared genes: B. number of motifs identified in B. taurus gene; C. number of motifs identified in B. indicus gene.

(XLSX)

S10 File. List of genes in integrin signalling pathway that bear different numbers of nucleotide motif bound by TashAT2 in B. taurus and B. indicus genomes.

Green highlight designates genes that display differential expression between Sahiwal and Holstein infected cells.

(DOCX)

Acknowledgments

We would like to thank the Centre for Virus Research, University of Glasgow for providing access to the IPA software.

Data Availability

RNA-seq data are publicly available at the NCBI GEO online repository under accession numbers GSM4824553–GSM4824563.

Funding Statement

This work was funded by BBSRC: grant BB/L004739/1 and a BBSRC Strategic Programme grant (Control of Infectious Diseases: [BB/P013740/1]). The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Betts A, Gray C, Zelek M, MacLean RC, King KC. High parasite diversity accelerates host adaptation and diversification. Science. 2018;360:907–911. doi: 10.1126/science.aam9974 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Glass EJ, Crutchley S, Jensen K. Living with the enemy or uninvited guests: functional genomics approaches to investigating host resistance or tolerance traits to a protozoan parasite, Theileria annulata, in cattle. Vet Immunol Immunopathol. 2012;148:178–89. doi: 10.1016/j.vetimm.2012.03.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Brown C, Hunter A, Luckins A. Diseases caused by protozoa. Handbook of Animal Diseases in the Tropics. 4th Ed. Cambridge UK: Cambridge University Press; 1990. [Google Scholar]
  • 4.Minjauw B, McLeod A. Tick-borne Diseases and Poverty. The Impact of Ticks and Tick-borne Diseases on the Livelihoods of Small Scale and Marginal Livestock Owners in India and Eastern and Southern Africa. Research Report, DFID Animal Health Programme. Centre for Tropical Veterinary Medicine, University of Edinburgh. 2003; 116. [Google Scholar]
  • 5.Glass EJ, Jensen K. Resistance and susceptibility to a protozoan parasite of cattle—gene expression differences in macrophages from different breeds of cattle. Vet Immunol Immunopathol. 2007;120(1–2):20–30. doi: 10.1016/j.vetimm.2007.07.013 [DOI] [PubMed] [Google Scholar]
  • 6.Singh CV. Cross-breeding in Cattle for Milk Production: Achievements, Challenges and Opportunities in India-A Review. Adv Dairy Res. 2016; 4:158. [Google Scholar]
  • 7.Kolte SW, Larcombe SD, Jadhao SG, Magar SP, Warthi G, Kurkure NV, et al. PCR Diagnosis of Tick-Borne Pathogens in Maharashtra State, India Indicates Fitness Cost Associated with Carrier Infections Is Greater for Crossbreed Than Native Cattle Breeds. PLoS One. 201;12(3):e0174595. doi: 10.1371/journal.pone.0174595 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Larcombe SD, Kolte SW, Ponnudurai G, Kurkure N, Magar S, Velusamy R, et al. The impact of tick-borne pathogen infection in Indian bovines is determined by host type but not the genotype of Theileria annulata. Infect Genet Evol. 2019;75:103972. doi: 10.1016/j.meegid.2019.103972 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Glass EJ, Preston PM, Springbett A, Craigmile S, Kirvar E, Wilkie G, et al. Bos taurus and Bos indicus (Sahiwal) calves respond differently to infection with Theileria annulata and produce markedly different levels of acute phase proteins. Int J Parasitol. 2005;35(3):337–47. doi: 10.1016/j.ijpara.2004.12.006 [DOI] [PubMed] [Google Scholar]
  • 10.Glass EJ, Innes EA, Spooner RL, Brown CG. Infection of bovine monocyte/macrophage populations with Theileria annulata and Theileria parva. Vet Immunol Immunopathol. 1989;22(4):355–68. doi: 10.1016/0165-2427(89)90171-2 [DOI] [PubMed] [Google Scholar]
  • 11.Hulliger L, Wilde KH, Brown CG, Turner L. Mode of multiplication of Theileria in cultures of bovine lymphocytic cells. Nature. 1964;203:728–30. doi: 10.1038/203728a0 [DOI] [PubMed] [Google Scholar]
  • 12.Irvin AD, Morrison WI. Immunopathology, immunology and immuno- prophylaxis of Theileria infections. In: Soulsby EJL, editor. Immune Responses in Parasitic Infections: Immunology, Immunopathology and Immunoprophylaxis vol III Protozoa Baton Rouge, Florida: CRC Press Inc; 1987. 223–274. [Google Scholar]
  • 13.Chaussepied M, Janski N, Baumgartner M, Lizundia R, Jensen K, Weir W, et al. TGF-b2 Induction Regulates Invasiveness of Theileria-Transformed Leukocytes and Disease Susceptibility. PLoS Pathog. 2010;6(11):e1001197. doi: 10.1371/journal.ppat.1001197 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Kinnaird JH, Weir W, Durrani Z, Pillai SS, Baird M, Shiels BR. A Bovine Lymphosarcoma Cell Line Infected with Theileria annulata Exhibits an Irreversible Reconfiguration of Host Cell Gene Expression. PLoS One. 2013;8(6):e66833. doi: 10.1371/journal.pone.0066833 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Durrani Z, Weir W, Pillai S, Kinnaird J, Shiels B. Modulation of activation-associated host cell gene expression by the apicomplexan parasite Theileria annulata. Cell Microbiol. 2012;14(9):1434–54. doi: 10.1111/j.1462-5822.2012.01809.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Marsolier J, Perichon M, DeBarry JD, Villoutreix BO, Chluba J, Lopez T, et al. Theileria parasites secrete a prolyl isomerase to maintain host leukocyte transformation. Nature. 2015;520(7547):378–82. doi: 10.1038/nature14044 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Unlu AH, Tajeri S, Bilgic HB, Eren H, Karagenc T, Langsley G. The secreted Theileria annulata Ta9 protein contributes to activation of the AP-1 transcription factor. PLoS One. 2018;13(5):e0196875. doi: 10.1371/journal.pone.0196875 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Hayashida K. et al. Comparative genome analysis of three eukaryotic parasites with differing abilities to transform leukocytes reveals key mediators of Theileria-induced leukocyte transformation. mBIO 2012;3(5),e00204–12. doi: 10.1128/mBio.00204-12 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Swan DG, Stern R, McKellar S, Phillips K, Oura CA, Karagenc TI, et al. Characterisation of a cluster of genes encoding Theileria annulata AT hook DNA-binding proteins and evidence for localisation to the host cell nucleus. J Cell Sci. 2001;114:2747–2754. [DOI] [PubMed] [Google Scholar]
  • 20.Oura CA, McKellar S, Swan DG, Okan E, Shiels BR. Infection of bovine cells by the protozoan parasite Theileria annulata modulates expression of the ISGylation system. Cell Microbiol. 2006;8(2):276–88. doi: 10.1111/j.1462-5822.2005.00620.x [DOI] [PubMed] [Google Scholar]
  • 21.McGuire K, Manuja A, Russell GC, Springbett A, Craigmile SC, Nichani AK, et al. Quantitative analysis of pro-inflammatory cytokine mRNA expression in Theileria annulata-infected cell lines derived from resistant and susceptible cattle. Vet Immunol Immunopathol. 2004;99:87–98. doi: 10.1016/j.vetimm.2004.01.003 [DOI] [PubMed] [Google Scholar]
  • 22.Graham SP, Brown DJ, Vatansever Z, Waddington D, Taylor LH, Nichani AK, et al. Proinflammatory cytokine expression by Theileria annulata infected cell lines correlates with the pathology they cause in vivo. Vaccine. 2001;19(20–22):2932–44. doi: 10.1016/s0264-410x(00)00529-6 [DOI] [PubMed] [Google Scholar]
  • 23.Jensen K, Paxton E, Waddington D, Talbot R, Darghouth MA, Glass EJ. Differences in the transcriptional responses induced by Theileria annulata infection in bovine monocytes derived from resistant and susceptible cattle breeds. Int J Parasitol. 2008;38(3–4):313–25. doi: 10.1016/j.ijpara.2007.08.007 [DOI] [PubMed] [Google Scholar]
  • 24.Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. Embnet. J. 2011;17,10–12. [Google Scholar]
  • 25.Joshi NA, Fass JN. (2011). Sickle: A sliding-window, adaptive, quality-based trimming tool for FastQ files (Version 1.33). Available from: https://github.com/najoshi/sickle. [Google Scholar]
  • 26.Zerbino DR, Achuthan P, Akanni W, Amode MR, Barrell D, Bhai J, et al. Ensembl 2018 Nucleic Acids Res. 2018;46(D1):D754–D761. doi: 10.1093/nar/gkx1098 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Langmead B, Salzberg S. Fast gapped-read alignment with Bowtie 2. Nature Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Buffalo V. sam2counts.py—convert SAM mapping results to reference sequence counts. Available from: https://github.com/vsbuffalo/sam2counts. [Google Scholar]
  • 29.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biology. 2014;15:550. doi: 10.1186/s13059-014-0550-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Pain A, Renauld H, Berriman M, Murphy L, Yeats CA, Weir W, et al. : Genome of the host-cell transforming parasite Theileria annulata compared with T. parva. Science 2005;309:131–133. doi: 10.1126/science.1110418 [DOI] [PubMed] [Google Scholar]
  • 31.Huang DW, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID Bioinformatics Resources. Nature Protoc. 2009;4(1):44–57. doi: 10.1038/nprot.2008.211 [DOI] [PubMed] [Google Scholar]
  • 32.Mi H, Huang X, Muruganujan A, Tang H, Mills C, Kang D, et al. PANTHER version 11: expanded annotation data from Gene Ontology and Reactome pathways, and data analysis tool enhancements. Nucleic Acids Res. 2017;45(D1):D183–D189. doi: 10.1093/nar/gkw1138 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Land J. Statistical significance of the overlap between two groups of genes. Available from: http://nemates.org/MA/progs/overlap_stats.html. [Google Scholar]
  • 34.Almagro Armenteros JJ, Tsirigos KD, Sønderby CK, Petersen TN, Winther O, Brunak S, et al. SignalP 5.0 Improves Signal Peptide Predictions Using Deep Neural Networks. Nat Biotechnol. 2019;37(4):420–423. doi: 10.1038/s41587-019-0036-z [DOI] [PubMed] [Google Scholar]
  • 35.Krogh A, Larsson B, von Heijne G, Sonnhammer EL. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol. 2001;305(3):567–580. doi: 10.1006/jmbi.2000.4315 [DOI] [PubMed] [Google Scholar]
  • 36.Pierleoni A, Martelli PL, Casadio R. PredGPI: A GPI-anchor Predictor. BMC Bioinformatics. 2008;9:392. doi: 10.1186/1471-2105-9-392 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Weizmann Institute of Science. GeneCards. Available from: https://www.genecards.org.
  • 38.Jensen K, Gallagher IJ, Johnston N, Welsh M, Skuce R, Williams JL, et al. Variation in the Early Host-Pathogen Interaction of Bovine Macrophages with Divergent Mycobacterium bovis Strains in the United Kingdom. Infect Immun. 2018;86(3):e00385–17. doi: 10.1128/IAI.00385-17 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Rice P., Longden I., Bleasby A. EMBOSS: the European Molecular Biology Open Software Suite Trends Genet. 2000;16:276–277. doi: 10.1016/s0168-9525(00)02024-2 [DOI] [PubMed] [Google Scholar]
  • 40.Quinlan Aaron R., and Hall Ira M. "BEDTools: a flexible suite of utilities for comparing genomic features." Bioinformatics 2010;26.6: 841–842. doi: 10.1093/bioinformatics/btq033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Krzywinski Martin, et al. "Circos: an information aesthetic for comparative genomics." Genome research 2009;19.9:1639–1645. doi: 10.1101/gr.092759.109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Kinnaird JH, Singh M, Gillan V, Weir W, Calder ED, Hostettler I, et al. Characterization of HSP90 isoforms in transformed bovine leukocytes infected with Theileria annulata. Cell Microbiol. 2017;19(3):e12669. doi: 10.1111/cmi.12669 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Bilgic HB, Karagenc T, Bakırcı S, Shiels B, Tait A, Kinnaird J, et al. Identification and Analysis of Immunodominant Antigens for ELISA-Based Detection of Theileria annulata. PLoS One. 2016;11(6):e0156645. doi: 10.1371/journal.pone.0156645 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Swan DG, Phillips K, Tait A, Shiels BR. Evidence for localisation of a Theileria parasite AT hook DNA-binding protein to the nucleus of immortalised bovine host cells. Mol Biochem Parasitol. 1999;101(1–2):117–29. doi: 10.1016/s0166-6851(99)00064-x [DOI] [PubMed] [Google Scholar]
  • 45.Chaturvedi S, Yuen DA, Bajwa A, Huang YW, Sokollik C, Huang L, et al. Slit2 prevents neutrophil recruitment and renal ischemia-reperfusion injury. J Am Soc Nephrol. 2013;24(8):1274–87. doi: 10.1681/ASN.2012090890 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Zhang QQ, Zhou DL, Lei Y, Zheng L, Chen SX, Gou HJ, et al. Slit2/Robo1 signalling promotes intestinal tumorigenesis through Src-mediated activation of the Wnt/β-catenin pathway. Oncotarget. 2015;6(5):3123–35. doi: 10.18632/oncotarget.3060 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Aichem A, Groettrup M. The ubiquitin-like modifier FAT10 in cancer development. Int J Biochem Cell Biol. 2016;79:451–461. doi: 10.1016/j.biocel.2016.07.001 [DOI] [PubMed] [Google Scholar]
  • 48.Faralli JA, Desikan H, Peotter J, Kanneganti N, Weinhaus B, Filla MS, et al. Genomic/proteomic Analyses of Dexamethasone-Treated Human Trabecular Meshwork Cells Reveal a Role for GULP1 and ABR in Phagocytosis. Mol Vis. 2019;25:237–254. [PMC free article] [PubMed] [Google Scholar]
  • 49.Datta S, Nam HS, Hayashi M, Maldonado L, Goldberg R, Brait M, et al. Expression of GULP1 in Bronchial Epithelium Is Associated with the Progression of Emphysema in Chronic Obstructive Pulmonary Disease. Respir Med. 2017;124:72–78. doi: 10.1016/j.rmed.2017.02.011 [DOI] [PubMed] [Google Scholar]
  • 50.Oryan A, Namazi F, Sharifiyazdi H, Razavi M, Shahriari R. Clinicopathological Findings of a Natural Outbreak of Theileria annulata in Cattle: An Emerging Disease in Southern Iran. Parasitol Res. 2013;112(1):123–7. doi: 10.1007/s00436-012-3114-4 [DOI] [PubMed] [Google Scholar]
  • 51.Liu SY, Sanchez DJ, Aliyari R, Lu S, Cheng G. Systematic identification of type I and type II interferon-induced antiviral factors. Proc Natl Acad Sci U S A. 2012;109(11):4239–44. doi: 10.1073/pnas.1114981109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Pombo JP, Sanyal S. Perturbation of Intracellular Cholesterol and Fatty Acid Homeostasis During Flavivirus Infections. Front Immunol. 2018;9:1276. doi: 10.3389/fimmu.2018.01276 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Baenke F, Peck B, Miess H, Schulze A. Hooked on fat: the role of lipid synthesis in cancer metabolism and tumour development. Dis Model Mech. 2013;6(6):1353–63. doi: 10.1242/dmm.011338 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Ehmsen S, Pedersen MH, Wang G, Terp MG, Arslanagic A, Hood BL, et al. Increased Cholesterol Biosynthesis Is a Key Characteristic of Breast Cancer Stem Cells Influencing Patient Outcome. Cell Rep. 2019;27(13):3927–3938.e6. doi: 10.1016/j.celrep.2019.05.104 [DOI] [PubMed] [Google Scholar]
  • 55.Heussler VT, Küenzi P, Fraga F, Schwab RA, Hemmings BA, Dobbelaere DA. The Akt/PKB Pathway Is Constitutively Activated in Theileria-transformed Leucocytes, but Does Not Directly Control Constitutive NF-kappaB Activation. Cell Microbiol. 2001;3(8):537–50. doi: 10.1046/j.1462-5822.2001.00134.x [DOI] [PubMed] [Google Scholar]
  • 56.Treff NR, Derek Pouchnik D, Dement GA, Britt RL and Reeves R. High-mobility group A1a protein regulates Ras/ERK signalling in MCF-7 human breast cancer cells. Oncogene 2004;23:777–785. doi: 10.1038/sj.onc.1207167 [DOI] [PubMed] [Google Scholar]
  • 57.Salameh A, Lee AK, Cardó-Vila M, Nunes DN, Efstathiou E, Staquicini FI, et al. PRUNE2 is a human prostate cancer suppressor regulated by the intronic long noncoding RNA PCA3. Proc Natl Acad Sci U S A. 2015;112(27):8403–8. doi: 10.1073/pnas.1507882112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Koltai T. Clusterin: a key player in cancer chemoresistance and its inhibition. Onco Targets Ther. 2014;7:447–56. doi: 10.2147/OTT.S58622 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Sansanwal P, Li L, Sarwal MM. Inhibition of intracellular clusterin attenuates cell death in nephropathic cystinosis. J Am Soc Nephrol. 2015;26(3):612–25. doi: 10.1681/ASN.2013060577 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Chen J, Wang M, Xi B, Xue J, He D, Zhang J, et al. SPARC is a key regulator of proliferation, apoptosis and invasion in human ovarian cancer. PLoS One. 2012;7(8):e42413. doi: 10.1371/journal.pone.0042413 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Fenouille N, Puissant A, Tichet M, Zimniak G, Abbe P, Mallavialle A, et al. SPARC functions as an anti-stress factor by inactivating p53 through Akt-mediated MDM2 phosphorylation to promote melanoma cell survival. Oncogene. 2011;30(49):4887–900. doi: 10.1038/onc.2011.198 [DOI] [PubMed] [Google Scholar]
  • 62.Hayashida K, Hattori M, Nakao R, Tanaka Y, Kim JY, Inoue N, et al. MDM2 regulates a novel form of incomplete neoplastic transformation of Theileria parva infected lymphocytes. Mol Biochem Parasitol. 2010;174(1):8–17. doi: 10.1016/j.molbiopara.2010.06.005 [DOI] [PubMed] [Google Scholar]
  • 63.Resmini G, Rizzo S, Franchin C, Zanin R, Penzo C, Pegoraro S, et al. HMGA1 regulates the Plasminogen activation system in the secretome of breast cancer cells Sci Rep. 2017;7(1):11768. doi: 10.1038/s41598-017-11409-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Weir W., Karagenc T. Baird M., Tait A. and Shiels B.R. Evolution and diversity of secretome genes in the apicomplexan parasite Theileria annulata. BMC Genomics. 2010;11;42. doi: 10.1186/1471-2164-11-42 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Hall R, Ilhan T, Kirvar E, Wilkie G, Preston PM, Darghouth M, et al. Mechanism(s) of attenuation of Theileria annulata vaccine cell lines. Trop Med Int Health. 1999;4: A78–84. doi: 10.1046/j.1365-3156.1999.00454.x [DOI] [PubMed] [Google Scholar]
  • 66.Preston PM, Brown CG, Bell-Sakyi L, Richardson W, Sanderson A. Tropical theileriosis in Bos taurus and Bos taurus cross Bos indicus calves: response to infection with graded doses of sporozoites of Theileria annulata. Res Vet Sci. 1992;53(2):230–43. doi: 10.1016/0034-5288(92)90115-i [DOI] [PubMed] [Google Scholar]
  • 67.Schneider WM, Chevillotte MD, Rice CM. Interferon-stimulated genes: a complex web of host defenses. Annu Rev Immunol. 2014;32:513–45. doi: 10.1146/annurev-immunol-032713-120231 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Wong MT, Chen SS. Emerging roles of interferon-stimulated genes in the innate immune response to hepatitis C virus infection. Cell Mol Immunol. 2016;13(1):11–35. doi: 10.1038/cmi.2014.127 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Bogunovic D, Boisson-Dupuis S, Casanova JL. ISG15: leading a double life as a secreted molecule. Exp Mol Med. 2013;45:e18. doi: 10.1038/emm.2013.36 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Helbig KJ, Beard MR. The role of viperin in the innate antiviral response. J Mol Biol. 2014;426(6):1210–9. doi: 10.1016/j.jmb.2013.10.019 [DOI] [PubMed] [Google Scholar]
  • 71.Kim JC, Ha YJ, Tak KH, Roh SA, Kwon YH, Kim CW, et al. Opposite functions of GSN and OAS2 on colorectal cancer metastasis, mediating perineural and lymphovascular invasion, respectively. PLoS One. 2018;13(8):e0202856. doi: 10.1371/journal.pone.0202856 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Kuriakose T, Kanneganti TD. ZBP1: Innate Sensor Regulating Cell Death and Inflammation. Trends Immunol. 2018;39(2):123–134. doi: 10.1016/j.it.2017.11.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Numajiri Haruki A, Naito T, Nishie T, Saito S, Nagata K. Interferon-inducible antiviral protein MxA enhances cell death triggered by endoplasmic reticulum stress. J Interferon Cytokine Res. 2011;31(11):847–56. doi: 10.1089/jir.2010.0132 [DOI] [PubMed] [Google Scholar]
  • 74.Xu D, Zhang T, Xiao J, Zhu K, Wei R, Wu Z, et al. Modification of BECN1 by ISG15 plays a crucial role in autophagy regulation by type I IFN/interferon. Autophagy. 2015;11(4):617–28. doi: 10.1080/15548627.2015.1023982 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Medina GN, Segundo FD, Stenfeldt C, Arzt J, de Los Santos T. The Different Tactics of Foot-and-Mouth Disease Virus to Evade Innate Immunity. Front Microbiol. 2018;9:2644. doi: 10.3389/fmicb.2018.02644 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Singh R, Deb R, Singh U, Raja TV, Alex R, Kumar S, et al. Heterozygosity of the SNP (rs136500299) of ITGB6 receptor gene possibly influences susceptibility among crossbred bull to foot and mouth disease infection.Virus disease. 2015;26:48–54. doi: 10.1007/s13337-015-0249-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Walker AM, Roberts RM. Characterization of the bovine type I IFN locus: rearrangements, expansions, and novel subfamilies. BMC Genomics. 2009;10:187. doi: 10.1186/1471-2164-10-187 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Sager H, Brunschwiler C, Jungi TW. Interferon production by Theileria annulata-transformed cell lines is restricted to the beta family. Parasite Immunol. 1998;20(4):175–82. doi: 10.1046/j.1365-3024.1998.00141.x [DOI] [PubMed] [Google Scholar]
  • 79.Perucha E, Melchiotti R, Bibby JA, Wu W, Frederiksen KS, Roberts CA, et al. The cholesterol biosynthesis pathway regulates IL-10 expression in human Th1 cells. Nat Commun. 2019;10(1):498. doi: 10.1038/s41467-019-08332-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Lee W, Ahn JH, Park HH, Kim HN, Kim H, Yoo Y, et al. COVID-19-activated SREBP2 disturbs cholesterol biosynthesis and leads to cytokine storm. Signal Transduct Target Ther. 2020;5(1):186. doi: 10.1038/s41392-020-00292-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Preston PM, Dargouth M, Boulter NR, Hall FR, Tall R, Kirvar E, et al. A dual role for immunosuppressor mechanisms in infection with Theileria annulata: well-regulated suppressor macrophages help in recovery from infection; profound immunosuppression promotes non-healing disease. Parasitol Res. 2002;88:522–534. doi: 10.1007/s00436-002-0613-8 [DOI] [PubMed] [Google Scholar]
  • 82.Ozturk N, Singh I, Mehta A, Braun T, Barreto G. HMGA proteins as modulators of chromatin structure during transcriptional activation. Front Cell Dev Biol. 2014;2:5. doi: 10.3389/fcell.2014.00005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Vignali R, Marracci S. HMGA Genes and Proteins in Development and Evolution. Int. J. Mol. Sci. 2020;21:654. doi: 10.3390/ijms21020654 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Wang X, Wang J Wu. Emerging roles for HMGA2 in colorectal cancer. Translational Oncology 2021;14:100894 doi: 10.1016/j.tranon.2020.100894 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Reeves R, Edberg DD, Li Y. Architectural transcription factor HMGI(Y) promotes tumour progression and mesenchymal transition of human epithelial cells. Mol Cell Biol. 2001;21(2):575–94. doi: 10.1128/MCB.21.2.575-594.2001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Jørn Henriksen1,2, Marianne Stabell1,2, Meza-Zepeda Leonardo A1,2, Lauvrak Silje AU1, Kassem Moustapha3 and Myklebost Ola. Identification of target genes for wild type and truncated HMGA2 in mesenchymal stem-like cells. BMC Cancer 2010; 10:329. doi: 10.1186/1471-2407-10-329 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Haidar M, Whitworth J, Noé G, Liu WQ, Vidal M, Langsley G. TGF-β2 induces Grb2 to recruit PI3-K to TGF-RII that activates JNK/AP-1-signaling and augments invasiveness of Theileria-transformed macrophages. Sci Rep. 2015;5:15688. doi: 10.1038/srep15688 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Neri P, Ren L, Azab AK, Brentnall M, Gratton K, Klimowicz AC, et al. Integrin β7-mediated regulation of multiple myeloma cell adhesion, migration, and invasion. Blood. 2011;117(23):6202–13. doi: 10.1182/blood-2010-06-292243 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Lamb CA, O’Byrne S, Keir ME, Butcher EC. Gut-Selective Integrin-Targeted Therapies for Inflammatory Bowel Disease. J Crohns Colitis. 2018;12(2):S653–S668. doi: 10.1093/ecco-jcc/jjy060 [DOI] [PubMed] [Google Scholar]
  • 90.Quante T, Bird A. Do short, frequent DNA sequence motifs mould the epigenome? Nat. Rev. Mol. Cell Biol. 2016;17:257–262. doi: 10.1038/nrm.2015.31 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Gordon Langsley

28 Sep 2021

PONE-D-21-24458Susceptibility to disease (tropical theileriosis) is associated with differential expression of host genes that possess motifs recognised by a pathogen DNA binding proteinPLOS ONE

Dear Brian,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

 First, I'd like to apologize fort he time taken to come to a decision. The first two reviewers submitted their quite favorable reports quickly, but the third reviewer's report has only just came in and you'll see it's quite critical of the mapping and pointed out that if your mapping strategy was misunderstood it's likely that it was poorly described. As handling editor I found also the PCA analysis and particularly its interpretation wasn't sufficiently explained. For example, there was some obvious outliers that were not mentioned and it was not stated whether the PCA analysis was the basis used to decrease to 5 from 6 the number of lines derived from B. taurus ; meaning that you ended up comparing 6 lines derived from B. indicus  to 5 lines from B. taurus and how this imbalance might have impacted on the results and your interpretation. So, I've marked the submission down for major revision to give you the opportunity to clarify better the strategies, techniques used and their consequences on the choices you made.

Please submit your revised manuscript by Nov 12 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Gordon

Gordon Langsley

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. We note that the grant information you provided in the ‘Funding Information’ and ‘Financial Disclosure’ sections do not match. 

When you resubmit, please ensure that you provide the correct grant numbers for the awards you received for your study in the ‘Funding Information’ section

3. We note that you have stated that you will provide repository information for your data at acceptance. Should your manuscript be accepted for publication, we will hold it until you provide the relevant accession numbers or DOIs necessary to access your data. If you wish to make changes to your Data Availability statement, please describe these changes in your cover letter and we will update your Data Availability statement to reflect the information you provide

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: No

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: No

Reviewer #3: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The manuscript by Larcombe et al. studies an important topic in Theileria research. Identification of host/parasite associated mechanisms of cattle resistance/tolerance to tropical theileriosis can provide novel control strategies for theileriosis. Both resistant and susceptible animals get infected by Theileria sporozoites but the disease pathology is mild in resistant Sahiwals and Holsteins die of infection. Therefore, comparisons between these two groups of animals could provide new insights into Theileria annulata infection biology and disease.

The authors approach was to perform dual RNA-seq on low passages of Theileria annulata transformed cell lines established ex vivo from theileriosis susceptible (Bos taurus, Holstein(Ho)) and resistant (Bos indicus, Sahiwal (Sa)) cattle and looked at host and parasite genes differentially expressed between individuals of each group. Although the paper is descriptive and fully based on bioinformatics analyses, I found the findings on differential binding of TashAT2 to DNA in Holstein and Sahiwal interesting and the results could be useful for further development of projects on TashAT family as candidate host cell manipulator Theileria annulata genes.

Here are my specific comments on the manuscript:

-In figure 1 some animal samples show great variance and does not cluster with others, I’m aware that these are from individual animals. It would be if the authors discuss the possible reasons behind this variance.

Moreover in the same figure we can observe 12 points representing 12 samples (6 Ho versus 6 Sa), but in lines 157-158 (in fig1 legend as well) it is mentioned that 11 cell lines (5 cell lines of Holstein and 6 Sahiwal cattle) was studied for the PCA plot. Either there is an error in the figure or in the number of cell lines studied.

-In figure 1 legend Taurus should be taurus.

-Concerning differential expression of parasite genes between Sa and Ho, I was wondering if the authors have checked by western blot the protein levels of TashAT2 in nuclear extracts of Sa and Ho cell lines? Although TashAT2 mRNA levels was found not to be differentially expressed, the protein levels could be different (due to stabilization and etc.) More availability of TashAT2 in the host cell nucleus could result in occupation of more sites on DNA possibly leading to changes in gene expression.

-In the list of DE parasite genes (S10_File) between Sa and Ho there are several other interesting candidates that are not discussed or mentioned in results section. For example, we know that some of these DE genes (such as SVSPs) have nuclear localization signals and are found to be trafficked to the nucleus and contribute to cell transformation. Specific examples from this list are TA09790 (up in taurus) and TA05560 (up in indicus). This notion gets more importance when we consider T. parva infection. We know that resistance to T. parva have been reported in endemic African breeds cattle breeds and we know that the TashAT family have orthologues in T. parva but Tpsh family in parva lack DNA binding domains, so this means that other DNA-binding parasite proteins (SVSPs?) could also be involved in differential gene expression observed in disease resistant and disease susceptible animals.

-I think in the discussion authors can add few sentences concerning future possibilities to study their findings in more details such as comparative ChIP-sep/ChIP-qPCR approaches using anti-TashAT2 antibodies in Ho and Sa derived Theileria transformed cell lines. They might have already tested this.

-Lines 277, 278, 279: 1X, 2X and 3X instead of X1, X2 and X3

-I think mentioning COVID-19 several times in the manuscript is not necessary.

-It’s better to mention gene IDs of TashATs where they are introduced in the text for the first time.

- In lines 227-246 it is not clear whether these analyses were done on host cell genes or on parasite genes. I can imagine it’s the host cell genes?

Reviewer #2: In this manuscript, the authors perform RNAseq of Theileria infected cell lines to investigate disease susceptibility of tropical theileriosis. The comparative transcriptome analysis between tolerant (B. indicus, Sahiwal) and susceptible (B. taurus, Holstein) to acute disease identified differential transcriptome signatures whose pathways linked to innate immunity, cholesterol biosynthesis, cancer, and parasite infection. Differentially expressed genes (DEGs) of this comparison partially overlapped previously reported DEGs induced by Theileria infection in the BL20 cell line. Although conclusive parasite factors that cause different disease susceptibilities were not newly identified, the authors found binding motifs of TashAT2 are divergent in the two breeds genomes. In summary, this manuscript is suitable for publication in the journal. However, I have a few requests for corrections before publication.

Minor comments

Some text is unreadable due to the low resolution, especially figure 3, 4, 7, could you improve it?

The sizes of Table 1, 2, 3 are large and I couldn’t read all parts. These tables require modification.

Correct “Table S8” to “Table S6” of the legend in S6 Table.

Line 61, 1182, 1187, 1196, 1248, 1270: Correct “Vs” to “vs”.

Line 672: Correct “susceptible breeds” to “susceptible” breeds.

Reviewer #3: Susceptibility to disease (tropical theileriosis) is associated with differential expression of host genes that possess motifs recognized by a pathogen DNA binding protein

Author: Shiels et al.

In this work, the authors aim to identify host factors associated with susceptibility to tropical theileriosis, a disease caused by Theileria annulata in livestock. To this end, the authors compare RNAseq data from T. annulata-infected cell lines from Bos taurus (a susceptible species/breed to T. annulata infection) with similar data from infected cell lines from Bos indicus (a less susceptible species/breed). If I understand correctly from the methods, both RNAseq datasets were mapped to the Bos taurus genome and gene expression was tehn quantified and compared.

The work, as is, has a few key issues which, until addressed, make it hard to determine the significance of the work. In particular:

1. Bos indicus and Bos taurus are referred to as different breeds but their names indicate different species; how divergent are these taxa and how does that translate into genomic and transcriptomic differences?

2. Following from (1), the authors do not account for inherent gene expression differences between species;

3. Following from (1), the authors do not account for differences in mapping efficiency between species to the B. taurus genome. If, in fact, both RNAseq datasets were mapped to the B. taurus genome, it is likely that the data from B. taurus infected cell lines have a mapping success rate to the B. taurus genome that is higher than the reads from B. indicus.

4. If the two species differ significantly in gene expression (#2) or if mapping success differs between species (#3), either of those can, by itself, explain the pattern in Fig 1, as well as many of the differences observed between species for individual genes.

Species-specific differences in gene expression may be addressed by subtracting gene expression between infected and uninfected cells of the same species.

The impact of genome-wide differences can be addressed by:

- Demonstrating that there is no species-specific impact on read mapping success;

- If genomic differences do impact read mapping rate, one possible option is to map RNAseq data to each species’ genome and then merge the two expression quantification datasets based on gene orthology. If this is what has been done, please explain.

A possibly simpler approach to the question may be to determine, for each host species, differential gene expression between uninfected and infected cell lines, and then compare the two sets of DEGs. The intersection will possibly be host response to infection, which the unique responses in each host may reflect differences in susceptibility.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Shahin Tajeri

Reviewer #2: No

Reviewer #3: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2022 Jan 21;17(1):e0262051. doi: 10.1371/journal.pone.0262051.r002

Author response to Decision Letter 0


12 Nov 2021

PONE-D-21-24458

Susceptibility to disease (tropical theileriosis) is associated with differential expression of host genes that possess motifs recognised by a pathogen DNA binding protein

PLOS ONE

Response to Editor and reviewers

We address the points made by you (the editor) and reviewers below and highlight the position in the manuscript where revision has been made. Altered text in manuscript highlighted in yellow.

Editor - the PCA analysis and particularly its interpretation wasn't sufficiently explained. For example, there was some obvious outliers that were not mentioned and it was not stated whether the PCA analysis was the basis used to decrease to 5 from 6 the number of lines derived from B. taurus; meaning that you ended up comparing 6 lines derived from B. indicus to 5 lines from B. taurus and how this imbalance might have impacted on the results and your interpretation. So, I've marked the submission down for major revision to give you the opportunity to clarify better the strategies, techniques used and their consequences on the choices you made.

Response

As indicated by Reviewer 1, this is a misunderstanding of the data based on an error in Figure 1, for which we apologise. There were only 11, low passage archived Sahiwal and Holstein infected cell lines available at the Roslin Institute and so only 11 samples of RNA-seq data generated. The error occurred when generating a high-quality figure of PCA for publication, a double click on the mouse apparently, and the error was not noticed in the submitted figure. Thus, there was never removal of an outlier to improve data comparison. A new corrected Figure 1 has been generated. Regarding whether having an unequal number of samples impacted on the analysis - this is unlikely. All it would do is alter the potential level of significance by generating a smaller sample set for one condition vs the other. It is not necessary to have two sample sets of exactly the same size to make a comparison, and all the differences identified between the breed sample sets meet a recognized statistical cut-off.

Regarding outliers and within sample (breed) set variability (also raised by Reviewer 1). It is clear from the PCA that there is good separation between samples representing the breeds but that there is also variability of samples within breed. Looking at the expression values across samples (File S2) it is easy to see why this is detected by the PCA; for some genes there is quite a lot of variance within a breed. Also based on published data i.e. PCR results across Holstein and Sahiwal samples for cytokines and other genes tested, a high level of within breed variance has been detected, previously (see refs 13, 21, 42), so within breed variance was expected. I have added some discussion (line 688-704) on PCA result and how within breed variability might occur. We agree that it is relevant that the variance should be highlighted and propose that each individual animal may show a unique response to infection. Despite the variance, our study does show a considerable number of genes that are statistically significant between the breeds, and a large number where differences are consistent across all samples for each breed. Based on the known high degree of variability in gene expression between infected cell lines (see also ref 65), this result was a pleasant surprise.

Reviewers' comments:

Reviewer #1: The manuscript by Larcombe et al. studies an important topic in Theileria research. Identification of host/parasite associated mechanisms of cattle resistance/tolerance to tropical theileriosis can provide novel control strategies for theileriosis. Both resistant and susceptible animals get infected by Theileria sporozoites but the disease pathology is mild in resistant Sahiwals and Holsteins die of infection. Therefore, comparisons between these two groups of animals could provide new insights into Theileria annulata infection biology and disease.

The authors approach was to perform dual RNA-seq on low passages of Theileria annulata transformed cell lines established ex vivo from theileriosis susceptible (Bos taurus, Holstein(Ho)) and resistant (Bos indicus, Sahiwal (Sa)) cattle and looked at host and parasite genes differentially expressed between individuals of each group. Although the paper is descriptive and fully based on bioinformatics analyses, I found the findings on differential binding of TashAT2 to DNA in Holstein and Sahiwal interesting and the results could be useful for further development of projects on TashAT family as candidate host cell manipulator Theileria annulata genes.

Response: We thank Dr. Tajeri for his insightful comments on the manuscript.

Specific comments on the manuscript:

-In figure 1 some animal samples show great variance and does not cluster with others, I’m aware that these are from individual animals. It would be if the authors discuss the possible reasons behind this variance.

Moreover, in the same figure we can observe 12 points representing 12 samples (6 Ho versus 6 Sa), but in lines 157-158 (in fig1 legend as well) it is mentioned that 11 cell lines (5 cell lines of Holstein and 6 Sahiwal cattle) was studied for the PCA plot. Either there is an error in the figure or in the number of cell lines studied.

See response to editor above

-In figure 1 legend Taurus should be taurus. – changed to Holstein and Sahiwal

-Concerning differential expression of parasite genes between Sahiwal and Holstein, I was wondering if the authors have checked by western blot the protein levels of TashAT2 in nuclear extracts of Sa and Ho cell lines? Although TashAT2 mRNA levels was found not to be differentially expressed, the protein levels could be different (due to stabilization and etc.) More availability of TashAT2 in the host cell nucleus could result in occupation of more sites on DNA possibly leading to changes in gene expression.

Response: We did not follow up on levels of host nuclear TashAT2 initially, because it was not highlighted in the parasite RNA-seq data and previous work indicated protein levels follow RNA. Nevertheless, the point is relevant and of interest. To investigate this, we conducted immunofluorescence using an antiserum specific to TashAT2. The results demonstrate that a) all cell lines express TashAT2 and b) there is no clear difference in host nuclear expression level between the three Holstein and Sahiwal lines that were tested (same lines used as for TA9 blot). This data has been added to the manuscript: line 608 to 611 and File S13.

-In the list of DE parasite genes (S10_File) between Sa and Ho there are several other interesting candidates that are not discussed or mentioned in results section. For example, we know that some of these DE genes (such as SVSPs) have nuclear localization signals and are found to be trafficked to the nucleus and contribute to cell transformation. Specific examples from this list are TA09790 (up in taurus) and TA05560 (up in indicus). This notion gets more importance when we consider T. parva infection. We know that resistance to T. parva have been reported in endemic African breeds cattle breeds and we know that the TashAT family have orthologues in T. parva but Tpsh family in parva lack DNA binding domains, so this means that other DNA-binding parasite proteins (SVSPs?) could also be involved in differential gene expression observed in disease resistant and disease susceptible animals.

Response: The general result from our analysis is that there is not much evidence for large differences in parasite gene expression between the two breeds, and we analysed the best candidate (TA9) at the protein level. Re the two SVSPs: both show a less than 2-fold change (absolute). There are no reagents that have conclusively detected endogenous SVSPs by immunoblot, while the published IFAT (Schmuckli et al., Plos One 2009) only indicated a parasite location (nuclear localisation was by transfection into an uninfected cell line alone), there is no evidence SVSPs bind DNA and data suggests they may be highly diverse, rapidly degraded and are under no selective pressure (Weir et al., BMC genomics 2010). So personally, I do not think they are good candidates for follow up. Re the T. parva orthologues of TashAT family genes: It is unknown whether they lack DNA binding capability; from the genome sequences they lack a predicted canonical AT hook motif (RGRP core). However, several have predicted nuclear localisation sites with similarity to the T. annulata orthologue, and serum against TashAT2 detects the host nucleus in T. parva (Swan et al., Mol. Biol. Parasitol. 1999). The best current prediction is that T. parva orthologues go to the host nucleus and bind DNA but to a divergent sequence, probably linked to divergent evolution of the parasites in a different host cell lineage.

-I think in the discussion authors can add few sentences concerning future possibilities to study their findings in more details such as comparative ChIP-sep/ChIP-qPCR approaches using anti-TashAT2 antibodies in Ho and Sa derived Theileria transformed cell lines. They might have already tested this.

I have added a line on requirement for ChIp in future studies (line 854).

-Lines 277, 278, 279: 1X, 2X and 3X instead of X1, X2 and X3 – altered as requested.

-I think mentioning COVID-19 several times in the manuscript is not necessary.

Removed reference to Covid 19 at line 105, line 752 and 775.

-It’s better to mention gene IDs of TashATs where they are introduced in the text for the first time. Gene ID for TashAT2 given on line 42.

- In lines 227-246 it is not clear whether these analyses were done on host cell genes or on parasite genes. I can imagine it’s the host cell genes?

Response: Indeed, this was host cell genes – I have added infected host cell to the section title and bovine to the first sentence, for clarification. The rationale for this is that differences in what the infected cell secretes or transduces via receptors could have a major impact on differential interaction with the immune response.

Reviewer #2: In this manuscript, the authors perform RNAseq of Theileria infected cell lines to investigate disease susceptibility of tropical theileriosis. The comparative transcriptome analysis between tolerant (B. indicus, Sahiwal) and susceptible (B. taurus, Holstein) to acute disease identified differential transcriptome signatures whose pathways linked to innate immunity, cholesterol biosynthesis, cancer, and parasite infection. Differentially expressed genes (DEGs) of this comparison partially overlapped previously reported DEGs induced by Theileria infection in the BL20 cell line. Although conclusive parasite factors that cause different disease susceptibilities were not newly identified, the authors found binding motifs of TashAT2 are divergent in the two breeds genomes. In summary, this manuscript is suitable for publication in the journal. However, I have a few requests for corrections before publication.

Minor comments

Some text is unreadable due to the low resolution, especially figure 3, 4, 7, could you improve it?

Response: We hope that this is due to the build of the PDF on the web site, as we generated high quality images for each figure following journal guidelines. If this is not the case and figures need improved, we will do so prior to publication.

The sizes of Table 1, 2, 3 are large and I couldn’t read all parts. These tables require modification.

Response: We agree that Tables are large, and I have attempted to trim where possible. This however was minor, because if I had deleted further, the tables would not have matched the text and context of results would have been lost. Also, it might have looked like cherry picking most appropriate pathways. One option would be to move one or two Tables to supplementary (say Table 2), but I did not do this because it makes it harder for the reader to check what is said in the text. If required, Tables can be reduced with a change in text or moved to supplementary. Ditto if there is an issue with Tables being out with margins on manuscript – submission website indicated this was allowed.

Correct “Table S8” to “Table S6” of the legend in S6 Table.

Corrected

Line 61, 1182, 1187, 1196, 1248, 1270: Correct “Vs” to “vs”.

Corrected

Line 672: Correct “susceptible breeds” to “susceptible” breeds.

Corrected – now in edited section on PCA variability line 704 - 706

Reviewer #3: Susceptibility to disease (tropical theileriosis) is associated with differential expression of host genes that possess motifs recognized by a pathogen DNA binding protein

Author: Shiels et al.

In this work, the authors aim to identify host factors associated with susceptibility to tropical theileriosis, a disease caused by Theileria annulata in livestock. To this end, the authors compare RNAseq data from T. annulata-infected cell lines from Bos taurus (a susceptible species/breed to T. annulata infection) with similar data from infected cell lines from Bos indicus (a less susceptible species/breed). If I understand correctly from the methods, both RNAseq datasets were mapped to the Bos taurus genome and gene expression was tehn quantified and compared.

The work, as is, has a few key issues which, until addressed, make it hard to determine the significance of the work. In particular:

1. Bos indicus and Bos taurus are referred to as different breeds but their names indicate different species; how divergent are these taxa and how does that translate into genomic and transcriptomic differences?

2. Following from (1), the authors do not account for inherent gene expression differences between species;

3. Following from (1), the authors do not account for differences in mapping efficiency between species to the B. taurus genome. If, in fact, both RNAseq datasets were mapped to the B. taurus genome, it is likely that the data from B. taurus infected cell lines have a mapping success rate to the B. taurus genome that is higher than the reads from B. indicus.

4. If the two species differ significantly in gene expression (#2) or if mapping success differs between species (#3), either of those can, by itself, explain the pattern in Fig 1, as well as many of the differences observed between species for individual genes.

Species-specific differences in gene expression may be addressed by subtracting gene expression between infected and uninfected cells of the same species.

The impact of genome-wide differences can be addressed by:

- Demonstrating that there is no species-specific impact on read mapping success;

- If genomic differences do impact read mapping rate, one possible option is to map RNAseq data to each species’ genome and then merge the two expression quantification datasets based on gene orthology. If this is what has been done, please explain.

A possibly simpler approach to the question may be to determine, for each host species, differential gene expression between uninfected and infected cell lines, and then compare the two sets of DEGs. The intersection will possibly be host response to infection, which the unique responses in each host may reflect differences in susceptibility.

Response: This reviewer suggests that we should have mapped to each genome separately as the strategy of mapping RNA-seq reads from infected cells from both sub-species to Bos taurus may have resulted in a bias in detection of differential gene expression (i.e. difference in mapping success rate/higher for Bos taurus). We do not believe this has occurred or that it was necessary based on the following.

The common ancestor of indicine and taurine cattle is the auroch, an ancient species of wild cattle. Indicine and taurine cattle, being very closely related, can easily inter-breed; this ability is, in fact, the basis of a variety breed improvement to combat diseases such as theileriosis. The taxonomy of these cattle has been the subject of much debate; as they are so close, they have been classified as sister sub-species. The utility of mapping the RNA-seq data separately onto both the Bos indicus and Bos taurus genomes was considered but was rejected on the basis of the fact that the assembly and annotation of the Bos taurus reference genome is far superior. The quality of read mapping would not have been equivalent and had we undertaken this approach, we would have created two disparate datasets exhibiting inherent, unacceptable bias when combined. For this reason, it was decided to do a two-way comparison based on the better, quality reference genome, in the likelihood that there would be little differences in the read mapping of the two cattle types based on identity of mRNA coding regions, reflecting the approach of previous work in the field (ref 23).

The reviewer indicated that “The impact of genome-wide differences can be addressed by…demonstrating that there is no species-specific impact on read mapping success”.

Response: We performed this analysis, which confirms equitable mapping success and that there is no effect of cattle breed type. This analysis and data presented in the manuscript demonstrate that:

(a) The % reads mapped for each Holstein (Bos taurus) sample was equivalent to that for each Sahiwal (Bos indicus) sample and there is no evidence of any group-related difference. We thank the reviewer for this suggestion and add the information to the manuscript as Table S3.

(b) Nearly half the differentially expressed genes in the dataset display elevated expression in the Sahiwal group.

(c) Every gene that was selected for qRT-PCR including, those lower in Sahiwal group, reflected the RNA seq result, and confirmed that within-breed variability in expression (even across half sibs) was genuine. These results essentially repeat the above-mentioned microarray study where qPCR was used to validate differential hybridisation results. So, to date, qPCR has not highlighted any erroneous results in high-throughput gene expression analysis based on our approach.

(d) Comparison of the TashAT2 motif dataset across the two genomes with the subset of differentially expressed genes showed considerable overlap in pathways known to be altered by infection e.g. PI3K and ILK signalling and included genes expressed at a lower level in Sahiwal (e.g. SLIT, PRUNE, ITGs) and altered by infection. Taken together, our results support a lack of bias in the DE results and identification of infection-associated DE between the breeds linked to TashAT2 motifs.

Re the suggestion of generating a DE dataset for both Hosltein and Sahiwal lines, we agree that in the future this could/should be performed for multiple infected and uninfected lines/cells to further investigate within- and between-breed variability and so have stated this in the discussion: line 702.

2. Following from (1), the authors do not account for inherent gene expression differences between species;

Response: We present a significant subset of genes that are altered by infection in a Bos taurus cell line (Fig. 2), with a model that has been utilised to detect infection associated DE in most studies over the last 30 years. We highlighted that this was performed to identify DE linked to infection and distinguish these genes from those displaying possible inherent expression differences between the two breeds. Our data shows remarkable overlap between the top pathways enriched in both the full and the infection-associated DE datasets, many of which have been linked to infection/transformation of bovine leukocytes by the Theileria parasite. Moreover, we do not think that inherent differences between breeds should be totally discounted because, at the end of the day, it is the difference between the gene expression profile of the infected cell that will determine its phenotype/virulence, particularly genes with links to the immune response (e.g. inducible ISGs) and cancer highlighted in the paper. These could arise from a change in gene expression caused by infection of cells in one breed but not the other, DE linked to infection in both breeds, or a difference not altered by infection (inherently different). Thus, it is logical, reasonable and informative to directly compare infected cells from both types of cattle for all differentially expressed genes, in addition to those generated by parasite infection of the cell.

In summary, we believe finding overlaps between genes differentially expressed between the two breeds, infected and uninfected cells and bearing TashAT2 motifs will be of interest to PLOS One readers. We hope you agree that our revisions and response to reviewers now meet the requirements of the Journal.

Attachment

Submitted filename: response to reviewers.10.11.21_PONE.docx

Decision Letter 1

Gordon Langsley

2 Dec 2021

PONE-D-21-24458R1

Susceptibility to disease (tropical theileriosis) is associated with differential expression of host genes that possess motifs recognised by a pathogen DNA binding protein

PLOS ONE

Dear Brian,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

As you know there is no proof reading at PLoS One, and as referee #3 requested a limited number of small corrections to the text, I have marked your submission down for minor revision to give you the opportunity to make the minor text changes.

Please submit your revised manuscript by Jan 16 2022 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Gordon

Gordon Langsley

Academic Editor

PLOS ONE

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

 

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

Reviewer #3: All comments have been addressed

********** 

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

********** 

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

********** 

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

********** 

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

********** 

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: (No Response)

Reviewer #2: The authors have responded all requested correction and the manuscript has been improved. I think this manuscript is acceptable now.

Reviewer #3: Susceptibility to disease (tropical theileriosis) is associated with differential expression of host genes that possess motifs recognized by a pathogen DNA binding protein

Author: Shiels et al.

This is a resubmission of a paper in which authors compare patterns of gene expression in ex vivo Theileria annulata-infected bovine cells lines from cattle breeds susceptible or tolerant to T. annulata infection to identify host and parasite factors that impact the susceptibility phenotype. The authors found plausible links between differences in tolerance and interferon-stimulated gene pathways, production of INFB1, inflammatory responses and oncogenesis. Critically, they identify a motif bound by a parasite gene that is differently distributed in the genome of both breeds. The authors find that genes with differences between breeds in the presence/absence/number of these motifs were enriched for functions potentially connected to infection.

This is a well-written paper with several interesting findings, leading to new hypotheses related to interactions between host and parasite, as well as infection susceptibility.

This resubmission addresses my previous concerns adequately.

Some minor issues:

- Pls add a first line to all supplemental word and xls tables with the title of the table

- Lack of consistency throughout text in use of versus, vs, vs. (best to use an italicized form, as vs.)

- L136, word missing in “offer the best available for“

- L191: change “mapped to the bovine” to “mapped to the Bos taurus”

- L319: change “number of reads” to “proportion of reads” (also, see typo in table S3)

- L629-630” “Subsequent PANTHER analysis showed”. Which genes were fed to this analysis? All 11,983 from B. taurus? Only the 7,575 shared? Those not shared? ...? The interpretation of the enrichment analysis will be affected.

- Fig 7C. I suggest using in Panel C different colors from those in legend in panel B

- L707-708: “This result may indicate that a number of infection-associated genes differentially expressed between breeds have been missed by our analysis “ I am not sure I follow. Couldn’t there be constitutively expressed genes, differentially expressed between breeds, regardless of infection status?

********** 

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Attachment

Submitted filename: Shiels-etal.resubmission.docx

PLoS One. 2022 Jan 21;17(1):e0262051. doi: 10.1371/journal.pone.0262051.r004

Author response to Decision Letter 1


10 Dec 2021

PONE-D-21-24458R1

Susceptibility to disease (tropical theileriosis) is associated with differential expression of host genes that possess motifs recognised by a pathogen DNA binding protein

PLOS ONE

Response to reviewers,

Thanks to the reviewers for making the effort to look at our manuscript for a second time. We were pleased that the three reviewers indicated that we had adequately answered all previous concerns.

In response to minor issues raised by Reviewer 3, we have made the following amendments/corrections

Reviewer 3.

This resubmission addresses my previous concerns adequately.

Some minor issues:

- Pls add a first line to all supplemental word and xls tables with the title of the table

- Response – edited as requested

- Lack of consistency throughout text in use of versus, vs, vs. (best to use an italicized form, as vs.)

- Response - corrected to vs. throughout

- L136, word missing in “offer the best available for“

- Response – edited to “offer the best infected cell lines available….”

- L191: change “mapped to the bovine” to “mapped to the Bos taurus”

- Response – corrected to “mapped to the B. taurus

- L319: change “number of reads” to “proportion of reads” (also, see typo in table S3)

- Response – change as requested

- L629-630” “Subsequent PANTHER analysis showed”. Which genes were fed to this analysis? All 11,983 from B. taurus? Only the 7,575 shared? Those not shared? ...? The interpretation of the enrichment analysis will be affected.

- Response – edited to “Subsequent PANTHER analysis of the shared genes data set showed an enrichment”

- Fig 7C. I suggest using in Panel C different colors from those in legend in panel B

- Response - colours in panel C changed to Yellow and Grey

- L707-708: “This result may indicate that a number of infection-associated genes differentially expressed between breeds have been missed by our analysis “ I am not sure I follow. Couldn’t there be constitutively expressed genes, differentially expressed between breeds, regardless of infection status?

- We agree the logic of the text preceding this statement (i.e. that the pathways identified for both data sets are more or less the same) could be confusing, and the reviewer is correct that non-infection associated genes could be/are differentially expressed between breeds. Nevertheless, published data indicates we have missed candidates, so the text has been edited with “This result may indicate” deleted. The text now reads – “However, previous studies indicate that a number of infection-associated genes differentially expressed between breeds have been missed by our analysis, and there are at least several candidates.”

Attachment

Submitted filename: Shiels Response to reviewers_2.docx

Decision Letter 2

Gordon Langsley

16 Dec 2021

Susceptibility to disease (tropical theileriosis) is associated with differential expression of host genes that possess motifs recognised by a pathogen DNA binding protein

PONE-D-21-24458R2

Dear Brian,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Gordon

Gordon Langsley

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Gordon Langsley

20 Dec 2021

PONE-D-21-24458R2

Susceptibility to disease (tropical theileriosis) is associated with differential expression of host genes that possess motifs recognised by a pathogen DNA binding protein

Dear Dr. Shiels:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Gordon Langsley

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. IPA canonical pathway, “FAT10 cancer signalling”.

    The nodes represent genes/molecules/complexes in a pathway, and the lines and arrows between nodes indicate known relationships from the Ingenuity Knowledge Base. Nodes with purple outline indicate molecules that were measured as differentially expressed in our dataset, with the intensity of coloured infill indicating the level of up–(red) or down-(green) regulation of Holstein relative to Sahiwal. The blue and orange coloured molecules and lines are predicted activation states generated by the Molecular Activity Predictor function in Ingenuity Pathway Analysis. Blue colour indicates a predicted inhibition, and orange a predicted activation state in Holstein relative to Sahiwal. Broad lines with explanatory text beside the pathway indicate the cellular location of molecules in the pathway. The molecules in the pathway are given shapes that indicate their functional class (Nested Circle/Square = Group/Complex, Horizontal ellipse = Transcriptional Regulator, Vertical Ellipse = transmembrane receptor, Vertical Rhombus = enzyme, Square = Cytokine/Growth Factor, Triangle = Kinase, Vertical Ellipse = Transmembrane Receptor, Circle = other). The edges between molecules are also differentiated to indicate the type of relationship between them. Solid lines are direct relationships and dashed lines are indirect.

    (TIF)

    S2 Fig. Immunoblot carried out with protein extracts prepared from 3 independent Sahiwal lines compared to 3 Holstein lines.

    Sahiwal samples are denoted SA, SB, SC; Holstein denoted HA, HB, HC. Protein size markers are indicated on the right (kDa). A. Extracts probed with Rat anti-Ta9 (TA15705) at 1/1200 dilution. Ta9 was detected at a variable size of 43-46kDa, which is a similar to that described previously for the polymorphic Ta9 antigen [17,43]. B. As a control, the same blot was reprobed with Rabbit antiserum raised against constitutively expressed ER HSP90 (TA06470) at 1/1500 dilution. The Ta9 and HSP90 reactive proteins are denoted by arrow.

    (TIF)

    S3 Fig. Immunofluorescence assay carried out on cells prepared from 3 independent Sahiwal lines compared to 3 Holstein lines.

    Sahiwal samples are denoted SA, SB, SC; Holstein denoted HA, HB, HC. Cells were reacted with antiserum specific for TashAT2 (EL24) and images obtained using matched exposures. Bar = 7 μm.

    (TIFF)

    S1 Table. Details of oligonucleotide primers used in qRT-PCR.

    F and R denote forward and reverse primers respectively. *Primers designed based on Ensembl transcript.

    (DOCX)

    S2 Table. Number of RNA-seq reads mapped on to B. taurus genome are not statistically different.

    Rows denote RNA-seq sample set derived from 6 Sahiwal (S1-6) or 5 Holstein (H1-5) infected cell lines. Columns denote the % of reads mapped to the B. taurus genome. Summary denotes mean % of reads mapped, standard deviation and no significant difference between means.

    (DOCX)

    S3 Table. Top upstream regulators with predicted activated or inhibited activation states in infection-associated H/S-DE genes.

    In total 655 upstream regulators were found with a p-value of < 0.05; including 123 with a significantly predicted activation state (activated or inhibited, based on a z-score >2, or <-2). Only the top 20 activated and top 20 inhibited regulators based on z-score are presented in the table.

    (DOCX)

    S1 File. Excel spread sheet of 2211 H/S-DE genes (<0.1 FDR).

    Columns designate: Ensemble Gene ID; base mean of RNA-seq counts across all samples; log2 fold change between mean of Holstein vs. Sahiwal sample counts; lfcSE (Standard Error of the log fold change); stat (Wald statistic); p value (unadjusted p value for the Wald test); padj (Benjamini-Hochberg adjusted p-value for significance of the Wald test; DESeq2 normalized RNA-seq counts for each of the six Sahwal samples (Sah 1–6) and five Holstein (Hol 1–6); the gene name; an alternative gene name, when applicable; whether RNA-seq counts were consistent across all Sahiwal samples vs. Holstein; Interferon associated genes, as identified in the study of Liu et al. [50] or by GO [37]; description of protein/factor encoded by the gene. Yellow highlight designate genes also modulated in Theileria infected (TBL20) cells.

    (XLSX)

    S2 File. Excel spread sheet of the 517 genes in H/S-DE that overlap with genes designated as infection associated (modulated in TBL20 vs. BL20).

    Columns designate: Gene id; DESeq normalized base mean of counts in A (Sahiwal) samples; base mean counts in B (Holstein) samples; log2 fold change between mean of Holstein vs. Sahiwal sample counts; lfcSE (Standard Error of the log fold change); stat (Wald statistic); pvalue (unadjusted p value for the Wald test); padj (Benjamini-Hochberg adjusted p-value for significance of the Wald test; the gene name; description of protein/factor encoded by the gene.

    (XLSX)

    S3 File. Excel spread sheet of the 100 most significant canonical pathways designated as enriched by IPA for H/S-DE data set.

    Columns designate: The enriched canonical pathway; the–log p value; ratio of the number of genes from the list that maps to the pathway divided by the total number of genes that map to pathway; the (activation) Z-score; the gene symbol for molecules in H/S-DE data set present in the pathway. Blue highlight denotes pathways with a Z score>2, indicating repression in Holstein vs. Sahiwal infected cells; red highlight indicates activation (Z-score >1.5).

    (XLSX)

    S4 File. Excel spread sheet of the 100 most significant canonical pathways enriched by IPA from the data set of 517 overlapped infection-associated H/S-DE genes.

    Columns designate: The enriched canonical pathway; the–log p value; the ratio of the number of genes from the list that maps to the pathway divided by the total number of genes that map to pathway; the (activation) Z-score; the gene symbol for molecules in H/S DE data set present in the pathway. Blue highlight denotes pathways with an activation score (Z score>2) indicating repression in Holstein vs. Sahiwal infected cells.

    (XLSX)

    S5 File. Excel spread sheet of genes in H/S-DE data set predicted to encode secreted proteins.

    Columns designate: Ensemble Gene ID; DESeq normalized base mean of counts in all samples; log2 fold change between mean of Holstein vs.Sahiwal sample counts; lfcSE (Standard Error of the log fold change); stat (Wald statistic); pvalue (unadjusted p value for the Wald test); padj (Benjamini-Hochberg adjusted p-value for significance of the Wald test; the gene name; description of protein/factor encoded by the gene; prediction of location based on gene ontology data in Genecards [35]; Yellow highlight designate genes also modulated in Theileria infected (TBL20) cells.

    (XLSX)

    S6 File. Excel spread sheet of genes in H/S-DE data set predicted to encode receptors.

    Columns designate: Ensemble Gene ID; Deseq2 normalized base mean for RNA-seq counts for all Sahiwal and Holstein samples; the log2 fold change between Holstein and Sahiwal samples for that gene; lfcSE (Standard Error of the log fold change); stat (Wald statistic); pvalue (unadjusted p value for the Wald test); padj (Benjamini-Hochberg adjusted p-value for significance of the Wald test; the gene name; description of protein/factor encoded by the gene; Yellow highlight designate genes also modulated in Theileria infected (TBL20) cells.

    (XLSX)

    S7 File. Excel spread sheet of 109 parasite (T. annulata) genes (<0.1 FDR).

    Columns designate: TA number (T. annulata gene ID); DESeq normalized base mean of RNA-seq counts across all samples; designation of whether expression is higher (up) or lower (down) in Holstein (B. taurus) samples relative to Sahiwal (B. indicus); DESeq normalized base mean of counts in A (Sahiwal) samples; base mean counts in B (Holstein) samples; log2 fold change between mean of Holstein vs.Sahiwal sample counts; lfcSE (Standard Error of the log fold change); stat (Wald statistic); pvalue (unadjusted p value for the Wald test); padj (Benjamini-Hochberg adjusted p-value for significance of the Wald test; description of protein/factor predicted to be encoded by the gene; location of gene on T. annulata genome; genomic sequence ID.

    (XLS)

    S8 File. Excel spread sheet of nucleotide motifs bound by TashAT2 in genome of B. taurus and B. indicus.

    Columns designate chromosome; motif start position; motif end position; genomic feature linked to motif position (non-coding, intron, exon, untranslated region (UTR); gene product associated with motif and genomic feature, if applicable.

    (XLSX)

    S9 File. Excel spread sheet of nucleotide motifs bound by TashAT2 in gene coding (RNA) regions of B. taurus and B. indicus genomes.

    The data sheets list motifs identified in B. taurus genes, motifs identified in B. indicus genes and motifs in genes shared by B. taurus and B. indicus. Columns indicate: Gene identifier, number of motifs identified in that gene region; and for shared genes: B. number of motifs identified in B. taurus gene; C. number of motifs identified in B. indicus gene.

    (XLSX)

    S10 File. List of genes in integrin signalling pathway that bear different numbers of nucleotide motif bound by TashAT2 in B. taurus and B. indicus genomes.

    Green highlight designates genes that display differential expression between Sahiwal and Holstein infected cells.

    (DOCX)

    Attachment

    Submitted filename: response to reviewers.10.11.21_PONE.docx

    Attachment

    Submitted filename: Shiels-etal.resubmission.docx

    Attachment

    Submitted filename: Shiels Response to reviewers_2.docx

    Data Availability Statement

    RNA-seq data are publicly available at the NCBI GEO online repository under accession numbers GSM4824553–GSM4824563.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES