Abstract
Background
Despite decades of extensive research, bovine respiratory disease (BRD) remains the most devastating disease in beef cattle production. Establishing a clinical diagnosis often relies upon visual detection of non-specific signs, leading to low diagnostic accuracy. Thus, post-weaned beef cattle are often metaphylactically administered antimicrobials at facility arrival, which poses concerns regarding antimicrobial stewardship and resistance. Additionally, there is a lack of high-quality research that addresses the gene-by-environment interactions that underlie why some cattle that develop BRD die while others survive. Therefore, it is necessary to decipher the underlying host genomic factors associated with BRD mortality versus survival to help determine BRD risk and severity. Using transcriptomic analysis of at-arrival whole blood samples from cattle that died of BRD, as compared to those that developed signs of BRD but lived (n = 3 DEAD, n = 3 ALIVE), we identified differentially expressed genes (DEGs) and associated pathways in cattle that died of BRD. Additionally, we evaluated unmapped reads, which are often overlooked within transcriptomic experiments.
Results
69 DEGs (FDR<0.10) were identified between ALIVE and DEAD cohorts. Several DEGs possess immunological and proinflammatory function and associations with TLR4 and IL6. Biological processes, pathways, and disease phenotype associations related to type-I interferon production and antiviral defense were enriched in DEAD cattle at arrival. Unmapped reads aligned primarily to various ungulate assemblies, but failed to align to viral assemblies.
Conclusion
This study further revealed increased proinflammatory immunological mechanisms in cattle that develop BRD. DEGs upregulated in DEAD cattle were predominantly involved in innate immune pathways typically associated with antiviral defense, although no viral genes were identified within unmapped reads. Our findings provide genomic targets for further analysis in cattle at highest risk of BRD, suggesting that mechanisms related to type I interferons and antiviral defense may be indicative of viral respiratory disease at arrival and contribute to eventual BRD mortality.
Introduction
Although extensively researched, bovine respiratory disease (BRD) continues to be the most significant disease in post-weaned beef cattle in North America. BRD is a multifactorial disease complex, with contributing causative factors including primary viral infection, bacterial colonization of the upper and lower respiratory tract, and stressful events related to abrupt weaning, co-mingling with recently transported cattle, and novel feeding or housing environments [1–5]. These factors result in host-pathogen interactions that are exceedingly complex and definitive diagnosis of the inciting etiological agent(s) is not usually made. BRD diagnosis will typically rely on non-specific clinical signs including elevated rectal temperature, depressed demeanor, increased respiratory rate and effort, and anorexia [5–8]. However, this clinically based diagnosis has been shown to lack sensitivity and specificity [9, 10]. Therefore, post-weaned beef cattle at high risk of developing BRD are often mass medicated with antimicrobials at facility arrival (i.e. antimicrobial metaphylaxis) [11–13]. With growing public concerns regarding the relationship between the use of metaphylaxis in beef cattle and antimicrobial resistance, there is a need to recognize cattle at increased risk of developing BRD in order to implement more targeted therapeutic regimens.
In order to identify new methods of accurate BRD diagnosis, our previous research contrasted the whole blood transcriptomes of cattle that naturally acquired or resisted BRD [14]. Specifically, we identified upregulation of inflammatory-mitigating molecules and pathways at arrival in cattle that failed to develop naturally occurring clinical BRD. This prior research did not examine differentially expressed genes or pathways that segregate with disease severity. Therefore, we hypothesize that whole blood transcriptome profiles of cattle at arrival can identify biological functions that influence BRD severity; specifically, functions which distinguish cattle that are likely to die versus cattle that survive.
In the present study, we analyzed the at-arrival whole blood transcriptomes of post-weaned beef cattle that developed BRD. Specifically, we compared at-arrival whole blood transcriptomes from cattle that naturally acquired BRD within the first 28 days following arrival, stratifying cattle into severity groups defined by BRD-associated mortality. Our objectives were to uncover differentially expressed genes and associated processes and pathways that segregate with cattle at highest risk of BRD-associated mortality and to determine if reads from diseased cattle align to pathogens that promote inflammatory processes. Given the lack of sensitivity in clinical screening for BRD and the need to improve understanding of gene-by-environment interactions involved in BRD, the identification of gene products whose expression correlates with the risk of BRD-associated mortality would advance management and diagnostic strategies for improving the outcome of post-weaned beef cattle in high-risk situations.
Results
Differential gene expression analysis
Alignment of reads from the six biological replicates to the ARS-UCD1.2 bovine reference assembly identified 32,976 unique genes. Following filtering for low expression, 15,755 genes were used for differential expression analysis between DEAD and ALIVE groups. Multidimensional scaling (MDS; Fig 1), which depicts the similarity of expression profiles between each animal in the analysis, demonstrated clustering of the three DEAD cattle (red; IDs 33, 52, 76). This indicates that the expression patterns of DEAD cattle are highly similar and are distinguishable from the cattle within the ALIVE group (blue; IDs 51, 75, 85). In contrast, the gene expression patterns of the three animals within the ALIVE group are clearly more dissimilar than the expression patterns of cattle in the DEAD group.
The three animals within the DEAD cohort are highly similar in gene expression and more distinct than the three animals within the ALIVE cohort, with leading fold-change of approximately 2-fold between the furthest points within the DEAD cohort. One animal within the ALIVE cohort (S_51) is the most dissimilar animal in terms of gene expression. Points represent each sample and their transformed Euclidean distance in two dimensions, discerned as leading log2-fold change between the pairwise distances of the top 500 genes that best differentiate each animal.
A total of 69 genes were differentially expressed (FDR < 0.10) between DEAD and ALIVE groups; 37 genes were upregulated and 32 downregulated in DEAD cattle compared to ALIVE cattle. A heatmap was generated from these 69 DEGs using z-scores calculated from Trimmed Mean of M-values (TMM) normalized counts (Fig 2). The resulting hierarchical clustering of DEG expression patterns for each individual segregates the six individuals into two groups according to their respective ALIVE and DEAD status and also provided a dendrogram of expression similarities. The complete list of DEGs with accompanying statistics are provided in S1 File.
Heatmap depicting gene expression directionality and hierarchical clustering of DEGs in each sample. Red or blue color intensities, respectively, correspond to increasing or decreasing gene expression. Dendrograms in the rows identify gene expression clusters. Note that clustering of samples (columns) based on gene expression similarity segregates the samples into ALIVE and DEAD cohorts.
Gene ontology, pathway, and disease phenotype enrichment
Gene ontology (GO) term enrichment of DEGs identified 34 significantly overrepresented biological processes (FDR < 0.05; Table 1). The top biological processes were primarily related to type I interferon signaling and response, viral defense mechanisms, and innate immune regulation involving cytokine signaling. These biological processes are chiefly composed of genes with higher expression in DEAD cattle, particularly those of the IFIT, ISG, HERC, and OAS mRNA families. Utilizing Reactome [15, 16], six pathways were identified as significantly overrepresented (FDR < 0.05; Table 2). These identified biological pathways are involved primarily in type I interferon signaling and antiviral mechanisms, represented predominantly by higher IFIT, ISG, HERC, BST, and OAS mRNA family expression. Similar to the biological processes, these pathways are largely represented by genes higher in expression in DEAD cattle. The predominant disease phenotypes identified by GLAD4U [17] consisted of viral-induced diseases, which were heavily influenced by certain genes increased in expression in DEAD cattle, including BST2, HERC5, IFIT1, ISG15, MX2, and OAS2 (Table 3). In summary, these analyses of biological processes, pathways, and disease phenotypes represented by DEG information indicate that cattle within the DEAD cohort have increased expression of genes involved in type I interferon production and viral-associated responses at arrival.
Table 1. Top enriched GO-BP terms (FDR < 0.05).
Gene Set | Description | Size | P Value | FDR |
---|---|---|---|---|
GO:0060337 | Type I interferon signaling pathway | 84 | 0 | 0 |
GO:0071357 | Cellular response to type I interferon | 84 | 0 | 0 |
GO:0034340 | Response to type I interferon | 89 | 1.11E-16 | 4.81E-13 |
GO:0051607 | Defense response to virus | 235 | 2.70E-12 | 8.76E-09 |
GO:0009615 | Response to virus | 319 | 5.55E-11 | 1.44E-07 |
GO:0098542 | Defense response to other organism | 473 | 1.19E-10 | 2.58E-07 |
GO:0019221 | Cytokine-mediated signaling pathway | 705 | 4.88E-10 | 9.06E-07 |
GO:0045087 | Innate immune response | 827 | 3.00E-09 | 4.88E-06 |
GO:0043207 | Response to external biotic stimulus | 899 | 7.70E-09 | 1.00E-05 |
GO:0051707 | Response to other organism | 897 | 7.51E-09 | 1.00E-05 |
GO:0002252 | Immune effector process | 1141 | 9.64E-09 | 1.14E-05 |
GO:0009607 | Response to biotic stimulus | 926 | 1.07E-08 | 1.16E-05 |
GO:0045071 | Negative regulation of viral genome replication | 50 | 1.85E-08 | 1.85E-05 |
GO:0035455 | Response to interferon-alpha | 20 | 3.03E-08 | 2.63E-05 |
GO:0071345 | Cellular response to cytokine stimulus | 1015 | 2.99E-08 | 2.63E-05 |
GO:0034097 | Response to cytokine | 1100 | 7.29E-08 | 5.92E-05 |
GO:1903901 | Negative regulation of viral life cycle | 74 | 1.37E-07 | 1.04E-04 |
GO:0006952 | Defense response | 1518 | 2.86E-07 | 2.07E-04 |
GO:0045069 | Regulation of viral genome replication | 87 | 3.09E-07 | 2.11E-04 |
GO:0048525 | Negative regulation of viral process | 88 | 3.27E-07 | 2.12E-04 |
GO:0006955 | Immune response | 1919 | 5.73E-07 | 3.55E-04 |
GO:0019079 | Viral genome replication | 114 | 1.19E-06 | 7.02E-04 |
GO:1903900 | Regulation of viral life cycle | 135 | 2.74E-06 | 1.55E-03 |
GO:0043901 | Negative regulation of multi-organism process | 165 | 7.32E-06 | 3.96E-03 |
GO:0002376 | Immune system process | 2778 | 8.67E-06 | 4.35E-03 |
GO:0050792 | Regulation of viral process | 171 | 8.71E-06 | 4.35E-03 |
GO:0043903 | Regulation of symbiosis, encompassing mutualism through parasitism | 198 | 1.77E-05 | 8.52E-03 |
GO:0035456 | Response to interferon-beta | 33 | 2.24E-05 | 1.04E-02 |
GO:0043900 | Regulation of multi-organism process | 367 | 2.74E-05 | 1.23E-02 |
GO:0009605 | Response to external stimulus | 2282 | 2.87E-05 | 1.25E-02 |
GO:0002697 | Regulation of immune effector process | 381 | 3.38E-05 | 1.41E-02 |
GO:0032020 | ISG15-protein conjugation | 6 | 4.07E-05 | 1.65E-02 |
GO:0019058 | Viral life cycle | 285 | 1.01E-04 | 3.96E-02 |
GO:0035457 | Cellular response to interferon-alpha | 10 | 1.21E-04 | 4.64E-02 |
Table 2. Top enriched signaling pathways (FDR < 0.05).
Gene Set | Description | Size | P Value | FDR |
---|---|---|---|---|
R-HSA-909733 | Interferon alpha/beta signaling | 69 | 0 | 0 |
R-HSA-913531 | Interferon Signaling | 197 | 4.89E-15 | 4.87E-12 |
R-HSA-1280215 | Cytokine Signaling in Immune system | 688 | 1.77E-10 | 1.18E-07 |
R-HSA-168256 | Immune System | 1997 | 2.44E-08 | 1.22E-05 |
R-HSA-1169410 | Antiviral mechanism by IFN-stimulated genes | 78 | 2.08E-07 | 8.28E-05 |
R-HSA-1169408 | ISG15 antiviral mechanism | 71 | 6.75E-06 | 2.24E-03 |
Table 3. Top enriched disease phenotypes (FDR < 0.05).
Gene Set | Description | Size | P Value | FDR | Genes |
---|---|---|---|---|---|
PA444621 | Influenza, Human | 144 | 2.07E-11 | 4.56E-08 | BST2, HERC5, IFIT1, ISG15, ISG20, MX2, RSAD2, TIMD4 |
PA444614 | Infection | 643 | 3.08E-11 | 4.56E-08 | APOBEC3A, BST2, HERC5, IFIT1, IFIT2, IFIT3, ISG15, ISG20, MX2, OAS2, RSAD2, TIMD4 |
PA446038 | Virus Diseases | 580 | 2.15E-10 | 2.12E-07 | APOBEC3A, BST2, HERC5, IFIT1, IFIT2, IFIT3, ISG15, ISG20, MX2, OAS2, RSAD2 |
PA444435 | Hepatitis | 253 | 1.85E-09 | 1.37E-06 | APOBEC3A, BST2, IFIT1, ISG15, ISG20, OAS2, RSAD2, TIMD4 |
PA166170066 | Virological response | 282 | 4.34E-09 | 2.57E-06 | BST2, IFIT1, IFIT2, IFIT3, ISG15, ISG20, MX2, RSAD2 |
PA447230 | HIV | 862 | 1.37E-08 | 6.73E-06 | APOBEC3A, BST2, HERC5, HNRNPA2B1, IFIT2, IFIT3, ISG15, ISG20, MX2, OAS2, RSAD2 |
PA445746 | Stomatitis | 126 | 2.20E-08 | 9.31E-06 | BST2, IFIT1, IFIT2, IFIT3, MX2, RSAD2 |
PA444020 | Encephalitis, Tick-Borne | 26 | 5.00E-08 | 1.85E-05 | IFIT1, IFIT2, OAS2, RSAD2 |
PA444014 | Encephalitis | 89 | 7.68E-06 | 2.31E-03 | IFIT2, MX2, OAS2, RSAD2 |
PA444445 | Hepatitis C | 195 | 7.82E-06 | 2.31E-03 | BST2, IFIT1, ISG15, OAS2, RSAD2 |
PA445546 | Retroviridae Infections | 494 | 6.10E-05 | 1.29E-02 | APOBEC3A, BST2, HERC5, ISG15, MX2, RSAD2 |
PA446213 | HIV Infections | 495 | 6.17E-05 | 1.29E-02 | APOBEC3A, BST2, HERC5, ISG15, MX2, RSAD2 |
PA445640 | Sexually Transmitted Diseases | 496 | 6.24E-05 | 1.29E-02 | APOBEC3A, BST2, HERC5, ISG15, MX2, RSAD2 |
PA446295 | Lentivirus Infections | 498 | 6.38E-05 | 1.29E-02 | APOBEC3A, BST2, HERC5, ISG15, MX2, RSAD2 |
PA444601 | Immunologic Deficiency Syndromes | 500 | 6.52E-05 | 1.29E-02 | APOBEC3A, BST2, HERC5, ISG15, MX2, RSAD2 |
PA446768 | Encephalitis, Viral | 66 | 1.14E-04 | 2.12E-02 | IFIT2, OAS2, RSAD2 |
PA443855 | Dengue | 76 | 1.74E-04 | 3.03E-02 | IFIT3, OAS2, RSAD2 |
Protein-protein interactions and co-expression of DEGs
The 69 DEGs identified between DEAD and ALIVE were used to predict protein-protein interactions and gene product co-expression in STRING v11.0 (Fig 3) [18]. This analysis identified interactions consisting of 16 DEGs (nodes) connected by 57 interactions (edges), in which 13 of 16 DEGs were increased in expression in DEAD (Fig 3A). A strong pattern of co-expression between those 16 gene products was also identified as depicted in Fig 3B. The highest co-expression scores (> 0.75) were identified for IFIT1-3/5, SPARC, COL1A1, RSAD2, ISG15, HERC5/6, OAS2, and MX2. In summary, 12 DEGs that were increased at arrival in cattle who died from BRD are known to code for proteins that possess strong interactions and co-expression. Additionally, two genes increased in expression in ALIVE (COL1A1, SPARC) demonstrated strong predicted co-expression and interaction. All co-expression interactions and associated scores may be found in S2 File.
De novo assembly and analysis of unmapped reads
From 6,968,239 unmapped reads contributed from all 6 samples, 6,953,629 survived quality trimming (99.79%) and were used to assemble a de novo transcriptome. The resulting 65,516 constructed contigs (S3 File) were analyzed against the NCBI non-redundant nucleotide (nt) database. Over 90% of the assembled contigs mapped to the Bovidae family, followed by alignments to various mammalian, bacterial (many part of the bovine microbiota), parasitic (Onchocerca ochengi), and fungal species (Basidiomycota) (Figs 4 and 5). Notably, the de novo assembly failed to map to any viral contigs within the NCBI nt database. Homology to viral DNA was not identified.
Alignment of unmapped reads to viral genome sequences
Trimmed unmapped reads from each calf were analyzed in two parts: 1) against all known virus sequences and 2) against all known bovine viral pathogen sequences. Top hits from alignments against all viral assemblies demonstrated a sparse number of reads across all 6 animals (231–535 reads; total: 2,257, average: 376.2; S4 File). The majority of reads aligned to non-mammalian viruses, namely the Choristoneura fumiferana granulovirus and Diolcogaster facetosa bracovirus, in addition to BeAn 58058 virus. However, independent analysis of these reads to the NCBI nt database indicated that these reads aligned to well conserved mammalian genes, such as U6 splicesomal ncRNA. Due to the homologous nature of these reads to mammalian genes, any hits were considered irrelevant. Using the top hits from the alignments against known bovine viruses, all 6 animals possessed a relatively small number of reads (5082–7330 reads; total: 35,432, average: 5905.3; S5 File) that only aligned to BVDV1. These reads were extracted and realigned to both the NCBI nt database and the complete BVDV1 genome (NC_001461.1). When realigned to the NCBI nt database, the extracted reads aligned to the BVDV1 sequences U86599.1 (Pestivirus type 1 cytopathic genomic RNA, complete genome) and L13783.1 (Bovine viral diarrhea virus p125 protein gene, partial CDS). However, the top hits were consistently to ubiquitin C (UBC) mRNA sequences within Bovidae assemblies. When realigned to the NC_001461.1 complete BVDV1 genome, no alignment hits were detected.
Discussion
This study builds upon our previous analysis of transcriptomes at arrival that were derived from post-weaned beef cattle that ultimately developed BRD versus cattle that remained healthy [14]. The present investigation was conducted with the intent to identify potential at-arrival biomarkers and pathways that indicate risk of BRD-associated mortality in post-weaned beef cattle. Our overarching goal with these studies is to identify gene expression profiles and biological pathways in blood samples at arrival that segregate with later BRD morbidity or mortality. By analyzing the transcriptomes of post-weaned cattle before they exhibit clinical signs of BRD, our approach will also improve understanding of the mechanistic basis of both susceptibility and resistance to BRD in this cohort. Previous research to determine early antemortem indications of BRD and risk of severity has yielded varied results [19–22]. At present, the diagnosis and classification of BRD is primarily assessed using clinical factors that have proven to be imprecise, effectively limiting BRD management [6, 10, 23]. Nonetheless, cattle diagnosed with BRD in this investigation using these same clinical factors (including DART scoring, treatment records, and average daily weight gain) exhibited differences in gene product and molecular pathway expression at arrival that ultimately segregated with their BRD-associated mortality [14].
In DEAD cattle, the expression of genes related to type I interferon production/signaling and viral defense were increased. These viral defense genes included IFIT1/2/3, IRF4, HERC5/6, OAS2, MX2, and ISG15/20. Several investigations have similarly identified these genes as relevant in cattle with prolonged inflammation and ongoing viral infection [24–27]. These findings, when considered with the increased expression of viral defense genes at arrival in cattle that ultimately died of BRD, suggest that cattle in the DEAD cohort were combating a viral agent at arrival. Though cattle that died of BRD in this investigation did not show clinical evidence of BRD at arrival, viral BRD is often subclinical and may initially present as an upper airway disease. Subclinical viral respiratory infection at arrival would not only account for the observed viral defense pathways in the cattle that died of BRD, but would facilitate secondary infectious processes in the lung that contribute to the observed BRD mortality [5, 25, 28, 29]. One challenge with our study is that differences in gene expression were characterized in peripheral blood. It has been suggested that the blood transcriptome represents an amalgamation of gene expression patterns and pathways in distinct physiological sites, such as airway epithelium, lymph nodes, and splenic tissue [30, 31]. Necropsy demonstrated that disease was limited to the lung in cattle that died. Overt disease was not evident at arrival, but subclinical disease cannot be ruled out. As these DEGs may indicate viral-induced disease at arrival, focused investigation of these genes is warranted for larger future studies. The ability to delineate underlying subclinical viral disease mechanisms in beef cattle at arrival could allow for the development precision management techniques to more effectively and specifically treat or prevent disease, leading to more precise antimicrobial usage and decreased dissemination of antimicrobial resistance.
In this study, we identified increased expression of gene products that interact with toll-like receptor 4 (TLR4) and interleukin 6 (IL6) in both live and dead cohorts. While there was no difference in the expression of these specific genes between DEAD vs ALIVE cattle at arrival, we have previously described increased BGN, MARCO and POMC expression (known to be involved in TLR4-dependent pro-inflammatory pathways) in arrival blood transcriptomes from cattle that ultimately developed BRD when compared to cattle that remained clinically healthy [14]. IL6 and several other type I interferon-associated genes have been reported to be differentially expressed within lymph node samples of virus-challenged cattle [25, 26]. TLR4 possesses high avidity for lipopolysaccharide (LPS) and some viral structural proteins, and is capable of inducing type I interferons production and increased levels of IL6 [32–35]. Additionally, elevated levels of IL6 may reciprocatively induce type I interferon production, enhancing natural killer cell cytotoxic activity, M1 macrophage maturation, and interleukin 12 (IL12) production [36–39]. It is important to note that IL6 and TLR4 are not differentially expressed between DEAD and ALIVE groups but are predicted to be active based on associations with DEG products increased within each cohort. It is possible that TLR4 and IL6, relatively non-specific markers of inflammation, are initiated in both DEAD and ALIVE groups albeit through differing mechanisms. Several studies have demonstrated that TLR4 expression is increased in active respiratory disease and is responsible for proinflammatory cytokine production, in both viral and bacterial induced infections [40–43]. Furthermore, in ALIVE cattle, the increased expression of several proinflammatory genes was identified: CD300LG, COL1A1, CX3CR1, KIR2DL5A, LOC104968634 (NK2B), OGN, LOC782922 (PRXL2B), TARP. These gene products, largely involved in natural killer cell activation, leukocyte adhesion, prostaglandin synthesis, and initiation of the acquired immune system, possess known interactions or promotion of TLR4 and IL6 activity. In conjunction with TLR4 interactions, it is possible that the ALIVE cattle were actively combating extracellular antigens or etiological agents. The commonality between ALIVE and DEAD cattle is antigenic and immunogenic signaling without inflammatory mitigation. Notably, our research did not ascertain the order of TLR4 and IL6 association, therefore further research is necessary to define mechanistic characteristics and signaling order.
Some of the limitations of this study are the lack of antemortem pathogen identification, particularly viral isolation at arrival, and the relatively small number of biological replicates within each cohort. Due to technical and fiscal challenges associated with high-throughput sequencing, obtaining the optimal number of biological replicates remains a controversial aspect of RNA-Seq experimentation. However, it is generally accepted that three clean biological replicates per group is the minimum sample size necessary for meaningful inferential analysis [44, 45]. Modeling genes that were differentially expressed between live and dead cohorts identified increased antiviral pathways in the DEAD cohort. Accordingly, we utilized de novo alignment and BLAST toolkits to mine unmapped reads for viral sequences that would account for the gene expression changes and pathways identified in our study. Reads that fail to map to the host reference assembly have been previously used to identify pathogens within RNA-Seq datasets [46–50]. The de novo assembly reads aligned predominantly to annotated ungulate sequences (Figs 4 and 5). This is an expected occurrence in which the unmapped reads representing gene products in the tested cattle were not identified with the Bos taurus ARS-UCD1.2 reference assembly. This is not an uncommon occurrence with reference assemblies from non-model organisms that reflects errors in the assembly’s structural annotation and has been otherwise reported in transcriptomic experiments using cattle [46, 50].
A notable finding in this investigation is the lack of alignments to pathogenic organisms associated with BRD. This finding was consistent in three instances 1) when assembled contigs of the de novo transcriptome were mapped to all sequences in the NCBI non-redundant nt database; 2) when unmapped reads were mapped against all known viral sequences; and 3) when unmapped reads were mapped against all known bovine viral pathogen sequences. One key limitation of this experiment is the poly-A tail dependent capture of reads for library preparation. It is possible that pathogen genes within each whole blood sample were never captured prior to sequencing. Therefore, while we were unable to identify pathogen genes in these instances, it does not rule out the possibility of these cattle harboring etiological agents at arrival. All biological replicates possessed reads that matched only to BVDV1 sequences from the NCBI nt database. However, these reads matched solely to ubiquitin C (UBC) mRNA within Bovidae assemblies and no alignment was detectible when realigned to the NC_001461.1 complete BVDV1 genome. This demonstrates that the unmapped reads failed to align to viruses related to BRD, but rather aligned to bovine genome sequences that have been incorporated into BVDV1. It has been shown that several BVDV1 sequences possess Bovidae genomic sequence contamination, specifically to UBC mRNA. This finding agrees with the alignment discovery reported by Usman and colleagues [46]. Despite the absence of viral sequences, the identified DEGs and pathways provide evidence that the anti-viral mechanisms were activated at arrival in cattle within the DEAD cohort.
Conclusions
This study explored the at-arrival whole blood transcriptomes and differentially expressed genes of diseased cattle, identifying significant gene products and pathways that differentiate cattle that die from naturally acquired respiratory disease and those that develop BRD but survive. Our results demonstrate that cattle developing clinical BRD possess increased expression of genes involved in proinflammation and immune responses at arrival and share TLR4 and IL6 activity. Cattle that died from BRD demonstrated increased type I interferon and antiviral-associated gene product expression at arrival. Although the unmapped reads did not align to viral genomes, our at-arrival findings highlight candidate gene expression profiles that herald viral respiratory infections, prior to the identification of overt BRD. These findings may support the development of predictive viral disease assays and thus warrant further investigations comparing current pathogen detection techniques with the identification of these candidate biomarkers.
Materials and methods
Study design
All animal use and procedures were approved by the Mississippi State University Institutional Animal Care and Use Committee (IACUC protocol #17–120). This study examined whole blood transcriptomes at arrival from crossbred bulls (n = 5) and a steer (n = 1) that went on to develop clinical BRD within 28 days following facility arrival. These six animals were categorized into two groups based on BRD-attributed mortality. Group 1 (DEAD; n = 3) cattle died of naturally occurring BRD despite antimicrobial and supportive treatment; all animals within the DEAD cohort succumbed to BRD prior to administrated euthanasia. Group 2 (ALIVE; n = 3) cattle were treated for naturally occurring BRD, but subsequently recovered after one or more therapeutic courses of treatment. Animals in this investigation were a subset of a randomized experiment pertaining to the effect of vaccination and deworming on post-weaned beef cattle health and growth factors [51]. Animals enrolled in this study were purchased from local commercial livestock auctions within Mississippi and housed at the H. H. Leveck Animal Research Center at Mississippi State University. Further information involving animal management and enrollment selection is addressed in S6 File and in detail in our previous studies [14, 51]. At arrival, jugular blood samples were collected into Tempus Blood RNA tubes (Applied Biosystems), and then frozen and stored at -80° C until analysis. RNA extraction, quality assessment, cDNA library preparation, and RNA sequencing (80 million reads/sample) was performed by the UCLA Technology Center for Genomics and Bioinformatics (UCLA TCGB, Los Angeles, CA, USA) as previously described [14]. The whole blood transcriptomes of the six individuals examined in this investigation have been previously contrasted against at arrival whole blood transcriptomes from cattle that failed to develop clinical signs of BRD [14]. RNA sequence reads are available in the NCBI sequence read archive (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE136176). In contrast to our prior work, this investigation contrasts gene expression at arrival in cattle that went on to develop more clinically severe BRD versus less clinically severe BRD.
Differential gene expression analysis
Program parameters and alignment statistics for reference-based gene count matrix construction were as previously described [14]. All sequence alignment map (SAM) files produced by HISAT2 were converted to binary alignment map (BAM) files, sorted, and indexed with SAMtools v1.9 [52]. All unmapped reads were extracted using SAMtools for further exploration. Reference-guided assembly and assessment were performed with StringTie v2.0 and GffCompare v0.11.4, respectively [53–55]. Gene-level read counts for each sample were calculated in Python v2.7.17 with the program prepDE.py [56].
Filtering, normalization, and analysis of gene counts was performed in R, utilizing the Bioconductor [57] software package edgeR v3.26.8 [58, 59]. Data for all six biological replicates were categorized into two groups based on BRD-associated mortality (n = 3 DEAD; n = 3 ALIVE). Filtering of low gene counts was performed as described by Chen and colleagues using a total count per million minimum of 0.2 across at least three samples [60]. Library sizes were normalized using the trimmed mean of M-values method [61] in edgeR. Unsupervised clustering of the aligned reads was performed using multidimensional scaling (MDS) in order to plot differences in expression profiles between the 6 animals [62]. Distances between samples on the MDS plot represent ‘leading fold change’, defined as the root-mean-square average of the log-fold-changes for the genes best distinguishing each pair of samples. Differentially expressed genes (DEGs) were identified in edgeR using likelihood ratio testing (glmLRT function) to improve the ability to analyze samples with large gene count dispersions and low abundance counts [59]. Differential gene expression was considered significant with a false discovery rate (FDR) of ≤ 0.10 [63].
Biological interpretation of gene expression data
A heat map of the DEGs was created using the R package pheatmap v1.0.12 [64]. Gene Ontology (GO) analysis, biological pathways, and disease associations of the DEGs identified between DEAD and ALIVE groups were analyzed with the WEB-based Gene SeT AnaLysis Toolkit (WebGestalt 2019; http://www.webgestalt.org/), using the human orthologs of all bovine DEGs [65]. Overrepresented biological pathway analysis was performed utilizing the pathway database Reactome [15, 16]. Disease association analysis with the list of DEGs was performed using the GLAD4U functional database [17]. Analysis parameters within WebGestalt 2019 included between 5 and 3000 genes per category and an FDR cutoff of < 0.05 for significance. Protein-protein interaction networks and protein co-expression analysis was conducted with the Search Tool for the Retrieval of Interacting Genes (STRING; https://string-db.org/) database v11.0 [18] using human orthologs of the bovine DEGs. All interactions required a minimum interaction (confidence) score of 0.900, defined as the highest confidence score in STRING v11.0.
Analysis of unmapped reads
All reads in this study were originally aligned to the Bos taurus reference assembly ARS-UCD1.2. The unmapped reads which failed to align were extracted using SAMtools view -b option. The subsequent BAM files were converted into unique paired end fastq files with BEDtools v2.26.0 bamtofastq option [66]. Unmapped fastq files were retrimmed and quality assessed with Trimmomatic v0.38 and FastQC v0.11.9, respectively, in order to eliminate the potential of poor quality or inadequate length of sequences leading to misalignment. Trimming parameters were: 1) leading and trailing bases of each read were removed if their base quality score was below 3, 2) each read was scanned with a 3-base pair sliding window, removing read segments below a minimum base quality score of 15, and 3) sequences below a read length of 40 bases were removed. Read samples were concatenated based on directionality, and then assembled de novo into contigs using Trinity v2.8.5 by employing the program’s default protocols [67]. Unmapped read trimming and de novo alignment statistics are provided in S3 File.
Nucleotide sequence homology of the de novo transcriptome was explored against the NCBI non-redundant nucleotide database (ftp://ftp.ncbi.nlm.nih.gov/blast/db/; accessed September 30, 2019) with NCBI-blast v2.9.0+ (ftp://ftp.ncbi.nlm.nih.gov/blast/executables/blast+/LATEST/; blastn parameter, default settings) [68]. The top alignment hit for each contig was identified and used to create a blastn file that was further analyzed for phylogenetic grouping and characterization using MEGAN Community Edition v6.18.3 (https://github.com/danielhuson/megan-ce) using the program’s default protocols for taxonomic identification [69, 70].
Due to likely inaccuracies in the de novo assembly that trace to false chimeric contigs [71, 72], and persistence of reads that were not incorporated into the de novo transcriptome, we therefore analyzed all trimmed reads against bovine viral pathogen sequences downloaded from NCBI (https://www.ncbi.nlm.nih.gov/labs/virus/vssi; accessed January 6, 2020). A total of 1657 nucleotide files were downloaded and utilized as subject sequences by selecting only complete nucleotide sequence types found from Bos taurus (taxid: 9913) hosts. In addition, we also aligned reads from cattle that were not incorporated into the de novo transcriptome to all known viral sequences at NCBI (release 200, https://ftp.ncbi.nlm.nih.gov/refseq/release/complete/). Local alignment was performed with the NCBI-blast v2.9.0+ blastn option, using the same settings parameters as previously mentioned. The resulting blastn file from each sample was explored with MEGAN Community Edition v6.18.3. Top hits were scrutinized for potential genomic DNA contamination in the reference subjects by re-aligning the respective cDNA of the sample read sequence to both the official genome assembly that it annotated as and to the NCBI nucleotide database.
Supporting information
Acknowledgments
The authors would like to thank the students and staff of the Mississippi Agricultural and Forestry Experiment Station (MAFES), Mississippi State University College of Veterinary Medicine, and Mississippi State University Animal and Diary Science Department for their assistance in animal care and sample collection. Additionally, we would like to thank Merrilee Thoresen, Daniele Doyle, and Kathleen Barton for their technical assistance and insight throughout this experiment.
Data Availability
The datasets supporting the conclusions of this article are publicly available in the Gene Expression Omnibus (GEO) repository, https://www.ncbi.nlm.nih.gov/geo, under the accession number GSE136176, and the Sequence Read Archive (SRA) under the accession number SRP219429. All remaining data generated or analyzed during this study are included in this published article and its additional information files.
Funding Statement
This project was funded and supported by the Mississippi State University Department of Pathobiology and Population Medicine, Mississippi State University Department of Animal and Dairy Sciences, and the Mississippi Agricultural and Forestry Experiment Station. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Grissett G, White B, Larson R. Structured literature review of responses of cattle to viral and bacterial pathogens causing bovine respiratory disease complex. J Vet Intern Med. 2015. May-Jun; 29(3): 770–780. 10.1111/jvim.12597 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Holman D, Timsit E, Alexander T. The Nasopharyngeal Microbiota of Feedlot Cattle. Sci Rep. 2015. October; 5: 15557. 10.1038/srep15557 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Holman D, Timsit E, Amat S, Abbot D, Buret A, Alexander T. The Nasopharyngeal Microbiota of Beef Cattle Before and After Transport to a Feedlot. BMC Microbiol. 2017. March; 17(1): 70. 10.1186/s12866-017-0978-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Kishimoto M, Tsuchiaka S, Rahpaya S, Hasebe A, Otsu K, Sugumura S, et al. Development of a one-run real-time PCR detection system for pathogens associated with bovine respiratory disease complex. J Vet Med Sci. 2017. March; 79(3): 517–523. 10.1292/jvms.16-0489 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Taylor J, Fulton R, Lehenbauer T, Step D, Confer A. The epidemiology of bovine respiratory disease: What is the evidence for preventive measures? Can Vet J. 2010. October; 51(10): 1095–1102. [PMC free article] [PubMed] [Google Scholar]
- 6.Babcock A, White B, Dritz S, Tomson D, Renter D. Feedlot health and performance effects associated with the timing. J Anim Sci. 2009. January; 87(1): 314–327. 10.2527/jas.2008-1201 [DOI] [PubMed] [Google Scholar]
- 7.Murray G, More S, Sammin D, Casey M, McElroy M, O’Neil R, et al. Pathogens, patterns of pneumonia, and environmental risk factors associated with respiratory disease in recently weaned cattle in Ireland. J Vet Diagn Invest. 2017. January; 29(1): 20–34. 10.1177/1040638716674757 [DOI] [PubMed] [Google Scholar]
- 8.Griffin D, Chengappa M, Kuszak J, McVey D. Bacterial Pathogens of the Bovine Respiratory Disease Complex. Vet Clin North Am Food Anim Pract. 2010. July; 26(2): 381–394. 10.1016/j.cvfa.2010.04.004 [DOI] [PubMed] [Google Scholar]
- 9.White B, Renter D. Bayesian estimation of the performance of using clinical observations and harvest lung lesions for diagnosing bovine respiratory disease in post-weaned beef calves. J Vet Diagn Invest. 2009. July; 21(4): 446–453. 10.1177/104063870902100405 [DOI] [PubMed] [Google Scholar]
- 10.Timsit E, Dendukuri N, Schiller I, Buczinski S. Diagnostic accuracy of clinical illness for bovine respiratory disease (BRD) diagnosis in beef cattle placed in feedlots: A systematic literature review and hierarchical Bayesian latent-class meta-analysis. Prev Vet Med. 2016. December; 135: 67–73. 10.1016/j.prevetmed.2016.11.006 [DOI] [PubMed] [Google Scholar]
- 11.Ives S, Richeson J. Use of Antimicrobial Metaphylaxis for the Control of Bovine Respiratory Disease in High-Risk Cattle. Vet Clin North Am Food Anim Pract. 2015. November; 31(3): 341–350. 10.1016/j.cvfa.2015.05.008 [DOI] [PubMed] [Google Scholar]
- 12.Abell K, Theurer M, Larson R, White B, Apley M. A Mixed Treatment Comparison Meta-Analysis of Metaphylaxis Treatments for Bovine Respiratory Disease in Beef Cattle. J Anim Sci. 2017. February; 95(2): 626–635. 10.2527/jas.2016.1062 [DOI] [PubMed] [Google Scholar]
- 13.Baptiste K, Kyvsgaard N. Do antimicrobial mass medications work? A systematic review and meta-analysis of randomised clinical trials investigating antimicrobial prophylaxis or metaphylaxis against naturally occurring bovine respiratory disease. Pathog Dis. 2017. September; 75(5): 10.1093/femspd/ftx083 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Scott M, Woolums A, Swiderski C, Perkins A, Nanduri B, Smith D, et al. Whole blood transcriptomic analysis of beef cattle at arrival identifies potential predictive molecules and mechanisms that indicate animals that naturally resist bovine respiratory disease. PloS One. 2020. January; 15(1): e0227507. 10.1371/journal.pone.0227507 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Jassal B, Matthews L, Viteri G, Gong C, Lorente P, Fabregat A, et al. The reactome pathway knowledgebase. Nucleic Acids Res. 2020. January; 48(D1): D498–D503. 10.1093/nar/gkz1031 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Fabregat A, Jupe S, Matthews L, Sidiropoulos K, Gillespie M, Garapati P, et al. The Reactome Pathway Knowledgebase. Nucleic Acids Res. 2018. January; 46(D1): D649–D655. 10.1093/nar/gkx1132 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Jourquin J, Duncan D, Shi Z, Zhang B. GLAD4U: deriving and prioritizing gene lists from PubMed literature. BMC Genomics. 2012. December; 13(Suppl 8): S20. 10.1186/1471-2164-13-S8-S20 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Szklarczyk D, Gable A, Lyon D, Junge A, Wyder S, Huerta-Cepas J, et al. STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 2019. January; 47(D1): D607–D613. 10.1093/nar/gky1131 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Step D, Krehbiel C, DePra H, Cranston J, Fulton R, Kirkpatrick J, et al. Effects of commingling beef calves from different sources and weaning protocols during a forty-two-day receiving period on performance and bovine respiratory disease. J Anim Sci. 2008. November; 86(11): 3146–3158. 10.2527/jas.2008-0883 [DOI] [PubMed] [Google Scholar]
- 20.Cramer M, Proudfoot K, Ollivett T. Short communication: Behavioral attitude scores associated with bovine respiratory disease identified using calf lung ultrasound and clinical respiratory scoring. J Dairy Sci. 2019. July; 102(7): 6540–6544. 10.3168/jds.2018-15550 [DOI] [PubMed] [Google Scholar]
- 21.Abutarbush S, Pollock C, Wildman B, Perrett T, Schunicht O, Fenton R, et al. Evaluation of the diagnostic and prognostic utility of ultrasonography at first diagnosis of presumptive bovine respiratory disease. Can J Vet Res. 2012. January; 76(1): 23–32. [PMC free article] [PubMed] [Google Scholar]
- 22.Amrine D, White B, Larson R. Comparison of classification algorithms to predict outcomes of feedlot cattle identified and treated for bovine respiratory disease. Comput Electron Agric. 2014. July; 105: 9–19. [Google Scholar]
- 23.Terrell S, Thomson D, Wileman B, Apley M. A survey to describe current feeder cattle health and well-being program recommendations made by feedlot veterinary consultants in the United States and Canada. Bovine Pract. 2011; 45: 140–148. [Google Scholar]
- 24.Barreto D, Barros G, Santos L, Soares R, Batista M. Comparative transcriptomic analysis of bovine papillomatosis. BMC Genomics. 2018. December; 19(1): 949. 10.1186/s12864-018-5361-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Tizioto P, Kim J, Seabury C, Schnabel R, Gershwin L, Van Eenennaam A, et al. Immunological Response to Single Pathogen Challenge with Agents of the Bovine Respiratory Disease Complex: An RNA-Sequence Analysis of the Bronchial Lymph Node Transcriptome. PLoS One. 2015. June; 10(6): e0131459. 10.1371/journal.pone.0131459 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Johnston D, Earley B, McCabe M, Lemon K, Duffy C, McMenamy M, et al. Experimental Challenge With Bovine Respiratory Syncytial Virus in Dairy Calves: Bronchial Lymph Node Transcriptome Response. Sci Rep. 2019. October; 9(1): 14736. 10.1038/s41598-019-51094-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Foley C, Chapwanya A, Creevey C, Narciandi F, Morris D, Kenny E, et al. Global endometrial transcriptomic profiling: transient immune activation precedes tissue proliferation and repair in healthy beef cows. BMC Genomics. 2012. September; 13(1): 489. 10.1186/1471-2164-13-489 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Hodgins D, Conlon J, Shewen P. Respiratory Viruses and Bacteria in Cattle. In Brogden K, Guthmiller J, editors. Polymicrobial Diseases. Washington DC: ASM Press; 2002. p. Chapter 12. [Google Scholar]
- 29.Ellis J. Update on Viral Pathogenesis in BRD. Anim Health Res Rev. 2009. December; 10(2): 149–153. 10.1017/S146625230999020X [DOI] [PubMed] [Google Scholar]
- 30.Liew C, Ma J, Tang H, Zheng R, Dempsey A. The Peripheral Blood Transcriptome Dynamically Reflects System Wide Biology: A Potential Diagnostic Tool. J Lab Clin Med. 2006. March; 147(3): 126–132. 10.1016/j.lab.2005.10.005 [DOI] [PubMed] [Google Scholar]
- 31.Morrow J, Chase R, Parker M, Glass K, Seo M, Divo M, et al. RNA-sequencing Across Three Matched Tissues Reveals Shared and Tissue-Specific Gene Expression and Pathway Signatures of COPD. Respir Res. 2019. April; 20(1): 65. 10.1186/s12931-019-1032-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Kawai T, Akira S. Toll-like Receptors and Their Crosstalk With Other Innate Receptors in Infection and Immunity. Immunity. 2011. May; 34(5): 637–650. 10.1016/j.immuni.2011.05.006 [DOI] [PubMed] [Google Scholar]
- 33.Uematsu S, Akira S. Toll-like Receptors and Type I Interferons. J Biol Chem. May 2007; 282(21): 15319–15323. 10.1074/jbc.R700009200 [DOI] [PubMed] [Google Scholar]
- 34.Fujisawa H, Wang B, Sauder D, Kondo S. Effects of Interferons on the Production of interleukin-6 and interleukin-8 in Human Keratinocytes. J Interferon Cytokine Res. 1997. June; 17(6): 347–353. 10.1089/jir.1997.17.347 [DOI] [PubMed] [Google Scholar]
- 35.Velazquez-Salinas L, Pauszek S, Stenfeldt C, O’Hearn E, Pacheco J, Borca M, et al. Increased Virulence of an Epidemic Strain of Vesicular Stomatitis Virus Is Associated With Interference of the Innate Response in Pigs. Front Microbiol. 2018. August; 9: 1981. 10.3389/fmicb.2018.01981 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Zhoe D, Mattner J, Cantu C3, Schrantz N, Yin N, Gao Y, et al. Lysosomal Glycosphingolipid Recognition by NKT Cells. Science. 2004. December; 306(5702): 1786–1789. 10.1126/science.1103440 [DOI] [PubMed] [Google Scholar]
- 37.Lopez-Collazo E, Hortelano S, Rojas A, Bosca L. Triggering of Peritoneal Macrophages with IFN-α/β Attenuates the Expression of Inducible Nitric Oxide Synthase Through a Decrease in NF-κB Activation. J Immunol. 1998. March; 160(6): 2889–2895. [PubMed] [Google Scholar]
- 38.Nguyen K, Watford W, Salomon R, Hofmann S, Pien G, Morinobu A, et al. Critical Role for STAT4 Activation by Type 1 Interferons in the Interferon-Gamma Response to Viral Infection. Science. 2002. September; 297(5589): 2063–2066. 10.1126/science.1074900 [DOI] [PubMed] [Google Scholar]
- 39.Stockinger S, Materna T, Stoiber D, Bayr L, Steinborn R, Kolbe T, et al. Production of Type I IFN Sensitizes Macrophages to Cell Death Induced by Listeria Monocytogenes. J Immunol. 2002. December; 169(11): 6522–6529. 10.4049/jimmunol.169.11.6522 [DOI] [PubMed] [Google Scholar]
- 40.Akira S, Takeda K. Toll-like receptor signalling. Nat Rev Immunol. 2004. July; 4(7): 499–511. 10.1038/nri1391 [DOI] [PubMed] [Google Scholar]
- 41.Openshaw P, Tregoning J. Immune Responses and Disease Enhancement during Respiratory Syncytial Virus Infection. Clin Microbiol Rev. 2005. July; 18(3): 541–555. 10.1128/CMR.18.3.541-555.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Grütz G. New Insights Into the Molecular Mechanism of interleukin-10-mediated Immunosuppression. J Leukoc Biol. 2005. January; 77(1): 3–15. 10.1189/jlb.0904484 [DOI] [PubMed] [Google Scholar]
- 43.Hodgson P, Aich P, Manuja A, Hokamp K, Roche F, Brinkman F, et al. Effect of stress on viral–bacterial synergy in bovine respiratory disease: novel mechanisms to regulate inflammation. Comp Funct Genomics. 2005; 6(4): 244–250. 10.1002/cfg.474 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Conesa A, Madrigal P, Tarazona S, Gomez-Cabrero D, Cervera A, McPherson A, et al. A survey of best practices for RNA-seq data analysis. Genomic Biol. 2016. January; 17(13). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Schurch N, Schofield P, Gierliński M, Cole C, Sherstnev A, Singh V, et al. Erratum: How many biological replicates are needed in an RNA-seq experiment and which differential expression tool should you use? RNA. 2016. October; 22:10: 1641. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Usman T, Hadlich F, Demasius W, Weikard R, Kuhn C. Unmapped reads from cattle RNAseq data: A source for missing and misassembled sequences in the reference assemblies and for detection of pathogens in the host. Genomics. 2017. January; 109(1): 36–42. 10.1016/j.ygeno.2016.11.009 [DOI] [PubMed] [Google Scholar]
- 47.Merchant S, Wood D, Salzberg S. Unexpected cross-species contamination in genome sequencing projects. Peer J. 2014. November; 2: e675. 10.7717/peerj.675 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Isakov O, Modai S, Shomron N. Pathogen detection using short-RNA deep sequencing subtraction and assembly. Bioinformatics. 2011. August; 27(15): 2027–2030. 10.1093/bioinformatics/btr349 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Bhaduri A, Qu K, Lee C, Ungewickell A, Khavari P. Rapid identification of non-human sequences in high-throughput sequencing datasets. Bioinformatics. 2012. April; 28(8): 1174–1175. 10.1093/bioinformatics/bts100 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Whitacre L, Tizioto P, Kim J, Sonstegard T, Schroeder S, Schroeder S, et al. What’s in your next-generation sequence data? An exploration of unmapped DNA and RNA sequence reads from the bovine reference individual. MBC Genomics. 2015. December; 16: 1114. 10.1186/s12864-015-2313-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Griffin C, Scott J, Karisch B, Woolums A, Blanton JR, et al. A randomized controlled trial to test the effect of on-arrival vaccination and deworming on stocker cattle health and growth performance. Bov Pract (Stillwater). 2018. Spring; 52(1): 26–33. [PMC free article] [PubMed] [Google Scholar]
- 52.Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer NMG, et al. The Sequence Alignment/Map Format and SAMtools. Bioinformatics. 2009. August; 25(16): 2078–2079. 10.1093/bioinformatics/btp352 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Pertea M, Pertea G, Antonescu C, Chang T, Mendell J, Salzberg S. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat Biotechnol. 2015. March; 33(3): 290–295. 10.1038/nbt.3122 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Pertea M, Kim D, Pertea G, Leek J, Salzberg S. Transcript-level Expression Analysis of RNA-seq Experiments With HISAT, StringTie and Ballgown. Nat Protoc. 2016. September; 11(9): 1650–1667. 10.1038/nprot.2016.095 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Pertea G, Kirchner R. Johns Hopkins University Center for Computational Biology (CCB). [Online].; 2019 [cited 2019. Available from: https://ccb.jhu.edu/software/stringtie/gffcompare.shtml.
- 56.Pertea G. stringtie/prepDE.py. [Online].; 2019 [cited 2019. Available from: https://github.com/gpertea/stringtie/blob/master/prepDE.py.
- 57.Gentleman R, Carey V, Bates D, Bolstad B, Dettling M, Dudoit S, et al. Bioconductor: open software development for computational biology and bioinformatics. Genome Biol. 2004. September; 5(10): R80. 10.1186/gb-2004-5-10-r80 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Robinson M, McCarthy D, Smyth G. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010; 26(1): 139–140. 10.1093/bioinformatics/btp616 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.McCarthy J, Chen Y, Smyth K. Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation. Nucleic Acids Research. 2012; 40(10): 4288–4297. 10.1093/nar/gks042 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Chen Y, Lun A, Smyth G. From reads to genes to pathways: differential expression analysis of RNA-Seq experiments using Rsubread and the edgeR quasi-likelihood pipeline. F1000Research. 2016. August; 5: 1438. 10.12688/f1000research.8987.2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Robinson M, Oshlack A. A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biol. 2010; 11: R25. 10.1186/gb-2010-11-3-r25 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Ritchie M, Phipson B, Wu D, Hu Y, Law C, Shi W, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015. April; 43(7): e47. 10.1093/nar/gkv007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Benjamini Y, Hochberg Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society. Series B. 1995; 57(1): 289–300. [Google Scholar]
- 64.Kolde R. The Comprehensive R Archive Network (CRAN). [Online].; 2019 [cited 2019. Available from: https://CRAN.R-project.org/package=pheatmap.
- 65.Liao Y, Wang J, Jaehnig E, Shi Z, Zhang B. WebGestalt 2019: gene set analysis toolkit with revamped UIs and APIs. Nucleic Acids Research. 2019. July; 47(W1): W199–W205. 10.1093/nar/gkz401 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Quinlan A, Hall I. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010. March; 26(6): 841–842. 10.1093/bioinformatics/btq033 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Grabherr M, Haas B, Yassour M, Levin J, Thompson D, Amit I, et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011. May; 29(7): 644–652. 10.1038/nbt.1883 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, et al. BLAST+: architecture and applications. BMC Bioinformatics. 2009. December; 10: 421. 10.1186/1471-2105-10-421 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Huson D, Auch A, Qi J, Schuster S. MEGAN analysis of metagenomic data. Genome Res. 2007. March; 17(3): 377–386. 10.1101/gr.5969107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Huson D, Beier S, Flade I, Górska A, El-Hadidi M, Mitra S, et al. MEGAN Community Edition—Interactive Exploration and Analysis of Large-Scale Microbiome Sequencing Data. PLoS Comput Biol. 2016. June; 12(6): e1004957. 10.1371/journal.pcbi.1004957 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Hsieh P, Oyang Y, Chen C. Effect of de novo transcriptome assembly on transcript quantification. Sci Rep. 2019. June; 9(1): 8304. 10.1038/s41598-019-44499-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Freedman A, Clamp M, Sackton T. Error, noise and bias in de novo transcriptome assemblies. bioRxiv. 2020. January; 585745: 10.1111/1755-0998.13156 [DOI] [PubMed] [Google Scholar]