Skip to main content
PeerJ logoLink to PeerJ
. 2017 Sep 26;5:e3810. doi: 10.7717/peerj.3810

Gene expression of benthic amphipods (genus: Diporeia) in relation to a circular ssDNA virus across two Laurentian Great Lakes

Kalia SI Bistolas 1,, Lars G Rudstam 2, Ian Hewson 1
Editor: Peter Prentis
PMCID: PMC5621510  PMID: 28966890

Abstract

Circular rep-encoding ssDNA (CRESS-DNA) viruses are common constituents of invertebrate viral consortia. Despite their ubiquity and sequence diversity, the effects of CRESS-DNA viruses on invertebrate biology and ecology remain largely unknown. This study assessed the relationship between the transcriptional profile of benthic amphipods of genus Diporeia and the presence of the CRESS-DNA virus, LM29173, in the Laurentian Great Lakes to provide potential insight into the influence of these viruses on invertebrate gene expression. Twelve transcriptomes derived from Diporeia were compared, representing organisms from two amphipod haplotype clades (Great Lakes Michigan and Superior, defined by COI barcode sequencing) with varying viral loads (up to 3 × 106 genome copies organism−1). Read recruitment to de novo assembled transcripts revealed 2,208 significantly over or underexpressed contigs in transcriptomes with above average LM29173 load. Of these contigs, 31.5% were assigned a putative function. The greatest proportion of annotated, differentially expressed transcripts were associated with functions including: (1) replication, recombination, and repair, (2) cell structure/biogenesis, and (3) post-translational modification, protein turnover, and chaperones. Contigs putatively associated with innate immunity displayed no consistent pattern of expression, though several transcripts were significantly overexpressed in amphipods with high viral load. Quantitation (RT-qPCR) of target transcripts, non-muscular myosin heavy chain, β-actin, and ubiquitin-conjugating enzyme E2, corroborated transcriptome analysis and indicated that Lake Michigan and Lake Superior amphipods with high LM29173 load exhibit lake-specific trends in gene expression. While this investigation provides the first comparative survey of the transcriptional profile of invertebrates of variable CRESS-DNA viral load, additional inquiry is required to define the scope of host-specific responses to potential infection.

Keywords: Diporeia, CRESS-DNA, Laurentian Great Lakes, Transcriptomics, ssDNA virus

Introduction

Circular rep-encoding ssDNA (CRESS-DNA) virus genomes are small (∼1.7–4 kb), circular molecules which encode, at minimum, major open reading frames rep (replication initiator protein) and cap (structural capsid protein; Rosario, Duffy & Breitbart, 2012; Rosario et al., 2015). Eukaryotic CRESS-DNA viruses broadly encompass ssDNA viruses that infect plants (Geminiviridae, Nanoviridae), and metazoans (Circoviridae, Anelloviridae; Dunlap et al., 2013; Rosario et al., 2017; Rosario, Duffy & Breitbart, 2012), and include common and important pathogens of ecologically or commercially relevant vertebrates. For example, beak and feather disease virus (BFDV, Circoviridae) is responsible for persistent immunosuppression in avian hosts (Eastwood et al., 2014) and porcine circoviruses infect domestic swine, manifesting sub-clinically (PCV1) or eliciting postweaning multisystemic wasting syndrome (PMWS, PCV2; Allan & Ellis, 2000). The use of culture-independent (metaviromic) approaches has led to the discovery and characterization of an extraordinary diversity of novel ssDNA viruses in environmental reservoirs and non-model invertebrates (Labonté & Suttle, 2013; Rosario & Breitbart, 2011; Rosario et al., 2017; Rosario, Duffy & Breitbart, 2012; Roux et al., 2016). To date, the etiology, pathology, and association between ssDNA viruses and any invertebrate remains wholly unknown. This study utilized whole transcriptome sequencing to investigate the relationship between a CRESS-DNA virus and benthic amphipods of genus Diporeia from the Laurentian Great Lakes.

Circular rep-encoding ssDNA viruses have been identified in association several major aquatic invertebrate phyla, including the Annelida, Arthropoda, Chaetognatha, Cnidaria, Ctenophora, Echinodermata, and Mollusca, among others (Breitbart et al., 2015; Dayaram et al., 2016; Dunlap et al., 2013; Eaglesham & Hewson, 2013; Fahsbender et al., 2015; Kibenge & Godoy, 2016; Hewson et al., 2013a, 2013b; Jackson et al., 2016; Rosario et al., 2015; Soffer et al., 2013). These viruses appear to be biogeographically widespread, taxonomically diverse, and common constituents of crustacean nanobiomes (Dunlap et al., 2013; Hewson et al., 2013a, 2013b; Labonté & Suttle, 2013; Rosario et al., 2015, 2017; Rosario, Duffy & Breitbart, 2012). However, little is known about the role of CRESS-DNA viruses in mediating crustacean ecology, physiology, and mortality. Because no immortal crustacean cell line currently exists, propagation of crustacean-associated CRESS-DNA viruses in vitro remains intractable, and the unknown nature of CRESS-DNA virus tropism and infection dynamics in these systems impedes targeted sequencing of virus-infected cells. Furthermore, many microcrustaceans cannot be reared or maintained effectively in aquaria without significant physiological stress and high incidence of mortality, hindering in vivo infection experiments. Therefore, we implemented a whole-organism comparative transcriptome sequencing (transcriptomics) approach in evaluating the relationship between the presence of CRESS-DNA viral genotype, LM29173, and benthic crustaceans (genus: Diporeia) in Great Lakes ecosystems.

Diporeia are historically abundant benthic meiofauna in the Laurentian Great Lakes (Auer et al., 2013; Barbiero et al., 2011; Birkett, Lozano & Rudstam, 2015; Guiguer & Barton, 2002). These amphipods influence lake-wide biogeochemistry and mediate relationships between spring diatom blooms and upper trophic level consumers through detritivory and sediment bioturbation (Gardner et al., 1985; Guiguer & Barton, 2002; Halfon, Schito & Ulanowicz, 1996; Wells, 1980). Localized and precipitous declines in several Diporeia populations have prompted exploration of their viral consortia (Bistolas et al., 2017; Hewson et al., 2013a). Metavirome sequencing has documented a common and recurrent CRESS-DNA virus genotype, LM29173, frequently detected in impacted Diporeia populations in Lakes Michigan and Huron, but rare among specimens from stable Lake Superior populations (Bistolas et al., 2017; Hewson et al., 2013a). It is also prevalent among amphipods from the deep, glacial Finger Lakes of Central New York (Seneca, Cayuga, and Owasco Lakes). Previous DNA barcoding of maternally inherited cytochrome c oxidase I (COI) sequences (Pilgrim et al., 2009; Bistolas et al., 2017) have revealed sub-species genetic variation between impacted and stable populations, with Diporeia from Lakes Michigan, Huron, Ontario, Erie, and the Finger Lakes comprising a southern lake haplotype clade, and amphipods from Lake Superior comprising a northern lake haplotype clade. While LM29173 is more abundant in Diporeia from declining southern populations than stable northern populations, no advances have been made to describe the impact of this CRESS-DNA virus on amphipod biology. This study offers preliminary insight into the relationship between LM29173 and gene expression in amphipods from both haplotype clades, and provides transcriptional targets for further investigation. Specific objectives of this study were to (1) investigate the association between LM29173 presence/load and the transcriptional profile of Diporeia, (2) determine if detected changes in gene expression are specific to distinct Diporeia haplotypes, and (3) explore the effect of LM29173 presence on amphipod transcription of innate immunity regulators/effectors.

Materials and Methods

Sample collection and transcriptome preparation

Diporeia were collected in August–September 2014 via Ponar benthic sampler from Great Lakes Michigan and Superior at EPA-designated stations (Fig. 1, Table S1; United States Environmental Protection Agency, 2012). Organisms were sieved to remove sediment (500 μm), rinsed, and immediately individually frozen at −80 °C.

Figure 1. Amphipod collection sites in the Laurentian Great Lakes (August–September, 2014).

Figure 1

Collection locations are congruent with EPA-Great Lakes National Program Office (GLNPO) designated stations (United States Environmental Protection Agency, 2012). Specimens were collected on the R/V Lake Guardian via Ponar benthic sampler. Bathymetry data was provided by NOAA National Geophysical Data Center’s Marine Geology & Geophysics Division (NGDC/MGG) and the NOAA Great Lakes Environmental Research Laboratory (GLERL). Map service published and hosted by Esri Canada© 2012 under Attribution-NonCommercial 2.5 Canada (CC BY-NC 2.5 CA) license https://creativecommons.org/licenses/by-nc/2.5/ca/.

Nucleic acids were extracted from individual amphipods via ZR-Duet™ DNA/RNA MiniPrep kit (Zymo Research, Irvine, CA, USA). Presence and genome load (copy number) of LM29173 was determined via qPCR per Hewson et al. (2013a) using SsoAdvanced™ Universal Probes Supermix (Bio-Rad Laboratories, Hercules, CA, USA), corrected for total extraction volume, and standardized by organism wet weight (mg). Two samples with the highest and two samples with the lowest copy numbers organsim−1 of LM29173 from each of three stations (Lake Michigan 27 and 40, Lake Superior 066; Fig. 2, Fig. S1; United States Environmental Protection Agency, 2012) were selected for transcriptome preparation (n = 12 total transcriptomes, n = 4 per station). For selected samples, RNA fractions were further enzymatically digested with TurboDNAse (Thermo Fisher Scientific, Waltham, MA, USA) for 15 min to reduce co-extracted DNA. Ribosomal RNA was depleted via mRNA-ONLY™ mRNA Isolation Kit (Epicentre, Madison, WI, USA), and remaining RNA was reverse transcribed and amplified via the TransPlex® Complete Whole Transcriptome Amplification Kit (WTA2; Sigma-Aldrich, Saint Louis, MO, USA) per manufacturer instructions. Resulting cDNA libraries were quantified via PicoGreen fluorescence and prepared for sequencing using a Nextera XT DNA Library Preparation Kit (Illumina, San Diego, CA, USA). Resulting libraries were subjected to 2 × 250 bp paired-end sequencing on an Illumina MiSeq at the Cornell University Core Laboratories Center (Ithaca, NY, USA). Libraries were deposited in Genbank (accession: PRJNA379017; SRR5341776SRR5341788).

Figure 2. Quantitative detection of LM29173.

Figure 2

(A) Prevalence and average load (log10 transformed copy number mg−1 of tissue ± 1SE) of CRESS-DNA virus genotype LM29173 in amphipods from Great Lakes Michigan, Huron, and Superior. Viral load was significantly greater in Lake Michigan than Lakes Huron (Games–Howell post hoc t = 7.30, p = 3.1 × 10−9) or Superior (Games–Howell post hoc t = 7.30, p = 3.0 × 10−9); (B) Load of LM29173 (copy number organism−1) in amphipods selected for transcriptome sequencing. Four samples (two with the highest and two with the lowest viral load) were selected from each of three stations: Lake Michigan 27 (Mi27), Lake Michigan 40 (Mi40), and Lake Superior 066 (Su066). Individual transcriptomes are denoted by sample ID (#71, 72, 75, 77, 121, 128, 139, 130, 358, 359, 361, and 362).

Transcriptome assembly and comparison of transcript expression

Reads were trimmed for quality (quality score < 0.05, modified-Mott trimming algorithm), ambiguous nucleotides (n = 0), length (50 nt ≤ length ≤ 251 nt), and Illumina adapters via CLC Genomics Workbench (v.8.5.1; Qiagen, Hilden, Germany). Reads mapped to SILVA rRNA databases (90% identity, 50% coverage via CLC Genomics Workbench; http://www.arb-silva.de/) were excluded from assembly. Remaining reads were then assembled de novo using Trinity on the Galaxy bioinformatics platform per default parameters (National Center for Genome Analysis Support, Indiana University Pervasive Technology Institute; Table S2). Resulting contigs were further clustered via CD-HIT-EST to reduce isoform redundancy (sequence identity cutoff = 0.98). Reads were aligned to contigs via the Bioconductor package EdgeR (Robinson & Smyth, 2007) in CLC Genomics Workbench (v.8.5.1; Qiagen, Hilden, Germany) to calculate relative read recruitment (reads per kilobase of transcript per million mapped reads; RPKM) and significance (corrected for multiple comparison via false discovery rate methods; FDR). Contigs that exhibited >10-fold change (EdgeR) in read recruitment, ΔRPKM > 100, and FDR-adjusted p < 0.05 between the six low LM29173 load libraries and six high LM29173 load libraries were considered significantly differentially expressed genes (DEGs). DEGs were then annotated using Blast2Go (v.4.0.7 BLASTx, e < 1 × 10−5) and functionally classified by EuKaryotic Orthologous Group, or “KOG” (Joint Genome Institute).

Eight Lake Michigan libraries were grouped by station and viral load to identify DEGs common between both stations in Lake Michigan, minimizing the effect of between-lake genetic and environmental variance. DEGs shared between libraries were defined per the following criteria: >2-fold change in expression, ΔRPKM > 10 between libraries, significantly differentially expressed with an FDR-adjusted p < 0.05, and consistently over or underexpressed in both Lake Michigan stations. Contigs fulfilling these criteria were annotated via BLASTx against the non-redundant (nr) database and assessed for relevance to viral infection (Altschul et al., 1990).

To identify contigs affiliated with putative immune functions, reference sequences associated with invertebrate innate immunity were collected from the Insect Innate Immunity Database (Brucker et al., 2012) or curated from NCBI protein database queries of keywords in Table 1 of McTaggart et al. (2009) (keywords listed in Table S3). Contigs homologous to these genes were identified via BLASTx (e < 1 × 10−5; Altschul et al., 1990), and the RPKM of those that were >2-fold over or underexpressed in both Lake Michigan stations (Mi27 and Mi40) were standardized to total 18 s rRNA RPKM per library and depicted via web-based visualization tool, Morpheous (Broad Institute, Cambridge, MA, USA).

Table 1. Total number of over- and underexpressed contigs in transcriptomes with above average LM29173 load.

Library Overexpressed Underexpressed Total
SU066 169 65 234
MI40 129 161 290
MI27 1,497 187 1,684

Notes:

Contigs that exhibited >10-fold change (EdgeR), ΔRPKM > 100, and FDR-adjusted p < 0.05 were considered significantly differentially expressed genes (DEGs).

Quantification (RT-qPCR) of differentially expressed target genes

Whole amphipods were collected in August–September, 2014 at EPA-designated stations (United States Environmental Protection Agency, 2012, Table S1) in Lakes Michigan, Huron, and Superior and extracted via ZR-Duet™ DNA/RNA MiniPrep kit (Zymo Research, Irvine, CA, USA). Load of LM29173 was quantified per Hewson et al. (2013a). RNA was reverse transcribed (RT) via Superscript III (Invitrogen, Carlsbad, CA, USA per manufacturer instructions). Parallel no-RT controls were generated using identical reaction parameters and no reverse transcriptase. cDNA was subjected to duplex RT-qPCR (quantifying both a gene of interest and a reference gene to control for organism variability) using SsoAdvanced™ Universal Probes Supermix (Bio-Rad Laboratories, Hercules, CA, USA). Amplicons were gel-purified (Zymoclean™ Gel DNA Recovery Kit; Zymo Research, Irvine, CA, USA) and cloned (pGEM®-T Easy Vector; Promega, Madison, WI, USA) using JM109 competent E. coli (Invitrogen, Carlsbad, CA, USA). Plasmids were extracted per Zyppy™ Plasmid Miniprep Kit instructions (Zymo Research, Irvine, CA, USA) and Sanger sequenced (Cornell University Core Laboratories Center, Ithaca, NY, USA) to confirm primer/probe specificity. Reaction parameters and primer, probe, and standard sequences are detailed in Table S4.

Samples were run in duplicate with congruent duplicate no-RT controls and quantified using duplicate eight-fold standard dilutions (limits of detection described in Table S4). Ct values, quantity, and standard deviation between technical replicates were determined via StepOnePlus software v.2.3 (Foster City, CA, USA). Valid runs were defined by reaction efficiency >94% and standard regression linearity (R2) > 0.98. Samples were excluded if Ct standard deviation between replicates was >0.5. Quantities were corrected for total extraction and reverse transcription dilutions. Quantities of targets β-actin (ACT), ubiquitin-conjugating enzyme E2 (UBQ), and non-muscular myosin heavy chain (NMHC) were standardized by copy number of elongation factor-1α (EF1A) per reaction.

Results and Discussion

Investigation of amphipod transcriptomes revealed differential expression of DNA replication/repair pathways, cytoskeletal architecture, and post-translational modification associated genes in correlation with CRESS-DNA virus load. However, the degree of variability between transcriptomes limited the ability to identify over or underexpression of specific molecular pathways. It is unknown whether vertebrate and invertebrate CRESS-DNA viruses utilize similar pathways of infection, particularly in light of the considerable divergence in sequence homology and genome architecture between groups. Despite this, DEGs in Diporeia transcriptomes were often homologous to DEGs in porcine circoviral infections, or were associated with putative innate immune functions. Expression of these genes varied between amphipod haplotype clades, suggesting that the transcriptional relationship with LM29173 may have a heritable component. It remains unclear if CRESS-DNA viral load corresponds significantly with ecologically relevant changes in invertebrate physiology.

Detection of LM29173

Prevalence and load of LM29173 was significantly greater in Lake Michigan (100%) than Lakes Huron (66.7%; Games–Howell, p < 1 × 10−8, Ruxton & Beauchamp, 2008) and Superior (30.8%; Games–Howell, p < 1 × 10−8; Ruxton & Beauchamp, 2008; Welch’s ANOVA, F2,26.24 = 26.4, p = 5.21 × 10−7), congruent with previous observations of the distribution of this genotype (Bistolas et al., 2017). Pilgrim et al. (2009) utilized mitochondrial COI sequences to identify sub-species genetic variation between Diporeia populations among Great Lakes ecosystems, ultimately delineating two clades with distinct haplotype signatures. qPCR results corroborate previous observations that LM29173 is detected in greater abundance in southern lakes haplotype clade populations (Lakes Michigan, Huron, Ontario, Erie, and the Finger Lakes), relative to northern lakes haplotype clade populations (Lake Superior; Bistolas et al., 2017). Because LM29173 was positively detected in all Lake Michigan amphipods, samples with the highest and lowest respective load of LM29173 were utilized for transcriptome preparation (Fig. 2; Fig. S1).

Transcriptome assembly and annotation of DEGs

Sequence reads from twelve Diporeia transcriptomes were collated (n = 14,702,859 after trimming and exclusion of rRNA-like sequences) to de novo assemble 82,074 contigs with a mean length of 310 nt and N50 value of 290 nt (Fig. S2; Tables S2 and S5). Despite rRNA depletion prior to sequencing, computational subtraction of rRNA reads was considerable (0.016–36.95%), but comparable to previously observed proportions in other studies (Schmieder, Lim & Edwards, 2012; Stewart, Ottesen & DeLong, 2010). Less than 1.25% of all rRNA-mapped reads (90% identity, 50% coverage; SILVA rRNA database) were putatively bacterial in origin, indicating that co-infecting microbes may contribute to variation in Diporeia transcriptional profiles. No transcripts of non-target CRESS-DNA viruses were identified. However, 19 unique contigs (376–6,914 nt) shared sequence similarity to putative metazoan-associated RNA viruses when compared to a manually curated database of viral RNA-dependent RNA polymerase sequences (GenBank) or the non-redundant database (BLASTx; e < 1 × 10−5). These contigs were homologous to members of the Nodaviridae (n = 4), Nyamiviridae (n = 1), Orthomyxoviridae (n = 1), Peribunyaviridae (n = 6), Phenuiviridae (n = 2), and Rhabdoviridae (n = 5), yet it remains unclear if these sequences represent transient/nonpathogenic viruses or specific pathogens of Diporeia. Furthermore, despite methodological biases favoring amplification of encapsidated RNA viruses, read recruitment to these contigs was negligible (5.74 × 10−05—0.24% of mapped, non-rRNA reads), indicating that these genotypes may have minimal relative impact on overall amphipod transcription.

Due to the lack of a reference Diporeia genome, transcripts were conservatively assembled using an isoform-sensitive algorithm, resulting in a fragmented assembly with multiple isoforms per gene. To reduce redundant read mapping, contigs were grouped into 59,317 isoform clusters prior to recruitment analysis. Library D130 (Lake Michigan site Mi40; Table S5) contained fewer total reads relative to other libraries, but was retained, as relative read recruitment was standardized by sequencing depth per library. The statistical package, EdgeR, detected 2,208 significantly DEGs between libraries with high and low LM29173 load among three Great Lakes stations (Table 1). Correlative multidimensional scaling (MDS) analyses indicated that transcriptomes do not cluster by viral presence, viral load, station, or haplotype, likely as a result of high variability in ontogeny and life history between organisms (Fig. S3). Libraries from Mi27 (Lake Michigan) contained over seven-fold more DEGs than Mi40 (Lake Michigan) or Su066 (Lake Superior) libraries. 89% of these transcripts were overexpressed in libraries with high LM29173 load (Table 1; Fig. 3) but were small, unannotated, contained no ORFs, and were therefore removed from downstream analyses. Conversely, volcano plots (Fig. 3) illustrate a roughly symmetrical distribution of significantly over and underexpressed genes in libraries from Mi40 and Su066 relative to viral load.

Figure 3. Volcano plots depicting the distribution of differentially expressed contigs.

Figure 3

Distribution of differentially expressed contigs was determined by EdgeR (Robinson & Smyth, 2007) for libraries from each of three stations: Superior 066 (Su066), Michigan 40 (Mi40), and Michigan 27 (Mi27). Orange points indicate >10-fold differentially expressed contigs (x-axis, as determined via EdgeR); red points indicate significantly differentially expressed contigs (y-axis, FDR-adjusted p < 0.05).

Due to its evolutionary distance from sequenced model organisms, the Diporeia transcriptome remains incompletely annotated. Therefore, DEGs were broadly annotated by putative function using BLASTx via Blast2Go (v.4.0.7). Successfully identified contigs were further assigned to a euKaryotic Orthologous Group, or “KOG” classification (Joint Genome Institute, Walnut Creek, CA, USA). Contigs that received designations of “general function prediction only” (KOG designation “R”) or “function unknown” (KOG designation “S”) were excluded from analysis. Among remaining functionally annotated contigs (n = 696), most were involved in replication, recombination and repair (KOG designation “L”, n = 61), cell wall/membrane/envelope biogenesis (KOG designation “M”, n = 32), or post-translational modification, protein turnover, and chaperones (KOG designation “O”, n = 26, Fig. 4). These three functions were further investigated for potential relevance to viral infection.

Figure 4. Average amphipod transcript expression in relation to LM29173 load.

Figure 4

Average expression (Log10(RPKM+1)) of contigs in transcriptomes associated with above (grey) and below (white) average LM29173 load. Arrows indicate greater (↑) or reduced (↓) average transformed RPKM in transcriptomes with high LM29173 load relative to transcriptomes with low LM29173 load. Contigs are grouped by putative functional annotation (KOG, EuKaryotic Orthologous Groups), and abbreviations correspond to the following functions: (B) chromatin structure and dynamics, (C) energy production and conversion, (D) cell cycle control, cell division, chromosome partitioning, (F) nucleotide transport and metabolism, (I) lipid transport and metabolism, (J) translation, ribosomal structure and biogenesis, (K) transcription, (L) replication, recombination and repair, (M) cell wall/membrane/envelope biogenesis, (N) cell motility, (O) posttranslational modification, protein turnover, chaperones, (T) signal transduction mechanisms, (U) intracellular trafficking, secretion, and vesicular transport, (V) defense mechanisms, (W) extracellular structures.

DEGs involved in replication, recombination, and repair—KOG “L”

The proportion of DEGs involved in modulating DNA synthesis and stability may indicate that CRESS-DNA viruses alter or manipulate cellular replication pathways. This is congruent with the dynamics of circoviral infections in vertebrates, which exploit cellular DNA damage responses through a complex kinase cascade, triggering apoptosis and ultimately facilitating viral replication (Wei et al., 2016). Contigs homologous to unclassified DNA binding proteins, DNA modification enzymes, nucleases, histone structural components, and mobile elements/DNA translocases were differentially expressed. Several DEGs were responsible for chromatin remodeling, indicating a potential correlation between states of nucleosome packaging and viral load. However, many of these transcripts were associated with opposing functions (e.g., DNA methylases and demethylases) and may target different chromatin residues, rendering it difficult to determine if the presence of LM29173 leads to differential transcription.

DEGs involved in cell wall/membrane/envelope biogenesis—KOG “M”

Several homologs of cell-surface receptors and transmembrane transporters including cubilin, calsyntenin, choline transporters, and g-protein coupled receptors were differentially expressed in transcriptomes with high LM29173 load. Contigs putatively involved in carapace biogenesis and the production of other structural/connective tissues (keratin, collagen, and elastin), as well as those involved in cell movement and intracellular transport (actin and myosin) were also significantly differentially expressed. These proteins play central roles in cell growth and replication, and differences in their transcription may be an artifact of natural variability between organisms. However, mis-regulation of these proteins is a well-documented response to many metazoan virus infections (Döhner & Sodeik, 2005; Luftig, 1982; Yan, Zhu & Yang, 2014). For example, cellular entry and trafficking of porcine circoviruses is actin and small GTPase-mediated (Misinzo et al., 2009; Yan, Zhu & Yang, 2014). Myosin is also differentially expressed in subclinical PCV-2 infections and may aid in ATP-dependent intracellular transport of viral particles to the nucleus (Arii et al., 2010; Tomás et al., 2009; Vicente-Manzanares et al., 2009; Xiong et al., 2015).

DEGs involved in post-translational modification, protein turnover, and chaperones—KOG “O”

Intracellular transporters are commonly exploited by vertebrate-associated CRESS-DNA viruses to facilitate entry into the nucleus (Cao et al., 2014; Misinzo et al., 2009). A transcript homologous to Ran (Ras-family related GTP-binding nuclear protein) was overexpressed in transcriptomes with moderate and high LM29173 load, and may be implicated in nucleocytoplasmic transport and regulation of cell cycle progression (Avis & Clarke, 1996; Sazer & Dasso, 2000). Likewise, ubiquitin-conjugating enzyme E2 (UBQ) was overexpressed in libraries with highviral load. This enzyme facilitates covalent attachment of ubiquitin to protein substrates (Liu et al., 2007), and may be exploited by viruses to mis-regulate proteolytic degradation, modify chromatin structure, activate NF-κB and other innate immune mechanisms, or advance G2/M-phase cells into S-phase (Cheng et al., 2014; Gao & Luo, 2006). For example, PCV2 encodes a protein (ORF3) that co-localizes and interacts with E3 ubiquitin ligase, resulting in upregulation of P53 and induction of apoptotic programs, presumably benefiting viral egress (Liu et al., 2007). Knockdown of ubiquitination conjugating enzymes also stalls cells in the G2/M phase, prohibiting PCV2 from accessing S-phase DNA polymerase necessary for viral propagation (Cheng et al., 2014; Liu et al., 2007).

RT-qPCR supports a haplotype-specific relationship between LM29173 load and amphipod gene expression

Viral load correlated with opposite trends in gene expression (average log-transformed RPKM) between Lake Superior and Lake Michigan transcriptomes in all KOG categories with the exception of “chromatin structure and dynamics” (B), “translation, ribosomal structure and biogenesis” (J), and “energy production and conversion” (C; Fig. 4). Unlike Lake Superior libraries, Lake Michigan libraries with high viral load were associated with elevated average RPKM (Fig. 4). Because gene expression in organisms with high viral load may be predicated on population-specific characteristics, we identified 29 common genes differentially expressed in both Lake Michigan stations Mi27 and Mi40 (shared DEGs). However, only one contig (NMHC) was both successfully annotated and potentially affiliated with viral infection (Figs. 5C and 5F). Lake-specific transcriptional profiles confound bulk comparison of gene expression in relation to LM29173 load, and density estimation distributions of individual transcripts indicate that intermediate viral load correlates with increased expression in most KOG classes (Fig. S4). These patterns could indicate that CRESS-DNA virus presence has no appreciable impact on gene expression. Alternatively, because amphipods from Lakes Michigan and Superior belong to potentially phenotypically distinct clades (Pilgrim et al., 2009), these results may indicate that response to environmental and microbial stressors is haplotype-specific.

Figure 5. Relative expression of target genes ACT, UBQ, and NMHC in relation to LM29173 load.

Figure 5

(A–C) Relative expression of target genes β-actin (A; ACT), (B; UBQ) and (C; NMHC) in relation to expression of reference gene elongation factor-1α (EF1A) in specimens from two haplotype clusters (northern and southern) with above (grey) and below (white) average LM29173 copy number (±1 SE). (D–F) Correlation between viral load and relative expression of ACT (D), UBQ (E), and NMHC (F) in relation to EF1A reference gene expression. Quantities of target amplicons were standardized by reference gene EF1A using the following equation: (TargetRT–TargetNRT)/(EF1ART − EF1ANRT), where RT and NRT indicate samples that have been reverse transcribed via Superscript III (Invitrogen, Carlsbad, CA, USA), or not reverse transcribed (no-RT control), respectively. (G) Gene expression (reads per kilobase of transcript per million mapped reads; RPKM) of ACT, NMHC, and UBQ per transcriptome library in each of three stations: Lake Superior station 066 (SU066), Lake Michigan station 40 (Mi40) and Lake Michigan station 27 (Mi27). Libraries are ranked from left to right by increasing LM29173 load.

RT-qPCR quantification of ACT, NMHC, and UBQ confirmed opposite trends in contig expression in correlation with above average viral load among amphipods from Lake Michigan and Superior (Fig. 5). Contigs DN12114c1g3i8 (230 nt), DN12352c3g10i1 (1,229 nt), and DN135c0g1i1 (309 nt) exhibited sequence similarity to ACT from penaeid blue shrimp (Litopenaeus stylirostris; BLASTx, e-value 6 × 10−50), NMHC from freshwater amphipods (Hyalella azteca; e-value 3.0 × 10−8), and UBQ from freshwater amphipods (H. azteca; e-value 4 × 10−52), respectively. Relative expression of ACT, NMHC, and UBQ did not significantly correlate with viral load, suggesting that LM29173 does not likely specifically alter transcription of these genes, or the practice of whole-organism RNA extraction obscures cell-specific response(s) to viral presence (Fig. 5). However, relative expression of target genes varied in relation to amphipod population. Organisms associated with the southern haplotype clade exhibited greater average NMHC and UBQ expression, but diminished average ACT expression in concurrence with high viral copy number, relative to amphipods associated with the northern haplotype clade (p > 0.05, Welch’s t-test for all pairwise comparisons).

Expression of target genes ACT, NMHC, and UBQ was standardized to expression of contig DN11198c0g1i1 (723 nt), a homolog of elongation factor 1 − α (EF1A) from H. azteca (BLASTx, e-value 2 × 10−139). This constitutively expressed gene has been validated as an invariant internal RT-qPCR control under experimental conditions in decapods (Leelatanawit et al., 2012), and provided adequate reference to the baseline transcriptional activity of Diporeia, as expression did not correlate with amphipod wet weight, lake, or LM29173 load (Fig. S5). Variability in ACT, NMHC, and UBQ expression may be a result of nonspecific RNA extraction, which confounds assessments of specific impacts(s) of viral presence on single tissue types or cells. Additionally, RT-qPCR cannot detect changes in the intracellular localization of myosin subunits nor the state of polymerization of actin subunits, and additional investigation via microscopy and proteomics may be warranted.

Expression of amphipod innate immunity regulators and effectors

Diporeia transcriptomes were surveyed for homologs of genes involved in crustacean innate immunity to determine if LM29173 presence correlates with immune-specific gene expression. About 148 homologs (BLASTx e < 1 × 10−5) were identified and exhibited >2-fold differential expression in both Lake Michigan station Mi27 and Mi40. Genes involved in stress response (heat shock or oxidative stress response), immune-specific signaling and post-translational modification, and immune-associated cell structure, mobility, and intracellular trafficking mechanisms were consistently overexpressed in Lake Michigan libraries with high viral load (Fig. S6). Correlative evidence that these immune-related genes are overexpressed in association with high viral load does not preclude the possibility of other co-occurring immune demands. Therefore, is unclear if overexpression of these genes are a product of environmental stress, or if they are specific responses to viral infection.

It remains unclear to what extent LM29173 impacts lake-wide Diporeia population dynamics. However, the presence of LM29173 among stable amphipod populations and negligible changes in expression of specific amphipod disease pathways in relation to this viral genotype likely indicate that LM29173 is not solely responsible for Diporeia decline in the Laurentian Great Lakes. We stipulate that CRESS-DNA viruses associated with Diporeia may play a subtle role in altering amphipod physiology, if any. This observation corroborates data from well-characterized mammalian CRESS-DNA viruses (PCV1; Allan & Ellis, 2000; TTV; Okamoto, 2009), which often manifest asymptomatically in healthy host tissue. We speculate that LM29173, like other CRESS-DNA viruses, may evade host clearance, attenuate innate immune responses, or elicit host tolerance through post-transcriptional or translational gene regulation, ultimately establishing persistent and asymptompatic infections (Brajão de Oliveira, 2015; Okamoto, 2009). This hypothesis may explain the universal prevalence and diversity of these viruses in aquatic ecosystems, as observed by metaviromic sequencing.

In summary, while LM29173 load does not correlate with significant differential expression of specific gene pathways, transcriptional changes in genes involved in several physiological functions, including innate immunity, are detectable and specific to distinct haplotype clades. To our knowledge, this study communicates the first investigation of the transcriptional relationship between invertebrates and associated CRESS-DNA viruses in natural ecosystems. This study also provides several potential transcriptional targets for further investigation of gene/pathway-specific inquiries to determine if the bulk of these novel viruses have little effect on metazoan gene expression or physiology.

Supplemental Information

Supplemental Information 1. Quantitation of LM29173 prevalence and load.

(A–C) Load of LM29173 copy number animal−1 or mg−1 (± 1SE between quadruple technical replicates) in organisms from stations Superior 066 (A, Su066), Michigan 40 (B, Mi40), and Michigan 27 (C, Mi27). (D) Boxplot illustrating distribution of viral load among amphipods from three stations. Outliers (< average and > average LM29173 genome copies animal−1) were selected for transcriptome preparation.

DOI: 10.7717/peerj.3810/supp-1
Supplemental Information 2. Ranked abundance of reads (log10 transformed) mapped to individual contigs.
DOI: 10.7717/peerj.3810/supp-2
Supplemental Information 3. Correlation matrix (multi-dimensional scaling; MDS) exhibiting the normalized degree of variation between libraries grouped by lake (A) and station (B).

Plots were generated via CLC workbench (v. 8.5.1, Qiagen, Hilden, Germany) with default parameters.

DOI: 10.7717/peerj.3810/supp-3
Supplemental Information 4. Density estimation distributions of contigs in relation to LM29173 load.

(A) Density estimation distributions depicting expression (Log10(RPKM + 1)) of contigs associated with 15 KOG classes at a range of viral loads (LM29173 load organism−1; n = 12 transcriptomes). (B) Density estimation distributions of RT-qPCR target contigs UBQ and ACT depicting expression (Log10(RPKM + 1)) at a range of viral loads (LM29173 load organism−1; n = 12 transcriptomes).

DOI: 10.7717/peerj.3810/supp-4
Supplemental Information 5. Expression of reference gene elongation factor-1α (EF1A) relative to organism wet weight and LM29173 load mg−1 (Log10 + 1 transformed).

EF1A expression is reported as the quantitated difference between reverse transcribed and non-reverse transcribed RNA extractions. i.e. EF1ART–EF1ANRT, where RT and NRT indicate samples that have been reverse transcribed via Superscript III (Invitrogen, Carlsbad, CA, USA) or not reverse transcribed (no-RT control), respectively.

DOI: 10.7717/peerj.3810/supp-5
Supplemental Information 6. Heat map depicting expression (RPKM) of differentially expressed contigs (>2 fold over or underexpressed in both Lake Michigan stations) homologous to invertebrate innate immunity effectors (BLASTx e-value < 1 × 10−5) standardized by total RPKM.

Contigs are grouped by putative function: (A) immune response regulation, (B) antimicrobial peptides, (C) phagocytosis, (D) protease/chitinase, (E) JAK/STAT pathway, JNK pathway, (F) toll pathway, (G) prophenoloxidase system, (H) stress/oxidative damage response, (I) transcription factors, (J) signaling, post-translational modification, (K) cell structure, mobility, and intracellular trafficking, (L) receptors, and (M) cell cycle. Image generated by web-based visualization tool, Morpheous (Broad Institute, Cambridge, MA, USA).

DOI: 10.7717/peerj.3810/supp-6
Supplemental Information 7. Amphipod collection site details (congruent with EPA-Great Lakes National Program Office designated stations).

Organisms were acquired via Ponar benthic sampler from the R/V Lake Guardian between August–September, 2014 (n = 98). RT-qPCR (n) refers to the number of samples per station allocated to RT-qPCR. HTS (high throughput sequencing) refers to stations where amphipods were collected for transcriptome preparation and sequencing. Haplotype was determined via cytochrome c oxidase I (COI) sequencing (Pilgrim et al., 2009).

DOI: 10.7717/peerj.3810/supp-7
Supplemental Information 8. Transcriptome assembly statistics.

Transcriptome assembly statistics. Read libraries were pooled and assembled de novo using de Bruijn graphs integrated into Trinity v.2.4.0, a tripartite assembly program (software modules: Inchworm, Chrysalis, and Butterfly) implemented on the Galaxy bioinformatics platform per default parameters (National Center for Genome Analysis Support, Indiana University Pervasive Technology Institute; Trinity –max_memory 240G –CPU 8 –normalize_reads –monitoring –seqType seq_type –single singlefile or –left left_file –right right_file).

DOI: 10.7717/peerj.3810/supp-8
Supplemental Information 9. NCBI protein database keyword queries.

Reference sequences fitting both keyword 1 and 2 (e.g. “Crustacea” + “Toll like receptor”) were collected and collated from the NCBI protein repository as a BLAST database to identify transcriptome contigs affiliated with putative immune functions.

DOI: 10.7717/peerj.3810/supp-9
Supplemental Information 10. qPCR and RT-qPCR primer/probe sequences and reaction parameters.

All reactions were 25ul and included SsoAdvanced™ Universal Probes Supermix (Bio-Rad Laboratories, Hercules, CA, USA) with 2 μM primer/probe oligo (Eurofins Scientific, Luxembourg City, Luxembourg) per reaction. Reaction efficiencies of duplex reactions were comparable to those when reactions were run independently. Quantities of target amplicons were standardized by reference gene EF1A using the following equation: (TargetRT–TargetNRT)/(EF1ART–EF1ANRT), where RT and NRT indicate samples that have been reverse transcribed via Superscript III (Invitrogen, Carlsbad, CA, USA), or not reverse transcribed (no-RT control), respectively. LLOD specifies average lower limit of detection (Ct) across all runs containing the indicated primer/probe set and the corresponding amplicon copy number. Samples with Ct values > LLOD were designated “no detection” (negative). Average threshold Ct indicates ΔRn where quantity was determined (per StepOnePlus software v. 2.3; Foster City, CA, USA).

DOI: 10.7717/peerj.3810/supp-10
Supplemental Information 11. Summary of Diporeia transcriptomes.

Reads were trimmed using CLC workbench (v. 8.5.1, Qiagen, Hilden, Germany: quality limit 0.05, no ambiguous nucleotides, maximum read length 251 nt, discard reads <50 nt), and assembled de novo using Trinity on the Galaxy bioinformatics platform per default parameters (National Center for Genome Analysis Support, Indiana University Pervasive Technology Institute, USA).

DOI: 10.7717/peerj.3810/supp-11
Supplemental Information 12. Raw quantitation data (Ct values) for transcript-specific RT-qPCR.
DOI: 10.7717/peerj.3810/supp-12

Acknowledgments

We would like to thank Dr. Gary Blissard and Elliot Jackson for indispensable insight into transcriptome analysis and manuscript revision, and Dr. Jim Watkins and the crew of the R/V Guardian (2014) for assistance with sample collection and processing. We also worked closely with the Core Genomics Facility (BRC) at Cornell for sequencing support.

Funding Statement

This research is supported by the National Science Foundation (NSF-135696 and DGE-1144153) with additional support from the U.S. Environmental Protection Agency, Cooperative Agreement GL 00E01184-0 to Cornell University. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Additional Information and Declarations

Competing Interests

The authors declare that they have no competing interests.

Author Contributions

Kalia S.I. Bistolas conceived and designed the experiments, performed the experiments, analyzed the data, wrote the paper, and prepared figures and/or tables.

Lars G. Rudstam conceived and designed the experiments, contributed reagents/materials/analysis tools, and reviewed drafts of the paper.

Ian Hewson conceived and designed the experiments, analyzed the data, contributed reagents/materials/analysis tools, and reviewed drafts of the paper.

DNA Deposition

The following information was supplied regarding the deposition of DNA sequences:

Transcriptome sequence data were deposited in Genbank (https://www.ncbi.nlm.nih.gov/genbank/) under accession numbers: PRJNA379017; SRR5341776SRR5341788.

Data Availability

The following information was supplied regarding data availability:

The raw quantitation data (Ct values) for transcript-specific RT-qPCR has been uploaded as Supplemental Dataset Files.

References

  • Allan & Ellis (2000).Allan GM, Ellis JA. Porcine circoviruses: a review. Journal of Veterinary Diagnostic Investigation. 2000;12(1):3–14. doi: 10.1177/104063870001200102. [DOI] [PubMed] [Google Scholar]
  • Altschul et al. (1990).Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. Journal of Molecular Biology. 1990;215(3):403–410. doi: 10.1016/s0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
  • Arii et al. (2010).Arii J, Goto H, Suenaga T, Oyama M, Kozuka-Hata H, Imai T, Minowa A, Akashi H, Arase H, Kawaoka Y, Kawaguchi Y. Non-muscle myosin IIA is a functional entry receptor for herpes simplex virus-1. Nature. 2010;467(7317):859–862. doi: 10.1038/nature09420. [DOI] [PubMed] [Google Scholar]
  • Auer et al. (2013).Auer MT, Auer NA, Urban NR, Auer T. Distribution of the amphipod Diporeia in lake superior: the ring of fire. Journal of Great Lakes Research. 2013;39(1):33–46. doi: 10.1016/j.jglr.2012.12.020. [DOI] [Google Scholar]
  • Avis & Clarke (1996).Avis JM, Clarke PR. Ran, a GTPase involved in nuclear processes: its regulators and effectors. Journal of Cell Science. 1996;109:2423–2427. doi: 10.1242/jcs.109.10.2423. [DOI] [PubMed] [Google Scholar]
  • Barbiero et al. (2011).Barbiero RP, Schmude K, Lesht BM, Riseng CM, Warren GJ, Tuchman ML. Trends in Diporeia populations across the Laurentian Great Lakes, 1997–2009. Journal of Great Lakes Research. 2011;37(1):9–17. doi: 10.1016/j.jglr.2010.11.009. [DOI] [Google Scholar]
  • Birkett, Lozano & Rudstam (2015).Birkett K, Lozano SJ, Rudstam LG. Long-term trends in Lake Ontario’s benthic macroinvertebrate community from 1994–2008. Aquatic Ecosystem Health & Management. 2015;18:76–88. doi: 10.1080/14634988.2014.965122. [DOI] [Google Scholar]
  • Bistolas et al. (2017).Bistolas KSI, Jackson EW, Watkins JM, Rudstam LG, Hewson I. Distribution of circular single-stranded DNA viruses associated with benthic amphipods of genus Diporeia in the Laurentian Great Lakes. Freshwater Biology. 2017;62(7):1220–1231. doi: 10.1111/fwb.12938. [DOI] [Google Scholar]
  • Brajão de Oliveira (2015).Brajão de Oliveira K. Torque teno virus: a ubiquitous virus. Revista Brasileira de Hematologia e Hemoterapia. 2015;37(6):357–358. doi: 10.1016/j.bjhh.2015.07.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Breitbart et al. (2015).Breitbart M, Benner BE, Jernigan PE, Rosario K, Birsa LM, Harbeitner RC, Fulford S, Graham C, Walters A, Goldsmith DB, Berger SA, Nejstgaard JC. Discovery, prevalence, and persistence of novel circular single-stranded DNA viruses in the ctenophores Mnemiopsis leidyi and Beroe ovata. Frontiers in Microbiology. 2015;6:1427. doi: 10.3389/fmicb.2015.01427. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Brucker et al. (2012).Brucker RM, Funkhouser LJ, Setia S, Pauly R, Bordenstein SR. Insect innate immunity database (IIID): an annotation tool for identifying immune genes in insect genomes. PLOS ONE. 2012;7:e45125. doi: 10.1371/journal.pone.0045125. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Cao et al. (2014).Cao J, Lin C, Wang H, Wang L, Zhou N, Jin Y, Liao M, Zhou J. Circovirus transport proceeds via direct interaction of the cytoplasmic dynein IC1 subunit with the viral capsid protein. Journal of Virology. 2014;89(5):2777–2791. doi: 10.1128/jvi.03117-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Cheng et al. (2014).Cheng S, Yan W, Gu W, He Q. The ubiquitin-proteasome system is required for the early stages of porcine circovirus type 2 replication. Virology. 2014;456–457:198–204. doi: 10.1016/j.virol.2014.03.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Dayaram et al. (2016).Dayaram A, Galatowitsch ML, Argüello-Astorga GR, van Bysterveldt K, Kraberger S, Stainton D, Harding JS, Roumagnac P, Martin DP, Lefeuvre P, Varsani A. Diverse circular replication-associated protein encoding viruses circulating in invertebrates within a lake ecosystem. Infection, Genetics and Evolution. 2016;39:304–316. doi: 10.1016/j.meegid.2016.02.011. [DOI] [PubMed] [Google Scholar]
  • Döhner & Sodeik (2005).Döhner K, Sodeik B. The role of the cytoskeleton during viral infection. Current Topics in Microbiology and Immunology. 2005;285:67–108. doi: 10.1007/3-540-26764-6_3. [DOI] [PubMed] [Google Scholar]
  • Dunlap et al. (2013).Dunlap DS, Ng TFF, Rosario K, Barbosa JG, Greco AM, Breitbart M, Hewson I. Molecular and microscopic evidence of viruses in marine copepods. Proceedings of the National Academy of Sciences of the United States of America. 2013;110(4):1375–1380. doi: 10.1073/pnas.1216595110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Eaglesham & Hewson (2013).Eaglesham JB, Hewson I. Widespread detection of circular replication initiator protein (rep)-encoding ssDNA viral genomes in estuarine, coastal and open ocean net plankton. Marine Ecology Progress Series. 2013;494:65–72. doi: 10.3354/meps10575. [DOI] [Google Scholar]
  • Eastwood et al. (2014).Eastwood JR, Berg ML, Ribot RFH, Raidal SR, Buchanan KL, Walder KR, Bennett ATD. Phylogenetic analysis of beak and feather disease virus across a host ring-species complex. Proceedings of the National Academy of Sciences of the United States of America. 2014;111(39):14153–14158. doi: 10.1073/pnas.1403255111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Fahsbender et al. (2015).Fahsbender E, Hewson I, Rosario K, Tuttle AD, Varsani A, Breitbart M. Discovery of a novel circular DNA virus in the Forbes sea star, Asterias forbesi. Archives of Virology. 2015;160(9):2349–2351. doi: 10.1007/s00705-015-2503-2. [DOI] [PubMed] [Google Scholar]
  • Kibenge & Godoy (2016).Kibenge FSB, Godoy MG. Aquaculture Virology. London: Academic Press, Elsevier; 2016. [Google Scholar]
  • Gao & Luo (2006).Gao G, Luo H. The ubiquitin–proteasome pathway in viral infections. Canadian Journal of Physiology and Pharmacology. 2006;84:5–14. doi: 10.1139/y05-144. [DOI] [PubMed] [Google Scholar]
  • Gardner et al. (1985).Gardner WS, Nalepa TF, Frez WA, Cichocki EA, Landrum PF. Seasonal patterns in lipid content of Lake Michigan macroinvertebrates. Canadian Journal of Fisheries and Aquatic Sciences. 1985;42(11):1827–1832. doi: 10.1139/f85-229. [DOI] [Google Scholar]
  • Guiguer & Barton (2002).Guiguer KRRA, Barton DR. The trophic role of Diporeia (Amphipoda) in Colpoys Bay (Georgian Bay) benthic food web: a stable isotope approach. Journal of Great Lakes Research. 2002;28(2):228–239. doi: 10.1016/s0380-1330(02)70579-0. [DOI] [Google Scholar]
  • Halfon, Schito & Ulanowicz (1996).Halfon E, Schito N, Ulanowicz RE. Energy flow through the Lake Ontario food web: conceptual model and an attempt at mass balance. Ecological Modelling. 1996;86(1):1–36. doi: 10.1016/0304-3800(94)00195-2. [DOI] [Google Scholar]
  • Hewson et al. (2013a).Hewson I, Eaglesham JB, Höök TO, LaBarre BA, Sepúlveda MS, Thompson PD, Watkins JM, Rudstam LG. Investigation of viruses in Diporeia spp. from the Laurentian Great Lakes and Owasco Lake as potential stressors of declining populations. Journal of Great Lakes Research. 2013a;39(3):499–506. doi: 10.1016/j.jglr.2013.06.006. [DOI] [Google Scholar]
  • Hewson et al. (2013b).Hewson I, Ng G, Li W, LaBarre BA, Aguirre I, Barbosa JG, Breitbart M, Greco AW, Kearns CM, Looi A, Schaffner LR, Thompson PD, Hairston NG. Metagenomic identification, seasonal dynamics, and potential transmission mechanisms of a Daphnia-associated single-stranded DNA virus in two temperate lakes. Limnology and Oceanography. 2013b;58(5):1605–1620. doi: 10.4319/lo.2013.58.5.1605. [DOI] [Google Scholar]
  • Jackson et al. (2016).Jackson EW, Bistolas KS, Button JB, Hewson I. Novel circular single-stranded DNA viruses among an asteroid, echinoid and holothurian (Phylum: Echinodermata) PLOS ONE. 2016;11(11):e0166093. doi: 10.1371/journal.pone.0166093. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Labonté & Suttle (2013).Labonté JM, Suttle CA. Previously unknown and highly divergent ssDNA viruses populate the oceans. ISME Journal. 2013;7(11):2169–2177. doi: 10.1038/ismej.2013.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Leelatanawit et al. (2012).Leelatanawit R, Klanchui A, Uawisetwathana U, Karoonuthaisiri N. Validation of reference genes for real time PCR of reproductive system in the black tiger shrimp. PLOS ONE. 2012;7(12):e52677. doi: 10.1371/journal.pone.0052677. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Liu et al. (2007).Liu J, Zhu Y, Chen I, Lau J, He F, Lau A, Wang Z, Karuppannan AK, Kwang J. The ORF3 protein of porcine circovirus type 2 interacts with porcine ubiquitin E3 ligase Pirh2 and facilitates p53 expression in viral infection. Journal of Virology. 2007;81(17):9560–9567. doi: 10.1128/jvi.00681-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Luftig (1982).Luftig RB. Does the cytoskeleton play a significant role in animal virus replication? Journal of Theoretical Biology. 1982;99(1):173–191. doi: 10.1016/0022-5193(82)90397-6. [DOI] [PubMed] [Google Scholar]
  • McTaggart et al. (2009).McTaggart SJ, Conlon C, Colbourne JK, Blaxter ML, Little TJ. The components of the Daphnia pulex immune system as revealed by complete genome sequencing. BMC Genomics. 2009;10:175. doi: 10.1186/1471-2164-10-175. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Misinzo et al. (2009).Misinzo G, Delputte PL, Lefebvre DJ, Nauwynck HJ. Porcine circovirus 2 infection of epithelial cells is clathrin-, caveolae- and dynamin-independent, actin and Rho-GTPase-mediated, and enhanced by cholesterol depletion. Virus Research. 2009;139(1):1–9. doi: 10.1016/j.virusres.2008.09.005. [DOI] [PubMed] [Google Scholar]
  • Okamoto (2009).Okamoto H. History of discoveries and pathogenicity of TT viruses. Current Topics in Microbiology and Immunology. 2009;331:1–20. doi: 10.1007/978-3-540-70972-5_1. [DOI] [PubMed] [Google Scholar]
  • Pilgrim et al. (2009).Pilgrim EM, Scharold JV, Darling JA, Kelly JR. Genetic structure of the benthic amphipod Diporeia (Amphipoda: Pontoporeiidae) and its relationship to abundance in Lake Superior. Canadian Journal of Fisheries and Aquatic Sciences. 2009;66(8):1318–1327. doi: 10.1139/f09-086. [DOI] [Google Scholar]
  • Robinson & Smyth (2007).Robinson MD, Smyth GK. Small-sample estimation of negative binomial dispersion, with applications to SAGE data. Biostatistics. 2007;9(2):321–332. doi: 10.1093/biostatistics/kxm030. [DOI] [PubMed] [Google Scholar]
  • Rosario & Breitbart (2011).Rosario K, Breitbart M. Exploring the viral world through metagenomics. Current Opinion in Virology. 2011;1(4):289–297. doi: 10.1016/j.coviro.2011.06.004. [DOI] [PubMed] [Google Scholar]
  • Rosario et al. (2017).Rosario K, Breitbart M, Harrach B, Segalés J, Delwart E, Biagini P, Varsani A. Revisiting the taxonomy of the family Circoviridae: establishment of the genus Cyclovirus and removal of the genus Gyrovirus. Archives of Virology. 2017;162(5):1447–1463. doi: 10.1007/s00705-017-3247-y. [DOI] [PubMed] [Google Scholar]
  • Rosario, Duffy & Breitbart (2012).Rosario K, Duffy S, Breitbart M. A field guide to eukaryotic circular single-stranded DNA viruses: insights gained from metagenomics. Archives of Virology. 2012;157:1851–1871. doi: 10.1007/s00705-012-1391-y. [DOI] [PubMed] [Google Scholar]
  • Rosario et al. (2015).Rosario K, Schenck RO, Harbeitner RC, Lawler SN, Breitbart M. Novel circular single-stranded DNA viruses identified in marine invertebrates reveal high sequence diversity and consistent predicted intrinsic disorder patterns within putative structural proteins. Frontiers in Microbiology. 2015;6:696. doi: 10.3389/fmicb.2015.00696. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Roux et al. (2016).Roux S, Solonenko NE, Dang VT, Poulos BT, Schwenck SM, Goldsmith DB, Coleman ML, Breitbart M, Sullivan MB. Towards quantitative viromics for both double-stranded and single-stranded DNA viruses. PeerJ. 2016;4:e2777. doi: 10.7717/peerj.2777. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Ruxton & Beauchamp (2008).Ruxton GD, Beauchamp G. Time for some a priori thinking about post hoc testing. Behavioral Ecology. 2008;19(3):690–693. doi: 10.1093/beheco/arn020. [DOI] [Google Scholar]
  • Sazer & Dasso (2000).Sazer S, Dasso M. The ran decathlon: multiple roles of ran. Journal of Cell Science. 2000;113:1111–1118. doi: 10.1242/jcs.113.7.1111. [DOI] [PubMed] [Google Scholar]
  • Schmieder, Lim & Edwards (2012).Schmieder R, Lim YW, Edwards R. Identification and removal of ribosomal RNA sequences from metatranscriptomes. Bioinformatics. 2012;28(3):433–435. doi: 10.1093/bioinformatics/btr669. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Soffer et al. (2013).Soffer N, Brandt ME, Correa AMS, Smith TB, Thurber RV. Potential role of viruses in white plague coral disease. ISME Journal. 2013;8(2):271–283. doi: 10.1038/ismej.2013.137. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Stewart, Ottesen & DeLong (2010).Stewart FJ, Ottesen EA, DeLong EF. Development and quantitative analyses of a universal rRNA-subtraction protocol for microbial metatranscriptomics. ISME Journal. 2010;4(7):896–907. doi: 10.1038/ismej.2010.18. [DOI] [PubMed] [Google Scholar]
  • Tomás et al. (2009).Tomás A, Fernandes LT, Sánchez A, Segalés J. Time course differential gene expression in response to porcine circovirus type 2 subclinical infection. Veterinary Research. 2009;41(1):12. doi: 10.1051/vetres/2009060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • United States Environmental Protection Agency (2012).United States Environmental Protection Agency . Quality Assurance Project Plan for the Great Lakes Water Quality Surveys 2008–2012. Washington, D.C.: United States Environmental Protection Agency, Office of Water, Office of Wetlands, Oceans and Watersheds. Appendix B; 2012. [Google Scholar]
  • Vicente-Manzanares et al. (2009).Vicente-Manzanares M, Ma X, Adelstein RS, Horwitz AR. Non-muscle myosin II takes centre stage in cell adhesion and migration. Nature Reviews Molecular Cell Biology. 2009;10(11):778–790. doi: 10.1038/nrm2786. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Wei et al. (2016).Wei L, Zhu S, Wang J, Quan R, Yan X, Li Z, Hou L, Wang N, Yang Y, Jiang H, Liu J. Induction of a cellular DNA damage response by porcine circovirus type 2 facilitates viral replication and mediates apoptotic responses. Scientific Reports. 2016;6(1):39444. doi: 10.1038/srep39444. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Wells (1980).Wells L. Washington, D.C.: U.S. Fish and Wildlife Service; 1980. Food of alewives, yellow perch, spottail shiners, troutperch, and slimy and fourhorn sculpins in southeastern Lake Michigan. Technical Paper no. 98. [Google Scholar]
  • Xiong et al. (2015).Xiong D, Du Y, Wang HB, Zhao B, Zhang H, Li Y, Hu LJ, Cao J-Y, Zhong Q, Liu WL, Li MZ, Zhu XF, Tsao SW, Hutt-Fletcher LM, Song E, Zeng YX, Kieff E, Zeng MS. Nonmuscle myosin heavy chain IIA mediates Epstein–Barr virus infection of nasopharyngeal epithelial cells. Proceedings of the National Academy of Sciences of the United States of America. 2015;112:11036–11041. doi: 10.1073/pnas.1513359112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Yan, Zhu & Yang (2014).Yan M, Zhu L, Yang Q. Infection of porcine circovirus 2 (PCV2) in intestinal porcine epithelial cell line (IPEC-J2) and interaction between PCV2 and IPEC-J2 microfilaments. Virology Journal. 2014;11(1):193. doi: 10.1186/s12985-014-0193-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Information 1. Quantitation of LM29173 prevalence and load.

(A–C) Load of LM29173 copy number animal−1 or mg−1 (± 1SE between quadruple technical replicates) in organisms from stations Superior 066 (A, Su066), Michigan 40 (B, Mi40), and Michigan 27 (C, Mi27). (D) Boxplot illustrating distribution of viral load among amphipods from three stations. Outliers (< average and > average LM29173 genome copies animal−1) were selected for transcriptome preparation.

DOI: 10.7717/peerj.3810/supp-1
Supplemental Information 2. Ranked abundance of reads (log10 transformed) mapped to individual contigs.
DOI: 10.7717/peerj.3810/supp-2
Supplemental Information 3. Correlation matrix (multi-dimensional scaling; MDS) exhibiting the normalized degree of variation between libraries grouped by lake (A) and station (B).

Plots were generated via CLC workbench (v. 8.5.1, Qiagen, Hilden, Germany) with default parameters.

DOI: 10.7717/peerj.3810/supp-3
Supplemental Information 4. Density estimation distributions of contigs in relation to LM29173 load.

(A) Density estimation distributions depicting expression (Log10(RPKM + 1)) of contigs associated with 15 KOG classes at a range of viral loads (LM29173 load organism−1; n = 12 transcriptomes). (B) Density estimation distributions of RT-qPCR target contigs UBQ and ACT depicting expression (Log10(RPKM + 1)) at a range of viral loads (LM29173 load organism−1; n = 12 transcriptomes).

DOI: 10.7717/peerj.3810/supp-4
Supplemental Information 5. Expression of reference gene elongation factor-1α (EF1A) relative to organism wet weight and LM29173 load mg−1 (Log10 + 1 transformed).

EF1A expression is reported as the quantitated difference between reverse transcribed and non-reverse transcribed RNA extractions. i.e. EF1ART–EF1ANRT, where RT and NRT indicate samples that have been reverse transcribed via Superscript III (Invitrogen, Carlsbad, CA, USA) or not reverse transcribed (no-RT control), respectively.

DOI: 10.7717/peerj.3810/supp-5
Supplemental Information 6. Heat map depicting expression (RPKM) of differentially expressed contigs (>2 fold over or underexpressed in both Lake Michigan stations) homologous to invertebrate innate immunity effectors (BLASTx e-value < 1 × 10−5) standardized by total RPKM.

Contigs are grouped by putative function: (A) immune response regulation, (B) antimicrobial peptides, (C) phagocytosis, (D) protease/chitinase, (E) JAK/STAT pathway, JNK pathway, (F) toll pathway, (G) prophenoloxidase system, (H) stress/oxidative damage response, (I) transcription factors, (J) signaling, post-translational modification, (K) cell structure, mobility, and intracellular trafficking, (L) receptors, and (M) cell cycle. Image generated by web-based visualization tool, Morpheous (Broad Institute, Cambridge, MA, USA).

DOI: 10.7717/peerj.3810/supp-6
Supplemental Information 7. Amphipod collection site details (congruent with EPA-Great Lakes National Program Office designated stations).

Organisms were acquired via Ponar benthic sampler from the R/V Lake Guardian between August–September, 2014 (n = 98). RT-qPCR (n) refers to the number of samples per station allocated to RT-qPCR. HTS (high throughput sequencing) refers to stations where amphipods were collected for transcriptome preparation and sequencing. Haplotype was determined via cytochrome c oxidase I (COI) sequencing (Pilgrim et al., 2009).

DOI: 10.7717/peerj.3810/supp-7
Supplemental Information 8. Transcriptome assembly statistics.

Transcriptome assembly statistics. Read libraries were pooled and assembled de novo using de Bruijn graphs integrated into Trinity v.2.4.0, a tripartite assembly program (software modules: Inchworm, Chrysalis, and Butterfly) implemented on the Galaxy bioinformatics platform per default parameters (National Center for Genome Analysis Support, Indiana University Pervasive Technology Institute; Trinity –max_memory 240G –CPU 8 –normalize_reads –monitoring –seqType seq_type –single singlefile or –left left_file –right right_file).

DOI: 10.7717/peerj.3810/supp-8
Supplemental Information 9. NCBI protein database keyword queries.

Reference sequences fitting both keyword 1 and 2 (e.g. “Crustacea” + “Toll like receptor”) were collected and collated from the NCBI protein repository as a BLAST database to identify transcriptome contigs affiliated with putative immune functions.

DOI: 10.7717/peerj.3810/supp-9
Supplemental Information 10. qPCR and RT-qPCR primer/probe sequences and reaction parameters.

All reactions were 25ul and included SsoAdvanced™ Universal Probes Supermix (Bio-Rad Laboratories, Hercules, CA, USA) with 2 μM primer/probe oligo (Eurofins Scientific, Luxembourg City, Luxembourg) per reaction. Reaction efficiencies of duplex reactions were comparable to those when reactions were run independently. Quantities of target amplicons were standardized by reference gene EF1A using the following equation: (TargetRT–TargetNRT)/(EF1ART–EF1ANRT), where RT and NRT indicate samples that have been reverse transcribed via Superscript III (Invitrogen, Carlsbad, CA, USA), or not reverse transcribed (no-RT control), respectively. LLOD specifies average lower limit of detection (Ct) across all runs containing the indicated primer/probe set and the corresponding amplicon copy number. Samples with Ct values > LLOD were designated “no detection” (negative). Average threshold Ct indicates ΔRn where quantity was determined (per StepOnePlus software v. 2.3; Foster City, CA, USA).

DOI: 10.7717/peerj.3810/supp-10
Supplemental Information 11. Summary of Diporeia transcriptomes.

Reads were trimmed using CLC workbench (v. 8.5.1, Qiagen, Hilden, Germany: quality limit 0.05, no ambiguous nucleotides, maximum read length 251 nt, discard reads <50 nt), and assembled de novo using Trinity on the Galaxy bioinformatics platform per default parameters (National Center for Genome Analysis Support, Indiana University Pervasive Technology Institute, USA).

DOI: 10.7717/peerj.3810/supp-11
Supplemental Information 12. Raw quantitation data (Ct values) for transcript-specific RT-qPCR.
DOI: 10.7717/peerj.3810/supp-12

Data Availability Statement

The following information was supplied regarding data availability:

The raw quantitation data (Ct values) for transcript-specific RT-qPCR has been uploaded as Supplemental Dataset Files.


Articles from PeerJ are provided here courtesy of PeerJ, Inc

RESOURCES