Skip to main content
Journal of Virology logoLink to Journal of Virology
. 2018 Jul 31;92(16):e00145-18. doi: 10.1128/JVI.00145-18

Insights into Circovirus Host Range from the Genomic Fossil Record

Tristan P W Dennis a,#, Peter J Flynn b,c,#, William Marciel de Souza a,d, Joshua B Singer a, Corrie S Moreau b, Sam J Wilson a, Robert J Gifford a,
Editor: Frank Kirchhoffe
PMCID: PMC6069186  PMID: 29875243

Advances in DNA sequencing have dramatically increased the rate at which new viruses are being identified. However, the host species associations of most virus sequences identified in metagenomic samples are difficult to determine. Our analysis indicates that viruses proposed to infect vertebrates (in some cases being linked to human disease) may in fact be restricted to arthropod hosts. The detection of these sequences in vertebrate samples may reflect their widespread presence in the environment as viruses of parasitic arthropods.

KEYWORDS: EVE, circovirus, cyclovirus, diversity, endogenous, evolution, metagenomics

ABSTRACT

A diverse range of DNA sequences derived from circoviruses (family Circoviridae) has been identified in samples obtained from humans and domestic animals, often in association with pathological conditions. In the majority of cases, however, little is known about the natural biology of the viruses from which these sequences are derived. Endogenous circoviral elements (CVe) are DNA sequences derived from circoviruses that occur in animal genomes and provide a useful source of information about circovirus-host relationships. In this study, we screened genome assemblies of 675 animal species and identified numerous circovirus-related sequences, including the first examples of CVe derived from cycloviruses. We confirmed the presence of these CVe in the germ line of the elongate twig ant (Pseudomyrmex gracilis), thereby establishing that cycloviruses infect insects. We examined the evolutionary relationships between CVe and contemporary circoviruses, showing that CVe from ants and mites group relatively closely with cycloviruses in phylogenies. Furthermore, the relatively random interspersion of CVe from insect genomes with cyclovirus sequences recovered from vertebrate samples suggested that contamination might be an important consideration in studies reporting these viruses. Our study demonstrates how endogenous viral sequences can inform metagenomics-based virus discovery. In addition, it raises doubts about the role of cycloviruses as pathogens of humans and other vertebrates.

IMPORTANCE Advances in DNA sequencing have dramatically increased the rate at which new viruses are being identified. However, the host species associations of most virus sequences identified in metagenomic samples are difficult to determine. Our analysis indicates that viruses proposed to infect vertebrates (in some cases being linked to human disease) may in fact be restricted to arthropod hosts. The detection of these sequences in vertebrate samples may reflect their widespread presence in the environment as viruses of parasitic arthropods.

INTRODUCTION

Circoviruses (family Circoviridae) are small, nonenveloped viruses with circular, single-stranded DNA (ssDNA) genomes ∼1.8 to ∼2.1 kb in length. Circovirus genomes encode two major proteins: replication-associated protein (Rep) and capsid (Cap), responsible for genome replication and particle formation, respectively. Transcription is bidirectional, with the rep gene being encoded in the forward sense and the cap gene being encoded in the complementary sense (1, 2).

The family Circoviridae contains two recognized genera: Circovirus and Cyclovirus (1). The genus Circovirus includes pathogenic viruses of vertebrates, such as porcine circovirus 2 (PCV-2), which causes postweaning multisystemic wasting syndrome in swine. The genus Cyclovirus, in contrast, is comprised entirely of viruses that have been identified only via sequencing and for which host species associations are less clear. Nevertheless, cycloviruses have frequently been associated with pathogenic conditions in humans and domestic mammals. For example, cyclovirus sequences have been detected in the cerebrospinal fluid of humans suffering from acute central nervous system disease in Vietnam and Malawi (3, 4). Cyclovirus sequences have also been reported in association with numerous other outbreaks of disease in humans and domestic mammals (57).

Sequences derived from circoviruses have been shown to be present in the genomes of many eukaryotic species (8, 9). These endogenous circoviral elements (CVe) are thought to be derived from the genomes of ancient circoviruses that were, by one means or another, ancestrally integrated into the nuclear genome of germ line cells (10). CVe can provide unique information about the long-term coevolutionary relationships between viruses and hosts; for example, the identification of ancient CVe in vertebrate genomes shows that viruses in the genus Circovirus have been coevolving with vertebrate hosts for millions of years (11).

We recently reported the results of a study in which we systematically screened vertebrate whole-genome sequence (WGS) data for CVe (11). Here, we expanded this screen to include a total of 675 animal genomes, including 307 invertebrate species. Via screening, we identified novel examples of sequences derived from circoviruses, cycloviruses, and the more divergent circular Rep-encoding single-stranded DNA (CRESS-DNA) group. We examine the phylogenetic relationships between these sequences, well-studied circovirus isolates, and circovirus-related sequences recovered via metagenomic sequencing of environmental samples or animal tissues. Our analysis raises important questions about the origins of cyclovirus sequences in samples derived from humans and other mammals and about their role in causing disease in these hosts.

(This article was submitted to an online preprint archive [12].)

RESULTS

Identification of CVe in animal genomes.

We screened WGS data of 675 animal species (see Table S1 in the supplemental material) in silico to identify sequences related to circoviruses. We identified 300 circovirus-related sequences in total, 76 of which have not been reported previously (Tables 1 and S3). To investigate the novel sequences identified in our screen, each sequence was virtually translated and incorporated into a multiple-sequence alignment that included a representative set of previously reported circoviruses and CVe (Table S2). Incorporation of CVe sequences into an alignment provided a basis for determining their genetic structures and investigating their phylogenetic relationships to circoviruses (Fig. 1).

TABLE 1.

Novel CVe identified in this study

Sequence source and common name Scientific name Class Order No. of sequencesa Statusb Intactc
Circovirus
    Tomato clownfish Amphiprion frenatus Vertebrata Perciformes 1 CVe No
    Elephant fish Paramormyrops kingsleyae Vertebrata Osteoglossiformes 1 CVe No
Cyclovirus
    Asian bee mite Tropilaelaps mercedesae Arthropoda Arachnida 7 CVe No
    Varroa mite Varroa destructor Arthropoda Arachnida 19 CVe Yes
    Elongate twig ant Pseudomyrmex gracilis Arthropoda Insecta 1 CVe Yes
CRESS-DNA group
    Myxosporean parasite Thelohanellus kitauei Cnidaria Myxosporea 1 CVe Yes
    Philippine horse mussel Modiolus philippinarum Mollusca Bivalvia 4 CVe Yes
    Mediterranean mussel Mytilus galloprovincialis Mollusca Bivalvia 4 Yes
    Freshwater snail Biomphalaria glabrata Mollusca Gastropoda 1 Yes
    Tribble's cone Conus tribblei Mollusca Gastropoda 3 Yes
    Western predatory mite Galendromus occidentalis Arthropoda Arachnida 1 Yes
    Phytoseiid predatory mite Metaseiulus occidentalis Arthropoda Arachnida 1 Yes
    Brown recluse spider Loxosceles reclusa Arthropoda Arachnida 19 CVe Yes
    Scarab beetle Oryctes borbonicus Arthropoda Insecta 1 CVe Yes
    Drifting brine fly Ephydra gracilis Arthropoda Insecta 10 Yes
    Alkali fly Ephydra hians Arthropoda Insecta 8 Yes
    Amphipod crustacean Parhyale hawaiensis Arthropoda Malacostraca 3 CVe No
    Sea louse Caligus rogercresseyi Arthropoda Maxillopoda 4 Yes
    Tadpole shrimp Triops cancriformis Arthropoda Branchiopoda 1 Yes
    Pork tapeworm Taenia solium Cestoda Cyclophyllidea 3 CVe Yes
a

Number of distinct sequences disclosing similarity to circovirus proteins that were identified in species WGS data.

b

Species genomes that were confirmed as containing CVe are indicated, based on the presence in WGS assemblies of at least one contig containing regions of circovirus homology flanked by >3 kb of genomic sequence.

c

Data indicate which species contained circovirus-derived sequences in which protein-coding potential was maintained across the entire length of the detected region of circovirus homology, and this region was at least 200 nucleotides in length.

FIG 1.

FIG 1

Phylogeny of exogenous and endogenous circovirus Rep sequences. Maximum-likelihood phylogeny reconstructed from an alignment of replication-associated protein (Rep) sequences. The tree is midpoint rooted; asterisks indicate nodes with >70% bootstrap support. The scale bar indicates evolutionary distance in the number of substitutions per site. Sequences derived from metagenomic samples are indicated by colored circles. Taxon names are shown for sequences derived from viruses and CVe. All taxa are colored to indicate associations with host species groups, as shown in the key. Stars indicate viral taxa that have been linked to human disease. See Fig. S1 in the supplemental material for accession numbers of all taxa shown here. The arrow indicates an age calibration inferred for a clade within the Circovirus genus. Mya, million years ago; CaCV, canary circovirus; RaCV, raven circovirus; FiCV, finch circovirus; StCV, starling circovirus; CoCV, columbid circovirus; BFDV, beak and feather disease virus; MiCV, mink circovirus; PCV-2, porcine circovirus 2; CfCV, canine circovirus 1; DuCV, duck circovirus; SwCV, swan circovirus; GoCV, goose circoviruses; SgCV, wels catfish circovirus; BarbCV, barbel circovirus.

All of the newly identified sequences were derived from rep; no novel sequences derived from circovirus cap genes were detected. We identified two novel CVe derived from viruses in the genus Circovirus in fish genomes (Table 1). One of these, identified in the tomato clownfish (Amphiprion frenatus), appeared to be an ortholog of a CVe locus previously identified in other perciform fish (11). The other, identified in a mormyrid fish, was clearly related to CVe previously identified in ray-finned fish (11, 13), but as it comprised a relatively short fragment of the rep gene, its more precise phylogenetic relationship to these CVe could not be determined with confidence.

We identified 93 circovirus-related sequences in invertebrate genome assemblies, 71 of which have not been reported previously (Tables 1 and S3). Of these, a relatively high proportion exhibited coding potential. Some occurred on short contigs and could potentially have been derived from contaminating virus. However, we found that, in many cases, at least one of the circovirus-related sequences identified in a WGS assembly was incorporated into a contig that was easily large enough to contain an entire circovirus genome and was flanked by >3 kb of genomic sequence and thus likely to represent CVe. On this basis, we estimate that 60 of the 93 sequences we identified in invertebrate genomes are likely to be derived from CVe. Sequences that occurred on short contigs, particularly those that lacked any in-frame stop codons or frameshifts (Table 1), might instead be derived from contaminating virus.

Maximum likelihood (ML) phylogenies were reconstructed using an alignment of Rep proteins and disclosed two robustly supported, monophyletic clades corresponding to the Circovirus and Cyclovirus genera (1). In line with our previous investigations (11), we found that all Rep-related sequences from vertebrate WGSs grouped with circoviruses, with the exception of a highly divergent sequence identified in the genome of a jawless vertebrate (the hagfish Eptatretus burgeri). All sequences derived from invertebrate WGSs grouped with cycloviruses or with divergent CRESS-DNA viruses (e.g., Avon-Heathcote Estuary-associated circular virus 24 [14]). CRESS-DNA virus-like sequences from distinct species tended to emerge on relatively long branches, and bootstrap support for branching patterns in this region of the phylogeny were generally quite low. The low resolution in this part of the phylogeny likely reflects the lack of adequate sampling of viruses from invertebrate species.

Some sequences from invertebrate WGSs were observed to cluster with cycloviruses in phylogenies, including CVe detected in two parasitic mite species (Varroa destructor and Tropilaelaps mercedesae) and a third detected in the genome of the elongate twig ant (Pseudomyrmex gracilis). The last element, here referred to as CVe-Pseudomyrmex, appeared to be no more distantly related to contemporary cycloviruses than many of them are to one another, including some that are associated with vertebrates (at least superficially) (Fig. 1). Because this seemed a little surprising, we sought to confirm the presence of CVe-Pseudomyrmex in the twig ant germ line. We obtained genomic DNA from four species of ant within the Pseudomyrmex genus (P. gracilis, Pseudomyrmex elongatus, Pseudomyrmex spinicola, and Pseudomyrmex oculatus), including three distinct populations of P. gracilis. We then used PCR to amplify a region encompassing part of the CVe and part of the genomic flanking sequence. We obtained an amplicon of the expected size in all three DNA samples of P. gracilis; all other samples were negative (Fig. 2). Sequencing of the amplicon confirmed that it was derived the genomic locus containing the CVe and contained both a portion of the CVe and a region of genomic flanking sequence.

FIG 2.

FIG 2

PCR confirmation of CVe-Pseudomyrmex presence in three populations of Pseudomyrmex gracilis. (a) Results of amplification using primer pair 1 (694-bp product). (b) Results of amplification using primer pair 2 (286-bp product). Lane 1, negative control; lane 2, Pseudomyrmex gracilis from the Florida Keys; lane 3, P. gracilis from mainland Florida; lane 4, P. gracilis from Texas; lane 5, P. elongatus from the Florida Keys; lane 6, P. spinicola from Guanacaste Province, Costa Rica; lane 7, P. oculatus from Cusco, Peru; lane 8, Cephalotes atratus from Cusco, Peru; lane 9, negative control; lane 10, ladder.

Mapping the host associations of circoviruses and cycloviruses.

In phylogenies based on Rep, clades corresponding to the Circovirus and Cyclovirus genera contained a mixture of (i) CVe from WGS assemblies, (ii) sequences obtained from virus isolates, and (iii) sequences obtained from metagenomic samples (Fig. 1 and Table S1). Among circoviruses (genus Circovirus), host associations at the level of class appear relatively stable. For example, beak and feather disease virus (BFDV) groups robustly with a CVe that entered the germ line of birds of the order Passeriformes ∼38 million years ago (mya) (11), while barbel circovirus (BarbCV) groups robustly with CVe from the genome of the golden-line barbel in a well-supported clade containing numerous CVe from ray-finned fish. The only sequence that superficially seems to contradict this pattern is “chimpanzee” circovirus, which groups robustly with avian viruses. However, the name of this sequence is misleading: it was recovered from chimpanzee feces, but no host association is known. Indeed, the possibility that it might derive from an avian circovirus was noted at the time it was reported (15).

We observed three well-supported sublineages in the clade corresponding to the Cyclovirus genus, here termed cycloviruses 1 to 3 (Fig. 1). For cycloviruses, the only confirmed host associations come from the CVe-Pseudomyrmex sequence reported above. Many of the cyclovirus sequences that have been identified via metagenomic sequencing are associated with arthropod species, such as dragonflies (16). However, others are associated with vertebrates and have names such as bat cyclovirus. The cyclovirus 1 group is exclusively comprised of viruses from vertebrate samples. In cyclovirus groups 2 and 3, however, sequences from vertebrate and invertebrate samples are extensively intermingled (Fig. 1), and clade structure does not reflect these host associations in any obvious way. Sequences from each host group appear to be dispersed randomly, and the branch lengths separating vertebrate from invertebrate viruses (and CVe) are relatively short in many cases.

DISCUSSION

In this study, we screened in silico 675 animal genomes and identified numerous sequences related to circoviruses, including many that have not been reported previously. We examined the phylogenetic relationships between these sequences, well-studied circovirus isolates, and circovirus sequences recovered via metagenomic sequencing of environmental samples or animal tissues.

Most of the novel circovirus sequences reported here were identified in invertebrate genome assemblies. Many were highly divergent and are likely derived from uncharacterized CRESS-DNA virus lineages that infect invertebrate species. All of the newly identified sequences identified in our study were derived from rep genes; we did not detect any novel CVe with homology to circovirus cap genes. The preponderance of CVe derived from rep versus those derived from cap might reflect the greater heterogeneity of capsid sequences in general, which might lead to these sequences being generally harder to detect. Certainly, in the case of the more divergent invertebrate viruses, it is possible that the cap genes found in some lineages might not share any sequence homology with those sequenced previously. However, we note that even among CVe derived from viruses in the genus Circovirus, within which capsid sequences are comparatively conserved, sequences derived from rep are approximately twice as common as those derived from cap.

Among the factors that may have influenced the structures and types of CVe that we observe in animal germ lines are selection pressures that have led to these sequences being co-opted or exapted by host species. Interestingly, several of the confirmed CVe in our study lacked frameshifting mutations or in-frame stop codons (Table 1), indicating that they have been evolving under purifying selection relatively recently. We confirmed the presence of one such CVe (CVe-Pseudomyrmex) in three populations of Pseudomyrmex gracilis (Fig. 2). The fact that we did not detect the CVe-Pseudomyrmex sequence in other members of the genus suggests that it was incorporated into the P. gracilis germ line after this species diverged from P. elongatus, P. spinicola, and P. oculatus in the Miocene epoch (17). However, we cannot completely rule out that Pseudomyrmex is actually older since the failure to obtain an amplicon in other Pseudomyrmex species could be accounted for in other ways (e.g., sequence divergence in the regions targeted by PCR primers). Nevertheless, the occurrence of an apparently fixed, intact, and expressed circovirus rep gene in an ant genome provides further evidence that these genes have been co-opted or exapted by host species for as yet unknown functions. Functional genomic studies in insects indicate that endogenous viral element (EVE) sequences have been co-opted into RNA-based systems of antiviral immunity (18). Thus, one possible explanation accounting for the conservation of this sequence in CVe-Pseudomyrmex is that it is involved in immune defense although this would not necessarily require maintenance of an intact coding sequence.

Our study allowed the host associations of circoviruses and CVe to be examined in the context of their evolutionary relationships. With respect to this, the grouping of sequences for which the host associations are well established (i.e., CVe and viruses that have been investigated using methods in addition to sequencing) relative to sequences recovered from metagenomic samples was revealing. Prior to this study, the only host associations that had been robustly demonstrated were within the genus Circovirus. Circoviruses have been isolated from vertebrates, and in phylogenies based on Rep proteins, these isolates group together with vertebrate CVe in a well-supported clade. Furthermore, the host associations of circoviruses appear quite stable, with ancient CVe from particular host groups (e.g., orders or classes) sometimes seen grouping together with contemporary viruses from the same host groups (Fig. 1). Within the Circovirus clade there is only one sequence that seems to contradict this pattern. This sequence was recovered from a stool sample and, thus, as was observed when it was first reported (15), is likely to reflect environmental contamination.

The limited evidence available regarding the zoonotic potential of circoviruses suggests that they lack the capacity to be transmitted between relatively distantly related hosts (i.e., hosts in distinct classes or orders). For example, during the 1990s and early 2000s, porcine circovirus 1 (PCV-1) was inadvertently introduced into batches of live attenuated rotavirus vaccine as an adventitious agent. These vaccines were administered to millions of people (19), yet PCV-1 is not thought to have infected any humans as a result, indicating that powerful barriers to cross-species transmission are probably in effect.

Nevertheless, recent studies have identified some surprising cases wherein phylogenetic trees indicate apparent transmission of viruses between vertebrate classes (20). We see evidence for potential interclass transmission within one Circovirus subclade that contains sequences obtained from waterfowl and mammals (including mink, bats, dogs, and pigs) as well as CVe from reptile genomes (Fig. 1). Within this subclade, viruses of mink are robustly separated from porcine, canine, and waterfowl viruses by a CVe that was incorporated into the serpentine germ line >72 million years ago (11). Thus, the phylogeny indicates that at least one interclass transmission event is likely to have occurred within this clade. However, it should be noted that while the clustering patterns observed in this clade do suggest potential transmission of circoviruses between vertebrate classes, they also indicate that such events have occurred relatively infrequently during evolution since the CVe in snake genomes provide a minimum for the entire clade (assuming the root of this clade is as depicted in Fig. 1). Moreover, since clustering patterns that superficially appear to indicate host switches can be accounted for by multiple alternative evolutionary scenarios (21, 22), caution is advisable when phylogenetic approaches are used to infer the relationships between parasites and their hosts, especially when sampling is limited.

If cross-species transmission of circoviruses between distinct mammalian orders does not occur readily, then transmission between arthropod and vertebrate hosts appears extremely unlikely. However, if we take the reported host species associations of cycloviruses at face value, we might conclude that transmission between distantly related species groups occurs frequently, particularly within the cyclovirus 3′ lineage (Fig. 1). Importantly, CVe-Pseudomyrmex groups robustly within this lineage, and since all other taxa within the Cyclovirus clade have been recovered via metagenomic screening, this provides the first unambiguous evidence of a host association for cycloviruses, establishing with a high degree of confidence that they do indeed infect arthropods or, at the very least, have done so in the past. Furthermore, since the only proven associations for cycloviruses so far are with arthropods, contamination of vertebrate samples with viruses derived from arthropods is perhaps the most parsimonious explanation for the host associations observed here. Contamination from arthropod sources such as dust mites can presumably occur fairly easily, given their ubiquity (23). Intriguingly, with respect to this we identified putative cyclovirus CVe in the genomes of two distinct mite species (Fig. 1 and Table 1).

Since there is always a risk of being misled by contamination when viruses are identified via sequencing-based approaches, we propose that host associations of circoviruses identified via sequencing should be viewed with caution where they are found to strongly contradict established host associations within well-defined clades, particularly at higher taxonomic levels (e.g., phylum, class, or order). Whereas the weight of evidence may favor cyclovirus groups 2 and 3 being exclusively arthropod viruses that frequently contaminate vertebrate samples, the status of the cyclovirus 1 lineage is perhaps more equivocal. This basal lineage is comprised exclusively of sequences obtained from mammalian samples and includes cycloviruses proposed to cause disease in humans (cyclovirus VN and human cyclovirus VS5700009). Conceivably, these sequences could represent a mammal- or vertebrate-specific lineage of cycloviruses that is distinct from arthropod-infecting lineages. Notably, however, false-positive detection of human cyclovirus VS5700009 has been reported (24).

Virus sequences recovered from metagenomic samples can be investigated by examining their phylogenetic relationships to other viruses for which host associations have been established. The work performed here demonstrates the utility of endogenous virus sequences in this process. This approach can be generalized to inform metagenomics-based virus discovery and diversity mapping efforts for any virus group that has generated endogenous sequences.

MATERIALS AND METHODS

Sequence data.

Whole-genome sequence (WGS) assemblies of 675 species (see Table S1 in the supplemental material) were downloaded from the National Center for Biotechnology Information (NCBI) website. We obtained a representative set of sequences for the genus Circovirus and a nonredundant set of vertebrate CVe sequences from an openly accessible data set we compiled in our previous work (11). This data set was expanded to include a broader range of sequences in the family Circoviridae, including representative species in the Cyclovirus genus and the more distantly related CRESS-DNA viruses (Table S2). We used GLUE, an open, data-centric software environment specialized in capturing and processing virus genome sequence data sets (25), to collate the sequences, alignments, and associated data used in this investigation. These data are available in the publicly accessible DIGS-for-EVEs (database-integrated genome screening for EVEs) online repository (https://github.com/giffordlabcvr/DIGS-for-EVEs).

Genome screening in silico.

Genome screening in silico was performed using the DIGS tool (26). The DIGS procedure comprises two steps. In the first step, the basic local alignment search tool (BLAST) program (27) is used to search a genome assembly file for similar to a particular probe (i.e., a circovirus Rep or Cap polypeptide sequence). In the second, sequences that produce statistically significant matches to the probe are extracted and classified by BLAST-based comparison to a set of virus reference genomes (Table S2). Results are captured in a MySQL database.

Newly identified CVe identified in this study were assigned a unique identifier (ID) according to a convention we established previously (11). The first component of the ID is the classifier CVe. The second is a composite of two distinct subcomponents separated by a period: the name of the CVe group (usually derived from the host group in which the element occurs, e.g., Carnivora) and a numeric ID that uniquely identifies the insertion. Orthologous copies in different species are given the same number but are differentiated using the third component of the ID that uniquely identifies the species from which the sequence was obtained. Unique numeric IDs were assigned to novel CVe with reference to those used in the previously assembled data set (11).

Alignments and phylogenetic analysis.

Multiple sequence alignments were constructed using MUSCLE (28), RevTrans, version 2.0 (29), MACSE (30), and PAL2NAL (31). Manual inspection and adjustment of alignments were performed in Se-Al (32) and AliView (33). Phylogenies were reconstructed using maximum likelihood as implemented in IQ-TREE (34) and the VT+ G4 protein substitution model (35) as selected using ProTest (36), with support assessed using 1,000 nonparametric bootstrap replicates.

Amplification and sequencing.

Genomic DNA was extracted from ant tissue samples following the Moreau protocol (37) and a DNeasy blood and tissue kit (Qiagen). PCR amplification of CVe-Pseudomyrmex was performed using two sets of primer pairs designed with Primer3 (http://bioinfo.ut.ee/primer3-0.4.0/), each comprising one primer anchored in the CVe sequence and another anchored in the genomic flanking sequence. Primer pair 1 amplified a sequence that was 694 bp long, and primer pair 2 amplified a sequence that was 286 bp long. Primers were tested using illustra PuReTaq Ready-To-Go PCR beads (GE Healthcare). A temperature gradient PCR was performed to assess the optimum annealing temperature for the specific primer pairs. PCR was then performed using the genomic DNA ant extractions. The PCR conditions for this run were the following: an initial denaturation stage of 5 min at 95°C, 30 cycles of 30 s of denaturing at 95°C, 30 s of annealing at 49.7°C for primer pair 1 and at 62°C for primer pair 2, and extension at 72°C for 1 min, with a final extension after this program at 72°C for 5 min. Each run included a negative control. Amplification products (800 to 1,000 bp) for each PCR were excised and run on agarose gels. Bands of the expected size were excised, purified, and sequenced via Sanger sequencing (38).

Supplementary Material

Supplemental material

ACKNOWLEDGMENTS

T.P.W.D., J.B.S., S.J.W., and R.J.G. were supported by a grant from the UK Medical Research Council (grant MC_UU_12014/10). W.M.D.S. was supported by the Fundação de Amparo à Pesquisa do Estado de São Paulo, Brazil (scholarships No. 12/24150-9, 15/05778-5, and 17/13981-0). C.S.M. and P.J.F. were supported by a grant from the National Science Foundation (NSF DEB 1442316).

Footnotes

Supplemental material for this article may be found at https://doi.org/10.1128/JVI.00145-18.

REFERENCES

  • 1.Rosario K, Breitbart M, Harrach B, Segales J, Delwart E, Biagini P, Varsani A. 2017. Revisiting the taxonomy of the family Circoviridae: establishment of the genus Cyclovirus and removal of the genus Gyrovirus. Arch Virol 162:1447–1463. doi: 10.1007/s00705-017-3247-y. [DOI] [PubMed] [Google Scholar]
  • 2.Breitbart M, Delwart E, Rosario K, Segales J, Varsani A, ICTV Report Consortium. 2017. ICTV virus taxonomy profile: Circoviridae. J Gen Virol 98:1997–1998. doi: 10.1099/jgv.0.000871. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Smits SL, Zijlstra EE, van Hellemond JJ, Schapendonk CM, Bodewes R, Schurch AC, Haagmans BL, Osterhaus AD. 2013. Novel cyclovirus in human cerebrospinal fluid, Malawi, 2010–2011. Emerg Infect Dis 19:1511–1513. doi: 10.3201/eid1909.130404. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Tan LV, van Doorn HR, Nghia HDT, Chau TT, Tu LTP, de Vries M, Canuti M, Deijs M, Jebbink MF, Baker S, Bryant JE, Tham NT, BKrong NTT, Boni MF, Loi TQ, Phuong LT, Verhoeven JT, Crusat M, Jeeninga RE, Schultsz C, Chau NV, Hien TT, van der Hoek L, Farrar J, de Jong MD. 2013. Identification of a new cyclovirus in cerebrospinal fluid of patients with acute central nervous system infections. mBio 4:e00231-13. doi: 10.1128/mBio.00231-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Phan TG, Luchsinger V, Avendano LF, Deng X, Delwart E. 2014. Cyclovirus in nasopharyngeal aspirates of Chilean children with respiratory infections. J Gen Virol 95:922–927. doi: 10.1099/vir.0.061143-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Phan TG, Mori D, Deng X, Rajindrajith S, Ranawaka U, Fan Ng TF, Bucardo-Rivera F, Orlandi P, Ahmed K, Delwart E. 2015. Small circular single stranded DNA viral genomes in unexplained cases of human encephalitis, diarrhea, and in untreated sewage. Virology 482:98–104. doi: 10.1016/j.virol.2015.03.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Phan TG, Giannitti F, Rossow S, Marthaler D, Knutson TP, Li L, Deng X, Resende T, Vannucci F, Delwart E. 2016. Detection of a novel circovirus PCV3 in pigs with cardiac and multi-systemic inflammation. Virol J 13:184. doi: 10.1186/s12985-016-0642-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Katzourakis A, Gifford RJ. 2010. Endogenous viral elements in animal genomes. PLoS Genet 6:e1001191. doi: 10.1371/journal.pgen.1001191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Liu H, Fu Y, Li B, Yu X, Xie J, Cheng J, Ghabrial SA, Li G, Yi X, Jiang D. 2011. Widespread horizontal gene transfer from circular single-stranded DNA viruses to eukaryotic genomes. BMC Evol Biol 11:276. doi: 10.1186/1471-2148-11-276. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Holmes EC. 2011. The evolution of endogenous viral elements. Cell Host Microbe 10:368–377. doi: 10.1016/j.chom.2011.09.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Dennis TPW, de Souza WM, Marsile-Medun S, Singer JB, Wilson SJ, Gifford RJ. 2018. The evolution, distribution and diversity of endogenous circoviral elements in vertebrate genomes. Virus Res doi: 10.1016/j.virusres.2018.03.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Dennis TPW, Flynn PJ, de Souza WM, Singer JB, Moreau CS, Wilson SJ, Gifford RJ. 2018. Insights into circovirus host range from the genomic fossil record. bioRxiv doi: 10.1101/246777. [DOI] [PMC free article] [PubMed]
  • 13.Feher E, Szekely C, Lorincz M, Cech G, Tuboly T, Singh HS, Banyai K, Farkas SL. 2013. Integrated circoviral rep-like sequences in the genome of cyprinid fish. Virus Genes 47:374–377. doi: 10.1007/s11262-013-0928-9. [DOI] [PubMed] [Google Scholar]
  • 14.Dayaram A, Goldstien S, Arguello-Astorga GR, Zawar-Reza P, Gomez C, Harding JS, Varsani A. 2015. Diverse small circular DNA viruses circulating amongst estuarine molluscs. Infect Genet Evol 31:284–295. doi: 10.1016/j.meegid.2015.02.010. [DOI] [PubMed] [Google Scholar]
  • 15.Li L, Kapoor A, Slikas B, Bamidele OS, Wang C, Shaukat S, Masroor MA, Wilson ML, Ndjango JB, Peeters M, Gross-Camp ND, Muller MN, Hahn BH, Wolfe ND, Triki H, Bartkus J, Zaidi SZ, Delwart E. 2010. Multiple diverse circoviruses infect farm animals and are commonly found in human and chimpanzee feces. J Virol 84:1674–1682. doi: 10.1128/JVI.02109-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Rosario K, Dayaram A, Marinov M, Ware J, Kraberger S, Stainton D, Breitbart M, Varsani A. 2012. Diverse circular ssDNA viruses discovered in dragonflies (Odonata: Epiprocta). J Gen Virol 93:2668–2681. doi: 10.1099/vir.0.045948-0. [DOI] [PubMed] [Google Scholar]
  • 17.Gomez-Acevedo S, Rico-Arce L, Delgado-Salinas A, Magallon S, Eguiarte LE. 2010. Neotropical mutualism between Acacia and Pseudomyrmex: phylogeny and divergence times. Mol Phylogenet Evol 56:393–408. doi: 10.1016/j.ympev.2010.03.018. [DOI] [PubMed] [Google Scholar]
  • 18.Whitfield ZJ, Dolan PT, Kunitomi M, Tassetto M, Seetin MG, Oh S, Heiner C, Paxinos E, Andino R. 2017. The diversity, structure, and function of heritable adaptive immunity sequences in the Aedes aegypti genome. Curr Biol 27:3511–3519.e7. doi: 10.1016/j.cub.2017.09.067. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Victoria JG, Wang C, Jones MS, Jaing C, McLoughlin K, Gardner S, Delwart EL. 2010. Viral Nucleic acids in live-attenuated vaccines: detection of minority variants and an adventitious virus. J Virol 84:6033–6040. doi: 10.1128/JVI.02690-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Dill JA, Camus AC, Leary JH, Di Giallonardo F, Holmes EC, Ng TF. 2016. Distinct viral lineages from fish and amphibians reveal the complex evolutionary history of hepadnaviruses. J Virol 90:7920–7933. doi: 10.1128/JVI.00832-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Charleston MA. 1998. Jungles: a new solution to the host/parasite phylogeny reconciliation problem. Math Biosci 149:191–223. doi: 10.1016/S0025-5564(97)10012-8. [DOI] [PubMed] [Google Scholar]
  • 22.Paterson AM, Banks J. 2001. Analytical approaches to measuring cospeciation of host and parasites: through a glass, darkly. Int J Parasitol 31:1012–1022. doi: 10.1016/S0020-7519(01)00199-0. [DOI] [PubMed] [Google Scholar]
  • 23.Luczynska CM. 1998. Identification and quantification of mite allergens. Allergy 53(48 Suppl):54–57. [DOI] [PubMed] [Google Scholar]
  • 24.Chan MC, Kwok SW, Chan PK. 2015. False-positive PCR detection of cyclovirus Malawi strain VS5700009 in human cerebrospinal fluid. J Clin Virol 68:76–78. doi: 10.1016/j.jcv.2015.05.007. [DOI] [PubMed] [Google Scholar]
  • 25.Singer JB, Thomson EC, McLauchlan J, Hughes J, Gifford RJ. 2018. GLUE: a flexible software system for virus sequence data. bioRxiv doi: 10.1101/269274. [DOI] [PMC free article] [PubMed]
  • 26.Zhu H, Dennis T, Hughes J, Gifford RJ. 2018. Database-integrated genome screening (DIGS): exploring genomes heuristically using sequence similarity search tools and a relational database. bioRxiv doi: 10.1101/246835. [DOI]
  • 27.Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DL. 1997. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25:3389–3402. doi: 10.1093/nar/25.17.3389. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Edgar RC. 2004. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Wernersson R, Pedersen AG. 2003. RevTrans: multiple alignment of coding DNA from aligned amino acid sequences. Nucleic Acids Res 31:3537–3539. doi: 10.1093/nar/gkg609. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Ranwez V, Harispe S, Delsuc F, Douzery EJ. 2011. MACSE: Multiple Alignment of Coding SEquences accounting for frameshifts and stop codons. PLoS One 6:e22594. doi: 10.1371/journal.pone.0022594. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Suyama M, Torrents D, Bork P. 2006. PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments. Nucleic Acids Res 34:W609–W612. doi: 10.1093/nar/gkl315. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Rambaut A. 2002. Se-Al sequence alignment editor, v2.0a11 University of Oxford, Oxford, United Kingdom. [Google Scholar]
  • 33.Larsson A. 2014. AliView: a fast and lightweight alignment viewer and editor for large datasets. Bioinformatics 30:3276–3278. doi: 10.1093/bioinformatics/btu531. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Stamatakis A. 2006. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics 22:2688–2690. doi: 10.1093/bioinformatics/btl446. [DOI] [PubMed] [Google Scholar]
  • 35.Muller T, Vingron M. 2000. Modeling amino acid replacement. J Comput Biol 7:761–776. doi: 10.1089/10665270050514918. [DOI] [PubMed] [Google Scholar]
  • 36.Darriba D, Taboada GL, Doallo R, Posada D. 2011. ProtTest 3: fast selection of best-fit models of protein evolution. Bioinformatics 27:1164–1165. doi: 10.1093/bioinformatics/btr088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Moreau CS. 2014. A practical guide to DNA extraction, PCR, and gene-based DNA sequencing in insects. Halteres 5:32–42. [Google Scholar]
  • 38.Sanger F, Nicklen S, Coulson AR. 1977. DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci U S A 74:5463–5467. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental material

Articles from Journal of Virology are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES