Abstract
The COVID-19 outbreak has reminded us of the importance of viral evolutionary studies as regards comprehending complex viral evolution and preventing future pandemics. A unique approach to understanding viral evolution is the use of ancient viral genomes. Ancient viruses are detectable in various archaeological remains, including ancient people’s skeletons and mummified tissues. Those specimens have preserved ancient viral DNA and RNA, which have been vigorously analyzed in the last few decades thanks to the development of sequencing technologies. Reconstructed ancient pathogenic viral genomes have been utilized to estimate the past pandemics of pathogenic viruses within the ancient human population and long-term evolutionary events. Recent studies revealed the existence of non-pathogenic viral genomes in ancient people’s bodies. These ancient non-pathogenic viruses might be informative for inferring their relationships with ancient people’s diets and lifestyles. Here, we reviewed the past and ongoing studies on ancient pathogenic and non-pathogenic viruses and the usage of ancient viral genomes to understand their long-term viral evolution.
Keywords: ancient DNA, ancient RNA, PCR, NGS, ancient virus, ancient virome, bioinformatics, viral evolution, TDRP
1. Introduction
Ancient DNA and RNA are nucleic acids derived from archaeological or historical remains, including bones, teeth, mummified tissues, and coprolites. The first ancient DNA study was published in 1984. It demonstrated the 229 bp mitochondrial DNA fragments from the dried muscle of an Equus quagga specimen preserved in a museum by cloning into the λ gt10 vector [1]. Many similar studies have been reported using recent technological developments, including the polymerase chain reaction (PCR) and next-generation sequencing (NGS) techniques. Specifically, NGS has allowed us to obtain a massive quantity of ancient genomic data, mostly targeting the genomes of archaic humans. Therefore, we can estimate past events, including gene flow between different human species, migration history, and population structures [2,3,4,5,6,7]. According to ancient human genomic studies, several research groups have discovered that bacteria and viruses co-existed in ancient people’s bodies as they do in modern ones [8,9,10,11]. Most studies have focused on the pathogenic bacteria or viruses that previously caused pandemics or epidemics. For instance, the first ancient viral study was published in 1997 on the influenza virus that caused the Spanish influenza pandemic in 1918 [12]. Subsequently, several pathogenic bacterial and viral genomic sequences have been discovered from ancient samples, including Yersinia pestis and Hepatitis B virus [13,14,15]. The pathogenic data from ancient times are informative in inferring past pandemics and epidemics [16,17]. Essentially, we can directly examine the differences between modern and ancient viral genomes using ancient viral sequences. It would be helpful to understand viral evolutionary history across thousands of years. Although evolutionary estimation with only modern samples can detect short-term evolutionary processes with population-level processes, ancient samples can provide information on long-term viral evolution, reflecting the actual fixation ratio [18]. The expansion of the number of ancient viral genomes allows for the elucidation of the viral evolution of various viral species. Understanding long-term viral evolution might be beneficial for reconstructing epidemic history and possibly predicting future trends. Furthermore, the analyses of viral components or viromes in ancient people’s bodies might be informative for inferring the lifestyle of the ancient individuals. A virome is defined as all of the viral assemblages existing in a given environment, including the human gut, oral environment, and skin [19]. The human virome consists of the viruses infecting the eukaryotic cells, archaea, and bacteria in the human body as well as transient viruses in food [20,21]. It includes the commensal, opportunistic, and pathogenic viruses which cause a broad range of immune responses, some of which cause chronic infections [22].
Here, we review the ancient viral studies that made use of ancient materials over decades based on the following topics: history, samples, data analyses, ancient virome, specific ancient viral examples, and long-term viral evolution.
2. History of Ancient Viral Studies
Paleovirology is the study of ancient viruses derived from ancient samples including extinct viruses [23,24]. Ancient DNA and RNA sequence data have been used in studies of ancient viruses from historical samples approximately two decades old to tens of thousands of years old. The first ancient viral study, which analyzed the 1918 Spanish influenza virus, was reported in 1997 [12]. Since then, various ancient viruses have been discovered, as shown in Figure 1. In the early stages in the 2000s, ancient viral studies were conducted using the PCR technique [12,25,26]. The PCR technique is convenient but cannot be applied to unknown sequences or highly diverged sequences. It amplifies only specific targeted regions, which can lead to the invalid estimation of viral evolution, such as with respect to root ages. The NGS technique appeared in 2005, allowing us to obtain a vast quantity of genomic data [27]. An ancient DNA study using the NGS technique was published in 2006 showing a wooly mammoth’s genomic sequences [28]. Since then, it has been applied to ancient viral studies and resulted in the accumulation of viral genomic information. NGS data, such as whole genomic sequencing (WGS) data and the capture-based sequencing of particular cases, can provide us with a global picture of ancient viral genomic sequences, which means that viral evolution can be estimated more precisely. Thus far, ancient viral studies have mostly focused on specific pathogenic viruses because of their public health importance and the availability of reference genomes. Additionally, there have been several studies on non-pathogenic ancient viruses including non-human viruses [29,30]. Thanks to developments in metagenomics, viral genomic data are accumulating using materials in a specific environment with NGS techniques, leading to virome research [9].
Figure 1.
History of ancient viral studies. Yellow and pink dots indicate studies or events related to polymerase chain reaction (PCR) and next-generation sequencing (NGS) techniques, respectively.
3. Samples for Ancient Virus Discoveries and How to Analyze Them
3.1. Historical Samples
Ancient viruses exist in the remains of ancient organisms in which viral hosts co-existed. Mineralized tissues including bones, coprolites, and teeth are useful in the recovering of viral sequences preserved for thousands of years. Mineralized tissues are more abundant and highly preserved, allowing large-scale population sampling [31]. They are suitable for specific viral sequence analyses and commensal viruses. For example, 14th-century coprolites were used for virome analyses characterizing the viral community of the gut environment [9]. Conversely, soft tissues cannot be preserved longer than mineralized tissues and have serious limitations when it comes to their sample ages [32]. There are advantages when using soft tissues, such as formalin-embedded soft tissue, which can be preserved in museums or institutes for a period ranging from a number of decades to 200 years, and mummified tissue, where pathological lesions can be visible and from which pathological viruses can be extracted [12,33]. Overall, both historical soft and mineralized tissue provide various kinds of ancient DNA and RNA viruses.
For plant viral studies, barley grains, corn cobs, potato, and herbarium have been used, and RNA and DNA preservation was displayed [29,34,35,36,37,38,39,40,41,42]. Non-organismal samples such as ice cores have been effectively used to reconstruct ancient viromes and obtain replication-competent DNA viruses [43,44,45,46]. Cultural artifacts such as birch pitches may also contain viral sequences related to humans. For example, a 5700-year-old birch pitch reflecting an ancient human oral environment indicated the existence of the Epstein–Barr virus within the microbiome [47]. Vaccinia virus, one of the orthopoxviruses, has also been discovered on vaccination kits from the 1860s, indicating vaccination materials at the early stage of vaccination development [48,49,50].
Due to hydrolytic depurination and subsequent β elimination, ancient DNA has typically experienced single-strand breaks and fragmentation [51,52]. Moreover, the other prominent event is cytosine deamination, which involves cytosine turning into uracil or thymine when cytosine is methylated. Since it occurs at the overhanging ends of ancient DNA fragments, the C to T substitution ratio increases towards the ends of sequencing reads when mapped against reference sequences [51,52,53]. The postmortem degradation rate depends on the temperature and humidity. One report of an in situ DNA decay assay with different temperatures demonstrated that DNA fragmentation can occur rapidly at high temperatures [54]. Regarding RNA, cytosine can also be deaminated like DNA [29]. Nevertheless, different damage susceptibilities are invulnerable to depurination and depyrimidination and vulnerable to hydrolysis [55]. Weaker phosphodiester links and stronger N-glycosidic bonds cause these susceptibilities [55]. Either way, samples preserved in excellent conditions such as at a low temperature and low humidity together with the preservation of external morphology are preferred in the context of obtaining high-quality sequencing data.
3.2. DNA and RNA Extraction, Amplification, and Sequencing
Contaminated exogenous nucleic acids and co-extracted small molecules should be avoided to obtain sufficient DNA or RNA. Since ancient endogenous DNA is highly fragmented and exhibits low abundance, contaminations dramatically affect the DNA yields and inhibit the downstream enzymatic reactions [52,56]. To avoid these contaminations, several ancient DNA extraction methods have been developed for various kinds of historical samples, such as bone and teeth [56,57,58]. Modern contamination can also be distinguished based on the presence or absence of postmortem damage patterns with bioinformatic analyses as described in the next section. Several methods introduced USER reagent (New England Biolabs, Ipswich, MA, USA), which is a commercialized enzymatic mix of uracil-DNA-glycosylase (UDG) and endonuclease VIII (Endo VIII) and can remove uracil residues and repair the resulting abasic sites [52,59]. USER has been utilized to eliminate damage patterns and avoid damage-induced sequence errors. It has been used for ancient DNA authentication. In the case of mammalian nuclear DNA, the damage pattern can be still detected by examining CpG dinucleotides [52]. When we use non-mammalian DNA, we can build both USER-treated (full-UDG) and non-treated (non-UDG) libraries for sequencing or use a modified method (UDG-half) that restricts the ancient DNA damage at the terminal nucleotide [52,60]. The usage of USER treatment depends on the study, including the study design and research question.
In early ancient viral studies, PCR techniques were applied to amplify ancient DNA sequences derived from the target viral species using the specific primer sets followed by Sanger sequencing. However, those techniques detect specific or limited genomic regions and cannot be applied to detect unknown viral sequences or metagenomic analysis. For example, recombination and different gene contents might hinder fragment amplification by PCR. If the DNA experience massive postmortem damage, such as fragmentation and substitution, it might be challenging to detect ancient viral genomes using PCR with specific primers.
WGS and capture-based sequencing using NGS techniques are more informative solutions to overcoming the above problems. NGS platforms are helpful when studying viral genomes due to their capacity to obtain sequence data across the whole genome and a representative fraction of the overall population [61,62,63]. Metagenomic analyses with untargeted WGS data are a powerful means of detecting ancient viruses more comprehensively. For example, several ancient viral sequences, including the hepatitis B virus (HBV) and Siphovirus contig89, have been discovered from WGS data generated for genomic analyses of ancient people [14,30]. Moreover, WGS data has the potential to provide various kinds of viral genomic information because of a vast quantity of sequence data. However, a low copy number of DNA or poor preservation quality might cause difficulties in reconstructing complete ancient viral genomes despite the use of WGS [64,65]. It has been shown that capture-based sequencing enrichment of the target viral sequences, followed by NGS data analyses, possibly recovers the complete genomes of ancient viruses [14,66,67].
3.3. Bioinformatic Analysis
Multiple bioinformatic tools are publicly available for the analysis of sequencing data. These tools provide basic protocols for quality control, searching, and the genome assembly of the subject sequencing data. Figure 2 shows the overall procedure of the analyses for ancient viral sequences. For WGS data, the preprocessing of the output reads data is essential. For instance, adapters, contaminations, low quality reads, and duplications need to be removed using trimming tools, including AdapterRemoval, Trimmomatic, Picard Tools, and BBTools [68,69,70,71]. leeHom is designed for ancient DNA data and can be utilized in adapter trimming and merge reads based on the Bayesian maximum a posteriori probability approach [72]. Reads aligned to the human genome can also be removed to lower the computational burden, as shown in several ancient viral studies [30,73]. However, these human decontamination steps might remove several viral sequences with high homology with a part of the human reference genome such as the herpesvirus sequence. Therefore, an appropriate analytical design should be chosen based on the researcher’s purpose, and the caveat of the decision can be described in the method. The remaining reads could be used for the downstream analysis, as shown in Figure 2. If the ancient DNA has been highly preserved and experienced only a small number of fragmentation events, then de novo assembly could be an option for obtaining the longer sequences or contigs using assembling tools, such as SPAdes and MEGAHIT [74,75,76]. Assembled contigs might be helpful in detecting highly diverged ancient viral genomes which might not exist in the present reference genomes.
Figure 2.
Overview of the experiments and bioinformatic analyses of ancient viral genomes. Ancient DNA can be extracted from historical specimens such as bones and teeth. The extracted DNA is derived from human, microbial, and viral genomes. Those mixed sequences can be determined by Sanger sequencing, whole genome sequencing (WGS), or capture-based sequencing based on next-generation sequencing (NGS) platforms. WGS can sequence untargeted DNA from humans, microbes, and viruses, and capture-based methods use biotinylated specific bait libraries and magnetic beads to enrich the target sequences. Following the preprocessing steps, contigs can be constructed by de novo assembly. Then, those contigs and preprocessed reads can be utilized for sequence binning to cluster the sequences into individual groups and obtain ancient viral sequences. Simultaneously, all contigs, preprocessed reads, and polymerase chain reaction (PCR) amplicons can be aligned to known viral sequences to detect candidate ancient viral sequences. Finally, the ancient viral sequences can be applied for downstream analyses: metagenomic profiling, the reconstruction of ancient viral genomes, DNA authenticity testing, and phylogenetic analyses.
Many ancient samples have suffered from the contamination of modern sequences, resulting in metagenomic datasets mixed with ancient and modern sequences [77]. Borry et al. created a robust and automated approach for ancient DNA damage estimation and authentication of de novo assembled ancient DNA, PyDamage [77]. Because of the high degradation and fragmentation of ancient DNA, efficient de novo assembly is challenging. In such a case, sequence binning, a sequence clustering procedure that distributes the sequences into individual groups, can be performed as shown in Figure 2. The binning can be either taxonomy-dependent binning (taxonomy binning) or taxonomy-independent binning (genome binning) [78]. Taxonomy binning is a supervised method based on known genomic sequences such as DIAMOND and MetaPhlAn2 (see the primer on tools for taxonomic classification written by Ye et al.) [79,80,81,82]. Genome binning is an unsupervised method which uses machine-learning methods according to the feature patterns of sequences and linkage patterns such as CONCOCT and MetaBAT2 (see the review about tools for taxonomy-independent binning written by Sedlar et al.) [83,84,85]. Binning tools for viral genomes have been developed, such as phages from metagenomics binning (PHAMB) [86]. Arizmendi Cárdenas et al. reported that Centrifuge would be the most suitable tool for the identification of human DNA virus from ancient samples based on a simulation, with it demonstrating the highest sensitivity and precision when compared with other taxonomic binning tools such as Kraken2, DIAMOND, and MetaPhlAn2 [87,88,89]. Such tools have already been applied in several ancient microbiome studies and might be also useful for ancient viral genome classification and reconstruction [11,90]. In addition to sequence binning, preprocessed reads, PCR amplified fragments, and assembled contigs can be aligned to viral reference sequences using several alignment tools that detect ancient viral sequences, including the Basic Local Alignment Search Tool (BLAST), Burrows–Wheeler aligner (BWA), and MEGAN Alignment Tool (MALT) [91,92,93,94]. Viral genomes on databases such as RefSeq and IMG/VR can be used as reference viral genomes [95,96]. Pratas et al. established the first NGS pipeline, TRACESPipe, for identifying, analyzing, and assembling viral DNA by combining data from multiple organs [97]. TRACESPipe utilizes hybrid methodological approaches with reference-based and de novo assembly and has the potential to reconstruct and analyze ancient viral genomes from various kinds of ancient samples. Pipelines for ancient DNA analyses such as PALEOMIX, FALCON-Meta, and nf-core/EAGER might also prove helpful in the discovery of ancient viral genomes [98,99,100]. Such pipelines integrate several steps such as read preprocessing, genome alignment, ancient DNA authenticity, genotyping, and metagenomic profiling. PALEOMIX was utilized for an ancient oral microbiome study carried out by Jensen et al., and they detected fragments of the Epstein–Barr virus [47]. FALCON-Meta was applied for metagenomic analyses using a 110,000- to 130,000-year-old tooth of a polar bear from Svalbard, but Patas et al. themselves claimed that the observed viral genomes could be contaminated sequences such as Parvovirus [99,101].
The obtained ancient viral genomes can be utilized for further analyses such as genome characterization and phylogenetic analyses. Ancient viral genomes can be characterized by several tools for gene annotation and recombination analysis such as BLAST, Prodigal, and RDP4 [102,103]. For phylogenetic analyses, we use multisequence alignment data generated by software such as MAFFT. The data can be applied for the construction of neighbor joining, maximum likelihood (ML), and Bayesian phylogenetic trees by using the following tools: MEGA, PhyML, RAxML, MrBayes, BEAST, and BEAST2 [104,105,106,107,108,109,110].
It should be confirmed whether the detected ancient viral sequences exhibited postmortem damage, including fragmentation and cytosine deamination. Since modern contamination from a soiled laboratory environment is a serious threat to ancient viral studies, and the experimental environment could be a severe threat to ancient viral studies, ancient sequences can be distinguished from modern ones using damage patterns. Several analytical methods such as mapDamage, DamageProfiler, and PMDtools have been established for detecting damage [111,112,113,114]. These tools plot ancient DNA damage patterns and visualize the frequency of particular base misincorporations. The visualization of C to T substitution frequencies at the 5′ end is helpful in ancient DNA authentication. PMDtools can calculate the postmortem damage (PMD) score to distinguish the genuinely ancient sequence from modern contamination using a likelihood model incorporating PMD, base quality scores, and biological polymorphism [113].
4. Metagenomic Data to Comprehend the Ancient Human Virome
Viromes have been analyzed from the vast quantity of shotgun sequencing data to identify (ideally) complete but (most commonly) partial viral genomes. Several studies revealed a highly diverged viral population across the human body and individuals in modern samples as well as those associated with several factors, including diet, age, geographic location, and disease status [115,116,117,118,119,120,121,122,123]. The distinct characteristics of viromes can be observed among modern and ancient human bodies. Metagenomic sequencing data derived from ancient samples can help to detect such differences. The first ancient virome study was published in 2014 and it analyzed 14th-century coprolites and paleofeces in Belgium [9]. The viral particles were first isolated from the samples through filtration, and high-throughput pyrosequencing was conducted using the 454 Life Science Genome FLX sequencer, one of the earliest NGS platforms. Eukaryotic, archaeal, and bacterial viruses were detected from the metagenomic data, and Siphoviridae phages dominated within the detected viral reads, with approximately 60% abundance within known viral reads. Differences existed between the coprolite virome and modern human stool virome at the taxonomic level, but they were functionally more similar based on principal component analysis using taxonomical or functional annotation [9]. This suggested that the functional roles of viromes might be conserved between ancient and modern gut environments, which was consistent with a previous report demonstrating the significant conservation of gene contents within viromes of the same ecological niche despite individual taxonomic variability [124]. Viral communities might play a significant role in maintaining gut environments.
Another virome study in which ancient DNA was extracted from three pre-Columbian Andean mummies was published in 2015, and the viral components were analyzed [125]. Metagenomic sequencing was conducted using the Illumina MiSeq platform, and viral sequences were identified through homology analyses. Bacteriophages dominated within the reads as follows: 50.4% (mummy FI3), 1.0% (mummy FI9), and 84.4% (mummy FI12). Those phages derived from the Siphoviridae, Myoviridae, Podoviridae, and Microviridae families, respectively, and their probable host bacteria were predicted. The results suggested that the natural mummification of the human gut preserved viral DNA, which was utilized to infer the ancient gut virome.
Wibowo et al. analyzed eight 1000- to 2000-year-old human coprolites and reconstructed high-quality ancient microbial genomes [126]. The recovered microbial components were compared with present-day human gut microbiomes, demonstrating that the palaeofaeces microbiomes were more similar to non-industrialized human gut microbiomes than industrialized ones. Differences in dietary habits might be a factor in determining such differences. Rampelli et al. illustrated that modern virome data obtained from urban societies, hunter-gatherer communities, and pre-agricultural tribes demonstrate different viral contents in each cultural group [127].
So far, we have obtained only limited amounts of ancient virome information because this field is still in its infancy. The accumulation of ancient virome data might elucidate viral evolution in a more precise way.
5. Specific Ancient Viral Studies Inferring Past Pandemic and Evolutionary History
The benefits of studying ancient viruses include detecting the predation of the human pathogens that caused pandemics in ancient times and learning how viruses have evolved over thousands of years. Several studies have discovered ancient viral sequences from historical samples, as shown in Table 1. Most of them are human pathogenic viruses because they are related to past pandemics and are epidemiologically important. However, focusing on only human pathogenic viruses in ancient viral studies could be biased in terms of the evolution of viruses because their evolution has been affected by immune responses in the human body and adaptation to humans. Non-pathogenic and non-human viruses might reflect co-evolution with different host species and show diverged viral evolution. Therefore, ancient viral studies have been conducted on non-human and non-pathogenic viruses.
Table 1.
Overview of ancient viruses detected from historical samples.
Species Name | Type | Host | Reference Accession ID | Reference Length (kb) | Detected Length (kb) 1 | Method | Sample | Sample Age (ya) 2 | Region | Accession ID of Ancient Viral Genomes | Accession ID of Raw Reads | References |
---|---|---|---|---|---|---|---|---|---|---|---|---|
African cassava mosaic virus (ACMV) | ssDNA 3 | Manihot glaziovii | NC_001467, NC_001468 | 5.5 | 5.5 | PCR 10, WGS 11 | Leaf of Manihot glaziovii specimen | 94 | Bambari, Central African Republic | MW788219, MW788220 | PRJNA698751 | Rieux et al., 2021 [41] |
Anelloviridae | ssDNA | Homo sapiens | AB303563 | 3.2 | 0.078 | PCR | Dental pulp | 200 | Kaliningrad, Russia | NA | NA | Bédarida et al., 2011 [128] |
Ancient caribou feces-associated virus (aCFV) | DNA | Plant | NA | NA | 2.2 | PCR | Coprolite | 700 | Northwest Territories, Canada | KJ938716 | NA | Ng et al., 2014 [43], Holmes, 2014 [129] |
Ancient Northwest Territories cripavirus (aNCV) | +ssRNA 4 | Insect | NA | NA | 1.8 | PCR | Coprolite | 700 | Northwest Territories, Canada | KJ938718 | NA | Ng et al., 2014 [43], Holmes, 2014 [129] |
Barely stripe mosaic virus (BSMV) | +ssRNA | Horedum vulgare | NC_003469, NC_003481, NC_003478 | 10.2 | 10.2 | WGS | Barely grain | 750 | Upper Nubia, Egypt | NA | NA | Smith et al., 2014 [29] |
Barely yellow dwarf virus (BYDV) | +ssRNA | Avena fatua, Danthonia californica, Glyceria elata, Koeleria macrantha, Phalaris coerulescens | NC_004750 | 5.7 | 3.6 | RT-PCR 12 | Herbarium specimens | 82–105 | US | DQ115532–DQ115534, DQ118372, DQ631844–DQ631846, DQ631856, DQ631857 | NA | Malmstrom et al., 2007 [39] |
Citrus leprosis virus (CiLV) | +ssRNA | Citrus aurantium, Citrus sinensis | NC_008169, NC_008170 | 13.7 | 12.7 | WGS | Herbarium specimens | 55–90 | US, Mexico, Argentina, Brazil | KT187687–KT187693 | NA | Hartung et al., 2015 [40] |
Epstein–Barr virus (EBV) | dsDNA | Homo sapiens | NC_007605 | 171.8 | 22.9 | WGS | Chewed birch pitch | 5700 | Island of Lolland, Denmark | NA | PRJEB30280 | Jensen et al., 2019 [47] |
Hepatitis B virus (HBV) | dsDNA-RT 5 | Homo sapiens | NC_001611 | 3.2 | 3.2 | PCR | Liver of a mummy | 500 | Yangju, Korea | JN315779 | NA | Kahila Bar-Gal et al., 2012 [33] |
3.2 | Capture 13 | Distal femur, skin, muscle of a mummy | 500 | Naples, Italy | MG585269 | NA | Patterson et al., 2018 [130] | |||||
3.2 | WGS, Capture | Tooth cementum, petrous bones | 822–4488 | Central to western Eurasia | ERS2295383–ERS2295394 | PRJEB9021, PRJEB20658 | Mühlemann et al., 2018 [14] | |||||
3.2 | WGS | Teeth | 340–5000 | Germany | NA | PRJEB24921 | Krause-Kyora et al., 2018 [15] | |||||
2.9 | WGS, Capture | Tooth | 396–569 | Mexico City, Mexico | NA | PRJEB37490 | Barquera, et al., 2020 [131] | |||||
3.1 | WGS | Soft tissue and bone of a mummy | 2000 | Abusir el-Meleq, Egypt | NA | PRJEB33848 | Neukamm et al., 2020 [132] | |||||
3.2 | WGS, Capture | Tooth root | 500 | Mexico City, Mexico | MT108214 | Available at Dryad 14 | Guzmán-Solís et al., 2021 [67] | |||||
3.2 | WGS, Capture | Teeth, bones, petrous bones | 400–10,500 | Eurasia and US | ERS6597748–ERS6597884 | PRJEB45699 | Kocher et al., 2021 [133] | |||||
Hepatitis C virus (HCV) | +ssRNA | Homo sapiens | NC_004102 | 9.7 | 0.336 | RT-PCR | Archived blood samples | 69 | US | KF261594, KF261595 | NA | Gray et al., 2013 [134] |
Human immunodeficiency virus type 1 (HIV-1) | ssRNA-RT 6 | Homo sapiens | NC_001802 | 9.2 | ~0.3 | RT-PCR | Plasma samples | 63 | Kinshasa, Democratic Republic of Congo | NA | NA | Zhu et al., 1998 [26] |
8.6 | RT-PCR | Frozen serum samples | 50 | New York City, US | KJ704787–KJ704797 | NA | Worobey et al., 2016 [135] | |||||
8.3 | RT-PCR, amplicon sequence | Formalin-fixed paraffin-embedded tissues | 56 | Kinshasa, Democratic Republic of Congo | MN082768 | NA | Gryseels et al., 2020 [136] | |||||
Human papillomavirus (HPV) | dsDNA 7 | Homo sapiens | NC_027779 | 7.3 | 0.141 | PCR | Mummy of a Renaissance noble woman | 454 | Naples, Italy | NA | NA | Fornaciar et al., 2003 [137] |
Human parvovirus B19 (B19V) | ssDNA | Homo sapiens | NC_000883 | 5.6 | 0.275 | PCR | Long bones | 92 | Karelia district, Finland | NA | NA | Toppinen et al., 2015 [138] |
5.9 | WGS | Dental, skeletal remains | 500–6900 | Eurasia, Southeast Asia, Greenland | NA | PRJEB26712 15 | Mühlemann et al., 2018 [73] | |||||
4.4 | WGS, Capture | Tooth roots | 500 | Mexico City, Mexico | MT108215–MT108217 | Available at Dryad 14 | Guzmán-Solís et al., 2021 [67] | |||||
Human T-cell leukemia virus type 1 (HTLV-1) | ssRNA-RT | Homo sapiens | NC_001436 | 8.5 | 0.316 | PCR | Mummy | 500 | Andean, US | NA | NA | Li et al., 1999 [139], Gessain et al., 2000 [140], Vandamme et al., 2000 [141] |
Influenza A virus | -ssRNA 8 | Homo sapiens | NC_026431-NC_026438 | 13.2 | 12.7 | RT-PCR | Formalin-fixed paraffin-embedded lung tissues | 104 | US | AF116575, AF250356, AF333238, AY130766, AY744935, DQ208309–DQ208311 | NA | Taubenberger et al., 1997 [12], Reid et al., 1999 [142], Reid et al., 2000 [143], Basler et al., 2001 [144], Reid et al., 2002 [145], Reid et al., 2004 [146], Taubenberger et al., 2005 [147] |
12.7 | RT-PCR, WGS | Formalin-fixed paraffin-embedded lung tissues | 104 | New York City, US | NA | PRJNA178740 | Xiao et al., 2013 [148] | |||||
Measles morbillivirus (MeV) | -ssRNA | Homo sapiens | NC_001498 | 15.8 | 15.8 | WGS | Formalin-fixed paraffin-embedded lung tissues | 110 | Berlin, Germany | NA | PRJEB36265 | Düx et al., 2020 [149] |
Mollivirus sibericum | dsDNA | Acanthamoeba castellanii | NA | NA | 651 | WGS | Permafrost layer | 30,000 | Northeast Siberia, Russia | KR921745 | NA | Legendre et al., 2015 [45] |
Papillomavirus | dsDNA | Neuroma cinera | MF416381 | 7.4 | 0.677 | PCR | Unwashed midden materials | 27,000 | Arizona, US | MH136586, MH136587 | NA | Larsen et al., 2018 [150] |
Pithovirus sibericum | dsDNA | Acanthamoeba castellanii | NA | NA | 610 | WGS | Permafrost layer | 30,000 | Northeast Siberia, Russia | KF740664 | NA | Legendre et al., 2014 [44] |
Potato virus X (PVX) | +ssRNA | Solanum tuberosum | NC_011620 | 6.4 | 0.75 | RT-PCR | Freeze dried leaves | 38–52 | Australia, England | GU384732–GU384734, GU384737–GU384738 | NA | Cox and Jones et al., 2010 [35] |
Potato virus Y (PVY) | +ssRNA | Solanum tuberosum | NC_001616 | 9.7 | 9.7 | RT-PCR | Potato | 84 | Netherlands | EU563512 | NA | Dullemans et al., 2011 [36] |
9.7 | WGS | Freeze-died PVY cultures | 38–79 | UK | KP691317–KP691330, MT200665–MT200668 | NA | Kehoe and Jones, 2016 [37], Green et al., 2020 [38] | |||||
Simian T-lymphotropic virus type 1 (STLV-1) | ssRNA-RNA | Cercopithecus aethiops | MF622054 | 8.4 | 0.467 | PCR | Skeletons | 122 | Central Africa | NA | NA | Calvignac et al., 2008 [151] |
Siphovirus contig89 (CT89) | dsDNA | Schalia meyeri | KF594184 | 2.4 | 4.2 | WGS | Dental pulp | 3800 | Hokkaido, Japan | LC585292 | PRJDB7235 | Nishimura et al., 2021 [30] |
Tomato mosaic tobamovirus (ToMV) | +ssRNA | Dicotyledonous, monocotyledonous | NC_002692 | 6.4 | 0.347 | RT-PCR | Ice cores | <500–140,000 | Greenland | NA | NA | Castello et al., 1999 [152] |
Variola virus (VARV) | dsDNA | Homo sapiens | NC_001611 | 185.6 | 0.718 | PCR | Pulmonary tissue of a mummy | 300 | Siberia, Russia | JX080525–JX080527 | NA | Biagini et al., 2012 [153] |
0.43 | Skeleton | 300 | Marseille city, France | NA | NA | Meffray et al., 2021 [154] | ||||||
166.8 | Capture | Soft tissue of a mummy | 367–379 | Vilnius, Lithuania | KY358055, BK010317 | PRJNA348754 | Duggan et al., 2016 [155], Smithson et al., 2017 [156] | |||||
185.4 | WGS | Forefoot and piece of skin | 100 | Prague, Czech | LT706528, LT706529 | PRJEB18730 | Pajer et al., 2017 [157], Porter et al., 2017 [158] | |||||
158.1 | Ethanol-fixed infant leg | 229–262 | London, England | NA | PRJEB35140 | Ferrari et al., 2020 [159] | ||||||
192.3 | WGS, Capture | Skeletons | 970–1400 | Northern Europe | LR800244–LR800247 | PRJEB38129 | Mühlemann et al., 2020 [66], Babkin et al., 2022 [160] | |||||
Vaccinia virus (VACV) | dsDNA | Homo sapiens | M35027 | 191.7 | 184.7 | WGS, Capture | Vaccination kits | 156 | Philadelphia, US | MN369532 | PRJNA561155 | Duggan et al., 2020 [48], Brinkmann, et al., 2020 [49], Duggan et al., 2020 [50] |
Zea may chrysovirus 1 (ZMCV1) | dsRNA 9 | Zea mays | NA | NA | 11.3 | WGS, RT-PCR | Maize cobs | 1000 | Antelope house, US | MH931189–MH931208, MH936006, MH936007, MH936014–MH936017 | NA | Peyambari et al., 2019 [34] |
1 the longest length within reconstructed viral sequences, 2 years ago (ya), 3 single-strand DNA virus (ssDNA), 4 positive-strand RNA virus (+ssRNA), 5 double-strand DNA virus (dsDNA), 6 retro-transcribing DNA virus (dsDNA-RT), 7 retro-transcribing RNA virus (ssRNA-RT), 8 negative-strand RNA virus (-ssRNA), 9 double-strand RNA virus (dsRNA), 10 polymerase chain reaction (PCR), 11 whole genome sequencing (WGS), 12 reverse transcription polymerase chain reaction (RT-PCR), 13 capture-based sequencing (capture), 14 NGS reads are available at Dryad Digital Repository: https://dx.doi.org/10.5061/dryad.5x69p8d2s accessed on 18 June 2022, 15 No public data are linked to this project (2 June 2022). The alignments and XML files used to perform the analysis presented in this paper are available at https://github.com/acorg/parvo-2018 accessed on 18 June 2022. NA stands for not applicable.
Various epidemics and pandemics have been recorded or assumed to have occurred throughout human history [16]. Ancient viral genomes have given insights into the pandemics and epidemics of the past and virus–human interactions on an evolutionary timescale. For example, the hepatitis C virus (HCV), papillomaviruses, anellovirus, tomato mosaic tobamovirus (ToMV), and T-lymphotropic virus have been detected through PCR amplification of targeted regions, as shown in Table 1 [128,134,137,139,150,151,152].
Several studies successfully obtained complete ancient viral sequences using the PCR technique such as influenza viruses in 1918 and human immunodeficiency virus type 1 (HIV-1) in the 1970s. First, the 1918 influenza viral RNA was amplified using reverse transcription polymerase chain reaction (RT-PCR) from formalin-fixed paraffin-embedded or frozen lung tissue specimens and sequenced. The complete coding sequences were determined by several studies. The 1918 virus might have originated from an avian source and adapted to mammals with several mutations [12,142,143,144,145,146,147,148]. The complete coding sequence of the 1918 influenza virus was cloned and applied for infectious experiments to characterize its virulence, with it displaying a high-proliferation phenotype in human bronchial epithelial cells [161]. In the case of HIV-1, an almost complete genome was obtained from the serum of 1970s samples in the US and formalin-fixed paraffin-embedded tissues from Kinshasa, Democratic Republic of Congo, in 1966 [135,136]. Worobey et al. illustrated that the US HIV-1 epidemic showed extensive genomic diversity and emerged from the pre-existing Caribbean epidemic [135]. They also suggested that HIV-1 entered the US around 1970, and New York City was the critical center of early US HIV diversification based on their phylogenetic analyses. Gryseels et al. reconstructed the oldest HIV-1 genome. The new phylogenetic tree, including the oldest genome, revealed that the origin of the pandemic lineage of HIV-1 dated to approximately the turn of the 20th century [136].
A 2.2 kb complete ancient caribou feces-associated virus (aCFV) was reconstructed using PCR and characterized from 700-year-old caribou feces frozen in a permanent ice patch [43]. The aCFV was estimated to be distantly related to the plant-infecting geminiviruses and the fungi-infecting Sclerotinia sclerotiorum hypovirulence-associated DNA virus 1 and derived from plant material ingested by caribou. The infectivity of the reconstructed viral genome was confirmed with the model plant Nicotiana benthamiana. Older permafrost layers also revealed the existence of replication-competent viruses such as 30,000-year-old Pithovirus sibericum and Mollivirus sibericum, also known as a giant virus [44,45]. Both of the viruses in the caribou feces and the 30,000-year-old virus demonstrated their infectivity, and they did not show postmortem degradation patterns. Frozen encapsidated viruses might have escaped from ancient DNA damage [129].
In contrast to the above successes, short and limited PCR products might cause inaccurate estimation results in relation to viral evolution. For example, the most recent common ancestor (MRCA) of human parvovirus B19 (B19V) was estimated to be from the early 1800s based on 121 and 154 bp PCR amplified fragments from the 70-year-old bones of Finnish soldiers [138]. However, 500- to 6900-year-old B19V viruses were detected in a subsequent study, and the estimated MRCA was inconsistent [73]. The new result suggested that the MRCA was placed ~12,600 years ago, with a substitution rate of approximately 1.22 × 10−5 substitutions/site/year, which is lower than the previous estimate of 2.1–2.2 × 10−4 substitutions/site/year calculated with 70-year-old B19V fragments [73,138]. Another study using ancient B19V sequences detected from dental samples between the 16th and 18th centuries supported this estimation [67]. Ancient human T-cell leukemia virus type 1 (HTLV-1) fragments with a length of about 160 bp were detected from a 1500-year-old Andean mummy, leading to the conclusion that HTLV-1 was carried by ancient Mongoloids to the Andes before the Colonial era [139]. However, it has been suspected that the sequence was derived from modern contamination based on the current prevalence of HTLV-1 in American Indians and molecular clock calculation [140,141]. Since DNA and RNA fragmentation likely results in fragments of fewer than 100 bp, it is difficult to amplify longer sequences.
Since the appearance of NGS techniques, several ancient viral genomes have been detected and analyzed by NGS techniques, as described in Table 1. The oldest ancient RNA virus discovered from WGS data is the measles morbillivirus (MeV), which was extracted from a formalin-fixed lung specimen collected in 1912 in Berlin [149]. The quasi-complete MeV genome was constructed with a mean coverage of 54x derived from Illumina platforms. The Bayesian and maximum likelihood phylogenetic trees suggested that the MeV and rinderpest viruses potentially diverged around 2500 years ago and was followed by cattle-to-human host transmission, with the subsequent evolution of two distinct lineages occurred [149]. The estimation seemed to be consistent with demographic changes in the past. The viral divergence event coincided with an increase in the population size, which became larger than the MeV critical community size (CCS) of 250,000, to 500,000 individuals, supporting the continuous transmission of MeV [162,163,164].
Concerning the reverse transcribing of viruses, the hepatitis B virus (HBV) has been well-studied using WGS data. The HBV genome is relatively small, approximately 3 kb; thus, its genome can be easily reconstructed. The first ancient HBV study in 2012 confirmed the existence of ancient HBV in a Korean mummy around the 16th century [33]. The constructed genotype C2 HBV sequence, commonly spread in Southeast Asia, suggested that the Korean HBV sequence dated back at least 3000 to 100,000 years based on its genetic diversity when compared with modern samples. Subsequently, several groups have discovered ancient HBV genomes from ancient samples: a 500-year-old Italian mummy, a 2000-year-old Egyptian mummy, 500-year-old Mexican teeth, and 400- to 105,000-year-old Eurasian and American teeth and bones, as listed in Table 1 [14,15,67,130,131,132,133]. These samples were analyzed with WGS or the capture-based sequencing data of the samples. As HBV has experienced recombination events and exhibited a complicated evolution, several hypotheses have been proposed on the evolution and origins of HBV that have not yet been fully demonstrated [165]. For example, the evolutionary ratio of HBV was estimated to be approximately 10−4 substitutions/site/year with heterochronous samples and 10−6 substitutions/site/year using calibration based on HBV-human co-expansion [166,167,168]. The evolution of HBV has been estimated using ancient HBV genomes with a temporal signal [14,169]. Mühlemann et al. suggested that the root of the HBV tree was placed between 8600 and 20,900 years ago, and the substitution rate was estimated to be 8.04 × 10−6 to 1.51 × 10−5 substitutions/site/year [14]. Kocher et al. also estimated the MRCA to be ~20,000 to 12,000 years ago, and the substitution rate was estimated to be 8.7 × 10−6 to 1.45 × 10−5 substitutions/site/year using 137 ancient HBV genomes detected from Eurasian and American individuals who lived 400 to 105,000 years ago [133]. The existence of Ancient HBV reflected several known human migrations and demographic events such as the expansion of first American populations and the Neolithic transition in Europe. Several genetic events could not be expected from the human genetic and archaeological data, such as the near complete renewal of western Eurasian HBV diversity and the existence of the extinct genotype [133]. Mühlemann et al. demonstrated the evidence that genotype A, typical in modern Africa and derived from recombination, emerged outside Africa. Such a recombination event was also detected in ancient B19V genomes [73]. Ancient HBV genomes were discovered from Mexican samples derived from transatlantic slaves and might explain the introduction and dissemination of pathogens from Africa to the Americas [67,131]. In addition to the evolution of HBV within humans, reciprocal cross-species transmission might have occurred because several ancient viral lineages seem to be distinct from modern HBV lineages and show a closer relationship with non-human primate HBV strains than those of other humans [14,15].
Ancient variola virus (VARV) sequences have been discovered in several samples: a 300-year-old Siberian mummy, a 300-year-old French skeletal specimen, a 367- to 379-year-old Lithuanian child mummy, two specimens from the Czech National Museum, a 229- to 262-year-old ethanol-fixed infant leg from England, and thirteen 970- to 1400-year-old Northern European individuals [66,153,154,157,159]. Almost complete VARV genomes were obtained from samples except for in the case of the Siberian mummy and the French skeleton specimen, even though the VARV genome is relatively large, approximately 186 kb. The reconstructed genomes were subjected to phylogenetic analyses. The specimens from the Czech National Museum were first estimated to be 160-year-old samples, but they were later re-estimated as samples from the 1920s [157,158]. The MRCA was estimated to have originated roughly 371 to 508 years ago using VARV sequences derived from Lithuanian, Czech, and British samples [155,156,158,159]. Ferrari et al. estimated the evolutionary ratio as 8.5 × 10−6 substitutions/site/year using Lithuanian samples, and Ferrari et al. estimated it as 1.067 × 10−5 substitutions/site/year using Lithuanian, Czech, and British samples [155,159]. However, ancient VARV was sequenced using older samples of Northern European specimens approximately 970 to 1400 years old, indicating that the MRCA of VARV was ~1700 years ago, and the accumulation rate of nucleotide substitutions was 3.7 × 10−6 to 6.5 × 10−6 substitutions/site/year [66,160]. These estimation results differed from the ratio estimated from time-structured samples, 1 × 10−5 substitutions/site/year, isolated in the 20th century [170]. Since a limited number of ancient samples tends to cause incorrect estimation results, efforts to discover more ancient viral samples are necessary. In addition to the evolutionary rate estimation, Mühlemann et al. demonstrated the reduction in gene contents during VARV evolution based on the following evidence: three genes are active in modern VARV which were inactive in some or all ancient VARV, 10 inactive genes in modern and ancient VARV have different mutations suggesting parallel evolution, and the inactivation of 14 genes in modern VARV are active in some or all of the ancient VARV and eight of them encode virulence factors or immunomodulators [66]. It was suggested that the orthopoxvirus species originated from a common ancestor containing all genes present in current orthopoxviruses and that the long-term adaptation within host species caused the reduction in active genes. Babkin also revealed the differences in terms of gene contents between ancient VARV and its modern equivalent and suggested that the ancestral species contained all of the genes present in orthopoxviruses today and that long-term adaptation to within-host species occurred through a reduction of active genes [160].
Several complete sequences of ancient plant DNA and RNA viruses have been discovered despite RNA instability, as listed in Table 1 [29,34,35,36,37,38,39,40,41]. For instance, ancient barley stripe mosaic virus (BSMV) genomes were obtained from ~750-year-old barley grains [29]. The phylogenetic relationships between modern BSMV and reconstructed ancient BSMV viral genomes suggested that the divergence of BSMV most likely occurred roughly 2000 years ago. The divergence age was much older than the results based on recent serial and heterochronous sampling data. The viral lineage appears to have originated from the Near East or North Africa and spread to North America and East Asia via hosts.
Finally, we introduce a complete sequence of an ancient bacteriophage. Ancient bacteriophages were less examined in ancient specimens than viruses that infect other hosts. One study successfully reconstructed an almost complete ancient 42 kb phage genome, Siphovirus contig89 (CT89), from approximately 3800-year-old dental pulp using de novo assembly [30]. Interestingly, the reference sequence of modern CT89 was registered as only a partial sequence, around 24 kb. This suggested that the de novo assembly of ancient sequences is beneficial in the context of the reconstructing of complete ancient viral genomes, although it requires the preservation of ancient DNA. CT89 is a dsDNA phage that infects Schalia meyeri and is known as an oral commensal bacterium. The results of phylogenetic analysis indicated that the ancient CT89 sequences were different from the modern CT89 sequences and might reflect ancestral states.
Overall, NGS data have enabled us to obtain a vast amount of information about ancient viral genomes. When suitable viral sequences were obtained, viral genetic variations and evolution with information regarding dates could be accurately examined. In reality, limited numbers of sequences can result in inadequate estimation results, as shown in the case of VARV. Therefore, we should make more effort to increase the number of ancient samples. Recently, the number of ancient human WGS data containing viral genomes registered on the open database has increased; thus, we have a greater chance to detect ancient viral genomes from WGS data.
6. Long-Term Viral Evolution Reflecting Time-Dependent Rate Phenomenon (TDRP) Elucidated by Ancient Viral Sequences
Viral evolution has been inferred from the differences among current viral genomic sequencing data. Based on these differences, several evolutionary processes, including phylogenetic relationships, evolutionary rates, and divergent periods, were estimated. Recent ancient viral studies have shed light on long-term viral evolution based on age-related information regarding the nucleotide diversity between ancient and modern viruses.
Mutations cause genomic differences and genetic diversity in viruses and are maintained through several processes such as natural selection, random genetic drift, and recombination, contributing to viral evolution [171]. Therefore, analyses of viral mutations are essential for elucidating viral evolutionary history, past viral population dynamics, and viral evolutionary rates. Since several mutations are related to viral adaptation, such as when a virus acquires drug resistance and pathogenicity, it is critical to understand the evolution of viruses in terms of both its medical and epidemiological aspects in order to prevent and predict pandemics [18,172,173,174]. Several genomic and phylogenetic analyses of viral genomes suggest that viral evolution appears to be faster than that of cellular organisms because of high nucleotide substitution rates [175,176]. It is possible to observe the evolution of viral genomes using samples collected within timescales of years or by using abundant data concerning experimentally determined mutation rates [170,177]. Such results indicate that the viral evolutionary rate is determined by diverse factors including genomic architecture, replication speed, and polymerase fidelity [178]. Additionally, viral evolutionary rates vary depending on the timescale according to historical data accumulation. Both viruses and cellular organisms have shown discrepancies in the evolutionary ratio between short and long timescales, and their evolutionary rates appear to decrease with the measurement of long timescales. This phenomenon is known as the time-dependent rate phenomenon (TDRP). The TDRP has been explained by several factors such as the sequence site saturation, purifying selection, short-term changes in selection pressure, and inadequate estimation of substitution rates [179,180,181,182,183]. Furthermore, the evolutionary rates are consistent with a power-law relationship between the substitution rate and the observational period [18,180]. Ghafari et al. established a model that explains the rate decay phenomenon over a wide timescale and reproduced the ubiquitous power-law rate decay [183]. Short-term rates might reflect population-level processes such as transient deleterious mutations and short-sighted adaptations within the current host species [18]. Conversely, long-term rates reflect the actual fixation rate of mutations over historical timescales of more than thousands of years [18,184]. Therefore, ancient DNA and RNA viruses can provide valuable information about long-term viral evolution over macroevolutionary timescales. Many ancient viral studies have conducted phylogenetic analyses and estimated each virus’s evolutionary ratio and divergent events. The short-term evolutionary rates tend to be relatively fast because the divergent age might be incorrectly estimated when only modern samples are applied [181,182]. The evolutionary ratio of HBV was estimated to be roughly 10−4 substitutions/site/year using modern heterochronous samples, and the most recent common ancestor (MRCA) of human HBV was estimated as <~1500 years ago [165,166,168,185]. In the case of using calibration based on HBV-human co-expansion, the evolutionary ratio and MRCA was estimated at roughly 10−6 substitutions/site/year and 34,100 years ago, respectively [167]. As described in the previous section, ancient HBV sequences were recently discovered from human specimens that were thousands of years old [14,15]. The evolutionary ratio estimate ranged from 8.7 × 10−6 to 1.45 × 10−5 substitutions/site/year, and the root of the HBV tree was estimated to be between 8600 and 20,900 years ago [14]. Different estimation results were also obtained for VARV, as described in the previous section; the evolutionary ratio was estimated to be 1 × 10−5 substitutions/site/year using time-structured samples isolated in the 20th century and 3.7 × 10−6 to 6.5 × 10−6 substitutions/site/year when about 970- to 1400-year-old Eurasian samples were used [66,160,170].
Lythgoe et al. proposed that chronic viral infection with long transmission intervals requires strategies to avoid short-sighted evolution, which could be deleterious for chronic viral infections and transmission within the host population [184]. HBV has life history traits which show a slower within-host evolutionary rate compared to during replication. Lin et al. mentioned that the colonization–adaptation trade-off (CAT) model can explain the high short-term and low long-term HBV evolutionary rates [186]. When chronic HBV limited host immunity at an early stage, it favored viruses with a high replication ability. It might generate escape or adaptive mutants against the enhanced immunity infection. Then, HBV shifts to a new tolerance phase in a new host. The results mentioned that the trade-offs occurred during transmission and colonization and their effects were concentrated on nonsynonymous rather than synonymous sites. On the other hand, HIV-1 had higher within-host evolutionary rates compared to between-host [184]. The rate mismatch might be caused by temporal changes in selection pressure during infection, the frequent reversion of adaptive mutations after transmission, and the storage of viruses in the body following preferential viral transmission: in other words, “stage-specific selection”, “adapt and revert”, and “store and retrieve” [187]. Concerning the three processes, the “store and retrieve” process might be the primary factor contributing to the rate of divergence. Lythgoe et al. mentioned the importance of acute infection, and only viruses stored after acute infection are transmitted and can cause slow evolution at the population level [187]. Accumulating evidence has also suggested that minority viral populations that initiate infections have a lower evolutionary ratio within a host [184]. It might be helpful to avoid evolutionary short-sightedness through the maintenance and preferential transmission of the subpopulation of viruses that initiated the infection.
Extrapolating evolutionary rate estimation across large timescales can seriously bias analysis [182,188]. There are several arguments concerning the discrepancy and its several possible causes, such as the inappropriate use of molecular clock dating without a temporal structure, incorrect calibration points, and different replication rates [189,190]. Among them, the “lack of temporal structure” is a well-argued practical problem. Evolutionary estimation requires a detectable temporal signal demonstrated by a positive correlation among sampling times and genetic distances [191,192]. When we utilize tip dates for calibration, the population must be measurably evolving, and the sampling window must be wide enough to capture the adequate amount of genetic change [193,194]. If there is a strong disparity in the number of modern and ancient sequences, the standard test for time requires a root-to-tip linear regression for genetic distance, as a function of time might not be reliable [192,195,196]. Several methods have been established to avoid these problems caused by biased or erroneous data and evaluate the extent of the temporal structure in the datasets to be used. Firstly, the degree of the temporal signal should be calculated on time-structured datasets. It can be performed using the simple regression of the root-to-tip distance against the sampling time [169,191]. Rambaut et al. established a tool, TempEst, to visualize and analyze temporally sampled sequence data [191]. It evaluates the existence of sufficient temporal signals in the data to perform molecular clock analysis and identifies sequences with inconsistent genetic divergence and sampling dates [191]. Secondly, the date-randomization test helps to determine whether a time-structured dataset has sufficient temporal structure [169,197]. Duchêne et al. assessed a date-randomization test to investigate whether time-structured datasets had adequate temporal signals [197]. The estimated substitution rate can pass the test when the mean does not fall within the 95% credible intervals of rate estimates obtained using replicating datasets with random sampling times. One of the HBV studies used these two methods to estimate the presence of temporal signals in ancient HBV samples to increase the credibility of the estimation [14]. Tong et al. compared the different methods for estimating substitution rates using ancient DNA sequence data, regression of root-to-tip distances, least-squares dating, and Bayesian inference [193]. They recommended applying multiple methods of inference and testing for the presence of temporal signals.
7. Future Perspective: Detecting Highly Diverged or Extinctic Viral Genomes
Viral sequences can rapidly diverge from ancestral sequences and have little homology due to the fast evolutionary rates of viruses. Accordingly, ancient viral sequences might be difficult to detect based on homology searches because of the lack of similarities. Therefore, we need alternative methods to detect ancient viral sequences. Several viral detection methods were proposed for assembled contigs in modern viral metagenomic studies. An example is detecting viral sequences from a combination of viral gene contents and genomic structural features, including VirSorter, VirSorter2, and MARVEL [198,199,200]. Another method is using frequencies of nucleic acids or kmer-based machine-learning methods with known viral sequences, such as VirFinder [201]. The clustered regularly interspaced short palindromic repeats (CRISPR) system and prokaryotic adaptive immunological memory are also employed as nonreference-based approaches [202,203]. Bacteria can memorize the partial genomes of previously infected phages, and there are almost identical sequences between bacterial CRISPR spacers and phage protospacers [204,205]. Therefore, we can identify viral sequences utilizing bacterial CRISPR spacer sequences. Those non-homology methods might aid in detecting highly diverged or extinct ancient viral sequences. One paper indicated that viral sequences were detected from 15,000-year-old glacier ice by VirSorter [46]. Due to the low preservation quality of ancient DNA and RNA in general, there is a high hurdle for assembling contigs and searching candidate ancient viral genomes. However, if we could obtain longer contigs, the above methods would enable us to detect highly diverged or extinct viral genomes.
8. Conclusions
We reviewed studies on ancient viruses discovered from archaeological samples ranging from a few decades to more than thousands of years old. Thanks to advancements in sequencing technologies, several ancient viral genomes have been discovered from historical samples and utilized for evolutionary analyses. The reconstructed genomes are beneficial for obtaining epidemiological pieces of evidence from ancient times and for estimating long-term viral evolution and temporal signals. Moreover, ancient viral detection, including non-pathogenic viruses, helps to elucidate ancient viromes possibly related to ancient people’s lifestyles. Currently, the number of identified ancient viral genomes is limited; thus, efforts to detect more ancient viruses will provide more insights into viral evolution and transition from ancient times.
Acknowledgments
We thank the members of the Human Genetics Laboratory at the National Institute of Genetics for providing many valuable suggestions. This project was supported by Deciphering Origin and Establishment of Yaponesians, based on a genome sequence data project (Yaponesia Genome Project) supported by JSPS KAKENHI, grant number JP18H05506. This work was supported in part by The Graduate University for Advanced Studies, SOKENDAI.
Author Contributions
Investigation, L.N.; writing—original draft preparation, L.N. and I.I.; writing—review and editing, L.N., N.F., R.S. and I.I.; supervision, I.I.; project administration, I.I.; funding acquisition, L.N., R.S. and I.I. All authors have read and agreed to the published version of the manuscript.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
The data presented in this manuscript are available in Table 1.
Conflicts of Interest
The authors declare no conflict of interest.
Funding Statement
This research was funded by JSPS KAKENHI, grant numbers JP18H05506 (I.I.), JP20K21405 (I.I.), and JP21J22509 (L.N.). R.S. is funded by the Research Organization of Information and Systems (Investment program for futuristic research projects).
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Higuchi R., Bowman B., Freiberger M., Ryder O.A., Wilson A.C. DNA Sequences from the Quagga, an Extinct Member of the Horse Family. Nature. 1984;312:282–284. doi: 10.1038/312282a0. [DOI] [PubMed] [Google Scholar]
- 2.Prüfer K., Racimo F., Patterson N., Jay F., Sankararaman S., Sawyer S., Heinze A., Renaud G., Sudmant P.H., De Filippo C., et al. The Complete Genome Sequence of a Neanderthal from the Altai Mountains. Nature. 2014;505:43–49. doi: 10.1038/nature12886. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Sawyer S., Renaud G., Viola B., Hublin J.J., Gansauge M.T., Shunkov M.V., Derevianko A.P., Prüfer K., Kelso J., Pääbo S. Nuclear and Mitochondrial DNA Sequences from Two Denisovan Individuals. Proc. Natl. Acad. Sci. USA. 2015;112:15696–15700. doi: 10.1073/pnas.1519905112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Meyer M., Kircher M., Gansauge M.T., Li H., Racimo F., Mallick S., Schraiber J.G., Jay F., Prüfer K., De Filippo C., et al. A High-Coverage Genome Sequence from an Archaic Denisovan Individual. Science. 2012;338:222–226. doi: 10.1126/science.1224344. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Browning S.R., Browning B.L., Zhou Y., Tucci S., Akey J.M. Analysis of Human Sequence Data Reveals Two Pulses of Archaic Denisovan Admixture. Cell. 2018;173:53–61.e9. doi: 10.1016/j.cell.2018.02.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Slon V., Mafessoni F., Vernot B., de Filippo C., Grote S., Viola B., Hajdinjak M., Peyrégne S., Nagel S., Brown S., et al. The Genome of the Offspring of a Neanderthal Mother and a Denisovan Father. Nature. 2018;561:113–116. doi: 10.1038/s41586-018-0455-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Kanzawa-Kiriyama H., Jinam T.A., Kawai Y., Sato T., Hosomichi K., Tajima A., Adachi N., Matsumura H., Kryukov K., Saitou N., et al. Late Jomon Male and Female Genome Sequences from the Funadomari Site in Hokkaido, Japan. Anthropol. Sci. 2019;127:83–108. doi: 10.1537/ase.190415. [DOI] [Google Scholar]
- 8.Warinner C., Rodrigues J.F.M., Vyas R., Trachsel C., Shved N., Grossmann J., Radini A., Hancock Y., Tito R.Y., Fiddyment S., et al. Pathogens and Host Immunity in the Ancient Human Oral Cavity. Nat. Genet. 2014;46:336–344. doi: 10.1038/ng.2906. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Appelt S., Fancello L., Le Bailly M., Raoult D., Drancourt M., Desnues C. Viruses in a 14th-Century Coprolite. Appl. Environ. Microbiol. 2014;80:2648–2655. doi: 10.1128/AEM.03242-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Warinner C., Speller C., Collins M.J. A New Era in Palaeomicrobiology: Prospects for Ancient Dental Calculus as a Long-Term Record of the Human Oral Microbiome. Philos. Trans. R. Soc. B Biol. Sci. 2015;370:20130376. doi: 10.1098/rstb.2013.0376. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Weyrich L.S., Duchene S., Soubrier J., Arriola L., Llamas B., Breen J., Morris A.G., Alt K.W., Caramelli D., Dresely V., et al. Neanderthal Behaviour, Diet, and Disease Inferred from Ancient DNA in Dental Calculus. Nature. 2017;544:357–361. doi: 10.1038/nature21674. [DOI] [PubMed] [Google Scholar]
- 12.Taubenberger J.K., Reid A.H., Krafft A.E., Bijwaard K.E., Fanning T.G. Initial Genetic Characterization of the 1918 “Spanish” Influenza Virus. Science. 1997;275:1793–1796. doi: 10.1126/science.275.5307.1793. [DOI] [PubMed] [Google Scholar]
- 13.Bos K.I., Schuenemann V.J., Golding G.B., Burbano H.A., Waglechner N., Coombes B.K., McPhee J.B., Dewitte S.N., Meyer M., Schmedes S., et al. A Draft Genome of Yersinia Pestis from Victims of the Black Death. Nature. 2011;478:506–510. doi: 10.1038/nature10549. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Mühlemann B., Jones T.C., De Barros Damgaard P., Allentoft M.E., Shevnina I., Logvin A., Usmanova E., Panyushkina I.P., Boldgiv B., Bazartseren T., et al. Ancient Hepatitis B Viruses from the Bronze Age to the Medieval Period. Nature. 2018;557:418–423. doi: 10.1038/s41586-018-0097-z. [DOI] [PubMed] [Google Scholar]
- 15.Krause-Kyora B., Susat J., Key F.M., Kühnert D., Bosse E., Immel A., Rinne C., Kornell S.C., Yepes D., Franzenburg S., et al. Neolithic and Medieval Virus Genomes Reveal Complex Evolution of Hepatitis B. eLife. 2018;7:e36666. doi: 10.7554/eLife.36666. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Spyrou M.A., Bos K.I., Herbig A., Krause J. Ancient Pathogen Genomics as an Emerging Tool for Infectious Disease Research. Nat. Rev. Genet. 2019;20:323–340. doi: 10.1038/s41576-019-0119-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Duchêne S., Ho S.Y.W., Carmichael A.G., Holmes E.C., Poinar H. The Recovery, Interpretation and Use of Ancient Pathogen Genomes. Curr. Biol. 2020;30:R1215–R1231. doi: 10.1016/j.cub.2020.08.081. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Simmonds P., Aiewsakun P., Katzourakis A. Prisoners of War—Host Adaptation and Its Constraints on Virus Evolution. Nat. Rev. Microbiol. 2019;17:321–328. doi: 10.1038/s41579-018-0120-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Harvey E., Holmes E.C. Diversity and Evolution of the Animal Virome. Nat. Rev. Microbiol. 2022;1993:30–33. doi: 10.1038/s41579-021-00665-x. [DOI] [PubMed] [Google Scholar]
- 20.Virgin H.W. The Virome in Mammalian Physiology and Disease. Cell. 2014;157:142–150. doi: 10.1016/j.cell.2014.02.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Minot S., Sinha R., Chen J., Li H., Keilbaugh S.A., Wu G.D., Lewis J.D., Bushman F.D. The Human Gut Virome: Inter-Individual Variation and Dynamic Response to Diet. Genome Res. 2011;21:1616–1625. doi: 10.1101/gr.122705.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Cadwell K. The Virome in Host Health and Disease. Immunity. 2015;42:805–813. doi: 10.1016/j.immuni.2015.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Tsangaras K., Greenwood A.D. Paleovirology: Viral Sequences from Historical and Ancient DNA. In: Lindqvist C., Rajora O.P., editors. Paleogenomics: Genome-Scale Analysis of Ancient DNA. Springer International Publishing; Cham, The Netherlands: 2018. pp. 139–162. [Google Scholar]
- 24.Emerman M., Malik H.S. Paleovirology—Modern Consequences of Ancient Viruses. PLoS Biol. 2010;8:e1000301. doi: 10.1371/journal.pbio.1000301. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Mullis K., Faloona F., Scharf S., Saiki R., Horn G., Erlich H. Specific Enzymatic Amplification of DNA In Vitro: The Polymerase Chain Reaction. Cold Spring Harb. Symp. Quant. Biol. 1986;51:263–273. doi: 10.1101/SQB.1986.051.01.032. [DOI] [PubMed] [Google Scholar]
- 26.Zhu T., Korber B.T., Nahmias A.J., Hooper E., Sharp P.M., Ho D.D. An African HIV-1 Sequence from 1959 and Implications for the of the Epidemic. Nature. 1998;391:594–597. doi: 10.1038/35400. [DOI] [PubMed] [Google Scholar]
- 27.Goodwin S., McPherson J.D., McCombie W.R. Coming of Age: Ten Years of next-Generation Sequencing Technologies. Nat. Rev. Genet. 2016;17:333–351. doi: 10.1038/nrg.2016.49. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Poinar H.N., Schwarz C., Qi J., Shapiro B., MacPhee R.D.E., Buigues B., Tikhonov A., Huson D.H., Tomsho L.P., Auch A., et al. Metagenomics to Paleogenomics: Large-Scale Sequencing of Mammoth DNA. Science. 2006;311:392–394. doi: 10.1126/science.1123360. [DOI] [PubMed] [Google Scholar]
- 29.Smith O., Clapham A., Rose P., Liu Y., Wang J., Allaby R.G. A Complete Ancient RNA Genome: Identification, Reconstruction and Evolutionary History of Archaeological Barley Stripe Mosaic Virus. Sci. Rep. 2014;4:4003. doi: 10.1038/srep04003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Nishimura L., Sugimoto R., Inoue J., Nakaoka H., Kanzawa-Kiriyama H., Shinoda K., Inoue I. Identification of Ancient Viruses from Metagenomic Data of the Jomon People. J. Hum. Genet. 2021;66:287–296. doi: 10.1038/s10038-020-00841-6. [DOI] [PubMed] [Google Scholar]
- 31.Hagelberg E., Sykes B., Hedges R. Ancient Bone DNA Amplified. Nature. 1989;342:485. doi: 10.1038/342485a0. [DOI] [PubMed] [Google Scholar]
- 32.Seguin-Orlando A., Gamba C., der Sarkissian C., Ermini L., Louvel G., Boulygina E., Sokolov A., Nedoluzhko A., Lorenzen E.D., Lopez P., et al. Pros and Cons of Methylation-Based Enrichment Methods for Ancient DNA. Sci. Rep. 2015;5:11826. doi: 10.1038/srep11826. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Kahila Bar-Gal G., Kim M.J., Klein A., Shin D.H., Oh C.S., Kim J.W., Kim T.H., Kim S.B., Grant P.R., Pappo O., et al. Tracing Hepatitis B Virus to the 16th Century in a Korean Mummy. Hepatology. 2012;56:1671–1680. doi: 10.1002/hep.25852. [DOI] [PubMed] [Google Scholar]
- 34.Peyambari M., Warner S., Stoler N., Rainer D., Roossinck M.J. A 1000-Year-Old RNA Virus. J. Virol. 2019;93:e01188-18. doi: 10.1128/jvi.01188-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Cox B.A., Jones R.A.C. Genetic Variability in the Coat Protein Gene of Potato Virus X and the Current Relationship between Phylogenetic Placement and Resistance Groupings. Arch. Virol. 2010;155:1349–1356. doi: 10.1007/s00705-010-0711-3. [DOI] [PubMed] [Google Scholar]
- 36.Dullemans A.M., Cuperus C., Verbeek M., van der Vlugt R.A.A. Complete Nucleotide Sequence of a Potato Isolate of Strain Group C of Potato Virus Y from 1938. Arch. Virol. 2011;156:473–477. doi: 10.1007/s00705-010-0853-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Kehoe M.A., Jones R.A.C. Improving Potato Virus Y Strain Nomenclature: Lessons from Comparing Isolates Obtained over a 73-Year Period. Plant Pathol. 2016;65:322–333. doi: 10.1111/ppa.12404. [DOI] [Google Scholar]
- 38.Green K.J., Quintero-Ferrer A., Chikh-Ali M., Jones R.A.C., Karasev A.V. Genetic Diversity of Nine Non-Recombinant Potato Virus y Isolates from Three Biological Strain Groups: Historical and Geographical Insights. Plant Dis. 2020;104:2317–2323. doi: 10.1094/PDIS-02-20-0294-SC. [DOI] [PubMed] [Google Scholar]
- 39.Malmstrom C.M., Shu R., Linton E.W., Newton L.A., Cook M.A. Barley Yellow Dwarf Viruses (BYDVs) Preserved in Herbarium Specimens Illuminate Historical Disease Ecology of Invasive and Native Grasses. J. Ecol. 2007;95:1153–1166. doi: 10.1111/j.1365-2745.2007.01307.x. [DOI] [Google Scholar]
- 40.Hartung J.S., Roy A., Fu S., Shao J., Schneider W.L., Brlansky R.H. History and Diversity of Citrus Leprosis Virus Recorded in Herbarium Specimens. Phytopathology. 2015;105:1277–1284. doi: 10.1094/PHYTO-03-15-0064-R. [DOI] [PubMed] [Google Scholar]
- 41.Rieux A., Campos P., Duvermy A., Scussel S., Martin D., Gaudeul M., Lefeuvre P., Becker N., Lett J.M. Contribution of Historical Herbarium Small RNAs to the Reconstruction of a Cassava Mosaic Geminivirus Evolutionary History. Sci. Rep. 2021;11:21280. doi: 10.1038/s41598-021-00518-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Jones R.A.C., Boonham N., Adams I.P., Fox A. Historical Virus Isolate Collections: An Invaluable Resource Connecting Plant Virology’s Pre-Sequencing and Post-Sequencing Eras. Plant Pathol. 2021;70:235–248. doi: 10.1111/ppa.13313. [DOI] [Google Scholar]
- 43.Ng T.F.F., Chen L.F., Zhou Y., Shapiro B., Stiller M., Heintzman P.D., Varsani A., Kondov N.O., Wong W., Deng X., et al. Preservation of Viral Genomes in 700-y-Old Caribou Feces from a Subarctic Ice Patch. Proc. Natl. Acad. Sci. USA. 2014;111:16842–16847. doi: 10.1073/pnas.1410429111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Legendre M., Bartoli J., Shmakova L., Jeudy S., Labadie K., Adrait A., Lescot M., Poirot O., Bertaux L., Bruley C., et al. Thirty-Thousand-Year-Old Distant Relative of Giant Icosahedral DNA Viruses with a Pandoravirus Morphology. Proc. Natl. Acad. Sci. USA. 2014;111:4274–4279. doi: 10.1073/pnas.1320670111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Legendre M., Lartigue A., Bertaux L., Jeudy S., Bartoli J., Lescot M., Alempic J.M., Ramus C., Bruley C., Labadie K., et al. In-Depth Study of Mollivirus Sibericum, a New 30,000-Yold Giant Virus Infecting Acanthamoeba. Proc. Natl. Acad. Sci. USA. 2015;112:E5327–E5335. doi: 10.1073/pnas.1510795112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Zhong Z.P., Tian F., Roux S., Gazitúa M.C., Solonenko N.E., Li Y.F., Davis M.E., Van Etten J.L., Mosley-Thompson E., Rich V.I., et al. Glacier Ice Archives Nearly 15,000-Year-Old Microbes and Phages. Microbiome. 2021;9:160. doi: 10.1186/s40168-021-01106-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Jensen T.Z.T., Niemann J., Iversen K.H., Fotakis A.K., Gopalakrishnan S., Vågene Å.J., Pedersen M.W., Sinding M.H.S., Ellegaard M.R., Allentoft M.E., et al. A 5700 Year-Old Human Genome and Oral Microbiome from Chewed Birch Pitch. Nat. Commun. 2019;10:5520. doi: 10.1038/s41467-019-13549-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Duggan A.T., Klunk J., Porter A.F., Dhody A.N., Hicks R., Smith G.L., Humphreys M., McCollum A.M., Davidson W.B., Wilkins K., et al. The Origins and Genomic Diversity of American Civil War Era Smallpox Vaccine Strains. Genome Biol. 2020;21:175. doi: 10.1186/s13059-020-02079-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Brinkmann A., Souza A.R.V., Esparza J., Nitsche A., Damaso C.R. Re-Assembly of Nineteenth-Century Smallpox Vaccine Genomes Reveals the Contemporaneous Use of Horsepox and Horsepox-Related Viruses in the USA. Genome Biol. 2020;21:4–9. doi: 10.1186/s13059-020-02202-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Duggan A.T., Holmes E.C., Poinar H.N. Response to Brinkmann et al. Re-Assembly of 19th Century Smallpox Vaccine Genomes Reveals the Contemporaneous Use of Horsepox and Horsepox-Related Viruses in the United States. Genome Biol. 2020;21:20–21. doi: 10.1186/s13059-020-02203-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Dabney J., Meyer M., Pääbo S. Ancient DNA Damage. Cold Spring Harb. Perspect. Biol. 2013;5:a012567. doi: 10.1101/cshperspect.a012567. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Orlando L., Allaby R., Skoglund P., Der Sarkissian C., Stockhammer P.W., Ávila-Arcos M.C., Fu Q., Krause J., Willerslev E., Stone A.C., et al. Ancient DNA Analysis. Nat. Rev. Methods Prim. 2021;1:14. doi: 10.1038/s43586-020-00011-0. [DOI] [Google Scholar]
- 53.Briggs A.W., Stenzel U., Johnson P.L.F., Green R.E., Kelso J., Prüfer K., Meyer M., Krause J., Ronan M.T., Lachmann M., et al. Patterns of Damage in Genomic DNA Sequences from a Neandertal. Proc. Natl. Acad. Sci. USA. 2007;104:14616–14621. doi: 10.1073/pnas.0704665104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Allentoft M.E., Collins M., Harker D., Haile J., Oskam C.L., Hale M.L., Campos P.F., Samaniego J.A., Gilbert T.P.M., Willerslev E., et al. The Half-Life of DNA in Bone: Measuring Decay Kinetics in 158 Dated Fossils. Proc. R. Soc. B Biol. Sci. 2012;279:4724–4733. doi: 10.1098/rspb.2012.1745. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Fordyce S.L., Kampmann M.L., van Doorn N.L., Gilbert M.T.P. Long-Term RNA Persistence in Postmortem Contexts. Investig. Genet. 2013;4:7. doi: 10.1186/2041-2223-4-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Höss M., Pääbo S. DNA Extraction from Pleistocene Bones by a Silica-Based Purification Method. Nucleic Acids Res. 1993;21:3913–3914. doi: 10.1093/nar/21.16.3913. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Sirak K.A., Fernandes D.M., Cheronet O., Novak M., Gamarra B., Balassa T., Bernert Z., Cséki A., Dani J., Gallina J.Z., et al. A Minimally-Invasive Method for Sampling Human Petrous Bones from the Cranial Base for Ancient DNA Analysis. Biotechniques. 2017;62:283–289. doi: 10.2144/000114558. [DOI] [PubMed] [Google Scholar]
- 58.Rohland N., Hofreiter M. Ancient Dna Extraction from Bones and Teeth. Nat. Protoc. 2007;2:1756–1762. doi: 10.1038/nprot.2007.247. [DOI] [PubMed] [Google Scholar]
- 59.Briggs A.W., Stenzel U., Meyer M., Krause J., Kircher M., Pääbo S. Removal of Deaminated Cytosines and Detection of in Vivo Methylation in Ancient DNA. Nucleic Acids Res. 2009;38:e87. doi: 10.1093/nar/gkp1163. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Rohland N., Harney E., Mallick S., Nordenfelt S., Reich D. Partial Uracil—DNA—Glycosylase Treatment for Screening of Ancient DNA. Philos. Trans. R. Soc. B Biol. Sci. 2015;370:20130624. doi: 10.1098/rstb.2013.0624. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Beerenwinkel N., Zagordi O. Ultra-Deep Sequencing for the Analysis of Viral Populations. Curr. Opin. Virol. 2011;1:413–418. doi: 10.1016/j.coviro.2011.07.008. [DOI] [PubMed] [Google Scholar]
- 62.Lauring A.S. Within-Host Viral Diversity: A Window into Viral Evolution. Annu. Rev. Virol. 2020;7:63–81. doi: 10.1146/annurev-virology-010320-061642. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Beerenwinkel N., Günthard H.F., Roth V., Metzner K.J. Challenges and Opportunities in Estimating Viral Genetic Diversity from Next-Generation Sequencing Data. Front. Microbiol. 2012;3:329. doi: 10.3389/fmicb.2012.00329. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Knapp M., Hofreiter M. Next Generation Sequencing of Ancient DNA: Requirements, Strategies and Perspectives. Genes. 2010;1:227–243. doi: 10.3390/genes1020227. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Quince C., Walker A.W., Simpson J.T., Loman N.J., Segata N. Shotgun Metagenomics, from Sampling to Analysis. Nat. Biotechnol. 2017;35:833–844. doi: 10.1038/nbt.3935. [DOI] [PubMed] [Google Scholar]
- 66.Mühlemann B., Vinner L., Margaryan A., Wilhelmson H., de la Fuente Castro C., Allentoft M.E., de Barros Damgaard P., Hansen A.J., Holtsmark Nielsen S., Strand L.M., et al. Diverse Variola Virus (Smallpox) Strains Were Widespread in Northern Europe in the Viking Age. Science. 2020;369:eaaw8977. doi: 10.1126/science.aaw8977. [DOI] [PubMed] [Google Scholar]
- 67.Guzmán-Solís A.A., Villa-Islas V., Bravo-López M.J., Sandoval-Velasco M., Wesp J.K., Gómez-Valdés J.A., de la Luz Moreno-Cabrera M., Meraz-Moreno A., Solís-Pichardo G., Schaaf P., et al. Ancient Viral Genomes Reveal Introduction of Human Pathogenic Viruses into Mexico during the Transatlantic Slave Trade. eLife. 2021;10:e68612. doi: 10.7554/eLife.68612. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Schubert M., Lindgreen S., Orlando L. AdapterRemoval v2: Rapid Adapter Trimming, Identification, and Read Merging. BMC Res. Notes. 2016;9:88. doi: 10.1186/s13104-016-1900-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Bolger A.M., Lohse M., Usadel B. Trimmomatic: A Flexible Trimmer for Illumina Sequence Data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Picard Toolkit. [(accessed on 19 April 2022)]. Available online: https://broadinstitute.github.io/picard/
- 71.Bushnell B. BBMap. [(accessed on 19 April 2022)]. Available online: http://sourceforge.net/projects/bbmap/
- 72.Renaud G., Stenzel U., Kelso J. LeeHom: Adaptor Trimming and Merging for Illumina Sequencing Reads. Nucleic Acids Res. 2014;42:e141. doi: 10.1093/nar/gku699. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Mühlemann B., Margaryan A., De Barros Damgaard P., Allentoft M.E., Vinner L., Hansen A.J., Weber A., Bazaliiskii V.I., Molak M., Arneborg J., et al. Ancient Human Parvovirus B19 in Eurasia Reveals Its Long-Term Association with Humans. Proc. Natl. Acad. Sci. USA. 2018;115:7557–7562. doi: 10.1073/pnas.1804921115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Prjibelski A., Antipov D., Meleshko D., Lapidus A., Korobeynikov A. Using SPAdes De Novo Assembler. Curr. Protoc. Bioinform. 2020;70:e102. doi: 10.1002/cpbi.102. [DOI] [PubMed] [Google Scholar]
- 75.Nurk S., Meleshko D., Korobeynikov A., Pevzner P.A. MetaSPAdes: A New Versatile Metagenomic Assembler. Genome Res. 2017;27:824–834. doi: 10.1101/gr.213959.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Li D., Liu C.M., Luo R., Sadakane K., Lam T.W. MEGAHIT: An Ultra-Fast Single-Node Solution for Large and Complex Metagenomics Assembly via Succinct de Bruijn Graph. Bioinformatics. 2015;31:1674–1676. doi: 10.1093/bioinformatics/btv033. [DOI] [PubMed] [Google Scholar]
- 77.Borry M., Hübner A., Rohrlach A.B., Warinner C. PyDamage: Automated Ancient Damage Identification and Estimation for Contigs in Ancient DNA de Novo Assembly. PeerJ. 2021;9:e11845. doi: 10.7717/peerj.11845. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Yue Y., Huang H., Qi Z., Dou H.M., Liu X.Y., Han T.F., Chen Y., Song X.J., Zhang Y.H., Tu J. Evaluating Metagenomics Tools for Genome Binning with Real Metagenomic Datasets and CAMI Datasets. BMC Bioinform. 2020;21:334. doi: 10.1186/s12859-020-03667-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Buchfink B., Xie C., Huson D.H. Fast and Sensitive Protein Alignment Using DIAMOND. Nat. Methods. 2014;12:59–60. doi: 10.1038/nmeth.3176. [DOI] [PubMed] [Google Scholar]
- 80.Buchfink B., Reuter K., Drost H.G. Sensitive Protein Alignments at Tree-of-Life Scale Using DIAMOND. Nat. Methods. 2021;18:366–368. doi: 10.1038/s41592-021-01101-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Truong D.T., Franzosa E.A., Tickle T.L., Scholz M., Weingart G., Pasolli E., Tett A., Huttenhower C., Segata N. MetaPhlAn2 for Enhanced Metagenomic Taxonomic Profiling. Nat. Methods. 2015;12:902–903. doi: 10.1038/nmeth.3589. [DOI] [PubMed] [Google Scholar]
- 82.Ye S.H., Siddle K.J., Park D.J., Sabeti P.C. Benchmarking Metagenomics Tools for Taxonomic Classification. Cell. 2019;178:779–794. doi: 10.1016/j.cell.2019.07.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Alneberg J., Bjarnason B.S., De Bruijn I., Schirmer M., Quick J., Ijaz U.Z., Lahti L., Loman N.J., Andersson A.F., Quince C. Binning Metagenomic Contigs by Coverage and Composition. Nat. Methods. 2014;11:1144–1146. doi: 10.1038/nmeth.3103. [DOI] [PubMed] [Google Scholar]
- 84.Kang D.D., Li F., Kirton E., Thomas A., Egan R., An H., Wang Z. MetaBAT 2: An Adaptive Binning Algorithm for Robust and Efficient Genome Reconstruction from Metagenome Assemblies. PeerJ. 2019;7:e7359. doi: 10.7717/peerj.7359. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Sedlar K., Kupkova K., Provaznik I. Bioinformatics Strategies for Taxonomy Independent Binning and Visualization of Sequences in Shotgun Metagenomics. Comput. Struct. Biotechnol. J. 2017;15:48–55. doi: 10.1016/j.csbj.2016.11.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Johansen J., Plichta D.R., Nissen J.N., Jespersen M.L., Shah S.A., Deng L., Stokholm J., Bisgaard H., Nielsen D.S., Sørensen S.J., et al. Genome Binning of Viral Entities from Bulk Metagenomics Data. Nat. Commun. 2022;13:965. doi: 10.1038/s41467-022-28581-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Arizmendi Cárdenas Y.O., Neuenschwander S., Malaspinas A.-S. Benchmarking Metagenomics Classifiers on Ancient Viral DNA: A Simulation Study. PeerJ. 2022;10:e12784. doi: 10.7717/peerj.12784. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Kim D., Song L., Breitwieser F.P., Salzberg S.L. Centrifuge: Rapid and Sensitive Classification of Metagenomic Sequences. Genome Res. 2016;26:1721–1729. doi: 10.1101/gr.210641.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Wood D.E., Lu J., Langmead B. Improved Metagenomic Analysis with Kraken 2. Genome Biol. 2019;20:257. doi: 10.1186/s13059-019-1891-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Granehäll L., Huang K.D., Tett A., Manghi P., Paladin A., O’Sullivan N., Rota-Stabelli O., Segata N., Zink A., Maixner F. Metagenomic Analysis of Ancient Dental Calculus Reveals Unexplored Diversity of Oral Archaeal Methanobrevibacter. Microbiome. 2021;9:197. doi: 10.1186/s40168-021-01132-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Camacho C., Coulouris G., Avagyan V., Ma N., Papadopoulos J., Bealer K., Madden T.L. BLAST+: Architecture and Applications. BMC Bioinform. 2009;10:421. doi: 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Li H., Durbin R. Fast and Accurate Short Read Alignment with Burrows-Wheeler Transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Herbig A., Maixner F., Bos K.I., Zink A., Krause J., Huson D.H. MALT: Fast Alignment and Analysis of Metagenomic DNA Sequence Data Applied to the Tyrolean Iceman. bioRxiv. 2016:050559. doi: 10.1101/050559. [DOI] [Google Scholar]
- 94.Huson D.H., Auch A.F., Qi J., Schuster S.C. MEGAN Analysis of Metagenomic Data. Genome Res. 2007;17:377–386. doi: 10.1101/gr.5969107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Brister J.R., Ako-Adjei D., Bao Y., Blinkova O. NCBI Viral Genomes Resource. Nucleic Acids Res. 2015;43:D571–D577. doi: 10.1093/nar/gku1207. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Roux S., Páez-Espino D., Chen I.M.A., Palaniappan K., Ratner A., Chu K., Reddy T., Nayfach S., Schulz F., Call L., et al. IMG/VR v3: An Integrated Ecological and Evolutionary Framework for Interrogating Genomes of Uncultivated Viruses. Nucleic Acids Res. 2021;49:D764–D775. doi: 10.1093/nar/gkaa946. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Pratas D., Toppinen M., Pyoria L., Hedman K., Sajantila A., Perdomo M.F. A Hybrid Pipeline for Reconstruction and Analysis of Viral Genomes at Multi-Organ Level. Gigascience. 2020;9:giaa086. doi: 10.1093/gigascience/giaa086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Schubert M., Ermini L., Der Sarkissian C., Jónsson H., Ginolhac A., Schaefer R., Martin M.D., Fernández R., Kircher M., McCue M., et al. Characterization of Ancient and Modern Genomes by SNP Detection and Phylogenomic and Metagenomic Analysis Using PALEOMIX. Nat. Protoc. 2014;9:1056–1082. doi: 10.1038/nprot.2014.063. [DOI] [PubMed] [Google Scholar]
- 99.Pratas D., Hosseini M., Grilo G., Pinho A.J., Silva R.M., Caetano T., Carneiro J., Pereira F. Metagenomic Composition Analysis of an Ancient Sequenced Polar Bear Jawbone from Svalbard. Genes. 2018;9:445. doi: 10.3390/genes9090445. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Fellows Yates J.A., Lamnidis T.C., Borry M., Valtueña A.A., Fagernäs Z., Clayton S., Garcia M.U., Neukamm J., Peltzer A. Reproducible, Portable, and Efficient Ancient Genome Reconstruction with Nfcore/Eager. PeerJ. 2021;9:e10947. doi: 10.7717/peerj.10947. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.Ingólfsson Ó., Wiig Ø. Late Pleistocene Fossil Find in Svalbard: The Oldest Remains of a Polar Bear (Ursus Maritimus Phipps, 1744) Ever Discovered. Polar Res. 2009;28:455–462. doi: 10.1111/j.1751-8369.2008.00087.x. [DOI] [Google Scholar]
- 102.Hyatt D., Chen G.L., LoCascio P.F., Land M.L., Larimer F.W., Hauser L.J. Prodigal: Prokaryotic Gene Recognition and Translation Initiation Site Identification. BMC Bioinform. 2010;11:119. doi: 10.1186/1471-2105-11-119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.Martin D.P., Murrell B., Golden M., Khoosal A., Muhire B. RDP4: Detection and Analysis of Recombination Patterns in Virus Genomes. Virus Evol. 2015;1:vev003. doi: 10.1093/ve/vev003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Saitou N., Nei M. The Neighbor-Joining Method: A New Method for Reconstructing Phylogenetic Trees. Mol. Biol. Evol. 1987;4:406–425. doi: 10.1093/oxfordjournals.molbev.a040454. [DOI] [PubMed] [Google Scholar]
- 105.Kumar S., Stecher G., Li M., Knyaz C., Tamura K. MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms. Mol. Biol. Evol. 2018;35:1547–1549. doi: 10.1093/molbev/msy096. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Guindon S., Dufayard J.F., Lefort V., Anisimova M., Hordijk W., Gascuel O. New Algorithms and Methods to Estimate Maximum-Likelihood Phylogenies: Assessing the Performance of PhyML 3.0. Syst. Biol. 2010;59:307–321. doi: 10.1093/sysbio/syq010. [DOI] [PubMed] [Google Scholar]
- 107.Stamatakis A. RAxML Version 8: A Tool for Phylogenetic Analysis and Post-Analysis of Large Phylogenies. Bioinformatics. 2014;30:1312–1313. doi: 10.1093/bioinformatics/btu033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108.Ronquist F., Teslenko M., Van Der Mark P., Ayres D.L., Darling A., Höhna S., Larget B., Liu L., Suchard M.A., Huelsenbeck J.P. Mrbayes 3.2: Efficient Bayesian Phylogenetic Inference and Model Choice across a Large Model Space. Syst. Biol. 2012;61:539–542. doi: 10.1093/sysbio/sys029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 109.Suchard M.A., Lemey P., Baele G., Ayres D.L., Drummond A.J., Rambaut A. Bayesian Phylogenetic and Phylodynamic Data Integration Using BEAST 1.10. Virus Evol. 2018;4:vey016. doi: 10.1093/ve/vey016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110.Bouckaert R., Vaughan T.G., Barido-Sottani J., Duchêne S., Fourment M., Gavryushkina A., Heled J., Jones G., Kühnert D., De Maio N., et al. BEAST 2.5: An Advanced Software Platform for Bayesian Evolutionary Analysis. PLoS Comput. Biol. 2019;15:e1006650. doi: 10.1371/journal.pcbi.1006650. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111.Ginolhac A., Rasmussen M., Gilbert M.T.P., Willerslev E., Orlando L. MapDamage: Testing for Damage Patterns in Ancient DNA Sequences. Bioinformatics. 2011;27:2153–2155. doi: 10.1093/bioinformatics/btr347. [DOI] [PubMed] [Google Scholar]
- 112.Jónsson H., Ginolhac A., Schubert M., Johnson P.L.F., Orlando L. MapDamage2.0: Fast Approximate Bayesian Estimates of Ancient DNA Damage Parameters. Bioinformatics. 2013;29:1682–1684. doi: 10.1093/bioinformatics/btt193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 113.Skoglund P., Northoff B.H., Shunkov M.V., Derevianko A.P., Pääbo S., Krause J., Jakobsson M. Separating Endogenous Ancient DNA from Modern Day Contamination in a Siberian Neandertal. Proc. Natl. Acad. Sci. USA. 2014;111:2229–2234. doi: 10.1073/pnas.1318934111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 114.Neukamm J., Peltzer A., Nieselt K. DamageProfiler: Fast Damage Pattern Calculation for Ancient DNA. Bioinformatics. 2021;37:3652–3653. doi: 10.1093/bioinformatics/btab190. [DOI] [PubMed] [Google Scholar]
- 115.Dinsdale E.A., Edwards R.A., Hall D., Angly F., Breitbart M., Brulc J.M., Furlan M., Desnues C., Haynes M., Li L., et al. Functional Metagenomic Profiling of Nine Biomes. Nature. 2008;452:629–632. doi: 10.1038/nature06810. [DOI] [PubMed] [Google Scholar]
- 116.Eric Wommack K., Bhavsar J., Polson S.W., Chen J., Dumas M., Srinivasiah S., Furman M., Jamindar S., Nasko D.J. VIROME: A Standard Operating Procedure for Analysis of Viral Metagenome Sequences. Stand. Genomic Sci. 2012;6:427–439. doi: 10.4056/sigs.2945050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 117.Lecuit M., Eloit M. The Human Virome: New Tools and Concepts. Trends Microbiol. 2013;21:510–515. doi: 10.1016/j.tim.2013.07.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 118.Liang G., Bushman F.D. The Human Virome: Assembly, Composition and Host Interactions. Nat. Rev. Microbiol. 2021;19:514–527. doi: 10.1038/s41579-021-00536-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 119.Abeles S.R., Robles-Sikisaka R., Ly M., Lum A.G., Salzman J., Boehm T.K., Pride D.T. Human Oral Viruses Are Personal, Persistent and Gender-Consistent. ISME J. 2014;8:1753–1767. doi: 10.1038/ismej.2014.31. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 120.Cadwell K. Expanding the Role of the Virome: Commensalism in the Gut. J. Virol. 2015;89:1951–1953. doi: 10.1128/JVI.02966-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 121.Zhao G., Vatanen T., Droit L., Park A., Kostic A.D., Poon T.W., Vlamakis H., Siljander H., Härkönen T., Hämäläinen A.M., et al. Intestinal Virome Changes Precede Autoimmunity in Type I Diabetes-Susceptible Children. Proc. Natl. Acad. Sci. USA. 2017;114:E6166–E6175. doi: 10.1073/pnas.1706359114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 122.Nakatsu G., Zhou H., Wu W.K.K., Wong S.H., Coker O.O., Dai Z., Li X., Szeto C.H., Sugimura N., Lam T.Y.T., et al. Alterations in Enteric Virome Are Associated With Colorectal Cancer and Survival Outcomes. Gastroenterology. 2018;155:529–541. doi: 10.1053/j.gastro.2018.04.018. [DOI] [PubMed] [Google Scholar]
- 123.Norman J.M., Handley S.A., Baldridge M.T., Droit L., Liu C.Y., Keller B.C., Kambal A., Monaco C.L., Zhao G., Fleshner P., et al. Disease-Specific Alterations in the Enteric Virome in Inflammatory Bowel Disease. Cell. 2015;160:447–460. doi: 10.1016/j.cell.2015.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 124.Willner D., Furlan M., Haynes M., Schmieder R., Angly F.E., Silva J., Tammadoni S., Nosrat B., Conrad D., Rohwer F. Metagenomic Analysis of Respiratory Tract DNA Viral Communities in Cystic Fibrosis and Non-Cystic Fibrosis Individuals. PLoS ONE. 2009;4:e7370. doi: 10.1371/journal.pone.0007370. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 125.Santiago-Rodriguez T.M., Fornaciari G., Luciani S., Dowd S.E., Toranzos G.A., Marota I., Cano R.J. Natural Mummification of the Human Gut Preserves Bacteriophage DNA. FEMS Microbiol. Lett. 2015;363:fnv219. doi: 10.1093/femsle/fnv219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 126.Wibowo M.C., Yang Z., Borry M., Hübner A., Huang K.D., Tierney B.T., Zimmerman S., Barajas-Olmos F., Contreras-Cubas C., García-Ortiz H., et al. Reconstruction of Ancient Microbial Genomes from the Human Gut. Nature. 2021;594:234–239. doi: 10.1038/s41586-021-03532-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 127.Rampelli S., Turroni S., Schnorr S.L., Soverini M., Quercia S., Barone M., Castagnetti A., Biagi E., Gallinella G., Brigidi P., et al. Characterization of the Human DNA Gut Virome across Populations with Different Subsistence Strategies and Geographical Origin. Environ. Microbiol. 2017;19:4728–4735. doi: 10.1111/1462-2920.13938. [DOI] [PubMed] [Google Scholar]
- 128.Bédarida S., Dutour O., Buzhilova A.P., de Micco P., Biagini P. Identification of Viral DNA (Anelloviridae) in a 200-Year-Old Dental Pulp Sample (Napoleon’s Great Army, Kaliningrad, 1812) Infect. Genet. Evol. 2011;11:358–362. doi: 10.1016/j.meegid.2010.11.007. [DOI] [PubMed] [Google Scholar]
- 129.Holmes E.C. Freezing Viruses in Time. Proc. Natl. Acad. Sci. USA. 2014;111:16643–16644. doi: 10.1073/pnas.1419827111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 130.Patterson Ross Z., Klunk J., Fornaciari G., Giuffra V., Duchêne S., Duggan A.T., Poinar D., Douglas M.W., Eden J.-S., Holmes E.C., et al. The Paradox of HBV Evolution as Revealed from a 16th Century Mummy. PLoS Pathog. 2018;14:e1006750. doi: 10.1371/journal.ppat.1006750. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 131.Barquera R., Lamnidis T.C., Lankapalli A.K., Kocher A., Hernández-Zaragoza D.I., Nelson E.A., Zamora-Herrera A.C., Ramallo P., Bernal-Felipe N., Immel A., et al. Origin and Health Status of First-Generation Africans from Early Colonial Mexico. Curr. Biol. 2020;30:2078–2091.e11. doi: 10.1016/j.cub.2020.04.002. [DOI] [PubMed] [Google Scholar]
- 132.Neukamm J., Pfrengle S., Molak M., Seitz A., Francken M., Eppenberger P., Avanzi C., Reiter E., Urban C., Welte B., et al. 2000-Year-Old Pathogen Genomes Reconstructed from Metagenomic Analysis of Egyptian Mummified Individuals. BMC Biol. 2020;18:108. doi: 10.1186/s12915-020-00839-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 133.Kocher A., Papac L., Barquera R., Key F.M., Spyrou M.A., Hübler R., Rohrlach A.B., Aron F., Stahl R., Wissgott A., et al. Ten Millennia of Hepatitis B Virus Evolution. Science. 2021;374:182–188. doi: 10.1126/science.abi5658. [DOI] [PubMed] [Google Scholar]
- 134.Gray R.R., Tanaka Y., Takebe Y., Magiorkinis G., Buskell Z., Seeff L., Alter H.J., Pybus O.G. Evolutionary Analysis of Hepatitis C Virus Gene Sequences from 1953. Philos. Trans. R. Soc. B Biol. Sci. 2013;368:20130168. doi: 10.1098/rstb.2013.0168. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 135.Worobey M., Watts T.D., McKay R.A., Suchard M.A., Granade T., Teuwen D.E., Koblin B.A., Heneine W., Lemey P., Jaffe H.W. 1970s and “Patient 0” HIV-1 Genomes Illuminate Early HIV/AIDS History in North America. Nature. 2016;539:98–101. doi: 10.1038/nature19827. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 136.Gryseels S., Watts T.D., Mpolesha J.M.K., Larsen B.B., Lemey P., Muyembe-Tamfum J.J., Teuwen D.E., Worobey M. A near Full-Length HIV-1 Genome from 1966 Recovered from Formalin-Fixed Paraffin-Embedded Tissue. Proc. Natl. Acad. Sci. USA. 2020;117:12222–12229. doi: 10.1073/pnas.1913682117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 137.Fornaciari G., Zavaglia K., Giusti L., Vultaggio C., Ciranni R. Human Papillomavirus in a 16th Century Mummy. Lancet. 2003;362:1160. doi: 10.1016/S0140-6736(03)14487-X. [DOI] [PubMed] [Google Scholar]
- 138.Toppinen M., Perdomo M.F., Palo J.U., Simmonds P., Lycett S.J., Söderlund-Venermo M., Sajantila A., Hedman K. Bones Hold the Key to DNA Virus History and Epidemiology. Sci. Rep. 2015;5:17226. doi: 10.1038/srep17226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 139.Li H.C., Fujiyoshi T., Lou H., Yashiki S., Sonoda S., Cartier L., Nunez L., Munoz I., Horai S., Tajima K. The Presence of Ancient Human T-Cell Lymphotropic Virus Type I Provirus DNA in an Andean Mummy. Nat. Med. 1999;5:1428–1432. doi: 10.1038/71006. [DOI] [PubMed] [Google Scholar]
- 140.Gessain A., Pecon-Slattery J., Meertens L., Mahieux R. Origins of HTLV-1 in South America (Letter 1) Nat. Med. 2000;6:232. doi: 10.1038/73020. [DOI] [PubMed] [Google Scholar]
- 141.Vandamme A.-M., Hall W.W., Lewis M.J., Goubau P., Salemi M. Origins of HTLV-1 in South America (Letter 2) Nat. Med. 2000;6:232–233. doi: 10.1038/73023. [DOI] [PubMed] [Google Scholar]
- 142.Reid A.H., Fanning T.G., Hultin J.V., Taubenberger J.K. Origin and Evolution of the 1918 “Spanish” Influenza Virus Hemagglutinin Gene. Proc. Natl. Acad. Sci. USA. 1999;96:1651–1656. doi: 10.1073/pnas.96.4.1651. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 143.Reid A.H., Fanning T.G., Janczewski T.A., Taubenberger J.K. Characterization of the 1918 “Spanish” Influenza Virus Neuraminidase Gene. Proc. Natl. Acad. Sci. USA. 2000;97:6785–6790. doi: 10.1073/pnas.100140097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 144.Basler C.F., Reid A.H., Dybing J.K., Janczewski T.A., Fanning T.G., Zheng H., Salvatore M., Perdue M.L., Swayne D.E., García-Sastre A., et al. Sequence of the 1918 Pandemic Influenza Virus Nonstructural Gene (NS) Segment and Characterization of Recombinant Viruses Bearing the 1918 NS Genes. Proc. Natl. Acad. Sci. USA. 2001;98:2746–2751. doi: 10.1073/pnas.031575198. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 145.Reid A.H., Fanning T.G., Janczewski T.A., McCall S., Taubenberger J.K. Characterization of the 1918 “Spanish” Influenza Virus Matrix Gene Segment. J. Virol. 2002;76:10717–10723. doi: 10.1128/JVI.76.21.10717-10723.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 146.Reid A.H., Fanning T.G., Janczewski T.A., Lourens R.M., Taubenberger J.K. Novel Origin of the 1918 Pandemic Influenza Virus Nucleoprotein Gene. J. Virol. 2004;78:12462–12470. doi: 10.1128/JVI.78.22.12462-12470.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 147.Taubenberger J.K., Reid A.H., Lourens R.M., Wang R., Jin G., Fanning T.G. Characterization of the 1918 Influenza Virus Polymerase Genes. Nature. 2005;437:889–893. doi: 10.1038/nature04230. [DOI] [PubMed] [Google Scholar]
- 148.Xiao Y.L., Kash J.C., Beres S.B., Sheng Z.M., Musser J.M., Taubenberger J.K. High-Throughput RNA Sequencing of a Formalin-Fixed, Paraffin-Embedded Autopsy Lung Tissue Sample from the 1918 Influenza Pandemic. J. Pathol. 2013;229:535–545. doi: 10.1002/path.4145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 149.Düx A., Lequime S., Patrono L.V., Vrancken B., Boral S., Gogarten J.F., Hilbig A., Horst D., Merkel K., Prepoint B., et al. Measles Virus and Rinderpest Virus Divergence Dated to the Sixth Century BCE. Science. 2020;368:1367–1370. doi: 10.1126/science.aba9411. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 150.Larsen B.B., Cole K.L., Worobey M. Ancient DNA Provides Evidence of 27,000-Year-Old Papillomavirus Infection and Long-Term Codivergence with Rodents. Virus Evol. 2018;4:vey014. doi: 10.1093/ve/vey014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 151.Calvignac S., Terme J.M., Hensley S.M., Jalinot P., Greenwood A.D., Hänni C. Ancient DNA Identification of Early 20th Century Simian T-Cell Leukemia Virus Type 1. Mol. Biol. Evol. 2008;25:1093–1098. doi: 10.1093/molbev/msn054. [DOI] [PubMed] [Google Scholar]
- 152.Castello J.D., Rogers S.O., Starmer W.T., Catranis C.M., Ma L., Bachand G.D., Zhao Y., Smith J.E. Detection of Tomato Mosaic Tobamovirus RNA in Ancient Glacial Ice. Polar Biol. 1999;22:207–212. doi: 10.1007/s003000050411. [DOI] [Google Scholar]
- 153.Biagini P., Thèves C., Balaresque P., Géraut A., Cannet C., Keyser C., Nikolaeva D., Gérard P., Duchesne S., Orlando L., et al. Variola Virus in a 300-Year-Old Siberian Mummy. N. Engl. J. Med. 2012;367:2057–2059. doi: 10.1056/NEJMc1208124. [DOI] [PubMed] [Google Scholar]
- 154.Meffray A., Ardagna Y., Sillano B., Parmentier S., Pouget B., Signoli M., Biagini P. Variola Virus DNA in Skeletal Remains, 17th to 18th Centuries, Southeastern France. Clin. Microbiol. Infect. 2021;27:1871–1872. doi: 10.1016/j.cmi.2021.09.015. [DOI] [PubMed] [Google Scholar]
- 155.Duggan A.T., Perdomo M.F., Piombino-Mascali D., Marciniak S., Poinar D., Emery M.V., Buchmann J.P., Duchêne S., Jankauskas R., Humphreys M., et al. 17th Century Variola Virus Reveals the Recent History of Smallpox. Curr. Biol. 2016;26:3407–3412. doi: 10.1016/j.cub.2016.10.061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 156.Smithson C., Imbery J., Upton C. Re-Assembly and Analysis of an Ancient Variola Virus Genome. Viruses. 2017;9:253. doi: 10.3390/v9090253. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 157.Pajer P., Dresler J., Kabíckova H., Písa L., Aganov P., Fucik K., Elleder D., Hron T., Kuzelka V., Velemínsky P., et al. Characterization of Two Historic Smallpox Specimens from a Czech Museum. Viruses. 2017;9:200. doi: 10.3390/v9080200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 158.Porter A.F., Duggan A.T., Poinar H.N., Holmes E.C. Comment: Characterization of Two Historic Smallpox Specimens from a Czech Museum. Viruses. 2017;9:276. doi: 10.3390/v9100276. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 159.Ferrari G., Neukamm J., Baalsrud H.T., Breidenstein A.M., Ravinet M., Phillips C., Rühli F., Bouwman A., Schuenemann V.J. Variola Virus Genome Sequenced from an Eighteenth-Century Museum Specimen Supports the Recent Origin of Smallpox: 18th Century Variola Virus Genome. Philos. Trans. R. Soc. B Biol. Sci. 2020;375:20190572. doi: 10.1098/rstb.2019.0572. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 160.Babkin I.V., Babkina I.N., Tikunova N.V. An Update of Orthopoxvirus Molecular Evolution. Viruses. 2022;14:388. doi: 10.3390/v14020388. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 161.Tumpey T.M., Basler C.F., Aguilar V.P., Basler C.F., Aguilar P.V., Zeng H., Solórzano A., Swayne D.E., Cox N.J., Katz J.M., et al. Characterization of the Reconstructed 1918 Spanish Influenza Pandemic Virus. Science. 2005;310:77–80. doi: 10.1126/science.1119392. [DOI] [PubMed] [Google Scholar]
- 162.Bartlett M.S. Measles Periodicity and Community Size. J. R. Stat. Soc. Ser. A. 1957;120:48. doi: 10.2307/2342553. [DOI] [Google Scholar]
- 163.Black F.L. Measles Endemicity in Insular Populations: Critical Community Size and Its Evolutionary Implication. J. Theor. Biol. 1966;11:207–211. doi: 10.1016/0022-5193(66)90161-5. [DOI] [PubMed] [Google Scholar]
- 164.Keeling M.J., Grenfell B.T. Disease Extinction and Community Size: Modeling the Persistence of Measles. Science. 1997;275:65–67. doi: 10.1126/science.275.5296.65. [DOI] [PubMed] [Google Scholar]
- 165.Littlejohn M., Locarnini S., Yuen L. Origins and Evolution of Hepatitis B Virus and Hepatitis D Virus. Cold Spring Harb. Perspect. Med. 2016;6:a021360. doi: 10.1101/cshperspect.a021360. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 166.Zhou Y., Holmes E.C. Bayesian Estimates of the Evolutionary Rate and Age of Hepatitis B Virus. J. Mol. Evol. 2007;65:197–205. doi: 10.1007/s00239-007-0054-1. [DOI] [PubMed] [Google Scholar]
- 167.Paraskevis D., Angelis K., Magiorkinis G., Kostaki E., Ho S.Y.W., Hatzakis A. Dating the Origin of Hepatitis B Virus Reveals Higher Substitution Rate and Adaptation on the Branch Leading to F/H Genotypes. Mol. Phylogenet. Evol. 2015;93:44–54. doi: 10.1016/j.ympev.2015.07.010. [DOI] [PubMed] [Google Scholar]
- 168.Zehender G., Ebranati E., Gabanelli E., Sorrentino C., Presti A.L., Tanzi E., Ciccozzi M., Galli M. Enigmatic Origin of Hepatitis B Virus: An Ancient Travelling Companion or a Recent Encounter? World J. Gastroenterol. 2014;20:7622–7634. doi: 10.3748/wjg.v20.i24.7622. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 169.Ho S.Y.W., Duchêne S., Molak M., Shapiro B. Time-Dependent Estimates of Molecular Evolutionary Rates: Evidence and Causes. Mol. Ecol. 2015;24:6007–6012. doi: 10.1111/mec.13450. [DOI] [PubMed] [Google Scholar]
- 170.Firth C., Kitchen A., Shapiro B., Suchard M.A., Holmes E.C., Rambaut A. Using Time-Structured Data to Estimate Evolutionary Rates of Double-Stranded DNA Viruses. Mol. Biol. Evol. 2010;27:2038–2051. doi: 10.1093/molbev/msq088. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 171.Sanjuán R., Domingo-Calap P. Mechanisms of Viral Mutation. Cell. Mol. Life Sci. 2016;73:4433–4448. doi: 10.1007/s00018-016-2299-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 172.Perelson A.S. Modelling Viral and Immune System Dynamics. Nat. Rev. Immunol. 2002;2:28–36. doi: 10.1038/nri700. [DOI] [PubMed] [Google Scholar]
- 173.Irwin K.K., Renzette N., Kowalik T.F., Jensen J.D. Antiviral Drug Resistance as an Adaptive Process. Virus Evol. 2016;2:vew014. doi: 10.1093/ve/vew014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 174.Volz E., Hill V., McCrone J.T., Price A., Jorgensen D., O’Toole Á., Southgate J., Johnson R., Jackson B., Nascimento F.F., et al. Evaluating the Effects of SARS-CoV-2 Spike Mutation D614G on Transmissibility and Pathogenicity. Cell. 2021;184:64–75. doi: 10.1016/j.cell.2020.11.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 175.Gojobori T., Moriyama E.N., Kimura M. Molecular Clock of Viral Evolution, and the Neutral Theory. Proc. Natl. Acad. Sci. USA. 1990;87:10015–10018. doi: 10.1073/pnas.87.24.10015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 176.Jenkins G.M., Rambaut A., Pybus O.G., Holmes E.C. Rates of Molecular Evolution in RNA Viruses: A Quantitative Phylogenetic Analysis. J. Mol. Evol. 2002;54:156–165. doi: 10.1007/s00239-001-0064-3. [DOI] [PubMed] [Google Scholar]
- 177.Sanjuán R. From Molecular Genetics to Phylodynamics: Evolutionary Relevance of Mutation Rates across Viruses. PLoS Pathog. 2012;8:e1002685. doi: 10.1371/journal.ppat.1002685. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 178.Duffy S., Shackelton L.A., Holmes E.C. Rates of Evolutionary Change in Viruses: Patterns and Determinants. Nat. Rev. Genet. 2008;9:267–276. doi: 10.1038/nrg2323. [DOI] [PubMed] [Google Scholar]
- 179.Wertheim J.O., Kosakovsky Pond S.L. Purifying Selection Can Obscure the Ancient Age of Viral Lineages. Mol. Biol. Evol. 2011;28:3355–3365. doi: 10.1093/molbev/msr170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 180.Ho S.Y.W., Lanfear R., Bromham L., Phillips M.J., Soubrier J., Rodrigo A.G., Cooper A. Time-Dependent Rates of Molecular Evolution. Mol. Ecol. 2011;20:3087–3101. doi: 10.1111/j.1365-294X.2011.05178.x. [DOI] [PubMed] [Google Scholar]
- 181.Duchêne S., Holmes E.C., Ho S.Y.W. Analyses of Evolutionary Dynamics in Viruses Are Hindered by a Time-Dependent Bias in Rate Estimates. Proc. R. Soc. B Biol. Sci. 2014;281:20140732. doi: 10.1098/rspb.2014.0732. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 182.Aiewsakun P., Katzourakis A. Time-Dependent Rate Phenomenon in Viruses. J. Virol. 2016;90:7184–7195. doi: 10.1128/JVI.00593-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 183.Ghafari M., Simmonds P., Pybus O.G., Katzourakis A. A Mechanistic Evolutionary Model Explains the Time-Dependent Pattern of Substitution Rates in Viruses. Curr. Biol. 2021;31:4689–4696. doi: 10.1016/j.cub.2021.08.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 184.Lythgoe K.A., Gardner A., Pybus O.G., Grove J. Short-Sighted Virus Evolution and a Germline Hypothesis for Chronic Viral Infections. Trends Microbiol. 2017;25:336–348. doi: 10.1016/j.tim.2017.03.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 185.Harrison A., Lemey P., Hurles M., Moyes C., Horn S., Pryor J., Malani J., Supuri M., Masta A., Teriboriki B., et al. Genomic Analysis of Hepatitis B Virus Reveals Antigen State and Genotype as Sources of Evolutionary Rate Variation. Viruses. 2011;3:83–101. doi: 10.3390/v3020083. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 186.Lin Y.-Y., Liu C., Chien W.-H., Wu L.-L., Tao Y., Wu D., Lu X., Hsieh C.-H., Chen P.-J., Wang H.-Y., et al. New Insights into the Evolutionary Rate of Hepatitis B Virus at Different Biological Scales. J. Virol. 2015;89:3512–3522. doi: 10.1128/JVI.03131-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 187.Lythgoe K.A., Fraser C. New Insights into the Evolutionary Rate of HIV-1 at the within-Host and Epidemiological Levels. Proc. R. Soc. B Biol. Sci. 2012;279:3367–3375. doi: 10.1098/rspb.2012.0595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 188.Aiewsakun P., Katzourakis A. Time Dependency of Foamy Virus Evolutionary Rate Estimates. BMC Evol. Biol. 2015;15:119. doi: 10.1186/s12862-015-0408-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 189.Holmes E.C., Duchêne S. Evolutionary Stasis of Viruses? Nat. Rev. Microbiol. 2019;17:329. doi: 10.1038/s41579-019-0168-7. [DOI] [PubMed] [Google Scholar]
- 190.Emerson B.C., Hickerson M.J. Lack of Support for the Time-Dependent Molecular Evolution Hypothesis. Mol. Ecol. 2015;24:702–709. doi: 10.1111/mec.13070. [DOI] [PubMed] [Google Scholar]
- 191.Rambaut A., Lam T.T., Carvalho L.M., Pybus O.G. Exploring the Temporal Structure of Heterochronous Sequences Using TempEst (Formerly Path-O-Gen) Virus Evol. 2016;2:vew007. doi: 10.1093/ve/vew007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 192.Bos K.I., Kühnert D., Herbig A., Esquivel-Gomez L.R., Andrades Valtueña A., Barquera R., Giffin K., Kumar Lankapalli A., Nelson E.A., Sabin S., et al. Paleomicrobiology: Diagnosis and Evolution of Ancient Pathogens. Annu. Rev. Microbiol. 2019;73:639–666. doi: 10.1146/annurev-micro-090817-062436. [DOI] [PubMed] [Google Scholar]
- 193.Tong K.J., Duchêne D.A., Duchêne S., Geoghegan J.L., Ho S.Y.W. A Comparison of Methods for Estimating Substitution Rates from Ancient DNA Sequence Data. BMC Evol. Biol. 2018;18:70. doi: 10.1186/s12862-018-1192-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 194.Drummond A.J., Pybus O.G., Rambaut A., Forsberg R., Rodrigo A.G. Measurably Evolving Populations. Trends Ecol. Evol. 2003;18:481–488. doi: 10.1016/S0169-5347(03)00216-7. [DOI] [Google Scholar]
- 195.Korber B., Muldoon M., Theiler J., Gao F., Gupta R., Lapedes A., Hahn B.H., Wolinsky S., Bhattacharya T. Timing the Ancestor of the HIV-1 Pandemic Strains. Science. 2000;288:1789–1796. doi: 10.1126/science.288.5472.1789. [DOI] [PubMed] [Google Scholar]
- 196.Rieux A., Balloux F. Inferences from Tip-Calibrated Phylogenies: A Review and a Practical Guide. Mol. Ecol. 2016;25:1911–1924. doi: 10.1111/mec.13586. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 197.Duchêne S., Duchêne D., Holmes E.C., Ho S.Y.W. The Performance of the Date-Randomization Test in Phylogenetic Analyses of Time-Structured Virus Data. Mol. Biol. Evol. 2015;32:1895–1906. doi: 10.1093/molbev/msv056. [DOI] [PubMed] [Google Scholar]
- 198.Roux S., Enault F., Hurwitz B.L., Sullivan M.B. VirSorter: Mining Viral Signal from Microbial Genomic Data. PeerJ. 2015;2015:e985. doi: 10.7717/peerj.985. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 199.Guo J., Bolduc B., Zayed A.A., Varsani A., Dominguez-Huerta G., Delmont T.O., Pratama A.A., Gazitúa M.C., Vik D., Sullivan M.B., et al. VirSorter2: A Multi-Classifier, Expert-Guided Approach to Detect Diverse DNA and RNA Viruses. Microbiome. 2021;9:37. doi: 10.1186/s40168-020-00990-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 200.Amgarten D., Braga L.P.P., da Silva A.M., Setubal J.C. MARVEL, a Tool for Prediction of Bacteriophage Sequences in Metagenomic Bins. Front. Genet. 2018;9:304. doi: 10.3389/fgene.2018.00304. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 201.Ren J., Ahlgren N.A., Lu Y.Y., Fuhrman J.A., Sun F. VirFinder: A Novel k-Mer Based Tool for Identifying Viral Sequences from Assembled Metagenomic Data. Microbiome. 2017;5:69. doi: 10.1186/s40168-017-0283-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 202.Al-Shayeb B., Sachdeva R., Chen L.X., Ward F., Munk P., Devoto A., Castelle C.J., Olm M.R., Bouma-Gregson K., Amano Y., et al. Clades of Huge Phages from across Earth’s Ecosystems. Nature. 2020;578:425–431. doi: 10.1038/s41586-020-2007-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 203.Sugimoto R., Nishimura L., Nguyen P.T., Ito J., Parrish N.F., Mori H., Kurokawa K., Nakaoka H., Inoue I. Comprehensive Discovery of CRISPR-Targeted Terminally Redundant Sequences in the Human Gut Metagenome: Viruses, Plasmids, and More. PLoS Comput. Biol. 2021;17:e1009428. doi: 10.1371/journal.pcbi.1009428. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 204.Mojica F.J.M., Díez-Villaseñor C., García-Martínez J., Almendros C. Short Motif Sequences Determine the Targets of the Prokaryotic CRISPR Defence System. Microbiology. 2009;155:733–740. doi: 10.1099/mic.0.023960-0. [DOI] [PubMed] [Google Scholar]
- 205.Barrangou R., Fremaux C., Deveau H., Richards M., Boyaval P., Moineau S., Romero D.A., Horvath P. CRISPR Provides Acquired Resistance Against Viruses in Prokaryotes. Science. 2007;315:1709–1712. doi: 10.1126/science.1138140. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The data presented in this manuscript are available in Table 1.