Skip to main content
eLife logoLink to eLife
. 2017 Jan 10;6:e20983. doi: 10.7554/eLife.20983

A molecular portrait of maternal sepsis from Byzantine Troy

Alison M Devault 1,2,, Tatum D Mortimer 3,4,, Andrew Kitchen 5, Henrike Kiesewetter 6, Jacob M Enk 1,2, G Brian Golding 7, John Southon 8, Melanie Kuch 1, Ana T Duggan 1, William Aylward 9,10, Shea N Gardner 11, Jonathan E Allen 11, Andrew M King 12, Gerard Wright 12, Makoto Kuroda 13, Kengo Kato 13, Derek EG Briggs 14, Gino Fornaciari 15, Edward C Holmes 16, Hendrik N Poinar 1,7,12,17,*, Caitlin S Pepperell 3,9,18,*
Editor: George H Perry19
PMCID: PMC5224923  PMID: 28072390

Abstract

Pregnancy complications are poorly represented in the archeological record, despite their importance in contemporary and ancient societies. While excavating a Byzantine cemetery in Troy, we discovered calcified abscesses among a woman’s remains. Scanning electron microscopy of the tissue revealed ‘ghost cells’, resulting from dystrophic calcification, which preserved ancient maternal, fetal and bacterial DNA of a severe infection, likely chorioamnionitis. Gardnerella vaginalis and Staphylococcus saprophyticus dominated the abscesses. Phylogenomic analyses of ancient, historical, and contemporary data showed that G. vaginalis Troy fell within contemporary genetic diversity, whereas S. saprophyticus Troy belongs to a lineage that does not appear to be commonly associated with human disease today. We speculate that the ecology of S. saprophyticus infection may have differed in the ancient world as a result of close contacts between humans and domesticated animals. These results highlight the complex and dynamic interactions with our microbial milieu that underlie severe maternal infections.

DOI: http://dx.doi.org/10.7554/eLife.20983.001

Research Organism: Human, Other

eLife digest

Why and how have some bacteria evolved to cause illness in humans? One way to study bacterial evolution is to search for ancient samples of bacteria and use DNA sequencing technology to investigate how modern bacteria have changed from their ancestors. Understanding the evolution process may help researchers to understand how some bacteria become resistant to the antibiotics designed to kill them.

Complications that occur during pregnancy, including bacterial infections, have long been a major cause of death for women. Now, Devault, Mortimer et al. have been able to sequence the DNA of bacteria found in tissue collected from a woman buried 800 years ago in a cemetery in Troy. Some of the woman’s tissues had been well preserved because they had calcified (probably as the result of infection), which preserved their structure in a mineralized layer. Two mineralized “nodules” in the body appear to be the remains of abscesses. Some of the human DNA in the nodules came from a male, suggesting that the woman was pregnant with a boy and that the abscesses formed in placental tissue.

Sequencing the DNA of the bacteria in the abscess allowed Devault, Mortimer et al. to diagnose the woman’s infection, which was caused by two types of bacteria. One species, called Gardnerella vaginalis, is found in modern pregnancy-related infections. The DNA of the ancient samples was similar to that of modern bacteria. The other bacteria species was an ancient form of Staphylococcus saprophyticus, a type of bacteria that causes urinary tract infections. However, the DNA of the ancient S. saprophyticus bacteria is quite different to that of the bacteria found in modern humans. Instead, their DNA sequence appears more similar to forms of the bacteria that infect currently livestock. As humans lived closely with their livestock at the time the woman lived, her infection may be due to a type of bacteria that passed easily between humans and animals.

Overall, the results suggest that the disease-causing properties of bacteria can arise from a wide range of sources. In addition, Devault, Mortimer et al. have demonstrated that certain types of tissue found in archeological remains are a potential gold mine of information about the evolution of bacteria and other microbes found in the human body.

DOI: http://dx.doi.org/10.7554/eLife.20983.002

Introduction

During excavations of a Late Byzantine era cemetery at the periphery of the ancient city of Troy, Anatolia (in present day Turkey) (Figure 1—figure supplement 1), we discovered two calcified nodules among a woman’s remains. The woman was estimated to be 30 (±5y) at the time of death (Appendix). She was found alone in a stone-lined grave (Figure 1A) within the graveyard of a farming community (Kiesewetter, 2014). The nodules, which are 2–3 cm in diameter and composed of concentric layers (Figure 1B), were discovered at the base of the ribs. Radiocarbon dating of the decedent’s ulna yielded 790-860y BP (Supplementary file 1A), in agreement with the archeological assessment of the age of the cemetery (early 13th century AD, Appendix).

Figure 1. Calcified nodule found among the skeletal remains at Troy.

(A) Burial x24.177 (grave 14, cemetery in quadrat x24). Photo credit Gebhard Bieg, 2005. (B) Cross-section of nodule (sample no x24.177), photo credit: Pathologie Nordhessen 2009. Scale represents 1 cm. (C) Location of Troy. Modern day Turkey is shaded in gray.

DOI: http://dx.doi.org/10.7554/eLife.20983.003

Figure 1.

Figure 1—figure supplement 1. Map of Troy showing the cemetery in Grid Square x24 and areas of excavation 1988–2012.

Figure 1—figure supplement 1.

Areas of excavation are in gray and the cemetery is marked with a red square. North is at the top of the plan.

Figure 1—figure supplement 2. Metagenomic profiles of shotgun DNA libraries from nodules, based on BLAST analysis of all reads >35 bp length.

Figure 1—figure supplement 2.

(A) Nodule one (Nod1_1h-UDG), 28,713,282 reads total (B) Nodule two (Nod2-UDG), 6,038,994 reads total.

Figure 1—figure supplement 3. Fragment length distributions for non-UDG treated human mitochondrial assemblies.

Figure 1—figure supplement 3.

These FLDs were generated from the Ulna enriched libraries (A + B) as well as the non enriched nodule (C) using mapDamage2 (Jonsson et al., 2013) from merged nonUDG data sets assembled to the human mitochondrial reference genome (Andrews et al., 1999), NCBI accession NC_012920.

Figure 1—figure supplement 4. Ancient DNA damage assessment of human mitochondrial reads.

Figure 1—figure supplement 4.

Damage profiles of non-UDG treated (‘nonU’) as well as UDG treated merged reads assembled to the human mitochondrial rCRS reference genome (NC_012920) for (A) Ulna_Enr1-nonU round one human mitochondrial enrichment, (B) Ulna_Enr2-nonU round 2, and (C) a Nod1_1h-UDG reads (which have been UDG treated). Damage profiles were generated using mapDamage2.(Jonsson et al., 2013).

Figure 1—figure supplement 5. Ancient DNA damage assessment of reads mapped to hg38 chrX, chrY and autosomes.

Figure 1—figure supplement 5.

Damage profiles generated using mapDamage2 (Jonsson et al., 2013) of non-UDG treated (‘nonU’) reads from the NOD1_nonU and NOD2_nonU data set (total of 1,468,381 trimmed and merged reads) with minimum 35 bp length and map quality 30, mapping to (A) hg38 chrX, and (B) hg38 chrY and C) hg38 autosomes.

Figure 1—figure supplement 6. Haplogroup U3 Bayesian Maximum Clade Credibility tree.

Figure 1—figure supplement 6.

Complete human mtDNA genomes assigned to haplogroup U3 (n = 137) were collected from GenBank and aligned with the Troy consensus sequence (highlighted in red). Tree was generated using BEAST v 1.856 and TreeAnnotator.57 Posterior probabilities are shown at nodes.

Figure 1—figure supplement 7. Heatmap of most common taxa in metagenomic samples.

Figure 1—figure supplement 7.

The heatmap gives the log of the frequency of the most common taxa in each sample along the diagonal (if the most frequent is already shown, then second most frequent is added for that sample; Nares and Ear, Nod1_1h-UDG and Nod2-UDG). The taxa in order are - 76775, Malassezia restricta; 76773, Malassezia globosa; 729, Haemophilus parainfluenzae; 60133, Prevotella pallens; 28117, Alistipes putredinis; 47770, Lactobacillus crispatus; 562, Escherichia coli; 487, Neisseria meningitidis; 2001, Streptosporangium roseum; 29385, Staphylococcus saprophyticus; 2702, Gardnerella vaginalis.

Figure 1—figure supplement 8. PCA of Human Microbiome Project and ancient metagenomic taxa.

Figure 1—figure supplement 8.

Taxa were identified using LMAT. PCA performed using prcomp function in R. Legend indicates the origin of the category and the number of samples combined into each category. The first principal component axis separates the placental and ancient samples from the remaining samples. The second principal component axis separates the Sediment-UDG and Ulna-UDG data sets, which likely contain soil contamination, from the remaining samples.

Figure 1—figure supplement 9. Sketch of skeletal preservation.

Figure 1—figure supplement 9.

Nodule one (Figure 2—figure supplement 1, Supplementary file 1B) is primarily composed of two phosphate phases, hydroxylapatite (bioapatite) and whitlockite (as well as small amounts of calcite), both of which have been found in human calcified pathological concretions (Lagier and Baud, 2003). Based on their size and concentric layered structure, the nodules could be urinary stones. However, struvite (magnesium ammonium phosphate) and calcium oxalate, common constituents of urinary stones, were absent in both XRD and SEM-EDS analyses (Supplementary file 1B-D). SEM of the nodules (Figure 2, Figure 2—figure supplement 2) revealed aggregates of spherical structures with dimensions typical of bacterial cells, as well as extracellular polymeric substances (EPS – a glycocalyx secreted by the cells during biofilm formation [Decho and Thiel, 2011]).

Figure 2. SEM image of nodule at (A) 2000x and (B) 20,000x magnification.

Bacterial cells indicated with red arrow are between ~1 µm and 2 µm (within range expected for Staphylococcus). Extracellular polymeric substances (EPS) are indicated by yellow arrows.

DOI: http://dx.doi.org/10.7554/eLife.20983.013

Figure 2.

Figure 2—figure supplement 1. XRD analysis of nodule.

Figure 2—figure supplement 1.

A) Video alignment and XRD frames. Left; crosshairs indicate the center of the region examined. Right; the four frames collected to obtain a 2θ range of 8–103°. At a low angle, air scatter from the main beam is evident. B) Background subtracted powder pattern. C) Search/match results.

Figure 2—figure supplement 2. SEM image of nodule at 10,000x magnification.

Figure 2—figure supplement 2.

Possible inflammatory (neutrophils) cells indicated by blue arrows.

We extracted DNA from both nodules and made Uracil DNA Glycosylase (UDG) and non-UDG treated dsDNA libraries. Shotgun sequences from all libraries yielded astonishingly high proportions of endogenous human and bacterial DNA: 24–48% human, 37–66% S. saprophyticus, and 5–7% G. vaginalis (Figure 1—figure supplements 25).

From these data, we reconstructed a human mitochondrial genome at 30.1x unique read depth, the consensus of which belongs to haplotype U3b3. In phylogenetic analyses of the mitogenome from Troy and modern mitogenomes, the Troy sample groups most closely with those from the Caucasus and Middle East, both of which were within the eastern limits of Late Byzantine influence (Figure 1—figure supplement 6).

To investigate whether the nodules belonged to the associated female individual, we extracted DNA from her ulna, constructed a dsDNA library, enriched for, sequenced, and reconstructed the mitogenome to an average unique coverage depth of 30.8x. The nodule and the ulna share the identical mitochondrial haplotype (Supplementary file 1E), indicating that they stem from the same individual or a maternal relative.

The metagenomic profile of the nodules suggests they derive from an amalgam of human and bacterial cells, as in an abscess. The high concentration of S. saprophyticus and G. vaginalis DNA suggests an origin in genitourinary tissue. To exclude an exogenous environmental source of the bacterial DNA and to further investigate the tissue of origin, we performed metagenomic profiling of the nodules, ulna and sediment from the gravesite. The similarity in abundance of G. vaginalis in the nodules and modern Human Microbiome Panel (HMP) vaginal samples (Figure 1—figure supplement 7) points to a likely origin for the nodules in the female reproductive tract. The metagenomic profile of the nodules (minus their associated blanks) is distinct from the sediment, whereas the reads from the ulna group closely with the sediment sample (Figure 1—figure supplement 8). These results indicate that the nodules were less prone to leaching of environmental DNA. Our SEM-EDS and XRD findings suggest that bacterial and inflammatory cells were replicated in calcium phosphate minerals (‘ghost cells’); it is likely that this mineralization provided a remarkable degree of protection from DNA degradation and environmental leaching as seen in the bones. The ectopic, inflammation-related calcification observed here is an apparently highly effective mechanism of bacterial fossilization that rivals mineralization occurring at much slower rates in the environment.

Sexing analyses of the remains (and associated blanks) using the method of Skoglund et al. (2013) assign the nodules as female -XX (Supplementary file 1F). More thorough analyses of the human DNA present in the nodules yielded an intriguing finding that helps pinpoint their tissue of origin. Shotgun sequencing data from the nodules contained a small number of reads (884) conservatively mapping to the Y chromosome (Supplementary file 1G). The length distributions of the reads overlapped with those mapping to the X chromosome and autosomes, suggesting an endogenous, ancient origin (Supplementary file 1G, Figure 1—figure supplement 5); we searched for but did not find similar bona fide ancient Y chromosome reads in sequence data from the ulna, the sediment, or any negative control (Supplementary file 1G). The presence of Y chromosome reads in the nodule but not in the ulna could be explained by a placental origin of the mineralized abscesses, indicating chorioamnionitis in the decedent while pregnant with a male fetus. Chorioamnionitis – inflammation and infection of the placenta and fetal membranes – involves an inflammatory response on the part of the fetus as well as the mother (Kraus et al., 2004), which would explain a female (maternal) origin of the nodular tissue with a minority male (fetal) component.

Chorioamnionitis is a mixed infection in which vaginal bacteria reach the upper reproductive tract, placenta, and fetal membranes; G. vaginalis is often identified in infected tissues (Hillier et al., 1988). S. saprophyticus can be found among the genitourinary and gastrointestinal flora of healthy women (Ringertz and Torssander, 1986; Rupp et al., 1992; Schneider and Riley, 1996). It is a common cause of urinary tract infection (UTI) in reproductive aged women (Kahlmeter, 2003) and has also been known to cause puerperal infections (Arianpour et al., 2009).

To gain further insights into the pathogens associated with this historical genitourinary infection, we pooled reads from all nodule DNA libraries, mapped and reconstructed the ancient S. saprophyticus and G. vaginalis genomes and analyzed them in conjunction with existing and newly acquired genomic data from extant and historical organisms (Supplementary file 1H,I).

We used a combination of paired-end reference guided assembly and iterative assembly to reconstruct a nearly complete genome of S. saprophyticus Troy, including >100 Kb of novel sequence compared to reference strain ATCC 15305. The genome is 2,471,881 bp long, with an average unique coverage depth of 298.6x (Figure 4—figure supplement 3), which represents an unprecedented, detailed and complete picture of an ancient pathogen genome from shotgun sequencing data. We also reconstructed a 22.6 Kb plasmid, pSST1.

We were unable to reconstruct a contiguous G. vaginalis genome due to high variability in coverage and lack of synteny in both ancient and modern genomic data (Ahmed et al., 2012). Instead, we used a de novo approach to reconstruct G. vaginalis Troy gene content using reads that mapped to the annotated coding regions of all available G. vaginalis genomes. This enabled us to assess the gene content of our ancient genome compared to the modern strains. Using this method, we recovered 1187 unique contigs (total length 1,435,761 bp) corresponding to 972 annotated genes and an average unique coverage depth of 57.0x (Figure 3—figure supplement 3).

Our sample of 35 isolates of G. vaginalis was grouped into four previously defined clades (Figure 3, Figure 3—figure supplement 4), which have been proposed to represent distinct species (Ahmed et al., 2012). G. vaginalis Troy sits within Clade 1, among vaginal and endometrial isolates collected from both healthy women and patients with bacterial vaginosis. Interestingly, the 800-year-old sample from Troy (Turkey) falls within contemporary genetic diversity (Supplementary file 1I).

Figure 3. Phylogenetic analysis of Gardnerella vaginalis.

A maximum likelihood tree estimated using RAxML(Stamatakis, 2014) (Figure 3—source data 3) from a core alignment of G. vaginalis genomes (Figure 3—source data 1, Figure 3—source data 2). Branches are colored based on clades originally identified in Ahmed et al.(Ahmed et al., 2012) (green = clade 1, blue = clade 2, red = clade 3, purple = clade 4). Tips from modern G. vaginalis isolates are labeled based on sample source (H = healthy vagina, BV = bacterial vaginosis, HE = healthy endometrium, E = endometrium, U = unknown). Lighter colored branches have bootstrap values less than 100. Clinical phenotypes are interspersed throughout the phylogeny, and the Troy genome is not associated with a consistently pathogenic lineage of G. vaginalis. Inset: Recombinant fragments in G. vaginalis core genome identified by BratNextGen (Figure 3—source data 4) (Marttinen, 2012). Each circle represents one genome. Colored blocks represent recombinant fragments, and colors correspond to the clade designations in the phylogenetic tree. Plot made with Circos (Krzywinski et al., 2009).

DOI: http://dx.doi.org/10.7554/eLife.20983.016

Figure 3—source data 1. Concatenated alignment of core genes in G. vaginalis.
DOI: 10.7554/eLife.20983.017
Figure 3—source data 2. G. vaginalis core genome alignment trimmed with Gblocks.
DOI: 10.7554/eLife.20983.018
Figure 3—source data 3. Maximum likelihood phylogenetic analysis of trimmed G. vaginalis alignment with RAxML.
DOI: 10.7554/eLife.20983.019
Figure 3—source data 4. Recombinant fragments detected with BratNextGen in trimmed G. vaginalis alignment.
DOI: 10.7554/eLife.20983.020

Figure 3.

Figure 3—figure supplement 1. Ancient DNA damage assessment of G. vaginalis.

Figure 3—figure supplement 1.

Damage profiles of non-UDG treated (‘nonU’) reads from a pooled NOD1_nonU and NOD2_nonU data set (total of 1,565,548 trimmed reads >24 bp) mapping to G. vaginalis strain ATCC 14019. Paired end reads were mapped using bwa (Li and Durbin, 2009) with default settings and duplicates were removed with samtools rmdup (Li et al., 2009). Damage profiles were generated using mapDamage2 (Jonsson et al., 2013).

Figure 3—figure supplement 2. Fragment length distribution (FLD) for G. vaginalis ATCC 14019.

Figure 3—figure supplement 2.

All nodule shotgun libraries (Nod1_1h-UDG, Nod1_1h-nonU, Nod2-UDG, Nod2-nonU) were pooled, reads were restricted to a minimum length of 35 bp and mapping quality of 30 and all duplicates removed both within and between libraries. The fragment length distribution of the remaining 1,658,978 reads was visualized using mapDamage2 (Jonsson et al., 2013).

Figure 3—figure supplement 3. Genome coverage plots for pooled nodule shotgun libraries.

Figure 3—figure supplement 3.

G. vaginalis (NC_014644), average coverage 57.0X. All reads were restricted to minimum length of 35 bp and minimum map quality 30 with all duplicates removed. Figures depict coverage of the genome in 100 bp blocks across references. Concentric grey circles demarcate increments of 50X coverage in both plots.

Figure 3—figure supplement 4. Neighbor net network of core genomes.

Figure 3—figure supplement 4.

The network created in SplitsTree v 4 (Huson and Bryant, 2006) of Gardnerella vaginalis. The networks recapitulate the structure of maximum likelihood tree (Figure 3).

Consistent with prior reports (Ahmed et al., 2012), we identified extensive impacts of lateral gene transfer (LGT) on G. vaginalis diversity (Figure 3). Even in the core genome alignment, which contains just 44% of per-isolate gene content, we estimate that 20% of sites have been affected by recombination. This high rate of recombination may help to explain the remarkable preservation of genetic diversity in G. vaginalis. A recent study of Helicobacter pylori, which has similarly high rates of LGT, found that genetic diversity within the species has been preserved for more than five thousand years (Maixner et al., 2016).

We discovered two distinct clades of S. saprophyticus (Figure 4, Figure 4—figure supplement 4), one of which (Clade P) appears to be more strongly associated with pathogenicity and includes our ancient S. saprophyticus Troy. Nineteen of twenty veterinary and human clinical isolates belong to Clade P, an association that was statistically significant (Appendix). A second clade (Clade E) is made up of food and environmental isolates of S. saprophyticus, as well as a human UTI isolate from Japan.

Figure 4. Phylogenetic analysis of Staphylococcus saprophyticus.

(A) Maximum likelihood tree estimated using RAxML (Stamatakis, 2014) (Figure 4—source data 3) from an alignment of S. saprophyticus genomes (Figure 4—source data 1, Figure 4—source data 2). Bootstrap values less than 100 are labeled. Silhouettes indicate bacterial sample source. Isolates without silhouettes are from human clinical samples isolated from urine. Color corresponds to country of isolation as seen on the map. Full sample descriptions are in Supplementary file 1H. (B) Source countries of bacterial samples. (C) Neighbor-net network of S. saprophyticus plasmid sequences (Figure 4—source data 4) related to pSST1 created in SplitsTree4 (Huson and Bryant, 2006). The boxed inset is an enlarged version of the portion of the network from Clade P isolates. Some S. saprophyticus isolates do not encode pSST1-like plasmids, and therefore, they are not included in the network. Starts and stops of recombinant regions of the alignment can be found in Figure 4—source data 5.

DOI: http://dx.doi.org/10.7554/eLife.20983.025

Figure 4—source data 1. S. saprophyticus whole genome alignment.
DOI: 10.7554/eLife.20983.026
Figure 4—source data 2. S. saprophyticus whole genome alignment trimmed with trimal.
DOI: 10.7554/eLife.20983.027
Figure 4—source data 3. Maximum likelihood phylogenetic analysis of trimmed S. saprophyticus alignment with RAxML.
DOI: 10.7554/eLife.20983.028
Figure 4—source data 4. S. saprophyticus plasmid alignment trimmed with trimal.
elife-20983-fig4-data4.fasta (208.8KB, fasta)
DOI: 10.7554/eLife.20983.029
Figure 4—source data 5. Recombinant fragments detected with BratNextGen in S. saprophyticus alignment.
DOI: 10.7554/eLife.20983.030

Figure 4.

Figure 4—figure supplement 1. Ancient DNA damage assessment of S. saprophyticus.

Figure 4—figure supplement 1.

Damage profiles of non-UDG treated (‘nonU’) reads from a pooled NOD1_nonU and NOD2_nonU data set (total of 1,565,548 trimmed reads >24 bp) mapping to S. saprophyticus strain ATCC 15305. Paired end reads were mapped using bwa (Li and Durbin, 2009) with default settings and duplicates were removed with samtools rmdup (Li et al., 2009). Damage profiles were generated using mapDamage2 (Jonsson et al., 2013).

Figure 4—figure supplement 2. Fragment length distribution (FLD) for S. saprophyticus ATCC 15305.

Figure 4—figure supplement 2.

All nodule shotgun libraries (Nod1_1h-UDG, Nod1_1h-nonU, Nod2-UDG, Nod2-nonU) were pooled, reads were restricted to a minimum length of 35 bp and mapping quality of 30 and all duplicates removed both within and between libraries. The fragment length distribution of the remaining 3,904,552 reads was visualized using mapDamage2 (Jonsson et al., 2013).

Figure 4—figure supplement 3. Genome coverage plots for pooled nodule shotgun libraries.

Figure 4—figure supplement 3.

S. saprophyticus (NC_007350), average coverage 298.6X. All reads were restricted to minimum length of 35 bp and minimum map quality 30 with all duplicates removed. Figures depict coverage of the genome in 100 bp blocks across references. Concentric grey circles demarcate increments of 50X coverage in both plots.

Figure 4—figure supplement 4. Neighbor net network of core genomes.

Figure 4—figure supplement 4.

Networks created in SplitsTree v 4 (Huson and Bryant, 2006) of S. saprophyticus. The networks recapitulate the structure of maximum likelihood trees (Figure 4).

Figure 4—figure supplement 5. Presence of mobile genetic elements, virulence genes, and antibiotic resistance in S. saprophyticus.

Figure 4—figure supplement 5.

Novobiocin resistance is conferred by a glycine at position 85 and lysine at position 140 (Vickers et al., 2007), which is present in all S. saprophyticus genomes examined here. SSP1924 and fosB confer streptomycin and fosfomycin resistance, respectively, and are encoded in vSs15305 in the ATCC 15305 reference genome (Kuroda et al., 2005). While none of the other isolates encode the entire genomic island, fosB and SSP1924 are found in isolates from both Clade P and Clade E. The canine isolate (K) harbors SCCmec containing mecA conferring methicillin resistance that has been identified in human clinical isolates of S. saprophyticus (Higashide et al., 2008).

Figure 4—figure supplement 6. Recombinant regions detected by BratNextGen in S. saprophyticus.

Figure 4—figure supplement 6.

Each circle in the figure represents one isolate. Regions with significant evidence for recombination are shown as black or colored blocks. Black ticks mark intervals of 20 kb, and positions are in reference to ATCC15305. 17.9% of the alignment is recombinant in at least one strain. After removing fragments associated with known MGEs, 15.0% of sites are recombinant in the core genome. Isolates are colored according to clade (purple- Clade E, green- bovine, blue- Troy, black- Clade P).

Plasmids similar to S. saprophyticus Troy pSST1 were present in isolates from both clades. The relationships among plasmid sequences from S. saprophyticus Troy and other members of Clade P were distinct from those of the core genome; we also found evidence of recombination among pSST1-like plasmids (Figure 4C, Appendix).

A long branch separates Clade P pSST1 from those of Clade E (Figure 4C), recapitulating their relationship on the core genome phylogeny. This was also true of pSSP2, the only other plasmid we identified in both Clades P and E (but not S. saprophyticus Troy; Figure 4—figure supplement 5). These observations suggest plasmids are more readily exchanged within Clades P and E than between them. This could indicate that Clades P and E are spatially segregated, that there are mechanistic barriers to plasmid exchange between clades, or that epistatic interactions reinforce clade separation of these mobile elements.

The human clinical isolates in Clades P and E are nested within the phylogeny with more divergent lineages associated with other animals. This suggests that the most recent common ancestor (MRCA) of S. saprophyticus may not have been human-associated. This is in stark contrast to the major pathogen in the genus, Staphylococcus aureus, where phylogenetic studies suggest that the MRCA of human and livestock-associated lineages had a human host (Fitzgerald, 2012; Weinert et al., 2012; Shepheard et al., 2013). S. aureus is strongly associated with its niche on the human body and is transmitted primarily from person-to-person. S. saprophyticus, by contrast, appears to be a generalist that colonizes a range of environments.

Several lines of evidence also indicate that humans acquire S. saprophyticus infection from the environment. In northern climates, there is marked seasonal variation in the incidence of S. saprophyticus UTI (Rupp et al., 1992; Hovelius and Mårdh, 1984; Ringertz and Torssander, 1986; Hedman et al., 1993; Widerström et al., 2007), whereas there is no evidence of seasonality in Mediterranean climates (Schneider and Riley, 1996). S. saprophyticus can be identified in environmental samples, with a strong seasonal peak that occurs just before peak rates of S. saprophyticus UTI in northern climates (Hedman et al., 1993; Soge et al., 2009). Molecular epidemiological surveys also suggest S. saprophyticus is primarily acquired from an environmental reservoir, rather than as a result of person-to-person transmission (Widerström et al., 2007; Widerström et al., 2012). These observations suggest that the bacteria cycle between host-associated and environmental stages, with seasonal climatic effects on their abundance in the environment.

The length of the branch leading to S. saprophyticus Troy is similar to those leading to the other tips (Figure 4A), suggesting there is little temporal signal in the phylogeny. Calibrated phylogenetic analyses (Appendix) confirmed the absence of temporal signal, which precludes reliable estimation of the rate of substitution or divergence times for S. saprophyticus.

A mixed environmental, commensal and pathogenic niche may in part explain the absence of temporal structure in our sample of S. saprophyticus. Selection pressures and generation times are likely to differ between free-living and host-associated stages, which can obscure temporal signals in genetic data (Bromham, 2009). In addition to producing rate variability, periods of dormancy in the environment – e.g. during the winter in northern climates, as suggested by seasonal patterns in cultivability – would be predicted to lower the overall rate at which S. saprophyticus evolves (Bromham, 2009; Weinert et al., 2015). The 800 year interval between S. saprophyticus Troy and the other tips may simply be too short relative to the overall depth of the tree to allow reliable rate inference.

Notably, all human-associated isolates of S. saprophyticus in Clade P form a monophyletic group, to which the bovine mastitis strain falls basally; there are no modern human pathogenic representatives of the S. saprophyticus Troy lineage. This may mean that the ecology of S. saprophyticus differed in the Byzantine world, with human infections arising from a different reservoir of bacteria than they do today. S. saprophyticus is readily cultured from the environment around livestock (Hedman et al., 1993; Cherif-Antar et al., 2016), and Byzantine era peasants in Anatolia typically shared their households with cattle (Lefort, 2007). This and other historical settings are likely to have facilitated spillover events and, perhaps, the circulation of bacteria that were adapted to both livestock and humans.

Based on the available data, it is not possible to determine whether the human clinical isolate nested among environmental and food-associated bacteria in Clade E represents a spillover or a second emergence into humans. In either event, it appears that S. saprophyticus can transition to a human pathogenic niche with relative ease. We did not identify any gene content uniquely shared (or absent) among the pathogenic strains in our sample, which suggests that pleiotropy underlies S. saprophyticus’ flexible association with diverse niches. For many bacterial genera, genetic distances between free-living organisms and pathogens are larger than observed here, and pathogen emergence is a singular event characterized by genomic decay and loss of functions required outside the pathogenic niche (Parkhill et al., 2001; Larsson et al., 2009; Reuter et al., 2014). More studies and wider sampling are needed to fully characterize the niche of S. saprophyticus, but our observations reinforce the notion that the adaptive paths to bacterial virulence are more diverse than has previously been appreciated.

Complications of pregnancy and childbirth are major causes of morbidity and mortality worldwide and new threats to maternal health continue to emerge (WHO et al., 2014; Mlakar et al., 2016). Our analyses of the remains of a woman who died in Late Byzantine Troy connect her to this broad historical and epidemiological phenomenon. Her infection was associated with exuberant calcification of the placenta, which replicated maternal, fetal, and bacterial cells in calcium phosphate minerals and preserved a high resolution molecular portrait of their contents. S. saprophyticus, an organism the decedent is likely to have acquired from her environment, and G. vaginalis, a member of the native human biota, are the dominant bacterial species of the infection. S. saprophyticus Troy belongs to a lineage that appears to be uncommonly associated with human disease in the modern world, whereas G. vaginalis Troy nests among modern commensal and pathogenic strains on its phylogeny. This highlights the complexity of virulence as a bacterial trait and a potential role of interactions among bacterial species in shaping pathologic outcomes of infection.

Materials and methods

Samples

Ethics approval for the study of the remains of the individual excavated in 2005 from grave 14 (Troy project, University of Tübingen, bone-sample x24.177) in quadrat x24 at Troy was obtained from Hamilton Health Sciences and McMaster University (REB# 13–146 T). Samples of extant bacteria were provided to investigators without patient identifiers or protected health information; the members of the study team did not have access to any identifiers or protected health information associated with the bacterial isolates. Sediment from the Troy site was imported to and studied at McMaster University in accordance with Canadian Food Inspection Agency guidelines, under permit number P-2012–04220.

Radiocarbon dating

Subsamples of both nodules and the ulna bone were sent to the Keck Carbon Cycle AMS Facility (Earth System Science Department, UC Irvine, CA) for radiocarbon dating (Appendix, Supplementary file 1A). In addition to 14C dating ultrafiltered collagen from the ulna, we also attempted to measure 14C in carbonate from nodule one (with 10–30% leaching) and three organic fractions from nodule two: raw nodule (including carbonate), residue from demineralization with room temperature 1N HCl, and residue from demineralization plus gelatinization with 60°C 0.01N HCl.

X-ray diffraction (XRD)

A subsample of nodule two was subjected to mineralogical analysis using XRD at the Brockhouse Institute for Materials Research (McMaster University) using the Bruker D8 DISCOVER with DAVINCI.DESIGN diffractometer (Figure 2—figure supplement 1). Sample flakes were piled on top of a single crystal silicon wafer, and aligned to the center of the diffractometer using the laser-video alignment system. The detector to sample distance was calibrated with corundum to 20 cm. Four frames were collected to obtain the 2θ range of 8–103°. The frames were integrated into intensity plots using DIFFRAC.EVA V.3.0 (software package from Bruker AXS). A pattern search/match was executed using the integrated ICDD PDF-2 2011 powder database. Slight mismatch in the peak positions are likely due to variation of elemental stoichiometry in the identified phase.

Scanning electron microscopy (SEM) with energy dispersive X-ray spectroscopy (EDS)

A subsample of nodule two was viewed via SEM and subjected to elemental analysis using SEM-EDS at the MAX Diffraction Facility (McMaster University). Sample pieces were attached to an aluminum stub with double-sided carbon tape and sputter-coated with a thin layer of Au. The sample was viewed in a Tescan Vega II LSU operating at 20kV. Energy dispersive spectroscopy (EDS) was carried out with an IMAX detector (Oxford Instruments) and INCA software (Supplementary file 1B).

Ancient DNA extractions

We made multiple DNA extractions from subsamples of two nodules, an ulna, sediment taken from the site and relevant associated blanks/controls. The details of these can all be found in Supplementary file 1J. As the elemental analyses of the nodules suggested a highly mineralized constituent, we extracted them in a similar fashion to bone and they are labelled as such in Supplementary file 1J hyphenated (‘bone’). ‘Bone’ (nodules) DNA extractions, consisted of demineralization (DM), removal and freezing of DM supernatant, incubation of non-demineralized tissue with a custom digestion buffer (DB), removal and freezing of DB supernatant, organic extraction of one/both supernatants, and concentration via filtration. They were performed as follows. For Set A (Supplementary file 1J), multiple consecutive rounds of DM (with 0.5M EDTA) and digestion were performed on a shaker (1000 rpm), with the supernatant(s) from each round subjected to organic extraction and filtration. DB consisted of 20 mM Tris pH 8.0, 0.5% sarcosyl, 250 μg/ml Proteinase K, 5 mM CaCl2, 50 mM DTT, 1% PVP, and 2.5 mM PTB. The breakdown of DM/DB rounds is as follows: round 1 = 1 st 0.5 mL overnight (ON) DM at room temperature (RT), second 0.5 mL DM at RT for 24 hr, and 7 hr digestion at 55°C, rounds 2–5 = ON 1 mL DM + 7 hr digestion at 55°C, and round 6 = ON 1 mL DM only. For subsequent extraction, all round one supernatants were combined and all round 2–6 supernatants were treated separately: supernatants were subjected to organic extraction using half-volume of phenol-chloroform-isopropanol (centrifuged at 16,000 x g for 5m), the aqueous phase of which was extracted with 750 μl chloroform (centrifuged as before). The final aqueous phase was filtered using Amicon Ultra 0.5 mL 10 kDa columns (EMD Millipore Corp., Billerica, MA, USA): columns primed with 450 μl 0.1xTE, followed by sample filtration, washed twice with 450 μl 0.1xTE, and eluted in 50 μl 0.1xTE. For Set B, nodule two, 1 mL DM was performed ON rotating at RT and the supernatant was frozen. Digestion was performed for 7 hr rotating at 55°C using 0.5 mL of DB (same recipe as Set A) and the supernatant was frozen. These supernatants were combined and subjected to organic extraction and filtration as in Set A. Set D, which was the ulna, followed the same protocol as Set B, except they were subjected to an additional initial 30 min demineralization with 500 μl 0.5M EDTA that was not extracted, and the DB did not contain DTT, PVP, or PTB and was performed ON.

Set C, ‘sediment’, DNA extractions were performed using the Mo Bio PowerSoil DNA Isolation Kit (MO BIO Laboratories, Inc., Carlsbad, CA) following the manufacturer’s protocol, with a final elution of 100 μl in 0.1xTE. For each set of samples associated blanks were treated in an identical fashion. Please refer to Supplementary file 1J for details for each of these four sets of extractions.

Library preparation and indexing

Ancient DNA extracts were converted to double-indexed libraries for sequencing on the Illumina platform (list in Supplementary file 1A). Prior to library preparation, all ulna DNA extracts (E5-1, 2, 3, 4, 7, 8, 9, and 10) were pooled to homogenize library input into multiple UDG and non-UDG libraries, as were the two set D extraction blanks (E5-6 and 12). Libraries were prepared as in (Wagner et al., 2014) using either regular (‘non-UDG’) or deaminated cytosine removal (damage repair; ‘UDG’) protocols, and subsequently amplified using a double-indexing protocol (Kircher et al., 2012; Meyer and Kircher, 2010). Each library set included at least one blank no-template (water) control reaction. Extract input volumes into library preparation were 10 μl (L25-L13), 20 μl (L01-L20), or 25 μl (L28-L38, 1 a-1j, and 1a-blk to 1j-blk). In non-UDG libraries L25-L38, the blunt-ending step was modified with an extended (3 hr) T4 PNK incubation prior to adding the T4 polymerase, in the same manner as the UDG protocol. Double-indexing amplification was performed as in (Wagner et al., 2014) for 10 cycles each, with 20 μl non-diluted library template input (except L25 and L13 which were used at 0.1x dilution) and included at least one no-template negative control reaction. All reactions were purified to 15 ul EB with the MinElute PCR Purification Kit (Qiagen, Hilden, Germany).

Enrichment

Two rounds of human mitochondrion targeted enrichment were performed on the non-UDG treated ulna specimen for comparison to the nodule shotgun reads. Prior to enrichment, the 7 Ulna-non-D libraries (L31-L38) were pooled to homogenize input, and 9 ul of this pool was used as input into four enrichment reactions (Ulna-D E07-E10) alongside the extraction blank (E11) and a negative control reaction (enrichment blank). The enrichment reactions were performed as for the human mtDNA enrichments in (Wagner et al., 2014), using the same parameters and custom MYbaits baitset (MYcroarray, Ann Arbor, MI) designed from the rCRS sequence (http://www.ncbi.nlm.nih.gov/nucleotide/113200490 NCBI GenBank accession no. J01415.2), but with the following modifications according to updated manufacturer recommendations: post-hybridization bead-library binding was performed rotating at high temp (55°C), Wash Buffer 1 was eliminated, and the post-washed beads were suspended in 20 μl EB and used directly in the post-enrichment amplification. For the adapter-specific blocking oligos, 2 μM of four P5/P7 adapter sequence custom blocking oligos (corresponding to one strand of each molecule) were used for enrichment round 1, and the manufacturer-supplied Block #3 was used for enrichment round 2.

Amplification after each enrichment round was performed as in (Wagner et al., 2014), with additional re-amplifications as required due to low output molarities (all amplification reactions were purified over MinElute columns to 15 μl EB). Post-round 1 amplification used 15 μl bead mixture directly as input into each 40 μl reaction (15 cycles). 6.5 μl of this purified reaction was used as input into enrichment round 2 (E17-E21), and to increase molarity prior to sequencing, 6.5 μl was used as a template for subsequent re-amplification reactions (12 cycles). Post-round 2 amplification used 10 μl bead mixture as input into two 40 μl reactions per enrichment (15 cycles), and the supernatants from each amplification (two per sample) were purified together. Prior to sequencing, 14 μl of this purified reaction was used as the template for a subsequent re-amplification reaction (16 cycles).

Sequencing and read preparation

All relevant ancient samples (nodules, bone and sediment) and their associated extraction blanks were sequenced. Details on the final data set names, their associated samples/libraries/enrichments, raw reads passing filter, and pre-sequencing pooling schemes can be found in Supplementary file 1K. Prior to sequencing, all additional indexed libraries (shotgun and enrichment) were quantified via a qPCR assay targeting indexed molecules and pooled according to desired sequencing ratio. All pools except Nod1_all (pool ‘K’) were size-selected using an electrophoresis gel size selection procedure (retaining molecules ~125/150/150 to 500 bp in length) in order to exclude as much no-insert adaptimer (and other short adapter artifacts) as possible. Pool ‘F’ also contained 10 additional samples not considered in this paper (pooled at a ratio of ‘1’). Pools were sequenced across three paired-end runs on the HiSeq 1500, all alongside other, unrelated samples: Pool ‘K’ (80 bp final read length), Pool ‘F’ (85 bp final read length), and pools ‘G'-'J’ (90 bp final read length). On the last run, the enrichment round 1 and 2 samples (pools ‘H’ and ‘J’) were separated on two different lanes, since they have the same index sequences.

Metagenome/Microbiome analyses

For the metagenomic analyses and the ancient pathogen genome reconstructions, all data sets were trimmed of library adapter using cutadapt (Martin, 2011) with settings -e 0.16, -O 1, -a AGATCGGAAGAGC (70) and reads <24 bp were removed retaining read order. To obtain metagenomic profiles from our shotgun data sets we used LMAT version 1.2.3 (Ames et al., 2013) to properly identify shotgun reads from the nodules (Nod1_1h-UDG, Nod2-UDG), sediment (Sediment-UDG), ulna (Ulna-UDG), and all metagenomic data sets available from the Human Microbiome Project (HMP, RRID:SCR_012956) database, housed at Lawrence Livermore National Laboratory (June 2015). Reads that could be identified at the sequence level or consistently at the species/strain level from all blank extracts were removed from final files used in the PCA analysis.

The PCAs were performed using prcomp (RRID:SCR_014676) in (R Core Team, 2015). The taxa identifications from the HMP were combined according to the origin of the sample. The number of samples combined into each category is indicated in the legend to Figure 1—figure supplement 8. A small number (1×10−7) was added to those entries with zero reads assigned and then natural logs of the numbers were taken. The PCA was centered and scaled.

Staphylococcus saprophyticus Troy genome reconstruction

Reads were initially processed as described in the previous section. The S. saprophyticus Troy draft genome was reconstructed using iterative assembly to span gaps between contigs that were created from assembly to the S. saprophyticus reference genome (NC_007350).

Trimmed reads from Nod1_all-UDG were paired-end assembled to the S. saprophyticus reference (NC_007350) using BWA (RRID:SCR_010910) with default settings (Li and Durbin, 2009), and duplicates were removed using samtools (RRID:SCR_002105) rmdup (Li et al., 2009). The resulting assembly was imported into Geneious (v.6.1.6, Biomatters, Ltd, RRID:SCR_010519) and a strict (50%) consensus sequence was generated. From this consensus, 65 large contigs (880 bp – 170,993 bp) that corresponded to regions of average coverage (and that did not span rRNA/tRNA regions) were manually extracted, which represented the non-gap regions of the assembly. As gap regions could represent indel regions (e.g., lateral gene transfer events), rearrangements, or divergence, these contigs were subjected to an iterative assembly process using a set of unmapped reads in order to attempt to span gaps and connect the contigs. The primary set of reads used for iterative assembly was generated by trimming 100 bp from each end of the contigs, assembling all original reads using the above settings to this set of trimmed contigs, and removing these assembled reads from consideration. A subset of the unmapped reads (20–100% as required, depending on assembly success) along with the full set of contigs were then subject to iterative assembly using Geneious (v.6.1.6), seeded with the first or last 50 bp of a contig (settings: maximum gaps per read 10%, maximum gap size 2, word length 20, index word length 14, ignore words repeated >8x, maximum mismatches per read 1%, maximum ambiguity 4, map multiple best matches randomly). All non-rRNA-adjacent gaps were successfully spanned, except for the region that was discovered to belong on the plasmid rather than the chromosome.

Gardnerella vaginalis Troy gene reconstruction

Ancient gene sequences were reconstructed de novo, via assembly of a pool of reads that mapped to annotated G. vaginalis. First, CDS annotations were extracted from 34 modern G. vaginalis assemblies (Supplementary file 1I; except for strains 41V and 101) and concatenated into one reference/genome (100 N’s between each CDS). Trimmed paired end reads from Nod1_1h-UDG reads were mapped to the concatenated reference. All paired and unpaired reads that mapped were extracted and subjected to de novo assembly using Velvet 1.2.1 (Zerbino et al., 2008) with settings kmer 23, insert length 51, expected coverage 75, minimum contig length 24, and coverage cutoff auto (parameters for expected coverage were chosen based on previous observation of assembly to strain ATCC 14019). This generated 1207 contigs that were confirmed using blastn (default settings, RRID:SCR_004870) to the nr database (April 2014) to detect any non-G. vaginalis sequences or chimeras generated from low level bacterial species also present in the nodules. Twenty contigs were excluded due to the top hit being S. saprophyticus, leaving a final set of 1187 G. vaginalis contigs (total length 1,435,761 bp). The final set of genomic contigs was annotated using Prokka 1.7 (Seemann, 2014) (RRID:SCR_014732) producing 972 genes. Paired-end assembly of Nod1_1h-UDG reads to the final set of contigs with bowtie2 (Langmead and Salzberg, 2012) and removal of duplicates with samtools rmdup (Li et al., 2009) consists of 2,034,514 readpairs. Total reads mapping to G. vaginalis from paired end-assemblies are listed in Supplementary file 1L.

Bacterial genome coverage and DNA damage estimations

To most conservatively assess the abundance, coverage, and authenticity of our ancient reads, we ran a subset of analyses using slightly more stringent criteria. CASAVA (RRID:SCR_001802) processed reads were trimmed and merged with leeHom (Renaud et al., 2014) (RRID:SCR_002710) using its ancient DNA parameter (--ancientdna). We restricted reads from the UDG-treated shotgun nodule libraries (Nod1_1h-UDG and Nod2-UDG) to those of minimum 35 bp length and blasted all reads against the GenBank nucleotide database retaining only the top hit. For all libraries, we additionally mapped to the S. saprophyticus ATCC 15305 (NC_007350) and G. vaginalis ATCC 14019 (NC_014644) with a customized version of the Burrows-Wheeler Aligner (Li and Durbin, 2009) obtained from https://github.com/mpieva/network-aware-bwa) with a maximum edit distance of 0.01 (-n 0.01), allowing for no more than two gaps (-o 2) and with seeding effectively disabled (-l 16500), retaining only those mapped reads which were merged or properly paired [https://github.com/grenaud/libbam/retrieveMapped_single_and_ProperlyPair.cpp]. Molecules that were less than 35 bp in length, had a mapping quality score of less than 30, or were marked as duplicates based on both 5’ and 3’ coordinates were removed [https://bitbucket.org/ustenzel/biohazard.git]. We then pooled all nodule reads DNA originating from the shotgun DNA libraries (Nod1_1h-UDG, Nod1_1h-nonU, Nod2-UDG, Nod2-nonU) and the S. saprophyticus and G. vaginalis mappings and further removed any duplicated molecules found between libraries (Supplementary file 1L). For both mapping assemblies, the average coverage at each reference position was calculated using the bedtools (Quinlan et al., 2010) (RRID:SCR_006646) genomecov function and then averaged over 100 bp blocks and visualized with Circos (Krzywinski, 2009) (RRID:SCR_011798, Figure 3—figure supplement 3, Figure 4—figure supplement 3). Fragment length distributions for all pooled libraries and damage plots for the non-UDG treated libraries (Nod1_1h-nonU and Nod2-nonU) were calculated through mapDamage2 (Jónsson et al., 2013) (Figure 3—figure supplement 1, Figure 4—figure supplement 1).

Human mitochondrial genome analyses

CASAVA processed reads (see directly above), from enriched ulna extractions (Ulna_Enr1-nonU, Ulna_Enr2-nonU), shotgun reads from the UDG treated nodule extraction (Nod1_1h-UDG), and corresponding extraction blanks (EblkD_Enr1-nonU, EblkD_Enr2-nonU and EblkA-UDG) were processed as described above, but mapped to the rCRS mitochondrial genome (NC_012920) (Andrews et al., 1999). Consensus sequences were called and contamination was estimated using Schmutzi, which implements iterative probability models to infer the endogenous bases given read length and deamination patterns (Renaud et al., 2015). Contamination was estimated at 12% and 13%, respectively, for the round 1 and round 2 enriched ulna libraries. These estimates are consistent with estimates from other aDNA studies (Posth et al., 2016). Contamination could not be confidently assessed from the shotgun nodule library as it had been UDG treated, which obfuscates deamination patterns and thereby lessens the differentiation between endogenous and contaminant molecules. mtDNA consensus sequences were uploaded to the HaploGrep webserver [http://haplogrep.uibk.ac.at/] and haplogroups were determined in reference to Phylotree Build 16 (Kloss-Brandstätter et al., 2011; van Oven and Kayser, 2009) (RRID:SCR_012948) and found to be U3b and U3b3. Haplogroup U3b was assigned to the consensus sequence generated from the first round enrichment of the ulna because there was missing data for polymorphisms diagnostic to the haplogroup U3b3 (Supplementary file 1E). All three consensus sequences shared an additional five private polymorphisms not diagnostic to haplogroup U3b3. 137 sequences assigned to haplogroup U3 were collected from all human complete mtDNA genomes in GenBank (18 June 2015), and aligned along with the Troy consensus sequence generated from the nodule shotgun (Nod1_1h-UDG) library with MUSCLE v3.8 (Edgar, 2004) (RRID:SCR_011812). It was determined that the best model of nucleotide substitute for this group of 138 sequences was HKY+I+Γ using the program jModelTest2 (Darriba et al., 2012) and the built-in Akaike Information Criterion (Akaike, 1974). A Bayesian Maximum Clade Credibility tree was calculated using BEAST v1.8 (Drummond et al., 2012) (RRID:SCR_010228) and TreeAnnotator (Drummond et al., 2007) with the nucleotide data partitioned between coding and non-coding and a strict molecular clock with evolutionary rates of 1.708×10−8 and 9.88 3×10−8 nucleotide substitutions/site/year following Soares et al. (Soares et al., 2009) (Figure 1—figure supplement 6). Damage patterns and fragment length distribution of ancient DNA mapped to mitochondrial genome can be found in Figure 1—figure supplements 34.

Human nuclear genome analyses

Reads from four shotgun libraries (three from the nodules, Nod1_1 hr_UDG, Nod2-UDG, Nod2-nonU; one from the ulna, Ulna-UDG) were mapped and processed as described for the mitochondrial genome above. Additionally, the reads originating from the four nodule libraries were pooled together for comparison (‘Nodule Pooled’). We mapped the libraries to a hard-masked hg38 reference genome downloaded from the UCSC genome browser [http://hgdownload.soe.ucsc.edu/goldenPath/hg38/bigZips/] and recorded the number of reads mapping to chrX, chrY, mitochondrial genome and all autosomes. We first filtered for mapped merged or properly paired reads with a minimum length of 35 bp and a minimum mapping quality filter of 30. Percent coverage was calculated by tallying the number of positions covered by at least one read and dividing by the total genome length with masked regions subtracted. We calculated the coverage depth by summing coverage of all positions and dividing the total by this same masked genome length (Supplementary file 1G).

Extraction and sequencing of modern S. saprophyticus

Fourteen new S. saprophyticus isolates (North America: eight human, one bovine, and one canine; Australia: two human; Japan: two human) were sequenced for this study to provide a broader comparative genomic data set (Supplementary file 1H).

Extraction and sequencing of North American S. saprophyticus

DNA was extracted at the University of Wisconsin-Madison. Isolates were inoculated in TSB and grown overnight at 37°C in a shaking incubator. Cultures were pelleted, resuspended in 140 μL TE, and incubated overnight with 50 units of mutanolysin. DNA was extracted using the MasterPure Gram Positive DNA Purification Kit (EpiCentre). Extracts were prepared for sequencing with the Illumina Nextera XT library preparation kit and sequenced in two different batches: samples 13–31 were pooled in equal ratio with 24 additional unrelated samples and sequenced on a MiSeq platform using a 2×250 kit; samples K, X, and M were sequenced at a desired ratio alongside pools ‘G'-'J’ (described above) as part of the HiSeq 1500 paired-end run (85 bp read length). Average insert sizes (estimated from Agilent Bioanalyzer analysis) are 849 bp (13–31, pooled average), 775 bp for M, 716 bp for X, and 769 bp for K.

Extraction and sequencing of Australian S. saprophyticus

DNA extractions were performed as described above. DNA was submitted to the University of Wisconsin-Madison Biotechnology Center. DNA concentration was verified using the Qubit dsDNA HS Assay Kit (Life Technologies, Grand Island, NY). Samples were prepared according the TruSeq Nano DNA LT Library Prep Kit (Illumina Inc., San Diego, California, USA) with minor modifications. A maximum of 200 ng of each sample was sheared using a Covaris M220 Ultrasonicator (Covaris Inc, Woburn, MA, USA). Sheared samples were size selected for an average insert size of 550 bp using Spri bead based size exclusion. Quality and quantity of the finished libraries were assessed using an Agilent DNA High Sensitivity chip (Agilent Technologies, Santa Clara, CA) and Qubit dsDNA HS Assay Kit, respectively. Libraries were standardized to 2 μM. Paired-end, 150 bp sequencing was performed using v2 SBS chemistry on an Illumina MiSeq sequencer. Images were analyzed using the Illumina Pipeline, version 1.8.2.

Extraction and sequencing of Japanese S. saprophyticus

S. saprophyticus cells were lysed by achromopeptidase (WAKO, Kyoto, Japan), and the genomic DNA was prepared with conventional phenol/chloroform extraction and ethanol precipitation, followed by further purification with QIAGEN genome DNA preparation kit. Library preparation was completed using the Nextera XT DNA Sample Prep Kit (Illumina, San Diego, CA, USA), followed by insert size selection using 1% TAE agarose electrophoresis to obtain an insert of approximately 400 bp. Sequencing was performed on NextSeq 500 (Illumina, San Diego, CA, USA) using the NextSeq 500 v1 kit (300 cycle) with paired-end 150 bp sequencing.

Reference-guided assembly and alignment of S. saprophyticus

Reads for the new modern samples were processed with reference-guided assembly via a pipeline [https://github.com/tracysmith/RGAPepPipe]. For reference guided assembly, read quality was assessed and trimmed with TrimGalore! v 0.4.0 [www.bioinformatics.babraham.ac.uk/projects/trim_galore], a wrapper script for FastQC [www.bioinformatics.babraham.ac.uk/projects/fastqc, RRID:SCR_005539] and cutadapt (Martin, 2011) (RRID:SCR_011841). Reads were mapped to the ATCC 15305 reference genome using BWA-MEM v 0.7.12 (Li, 2013) (RRID:SCR_010910) and bam files sorted using Samtools v 1.2 (Li et al., 2009) (RRID:SCR_002105). Read group information was edited and duplicates removed using Picard v 1.138 [picard.sourceforge.net, RRID:SCR_006525]. Reads were locally realigned using GATK v 2.8.1 (DePristo et al., 2011) (RRID:SCR_001876). Variants were called using Pilon v 1.16 (Walker et al., 2014) (RRID:SCR_014731) with a minimum read depth of 10, minimum mapping quality of 40 and minimum base quality of 20. Whole genome alignment of the Troy strain and de novo assemblies to ATCC 15305 was performed using Mugsy 2.3 (Angiuoli et al., 2011) (RRID:SCR_001414).

De novo assembly and annotation of S. saprophyticus

The draft genome sequences of Japanese isolates were obtained by de novo assembly using CLC genome workbench v8.02 (RRID:SCR_011853). For North American and Australian S. saprophyticus genomes, de novo assembly was performed using the iMetAMOS pipeline (Koren et al., 2014; Treangen et al., 2013) (RRID:SCR_011914). We compared assemblies from SPAdes (Bankevich et al., 2012), MaSurCA (Zimin et al., 2013), and Velvet (Zerbino et al., 2008). KmerGenie (Chikhi and Medvedev, 2014) was used to select kmer sizes for assembly. iMetAMOS uses FastQC [www.bioinformatics.babraham.ac.uk/projects/fastqc] to check read data quality. Assemblies were evaluated using QUAST (Gurevich et al., 2013), REAPR (Hunt et al., 2013), LAP (Ghodsi et al., 2013), ALE (Clark et al., 2013), FreeBayes (Garrison and Marth, 2012), and CGAL (Rahman et al., 2013). Additionally, Kraken (Wood et al., 2014) was run to detect potential contamination in sequence data. The SPAdes assembly was identified as best for isolates 13, 16, 19, 41, 42, 43, K, M, X, and 129. The MaSurCA assembly was identified as best for isolates 14, 15, 18, and 31. Genomes were annotated using Prokka 1.7 (Seemann, 2014) (RRID:SCR_014732). OrthoMCL v2.0.9 (Li et al., 2003) (RRID:SCR_007839) was used to find orthologous genes in these genomes.

Core genome alignment of G. vaginalis

Genomes were annotated with Prokka 1.7 (Seemann, 2014) (RRID:SCR_014732). OrthoMCL v2.0.9 (RRID:SCR_007839) grouped genes into orthologous groups (Li et al., 2003). Genes were filtered to include only genes present in one copy in every genome. Individual genes (n = 537) were aligned with TranslatorX (RRID:SCR_014733) and MAFFT v7.130b (Abascal et al., 2010; Katoh and Standley, 2014) (RRID:SCR_011811) and concatenated [https://github.com/tatumdmortimer/core-genome-alignment].

Phylogenetic analyses

Maximum likelihood phylogenetic trees were inferred using RAxML 8.0.6 (Stamatakis, 2014) (RRID:SCR_006086). Bootstrap replicates (number determined by autoMR convergence criteria) were applied to the tree with the highest likelihood of twenty using the GTRGAMMA substitution model. We used SplitsTree4 (Huson and Bryant, 2006) (RRID:SCR_014734) to create networks of pSST1, the chromosome in S. saprophyticus, and the core genome of G. vaginalis.

Recombination

Recombination in a whole genome alignment of S. saprophyticus isolates and a core genome alignment of modern G. vaginalis isolates was assessed using BratNextGen (Marttinen, 2012). For both S. saprophyticus and G. vaginalis analyses, one hundred permutations were performed to calculate the significance (p<0.05) of recombinant fragments (plots created with Circos [Krzywinski, 2009]). Recombination was also measured for the pSST1 plasmid alignment using Phi (Bruen et al., 2006), Max χ2 (Smith, 1992), and NSS (Jakobsen and Easteal, 1996) implemented in PhiPack; results of these tests were all significant with p-values of 5×10−15, 0, and 0, respectively.

Variant annotation

We used SNP-sites (Page et al., 2016) to convert the alignment of S. saprophyticus isolates to a multi-sample VCF. SnpEff (Cingolani, 2012) (RRID:SCR_005191) was used to annotate variants in this VCF as synonymous, non-synonymous, or intergenic.

Analysis of temporal structure

To determine whether there was sufficient temporal structure in the S. saprophyticus phylogeny to estimate evolutionary rates, we performed a regression of root-to-tip genetic distances against year of sampling using TempEst v 1.4 (Rambaut et al., 2016). We also attempted to estimate evolutionary rates using BEAST v1.8 (Drummond et al., 2012) (RRID:SCR_010228). Results of both these analyses (Appendix) revealed a lack of temporal structure such that rate (and date) estimates are unreliable.

Analysis of population structure

We used a Bayesian tree sampling method implemented in BaTS (vBETA2) (Parker et al., 2008) to determine the significance of phylogenetic clustering and population structure in our S. saprophyticus data. A distribution of phylogenies was generated using BEAST v1.8 (Drummond et al., 2012) (RRID:SCR_010228) with GTR+Γ substitution model, a strict molecular clock, and a constant population size. Markov chains were run in duplicate for 10 million generations each with sampling every 1000 generations, and the first 1 million generations were discarded as burn-in.

Acknowledgements

We thank the Troia Project archaeological team, especially Gebhard Bieg, for providing insights and images. We thank Dr. Ernst Pernicka, who authorized release of the material for study, and the General Directorate of Museums and Monuments in the Ministry of Culture and Tourism of the Republic of Turkey for permissions to conduct research in Turkey and take samples. We also thank Kurt Reed, Pam Ruegg, Scott Weese, Vitali Sintchenko, and Amanda Harrington for providing S. saprophyticus isolates. The authors would like to acknowledge the efforts of Tracy Smith and Mary O’Neill at University of Wisconsin-Madison for growth, extraction, and library preparation of S. saprophyticus isolates. We also thank the University of Wisconsin Biotechnology Center DNA Sequencing Facility for providing library preparation and sequencing facilities and services. Victoria Jarvis of the Brockhouse Institute for Materials Research and Henry Schwarz of the Department of Geology (McMaster University) helped with XRD, and Marcia Reid of the Electron Microscopy Facility (McMaster University) provided assistance with TEM/SEM-EDS analyses. We acknowledge helpful feedback from the members of the McMaster Ancient DNA Centre and the Pepperell Lab in our ongoing studies of ancient and extant pathogen genomes.

Appendix 1

Ancient data generation and analyses

Archaeological context

Of approximately 200 total burials found in controlled excavations at Troy between 1988 and 2012, 102 were found in three separate Late Byzantine cemeteries around the periphery of the ancient city (Kiesewetter, 2014; Jablonka, 2006). One of these (Figure 1—figure supplement 1), in quadrat x24, excavated in 2005, yielded 30 individuals from sixteen graves, including the woman described in the main text (Grave 14, context x24.177). The cemetery was dated to the early thirteenth century A.D., based on stratigraphy, topographical situation and relationship to other monuments at Troy, grave construction techniques and mortuary practices, pottery found in and around the graves, and comparison to other Late Byzantine graveyards at Troy (Kiesewetter, 2014; Jablonka, 2006).

Description of osteological materials Grave 14, Ind. I, Beh.177

In grave 14 (Troy, area x24) bones from one single individual (Beh.177) were unearthed. According to the overall appearance and morphology, the bones were fairly well preserved with minor signs of degradation. The skeleton was found in situ when excavated. The deceased was lying in a supine position with crossed arms, skull in the northwest facing southeast (Figure 1A). From the skull only the calvaria (cranial vault) was well preserved, the facial part including the maxilla was missing, and from the mandible only the left portion was present. No teeth were found, which is also due to ante-mortem tooth loss observed on the mandible. Both scapulae were present but fragmented. The right clavicle was missing; the left clavicle was preserved, but distal and proximal ends were missing. All long bones were represented; the proximal half of the right humerus was missing, right radius and ulna were complete, the left arm was almost complete, both femora and tibiae were almost complete, and from the fibulae only fragments remained (Figure 1—figure supplement 9). Ribs and vertebrae were fragmented, and some thoracic vertebrae were missing. The left pelvis was well preserved, while the right pelvis was very brittle and fragmentary. Metacarpals and metatarsals were almost completely preserved while other hand and foot bones were present but fragmented.

The recording of sex differences and age changes as well as the sex and age estimation followed the scheme recommended by the Standards for Data Collection from Human Skeletal Remains (Buikstra and Ubelaker, 1994).

Sex estimation

Sex estimation using the pelvic bones

The ventral arc and the shape of the greater sciatic notch of the pelvis demonstrated female features as did the subpubic concavity, the ischiopubic ramus ridge, the preauricular sulcus, and the composite arch of the pelvis (Buikstra and Ubelaker, 1994; Bruzek, 2002).

Sex estimation according to cranial morphology

The skull revealed slight supra-orbital ridges, sharp orbital borders, minimal expression of the rugosity of the nuchal crest, and a small mastoid process. All these features indicate a female sex of the skeleton. Only the glabella demonstrated an ambiguous feature (Buikstra and Ubelaker, 1994).

Age estimation

Age estimation for adult crania

Age estimation was made using the degree of cranial suture closure (Meindl and Lovejoy, 1985). The spheno-occipital synchondrosis was completely fused. The sagittal suture at the part of the obelion demonstrated significant closure, other cranial sutures showed no or minimal closure suggesting that the skeleton belonged to a 'young adult' no older than 35 years.

Age estimation of the postcranial skeleton

Age estimation was made using the degree of epiphyseal fusion (Scheuer, 2000). The epiphyses of all long bones were completely fused indicating an adult age over 20 years. The complete epiphyseal fusion of all parts of the spinal column suggests an age over 25 years (Scheuer, 2000).

The cranial and postcranial examination together indicate that the skeletal remains are most likely from a female individual whose age-at-death would have been between 25 and 35 years. The appearance of the surfaces of the joints and the pubic symphyseal face support this age estimation.

Pathological findings

The skull showed signs of slight orbital pitting (cribra orbitalia). Only the left mandible was preserved and all teeth were lost ante-mortem. Lumbar vertebrae 3 to 5 show slight to medium vertebral degeneration with osteophytic lipping of approximately 2–5 cm. The left ulna showed signs of a well-healed fracture of the styloid process. The left ulna and radius also revealed signs of severe osteomyelitis.

Estimated height

159 cm

Max femur length

397 cm

Max tibia length

ca. 315 cm

Max ulna length

231 cm

Radiocarbon dating

High molecular weight (>30 kDa) ultrafiltered collagen extracted from the ulna (sample no x24.177) following (Beaumont et al., 2010) and graphitized and measured using standard AMS techniques returned a 14C age of 860 ± 20 BP (UCIAMS# 131407). The corresponding calibrated date is 1154–1224 AD (0.98 probability distribution at two sigma) which is in agreement with the archaeologically established date for the quadrat x24 Troy burials in the 13th century AD (Kiesewetter, 2014Jablonka, 2006). The collagen yield of 4.5% and the carbon and nitrogen content (16.0% N, 46.1% C, atomic C/N ratio 3.3) are typical of results from well preserved bone, and the stable isotope values (13C = −19.6‰, 15N = + 8.2‰) suggest an individual whose dietary protein was largely derived from terrestrial plant sources. The diversity of 14C dates (460–1130 yrs BP, Supplementary file 1A) from the carbonate and organic fractions of the nodules (for example, the extremely young date on the 10% leached carbonate sample) is suggestive of possible ground water leaching and exchange. We made an additional attempt to date organic residues from demineralized nodule one, using base treatment (0.1N NaOH) to remove possible soil carbon contamination. This resulted in a very old date of 4680 yr BP, possibly due to the presence of residual refractory soil carbon (humin). Given these discrepant ages, the results for the nodules must be viewed with caution, in contrast to that of the bone.

Bacterial mineralization and biofilm formation

Soft tissue mineralization, enabling the long-term morphological preservation of cellular details, is not uncommon in the fossil record and often involves the replication of organic constituents in calcium phosphate (apatite) (Briggs and Kear, 1993; Briggs, 2003a, 2003b).

This process is controlled by the concentration of available phosphate and by pH, which must be at levels that inhibit the formation of calcium carbonate (Briggs and Wilby, 1996; Briggs and Kear, 1994). The role of commensal and environmental bacteria in the decomposition of soft tissues is becoming increasingly clear (Briggs, 2003a; Metcalf et al., 2016). Despite their ubiquity, however, bacteria themselves decay rapidly and are only occasionally mineralized in fossils, where they may be preserved in association with soft tissues (within-host environments) (Briggs et al., 1997, 2005). Melanosomes, which are commonly fossilized in association with colored structures such as feathers (Vinther et al., 2008), were originally mistaken for fossilized bacteria (Wuttke, 1983), an interpretation that persists in some cases (Moyer et al., 2014), but melanin, in contrast to bacteria, is very resistant to degradation (Vinther, 2016). Soft tissue fossilization relies on microbial activity to release mineral-forming ions and establish geochemical gradients, and microbial films (biofilms) may control diffusion (Briggs, 2003a). Biofilms, which are composed of spatially organized bacterial cells attached to a common surface and embedded within a matrix of extracellular polymeric substances (EPS - a glycocalyx secreted by the cells), rarely preserve cellular details, but individual microbial morphologies, such as coccoids, may become mineralized (Briggs, 2003b; Briggs et al., 1997).

Scanning Electron Micrograph (SEM) images of the Troy nodules taken at various magnifications (Figure 2) appear to show bacterial cells (and likely biofilms) within a labyrinthine structure reminiscent of the complex arrangement of overlapping villi visualized in SEM of the placenta (Hafez and Kenemans, 2012). We interpret the coccoid structures evident in the images (Figure 2, red arrows) as the superficially mineralized remains of bacteria. The rounded shape and clustering (without chain formation) are typical of Staphylococci, which measure between 0.5 and 1.0 microns in diameter. The size of the cocci in the electron micrographs (Figure 2) vary between 0.5 to 3.5 microns; the range in size may represent cells with varying degrees of biomineralization (i.e. precipitation of apatite and carbonate on the cell wall) thereby increasing their overall dimensions. The structures in Figure 2 (yellow arrows) have an appearance typical of EPS, suggesting that the Staphylococci were organized into a biofilm during active infection. Given the likely origin of the nodules studied here (see below), the presence of biofilms is not surprising. Biofilms are commonly associated with chronic infections and have been found in many tissues, including within the amniotic cavity (Romero et al., 2008). Larger irregular structures (Figure 2—figure supplement 2, blue arrows) more than five microns in size, are likely to represent neutrophils, which would be expected at the site of an active infection. The nodule appears to capture a ‘Kodak’ moment, with the cellular combatants of severe infection frozen in time by replication in minerals.

Calcified placental abscesses vs urinary calculi

The XRD analysis of ‘Troy’ nodule one (Figure 2—figure supplement 1) shows it to be predominantly composed of two phosphate phases, hydroxlyapatite (bioapatite - Ca5(PO4)3(OH)) and whitlockite (Ca3(PO4)2) as well as small amounts of calcium carbonate (calcite- CaCO3), both of which have been found in pathological calcified concretions (Lagier and Baud, 2003). Early morphological assessment of the nodules (multistratified concentric layering, overall shape and size) suggested possible renal or bladder calculi. Urinary calculi (kidney/bladder stones) are mineral aggregates formed under a variety of different physiological conditions (Balaji and Menon, 1997) and composed predominantly of whewellite, weddellite (two forms of calcium oxalate (CaC2O4H2O)), struvite (magnesium ammonium phosphate - Mg(NH4)(PO4)6H2O) and hydroxylapatite (bioapatite - Ca5(PO4)3(OH)) with other, minor mineral components (Supplementary file 1C,D) (Pak, 1987). Supplementary file 1D compares the relative frequency of these mineral components (individually and in combination) found within kidney and bladder stones in modern populations sampled from the US, UK and Germany, as well as material identified as possible stones from archaeological excavations. Most modern and ancient kidney stones (Supplementary file 1D) are composed predominantly of calcium oxalate, struvite and apatite, and in a few cases small amounts of ammonium acid urate and calcium carbonate (Steinbock, 1989; Giuffra et al., 2008; D’alessio, 2005; Giuffra, 2011). Ancient bladder stones have also been shown to be comprised of calcium oxalate, struvite and calcium carbonate. Interestingly a few large stones identified in archaeological material (Brothwell and Sandison, 1967; Piperno, 1976; Szalai and Jávor, 1987; Hawass and Brock, 2003; Anderson, 2003; Özdemir et al., 2015) have a predominantly apatite or carbonate-phosphate signature similar to the Troy nodules (Supplementary file 1D). While most earlier works (1967–1987) lack specific elemental details, precluding a more thorough comparison, the recent analysis of an ancient bladder stone from Oluz Höyük showed it to be composed almost exclusively of calcium phosphate (Özdemir et al., 2015). However, without further molecular/metagenomic characterization, it is difficult to pinpoint the tissue of origin of these ancient stones, or to compare them with the nodules studied herein. Notwithstanding the small sample size, the absence of calcium oxalate and struvite – as seen in XRD and SEM-EDS analyses of the Troy nodules (average 42.3% oxygen, 42.1% calcium, 7.53% phosphate, and 3.18% magnesium, by weight; Supplementary file 1B) – argues against a diagnosis of kidney or bladder stones.

The retrieval of DNA stemming predominantly from S. saprophyticus and G. vaginalis within the nodules suggests a urogenital origin for the calculi, although these bacteria do not pinpoint the tissue of origin (i.e. tubo-ovarian abscess, endometrium, cervix, vagina, or renal tract). While both S. saprophyticus and G. vaginalis can be found in the vaginal flora (Higashide et al., 2008; Harwich et al., 2010), S. saprophyticus is primarily associated with urinary disease (UTI) and both genera may occur in the female urinary microbiome (Lewis et al., 2013; Fouts et al., 2012). S. saprophyticus infection has been directly associated with urinary calculi in humans (Fowler, 1985) and rats (Gatermann et al., 1989) although these calculi are comprised predominantly (~52%) of struvite (magnesium ammonium phosphate) and apatite (~38%) in humans, and almost exclusively struvite in rats. Rates of stone disease are typically higher in men than in women (2.7:1, 2006) except in cases of infection-associated calculi, where they are higher in women (3:1) (although again these stones are composed almost entirely of struvite, magnesium ammonium urate, newberyite and ammonium hydrogen urate and not apatite) (Knoll et al., 2011).

Dystrophic or ectopic calcifications of various tissues (resulting in calculi, similar in shape and size to urinary stones) can occur as a result of infectious disease, injury or aging (Giachelli, 1999; Ronchetti et al., 2013). Calcification of the placenta in particular is common, and is associated with both normal pregnancies and a range of pathological conditions. Placental calcifications occur in most pregnancies after week 33 (>50%) and are universal in post-date pregnancies (Tindall and Scott, 1965; Spirt et al., 1982). Chorioamnionitis, a potentially fatal infection in which urogenital bacteria gain access to the amniotic membranes and placenta (Redline and Frcpa, 2007), can also result in marked tissue calcification. When chorioamnionitis progresses to involve the umbilical vessels (funisitis), for example, concentric rings of calcification are commonly visible to the naked eye and umbilical cord mineralization may be so pronounced that the cord cannot be clamped (Kraus et al., 2004; Benirschke et al., 2006). Based on the propensity of placental tissues to undergo dystrophic calcification, the elemental composition of the Troy nodules, their appearance under SEM, the urogenital bacteria recovered, as well as the age and sex of the decedent, chorioamnionitis is a probable underlying diagnosis. Reproductive-associated infections can cause premature labor and maternal and fetal sepsis (Redline and Frcpa, 2007). These and other obstetric complications were undoubtedly major factors in the mortality of women throughout history (Sayer and Dickinson, 2013), as they remain today (WHO et al., 2014). Archaeological palaeodemographic findings from Late Byzantine Troy reveal low overall life expectancies – 32 for females, 39 for males – and a higher mortality rate for females of reproductive age (Kiesewetter, 2014), emphasizing the importance of pregnancy complications as a cause of death among women in this setting.

Urolithiasis is an alternative diagnosis for the nodules. We favor chorioamnionitis for the following reasons. First, while urolithiasis can develop at any age, young women have the lowest rates of disease across a range of settings (Romero et al., 2010). By contrast, paleodemographic data implicate pregnancy complications as the leading cause of death among women in the Late Byzantine world (Bourbou, 2010). Urinary stones commonly harbor viable bacteria (Tavichakorntrakool et al., 2012), and urinary tract infection with S. saprophyticus and other urease-producing bacteria may lead to stone formation (Fowler and Jackson, 1985). Elemental analyses of the nodules were not, however, typical of either infection-related (i.e. struvite) or other types of urinary stone. The abundance of human DNA in the nodules also argues for their origin in human tissue, as opposed to inorganic stone material with entrapped human cells. Lastly, a diagnosis of urinary stones does not explain the endogenous Y chromosome in the nodules.

Authenticity of ancient chrY reads and attempts to sex the remains

To ensure to the greatest extent possible that reads mapping to Y chromosome were endogenous, we looked at the FLD mean and median of the hard masked, pooled, mapped fragments as well as their damage profiles (from all four nodule libraries). The median for reads (>35 bp, Qm = 30) mapping to the X, Y and autosomes were 45, 44 and 45 bp respectively and thus nearly identical suggesting that they stem from the same pool of damaged human endogenous DNAs found within the abscess. The damage profiles for these same reads, restricted to the two non-UDG treated shotgun libraries) clearly shows damage in those mapping to the X and the autosomes. While there is more noise in the Y, due to overall fewer reads present, they still have a decided increase in C to T and G to A transitions at terminal positions, again suggestive of their authenticity (Figure 1—figure supplement 5). We used a Fisher’s exact test to assess whether the number of unique reads mapping to the Y chromosome was significantly greater in the nodule than the ulna or sediment samples, despite unequal sequencing depths (893/33262468 nodule vs 3/4558127 bone). We found the nodule to have significantly more Y-mapped reads than either the ulna and sediment samples (p<2.2E-16 for both tests). There was no significant difference in the proportion of Y-mapped reads between the ulna and sediment samples. We attempted to sex the remains (i.e calcified nodules, ulna and their associated blanks as well as the surrounding sediments and blanks) using the method developed and published by Skoglund et al. (2013). Our input consisted of all reads >35 bp, mapping to the hard masked version of hg38 with map quality >30. As can be seen (Supplementary file 1F) all libraries derived from nodule DNA extractions are consistently assigned as female (XX). It is worth noting that in addition to this assignment (XX), all libraries also show the consistent presence of Y reads. The sediment has no assigned sex and the ulna is ‘consistent with XX’ but read numbers are low (n = 22). Importantly, blanks, also with low read counts (n = 1–35) are labelled as ‘consistent with XX’ with 95% confidence intervals (at those levels) of 0. Considering (a) that the skeleton is biologically female and of child-bearing age, (b) that the nodule and ulna share the same mtDNA haplotype, and (c) the relative abundance of X chromosome reads in both nodules, we interpret the significantly higher Y-mapped reads in the nodule over the ulna as evidence to support that the human DNA recovered from the nodule originated from that of two individuals, a female and her male fetus.

Staphylococcus saprophyticus Troy genome

Resolving duplicated/dual-motif regions

SSP0354-0356. There is an apparent divergent paralogous/duplicated region of NC_007350 genes SSP0354 (ispD; 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase), SSP0355 (zinc-binding dehydrogenase) and SSP0356 (glycosyl glycerophosphate transferase involved in teichoic acid biosynthesis) inside of the novel GI. The duplicated region precluded straightforward iterative or de novo assembly due to challenges in resolving which version was found in the same location as the reference and which version was found in the novel region. The challenges arose from the partially conserved, partially highly divergent nature of the duplicated region (which is problematic for both reference-guided and de novo assembly) as well as the short reads precluding the spanning of SNPs (haplotypes) by long paired-end reads. Proper gene sequences were established via a combination of (a) manually working inwards from the shared flanking regions with liberal assembly to manually determine correct sequence and (b) iterative assembly to de novo reconstruct highly divergent sections. For the majority of positions, the ancient versions of SSP0354-56 do match the reference genome and the novel copies are the 'unmatching' positions; however, in several instances, the reads support that the 'reference matching' positions are in fact on the paralogous copies instead of the homologous ancient SSP0354-56 genes. SSP0117, SSP0118, and SSP0788. As for SSP0354-0356 and their paralogs discussed above, the sequences of the Troy versions of the SSP0117, 0118, and 0788 paralogous hypothetical proteins were obtained through a combination of manually working inwards from the shared flanking regions with liberal assembly to manually determine correct sequence and iterative assembly to de novo reconstruct highly divergent sections.

S. saprophyticus Troy genome verification

All potential reads were assembled to a penultimate draft version of the genome (a combined product of reference-guided assembly consensus contigs, iterative de novo assembly, and manual curation as noted). Ambiguous positions (from the contig and iterative assembly consensus sequences) or positions heterogeneous in the reads (>40%) were resolved with the reference base (if that motif was present in the reads >40%). All Nod1_1h-UDG reads were assembled to the final genome draft using paired-end bowtie2 assembly (Langmead and Salzberg, 2012) with default settings and samtools rmdup (Li et al., 2009), and all positions were confirmed with at least 35x unique coverage. The genome was annotated using Prokka v 1.7 (Seemann, 2014). Total reads mapping to S. saprophyticus from paired end-assemblies are listed in Supplementary file 1L.

S. saprophyticus Troy unresolved regions

There were three repetitive regions of the genome that were unable to be reliably resolved using either reference guided or iterative assembly due to the length and sequence structure of the region, and were replaced by 100 N’s in the final genome sequence. These regions correspond to strain ATCC15305 positions 1810158–1810757 in SSP1741, 153694–158013 in SSP0135, and 725232–725331 between SSP0690 and SSP0691.

Regarding rRNA regions, no attempt was made to reconstruct or span these regions with iterative assembly, as the multiple copies could not be assembled accurately with short reads. All rRNA regions (genes and any flanking tRNAs) plus 200 bp of flanking sequence from the ends of the rRNA genes have been replaced by 100 N's in the final ancient genome sequence (corresponding to strain 15035 positions 743525–749373, 841186–847217, 959438–967218, 2349930–2355284, and 2301048–2312686). Any tRNAs that were not flanked by rRNA genes were included in the draft genome. The iterative assembly process established the presence of at least the same number of bordering RNA regions as in the reference sequence, although it cannot be determined whether there is any rearrangement of the intervening regions in the Troy strain versus the reference.

S. saprophyticus Troy plasmid, pSST1

A partially-novel plasmid was discovered during the assembly process and closed using iterative assembly. Large portions of this pSST1 plasmid have homology to the pSSAP1 (strain MS1146) and pSSP1 (strain ATCC 15305) plasmids, and a portion of the ATCC 15305 genome (similar to pSSAP1) containing repA, alcohol dehydrogenase, and dehydrogenase genes, which is missing in the Troy chromosomal genome (but located on the plasmid). The plasmid was annotated using Prokka v 1.7 (Seemann, 2014).

The S. saprophyticus Troy blaPC1 gene encodes a functional β-lactamase

The blaPC1 gene was submitted to SignalP (Petersen et al., 2011) software, which predicted an N-terminal signal peptide with a cleavage site between A32 and K33. The DNA sequence encoding mature PC1 enzyme was optimized and synthesized by IDT (Coralville, IA) before cloning into the pET-28b vector for overexpression. Escherichia coli BL21 chemically competent cells were transformed with pET-28b(PC1) and inoculated into LB medium containing 50 µg/mL kanamycin and grown at 37°C. Protein expression was induced with 1 mM IPTG at OD600 0.7 and cultures were incubated overnight at 16°C. Cells were harvested by centrifugation and cell paste from 1 L of culture expressing β-lactamase was washed with 8 mL 0.85% NaCl, resuspended in buffer containing 50 mM HEPES pH 7.5, 350 mM NaCl, and 20 mM imidazole, and then lysed by cell disruption at 20,000 PSI. Lysate was centrifuged using a Beckman JA 25.50 rotor at 20 000 RPM (48 254 x g) for 45 min at 4°C. The supernatant was applied to a 5 mL HiTrap Ni-NTA column (GE Lifesciences) at a constant flow rate of 3 mL/min. The column was washed with five column volumes of the same buffer and step gradients of increasing imidazole were used for wash and elution steps. Fractions containing purified β-lactamase, based on SDS-PAGE, were pooled and dialyzed overnight at 4°C in buffer containing 50 mM HEPES pH 7.5, 150 mM NaCl, and 20% glycerol. PC1 was determined to be >90% pure as assessed by SDS-PAGE and stored at −20°C.

Purified PC1 enzyme was used to determine kinetic parameters for nitrocefin hydrolysis (Supplementary file 1M). Nitrocefin was synthesized as previously reported (Lee et al., 2005). Enzyme (final concentration 1 nM) was added to nitrocefin in serial half dilutions (highest final concentration 320 µM) in 50 mM HEPES pH 7.5 at 30°C after a 5 s mix time. Nitrocefin hydrolysis was monitored at 490 nm and rates used to determine the kinetic parameters, Km and kcat. All enzyme dilutions were done in 100 ng/µL bovine serum albumin and nitrocefin dilutions in 0.01% Tween 20.

Modern comparative data generation and analyses

Patterns of gene content and variant sharing among S. saprophyticus associated with distinct niches

We determined the presence and absence of known mobile genetic elements, virulence genes, and genes conferring antibiotic resistance (Figure 4—figure supplement 5). To gain further insights into emergence of pathogenicity, we searched for genes and genetic variants that are unique to pathogenic strains of S. saprophyticus. Non-synonymous variants shared between Clade P and the human UTI isolate S. saprophyticus 55 are listed in Supplementary file 1O.

Shared gene content was explored using annotated, de novo assemblies of S. saprophyticus isolates. Scripts used to automate OrthoMCL (Li et al., 2003) analyses and compare gene content are available at https://github.com/tatumdmortimer/core-genome-alignment. There are 13 genes common to Clade P isolates that are absent from Clade E (Supplementary file 1N). However, these genes are not present in the human UTI isolate S. saprophyticus 55. Further studies are needed to investigate a possible role of these genes and variants in adaptation to the pathogenic niche.

Analysis of temporal structure

No significant correlation was observed (R2 = 0.037) in the regression of root-to-tip genetic distances against year of sampling using TempEst v 1.4 (Rambaut et al., 2016), thereby indicating that there is no temporal signal in these data, which precludes molecular clock dating analysis. Similarly, rate estimates using BEAST v 1.8 (Drummond et al., 2012) revealed extensive variation (95% highest posterior density: 1.4×10−10–2.9×10−7 substitutions/site/year with General Time Reversible with gamma distributed site variation substitution model, lognormal relaxed clock with an exponential rate distribution, and Bayesian Skyline Plot demographic model) such that all rate estimates are necessarily unreliable.

S. saprophyticus population structure

BaTS implements three statistics to test phylogenetic-trait correlation: the Fitch parsimony score (PS) (Fitch, 1971), the Association Index (AI) (Wang et al., 2001), and the maximum clade size (MC) (Parker et al., 2008). BaTS randomizes the taxon-trait values and uses the distribution of trees to determine null distributions of these statistics. The significance of the observed distribution of these statistics, calculated across the distribution of trees, is then determined in comparison to the null distributions. For our data, samples were coded as having either ‘pathogenic’ (n = 20) or non-‘pathogenic’ (n = 6) traits. We found that the observed distributions were significantly different from null expectations for all tests (PS: observed mean = 2.0, null mean = 5.51, p<0.001; AI: observed mean = 0.265, null mean = 1.127, p=0.007; MC of pathogenic trait: observed mean = 19.0, null mean = 5.1, p<0.001). These results indicate that pathogenic and non-pathogenic isolates in our sample are extremely structured by lineage.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Funding Information

This paper was supported by the following grants:

  • Canada Research Chairs to Hendrik N Poinar.

  • Natural Sciences and Engineering Research Council of Canada to Hendrik N Poinar.

  • National Institutes of Health National Research Service Award, T32 GM07215 to Tatum D Mortimer.

  • National Science Foundation Graduate Research Fellowship Program, DGE-1256259 to Tatum D Mortimer.

  • National Institutes of Health R01AI113287 to Caitlin S Pepperell.

  • McMaster University Michael G. DeGroote Institute for Infectious Disease Research (IIDR) to Hendrik N Poinar.

  • University of Wisconsin-Madison Graduate School to William Aylward, Caitlin S Pepperell.

  • Wisconsin Alumni Research Foundation to William Aylward, Caitlin S. Pepperell.

Additional information

Competing interests

The authors declare that no competing interests exist.

Author contributions

AMD, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article.

TDM, Conception and design, Analysis and interpretation of data, Drafting or revising the article.

AK, Analysis and interpretation of data, Drafting or revising the article.

HK, Acquisition of data, Drafting or revising the article, Contributed unpublished essential data or reagents.

JME, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article.

GBG, Analysis and interpretation of data, Drafting or revising the article.

JS, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article.

MKuc, Acquisition of data, Drafting or revising the article.

ATD, Analysis and interpretation of data, Drafting or revising the article.

WA, Acquisition of data, Drafting or revising the article, Contributed unpublished essential data or reagents.

SNG, Analysis and interpretation of data, Drafting or revising the article.

JEA, Analysis and interpretation of data, Drafting or revising the article.

AMK, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article.

GW, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article.

MKur, Acquisition of data, Drafting or revising the article.

KK, Acquisition of data, Drafting or revising the article.

DEGB, Analysis and interpretation of data, Drafting or revising the article.

GF, Analysis and interpretation of data, Drafting or revising the article.

ECH, Analysis and interpretation of data, Drafting or revising the article.

HNP, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article.

CSP, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article, Contributed unpublished essential data or reagents.

Ethics

Human subjects: Ethics approval for the study of the remains of the individual excavated in 2005 from grave 14 (Troy project, University of Tubingen, bone-sample x24.177) in quadrat x24 at Troy was obtained from Hamilton Health Sciences and McMaster University (REB# 13-146-T). Samples of extant bacteria were provided to investigators without patient identifiers or protected health information; the members of the study team did not have access to any identifiers or protected health information associated with the bacterial isolates.

Additional files

Supplementary file 1.

(A) Troy sample details (B) SEM-EDS results from nodule. For each replicate, upper value is weight %, lower value is atomic %. (C) Common chemical constituents of renal and bladder calculi (kidney and bladder stones) and Troy nodules. + - presence, ND- not detected, Unk- unknown, RF- Relative Frequency in modern populations (C.Y.C Pak (ed.) Pak [1987], Martinus Nijhoff Publishing, Boston). (D) Relative frequency of admixed calculi (kidney and bladder stones) in modern (grey shading) populations and archaeological findings (modified from Pak, 1987). Acronyms correspond to Table C. References cited herein. *1 Main constituent of bladder stone listed as ammonium acid urate and oxalate. *2 Majority of calculus determined to be calcium carbonate (calcite). Where not specifically given, relative proportions of all mineral components were estimated. *3 Minor constituents of calcium carbonate. *4 No other (minor) elements were provided thus composition was assumed to be 100%. ND – not determined/detected. (E) Mitochondrial results summary; unique reads mapped to the revised Cambridge Reference Sequence with minimum length of 35 bp and minimum mapping quality of 30. (F) Sex identification of all libraries after Skoglund et al. (2013). Libraries were mapped to a hard masked version of hg38 and restricted to reads of minimum length 35 bp and minimum mapping quality of 30. (G) Summary of unique shotgun reads (from nodules, ulna, sediment and associated blanks) of minimum length 35 bp and minimum mapping quality of 30, mapping to chromosome X, Y, autosomes and mitochondrion of the hard masked hg38. Nodule pooled =Nod2-UDG + Nod1. (H) Staphylococcus saprophyticus modern strains. (I) Gardnerella vaginalis modern strains. * F = full, S = scaffold; ** BWH = Brigham and Women’s Hospital, HMP = Human Microbiome Project, MWH = Magee-Womens Hospital, VCU = VCU Women’s Health Clinic (J) Troy DNA extraction details. *Demin. = demineralization; digest = digestion buffer (K) HTS data sets. (L) Summary of unique shotgun reads (from nodules, ulna, sediment and associated blanks) of minimum length 35 bp and minimum mapping quality of 30, mapping to Staphylococcus saprophyticus and Gardnerella vaginalis. Nodule pooled =Nod2-UDG + Nod1_1h-nonU + Nod2-nonU + Nod1_1h-UDG). (M) Kinetic analysis of ancient PC1 β-lactamase (N) S. saprophyticus Clade P specific genes. Locus tags refer to annotation of ATCC 15305 available at NCBI. (O) Non-synonymous variants shared between S. saprophyticus Clade P and isolate 55. REF is reference allele in ATCC15305, Clade P and isolate 55. ALT is allele in the remaining Clade E isolates. Locus tags refer to annotation of ATCC 15305 available at NCBI.

DOI: http://dx.doi.org/10.7554/eLife.20983.037

elife-20983-supp1.xlsx (45.3KB, xlsx)
DOI: 10.7554/eLife.20983.037

Major datasets

The following datasets were generated:

Henrike Kiesewetter,William Aylward,Caitlin S Pepperell,Tatum Mortimer,Alison M. Devault,Ana T Duggan,Hendrik N Poinar,Jacob M Enk,Melanie Kuch,2016,Ancient DNA sequencing of calcified nodules from Late Byzantine Troy,https://www.ncbi.nlm.nih.gov/bioproject/PRJNA352376,Publicly available at the NCBI (accession no: PRJNA352376)

Caitlin S Pepperell,Tatum Mortimer,Makoto Kuroda,Kengo Kato,2016,Whole genome sequencing of human and animal associated Staphylococcus saprophyticus,https://www.ncbi.nlm.nih.gov/bioproject/PRJNA352403,Publicly available at the NCBI (accession no: PRJNA352403)

References

  1. Abascal F, Zardoya R, Telford MJ. TranslatorX: multiple alignment of nucleotide sequences guided by amino acid translations. Nucleic Acids Research. 2010;38:W7–W13. doi: 10.1093/nar/gkq291. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Ahmed A, Earl J, Retchless A, Hillier SL, Rabe LK, Cherpes TL, Powell E, Janto B, Eutsey R, Hiller NL, Boissy R, Dahlgren ME, Hall BG, Costerton JW, Post JC, Hu FZ, Ehrlich GD. Comparative genomic analyses of 17 clinical isolates of Gardnerella vaginalis provide evidence of multiple genetically isolated clades consistent with subspeciation into genovars. Journal of Bacteriology. 2012;194:3922–3937. doi: 10.1128/JB.00056-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Akaike H. A new look at the statistical model identification. IEEE Transactions on Automatic Control. 1974;19:716–723. doi: 10.1109/TAC.1974.1100705. [DOI] [Google Scholar]
  4. Ames SK, Hysom DA, Gardner SN, Lloyd GS, Gokhale MB, Allen JE. Scalable metagenomic taxonomy classification using a reference genome database. Bioinformatics. 2013;29:2253–2260. doi: 10.1093/bioinformatics/btt389. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Anderson T. A medieval bladder stone from Norwich, Norfolk. International Journal of Osteoarchaeology. 2003;13:165–167. doi: 10.1002/oa.641. [DOI] [Google Scholar]
  6. Andrews RM, Kubacka I, Chinnery PF, Lightowlers RN, Turnbull DM, Howell N. Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA. Nature Genetics. 1999;23:147. doi: 10.1038/13779. [DOI] [PubMed] [Google Scholar]
  7. Angiuoli SV, Salzberg SL. Mugsy: fast multiple alignment of closely related whole genomes. Bioinformatics. 2011;27:334–342. doi: 10.1093/bioinformatics/btq665. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Arianpour N, Safari A, Hatami F. Bacteria Isolated from Post-Partum Infections. Journal of Family and Reproductive Health. 2009;3:63–66. [Google Scholar]
  9. Balaji KC, Menon M. Mechanism of stone formation. The Urologic Clinics of North America. 1997;24:1–11. doi: 10.1016/s0094-0143(05)70350-5. [DOI] [PubMed] [Google Scholar]
  10. Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. Journal of Computational Biology: A Journal of Computational Molecular Cell Biology. 2012;19:455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Beaumont W, Beverly R, Southon J, Taylor RE. Bone preparation at the KCCAMS laboratory. Nuclear Instruments and Methods in Physics Research Section B: Beam Interactions with Materials and Atoms. 2010;268:906–909. doi: 10.1016/j.nimb.2009.10.061. [DOI] [Google Scholar]
  12. Benirschke K, Kaufmann P, Baergen RN. Pathology of the human placenta. Springer; 2006. [DOI] [Google Scholar]
  13. Bourbou C. Health and Disease in Byzantine Crete (7th-12th Centuries AD) Ashgate Pub; 2010. [Google Scholar]
  14. Briggs DEG. The role of decay and mineralization in the preservation of soft-bodied fossils. Annual Review of Earth and Planetary Sciences. 2003a;31:275–301. doi: 10.1146/annurev.earth.31.100901.144746. [DOI] [Google Scholar]
  15. Briggs DEG. The role of biofilms in the fossilization of non-biomineralized tissues [Internet] In: Krumbein W. E, Paterson D. M, Zavarzin G. A, editors. Fossil and Recent Biofilms. The Netherlands: Springer; 2003b. pp. 281–290. [DOI] [Google Scholar]
  16. Briggs DEG, Kear AJ. Fossilization of soft tissue in the laboratory. Science. 1993;259:1439–1442. doi: 10.1126/science.259.5100.1439. [DOI] [PubMed] [Google Scholar]
  17. Briggs DEG, Kear AJ. Decay and mineralization of Shrimps. Palaios. 1994;9:431–456. doi: 10.2307/3515135. [DOI] [Google Scholar]
  18. Briggs DEG, Moore RA, Shultz JW, Schweigert G. Mineralization of soft-part anatomy and invading microbes in the horseshoe crab Mesolimulus from the Upper Jurassic Lagerstätte of Nusplingen, Germany. Proceedings of the Royal Society B: Biological Sciences. 2005;272:627–632. doi: 10.1098/rspb.2004.3006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Briggs DEG, Wilby PR, Pérez-Moreno BP, Sanz JL, Fregenal-Martínez M. The mineralization of dinosaur soft tissue in the Lower Cretaceous of Las Hoyas, Spain. Journal of the Geological Society. 1997;154:587–588. doi: 10.1144/gsjgs.154.4.0587. [DOI] [Google Scholar]
  20. Briggs DEG, Wilby PR. The role of the calcium carbonate-calcium phosphate switch in the mineralization of soft-bodied fossils. Journal of the Geological Society. 1996;153:665–668. doi: 10.1144/gsjgs.153.5.0665. [DOI] [Google Scholar]
  21. Bromham L. Why do species vary in their rate of molecular evolution? Biology Letters. 2009;5:401–404. doi: 10.1098/rsbl.2009.0136. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Brothwell DR, Sandison A. Diseases in Antiquity. Science. 1967;161:875–876. doi: 10.1126/science.161.3844.875-a. [DOI] [Google Scholar]
  23. Bruen TC, Philippe H, Bryant D. A simple and robust statistical test for detecting the presence of recombination. Genetics. 2006;172:2665–2681. doi: 10.1534/genetics.105.048975. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Bruzek J. A method for visual determination of sex, using the human hip bone. American Journal of Physical Anthropology. 2002;117:157–168. doi: 10.1002/ajpa.10012. [DOI] [PubMed] [Google Scholar]
  25. Buikstra JE, Ubelaker DH. Standards for Data Collection From Human Skeletal Remains. 1994. [Google Scholar]
  26. Cherif-Antar A, Moussa–Boudjemâa B, Didouh N, Medjahdi K, Mayo B, Flórez AB. Diversity and biofilm-forming capability of bacteria recovered from stainless steel pipes of a milk-processing dairy plant. Dairy Science & Technology. 2016;96:27–38. doi: 10.1007/s13594-015-0235-4. [DOI] [Google Scholar]
  27. Chikhi R, Medvedev P. Informed and automated k-mer size selection for genome assembly. Bioinformatics. 2014;30:31–37. doi: 10.1093/bioinformatics/btt310. [DOI] [PubMed] [Google Scholar]
  28. Cingolani P, Platts A, Wang LL, Coon M, Nguyen T, Wang L, Land SJ, Lu X, Ruden DM. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff. Fly. 2012;6:80–92. doi: 10.4161/fly.19695. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Clark SC, Egan R, Frazier PI, Wang Z. ALE: a generic assembly likelihood evaluation framework for assessing the accuracy of genome and metagenome assemblies. Bioinformatics. 2013;29:435–443. doi: 10.1093/bioinformatics/bts723. [DOI] [PubMed] [Google Scholar]
  30. Core Team R. R Foundation for Statistical Computing; 2015. R: A Language and Environment for Statistical Computing.http://www.gbif.org/resource/81287 [Google Scholar]
  31. Darriba D, Taboada GL, Doallo R, Posada D. jModelTest 2: more models, new heuristics and parallel computing. Nature Methods. 2012;9:772. doi: 10.1038/nmeth.2109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Decho AW. In: Encyclopedia of Geobiology. Reitner J, Thiel V, editors. The Netherland: Springer; 2011. pp. 359–362. [DOI] [Google Scholar]
  33. DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, Philippakis AA, del Angel G, Rivas MA, Hanna M, McKenna A, Fennell TJ, Kernytsky AM, Sivachenko AY, Cibulskis K, Gabriel SB, Altshuler D, Daly MJ. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nature Genetics. 2011;43:491–498. doi: 10.1038/ng.806. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Drummond AJ, Rambaut A. BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evolutionary Biology. 2007;7:214. doi: 10.1186/1471-2148-7-214. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Drummond AJ, Suchard MA, Xie D, Rambaut A. Bayesian phylogenetics with beauti and the beast 1.7. Molecular Biology and Evolution. 2012;29:1969–1973. doi: 10.1093/molbev/mss075. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. D’alessio A, Bramanti E, Piperno M, Naccarato G, Vergamini P, Fornaciari G. An 8500-year-old bladder stone from Uzzo Cave (trapani): Fourier Transform–Infrared Spectroscopy Analysis*. Archaeometry. 2005;47:127–136. doi: 10.1111/j.1475-4754.2005.00192.x. [DOI] [Google Scholar]
  37. Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Research. 2004;32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Fitch WM. Toward defining the course of evolution: minimum change for a specific tree topology. Systematic Biology. 1971;20:406–416. doi: 10.1093/sysbio/20.4.406. [DOI] [Google Scholar]
  39. Fitzgerald JR. Livestock-associated Staphylococcus aureus: origin, evolution and public health threat. Trends in Microbiology. 2012;20:192–198. doi: 10.1016/j.tim.2012.01.006. [DOI] [PubMed] [Google Scholar]
  40. Fouts DE, Pieper R, Szpakowski S, Pohl H, Knoblach S, Suh MJ, Huang ST, Ljungberg I, Sprague BM, Lucas SK, Torralba M, Nelson KE, Groah SL. Integrated next-generation sequencing of 16S rDNA and metaproteomics differentiate the healthy urine microbiome from asymptomatic bacteriuria in neuropathic bladder associated with spinal cord injury. Journal of Translational Medicine. 2012;10:174. doi: 10.1186/1479-5876-10-174. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Fowler JE. Staphylococcus saprophyticus as the cause of infected urinary calculus. Annals of Internal Medicine. 1985;102:342–343. doi: 10.7326/0003-4819-102-3-342. [DOI] [PubMed] [Google Scholar]
  42. Garrison E, Marth G. Haplotype-based variant detection from short-read sequencing.  Quantitative Biology. 2012 arXiv
  43. Gatermann S, John J, Marre R. Staphylococcus saprophyticus urease: characterization and contribution to uropathogenicity in unobstructed urinary tract infection of rats. Infection and Immunity. 1989;57:110–116. doi: 10.1128/iai.57.1.110-116.1989. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Ghodsi M, Hill CM, Astrovskaya I, Lin H, Sommer DD, Koren S, Pop M. De novo likelihood-based measures for comparing genome assemblies. BMC Research Notes. 2013;6:334. doi: 10.1186/1756-0500-6-334. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Giachelli CM. Ectopic calcification. The American Journal of Pathology. 1999;154:671–675. doi: 10.1016/S0002-9440(10)65313-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Giuffra V, Costantini L, Costantini Biasini L, Caramella D, Fornaciari G. Giant bladder stone in a natural mummy of the early 19th century. Urology. 2008;72:780–781. doi: 10.1016/j.urology.2008.04.031. [DOI] [PubMed] [Google Scholar]
  47. Giuffra V, Ventura L, Minozzi S, Lunardini A, Quaresima R, Arrizza L, Fornaciari G. Renal calculosis of Pandolfo III Malatesta (1370-1427) The American Journal of Medicine. 2011;124:1186–1187. doi: 10.1016/j.amjmed.2011.04.036. [DOI] [PubMed] [Google Scholar]
  48. Gurevich A, Saveliev V, Vyahhi N, Tesler G. QUAST: quality assessment tool for genome assemblies. Bioinformatics. 2013;29:1072–1075. doi: 10.1093/bioinformatics/btt086. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Hafez ES, Kenemans P. Atlas of Human Reproduction: By Scanning Electron Microscopy. Springer Science & Business Media; 2012. [Google Scholar]
  50. Harwich MD, Alves JM, Buck GA, Strauss JF, Patterson JL, Oki AT, Girerd PH, Jefferson KK. Drawing the line between commensal and pathogenic Gardnerella vaginalis through genome analysis and virulence studies. BMC Genomics. 2010;11:375. doi: 10.1186/1471-2164-11-375. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Hawass ZA, Brock LP. Egyptology at the Dawn of the Twenty-First Century: Language, Conservation, Museology. American Univ in Cairo Press; 2003. [Google Scholar]
  52. Hedman P, Ringertz O, Lindström M, Olsson K. The origin of Staphylococcus saprophyticus from cattle and pigs. Scandinavian Journal of Infectious Diseases. 1993;25:57–60. doi: 10.1080/00365549309169670. [DOI] [PubMed] [Google Scholar]
  53. Higashide M, Kuroda M, Omura CT, Kumano M, Ohkawa S, Ichimura S, Ohta T. Methicillin-resistant Staphylococcus saprophyticus isolates carrying staphylococcal cassette chromosome mec have emerged in urogenital tract infections. Antimicrobial Agents and Chemotherapy. 2008;52:2061–2068. doi: 10.1128/AAC.01150-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Hillier SL, Martius J, Krohn M, Kiviat N, Holmes KK, Eschenbach DA. A case-control study of chorioamnionic infection and histologic chorioamnionitis in prematurity. The New England Journal of Medicine. 1988;319:972–978. doi: 10.1056/NEJM198810133191503. [DOI] [PubMed] [Google Scholar]
  55. Hovelius B, Mårdh PA. Staphylococcus saprophyticus as a common cause of urinary tract infections. Clinical Infectious Diseases. 1984;6:328–337. doi: 10.1093/clinids/6.3.328. [DOI] [PubMed] [Google Scholar]
  56. Hunt M, Kikuchi T, Sanders M, Newbold C, Berriman M, Otto TD. REAPR: a universal tool for genome assembly evaluation. Genome Biology. 2013;14:R47. doi: 10.1186/gb-2013-14-5-r47. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Huson DH, Bryant D. Application of phylogenetic networks in evolutionary studies. Molecular Biology and Evolution. 2006;23:254–267. doi: 10.1093/molbev/msj030. [DOI] [PubMed] [Google Scholar]
  58. Jablonka P. Preliminary report on work at Troia 2005. Studia Troica . 2006:3–26. [Google Scholar]
  59. Jakobsen IB, Easteal S. A program for calculating and displaying compatibility matrices as an aid in determining reticulate evolution in molecular sequences. Bioinformatics. 1996;12:291–295. doi: 10.1093/bioinformatics/12.4.291. [DOI] [PubMed] [Google Scholar]
  60. Jonsson H, Ginolhac A, Schubert M, Johnson PLF, Orlando L. mapDamage2.0: fast approximate Bayesian estimates of ancient DNA damage parameters. Bioinformatics. 2013;29:1682–1684. doi: 10.1093/bioinformatics/btt193. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Kahlmeter G, ECO.SENS An international survey of the antimicrobial susceptibility of pathogens from uncomplicated urinary tract infections: the ECO.SENS Project. Journal of Antimicrobial Chemotherapy. 2003;51:69–76. doi: 10.1093/jac/dkg028. [DOI] [PubMed] [Google Scholar]
  62. Katoh K, Standley DM. MAFFT: iterative refinement and additional methods. Methods in Molecular Biology. 2014;1079:131–146. doi: 10.1007/978-1-62703-646-7_8. [DOI] [PubMed] [Google Scholar]
  63. Kiesewetter H. Troia 1987-2012 Grabungen und Forschungen I. Bonn: Dr. Rudolf Habelt GmbH; 2014. Paläoanthropologische Untersuchungen in Troia; pp. 627–631. [Google Scholar]
  64. Kircher M, Sawyer S, Meyer M. Double indexing overcomes inaccuracies in multiplex sequencing on the Illumina platform. Nucleic Acids Research. 2012;40:e3. doi: 10.1093/nar/gkr771. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Kloss-Brandstätter A, Pacher D, Schönherr S, Weissensteiner H, Binna R, Specht G, Kronenberg F. Haplogrep: a fast and reliable algorithm for automatic classification of mitochondrial dna haplogroups. Human Mutation. 2011;32:25–32. doi: 10.1002/humu.21382. [DOI] [PubMed] [Google Scholar]
  66. Knoll T, Schubert AB, Fahlenkamp D, Leusmann DB, Wendt-Nordahl G, Schubert G. Urolithiasis through the ages: data on more than 200,000 urinary stone analyses. The Journal of Urology. 2011;185:1304–1311. doi: 10.1016/j.juro.2010.11.073. [DOI] [PubMed] [Google Scholar]
  67. Koren S, Treangen TJ, Hill CM, Pop M, Phillippy AM. Automated ensemble assembly and validation of microbial genomes. BMC Bioinformatics. 2014;15:126. doi: 10.1186/1471-2105-15-126. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Kraus F, Redline RW, Gersell DJ, Nelson DM, Dicke JM. Placental Pathology. American Registry Of Pathology; 2004. [Google Scholar]
  69. Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R, Horsman D, Jones SJ, Marra MA. Circos: an information aesthetic for comparative genomics. Genome Research. 2009;19:1639–1645. doi: 10.1101/gr.092759.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Kuroda M, Yamashita A, Hirakawa H, Kumano M, Morikawa K, Higashide M, Maruyama A, Inose Y, Matoba K, Toh H, Kuhara S, Hattori M, Ohta T. Whole genome sequence of Staphylococcus saprophyticus reveals the pathogenesis of uncomplicated urinary tract infection. PNAS. 2005;102:13272–13277. doi: 10.1073/pnas.0502950102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Lagier R, Baud CA. Magnesium whitlockite, a calcium phosphate crystal of special interest in pathology. Pathology - Research and Practice. 2003;199:329–335. doi: 10.1078/0344-0338-00425. [DOI] [PubMed] [Google Scholar]
  72. Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nature Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Larsson P, Elfsmark D, Svensson K, Wikström P, Forsman M, Brettin T, Keim P, Johansson A. Molecular evolutionary consequences of niche restriction in Francisella tularensis, a facultative intracellular pathogen. PLoS Pathogens. 2009;5:e1000472. doi: 10.1371/journal.ppat.1000472. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Lee M, Hesek D, Mobashery S. A practical synthesis of nitrocefin. The Journal of Organic Chemistry. 2005;70:367–369. doi: 10.1021/jo0487395. [DOI] [PubMed] [Google Scholar]
  75. Lefort J. The Economic History of Byzantium. Vol 1. Dumbarton Oaks; 2007. The Rural Economy, Seventh--Twelfth Centuries; pp. 231–310. [Google Scholar]
  76. Lewis DA, Brown R, Williams J, White P, Jacobson SK, Marchesi JR, Drake MJ. The human urinary microbiome; bacterial DNA in voided urine of asymptomatic adults. Frontiers in Cellular and Infection Microbiology. 2013;3:41. doi: 10.3389/fcimb.2013.00041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. Quantitative Biology. 2013 arXiv:1303.3997
  78. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, 1000 Genome Project Data Processing Subgroup The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Li L, Stoeckert CJ, Roos DS. OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Research. 2003;13:2178–2189. doi: 10.1101/gr.1224503. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Maixner F, Krause-Kyora B, Turaev D, Herbig A, Hoopmann MR, Hallows JL, Kusebauch U, Vigl EE, Malfertheiner P, Megraud F, O'Sullivan N, Cipollini G, Coia V, Samadelli M, Engstrand L, Linz B, Moritz RL, Grimm R, Krause J, Nebel A, Moodley Y, Rattei T, Zink A. The 5300-year-old Helicobacter pylori genome of the Iceman. Science. 2016;351:162–165. doi: 10.1126/science.aad2545. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet Journal. 2011;17:10–12. doi: 10.14806/ej.17.1.200. [DOI] [Google Scholar]
  83. Marttinen P, Hanage WP, Croucher NJ, Connor TR, Harris SR, Bentley SD, Corander J. Detection of recombination events in bacterial genomes from large population samples. Nucleic Acids Research. 2012;40:e6. doi: 10.1093/nar/gkr928. [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Meindl RS, Lovejoy CO. Ectocranial suture closure: a revised method for the determination of skeletal age at death based on the lateral-anterior sutures. American Journal of Physical Anthropology. 1985;68:57–66. doi: 10.1002/ajpa.1330680106. [DOI] [PubMed] [Google Scholar]
  85. Metcalf JL, Xu ZZ, Weiss S, Lax S, Van Treuren W, Hyde ER, Song SJ, Amir A, Larsen P, Sangwan N, Haarmann D, Humphrey GC, Ackermann G, Thompson LR, Lauber C, Bibat A, Nicholas C, Gebert MJ, Petrosino JF, Reed SC, Gilbert JA, Lynne AM, Bucheli SR, Carter DO, Knight R. Microbial community assembly and metabolic function during mammalian corpse decomposition. Science. 2016;351:158–162. doi: 10.1126/science.aad2646. [DOI] [PubMed] [Google Scholar]
  86. Meyer M, Kircher M. Illumina sequencing library preparation for highly multiplexed target capture and sequencing. Cold Spring Harbor Protocols. 2010;2010:pdb.prot5448. doi: 10.1101/pdb.prot5448. [DOI] [PubMed] [Google Scholar]
  87. Mlakar J, Korva M, Tul N, Popović M, Poljšak-Prijatelj M, Mraz J, Kolenc M, Resman Rus K, Vesnaver Vipotnik T, Fabjan Vodušek V, Vizjak A, Pižem J, Petrovec M, Avšič Županc T. Zika virus associated with microcephaly. The New England Journal of Medicine. 2016;374:951–958. doi: 10.1056/NEJMoa1600651. [DOI] [PubMed] [Google Scholar]
  88. Moyer AE, Zheng W, Johnson EA, Lamanna MC, Li DQ, Lacovara KJ, Schweitzer MH. Melanosomes or microbes: testing an alternative hypothesis for the origin of microbodies in fossil feathers. Scientific Reports. 2014;4:4233. doi: 10.1038/srep04233. [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Özdemir K, Akyol AA, Erdal YS. A case of ancient bladder stones from Oluz Höyük, Amasya, Turkey. International Journal of Osteoarchaeology. 2015;25:827–837. doi: 10.1002/oa.2349. [DOI] [Google Scholar]
  90. Page AJ, Keane JA, Delaney AJ, Taylor B, Seemann T, Harris SR, Soares J. SNP-sites: rapid efficient extraction of SNPs from multi-FASTA alignments. Microbial Genomics. 2016;2 doi: 10.1099/mgen.0.000056. [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Pak CYC. Renal Stone Disease: Pathogenesis, Prevention, and Treatment. Springer Science & Business Media; 1987. [Google Scholar]
  92. Parker J, Rambaut A, Pybus OG. Correlating viral phenotypes with phylogeny: Accounting for phylogenetic uncertainty. Infection, Genetics and Evolution. 2008;8:239–246. doi: 10.1016/j.meegid.2007.08.001. [DOI] [PubMed] [Google Scholar]
  93. Parkhill J, Dougan G, James KD, Thomson NR, Pickard D, Wain J, Churcher C, Mungall KL, Bentley SD, Holden MT, Sebaihia M, Baker S, Basham D, Brooks K, Chillingworth T, Connerton P, Cronin A, Davis P, Davies RM, Dowd L, White N, Farrar J, Feltwell T, Hamlin N, Haque A, Hien TT, Holroyd S, Jagels K, Krogh A, Larsen TS, Leather S, Moule S, O'Gaora P, Parry C, Quail M, Rutherford K, Simmonds M, Skelton J, Stevens K, Whitehead S, Barrell BG. Complete genome sequence of a multiple drug resistant Salmonella enterica serovar Typhi CT18. Nature. 2001;413:848–852. doi: 10.1038/35101607. [DOI] [PubMed] [Google Scholar]
  94. Petersen TN, Brunak S, von Heijne G, Nielsen H. SignalP 4.0: discriminating signal peptides from transmembrane regions. Nature Methods. 2011;8:785–786. doi: 10.1038/nmeth.1701. [DOI] [PubMed] [Google Scholar]
  95. Piperno M. Scoperta di una sepoltura doppia epigravettiana nella grotta dell’Uzzo (Trapani) Kokalos Palermo. 1976;2:734–760. [Google Scholar]
  96. Posth C, Renaud G, Mittnik A, Drucker DG, Rougier H, Cupillard C, Valentin F, Thevenet C, Furtwängler A, Wißing C, Francken M, Malina M, Bolus M, Lari M, Gigli E, Capecchi G, Crevecoeur I, Beauval C, Flas D, Germonpré M, van der Plicht J, Cottiaux R, Gély B, Ronchitelli A, Wehrberger K, Grigorescu D, Svoboda J, Semal P, Caramelli D, Bocherens H, Harvati K, Conard NJ, Haak W, Powell A, Krause J. Pleistocene mitochondrial genomes suggest a single major dispersal of non-Africans and a Late Glacial population turnover in Europe. Current Biology. 2016;26:827–833. doi: 10.1016/j.cub.2016.01.037. [DOI] [PubMed] [Google Scholar]
  97. Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26:841–842. doi: 10.1093/bioinformatics/btq033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Rahman A, Pachter L. CGAL: computing genome assembly likelihoods. Genome Biology. 2013;14:R8. doi: 10.1186/gb-2013-14-1-r8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  99. Rambaut A, Lam TT, Max Carvalho L, Pybus OG. Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen) Virus Evolution. 2016;2:vew007. doi: 10.1093/ve/vew007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  100. Redline RW. In: Fetal and Neonatal Pathology. Jean W. K, Khong T. Y, editors. London: Springer; 2007. pp. 90–101. [Google Scholar]
  101. Renaud G, Slon V, Duggan AT, Kelso J. Schmutzi: estimation of contamination and endogenous mitochondrial consensus calling for ancient DNA. Genome Biology. 2015;16:224. doi: 10.1186/s13059-015-0776-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  102. Renaud G, Stenzel U, Kelso J. leeHom: adaptor trimming and merging for Illumina sequencing reads. Nucleic Acids Research. 2014;42:e141. doi: 10.1093/nar/gku699. [DOI] [PMC free article] [PubMed] [Google Scholar]
  103. Reuter S, Connor TR, Barquist L, Walker D, Feltwell T, Harris SR, Fookes M, Hall ME, Petty NK, Fuchs TM, Corander J, Dufour M, Ringwood T, Savin C, Bouchier C, Martin L, Miettinen M, Shubin M, Riehm JM, Laukkanen-Ninios R, Sihvonen LM, Siitonen A, Skurnik M, Falcão JP, Fukushima H, Scholz HC, Prentice MB, Wren BW, Parkhill J, Carniel E, Achtman M, McNally A, Thomson NR. Parallel independent evolution of pathogenicity within the genus Yersinia. PNAS. 2014;111:6768–6773. doi: 10.1073/pnas.1317161111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  104. Ringertz O, Torssander J. Prevalence of Staphylococcus saprophyticus in patients in a venereal disease clinic. European Journal of Clinical Microbiology. 1986;5:358–361. doi: 10.1007/BF02017800. [DOI] [PubMed] [Google Scholar]
  105. Romero R, Schaudinn C, Kusanovic JP, Gorur A, Gotsch F, Webster P, Nhan-Chang CL, Erez O, Kim CJ, Espinoza J, Gonçalves LF, Vaisbuch E, Mazaki-Tovi S, Hassan SS, Costerton JW. Detection of a microbial biofilm in intraamniotic infection. American Journal of Obstetrics and Gynecology. 2008;198:135.e1-5. doi: 10.1016/j.ajog.2007.11.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  106. Romero V, Akpinar H, Assimos DG. Kidney stones: a global picture of prevalence, incidence, and associated risk factors. Reviews in Urology. 2010;12:e86–e96. [PMC free article] [PubMed] [Google Scholar]
  107. Ronchetti I, Boraldi F, Annovi G, Cianciulli P, Quaglino D. Fibroblast involvement in soft connective tissue calcification. Frontiers in Genetics. 2013;4:1–16. doi: 10.3389/fgene.2013.00022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  108. Rupp ME, Soper DE, Archer GL. Colonization of the female genital tract with Staphylococcus saprophyticus. Journal of Clinical Microbiology. 1992;30:2975–2979. doi: 10.1128/jcm.30.11.2975-2979.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  109. Sayer D, Dickinson SD. Reconsidering obstetric death and female fertility in Anglo-Saxon England. World Archaeology. 2013;45:285–297. doi: 10.1080/00438243.2013.799044. [DOI] [Google Scholar]
  110. Scheuer B, Scheuer L, Black S. Development Juvenile Osteology. Londres: Academic Press; 2000. [DOI] [Google Scholar]
  111. Schneider PF, Riley TV. Staphylococcus saprophyticus urinary tract infections: epidemiological data from Western Australia. European Journal of Epidemiology. 1996;12:51–54. doi: 10.1007/BF00144428. [DOI] [PubMed] [Google Scholar]
  112. Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014;30:btu153. doi: 10.1093/bioinformatics/btu153. [DOI] [PubMed] [Google Scholar]
  113. Shepheard MA, Fleming VM, Connor TR, Corander J, Feil EJ, Fraser C, Hanage WP. Historical zoonoses and other changes in host tropism of Staphylococcus aureus, identified by phylogenetic analysis of a population dataset. PloS One. 2013;8:e62369. doi: 10.1371/journal.pone.0062369. [DOI] [PMC free article] [PubMed] [Google Scholar]
  114. Skoglund P, Storå J, Götherström A, Jakobsson M. Accurate sex identification of ancient human remains using DNA shotgun sequencing. Journal of Archaeological Science. 2013;40:4477–4482. doi: 10.1016/j.jas.2013.07.004. [DOI] [Google Scholar]
  115. Smith JM. Analyzing the mosaic structure of genes. Journal of Molecular Evolution. 1992;34:126–129. doi: 10.1007/BF00182389. [DOI] [PubMed] [Google Scholar]
  116. Soares P, Ermini L, Thomson N, Mormina M, Rito T, Röhl A, Salas A, Oppenheimer S, Macaulay V, Richards MB. Correcting for purifying selection: an improved human mitochondrial molecular clock. American Journal of Human Genetics. 2009;84:740–759. doi: 10.1016/j.ajhg.2009.05.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  117. Soge OO, Meschke JS, No DB, Roberts MC. Characterization of methicillin-resistant Staphylococcus aureus and methicillin-resistant coagulase-negative Staphylococcus spp. isolated from US West Coast public marine beaches. The Journal of Antimicrobial Chemotherapy. 2009;64:1148–1155. doi: 10.1093/jac/dkp368. [DOI] [PMC free article] [PubMed] [Google Scholar]
  118. Spirt BA, Cohen WN, Weinstein HM. The incidence of placental calcification in normal pregnancies. Radiology. 1982;142:707–711. doi: 10.1148/radiology.142.3.7063688. [DOI] [PubMed] [Google Scholar]
  119. Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–1313. doi: 10.1093/bioinformatics/btu033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  120. Steinbock R. Studies in ancient calcified soft tissues and organic concretions. II.Urolithiasis (renal and urinary bladder stone disease) Journal of Paleopathology. 1989;3:39–59. [Google Scholar]
  121. Szalai F, Jávor E. Finding of a bladder stone from the Avar period in southeast Hungary. International Urology and Nephrology. 1987;19:151–157. doi: 10.1007/BF02550466. [DOI] [PubMed] [Google Scholar]
  122. Tavichakorntrakool R, Prasongwattana V, Sungkeeree S, Saisud P, Sribenjalux P, Pimratana C, Bovornpadungkitti S, Sriboonlue P, Thongboonkerd V. Extensive characterizations of bacteria isolated from catheterized urine and stone matrices in patients with nephrolithiasis. Nephrology, Dialysis, Transplantation. 2012;27:4125–4130. doi: 10.1093/ndt/gfs057. [DOI] [PubMed] [Google Scholar]
  123. Tindall VR, Scott JS. Placental calcification a study of 3,025 Singleton and multiple pregnancies. International Journal of Obstetrics & Gynaecology. 1965;72:356–373. doi: 10.1111/j.1471-0528.1965.tb01470.x. [DOI] [PubMed] [Google Scholar]
  124. Treangen TJ, Koren S, Sommer DD, Liu B, Astrovskaya I, Ondov B, Darling AE, Phillippy AM, Pop M. MetAMOS: a modular and open source metagenomic assembly and analysis pipeline. Genome Biology. 2013;14:R2. doi: 10.1186/gb-2013-14-1-r2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  125. van Oven M, Kayser M. Updated comprehensive phylogenetic tree of global human mitochondrial DNA variation. Human Mutation. 2009;30:E386–E394. doi: 10.1002/humu.20921. [DOI] [PubMed] [Google Scholar]
  126. Vickers AA, Chopra I, O'Neill AJ. Intrinsic novobiocin resistance in Staphylococcus saprophyticus. Antimicrobial Agents and Chemotherapy. 2007;51:4484–4485. doi: 10.1128/AAC.00708-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
  127. Vinther J, Briggs DE, Prum RO, Saranathan V. The colour of fossil feathers. Biology Letters. 2008;4:522–525. doi: 10.1098/rsbl.2008.0302. [DOI] [PMC free article] [PubMed] [Google Scholar]
  128. Vinther J. Fossil melanosomes or bacteria? A wealth of findings favours melanosomes. BioEssays. 2016;38:220–225. doi: 10.1002/bies.201500168. [DOI] [PubMed] [Google Scholar]
  129. Wagner DM, Klunk J, Harbeck M, Devault A, Waglechner N, Sahl JW, Enk J, Birdsell DN, Kuch M, Lumibao C, Poinar D, Pearson T, Fourment M, Golding B, Riehm JM, Earn DJ, Dewitte S, Rouillard JM, Grupe G, Wiechmann I, Bliska JB, Keim PS, Scholz HC, Holmes EC, Poinar H. Yersinia pestis and the plague of Justinian 541-543 AD: a genomic analysis. The Lancet Infectious Diseases. 2014;14:319–326. doi: 10.1016/S1473-3099(13)70323-2. [DOI] [PubMed] [Google Scholar]
  130. Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, Earl AM. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PloS One. 2014;9:e112963. doi: 10.1371/journal.pone.0112963. [DOI] [PMC free article] [PubMed] [Google Scholar]
  131. Wang TH, Donaldson YK, Brettle RP, Bell JE, Simmonds P. Identification of shared populations of human immunodeficiency virus type 1 infecting microglia and tissue macrophages outside the central nervous system. Journal of Virology. 2001;75:11686–11699. doi: 10.1128/JVI.75.23.11686-11699.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  132. Weinert LA, Depledge DP, Kundu S, Gershon AA, Nichols RA, Balloux F, Welch JJ, Breuer J. Rates of vaccine evolution show strong effects of latency: implications for varicella zoster virus epidemiology. Molecular Biology and Evolution. 2015;32:1020–1028. doi: 10.1093/molbev/msu406. [DOI] [PMC free article] [PubMed] [Google Scholar]
  133. Weinert LA, Welch JJ, Suchard MA, Lemey P, Rambaut A, Fitzgerald JR. Molecular dating of human-to-bovid host jumps by Staphylococcus aureus reveals an association with the spread of domestication. Biology Letters. 2012;8:829–832. doi: 10.1098/rsbl.2012.0290. [DOI] [PMC free article] [PubMed] [Google Scholar]
  134. WHO. UNICEF. UNFPA. The World Bank United Nations Population Division . Trends in Maternal Mortality: 1990 to 2013. Estimates by WHO, UNICEF. (UNFPA, the World Bank and the United Nations Population Division) Geneva: World Health Organization; 2014. [Google Scholar]
  135. Widerström M, Wiström J, Ferry S, Karlsson C, Monsen T. Molecular epidemiology of Staphylococcus saprophyticus isolated from women with uncomplicated community-acquired urinary tract infection. Journal of Clinical Microbiology. 2007;45:1561–1564. doi: 10.1128/JCM.02071-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  136. Widerström M, Wiström J, Sjöstedt A, Monsen T. Coagulase-negative staphylococci: update on the molecular epidemiology and clinical presentation, with a focus on Staphylococcus epidermidis and Staphylococcus saprophyticus. European Journal of Clinical Microbiology & Infectious Diseases. 2012;31:7–20. doi: 10.1007/s10096-011-1270-6. [DOI] [PubMed] [Google Scholar]
  137. Wood DE, Salzberg SL. Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biology. 2014;15:R46. doi: 10.1186/gb-2014-15-3-r46. [DOI] [PMC free article] [PubMed] [Google Scholar]
  138. Wuttke M. Weichteil-Erhaltung’durch lithifizierte Mikroorganismen bei mittel-eozänen Vertebraten aus den Ölschiefern der ‘Grube Messel’bei Darmstadt. Senckenbergiana Lethaea. 1983;64:509–527. [Google Scholar]
  139. Zerbino DR, Birney E. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Research. 2008;18:821–829. doi: 10.1101/gr.074492.107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  140. Zimin AV, Marçais G, Puiu D, Roberts M, Salzberg SL, Yorke JA. The masurca genome assembler. Bioinformatics. 2013;29:btt476. doi: 10.1093/bioinformatics/btt476. [DOI] [PMC free article] [PubMed] [Google Scholar]
eLife. 2017 Jan 10;6:e20983. doi: 10.7554/eLife.20983.042

Decision letter

Editor: George H Perry1

In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included.

Thank you for submitting your article "An ancient emerging infection as a cause of maternal sepsis in Late Byzantine Troy" for consideration by eLife. Your article has been favorably evaluated by Richard Losick as the Senior Editor and three reviewers, including George H Perry (Reviewer #1) - who is a member of our Board of Reviewing Editors. – and Laura Weyrich (Reviewer #2).

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

Summary:

Devault et al. use DNA sequencing to describe, for the first time, urogenital infections in a woman buried ~800 years ago in Troy. The authors compare DNA present within calcified nodules (which are also analyzed morphologically) that were discovered in association with human skeletal remains to environmental, skeletal, and modern data to determine the source of the nodules and identify two bacterial species preserved within. The authors hypothesize that the infection may have been linked with pregnancy, due to some Y chromosome reads preserved within the nodule. The preservation of the Staphylococcus saprophyticus DNA, especially, inside the calcified nodule is astounding, providing opportunity for reconstructing a complete, high-quality genome sequence.

Essential revisions:

The reviewers have raised and discussed several major concerns that must be adequately addressed before your paper can be considered for acceptance. These necessary revisions all involve re-analysis of existing data and associated modifications to the presentation and interpretation of the results in the text, rather than generation of new data. Thus, we believe that these revisions can reasonably be completed within two months.

1) There is insufficient evidence presented in the manuscript to support the authors' conclusions about catching the "emergence" of a pathogenic human S. saprophyticus strain, or to determine whether this pathogen was passed from domestic animals to humans or vice versa, or even to exclude other potential scenarios in which both populations originally acquired the pathogen from the same source. This claim is made with a single cattle isolate that falls basal to the strain identified in Troy; however, other animal isolates fall within the tree. In the absence of additional cattle strains and wider sampling in general, the results should be treated with appropriate caution and interpretations tempered. Divergence date estimates generated with the field's current gold standard (BEAST) should be provided in the main text. If these estimates cannot be obtained for a technical reason or limitation then this needs to be clarified in the main text, not only the supplementary information.

1A) Note that an associated title change will likely be necessary.

2) The hypothesis of a fetal origin for the calcified nodule is intriguing, but needs further exploration. The genetic sex analyses in the paper are not rigorous to current standards, and leave more uncertainty than necessary. The authors should apply the Skoglund et al. 2013 J. Arch. Sci. (Accurate sex identification of ancient human remains using DNA shotgun sequencing) analysis to the data from both the nodule and the ulna, and then illustrate this result with a main figure, since this differential diagnosis is a key finding of the paper. The Y chromosome damage patterns presented in the supplementary information are not sufficiently robust (due to the limited number of reads) to determine if the DNA is ancient. The number of nuclear genome SNP sites covered by one or more reads in both the ulna and the nodule is likely insufficient for an assessment of relatedness. Please check and either perform the analysis or expand on the discussion of limitations in the manuscript.

3) The authors mention that extraction blank control samples were processed for enrichment, but the metagenome data for these samples are not shown. These data should be provided to further demonstrate authenticity of the results. For example, the authors could strengthen the argument that the Y chromosome and S. saprophyticus reads are endogenous to the sample and not the lab environment. With the high proportions of DNA mapping to S. saprophyticus, it is unlikely that these data reflect contaminants; however, the laboratory extraction procedure was extensive. Showing and discussing the extraction metagenome throughout the text would help to alleviate this concern.

4) The quality control assessment of whether the nodules belonged to the female whose skeletal remains they were associated has a logical flaw (easily corrected). Human mtDNA genome sequences obtained from both the nodule and from aDNA extracted from the individual's ulna were identical. The authors interpret this result as evidence that the nodule and the ulna come from the same individual. Of course, the authors elsewhere in the manuscript suggest a placental origin for the nodules, via chorioamnionitis with a male fetus. This underscores the inability for mtDNA to facilitate confident individual identifications, since it is inherited maternally without recombination. Thus, the authors should be more circumspect in their identification based on this evidence. If this is DNA from a male fetus, then all evidence is indeed pointing to an association with the female whose skeletal remains were recovered, but please ensure precision when discussing this evidence.

5) There is currently no description provided of the osteological materials and the methods and results leading to the conclusion that the skeletal remains are from a likely female individual, of ~30 years of age. This needs to be remedied, as these designations are critical for the interpretation.

6) All data must be provided via appropriate repositories, with clear descriptions of accession numbers for each dataset. Sequence reads from all experiments (and ideally intermediate processing files, and alignments, e.g., through Dryad), including the metagenomic datasets and the raw data from the bone (i.e., not simply the genome sequences), the SEM data, XRD data, etc., associated with this work all need to be thoroughly curated and made available.

eLife. 2017 Jan 10;6:e20983. doi: 10.7554/eLife.20983.043

Author response


Essential revisions:

The reviewers have raised and discussed several major concerns that must be adequately addressed before your paper can be considered for acceptance. These necessary revisions all involve re-analysis of existing data and associated modifications to the presentation and interpretation of the results in the text, rather than generation of new data. Thus, we believe that these revisions can reasonably be completed within two months.

1) There is insufficient evidence presented in the manuscript to support the authors' conclusions about catching the "emergence" of a pathogenic human S. saprophyticus strain, or to determine whether this pathogen was passed from domestic animals to humans or vice versa, or even to exclude other potential scenarios in which both populations originally acquired the pathogen from the same source. This claim is made with a single cattle isolate that falls basal to the strain identified in Troy; however, other animal isolates fall within the tree. In the absence of additional cattle strains and wider sampling in general, the results should be treated with appropriate caution and interpretations tempered. Divergence date estimates generated with the field's current gold standard (BEAST) should be provided in the main text. If these estimates cannot be obtained for a technical reason or limitation then this needs to be clarified in the main text, not only the supplementary information.

1A) Note that an associated title change will likely be necessary.

We appreciate the reviewers’ comments that we have insufficient evidence to identify the emergence of a human pathogenic lineage of S. saprophyticus. We have therefore removed all references to this concept and changed the title to “A molecular portrait of maternal sepsis from Byzantine Troy”.

We attempted to estimate the nucleotide substitution rate for S. saprophyticus but were unable to generate precise estimates due to a striking lack of temporal signal in the data. We have added a section to the main text that describes these attempts and our hypotheses about potential reasons we failed to observe any temporal signal.

Upon re-review of the supplementary material, we decided that the rate analyses for Gardnerella vaginalis were based on a sample of insufficient size and we have removed this paragraph.

2) The hypothesis of a fetal origin for the calcified nodule is intriguing, but needs further exploration. The genetic sex analyses in the paper are not rigorous to current standards, and leave more uncertainty than necessary. The authors should apply the Skoglund et al. 2013 J. Arch. Sci. (Accurate sex identification of ancient human remains using DNA shotgun sequencing) analysis to the data from both the nodule and the ulna, and then illustrate this result with a main figure, since this differential diagnosis is a key finding of the paper.

Our hypothesis about the origin of the nodules was not well described in the original manuscript: we don’t posit a fetal origin for the nodules, rather we hypothesize that they are of placental origin. Placental tissue is of both fetal and maternal origin, and the inflammatory response to infection of the placenta and fetal membranes involves cells from mother and fetus. Our observations of the nodules are consistent with their having an origin as placental abscesses consisting of bacterial cells, maternal inflammatory cells with a minority component of fetal inflammatory cells. We have amended the description in the main text to make this logic more explicit and clear to the reader.

To address comments regarding appropriate sexing of the remains we have applied the methodology of Skoglund et al. This assigns a female sex to the nodules, with high certainty, reflecting a majority maternal origin/signal for the inflammatory cells in the abscess. We have added this detail to the main text and the supplementary material with an associated table. We don’t however think that a figure is necessary to convey this information and hope the reviewer concurs.

We also agree with the reviewer that there are insufficient data from nuclear SNPs for an assessment of relatedness and thus feel that a figure would unduly draw attention to this aspect of the paper. We have modified the text accordingly (re: sexing).

“We attempted to sex the remains (i.e. calcified nodules, ulna and their associated blanks as well as the surrounding sediments and blanks) using the method developed and published by Skoglund et al. (2013). […] Importantly, blanks, also with low read counts (between 1 and 35) are labelled as ‘consistent with XX’ however with 95% confidence intervals (at those levels) of 0.”

The Y chromosome damage patterns presented in the supplementary information are not sufficiently robust (due to the limited number of reads) to determine if the DNA is ancient. The number of nuclear genome SNP sites covered by one or more reads in both the ulna and the nodule is likely insufficient for an assessment of relatedness. Please check and either perform the analysis or expand on the discussion of limitations in the manuscript.

The number of reads mapping to the Y in our nodule libraries is low (884) but not (as we have shown) insignificant, when compared to all necessary controls (sediment, blanks, etc.).

While the signal of deamination (as C —> T and G—> A) is present at the termini (-1 to – 14bp), it is indeed low. This is due to the fact that the majority of Y reads come from the more deeply sequenced, UDG treated libraries.

In summary, we feel that the Y- chromosomal FLDs’ low but none-the-less present damage signal, their independent identification in the Skoglund et al. 2013 method (at the reviewers’ suggestion), and our Fisher’s exact test, suggest their authenticity more so than their identification as spurious low level contamination not found in equally sequenced bone or sedimentary remains. We hope the reviewers agree.

3) The authors mention that extraction blank control samples were processed for enrichment, but the metagenome data for these samples are not shown. These data should be provided to further demonstrate authenticity of the results. For example, the authors could strengthen the argument that the Y chromosome and S. saprophyticus reads are endogenous to the sample and not the lab environment. With the high proportions of DNA mapping to S. saprophyticus, it is unlikely that these data reflect contaminants; however, the laboratory extraction procedure was extensive. Showing and discussing the extraction metagenome throughout the text would help to alleviate this concern.

We apologize if this was not clear, there are a lot of data spread across various tables. The results the reviewer is requesting are indeed listed in Supplementary file 1G. This table is (as quoted in the supplementary file) a “Summary of unique shotgun reads (from nodules, ulna, sediment and associated blanks) of minimum length 35bp and minimum mapping quality of 30 (or greater), mapping to Staphylococcus saprophyticus and Gardnerella vaginalis. Nodule pooled =Nod2-UDG + Nod1_1h-nonU + Nod2-nonU + Nod1_1h-UDG).”

As the reviewers will see, there are low level reads in the blanks that do map to S. saprophyticus (0-53 reads, the highest being the sediment) and G. vaginalis (0-9). In both cases, when compared with the nodules these represent minute fractions of what’s detected in the nodules and thus unlikely to be contamination. We did, in addition, perform metagenomic analysis on all reads found in the blanks. Reads (>100) that could be identified at the sequence level or the species/strain level from all blank extracts were removed from final files used in the PCA analysis.

4) The quality control assessment of whether the nodules belonged to the female whose skeletal remains they were associated has a logical flaw (easily corrected). Human mtDNA genome sequences obtained from both the nodule and from aDNA extracted from the individual's ulna were identical. The authors interpret this result as evidence that the nodule and the ulna come from the same individual. Of course, the authors elsewhere in the manuscript suggest a placental origin for the nodules, via chorioamnionitis with a male fetus. This underscores the inability for mtDNA to facilitate confident individual identifications, since it is inherited maternally without recombination. Thus, the authors should be more circumspect in their identification based on this evidence. If this is DNA from a male fetus, then all evidence is indeed pointing to an association with the female whose skeletal remains were recovered, but please ensure precision when discussing this evidence.

As noted in the response to question 2 above, we believe that the nodules are of both maternal and fetal origin. As the reviewers point out, mitochondrial sequences of maternal inflammatory cells, fetal inflammatory cells and maternal bone cells should be identical. Our finding of identical mitochondrial haplotypes in the ulna and nodules is consistent with this scenario.

We feel that the most parsimonious explanation of our observation of identical haplotypes in the nodules and ulna is that we have enriched and sequenced the identical woman’s mitogenome, with a possible minority component from her fetus. However, there is no precise way, without autosomal SNPs, to be 100% certain. For this reason, we have modified the following sentence in the main text to read:

“The nodule and the ulna share the identical mitochondrial haplotype (Supplementary file 5), suggesting that they likely stem from the same individual or a maternal relative.”

5) There is currently no description provided of the osteological materials and the methods and results leading to the conclusion that the skeletal remains are from a likely female individual, of ~30 years of age. This needs to be remedied, as these designations are critical for the interpretation.

Thank you for pointing this out. We have added a detailed description of the osteological materials and methods used for sexing and age estimation of the woman at Troy’s remains (1st section of the supplement).

6) All data must be provided via appropriate repositories, with clear descriptions of accession numbers for each dataset. Sequence reads from all experiments (and ideally intermediate processing files, and alignments, e.g., through Dryad), including the metagenomic datasets and the raw data from the bone (i.e., not simply the genome sequences), the SEM data, XRD data, etc., associated with this work all need to be thoroughly curated and made available.

The sequencing data have been deposited under the BioProject IDs PRJNA352403 and PRJNA352376. Other data supporting the paper are included as supplementary material.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Figure 3—source data 1. Concatenated alignment of core genes in G. vaginalis.

    DOI: http://dx.doi.org/10.7554/eLife.20983.017

    DOI: 10.7554/eLife.20983.017
    Figure 3—source data 2. G. vaginalis core genome alignment trimmed with Gblocks.

    DOI: http://dx.doi.org/10.7554/eLife.20983.018

    DOI: 10.7554/eLife.20983.018
    Figure 3—source data 3. Maximum likelihood phylogenetic analysis of trimmed G. vaginalis alignment with RAxML.

    DOI: http://dx.doi.org/10.7554/eLife.20983.019

    DOI: 10.7554/eLife.20983.019
    Figure 3—source data 4. Recombinant fragments detected with BratNextGen in trimmed G. vaginalis alignment.

    DOI: http://dx.doi.org/10.7554/eLife.20983.020

    DOI: 10.7554/eLife.20983.020
    Figure 4—source data 1. S. saprophyticus whole genome alignment.

    DOI: http://dx.doi.org/10.7554/eLife.20983.026

    DOI: 10.7554/eLife.20983.026
    Figure 4—source data 2. S. saprophyticus whole genome alignment trimmed with trimal.

    DOI: http://dx.doi.org/10.7554/eLife.20983.027

    DOI: 10.7554/eLife.20983.027
    Figure 4—source data 3. Maximum likelihood phylogenetic analysis of trimmed S. saprophyticus alignment with RAxML.

    DOI: http://dx.doi.org/10.7554/eLife.20983.028

    DOI: 10.7554/eLife.20983.028
    Figure 4—source data 4. S. saprophyticus plasmid alignment trimmed with trimal.

    DOI: http://dx.doi.org/10.7554/eLife.20983.029

    elife-20983-fig4-data4.fasta (208.8KB, fasta)
    DOI: 10.7554/eLife.20983.029
    Figure 4—source data 5. Recombinant fragments detected with BratNextGen in S. saprophyticus alignment.

    DOI: http://dx.doi.org/10.7554/eLife.20983.030

    DOI: 10.7554/eLife.20983.030
    Supplementary file 1.

    (A) Troy sample details (B) SEM-EDS results from nodule. For each replicate, upper value is weight %, lower value is atomic %. (C) Common chemical constituents of renal and bladder calculi (kidney and bladder stones) and Troy nodules. + - presence, ND- not detected, Unk- unknown, RF- Relative Frequency in modern populations (C.Y.C Pak (ed.) Pak [1987], Martinus Nijhoff Publishing, Boston). (D) Relative frequency of admixed calculi (kidney and bladder stones) in modern (grey shading) populations and archaeological findings (modified from Pak, 1987). Acronyms correspond to Table C. References cited herein. *1 Main constituent of bladder stone listed as ammonium acid urate and oxalate. *2 Majority of calculus determined to be calcium carbonate (calcite). Where not specifically given, relative proportions of all mineral components were estimated. *3 Minor constituents of calcium carbonate. *4 No other (minor) elements were provided thus composition was assumed to be 100%. ND – not determined/detected. (E) Mitochondrial results summary; unique reads mapped to the revised Cambridge Reference Sequence with minimum length of 35 bp and minimum mapping quality of 30. (F) Sex identification of all libraries after Skoglund et al. (2013). Libraries were mapped to a hard masked version of hg38 and restricted to reads of minimum length 35 bp and minimum mapping quality of 30. (G) Summary of unique shotgun reads (from nodules, ulna, sediment and associated blanks) of minimum length 35 bp and minimum mapping quality of 30, mapping to chromosome X, Y, autosomes and mitochondrion of the hard masked hg38. Nodule pooled =Nod2-UDG + Nod1. (H) Staphylococcus saprophyticus modern strains. (I) Gardnerella vaginalis modern strains. * F = full, S = scaffold; ** BWH = Brigham and Women’s Hospital, HMP = Human Microbiome Project, MWH = Magee-Womens Hospital, VCU = VCU Women’s Health Clinic (J) Troy DNA extraction details. *Demin. = demineralization; digest = digestion buffer (K) HTS data sets. (L) Summary of unique shotgun reads (from nodules, ulna, sediment and associated blanks) of minimum length 35 bp and minimum mapping quality of 30, mapping to Staphylococcus saprophyticus and Gardnerella vaginalis. Nodule pooled =Nod2-UDG + Nod1_1h-nonU + Nod2-nonU + Nod1_1h-UDG). (M) Kinetic analysis of ancient PC1 β-lactamase (N) S. saprophyticus Clade P specific genes. Locus tags refer to annotation of ATCC 15305 available at NCBI. (O) Non-synonymous variants shared between S. saprophyticus Clade P and isolate 55. REF is reference allele in ATCC15305, Clade P and isolate 55. ALT is allele in the remaining Clade E isolates. Locus tags refer to annotation of ATCC 15305 available at NCBI.

    DOI: http://dx.doi.org/10.7554/eLife.20983.037

    elife-20983-supp1.xlsx (45.3KB, xlsx)
    DOI: 10.7554/eLife.20983.037

    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES