Abstract
Stool contains DNA shed from cells of the gastrointestinal (GI) tract and has great potential as a bio-specimen for non-invasive, nucleic acid-based detection of GI diseases. Whereas methods for studying faecal microbiome DNA are plentiful, there is a lack of well-characterised procedures for stabilisation, isolation, and quantitative analysis of faecal host DNA. We report an optimised pipeline for faecal host DNA analysis from the point-of-collection to droplet digital PCR (ddPCR) absolute quantification of host-specific gene targets. We evaluated multiple methods for preservation and isolation of host DNA from stool to identify the highest performing methods. To quantify host DNA even if present in partially degraded form, we developed sensitive, human-specific short-amplicon ddPCR assays targeting repetitive nuclear genomic elements (LINE-1) and mitochondrial genes. We validated the ability of these optimised methods to perform absolute quantification of host DNA in 200 stool DNA extracts from samples that were serially collected from three healthy individuals and three hospitalised patients. These specimens allowed assessment of host DNA day-to-day variability in stool specimens with widely varying physical characteristics (i.e., Bristol scores). We further extended this approach to mouse stool analysis, to enable faecal host DNA studies in animal disease models as well.
Introduction
Analysis of DNA in stool has attracted great interest, which has largely focused on the gut microbiota and its relationship to health and disease. Aside from microbes, stool also contains exfoliated cells from the lining of the gastrointestinal (GI) tract1. Given that both genetic2,3 and epigenetic4 changes in DNA of somatic cells underlie many diseases, stool DNA tests offer great opportunities for non-invasive sampling and study of the GI tract in health and disease, as shown by commercial success of Cologuard (Exact Sciences, Inc.), a stool tumour DNA-based test for early detection of colorectal cancer. However, unlike the microbiome field where methods for preservation, isolation, and quantitative analysis of stool microbial DNA are well-established, comparable well-characterised methods for the study of host DNA in stool are lacking in the public domain. Challenges to the successful analysis of host DNA in stool include the lack of:
Sample preservation at the point of collection for host DNA stabilisation: for the microbiome, immediate freezing of stool samples or storage in specific preservative solutions before DNA extraction significantly improves the stability of microbial community compositions compared to no preservation5. For human DNA preservation in stool, there have been a few studies that assessed preservation in EDTA-based buffers and commercial solutions6–8. However, it was unclear whether DNA stabilisation solutions reported earlier are effective in preserving a range of DNA fragment lengths, including short fragments of host DNA that may be derived from normal apoptotic colonocytes or neoplastic cells (i.e. ~100 bp)9.
High efficiency host DNA extraction: most commercial methods for stool DNA extraction are optimised for long microbial genomic DNA and not for host DNA, which also includes the shorter host DNA fragments expected from apoptotic epithelial cells shed into the stool. In addition, some of the earlier work has been done with proprietary commercial reagents that are not easily accessible for research use8. Thus, there is a need for detailed studies of the efficiency of various stool DNA isolation methods for recovering host DNA, in the public domain.
Assays for absolute quantification of host DNA targets: the low abundance of host DNA (typically < 1% total stool DNA10,11) and the presence of PCR inhibitors of dietary and metabolic origin12 can present challenges for high sensitivity absolute quantification of host DNA using traditional quantitative PCR (qPCR) methods.
To address these challenges, we studied three preservation solutions for human DNA stabilisation during stool collection and transportation, evaluated three commercially available DNA isolation kits for their ability to efficiently recover DNA without size bias, and developed sensitive nuclear and mitochondrial DNA element assays to quantify human DNA in stool using droplet digital PCR (ddPCR). ddPCR enables single DNA molecule detection by partitioning PCR reactions into many thousands of oil-capsulated nanolitre-sized droplets and performing PCR amplification in individual droplets. ddPCR is well-suited for host DNA quantification, as it is an absolute quantification technique that is more robust to PCR inhibitors than qPCR, and offers greater precision and improved day-to-day reproducibility than qPCR without requiring a standard curve13,14. Here we report an optimised pipeline using 0.5 M EDTA (pH 8) for stool preservation, specialised reagents for DNA extraction (Norgen Biotek Corp.), and ddPCR of LINE-1 and mitochondrial DNA targets to perform absolute quantification of host DNA in stool. We report data from not only healthy individuals, but also hospitalised patients (i.e., recipients of allogeneic hematopoietic cell transplantation (HCT)) who often experience GI disturbances (e.g., diarrhoea) that result in stools of a range of physical characteristics (i.e., Bristol scores). Finally, we developed and validated assays for host DNA analysis in stools from mice, to enable study of host DNA in stool samples from a commonly used animal model system.
Results
Design and Optimisation of ddPCR Assays for High Specificity Detection of Human DNA in Stool
Human DNA is present in relatively low abundance in stool10,11 and is expected to have varying lengths due to naturally occurring apoptosis of colonic epithelial cells15 and potential degradation by nucleases16,17 present in stool. Therefore, when selecting human gene targets and designing ddPCR primers for our assays, we considered: i) the gene targets should ideally exist as a large number of copies per cell for enhanced sensitivity; ii) the primers need to be highly specific for human DNA relative to microbial, plant, or animal genomes that may be present in stool; and iii) the amplicons need to be as short as possible to allow for efficient capture of even partially degraded DNA. Therefore, we focused on two types of targets that are present at multiple copies per cell: repetitive sequences in the nuclear genome and mitochondrial genes. Long interspersed nuclear elements (LINEs), including LINE-1 repeats, are transposable elements that comprise 17% of the human genome18. LINE-1 repeats in plasma have been used to quantify the human tumour xenograft load in mice19. We postulated that the extremely high copies per genome of LINE-1 elements would substantially increase the likelihood of detecting human DNA using these targets, even in a high background of microbial and dietary DNA. Mitochondrial (mt) DNA is another desirable target because mitochondrial DNA sequences can be species-specific, and could therefore provide a human-specific DNA target. In fact, mtDNA markers have been used to track faecal contamination in water environments20. Furthermore, there are tens to thousands of mitochondria per human cell, depending on the cell type21, which could enhance sensitivity. We therefore selected two LINE-1 sequences, 55-bp and 60-bp long, respectively, as well as two mitochondrial gene sequences, 83-bp (located in the NADH dehydrogenase subunit 5 gene, abbreviated as ND5) and 77-bp (located in the cytochrome c oxidase subunit II gene, abbreviated as CO2) long, respectively, as targets for initial ddPCR assay development.
We first determined optimal annealing/extension temperatures for the four primer sets (described in Materials and Sequences) using extracted human stool DNA as a template in ddPCR reactions. For the LINE-1 primer sets, we observed decreasing amplification as annealing/extension temperatures were increased from 53 to 61 °C (Fig. 1a). We selected 60 °C as a conservative temperature to minimize false positive signals. For the mt primers, on the other hand, absolute copy number (ACN) quantifications remained relatively constant between 53–61 °C (Fig. 1a). We decided to use the same annealing/extension temperature of 60 °C for ddPCR for both primer sets so these two assays can be run together on the same plate. At 60 °C, LINE-1 assays yielded 6–20 times higher ACN than mt assays, with the LINE-1 60-bp amplicon demonstrating the highest ACN of all (Fig. 1a), suggesting that LINE-1 assays may be more sensitive for low DNA input samples than the mt assays.
In order to assess specificity of the assays for human DNA, relative to other types of genomes that may be present in stool, we ran the four assays on 80 pg extracted human stool DNA as well as 80 pg of genomic DNA (gDNA) purified from a range of animal, plant, and bacterial sources. For each non-human genome, we determined a signal to noise (S/N) ratio calculated as ACN measured in human stool DNA divided by ACN measured from the non-human gDNA. As shown in Fig. 1b, for LINE-1 assays, aside from monkey gDNA (where cross-reactivity was expected given sequence conservation), the assays showed human-specificity in excess of 100-fold across most genomes, and even the lowest S/N ratio, seen with one of the assays on corn gDNA, showed in excess of 25-fold specificity for human gDNA. The mitochondrial DNA assays were even more human-specific, yielding S/N ratios of > 100 in all cases and > 1000 for some of the non-human genomes (Fig. 1b). Whereas the mtDNA assays show higher specificity, the LINE-1 assays may be more sensitive, because of the potentially higher number of copies per cell of LINE-1 DNA targets. Based on the results in Fig. 1b, we selected the 60-bp LINE-1 amplicon and 83-bp mt amplicon for subsequent assay development and characterisation.
ddPCR linearity, accuracy, and reproducibility
To test the linearity, accuracy, and reproducibility of the selected LINE-1 and mt assays, we used purified human gDNA and synthetic gBlock fragments as templates to generate triplicate standard curves for each assay on three different days. Since the manufacturer (Bio-Rad, Inc.) recommends DNA fragmentation of genomic DNA with restriction enzymes for ACN analysis, we digested purified human gDNA with the restriction enzyme HaeIII, which yields a theoretical average fragment size of 347 bp22.
Using synthetic gBlocks as templates, we found that our assays were linear in the range from 193 to 120,440 copies of input DNA target molecules, with a median % yield (i.e., (# of copies measured/# of copies expected)*100) of 50.6% for the LINE-1 assay and 44.1% for the mt assay (Fig. 2a,b and Supplementary Table S1). It is worth noting that the gBlock working solutions used for this analysis were stored as highly dilute solutions (10 pM) to reduce the risk of cross contamination in the lab. This may have contributed to reduced % yield, via DNA adsorption to the plastic surface of the storage tubes that can occur with dilute solutions. Using digested gDNA as templates, we found that our LINE-1 assay is linear in a range from 36 to 22,247 in silico predicted copies input, with a median % yield of 395% (Fig. 2c and Supplementary Table S1). Our experimental results indicate that the in silico predictions are an underestimate. The expected (i.e., theoretical) yield value for the LINE-1 PCR used was ~5,800 copies per genome, as determined using an in silico PCR simulation tool available on the UCSC Genome Browser23. However, with highly abundant repeat elements containing somewhat degenerate sequences throughout the genome, it is difficult to know the precise accuracy of in silico tools. Regardless, the data indicate that the assay detection limit lies well below the amount of DNA corresponding to a single cell (i.e., less than one genome equivalent (GE)). The standard curves demonstrated excellent linearity for both the LINE-1 primer set (Fig. 2a,c) and mt primer set (Fig. 2b).
To assess day-to-day reproducibility, we calculated percentage coefficients of variation (% CV) for the triplicate standard curves performed on three different days as shown in Fig. 2d–f. The % CVs were 5–18% when concentrated synthetic gBlocks were used as templates for LINE-1 (Fig. 2d) and mt assays (Fig. 2e), and < 3% at each of the five tested concentrations of human gDNA for the LINE-1 assay (Fig. 2f). We attribute the higher % CV seen with the gBlocks to the fact that these are prepared as high-concentration stock solutions requiring more serial dilution steps than the gDNA templates, which likely incurred greater pipetting variations.
Evaluating stool DNA purification methods for high efficiency recovery of host DNA
Maximum DNA recovery during DNA purification is critical for accurate DNA quantification. As it is highly likely that in vivo (i.e., inside intestines) as well as ex vivo (i.e., during specimen collection, transportation, and storage) degradation and fragmentation of host DNA might occur in stool, we searched for DNA isolation methods likely to recover short and long DNA fragments equally well. We tested three commercial stool DNA purification kits (the Zymo Quick-DNA faecal/Soil Microbe Miniprep Kit, the Norgen Stool DNA Isolation Kit, and the Qiagen QIAamp DNA Stool Mini Kit (human DNA protocol)) for recovery of a DNA ladder (Invitrogen 1 kb Plus), ranging from 100 to 15,000 bp. By loading DNA ladder either directly (D) onto the TapeStation, or after one of the three purification protocols (P) (Fig. 3a), we visualised DNA losses by looking at the differences in band intensity of the DNA ladder components before and after purification (Fig. 3b). We chose 10 μg and 2 μg as total input DNA amounts for purification because these are similar in range to the amount of stool DNA we expect to be present in the aliquots of stool specimens that we would typically analyze, taking into account historical stool DNA abundance data from prior work24.
At the 10 μg DNA input level, the Norgen reagents showed the most uniform and highest recovery of DNA fragments, Zymo was comparable in recovery across the DNA ladder range except it showed diminished recovery of the 100 bp fragment (see red arrow in Fig. 3b), and the Qiagen kit showed the lowest recovery of DNA across the entire size range. At the 2 μg DNA input level, the Norgen reagents showed the highest recovery (Fig. 3b). Based on these results, we selected the Norgen DNA isolation method for further analyses because of its high efficiency, uniform DNA recovery of the full range of sizes examined.
We then assessed DNA isolation efficiency of the Norgen kit using healthy donor human stools as a background and HaeIII-digested human gDNA as spike-ins. Human stools were lysed using Norgen reagents, centrifuged, and supernatants aliquoted in replicates. Serially diluted human gDNA or a buffer control was mixed with the lysate aliquots and carried through the rest of the isolation protocol per the manufacturer’s instructions. The LINE-1 assay was used to quantify the ACN of the 60-bp amplicon in each sample, with and without the spike-ins. The percentage of DNA spike-in recovery, R, was calculated as:
As shown in Fig. 4, in the background of total stool DNA, 800 ng, 80 ng, and 8 ng human gDNA spike-ins corresponding to 232,000 GE, 23,200 GE, and 2,320 GE resulted in an average of 57% ± 5%, 60% ± 11%, and 75% ± 18% recovery, respectively, through the DNA isolation process. These values give us confidence that the majority of human DNA in stool can be recovered for downstream analysis.
Selecting a host DNA preservative buffer for stool collection
Considering that stool transport after collection most conveniently occurs at room temperature, we sought to evaluate preservative solutions for stool host DNA stabilisation. We aimed to assess time-dependent DNA degradation of homogenised, buffer-preserved stool at room temperature to simulate a typical specimen transport temperature. Buffers selected for this study are: (i) a proprietary buffer OMNIgene (referred to as OMNI hereafter), which comes with the OMNIgene Gut Kit and has been optimised for microbial DNA25, (ii) a buffer referred to as TEN2 which contains Tris, EDTA, and NaCl, which represent core ingredients of a previously described stool DNA preservative solution26,27 and (iii) a simple solution of 0.5 M EDTA at pH 8.0 (referred to as EDTA hereafter) designed to inactivate DNases by chelating divalent cations.
We collected stools from two healthy individuals (D-159x and D-145x), who scooped freshly defecated stools into collection devices containing OMNI, TEN2, or EDTA solutions. Stool specimens were brought to the laboratory within an hour. Stools were subsequently weighed, the buffer volume adjusted (for TEN2 only, see Materials and Methods), homogenised, and then aliquoted into five portions. Each aliquot was frozen at −80 °C after incubation at room temperature (22 °C) for one of five different time durations (0, 4, 24, 72, and 96 hours). See Fig. 5a for a schematic diagram of the workflow. We then extracted faecal DNA from each of the time point samples using Norgen reagents and used ddPCR to measure LINE-1, mt, and bacterial DNA targets as per methods described above.
As seen in Fig. 5b (relative change in ACN per μl extract from baseline, plotted against storage time) and Supplementary Fig. S2 (ACN per μl extract plotted against storage time), there are differences in the stability of DNA in stools preserved in different buffers over time. In TEN2, ACN of both human and microbial DNA targets decreased over the course of four days storage at room temperature, indicating progressive DNA degradation. In EDTA, we found that the ACN of human genes tended to stay constant over time, and that of bacterial genes rose slightly over the four days, indicating modest growth of faecal bacteria. And in some samples preserved in OMNI, levels of host DNA decreased whereas those of bacterial DNA increased over the incubation period. Based on these results, we selected EDTA as the stool preservation solution for subsequent experiments.
Quantifying host DNA in serially collected stool specimens from healthy individuals
To demonstrate the feasibility of using the optimised procedures developed here for assaying human DNA in stool, we applied them to analyze stool specimens in duplicate from three healthy control individuals (D-145x, D-165x, and D-166x), collected on multiple days (31 samples and 62 DNA isolations in total). These individuals had no known GI disorders. Using our DNA isolation protocol, the median stool input per ~200 μl stool homogenate for DNA extractions was 49.5 mg (Fig. 6a), and the median recovered total DNA per extraction was 707 ng (Fig. 6b). Figure 6c,d show the ACN per μl extract and normalized as per mg stool, respectively, of human-specific LINE-1 targets recovered from these samples. Figure 6e,f show the ACN per μl extract and normalized as per mg stool, respectively, of human-specific mtDNA targets recovered from these samples. Both LINE-1 and mtDNA targets were detected in all samples, with measured mtDNA levels being 9-fold lower than LINE-1, on average. Longitudinally, we observed a several-fold intra-individual day-to-day variation in human DNA levels in stool in the healthy individuals. This was not affected by normalisation to stool mass (i.e., ACN per mg stool input) (Fig. 6c–f), indicating that the observed day-to-day variation represents true biological variation.
Based on our success in detecting host DNA in human stool, we next applied the approach to mouse stool DNA quantification. Two sets of primers targeting LINE-1 elements in the mouse genome (mLINE-1) were designed. These mLINE-1 primers target 58-bp and 62-bp amplicons, respectively. We used 0.2 pg HaeIII-digested purified mouse gDNA as templates in a gradient PCR experiment to determine the optimal annealing and extension temperatures for the primers. Similar to the human LINE-1 primers, the mLINE-1 primers showed inverse relationships between copy number yields and annealing and extension temperatures (Supplementary Fig. S3a). To minimise non-specific amplification and ensure proper primer-template annealing, we chose 59 °C for annealing and extension during ddPCR, a temperature at which both primer sets exhibit high copy number yields (~10,000 copies/GE for the 58-bp amplicon and ~16,300/GE for the 62-bp amplicon).
To test the specificity of the two mLINE-1 primer sets, we included DNA isolated from mouse chow in addition to the genomes we tested for the human primer sets. Signal to noise ratios for each non-mouse DNA source were calculated as ACN per pg DNA from the mouse stool sample, divided by ACN per pg DNA from each of the non-mouse genomic DNA. As shown in Supplementary Fig. S3b, both primer sets exhibit signal to noise ratios of > 1000 against all non-mouse DNA. Therefore, both mLINE-1 assays are highly specific to mice.
Stool pellets were collected from three healthy BALB/c mice twice a week for five weeks. We collected fresh pellets directly from the anus of the mice into tubes containing EDTA. Using the primers targeting the 58-bp mLINE-1, we plotted the un-normalised stool mLINE-1 ACN for each mouse (Fig. 6g, left y axes) as well as ACN normalised to mg of input stools (Fig. 6g, right y axes) and observed a several-fold day-to-day variation in host DNA levels, similar to the variations seen in healthy human LINE-1 measurements (Fig. 6c,d).
Quantifying host DNA in stool specimens from hospitalised patients
In contrast to stool from healthy individuals, stool from hospitalised patients experiencing acute and chronic GI tissue damage or inflammation can vary greatly with respect to physical consistency (e.g., ranging from firm and dense to entirely liquid), the presence of potential PCR inhibitors28, and microbiome mass11 and composition29. All of these factors could collectively alter host DNA content and accessibility for analysis in the stool. To determine whether our optimised methods could be used to measure host DNA levels in stool from hospitalised patients, we used our methods to quantify human DNA in three allogeneic hematopoietic cell transplantation (HCT) patients (P07, P12, and P13) from duplicate DNA isolations at multiple times during the first 100 days after transplant (69 samples and 138 DNA isolations total). These patients represent a sick population since they all received radiation and/or chemotherapy before transplantation, which generally cause organ and tissue damage, especially in the GI tract. Two of the three HCT patients (P07 and P12) also developed graft-versus-host disease (GVHD) in the GI tract and experienced moderate to severe diarrhoea during periods of hospitalisation. Thus collectively across the stool specimens, we were able to cover almost the entire range of the Bristol Stool Scale (types 2–7) with respect to stool consistency.
The median recovery of total DNA (i.e., including both microbial and host DNA) per ~200 μl stool homogenate input was 20 ng with a wide distribution (Fig. 7a). P07 and P12, the subjects who developed GI GVHD, generally showed lower values for total stool DNA than P13, who did not develop GI GVHD (Fig. 7a). We were able to detect both LINE-1 and mt DNA in all stool samples from all three patients. Both the LINE-1 and mtDNA levels in the HCT patients (Fig. 7b,c) showed wider day-to-day variation compared to the healthy individuals (Fig. 6c,f).
For patient P07, for whom Bristol scores were available for every stool, we compared Bristol score to total DNA yield (Supplementary Fig. S4a) as well as to ddPCR-assayed host (Supplementary Fig. S4b) and 16S (Supplementary Fig. S4a) bacterial DNA. The amount of total extracted DNA showed a sustained drop at Day 17 post HCT and coincided with an increase of the Bristol score to 7 (“entirely liquid”). This corresponded to a sharp sustained drop in 16S bacterial DNA as well. In contrast, host DNA levels were maintained throughout the time course, indicating that host DNA can be recovered from liquid stools at comparable levels as seen from low Bristol score stools (Supplementary Fig. S4b).
Discussion
The gut microbiome is implicated in the maintenance of proper health30 as well as in nearly every major disease31 from obesity, intestinal disorders, to Parkinson’s32,33. The host, as a symbiotic partner, interacts intimately with and shapes the microbiota through multiple mechanisms34,35. Yet the lack of well-characterised methods for accessing and analyzing faecal host DNA has limited study of host intestinal effects on the microbiome. It has also limited progress in using stool-based approaches for the study of GI cancer and other diseases affecting the intestinal tract, for example by analysis of methylated DNA that may be recovered in stool.
Here we have developed a pipeline of methods to collect and isolate DNA in stool, and quantify host DNA within stool samples using ddPCR. For sample collection, we identified 0.5 M EDTA (pH 8) for use as a host DNA preservative solution for stool samples, which can stabilize DNA in stool for at least 4 days at room temperature. Since EDTA is nontoxic, readily available and relatively inexpensive, it offers an economical solution for stool DNA preservation at the point of collection, until DNA isolation can be carried out. It is worth noting that our DNA stability analyses were carried out using stool that had been homogenized within an hour of collection, in order to produce material that could be uniformly sampled over multiple time points. In real-world practice, we expect that stools would be collected in EDTA without prompt homogenization. Thus a limitation of our study is that we do not know whether such delays in homogenization would impact the DNA stabilisation effect of EDTA. Additionally, we found glass beads facilitated homogenisation of stool in a relative large volume of solution (i.e. 40 ml) and therefore recommend having them in the stool collectors.
For DNA isolation, we determined that Norgen Stool DNA isolation reagents provided the highest efficiency, non-size-biased recovery of DNA among the methods we evaluated. For host DNA quantification, we developed four ddPCR assays for quantification of host nuclear and mitochondrial genes in human and mouse stools. The choice of ddPCR as an analytic approach has advantages over real time PCR in this setting. These include obtaining absolute quantification without a standard curve, higher precision13, and less sensitivity to PCR inhibitors36, which may be present in stool and co-purify with stool DNA12. In addition, we chose targets that are present in high copy numbers per cell, and validated low cross-reactivity against other genomes that may be expected in stool. As a result, we achieved high sensitivity (lower detection limit well below a single human nuclear genome), reproducibility, linearity, and specificity with our assays. Ideally, DNA samples should be fragmented into shorter pieces for high CN target analysis (e.g. LINE-1 elements) using ddPCR to avoid target overcrowding in the droplets. However due to low DNA concentration in our patient specimens, we did not perform DNA fragmentation, as incorporating fragmentation may lead to sample loss and/or dilution. Therefore we expect the detection limit to be even lower for the LINE-1 assay for samples that have higher DNA concentrations and are hence suitable for pre-ddPCR DNA fragmentation. When reporting faecal host DNA levels, we found that normalisation of ACN to stool input (wet weight) did not visibly alter the longitudinal trends within an individual, regardless of the individual’s physiological status (healthy vs. hospitalised) and stool consistency (Bristol scores 2 through 7). We infer that this result indicates that the biological variability is much greater than the variability introduced by not normalising to stool weight. However stool wet weight has the limitation that it can be confounded by variations in water content. In future studies, it would be worthwhile to assess whether normalisation to stool dry weight (which was not available for our specimens) could better account for variations in stool input, especially for watery stools that may contain very little organic material.
Using our optimised pipeline, we further demonstrated the feasibility of quantifying host DNA in stools of a wide range of physical properties (as measured by Bristol Stool Scale) from not only healthy populations but also hospitalised HCT patients who commonly have GI tract pathology. In the 31 serially collected stool specimens from three healthy donors, the LINE-1 elements ranged from 5,600,000 to 184,000,000 copies per ~200 μl stool homogenate and 52,000 to 4,050,000 copies per mg of stool collected. Based on the empirically measured number of LINE-1 elements per haploid genome in a commercially available reference human genomic DNA sample, we were able to estimate that these values correspond to approximately 2 to 176 cells per mg of stool collected from the healthy donors. We consider these to be only estimates because the number of copies of LINE-1 per genome is polymorphic in the human population. Thus, there could be differences in the number of LINE-1 copies per genome in the individuals whose stool samples we studied, as compared to the reference human genomic DNA sample. In the healthy individuals studied, the mitochondrial gene ND5 ranged from 1,140,000 to 7,050,000 copies per ~200 μl stool homogenate, corresponding to 9,300 to 182,000 copies per mg of stool collected. In the 69 serially collected stool specimens from the three HCT patients, the LINE-1 elements ranged from 130,000 to 150,600,000 copies (corresponding to approximately 5 to 6544 cells) per ~200 μl stool homogenate. For HCT patients, we could not estimate the number of cells/mg stool because stool weights were not available. The mitochondrial gene ND5 ranged from 39,400 to 21,610,000 copies per ~200 μl stool homogenate in the HCT patient specimens.
Even though many of the stool specimens from HCT patients were watery and had low bacterial counts (see Supplementary Fig. S4a for patient P07 as an example), we were able to detect significant amounts of human DNA. We observed day-to-day variations in faecal human DNA in all six human subjects, which were several-fold in healthy controls and up to ~1000-fold in HCT patients. Although technical variation in DNA recovery efficiency does occur among replicate DNA isolations (Fig. 4), this is small relative to the observed day-to-day variations. However in future studies, variation in DNA recovery efficiency could be measured and corrected for using synthetic DNA of artificial sequence spiked into the samples prior to DNA extraction37. Taken together, our data suggest that the majority of observed day-to-day variation is likely biological in origin, perhaps related to variation in diet, GI tract environment, GI inflammation or other factors. We speculate that the greater variations seen in the HCT patients may be a manifestation of GI tract damage that is common amongst HCT patients during their treatment course.
Beyond stool, our assays provide a quantitative and reproducible tool that may be applied to quantifying host DNA sequences in other types of human specimens, which may be relevant to a wide range of diseases. Mitochondria are responsible for ATP (energy) generation, reactive oxygen species generation and detoxification, and other essential functions in the cell38. Mitochondrial copy number alterations have been linked to autism39,40 and multiple types of cancer including breast cancer41,42, bladder cancer42, kidney cancer42, and colon cancer43,44. Thus for example, our mitochondrial DNA quantification assays could be adapted to measure such mt copy number alterations in biofluids (e.g., urine, blood, cyst fluid, and cerebrospinal fluid) in cancer patients as a biomarker approach. In addition, our methods could be adapted to other DNA targets, such as specific mutated or methylated DNA sequences in the nuclear genome, for example, which may be associated with cancer and other diseases of the GI tract.
Lastly, as proof-of-concept, we validated that the same approaches for collecting and quantifying human DNA from stool are also effective for quantifying mouse DNA from murine stool. We detected between 253 and 8655 copies of mouse LINE-1 elements in stools from three healthy mice, further demonstrating the functionality of the pipeline. We envision this will facilitate disease studies in a broad range of mouse models of GI diseases. In our experiments, one to three pellets of healthy mouse stools per sample yielded DNA extracts that had to be diluted over 3000-fold for our ddPCR assay. This high sensitivity indicates that mouse DNA will be quantifiable even in sick animals which might produce liquid stools, for example. Taken together, the pipeline of methods for host DNA preservation and detection in stools we have described provides a convenient and highly sensitive tool for quantifying host cell DNA in the GI tract and can be applied broadly in studies of GI tract physiology and disease monitoring.
Methods
Stool collection kits and stool preservatives
Two types of stool collection devices were used for human stool collection in this study. One consists of PrecisionTM Stool Collectors (Covidien, Dublin, Ireland) that were each preloaded with one of two stabilisation buffers: 40 ml of “TEN2” or 50 ml of “EDTA”. TEN2 was made in-house and is composed of 500 mM Tris, 16 mM EDTA and 10 mM NaCl at pH 9.0. EDTA (0.5 M; pH 8.0) was purchased and used without dilution. For specimen collection in the PrecisionTM Stool Collectors, the donors/patients/caregivers were instructed to fill the provided shovel with stool and close the lid with the shovel attached to the lid of the container. The other stool collection device is the OMNIgene Gut kit (DNA Genotek, Kanata, ON, Canada), which has a tube that contains ~2 ml of a proprietary buffer and a large stainless steel bead. The stool collection using the OMNIgene kit was conducted per the manufacturer’s instructions.
Informed consent and institutional review board (IRB) supervision of human subjects research
All human subjects in the study (healthy controls as well as patients) provided informed consent and the study was approved by the University of Michigan IRB and carried out in accordance with the relevant guidelines and regulations.
Healthy stool collection and processing for DNA stability testing at ambient temperature
Stools from healthy individuals were collected in a hat that sits on the toilet seat. Stool was scooped into an OMNIgene kit and two PrecisionTM Stool Collectors that we preloaded with TEN2 or EDTA as well as six, 6 mm solid-glass beads. The specimens were then delivered to the laboratory within an hour of bowel movements.
Stool kits were weighed before and after stool collection and stool weight was recorded. Stools were homogenised into slurries as described below and divided into five aliquots. Aliquots were kept at room temperature for a defined amount of time after homogenisation (0, 4, 24, 72, and 96 hours) before freezing at −80 °C.
Homogenisation for OMNIgene preserved stool
Collection tubes were vortexed vigorously on a Fisherbrand™ Analog Vortex Mixer (Catalog No. 02–215–365) at setting 9 for 30 seconds.
Homogenisation for TEN2 preserved stool
Stool weight to buffer volume ratios were adjusted to 1:4 by adding additional TEN2 buffer if stool weight was > 10 g. Buffer was not added if the stool weight was < 10 g. Stools were homogenised in PrecisionTM Stool Collectors by vortexing on a Fisherbrand™ Analog Vortex Mixer (Catalog No. 02-215-365) on setting 9 until the sample appeared homogenous to visual inspection.
Homogenisation of EDTA preserved stool
Stool weight to buffer volume ratios were unadjusted. Stools were homogenised in the PrecisionTM Stool Collectors by vortexing on a Fisherbrand™ Analog Vortex Mixer (Catalog No. 02-215-365) on setting 9 until the sample appeared homogenous to visual inspection.
Healthy stool collection and processing for longitudinal human DNA quantification
The stool was scooped into a PrecisionTM Stool Collector containing EDTA and six, 6 mm solid-glass beads. Participants were instructed to provide stool samples three times a week. The specimens were then delivered to the laboratory within one hour of bowel movement.
All stool collection kits were weighed before and after stool collection to obtain stool weight. Stools were homogenised into slurries as described below and stored at −80 °C until further analysis.
Homogenisation for EDTA preserved stool
Stool weight to buffer volume ratios were unadjusted. Stools were homogenised in the PrecisionTM Stool Collector by vortexing on high until the sample appeared homogenous to visual inspection.
Clinical Stool Collection and Processing for Longitudinal Human DNA Quantification
Stools from allogeneic hematopoietic cell transplantation (HCT) patients were collected at multiple time points during the patient’s first 100 days post-transplant, both during hospitalisation by nurses and at home following discharge, in a hat that sits on the toilet seat. Caregivers/patients were instructed to collect bowel movements with a maximum of two per day and to collect a sample of ‘native’ stool for Bristol scoring, as well as to scoop stool into PrecisionTM Stool Collectors preloaded with 50 ml EDTA without glass beads for subsequent DNA analysis. The specimens were then delivered to the laboratory within 48 hours of bowel movement, at which time the native sample was assessed for Bristol score and the EDTA-stabilised sample was further processed as follows: Stool weight to buffer volume ratios were not adjusted. Stools were homogenised in the PrecisionTM Stool Collectors by vortexing on a Fisherbrand™ Analog Vortex Mixer (Catalog No. 02-215-365) on setting 9 until the slurry appeared homogenous to visual inspection. The stool slurries were aliquoted and frozen at −80 °C.
DNA ladder extraction and tapestation analysis
Invitrogen’s 1 kb Plus DNA ladder (100 bp–15,000 bp) was diluted in TET buffer (10 mM Tris-HCl, 0.1 mM EDTA, 0.05% Tween 20) to concentrations of 50 and 10 ng/μl. A total of 200 μl of each ladder concentration (corresponding to 10 and 2 μg total DNA input) was purified using the Qiagen QIAamp DNA Stool Mini Kit (human DNA protocol), the Zymo Quick-DNA faecal/Soil Microbe Miniprep Kit, or the Norgen Stool DNA Isolation Kit and the ladder eluted in 200 μl elution buffer. One μl of each of the before-purification DNA ladder (corresponding to 50 or 10 ng) and 1 μl of after-purification DNA ladders were loaded onto a High Sensitivity D1000 ScreenTape side by side and analyzed on an Agilent 2200 TapeStation (Santa Clara, CA), such that if DNA recovery were 100%, the same amount of DNA would have been loaded for both the before- and the after-purification samples.
Oligonucleotide Primers for ddPCR
Primers for the ddPCR analyses of human, mouse, and bacterial DNA targets are provided below in Table 1. Primers for the bacterial 16S ribosomal RNA genes were taken from a published study by Suzuki et al.45. Primers for human and mouse targets were designed for this study using Primer346. Primers were purchased as standard desalted oligonucleotides from IDT (Coralville, IA).
Table 1.
Human nuclear targets | ||
55-bp LINE-1 amplicon | 60-bp LINE-1 amplicon | |
Forward primer | 5′-CTCCACCCCAAATCAACAGAAT-3′ | 5′-AAGACAGTGTGGCGATTCCT-3′ |
Reverse primer | 5′-AATAGGTGTGGTGTGGTGCT-3′ | 5′-GATGGCTGGGTCAAATGGTAT-3′ |
Human mitochondrial targets | ||
83-bp ND5 amplicon | 77-bp CO2 amplicon | |
Forward primer | 5′-AAAACCTGCCCCTACTCCTC-3’ | 5′-CATCCTAGTCCTCATCGCCC-3′ |
Reverse primer | 5′-GGTGGAGATTTGGTGCTGTG-3′ | 5′-TGGTAAGGGAGGGATCGTTG-3′ |
Mouse nuclear targets | ||
58-bp LINE-1 amplicon | 62-bp LINE-1 amplicon | |
Forward primer | 5′-AGGCAACGCTGGAGATAGAA-3′ | 5′-GGAGCTAAAGGGAACTGCAA-3′ |
Reverse primer | 5′-ATGCTCGCATCTATGGTTCC-3’ | 5′-CCGGGGTACTGCTTAGTTCA-3’ |
Bacterial targets | ||
173-bp 16S amplicon | ||
Forward primer | Bact1369F: 5′-CGGTGAATACGTTCYCGG-3′ | |
Reverse primer | Prok1541R: 5′-AAGGAGGTGATCCRGCCGCA-3′ |
Purified Genomic DNA and Synthetic DNA Fragments
For ddPCR assay development and validation, we used both natural and synthetic templates. The natural templates include purified human (Promega), rhesus monkey (BioChain), mouse (BioChain), corn (BioChain), potato (BioChain), rice (BioChain), bovine (BioChain), E. coli (Affymetrix), chicken (BioChain), and wheat (BioChain) genomic DNA (gDNA). Purified human gDNA was digested with the restriction enzyme HaeIII (NEB, Ipswich, Massachusetts) per the manufacturer’s instructions to yield an average of 347-bp DNA fragments22. Digestion reactions were frozen at −20 °C until further use without heat deactivation.
The following synthetic gBlock DNA fragments were used for testing linearity, accuracy, and reproducibility for each ddPCR assay:
A 126-bp long gBlock (IDT, Coralville, IA) containing the 60-bp LINE-1 amplicon with two modified nucleotides (denoted with underlining) to yield an internal EcoRI site (primer binding sequences are shown in bold):
5′-ACTGTGAACTAGTTCAACCATTGTGGAAGACAGTGTGGCGATTCCTGAGGGATCTAGAATTCGAAATACCATTTGACCCAGCCATCCCATTACTGGGTATATACCCAAAGGATTATAAATCATGC-3′
A 196-bp gBlock (IDT, Coralville, IA) containing the 83-bp ND5 amplicon with two modified nucleotides (denoted with underlining) to yield an internal EcoRI site (primer binding sequences are shown in bold):
5′-TCTAGGCCTTCTTACGAGCCAAAACCTGCCCCTACTCCTCCTAGACCTAACCTGACTAGAAAAGCTATTACCTAAAAGAATTCCACAGCACCAAATCTCCACCTCCATCATCACCTCAACCCAAAAAGGCATAATTAAACTTTACTTCCTCTCTTTCTTCTTCCCACTCATCCTAACCCTACTCCTAATCACATAA-3′
Both gDNA and synthetic gBlocks were diluted serially in TET buffer for ddPCR.
Stool DNA extraction for human specimens
Total stool DNA was extracted in duplicate from ~200 μl of stool homogenate from each sample using the Stool DNA Isolation Kit by Norgen Biotek (Thorold, Ontario, Canada), Qiagen QIAamp DNA Stool Mini Kit (human DNA protocol), or the Zymo Quick-DNA faecal/Soil Microbe Miniprep Kit per the manufacturer’s instructions. The duplicate purifications for each sample were processed at the same time. Stool DNA concentrations were measured using the Qubit dsDNA HS Assay Kit (Thermo Fisher Scientific).
DNA recovery assessment of purified human genomic DNA using the norgen stool DNA isolation kit
HaeIII-digested human gDNA was spiked into cleared stool lysates, which were generated in the first phase of the Norgen Stool DNA Isolation protocol, following the initial centrifugation. The protocol was modified slightly for this experiment, in that multiple cleared stool lysates were first pooled. Then, different amounts (40 μl of 800, 80, 8, or 0 ng) of HaeIII-digested gDNA were spiked into 560 μl aliquots of the pooled lysates. The DNA isolation procedure was completed and subsequently ddPCR for LINE-1 was carried out. The measured ACN for LINE-1 in this scenario comprises both that derived from the spiked-in gDNA, as well as LINE-1 present endogenously in the stool sample. In order to estimate recovery specifically of the spiked-in gDNA, we also measured ACN of LINE-1 present in aliquots of the same stool lysates without gDNA spike-in. This value was then subtracted from the measured ACN in the “with gDNA spike-in” sample to yield the ACN of the spiked-in gDNA. We then calculated % recovery efficiency as: [(measured ACN for spiked-in gDNA)/(expected ACN for spiked-in gDNA)] × 100%. The expected ACN for spiked-in gDNA was determined by extrapolation, based on analyzing serial dilutions of HaeIII-digested human gDNA directly using ddPCR.
Droplet digital PCR for human stool samples
In a Bio-Rad QX200 droplet generator, 20 μl ddPCR reactions containing 10 μl of 2 × QX200™ ddPCR™ EvaGreen® Supermix (Bio-Rad), 1 μl of 5–500 fold TET buffer-diluted stool DNA or 1 μl of TET buffer (for no-template controls, ntc), 0.2 μl of 10 μM forward primer, and 0.2 μl of 10 μM reverse primer were partitioned into ~20,000 oil-emulsified droplets. One technical replicate for each of the TET buffer-diluted duplicate DNA extracts and four technical replicates of ntc were run on ddPCR. The droplets were transferred into 96-well plates and PCR performed using 10 minutes at 95 °C, 40 cycles of 30 seconds at 95 °C followed by 60 seconds at 60 °C, then 5 minutes at 4 °C, 5 minutes at 95 °C, and finally an infinite hold at 4 °C. The temperature ramp increment was 2 °C/second for all steps. Plates were subsequently read on a Bio-Rad QX200 droplet reader.
Gradient PCR was carried out using the same conditions as described above except for using a combined annealing/extension step with no specified temperature ramp increments between steps.
Mouse strain and husbandry
BALB/c (H-2d) mice were purchased from Charles River Laboratories (Wilmington, MA). Animal care followed protocols reviewed and approved by the Institutional Animal Care & Use Committee of the University of Michigan, based on the University Laboratory Animal Medicine guidelines. Mice were fed PicoLab 5L0D rodent diet (LabDiet), which is referred to as mouse chow hereafter. DNA from mouse chow was obtained by breaking chow into small pieces with a razor blade and carrying it through the Norgen Stool DNA Isolation kit protocol per manufacturer’s instructions.
Stool collection and processing for longitudinal mouse DNA quantification
Mouse stools were collected twice a week for 32 days. Two hundred μl of 0.5 M EDTA at pH 8.0 was added to each bead tube (part of the Norgen Stool DNA Isolation Kit) and weighed. Each time, one to three pellets of mouse stool was collected directly from the anus of each mouse into the bead tubes containing EDTA, and the tubes weighed again. Stool weight from each collection was calculated for subsequent data normalisation. Within an hour of stool collections, the bead tubes were vortexed to aid in homogenisation and stored at −80 °C until DNA isolation was performed using the Norgen Stool DNA Isolation Kit.
Stool DNA extraction for mouse specimens
Total stool DNA was extracted once from ~200 μl of stool homogenate from each sample using the Stool DNA Isolation Kit by Norgen Biotek (Thorold, Ontario, Canada) per the manufacturer’s instructions.
Droplet digital PCR for mouse stool samples
PCR reactions and droplets were prepared as described for human samples. One μl of 3,600-fold TET buffer-diluted mouse stool DNA and 1 μl of TET buffer were used as templates and NTCs (no template controls), respectively, in the PCR reactions. PCR was carried out as follows: 10 minutes at 95 °C, 40 cycles of 30 seconds at 95 °C followed by 60 seconds at 59 °C, then 5 minutes at 4 °C, 5 minutes at 95 °C, and finally an infinite hold at 4 °C. The temperature ramp increment was 2 °C/second for all steps. Plates were subsequently read on a Bio-Rad QX200 droplet reader.
Droplet digital PCR data analysis
QuantaSoft (Version 1.7.4.0917) was used for raw data processing. All samples included in analysis had a minimum of 10,000 accepted droplets during the ddPCR reading. The thresholds for positive and negative droplets were set as follows: 3400 for the four human target assays (the 55-bp and 60-bp human LINE-1 assays, and the 77-bp and 83-bp human mt assays), 6300 for the 172-bp bacterial 16S assay, and 3000 for the two mouse target assays (the 62-bp and 58-bp mouse LINE-1 assays). These thresholds, although chosen arbitrarily, were set using droplet fluorescence values from NTC samples to define what would be considered negative droplets. The copy numbers per reaction of template-containing samples were averaged between the two extraction-duplicates to yield ACNsample. The copy numbers per reaction of ntc were averaged among the four technical replicates to yield ACNntc. Then copy numbers of the template-containing samples were adjusted for background and dilution using the equation: (ACNsample – ACNntc)*dilution-factor, which give the copies per μl DNA extract. To normalise to stool input, copies per μl DNA extract were divided by the mass of stool that was contained in the ~200 μl stool homogenate being added to each extraction.
Supplementary information
Acknowledgements
We thank Elena Stoffel for assistance with obtaining some of the stool specimens from healthy controls; Missy Tuck, Kirk Herman, and Annika Goicochea for coordinating specimen collections; David Hyland, Ryan Lindstrom, Tyler Suciu, Joshua Weber-Townsend, Ricardo Engel, and Ruta Raulickis for assistance with specimen processing; and Nicholas W. Lukacs and Judith Luborsky for critical comments and assistance in editing the manuscript. K.H. was supported by a Clinical and Translational Science Award (CTSA) from the National Institutes of Health. H.F. was supported by JSPS Postdoctoral Fellowships for Research Abroad and The YASUDA Medical Foundation Grants for Research Abroad. M.T. and S.W.C. acknowledge support from an A. Alfred Taubman Medical Research Institute Grand Challenge Award. P.R. acknowledges support from National Institutes of Health grants CA203542; CA217156; HL128046. M.T. acknowledges support from Mary Petrovich through the University of Michigan Fast Forward Gastrointestinal (GI) Innovation Fund and from the University of Michigan Comprehensive Cancer Center Support Grant.
Author Contributions
K.H., P.R., S.W.C. and M.T. conceived the study and interpreted results. K.H., E.S. and M.T. designed the experiments. K.H., H.F. and C.Z. performed the experiments. K.H. and M.T. analysed the results. K.H., E.S. and M.T. wrote the manuscript. All authors reviewed the manuscript.
Data Availability
The data generated or analysed during this study are included in this published article in the Figures and Supplementary information files.
Competing Interests
The authors declare no competing interests.
Footnotes
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Supplementary information accompanies this paper at 10.1038/s41598-019-41753-6.
References
- 1.Tortora, G. J. & Anagnostakos, N. P. The Digestive System. In Principles of Anatomy and Physiology (Fifth ed.) 624 (Harper & Row 1987).
- 2.Taylor RW, Turnbull DM. Mitochondrial DNA mutations in human disease. Nat. Rev. Genet. 2005;6:389. doi: 10.1038/nrg1606. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Burgess DJ. Human genetics: somatic mutations linked to future disease risk. Nat. Rev. Genet. 2015;16:69. doi: 10.1038/nrg3889. [DOI] [PubMed] [Google Scholar]
- 4.Zoghbi HY, Beaudet AL. Epigenetics and Human Disease. Cold Spring Harb. Perspect. Biol. 2016;8:a019497. doi: 10.1101/cshperspect.a019497. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Song, S. J. et al. Preservation Methods Differ in Fecal Microbiome Stability, Affecting Suitability for Field Studies. mSystems1, (2016). [DOI] [PMC free article] [PubMed]
- 6.Nechvatal JM, et al. Fecal collection, ambient preservation, and DNA extraction for PCR amplification of bacterial and human markers from human feces. J. Microbiol. Methods. 2008;72:124–132. doi: 10.1016/j.mimet.2007.11.007. [DOI] [PubMed] [Google Scholar]
- 7.Zou H, Harrington JJ, Klatt KK, Ahlquist DA. A sensitive method to quantify human long DNA in stool: relevance to colorectal cancer screening. Cancer Epidemiol. Biomarkers Prev. 2006;15:1115–1119. doi: 10.1158/1055-9965.EPI-05-0992. [DOI] [PubMed] [Google Scholar]
- 8.Olson J, Whitney DH, Durkee K, Shuber AP. DNA stabilization is critical for maximizing performance of fecal DNA-based colorectal cancer tests. Diagn. Mol. Pathol. 2005;14:183–191. doi: 10.1097/01.pas.0000176768.18423.7e. [DOI] [PubMed] [Google Scholar]
- 9.De Maio G, Rengucci C, Zoli W, Calistri D. Circulating and stool nucleic acid analysis for colorectal cancer diagnosis. World J. Gastroenterol. 2014;20:957–967. doi: 10.3748/wjg.v20.i4.957. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Human Microbiome Project Consortium. A framework for human microbiome research. Nature486, 215–221 (2012). [DOI] [PMC free article] [PubMed]
- 11.Vincent C, Mehrotra S, Loo VG, Dewar K, Manges AR. Excretion of Host DNA in Feces Is Associated with Risk of Clostridium difficile Infection. J Immunol Res. 2015;2015:246203. doi: 10.1155/2015/246203. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Schrader C, Schielke A, Ellerbroek L, Johne R. PCR inhibitors - occurrence, properties and removal. J. Appl. Microbiol. 2012;113:1014–1026. doi: 10.1111/j.1365-2672.2012.05384.x. [DOI] [PubMed] [Google Scholar]
- 13.Hindson CM, et al. Absolute quantification by droplet digital PCR versus analog real-time PCR. Nat. Methods. 2013;10:1003–1005. doi: 10.1038/nmeth.2633. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Hall Sedlak R, Jerome KR. The potential advantages of digital PCR for clinical virology diagnostics. Expert Rev. Mol. Diagn. 2014;14:501–507. doi: 10.1586/14737159.2014.910456. [DOI] [PubMed] [Google Scholar]
- 15.Delgado ME, Grabinger T, Brunner T. Cell death at the intestinal epithelial front line. FEBS J. 2016;283:2701–2719. doi: 10.1111/febs.13575. [DOI] [PubMed] [Google Scholar]
- 16.Arends MJ, Morris RG, Wyllie AH. Apoptosis. The role of the endonuclease. Am. J. Pathol. 1990;136:593–608. [PMC free article] [PubMed] [Google Scholar]
- 17.Peitsch MC, Mannherz HG, Tschopp J. The apoptosis endonucleases: cleaning up after cell death? Trends Cell Biol. 1994;4:37–41. doi: 10.1016/0962-8924(94)90002-7. [DOI] [PubMed] [Google Scholar]
- 18.Lander ES, et al. Initial sequencing and analysis of the human genome. Nature. 2001;409:860–921. doi: 10.1038/35057062. [DOI] [PubMed] [Google Scholar]
- 19.Rago C, et al. Serial assessment of human tumor burdens in mice by the analysis of circulating DNA. Cancer Res. 2007;67:9364–9370. doi: 10.1158/0008-5472.CAN-07-0605. [DOI] [PubMed] [Google Scholar]
- 20.He X, Chen H, Shi W, Cui Y, Zhang X-X. Persistence of mitochondrial DNA markers as fecal indicators in water environments. Sci. Total Environ. 2015;533:383–390. doi: 10.1016/j.scitotenv.2015.06.119. [DOI] [PubMed] [Google Scholar]
- 21.Chandel NS. Evolution of Mitochondria as Signaling Organelles. Cell Metab. 2015;22:204–206. doi: 10.1016/j.cmet.2015.05.013. [DOI] [PubMed] [Google Scholar]
- 22.Posfai, J. Fragment length distributions after single enzyme digests of the human genome. (2015). Available at: http://tools.neb.com/~posfai/TheoFrag/sngldigest.fragmentlegth.percentages.txt. (Accessed: 11th July 2018)
- 23.Kent WJ, et al. The human genome browser at UCSC. Genome Res. 2002;12:996–1006. doi: 10.1101/gr.229102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Sonke, E., -S. Kim, W. & Haj-Ahmad, Y. Comparison of Stool DNA Isolation Methods for Bacterial and Mammalian DNA Detection.
- 25.E Doukhanine, et al. AAJ GCL. OMNIgene®•GUT stabilizes the microbiome profile at ambient temperature for 60 days and during transport. DNA Genotek (2016). Available at: https://www.dnagenotek.com/us/pdf/PD-WP-00042.pdf (Accessed: 30th September 2018).
- 26.Shuber, A. P., Lapidus, S. N. & Radcliffe, G. E. Methods for stool sample preparation. US Patent (2002).
- 27.Shuber, A. P., Huntress, F. A., Jr. & Moore, J. K. Methods for preserving DNA integrity. US Patent (2003).
- 28.Wilson IG. Inhibition and facilitation of nucleic acid amplification. Appl. Environ. Microbiol. 1997;63:3741–3751. doi: 10.1128/aem.63.10.3741-3751.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Shreiner AB, Kao JY, Young VB. The gut microbiome in health and in disease. Curr. Opin. Gastroenterol. 2015;31:69–75. doi: 10.1097/MOG.0000000000000139. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Valdes AM, Walter J, Segal E, Spector TD. Role of the gut microbiota in nutrition and health. BMJ. 2018;361:k2179. doi: 10.1136/bmj.k2179. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Hall AB, Tolonen AC, Xavier RJ. Human genetic variation and the gut microbiome in disease. Nat. Rev. Genet. 2017;18:690–699. doi: 10.1038/nrg.2017.63. [DOI] [PubMed] [Google Scholar]
- 32.Lozupone CA, Stombaugh JI, Gordon JI, Jansson JK, Knight R. Diversity, stability and resilience of the human gut microbiota. Nature. 2012;489:220–230. doi: 10.1038/nature11550. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Hadrich D. Microbiome Research Is Becoming the Key to Better Understanding Health and Nutrition. Front. Genet. 2018;9:212. doi: 10.3389/fgene.2018.00212. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Goodrich JK, et al. Human Genetics Shape the Gut Microbiome. Cell. 2014;159:789–799. doi: 10.1016/j.cell.2014.09.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Liu S, et al. The Host Shapes the Gut Microbiota via Fecal MicroRNA. Cell Host Microbe. 2016;19:32–43. doi: 10.1016/j.chom.2015.12.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Dingle TC, Sedlak RH, Cook L, Jerome KR. Tolerance of droplet-digital PCR vs real-time quantitative PCR to inhibitory substances. Clin. Chem. 2013;59:1670–1672. doi: 10.1373/clinchem.2013.211045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Kavlick MF. Development of a universal internal positive control. Biotechniques. 2018;65:275–280. doi: 10.2144/btn-2018-0034. [DOI] [PubMed] [Google Scholar]
- 38.Nicholls DG. Mitochondrial function and dysfunction in the cell: its relevance to aging and aging-related disease. Int. J. Biochem. Cell Biol. 2002;34:1372–1381. doi: 10.1016/S1357-2725(02)00077-8. [DOI] [PubMed] [Google Scholar]
- 39.Gu F, et al. Alterations in mitochondrial DNA copy number and the activities of electron transport chain complexes and pyruvate dehydrogenase in the frontal cortex from subjects with autism. Transl. Psychiatry. 2013;3:e299. doi: 10.1038/tp.2013.68. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Chen S, et al. Elevated mitochondrial DNA copy number in peripheral blood cells is associated with childhood autism. BMC Psychiatry. 2015;15:50. doi: 10.1186/s12888-015-0432-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Campa D, et al. Mitochondrial DNA copy number variation, leukocyte telomere length, and breast cancer risk in the European Prospective Investigation into Cancer and Nutrition (EPIC) study. Breast Cancer Res. 2018;20:29. doi: 10.1186/s13058-018-0955-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Reznik, E. et al. Mitochondrial DNA copy number variation across human cancers. Elife5, (2016). [DOI] [PMC free article] [PubMed]
- 43.Kumar B, et al. Association of mitochondrial copy number variation and T16189C polymorphism with colorectal cancer in North Indian population. Tumour Biol. 2017;39:1010428317740296. doi: 10.1177/1010428317740296. [DOI] [PubMed] [Google Scholar]
- 44.Tanaka T, et al. Increased Copy Number Variation of mtDNA in an Array-based Digital PCR Assay Predicts Ulcerative Colitis-associated Colorectal Cancer. In Vivo. 2017;31:713–718. doi: 10.21873/invivo.11119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Suzuki MT, Béjà O, Taylor LT, Delong EF. Phylogenetic analysis of ribosomal RNA operons from uncultivated coastal marine bacterioplankton. Environ. Microbiol. 2001;3:323–331. doi: 10.1046/j.1462-2920.2001.00198.x. [DOI] [PubMed] [Google Scholar]
- 46.Rozen S, Skaletsky H. Primer3 on the WWW for general users and for biologist programmers. Methods Mol. Biol. 2000;132:365–386. doi: 10.1385/1-59259-192-2:365. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data generated or analysed during this study are included in this published article in the Figures and Supplementary information files.