Abstract
Microbial forensics has been defined as the discipline of applying scientific methods to the analysis of evidence related to bioterrorism, biocrimes, hoaxes, or the accidental release of a biological agent or toxin for attribution purposes. Over the past 15 years, technology, particularly massively parallel sequencing, and bioinformatics advances now allow the characterization of microorganisms for a variety of human forensic applications, such as human identification, body fluid characterization, postmortem interval estimation, and biocrimes involving tracking of infectious agents. Thus, microbial forensics should be more broadly described as the discipline of applying scientific methods to the analysis of microbial evidence in criminal and civil cases for investigative purposes.
INTRODUCTION
The anthrax letter attacks of 2001 in the United States ushered the world into a new reality and increased awareness of global vulnerability to bioterrorism. In addition, the event demonstrated that as a nation, the United States was woefully unprepared to characterize the biological evidence associated with the outbreak. As a consequence, the field of microbial forensics was developed to build a robust forensic capability to help investigate bioterrorism and biocrime. Microbial forensics is the discipline of applying scientific methods for the analysis of evidence from a bioterrorism attack, biocrime, hoax, or inadvertent release of a biological agent or toxin, with attribution as the ultimate goal (1). Attribution of microbial evidence involves determining an associated source and/or perpetrator or group of individuals to the highest degree possible. The microbial forensics field is built on a network of multiple specialties (e.g., microbiology, genetics, bioinformatics, forensic science, immunology, population genetics, biochemistry, molecular biology, epidemiology, etc.) and the law enforcement, public health, policy, and intelligence communities. While the field was formalized shortly after the anthrax letter attacks, its roots are well established, as they are embedded in the same practices used for decades in epidemiology and public health to investigate disease outbreaks. Epidemiologists focus on the outbreak, the population(s) at risk, spread of disease, possible reservoirs, and characterization of the etiologic agent (2), primarily serving the health care system. Epidemiology and microbial forensics are employed together to attempt to determine if an outbreak is natural, accidental, or intentional. Therefore, the two disciplines are integrated, and specialists tend to work together, with microbial forensics concentrating on individualization of the agent or toxin and/or how it was produced and disseminated. In addition, traditional forensic methods, such as fingerprinting, human DNA analysis, trace materials, and handwriting analysis, are exploited in a microbial forensic investigation, as ultimate attribution is the identification of the perpetrators of the crime.
There are more than 1,400 microbial species or strains that are potential health threats to humans (3), and the number expands by orders of magnitude when considering plant and animal pathogens. While high-consequence agents (4) have been targeted for developing preparedness measures, they are but a small percentage of possible biothreats, and it is not practical to continue to develop assays directed at single targets. There are simply far too many targets. Only 20 years ago, it took 13 months and cost >$870,000 for the first bacterial genome to be sequenced and assembled by the Institute of Genome Research (Rockville, MD) (5, 6). Seven years later for about the same amount of time and a lesser cost (approximately $200,000 to $300,000 for the first genome), genomic sequences of the Bacillus anthracis Ames strains were obtained from the evidence in the letter attacks and purported reference samples (7–9). Technical advancements in recent years, through the advent of massively parallel sequencing (MPS) (which also has been referred to as next-generation sequencing [NGS] and high-throughput sequencing [HTS]), allow analysis of microbes with a throughput and speed that were not thought possible a short time ago. MPS, a disruptive technology and a boon to microbial forensics, may overcome the challenge of identifying unknown pathogens, hoax microorganisms, and low-abundance microorganisms even in complex mixture samples. With its substantially increased throughput and continued development of powerful bioinformatics pipelines, MPS may be used to characterize any microbe, abundant or trace, degraded or intact, and even genetically engineered genomes with one unifying approach. MPS provides the ability to rapidly diagnose and monitor infections using culture-independent methods (thereby reducing cost and turnaround time) and track disease outbreaks in real-time using whole-genome comparisons (10–12). Indeed, Cummings et al. (13) showed several years ago the forensic capability of MPS to rapidly and reliably sequence multiple whole genomes. Since then, epidemiologists have applied MPS to several outbreak investigations (10–12, 14–17), and it is anticipated that MPS eventually will become the routine method for genetic analysis. In addition, MPS provides a methodology for human microbiome studies, which provide inference into different health and disease states and impact conditions, such as obesity, inflammatory bowel syndrome, effects from antibiotic use, and cancer (18–20). These same tools have been used to characterize the complex community of the human microbiome and have been demonstrated for use in human forensic applications, such as human identification, body fluid characterization, and time-since-death decomposition analysis.
Since the field of microbial forensics was developed in response to exigent circumstances, it was narrowly defined as concentrating on the immediate concern: bioterrorism. Other examples of microbial forensics investigations included tracing transmission of human immunodeficiency virus (HIV) and hepatitis C virus (HCV) in criminal health-related matters (21–28), which fall under the biocrime category (Table 1). Therefore, microbial forensics focused on investigations where the microbe or its products (e.g., toxins) were used as weapons or biothreats. However, over the past 15 years, technological advances and the realization of the vastness and abundance of the microbial world prompted the field to expand into other areas where microbes and their products may help other types of forensic investigations, including human identification (29) and postmortem interval estimation (30). This expansion in which microbes were exploited forensically beyond an investigation of bioterrorism and biocrime requires a more comprehensive definition for the field of microbial forensics. Microbial forensics now should be broadly defined as the discipline of characterizing microbiological evidence to develop investigative leads in criminal (and civil) cases.
TABLE 1.
Examples of microbial forensic and epidemiologic (of interest to forensics) investigations
Microbial type by yr | Case description | Reference(s) |
---|---|---|
Bacterial | ||
2000–2001 | Characterized Staphylococcus aureus strains from drug paraphernalia to track drug networks | 66 |
Starting in 2002 | Amerithrax investigation; compared different Ames strain morphotypes to determine the origin of the source material released in the 2001 anthrax attacks | 107, 108 |
2009 | Traced the origin of Bacillus anthracis from injectional anthrax cases among heroin users in Scotland | 109 |
2010 | Traced the origin of the source of the Haitian cholera outbreak after the 2010 earthquake | 10, 110, 111 |
2011 | Traced the origin of the 2011 Escherichia coli O104:H4 outbreak in Germany | 11 |
Viral | ||
1992 | First report of molecular tracking of HIV infection from dentist to patients after invasive health care procedure | 21 |
1994 | Phylogenetic analysis to provide evidence that Richard Schmidt intentionally injected his girlfriend with HIV/HCV-contaminated blood | 25 |
1998 | Phylogenetic and molecular clock analysis to provide evidence that a Spanish anesthetist infected 275 patients with hepatitis C virus | 28 |
2014 | Traced the origin and transmission route of the 2014 Ebola outbreak | 112 |
2015 | Parvovirus B19 characterization from skeletal remains revealed the likely origin of World War II casualties | 67 |
The expansion of the microbial forensics field extends into the realm of forensic human identity testing. Human microbiome analysis can be combined with traditional human DNA testing (e.g., short-tandem-repeat [STR] and single nucleotide polymorphism [SNP] analyses) to potentially provide additional data for stronger associations and better chances to exclude individuals falsely associated with biological evidence (Fig. 1). Human DNA-based identity testing allows for an analysis of stable inherited markers in the human genome related to individualization, kinship, ancestry, and phenotype. In contrast, forensic microbiome testing allows for the analysis of both stable and fluctuating changes in microbial communities on and in the body related to individualization, diet, health, recent geolocation, and postmortem time intervals. Microbial genetic markers can be used to expand upon current forensic genetic testing capabilities.
FIG 1.
Diagram of expanded human investigative and forensic testing to include human genome and human microbiome characterization.
MICROBIAL FORENSICS AND HUMAN IDENTITY TESTING
Humans are born with approximately 20,500 genes and die with more than 1,000,000 genes (31–33). This change in gene count is due to the accumulation of microorganisms as part of the normal development and existence of human beings. Indeed, there are 10 times more bacterial cells in and on the body than there are human cells, making us reconsider what makes us human. The substantial microflora that humans carry is known collectively as the human microbiome (32, 33). The majority of human microbes are autonomous, self-replicating, transmissible, unavoidable, and, in general, ubiquitous, although they may vary to some degree or substantially from human to human (34, 35). The microorganisms and their accompanying nucleic acids that are carried by humans are shed, deposited, and exchanged routinely in a similar fashion to human DNA, which is exploited for identifying people involved in sexual assaults, murders, and burglaries. These microbial genomes from many different microbial species collectively are complex and variable and may provide forensic signatures that could be as individualizing as are the current set of human short-tandem-repeat markers (36) used for identity testing purposes. In addition, since microorganisms that reside in different areas of the human body vary, additional investigative value could be obtained for determining the tissue source of forensic biological evidence. Thus, the human microbiome may be another target that could be used to identify (or exclude) humans involved in crimes.
When people touch items, they often transfer their DNA onto objects (via primary, secondary, and tertiary transfer) (37, 38). Therefore, an individualizing signature is left behind that can be exploited to determine the identity of an individual who may have handled an object. Forensic DNA typing characterizes genetic signatures from human biological samples. The current DNA typing methodologies focus on markers in the human genome and are sensitive, highly discriminating, and well validated (for examples, see references 39–41). Unfortunately, the amount of human DNA deposited by touching an object often is very low, and most technologies cannot reliably type such low levels of DNA. To attempt to obtain results from such limited samples, modifications of current methods are made to increase the sensitivity of detection of human DNA. Collectively, the suite of methods that increase the sensitivity of human DNA typing protocols are known as low-copy-number (LCN) typing (42–44). Under LCN typing, limited template analyses suffer from exaggerated stochastic effects, and with increased sensitivity, there is a greater potential for contamination. This lack of reproducibility with LCN typing results has not deterred the interest in LCN typing and its potential use for developing investigative lead purposes. One way to partially overcome the uncertainty of the LCN typing methodology is to employ an orthogonal approach. In fact, each technology, i.e., LCN typing and the orthogonal methodology, may not be robust, but when performed together, the information may be sufficiently corroborating to decrease uncertainty, enabling better use results from trace biological sample analyses for developing investigative leads.
Given the much greater number of bacterial cells compared with human cells, it is conceivable that more bacterial cells and thus gene targets are deposited on touched items than are human markers. Indeed, Grice et al. (45) showed that 10,000 bacteria/cm2 and 50,000 bacteria/cm2 could be collected by swabbing and scraping the skin, respectively. Of course, the population of bacterial cells is composed of many species with various abundance levels. However, a combination of enrichment methods, such as PCR, and sequencing can be used to detect those species that may be used to individualize their human hosts. Goga (46) sought to analyze bacterial DNA from shoes, as the quantity of human DNA there often was at LCN typing levels. Goga demonstrated that there are sufficient bacteria in shoes and the plantar skins of individuals and that the microbiome communities were unique among the individuals tested. A level of “matching” was possible between the shoe and wearer profiles. However, the results varied substantially, and in a few comparisons, the identity of the wearer and another individual could not be resolved. Tims et al. (47) analyzed the microbiome before and after hand washing and observed similar variation. They opined that constant bacterial contamination by touching may impact the results. These findings were not similar to those of Fierer et al. (29, 48), who collected reference bacterial samples without washing and successfully matched reference microbial communities to those deposited on touched objects, such as keyboards and computer mice. Fierer et al. (48) suggested that washing may have changed the composition of the hand skin microbiome. For hand microbiome profiling, an unwashed hand may be a better starting point for studies. More studies are needed to determine the suitability of touch sample analyses.
Gut microbiome profiling also may be an important tool for forensic human identity testing. There is evidence that core gut microflora of an individual are stable (49, 50) but can be affected by environmental changes and antibiotic use (51, 52). Fecal material has been found at some crime scenes, and determining the source of such evidence may be an important investigative lead. Typically, human STR profiling is attempted but with mixed success. However, fecal material, the primary material used to study the gut microbiome, carries a microbiome profile that could be exploited for identification purposes. The stable microbiome may provide identity information, while fluctuating or transient microbiota might indicate recent diet or geolocation.
Identification of the tissue source of forensic biological samples can be critical in some investigations to reconstruct crime scenes and events, but current techniques are limited. Presumptive tests for the specific body fluids are used as screening tests and tend to have specificity limitations. Screening, which is often quick and inexpensive, is used to select the best candidate samples for more-in-depth testing. Confirmatory tests identify tissues with high specificity. While gene expression (mRNA typing) (53, 54) and methylation (55) have been described, most presumptive and confirmatory tests are protein-based enzymatic or immunologic assays. The exception is microscopic visualization of sperm for the identification of semen. Proteins are less stable than DNA, and a negative result may not indicate reliably the tissue source of a sample or the amount of DNA in a sample. Most tissue assays are non- or semiquantitative without a defined threshold for a positive reaction. Because of the different assay formats for each tissue-specific target, presumptive and confirmatory tests cannot be run in a similar parallel manner, adding to labor demand and the difficulty of automation in a cost-effective way.
Unique bacterial genetic signatures may aid in biological sample tissue source determination. For example, Lactobacillus crispatus, Lactobacillus jensenii, and Atopobium vaginae have been associated with vaginal secretions, while Lactobacillus iners, Lactobacillus gasseri, and Gardnerella vaginalis have been found in other body fluids as well (56–58). Tissue specific to the vagina could be informative in rape cases. Nakanishi et al. (59) showed that Streptococcus salivarius and Streptococcus mutans could be detected in mock forensic saliva samples and were not present in other forensically relevant tissue sources. Choi et al. (60) had similar findings for S. salivarius. These species are relatively abundant and thus may be easily detected.
Nose and throat (respiratory and digestive) commensal bacteria are likely candidates for geographical and recent contact information because they demonstrate high temporal stability (persistence), are contagious organisms (airborne and fomites), exhibit clonal and geographical variation and diversity, colonize a large portion of the population, and are easy to collect (e.g., see references 61 and 62). Scheidegger and Zimmerli (63) have shown that Staphylococcus aureus is a high-incidence bacterium among drug users. Shared drug paraphernalia can be an avenue for bacterial transmission (64), in which colonization can be transient or permanent (65). A study of inhalational drug users detected similar S. aureus strains between users and drug paraphernalia (66). The S. aureus strain analysis traced back 14 social networks (known groups of individuals connected by social interaction and reported drug use) linked to a crack house, while only two of the biological networks (nasal culture and drug paraphernalia linked by significant similarity among isolates) were identified by social network analysis. These data demonstrate that behavioral, social, and environmental networks may be identified by analysis of a targeted species of the human microbiome. Because environment is related to social networks, hygiene, diet, geography, etc., microbiome analyses could be an additional tool for intelligence gathering. Last, the profiles could be used to confirm or refute what persons of interest claim about where they have been and with whom they have associated. If the microbial profile of an individual reflects the unique nature of a complex microbial community, microbial profiles may be better than human DNA typing for recent geolocation, as the environment (including contact with groups of individuals) may carry location-specific microorganisms that will be exchanged through human contact.
Tracking sexually transmitted diseases in child molestation, rape cases, and questioned malpractice cases in health care provider outbreaks have been well documented for HIV and HCV cases (21–28) and have been part of the microbial forensics arena. A full-scale analysis that demonstrated the potential of such investigations was reported by Gonzáles-Candelas et al. (28). These authors described in detail a case of a Spanish anesthetist convicted of malpractice by infecting more than 270 of his patients with hepatitis C virus (HCV). The case is particularly interesting due the time frame of the investigation some 25 years after the first suspected transmission(s) and by the number and complexity of potential infectious events (>300 candidates from two hospitals). Thus, scientifically challenging but forensically obvious questions were posed to the experts. (i) Was the suspect the source responsible for the outbreak? (ii) Could it be ascertained whether the patients that had been infected share a source and thus could be included in the outbreak? (iii) Alternatively, which patients could have been infected from other sources? (iv) Could these alternative sources or the existence of different but simultaneous outbreaks be determined? (v) Could the duration of the outbreak be determined? (vi) Could the time of infection for each patient in the outbreak be estimated? Finally, (vii) could the date of infection of the anesthetist be estimated? These questions were tackled by sequencing 229 nucleotides of a conservative nonstructural 5B (NS5B) gene using standard Sanger sequencing technology. After systematic and rigorous analyses, the authors concluded that the suspected health care professional was indeed the source of the HCV infection of a portion of infected individuals and provided likelihood ratios to support the findings versus an alternative hypothesis. As in many other forensic DNA investigations, the data were not used to prove if the suspect was guilty, as the results provide only the identity of the virus and strain (or quasispecies). The data combined with other evidence were used to convict the anesthetist.
The human virome also potentially can be used for characterizing unidentified cadavers, even in cases of skeletonized remains. Toppinen et al. (67) are the first to describe the suitability of bone as a source for exploration of DNA viruses (as ancient DNA researchers have described bacterial strains of plague victims from past pandemics [68–70]) but instead for forensic identity purposes. They showed parvovirus B19 DNA sequences were found abundantly in 70-year-old long bones from putative casualties from World War II. The reported viral sequences were exclusively from one genotype, which disappeared from circulation in the 1970s, or from a genotype that has never been reported in northern Europe. By adding the viral information to that from human mitochondrial and Y-chromosome profiling, the authors concluded a dispute of the origin of two individuals found in the battlefield between Finland and Russia.
These cases provide insight into the potential of exploiting the microbial world for forensic purposes. It is clear that the human microbiome holds substantial information to assist the forensic science community's analysis of a variety of case scenarios.
MPS APPROACHES
Most microbial analysis methods, and in particular metagenomics, have focused on the single target of the 16S rRNA gene, which lacks species-level resolution. Some of the above-mentioned studies targeted only a few known species that may be unique to a body fluid or tissue, such as saliva or vaginal secretions. They only require a positive/negative result for tissue sourcing. However, most of the studies were not sufficient to unequivocally identify the targeted species among a complex metagenomics background necessary for forensic investigations, and generally, they were not sufficient to provide high-depth characterization at the subspecies or genotype level. Lastly, such tissue-sourcing assays often require the redesign of primers to increase specificity and continual evaluation of potential false positives with increased sampling and databases as they are updated with new genomic sequences. If specialized primer design is required for the detection of every, or most, microorganism of interest, there will be great demand on technology development and require substantial and costly validation.
Current major genetic and health efforts describe the diversity of the human microbiome (18, 32, 35, 45, 71–73). However, little work has been carried out on how individualizing the profiles may be, which microorganisms are common to all humans, what portion of these common human microbiome species are stable throughout an individual's lifetime (or at least for reasonably long time periods), and what portions vary due to environmental conditions. Such questions cannot be answered until (i) a defined set of resolving markers are developed to assay the human microbiome, and (ii) a defined set of bacterial species are determined that are best suited for human identity testing.
There are technical challenges to identify bacteria of interest within highly complex metagenomic samples, to distinguish those of interest from near neighbors and from the vast complex background that constitutes a microbial sample taken from body areas, and the degree of confidence that can be assigned to a potential species that is detected in such complex samples. One approach to meet these objectives is to develop a panel of bacterial markers that have the potential to differentiate human hosts and that can be multiplexed in a MPS system. A targeted bacterial marker panel would provide an efficient and cost-effective method that balances sequence coverage and throughput and yet provides species-level resolution. MPS could provide a single unifying methodology so the power of the assay can be realized and resources for testing and validation can be used in a cost-efficient manner. Sensitive and accurate bacterial DNA detection is imperative for forensics analyses. Validation is an essential part of any assay and should be become a routine part of technology development and implementation (see below). The aim to achieve the goals of human identification through the microbiome is essentially targeted genomics to survey the variety of microorganisms present in a specific sample. Microbes of interest for human identity testing should be common to all healthy individuals and sufficiently abundant to be detected routinely. Human microbiomes represent unique ecosystems composed of complex mixtures of microorganisms. For the needs of forensic diagnostics, analysis of metagenomic samples can be more focused than total community structure and organism abundance studies. Instead, molecular analyses can entail the identification of specific microorganisms that provide an individualizing signature to the highest degree possible. Such information requires the resolution of selected microorganisms from near neighbors as well as from all other microorganisms of the community that create noise and reduce the sensitivity of detection of an assay.
Most metagenomic analyses characterize whole microbial communities at the phylum level, often reporting different abundance ratios of partially resolved taxa. While interesting and informative for ecosystem analyses, that level of resolution and the variation in abundance from sample-to-sample likely will not achieve human individualization. Instead, for human identification, it is imperative to (i) identify key bacteria that are common to all individuals so identity testing can be informative with a routine target set of microorganisms, (ii) select those bacteria that are relatively abundant to reduce stochastic sampling effects, and (iii) target genes or sequences that contain sufficient variation to generate a profile that would provide a high degree of individualization of the donor(s) of bacterium-containing biological evidence.
Current metagenomic approaches apply MPS and target a single phylogenetic marker, the 16S rRNA gene, or perform whole-genome shotgun sequencing. MPS with the 16S rRNA gene provides deeper coverage but rarely can differentiate at the species level. Whole-genome shotgun sequencing may be able to differentiate at the species level but lacks depth of coverage, and thus, stochastic effects reduce its ability to achieve taxonomic resolution. The 16S rRNA gene is the most commonly used bacterial genetic marker in phylogenetic studies and broad bacterial identification. The conserved and variable regions of the gene, its presence in a number of databases (e.g., see references 74–76), and the significant volume of 16S rRNA studies add to the appeal of using this marker. However, there are limitations to solely using 16S rRNA, which include insufficient species resolution (77), PCR bias (78, 79), copy number variation (80), and sequence variability among a single bacterium (81), inaccurate phylogenetic relationships based on key variability outside the marker region (82), and horizontal transfer of the entire gene region (83, 84). These limitations of using a single gene target to identify bacteria in a complex community will lead to inaccurate abundance ratios and can confound phylogenetic analyses. Whole-genome shotgun sequencing provides the ability theoretically to sequence all DNA molecules in a given sample, potentially covering any given region(s) of many genomes. Whole-genome shotgun sequencing can obtain species- or strain-level characterization of a given genome by producing sequence reads of species-/strain-specific informative markers. However, the more area of any given genome that is attempted to be sequenced, the less depth of coverage that will be obtained for any particular site, potentially reducing the confidence for species identification. Highly complex metagenomic samples can contain thousands of species within a sample, thereby limiting the coverage of any one genome, especially those at low abundance, such as may be inherent to trace biological samples.
A novel metagenomics approach that employs the use of multiple informative phylogenetic markers can allow greater depth of coverage of targeted genomic regions than would be achieved by whole-genome shotgun sequencing. The markers suited for the task of species-level resolution are housekeeping genes, as they contain appropriate conserved and variable regions to differentiate among species. The use of housekeeping genes for bacterial species- and strain-level identification is not a new concept. Multilocus sequence typing (MLST), first described almost 20 years ago (85), is one of the most commonly used bacterial typing methods. MLST typically uses seven housekeeping genes, and the composite locus sequence creates a profile for comparison purposes. Online MLST databases are available for identification purposes and data storage (86–88). With MPS, an augmented MLST method has been developed using 20 genes (or 21 genes [89]) called MLST-seq (90). In addition, bioinformatics software packages are freely available to analyze MLST profiles from shotgun sequencing data (88, 91, 92). Even with these advances, there are limitations with the MLST method, which include nonuniversal gene panels and insufficient species-level resolution. Seven markers simply are not sufficient for identification to the species level. In these cases, multilocus variable-number tandem-repeat analysis (MLVA) and single nucleotide polymorphism (SNP) analyses have proved beneficial for additional resolution of specific species (93). An expanded MLST approach has been developed, called ribosomal multilocus sequence typing (rMLST), utilizing 53 housekeeping genes to type bacteria down to the subspecies level (94). Another method using housekeeping genes as genetic markers was described by Baldwin et al. (95). Their approach consisted of a PCR-based assay using 16 different primer pairs for amplifying 9 housekeeping genes, followed by electrospray ionization mass spectrometry for genus- and species-level characterization (95). This mass spectrometry-based assay is very appealing and supports the concept of multiple markers for identification to the species level, but it is limited to detecting only the most abundant 1 to 3 species in a sample, i.e., it is better suited for testing infected individuals with a high titer of the target species.
Bioinformatics programs can retrieve and use housekeeping genes as genetic markers from shotgun sequencing data. AMPHORA (AutoMated PHylogenOmic infeRence) has been developed using a panel of 31 housekeeping genes for better taxonomic resolution than solely using 16S rRNA (96). These genes were selected because they are found within all bacteria, mostly present in single copies, and believed to be fairly resistant to horizontal gene transfer (96). This 31-housekeeping-gene panel was modified, and another bioinformatic program, called MetaPhyler, was developed to analyze the data (97). Phyla-AMPHORA was developed utilizing thousands of phylum-specific phylogenetic markers to improve the resolution of phylogenomic analyses over that with the initial 31 genes (98). Additionally, PhyloSift enables phylogenetic analysis of metagenomes, which expands on AMPHORA, and includes 37 bacterial and archaeal genetic markers (PhyEco markers) (99) in the prokaryote core marker set (100). PhyloSift also provides capabilities for the use of extended and custom marker sets and has software packages for metagenomic data set simulation (to test newly generated custom markers) and statistical analyses (100). Another bioinformatics tool, mOTU, was developed using 40 marker genes (101) to profile metagenomes with species-level resolution (102). Additionally, FunGene (the Functional Gene Pipeline and Repository) includes 11 phylogenetic markers in its repository and analysis pipeline (103). Although these programs use different genetic markers (with a subset of overlapping markers), they demonstrate that the technology and software exist to develop a custom genetic panel consisting of informative phylogenetic markers to use for species-level identification in human microbiome samples.
VALIDATION
Validation is essential in the development of diagnostic methods, and those used in microbial forensics are no exception. The most important consideration is that microbial forensic methods may develop information that can impact the health, life, and freedom of individuals, policy decisions, and possibly action by governments on a large scale. Therefore, the results generated from the analyses of any microbial forensics analysis (including collection, transfer, analytical method, and interpretation, as well as proper training) must be understood and limitations defined, so that a proper degree of confidence can be associated with the findings. Accurate and credible results are a requisite. Budowle et al. (104) stressed that failure to properly validate a method or misinterpret the results from a microbial forensic analysis or process may have severe consequences.
Validation is the process that “1) assesses the ability of procedures to obtain reliable results under defined conditions, 2) rigorously defines the conditions that are required to obtain the results, 3) determines the limitations of the procedures, 4) identifies aspects of the analysis that must be monitored and controlled, and 5) forms the basis for development of interpretation guidelines to convey the significance of the findings” (104). The basic features and criteria for validation are listed in quality assurance guidelines for microbial forensics (1) and are addressed elsewhere in detail (104, 105). Budowle et al. (104) define developmental validation as “the acquisition of test data and the determination of conditions and limitations of a newly developed method to analyze samples” and internal validation “as an accumulation of test data within the operational laboratory to demonstrate that established methods and procedures perform within predetermined limits in the laboratory.” Every effort should be made to validate a method thoroughly before implementation. However, cost, resources, and exigent circumstances may require a method to be implemented without extensive validation. Clearly, it is unacceptable to hold off on any analyses because an available method has not been completely validated, especially when an attack is under way. In this context, Schutzer et al. (106) described the process of preliminary validation as “an early evaluation of a method that will be used to investigate a biocrime or bioterrorism event.” There is still a requirement to acquire some test data to evaluate a potential method so peer review of extant data can be performed quickly by a panel of experts. This panel would recommend limitations and additional studies (if necessary) to use prior to analyzing evidentiary material. However, it is stressed that a preliminary validation should not be used as an excuse for obviating a full validation.
Budowle et al. (105) recently described criteria for the validation of MPS procedures. The criteria are not novel, as the same approaches are being used for human genome sequencing and clinical diagnostics. Three common areas of MPS validation are (i) sample preparation, (ii) sequencing, and (iii) data analysis, but these criteria should be defined in terms of the specific application. There are a number of general topics subsumed within these three areas, which are listed in Table 2.
TABLE 2.
Validation criteria for MPS proceduresa
Validation criteria |
---|
Extraction of DNA (or RNA) |
Quality |
Purity |
Enrichment |
PCR |
Capture |
Library preparation |
Multiplexing |
Markers |
Samples |
Sequencing |
Data analysis |
Raw data processing |
Quality scores |
Alignment |
Variant calls |
Reference materials |
Test materials |
Controls |
Databases |
Test materials |
Inferences of results |
Bioinformatics |
Software tools |
Data management |
Data storage |
Interpretation |
Taxonomic assignment |
Abundance |
Organism classification |
Community structure |
Standard operating protocols |
Reporting |
Based on data from reference 105.
Addressing these general areas of the MPS method should provide an analytical tool that is sufficiently robust for the expanding field of microbial forensics. While the field certainly has matured and no longer can be considered solely the domain of bioterrorism and biocrime investigations, MPS technologies are still quite dynamic. It is anticipated that the speed of obtaining sequencing results will increase, and read lengths will be extended, with a concomitant decrease in cost. Applications beyond those discussed here will become part of the microbial forensics arena. Good-quality practices will ensure that the tools of microbial forensics and the results obtained will be robust, accurate, and reliable.
As the microbial forensics field expands to incorporate the use of microbiome characterization for human investigative and forensic testing, additional methods can be created to expand the forensic genetic toolbox. Likely, more-advanced sequencing methods can be developed to better elucidate the data that can be obtained from complex metagenomic samples, which can provide additional investigative value, including geolocation, infection source tracking of biocrimes, postmortem interval, and increased power of discrimination for human identification. As standards and quality practices are developed for MPS and metagenomic microbial forensic testing methods, human microbiome characterization likely will become a routine methodology in the field of microbial forensics.
Biographies
Sarah E. Schmedes, M.S., is a Ph.D. Candidate in the Department of Molecular and Medical Genetics at the University of North Texas Health Science Center (UNTHSC) in Fort Worth, TX. She earned her Bachelor of Science in biochemistry and microbiology from Texas State University in San Marcos, TX, and earned her Master of Science in forensic biology from the University of Albany, State University of New York. Prior to enrolling in the molecular genetics doctoral program at UNTHSC, she served as a forensic research assistant at the Institute of Applied Genetics at UNTHSC. She has published 8 manuscripts and 1 book chapter during her graduate career. Sarah is a student member of the American Society for Microbiology and was a recipient of the 2014 Eugene and Millicent Goldschmidt Graduate Student Award, awarded by the Texas Branch-American Society for Microbiology. Her research interests include metagenomics, microbial forensics, and emerging infectious diseases.
Antti Sajantila, M.D, Ph.D., is a trained forensic pathologist and forensic geneticist from the Department of Forensic Medicine, University of Helsinki, Finland. He received his Ph.D. from the University of Helsinki, Finland, and was a postdoctoral fellow in Ludwig Maximilian's University in Munich, Germany. He has published over 200 articles, book chapters, and reviews. His research interests are in the fields of human population genetics, human identification, disaster victim identification, and cause-of-death investigation. He has testified in forensic court cases in Finland, Australia, and Kenya. He also has served on several national and international committees for the application of DNA methods in human identification. He is currently a member of Interpol's Disaster Victim Identification Standing Committee's Forensic Pathology and Forensic Genetics Working Groups.
Bruce Budowle received a Ph.D. in genetics in 1979 from VA Tech. In 1983, he joined the FBI Laboratory Division to carry out research, development, and validation of methods for forensic biological analyses. He has published approximately 550 articles and testified in well over 250 criminal cases. He has authored or coauthored books on molecular biology techniques, electrophoresis, protein detection, and microbial forensics. He has been directly involved in developing quality assurance standards for the forensic DNA field. Some of his efforts over the last 20 years have been in microbial forensics and bioterrorism. In 2009, B. Budowle became Executive Director of the Institute of Applied Genetics and Professor in the Department of Molecular and Medical Genetics at the University of North Texas Health Science Center at Fort Worth, TX. His current efforts focus on human forensic identification, microbial forensics, and emerging infectious disease.
Footnotes
For a commentary on this article, see doi:10.1128/JCM.01082-16.
REFERENCES
- 1.Budowle B, Schutzer SE, Einseln A, Kelley LC, Walsh AC, Smith JAL, Marrone BL, Robertson J, Campos J. 2003. Building microbial forensics as a response to bioterrorism. Science 301:1852–1853. doi: 10.1126/science.1090083. [DOI] [PubMed] [Google Scholar]
- 2.Morse SA, Budowle B. 2006. Microbial forensics: application to bioterrorism preparedness and response. Infect Dis Clin North Am 20:455–473. doi: 10.1016/j.idc.2006.03.004. [DOI] [PubMed] [Google Scholar]
- 3.Taylor LH, Latham SM, Woolhouse MEJ. 2001. Risk factors for human disease emergence. Philos Trans R Soc Lond B Biol Sci 356:983–989. doi: 10.1098/rstb.2001.0888. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Centers for Disease Control and Prevention. 2016. Bioterrorism agents/diseases. Centers for Disease Control and Prevention, Atlanta, GA: http://emergency.cdc.gov/agent/agentlist.asp. [Google Scholar]
- 5.Fleischmann RD, Adams MD, White O, Clayton RA, Kirkness EF, Kerlavage AR, Bult CJ, Tomb J-F, Dougherty BA, Merrick JM, McKenney K, Sutton G, FitzHugh W, Fields C, Gocayne JD, Scott J, Shirley R, Liu L-I, Glodek A, Kelley JM, Weidman JF, Phillips CA, Spriggs T, Hedblom E, Cotton MD, Utterback TR, Hanna MC, Nguyen DT, Saudek DM, Brandon RC, Fine LD, Fritchman JL, Fuhrmann JL, Geoghagen NSM, Gnehm CL, McDonald LA, Small KV, Fraser CM, Smith HO, Venter JC. 1995. Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science 269:496–512. doi: 10.1126/science.7542800. [DOI] [PubMed] [Google Scholar]
- 6.Institute of Medicine. 2013. Workshop overview, p 12 In The science and applications of microbial genomics: workshop summary. The National Academies Press, Washington, DC. [PubMed] [Google Scholar]
- 7.Institute of Medicine. 2013. Workshop overview, p 26 In The science and applications of microbial genomics: workshop summary. The National Academies Press, Washington, DC. [PubMed] [Google Scholar]
- 8.Rasko DA, Worsham PL, Abshire TG, Stanley ST, Bannan JD, Wilson MR, Langham RJ, Decker RS, Jiang L, Read TD, Phillippy AM, Salzberg SL, Pop M, Van Ert MN, Kenefic LJ, Keim PS, Fraser-Liggett CM, Ravel J. 2011. Bacillus anthracis comparative genome analysis in support of the Amerithrax investigation. Proc Natl Acad Sci U S A 108:5027–5032. doi: 10.1073/pnas.1016657108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Read TD, Salzberg SL, Pop M, Shumway M, Umayam L, Jiang L, Holtzapple E, Busch JD, Smith KL, Schupp JM, Solomon D, Keim P, Fraser CM. 2002. Comparative genome sequencing for discovery of novel polymorphisms in Bacillus anthracis. Science 296:2028–2033. doi: 10.1126/science.1071837. [DOI] [PubMed] [Google Scholar]
- 10.Chin C-S, Sorenson J, Harris JB, Robins WP, Charles RC, Jean-Charles RR, Bullard J, Webster DR, Kasarskis A, Peluso P, Paxinos EE, Yamaichi Y, Calderwood SB, Mekalanos JJ, Schadt EE, Waldor MK. 2011. The origin of the Haitian cholera outbreak strain. N Engl J Med 364:33–42. doi: 10.1056/NEJMoa1012928. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Grad YH, Lipsitch M, Feldgarden M, Arachchi HM, Cerqueira GC, Fitzgerald M, Godfrey P, Haas BJ, Murphy CI, Russ C, Sykes S, Walker BJ, Wortman JR, Young S, Zeng Q, Abouelleil A, Bochicchio J, Chauvin S, DeSmet T, Gujja S, McCowan C, Montmayeur A, Steelman S, Frimodt-Møller J, Petersen AM, Struve C, Krogfelt KA, Bingen E, Weill F-X, Lander ES, Nusbaum C, Birren BW, Hung DT, Hanage WP. 2012. Genomic epidemiology of the Escherichia coli O104:H4 outbreaks in Europe, 2011. Proc Natl Acad Sci U S A 109:3065–3070. doi: 10.1073/pnas.1121491109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Mellmann A, Harmsen D, Cummings CA, Zentz EB, Leopold SR, Rico A, Prior K, Szczepanowski R, Ji Y, Zhang W, McLaughlin SF, Henkhaus JK, Leopold B, Bielaszewska M, Prager R, Brzoska PM, Moore RL, Guenther S, Rothberg JM, Karch H. 2011. Prospective genomic characterization of the German enterohemorrhagic Escherichia coli O104:H4 outbreak by rapid next generation sequencing technology. PLoS One 6:e22751. doi: 10.1371/journal.pone.0022751. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Cummings CA, Bormann Chung CA, Fang R, Barker M, Brzoska P, Williamson PC, Beaudry J, Matthews M, Schupp J, Wagner DM, Birdsell D, Vogler AJ, Furtado MR, Keim P, Budowle B. 2010. Accurate, rapid and high-throughput detection of strain-specific polymorphisms in Bacillus anthracis and Yersinia pestis by next-generation sequencing. Invest Genet 1:5. doi: 10.1186/2041-2223-1-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Gardy JL, Johnston JC, Ho Sui SJ, Cook VJ, Shah L, Brodkin E, Rempel S, Moore R, Zhao Y, Holt R, Varhol R, Birol I, Lem M, Sharma MK, Elwood K, Jones SJM, Brinkman FSL, Brunham RC, Tang P. 2011. Whole-genome sequencing and social-network analysis of a tuberculosis outbreak. N Engl J Med 364:730–739. doi: 10.1056/NEJMoa1003176. [DOI] [PubMed] [Google Scholar]
- 15.Lienau EK, Strain E, Wang C, Zheng J, Ottesen AR, Keys CE, Hammack TS, Musser SM, Brown EW, Allard MW, Cao G, Meng J, Stones R. 2011. Identification of a salmonellosis outbreak by means of molecular sequencing. N Engl J Med 364:981–982. doi: 10.1056/NEJMc1100443. [DOI] [PubMed] [Google Scholar]
- 16.Eyre DW, Golubchik T, Gordon NC, Bowden R, Piazza P, Batty EM, Ip CLC, Wilson DJ, Didelot X, O'Connor L, Lay R, Buck D, Kearns AM, Shaw A, Paul J, Wilcox MH, Donnelly PJ, Peto TEA, Walker AS, Crook DW. 2012. A pilot study of rapid benchtop sequencing of Staphylococcus aureus and Clostridium difficile for outbreak detection and surveillance. BMJ Open 2:e001124. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Köser CU, Holden MTG, Ellington MJ, Cartwright EJP, Brown NM, Ogilvy-Stuart AL, Hsu LY, Chewapreecha C, Croucher NJ, Harris SR, Sanders M, Enright MC, Dougan G, Bentley SD, Parkhill J, Fraser LJ, Betley JR, Schulz-Trieglaff OB, Smith GP, Peacock SJ. 2012. Rapid whole-genome sequencing for investigation of a neonatal MRSA outbreak. N Engl J Med 366:2267–2275. doi: 10.1056/NEJMoa1109910. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Cho I, Blaser MJ. 2012. The human microbiome: at the interface of health and disease. Nat Rev Genet 13:260–270. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Petrof EO, Gloor GB, Vanner SJ, Weese SJ, Carter D, Daigneault MC, Brown EM, Schroeter K, Allen-Vercoe E. 2013. Stool substitute transplant therapy for the eradication of Clostridium difficile infection: “RePOOPulating” the gut. Microbiome 1:3. doi: 10.1186/2049-2618-1-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Schulfer A, Blaser MJ. 2015. Risks of antibiotic exposures early in life on the developing microbiome. PLoS Pathog 11:e1004903. doi: 10.1371/journal.ppat.1004903. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ou CY, Ciesielski CA, Myers G, Bandea CI, Luo CC, Korber BT, Mullins JI, Schochetman G, Berkelman RL, Economou AN. 1992. Molecular epidemiology of HIV transmission in a dental practice. Science 256:1165–1171. [DOI] [PubMed] [Google Scholar]
- 22.Albert J, Wahlberg J, Leitner T, Escanilla D, Uhlen M. 1994. Analysis of a rape case by direct sequencing of the human immunodeficiency virus type 1 pol and gag genes. J Virol 68:5918–5924. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Birch CJ, McCaw RF, Bulach DM, Revill PA, Carter JT, Tomnay J, Hatch B, Middleton TV, Chibo D, Catton MG, Pankhurst JL, Breschkin AM, Locarnini SA, Bowden DS. 2000. Molecular analysis of human immunodeficiency virus strains associated with a case of criminal transmission of the virus. J Infect Dis 182:941–944. doi: 10.1086/315751. [DOI] [PubMed] [Google Scholar]
- 24.Machuca R, Jørgensen LB, Theilade P, Nielsen C. 2001. Molecular investigation of transmission of human immunodeficiency virus type 1 in a criminal case. Clin Diagn Lab Immunol 8:884–890. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Metzker ML, Mindell DP, Liu X-M, Ptak RG, Gibbs RA, Hillis DM. 2002. Molecular evidence of HIV-1 transmission in a criminal case. Proc Natl Acad Sci U S A 99:14292–14297. doi: 10.1073/pnas.222522599. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Lemey P, Van Dooren S, Van Laethem K, Schrooten Y, Derdelinckx I, Goubau P, Brun-Vézinet F, Vaira D, Vandamme A-M. 2005. Molecular testing of multiple HIV-1 transmissions in a criminal case. AIDS 19:1649–1658. doi: 10.1097/01.aids.0000187904.02261.1a. [DOI] [PubMed] [Google Scholar]
- 27.Scaduto DI, Brown JM, Haaland WC, Zwickl DJ, Hillis DM, Metzker ML. 2010. Source identification in two criminal cases using phylogenetic analysis of HIV-1 DNA sequences. Proc Natl Acad Sci U S A 107:21242–21247. doi: 10.1073/pnas.1015673107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.González-Candelas F, Bracho MA, Wróbel B, Moya A. 2013. Molecular evolution in court: analysis of a large hepatitis C virus outbreak from an evolving source. BMC Biol 11:76. doi: 10.1186/1741-7007-11-76. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Fierer N, Lauber CL, Zhou N, McDonald D, Costello EK, Knight R. 2010. Forensic identification using skin bacterial communities. Proc Natl Acad Sci U S A 107:6477–6481. doi: 10.1073/pnas.1000162107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Pechal JL, Crippen TL, Benbow ME, Tarone AM, Dowd S, Tomberlin JK. 2014. The potential use of bacterial community succession in forensics as described by high throughput metagenomic sequencing. Int J Legal Med 128:193–205. doi: 10.1007/s00414-013-0872-1. [DOI] [PubMed] [Google Scholar]
- 31.Clamp M, Fry B, Kamal M, Xie X, Cuff J, Lin MF, Kellis M, Lindblad-Toh K, Lander ES. 2007. Distinguishing protein-coding and noncoding genes in the human genome. Proc Natl Acad Sci U S A 104:19428–19433. doi: 10.1073/pnas.0709013104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Human Microbiome Project Consortium. 2012. Structure, function and diversity of the healthy human microbiome. Nature 486:207–214. doi: 10.1038/nature11234. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Human Microbiome Project Consortium. 2012. A framework for human microbiome research. Nature 486:215–221. doi: 10.1038/nature11209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Caporaso JG, Lauber CL, Costello EK, Berg-Lyons D, Gonzalez A, Stombaugh J, Knights D, Gajer P, Ravel J, Fierer N, Gordon JI, Knight R. 2011. Moving pictures of the human microbiome. Genome Biol 12:R50. doi: 10.1186/gb-2011-12-5-r50. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Costello EK, Lauber CL, Hamady M, Fierer N, Gordon JI, Knight R. 2009. Bacterial community variation in human body habitats across space and time. Science 326:1694–1697. doi: 10.1126/science.1177486. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Hares DR. 2015. Selection and implementation of expanded CODIS core loci in the United States. Forensic Sci Int Genet 17:33–34. doi: 10.1016/j.fsigen.2015.03.006. [DOI] [PubMed] [Google Scholar]
- 37.Lowe A, Murray C, Whitaker J, Tully G, Gill P. 2002. The propensity of individuals to deposit DNA and secondary transfer of low level DNA from individuals to inert surfaces. Forensic Sci Int 129:25–34. doi: 10.1016/S0379-0738(02)00207-4. [DOI] [PubMed] [Google Scholar]
- 38.Warshauer DH, Marshall P, Kelley S, King J, Budowle B. 2012. An evaluation of the transfer of saliva-derived DNA. Int J Legal Med 126:851–861. doi: 10.1007/s00414-012-0743-1. [DOI] [PubMed] [Google Scholar]
- 39.Budolwe B, Eisenberg AJ. 2007. Forensic genetics, p 501–517. In Rimoin D, Connor J, Pyeritz R, Korf B (ed), Emery and Rimoin's principles and practice of medical genetics, 5th ed Elsevier, Philadelphia, PA. [Google Scholar]
- 40.Budowle B, Van Daal A. 2008. Forensically relevant SNP classes. Biotechniques 44:603–610. [DOI] [PubMed] [Google Scholar]
- 41.Honda K, Roewer L, de Knijff P. 1999. Male DNA typing from 25-year-old vaginal swabs using Y chromosomal STR polymorphisms in a retrial request case. J Forensic Sci 44:868–872. [PubMed] [Google Scholar]
- 42.Gill P, Whitaker J, Flaxman C, Brown N, Buckleton J. 2000. An investigation of the rigor of interpretation rules for STRs derived from less than 100 pg of DNA. Forensic Sci Int 112:17–40. doi: 10.1016/S0379-0738(00)00158-4. [DOI] [PubMed] [Google Scholar]
- 43.Budowle B, Hobson DL, Smerick JB, Smith JAL. 2001. Low copy number–consideration and caution, p 9–12. In Proceedings of the 12th International Symposium on Human Identification Promega Corporation, Madison, WI. [Google Scholar]
- 44.Budowle B, Eisenberg AJ, van Daal A. 2009. Validity of low copy number typing and applications to forensic science. Croat Med J 50:207–217. doi: 10.3325/cmj.2009.50.207. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Grice EA, Kong HH, Conlan S, Deming CB, Davis J, Young AC, NISC Comparative Sequencing Program, Bouffard GG, Blakesley RW, Murray PR, Green ED, Turner ML, Segre JA. 2009. Topographical and temporal diversity of the human skin microbiome. Science 324:1190–1192. doi: 10.1126/science.1171700. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Goga H. 2012. Comparison of bacterial DNA profiles of footwear insoles and soles of feet for the forensic discrimination of footwear owners. Int J Legal Med 126:815–823. doi: 10.1007/s00414-012-0733-3. [DOI] [PubMed] [Google Scholar]
- 47.Tims S, van Wamel W, Endtz HP, van Belkum A, Kayser M. 2010. Microbial DNA fingerprinting of human fingerprints: dynamic colonization of fingertip microflora challenges human host inferences for forensic purposes. Int J Legal Med 124:477–481. doi: 10.1007/s00414-009-0352-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Fierer N, Hamady M, Lauber CL, Knight R. 2008. The influence of sex, handedness, and washing on the diversity of hand surface bacteria. Proc Natl Acad Sci U S A 105:17994–17999. doi: 10.1073/pnas.0807920105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Rajilić-Stojanović M, Heilig HGHJ, Tims S, Zoetendal EG, de Vos WM. 2013. Long-term monitoring of the human intestinal microbiota composition. Environ Microbiol 15:1146–1159. doi: 10.1111/1462-2920.12023. [DOI] [PubMed] [Google Scholar]
- 50.Martínez I, Muller CE, Walter J. 2013. Long-term temporal analysis of the human fecal microbiota revealed a stable core of dominant bacterial species. PLoS One 8:e69621. doi: 10.1371/journal.pone.0069621. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Spor A, Koren O, Ley R. 2011. Unravelling the effects of the environment and host genotype on the gut microbiome. Nat Rev Microbiol 9:279–290. doi: 10.1038/nrmicro2540. [DOI] [PubMed] [Google Scholar]
- 52.Jakobsson HE, Jernberg C, Andersson AF, Sjölund-Karlsson M, Jansson JK, Engstrand L. 2010. Short-term antibiotic treatment has differing long-term impacts on the human throat and gut microbiome. PLoS One 5:e9836. doi: 10.1371/journal.pone.0009836. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Juusola J, Ballantyne J. 2003. Messenger RNA profiling: a prototype method to supplant conventional methods for body fluid identification. Forensic Sci Int 135:85–96. doi: 10.1016/S0379-0738(03)00197-X. [DOI] [PubMed] [Google Scholar]
- 54.Zubakov D, Hanekamp E, Kokshoorn M, van Ijcken W, Kayser M. 2008. Stable RNA markers for identification of blood and saliva stains revealed from whole genome expression analysis of time-wise degraded samples. Int J Legal Med 122:135–142. doi: 10.1007/s00414-007-0182-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Frumkin D, Wasserstrom A, Budowle B, Davidson A. 2011. DNA methylation-based forensic tissue identification. Forensic Sci Int Genet 5:517–524. doi: 10.1016/j.fsigen.2010.12.001. [DOI] [PubMed] [Google Scholar]
- 56.Akutsu T, Motani H, Watanabe K, Iwase H, Sakurada K. 2012. Detection of bacterial 16S ribosomal RNA genes for forensic identification of vaginal fluid. Leg Med (Tokyo) 14:160–162. doi: 10.1016/j.legalmed.2012.01.005. [DOI] [PubMed] [Google Scholar]
- 57.Doi M, Gamo S, Okiura T, Nishimukai H, Asano M. 2014. A simple identification method for vaginal secretions using relative quantification of Lactobacillus DNA. Forensic Sci Int Genet 12:93–99. doi: 10.1016/j.fsigen.2014.05.005. [DOI] [PubMed] [Google Scholar]
- 58.Fleming RI, Harbison S. 2010. The use of bacteria for the identification of vaginal secretions. Forensic Sci Int Genet 4:311–315. doi: 10.1016/j.fsigen.2009.11.008. [DOI] [PubMed] [Google Scholar]
- 59.Nakanishi H, Kido A, Ohmori T, Takada A, Hara M, Adachi N, Saito K. 2009. A novel method for the identification of saliva by detecting oral streptococci using PCR. Forensic Sci Int 183:20–23. doi: 10.1016/j.forsciint.2008.10.003. [DOI] [PubMed] [Google Scholar]
- 60.Choi A, Shin K-J, Yang WI, Lee HY. 2014. Body fluid identification by integrated analysis of DNA methylation and body fluid-specific microbial DNA. Int J Legal Med 128:33–41. doi: 10.1007/s00414-013-0918-4. [DOI] [PubMed] [Google Scholar]
- 61.Smith S, Eng R, Padberg FJ Jr. 1996. Survival of nosocomial bacteria at ambient temperature. J Med 27:293–302. [PubMed] [Google Scholar]
- 62.Brooke JS, Annand JW, Hammer A, Dembkowski K, Shulman ST. 2009. Investigation of bacterial pathogens on 70 frequently used environmental surfaces in a large urban U.S. university. J Environ Health 71:17–22. [PubMed] [Google Scholar]
- 63.Scheidegger C, Zimmerli W. 1989. Infectious complications in drug addicts: seven-year review of 269 hospitalized narcotics abusers in Switzerland. Rev Infect Dis 11:486–493. doi: 10.1093/clinids/11.3.486. [DOI] [PubMed] [Google Scholar]
- 64.Holbrook KA, Klein RS, Hartel D, Elliott D, Barsky TB, Rothschild LHA, Lowy FD. 1997. Staphylococcus aureus nasal colonization in HIV-seropositive and HIV-seronegative drug users. J Acquir Immune Defic Syndr Hum Retrovirol 16:301–306. [DOI] [PubMed] [Google Scholar]
- 65.Kluytmans J, van Belkum A, Verbrugh H. 1997. Nasal carriage of Staphylococcus aureus: epidemiology, underlying mechanisms, and associated risks. Clin Microbiol Rev 10:505–520. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Quagliarello B, Cespedes C, Miller M, Toro A, Vavagiakis P, Klein RS, Lowy FD. 2002. Strains of Staphylococcus aureus obtained from drug-use networks are closely linked. Clin Infect Dis 35:671–677. doi: 10.1086/342196. [DOI] [PubMed] [Google Scholar]
- 67.Toppinen M, Perdomo MF, Palo JU, Simmonds P, Lycett SJ, Söderlund-Venermo M, Sajantila A, Hedman K. 2015. Bones hold the key to DNA virus history and epidemiology. Sci Rep 5:17226. doi: 10.1038/srep17226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Schuenemann VJ, Bos K, DeWitte S, Schmedes S, Jamieson J, Mittnik A, Forrest S, Coombes BK, Wood JW, Earn DJD, White W, Krause J, Poinar HN. 2011. Targeted enrichment of ancient pathogens yielding the pPCP1 plasmid of Yersinia pestis from victims of the Black Death. Proc Natl Acad Sci U S A 108:E746–752. doi: 10.1073/pnas.1105107108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Bos KI, Schuenemann VJ, Golding GB, Burbano HA, Waglechner N, Coombes BK, McPhee JB, DeWitte SN, Meyer M, Schmedes S, Wood J, Earn DJD, Herring DA, Bauer P, Poinar HN, Krause J. 2011. A draft genome of Yersinia pestis from victims of the Black Death. Nature 478:506–510. doi: 10.1038/nature10549. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Wagner DM, Klunk J, Harbeck M, Devault A, Waglechner N, Sahl JW, Enk J, Birdsell DN, Kuch M, Lumibao C, Poinar D, Pearson T, Fourment M, Golding B, Riehm JM, Earn DJD, DeWitte S, Rouillard J-M, Grupe G, Wiechmann I, Bliska JB, Keim PS, Scholz HC, Holmes EC, Poinar H. 2014. Yersinia pestis and the Plague of Justinian 541–543 AD: a genomic analysis. Lancet Infect Dis 14:319–326. doi: 10.1016/S1473-3099(13)70323-2. [DOI] [PubMed] [Google Scholar]
- 71.Fodor AA, DeSantis TZ, Wylie KM, Badger JH, Ye Y, Hepburn T, Hu P, Sodergren E, Liolios K, Huot-Creasy H, Birren BW, Earl AM. 2012. The “most wanted” taxa from the human microbiome for whole genome sequencing. PLoS One 7:e41294. doi: 10.1371/journal.pone.0041294. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Kuczynski J, Lauber CL, Walters WA, Parfrey LW, Clemente JC, Gevers D, Knight R. 2012. Experimental and analytical tools for studying the human microbiome. Nat Rev Genet 13:47–58. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Schommer NN, Gallo RL. 2013. Structure and function of the human skin microbiome. Trends Microbiol 21:660–668. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Cole JR, Chai B, Farris RJ, Wang Q, Kulam SA, McGarrell DM, Garrity GM, Tiedje JM. 2005. The Ribosomal Database Project (RDP-II): sequences and tools for high-throughput rRNA analysis. Nucleic Acids Res 33:D294–D296. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.DeSantis TZ, Hugenholtz P, Larsen N, Rojas M, Brodie EL, Keller K, Huber T, Dalevi D, Hu P, Andersen GL. 2006. Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Appl Environ Microbiol 72:5069–5072. doi: 10.1128/AEM.03006-05. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, Peplies J, Glöckner FO. 2013. The SILVA ribosomal RNA gene database project: improved data processing and Web-based tools. Nucleic Acids Res 41:D590–D596. doi: 10.1093/nar/gks1219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Janda JM, Abbott SL. 2007. 16S rRNA gene sequencing for bacterial identification in the diagnostic laboratory: pluses, perils, and pitfalls. J Clin Microbiol 45:2761–2764. doi: 10.1128/JCM.01228-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Suzuki MT, Giovannoni SJ. 1996. Bias caused by template annealing in the amplification of mixtures of 16S rRNA genes by PCR. Appl Environ Microbiol 62:625–630. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Soergel DAW, Dey N, Knight R, Brenner SE. 2012. Selection of primers for optimal taxonomic classification of environmental 16S rRNA gene sequences. ISME J 6:1440–1444. doi: 10.1038/ismej.2011.208. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Klappenbach JA, Dunbar JM, Schmidt TM. 2000. rRNA operon copy number reflects ecological strategies of bacteria. Appl Environ Microbiol 66:1328–1333. doi: 10.1128/AEM.66.4.1328-1333.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Wang Y, Zhang Z, Ramanan N. 1997. The actinomycete Thermobispora bispora contains two distinct types of transcriptionally active 16S rRNA genes. J Bacteriol 179:3270–3276. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Fox GE, Wisotzkey JD, Jurtshuk P Jr. 1992. How close is close: 16S rRNA sequence identity may not be sufficient to guarantee species identity. Int J Syst Bacteriol 42:166–170. doi: 10.1099/00207713-42-1-166. [DOI] [PubMed] [Google Scholar]
- 83.Asai T, Zaporojets D, Squires C, Squires CL. 1999. An Escherichia coli strain with all chromosomal rRNA operons inactivated: complete exchange of rRNA genes between bacteria. Proc Natl Acad Sci U S A 96:1971–1976. doi: 10.1073/pnas.96.5.1971. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Schouls LM, Schot CS, Jacobs JA. 2003. Horizontal transfer of segments of the 16S rRNA genes between species of the Streptococcus anginosus group. J Bacteriol 185:7241–7246. doi: 10.1128/JB.185.24.7241-7246.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Maiden MCJ, Bygraves JA, Feil E, Morelli G, Russell JE, Urwin R, Zhang Q, Zhou J, Zurth K, Caugant DA, Feavers IM, Achtman M, Spratt BG. 1998. Multilocus sequence typing: a portable approach to the identification of clones within populations of pathogenic microorganisms. Proc Natl Acad Sci U S A 95:3140–3145. doi: 10.1073/pnas.95.6.3140. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Aanensen DM, Spratt BG. 2005. The multilocus sequence typing network: mlst.net. Nucleic Acids Res 33:W728–W733. doi: 10.1093/nar/gki415. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Jolley KA, Chan M-S, Maiden MCJ. 2004. mlstdbNet–distributed multi-locus sequence typing (MLST) databases. BMC Bioinformatics 5:86. doi: 10.1186/1471-2105-5-86. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Jolley KA, Maiden MCJ. 2010. BIGSdb: scalable analysis of bacterial genome variation at the population level. BMC Bioinformatics 11:595. doi: 10.1186/1471-2105-11-595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Singh P, Foley SL, Nayak R, Kwon YM. 2013. Massively parallel sequencing of enriched target amplicons for high-resolution genotyping of Salmonella serovars. Mol Cell Probes 27:80–85. doi: 10.1016/j.mcp.2012.11.004. [DOI] [PubMed] [Google Scholar]
- 90.Singh P, Foley SL, Nayak R, Kwon YM. 2012. Multilocus sequence typing of Salmonella strains by high-throughput sequencing of selectively amplified target genes. J Microbiol Methods 88:127–133. doi: 10.1016/j.mimet.2011.11.004. [DOI] [PubMed] [Google Scholar]
- 91.Inouye M, Conway TC, Zobel J, Holt KE. 2012. Short read sequence typing (SRST): multi-locus sequence types from short reads. BMC Genomics 13:338. doi: 10.1186/1471-2164-13-338. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Larsen MV, Cosentino S, Rasmussen S, Friis C, Hasman H, Marvig RL, Jelsbak L, Sicheritz-Pontén T, Ussery DW, Aarestrup FM, Lund O. 2012. Multilocus sequence typing of total-genome-sequenced bacteria. J Clin Microbiol 50:1355–1361. doi: 10.1128/JCM.06094-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Keim P, Van Ert MN, Pearson T, Vogler AJ, Huynh LY, Wagner DM. 2004. Anthrax molecular epidemiology and forensics: using the appropriate marker for different evolutionary scales. Infect Genet Evol 4:205–213. doi: 10.1016/j.meegid.2004.02.005. [DOI] [PubMed] [Google Scholar]
- 94.Jolley KA, Bliss CM, Bennett JS, Bratcher HB, Brehony C, Colles FM, Wimalarathna H, Harrison OB, Sheppard SK, Cody AJ, Maiden MCJ. 2012. Ribosomal multilocus sequence typing: universal characterization of bacteria from domain to strain. Microbiology 158:1005–1015. doi: 10.1099/mic.0.055459-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Baldwin CD, Howe GB, Sampath R, Blyn LB, Matthews H, Harpin V, Hall TA, Drader JJ, Hofstadler SA, Eshoo MW, Rudnick K, Studarus K, Moore D, Abbott S, Janda JM, Whitehouse CA. 2009. Usefulness of multilocus polymerase chain reaction followed by electrospray ionization mass spectrometry to identify a diverse panel of bacterial isolates. Diagn Microbiol Infect Dis 63:403–408. doi: 10.1016/j.diagmicrobio.2008.12.012. [DOI] [PubMed] [Google Scholar]
- 96.Wu M, Eisen JA. 2008. A simple, fast, and accurate method of phylogenomic inference. Genome Biol 9:R151. doi: 10.1186/gb-2008-9-10-r151. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Liu B, Gibbons T, Ghodsi M, Treangen T, Pop M. 2011. Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences. BMC Genomics 12:S4. doi: 10.1186/1471-2164-12-S2-S4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Wang Z, Wu M. 2013. A phylum-level bacterial phylogenetic marker database. Mol Biol Evol 30:1258–1262. doi: 10.1093/molbev/mst059. [DOI] [PubMed] [Google Scholar]
- 99.Wu D, Jospin G, Eisen JA. 2013. Systematic identification of gene families for use as “markers” for phylogenetic and phylogeny-driven ecological studies of bacteria and archaea and their major subgroups. PLoS One 8:e77033. doi: 10.1371/journal.pone.0077033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Darling AE, Jospin G, Lowe E, Matsen FA IV, Bik HM, Eisen JA. 2014. PhyloSift: phylogenetic analysis of genomes and metagenomes. PeerJ 2:e243. doi: 10.7717/peerj.243. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.Mende DR, Sunagawa S, Zeller G, Bork P. 2013. Accurate and universal delineation of prokaryotic species. Nat Methods 10:881–884. doi: 10.1038/nmeth.2575. [DOI] [PubMed] [Google Scholar]
- 102.Sunagawa S, Mende DR, Zeller G, Izquierdo-Carrasco F, Berger SA, Kultima JR, Coelho LP, Arumugam M, Tap J, Nielsen HB, Rasmussen S, Brunak S, Pedersen O, Guarner F, de Vos WM, Wang J, Li J, Doré J, Ehrlich SD, Stamatakis A, Bork P. 2013. Metagenomic species profiling using universal phylogenetic marker genes. Nat Methods 10:1196–1199. doi: 10.1038/nmeth.2693. [DOI] [PubMed] [Google Scholar]
- 103.Fish JA, Chai B, Wang Q, Sun Y, Brown CT, Tiedje JM, Cole JR. 2013. FunGene: the functional gene pipeline and repository. Front Microbiol 4:1–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Budowle B, Schutzer SE, Morse SA, Martinez KF, Chakraborty R, Marrone BL, Messenger SL, Murch RS, Jackson PJ, Williamson P, Harmon R, Velsko SP. 2008. Criteria for validation of methods in microbial forensics. Appl Environ Microbiol 74:5599–5607. doi: 10.1128/AEM.00966-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Budowle B, Connell ND, Bielecka-Oder A, Colwell RR, Corbett CR, Fletcher J, Forsman M, Kadavy DR, Markotic A, Morse SA, Murch RS, Sajantila A, Schmedes SE, Ternus KL, Turner SD, Minot S. 2014. Validation of high throughput sequencing and microbial forensics applications. Invest Genet 5:9. doi: 10.1186/2041-2223-5-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Schutzer SE, Keim P, Czerwinski K, Budowle B. 2009. Use of forensic methods under exigent circumstances without full validation. Sci Transl Med 1:8cm7. [DOI] [PubMed] [Google Scholar]
- 107.Keim P, Budowle B, Ravel J. 2011. Microbial forensic investigation of the anthrax-letter attacks, p 15–25. In Budowle B, Schutzer SE, Breeze RG, Keim PS, Morse SA (ed), Microbial forensics, 2nd ed Academic Press, Amsterdam, The Netherlands. [Google Scholar]
- 108.National Research Council. 2011. Review of the scientific approaches used during the FBI's investigation of the 2001 anthrax letters. The National Academies Press, Washington, DC. [PubMed] [Google Scholar]
- 109.Price EP, Seymour ML, Sarovich DS, Latham J, Wolken SR, Mason J, Vincent G, Drees KP, Beckstrom-Sternberg SM, Phillippy AM, Koren S, Okinaka RT, Chung W-K, Schupp JM, Wagner DM, Vipond R, Foster JT, Bergman NH, Burans J, Pearson T, Brooks T, Keim P. 2012. Molecular epidemiologic investigation of an anthrax outbreak among heroin users, Europe. Emerg Infect Dis 18:1307–1313. doi: 10.3201/eid1808.111343. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110.Hasan NA, Choi SY, Eppinger M, Clark PW, Chen A, Alam M, Haley BJ, Taviani E, Hine E, Su Q, Tallon LJ, Prosper JB, Furth K, Hoq MM, Li H, Fraser-Liggett CM, Cravioto A, Huq A, Ravel J, Cebula TA, Colwell RR. 2012. Genomic diversity of 2010 Haitian cholera outbreak strains. Proc Natl Acad Sci U S A 109:E2010–E2017. doi: 10.1073/pnas.1207359109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111.Hendriksen RS, Price LB, Schupp JM, Gillece JD, Kaas RS, Engelthaler DM, Bortolaia V, Pearson T, Waters AE, Upadhyay BP, Shrestha SD, Adhikari S, Shakya G, Keim PS, Aarestrup FM. 2011. Population genetics of Vibrio cholerae from Nepal in 2010: evidence on the origin of the Haitian outbreak. mBio 2(4):e00157-11. doi: 10.1128/mBio.00157-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 112.Gire SK, Goba A, Andersen KG, Sealfon RSG, Park DJ, Kanneh L, Jalloh S, Momoh M, Fullah M, Dudas G, Wohl S, Moses LM, Yozwiak NL, Winnicki S, Matranga CB, Malboeuf CM, Qu J, Gladden AD, Schaffner SF, Yang X, Jiang P-P, Nekoui M, Colubri A, Coomber MR, Fonnie M, Moigboi A, Gbakie M, Kamara FK, Tucker V, Konuwa E, Saffa S, Sellu J, Jalloh AA, Kovoma A, Koninga J, Mustapha I, Kargbo K, Foday M, Yillah M, Kanneh F, Robert W, Massally JLB, Chapman SB, Bochicchio J, Murphy C, Nusbaum C, Young S, Birren BW, Grant DS, Scheiffelin JS, et al. 2014. Genomic surveillance elucidates Ebola virus origin and transmission during the 2014 outbreak. Science 345:1369–1372. doi: 10.1126/science.1259657. [DOI] [PMC free article] [PubMed] [Google Scholar]