Skip to main content
Journal of Assisted Reproduction and Genetics logoLink to Journal of Assisted Reproduction and Genetics
. 2024 Oct 21;42(1):15–37. doi: 10.1007/s10815-024-03292-6

Methodological approaches in 16S sequencing of female reproductive tract in fertility patients: a review

I M Davidson 1, E Nikbakht 1, L M Haupt 2,3,4,5, K J Ashton 1, P J Dunn 1,
PMCID: PMC11805751  PMID: 39433639

Abstract

Background

The female genital tract microbiome has become a particular area of interest in improving assisted reproductive technology (ART) outcomes with the emergence of next-generation sequencing (NGS) technology. However, NGS assessment of microbiomes currently lacks uniformity and poses significant challenges for accurate and precise bacterial population representation.

Objective

As multiple NGS platforms and assays have been developed in recent years for microbiome investigation—including the advent of long-read sequencing technologies—this work aimed to identify current trends and practices undertaken in female genital tract microbiome investigations.

Results

Areas like sample collection and transport, DNA extraction, 16S amplification vs. metagenomics, NGS library preparation, and bioinformatic analysis demonstrated a detrimental lack of uniformity. The lack of uniformity present is a significant limitation characterised by gap discrepancies in generation and interpretation of results. Minimal consistency was observed in primer design, DNA extraction techniques, sample transport, and bioinformatic analyses.

Conclusion

With third-generation sequencing technology highlighted as a promising tool in microbiota-based research via full-length 16S rRNA sequencing, there is a desperate need for future studies to investigate and optimise methodological approaches of the genital tract microbiome to ensure better uniformity of methods and results interpretation to improve clinical impact.

Graphical Abstract

graphic file with name 10815_2024_3292_Figa_HTML.jpg

Keywords: ART (assisted reproduction technologies), IVF (in vitro fertilisation), 16S rRNA, NGS (next-generation sequencing), Bioinformatics

Introduction

Infertility is defined as failure to fall pregnant following 12 months of unprotected sexual intercourse [1]. However, for women over 35 years of age, infertility can be diagnosed at six months [2]. Causes of infertility are many and varied and can involve male or female factors or a combination of both factors. Male factors involve sperm abnormalities, including decreased concentration, reduced motility, and irregular morphology [3]. For females, common causes of infertility can be linked to defects in ovulation, transport of ovum, implantation, and tubule diseases [4, 5]. Lifestyle factors such as smoking, obesity or being underweight, alcohol use, poor diet, and strenuous exercise also contribute to infertility in both males and females [6, 7]. Researchers have recently begun hypothesising whether the vaginal microbiome may also contribute to infertility.

The vaginal microbiome is pivotal in maintaining and promoting homeostasis in the vagina whilst out-competing pathogenic bacteria [8]. It is a dynamic environment that changes with age, pregnancy, hormones, menstruation, sexual activity, hygiene practices, and diseases that alter the microbial composition [9]. Due to the dynamic nature of microbiota, there is no definition of a ‘normal’ microbiome and the term healthy is used. This is typically defined by good pregnancy outcomes and the absence of disease and various infections [10]. A healthy vaginal microbiome interacts mutualistically with the host and bacteria by exchanging nutrients and shelter for competition and protection [11]. The host provides a humid and warm environment for the bacteria, whilst the native microbiota manufactures antimicrobial and anti-inflammatory factors for an initial line of defence against pathogenic bacteria [12].

Just as having a ‘healthy’ vaginal microbiome is pivotal for reproductive health, dysbiosis can have an impact on the results of assisted reproduction technology (ART) treatments, negatively impacting ART outcomes [13]. An example of this has been seen where inflammatory infections like bacterial vaginosis have been correlated with preclinical pregnancy loss. The presence of abnormal flora or bacterial vaginosis-associated microbes are commonly associated with infertility with a systematic review performed by van Oostrum et al. found 20% of infertile patients present with bacterial vaginosis-associated microbes and 39% of infertile patients present with abnormal flora [14]. Furthermore, women with bacterial vaginosis have a higher risk of pregnancy loss during in vitro fertilisation-embryo transfer (IVF-ET) cycles along with a decreased implantation rate, particularly if pathogenic microbes like Chlamydia are present [15]. Despite this, women with previous bacterial vaginosis prior to IVF have comparable pregnancy rates to women with no history of infections [16]. Interestingly, Kong et al. found that the increased incidence of Gardnerella vaginalis and Atopobium vaginae caused a failure to fall pregnant following embryo transfer or patients experienced a miscarriage [17]. Infections ascending from the vagina into the upper genital tract, particularly by G. vaginalis and A. vaginae, have been suggested as a major contributor to why biochemical pregnancy is less likely in women with an abnormal microbiome.

With a link between the female genital tract microbiome and reproductive outcomes, testing is needed to determine microbiome composition prior to subsequent procedures. Testing such as Nugent scoring and traditional culture methods require considerable effort, are time-consuming, and face bias in the selectivity of growth rates [18]. For this reason, molecular-based techniques such as next-generation sequencing (NGS) have rapidly become the gold standard for microbiome analysis [10, 18, 19]. More recently, the application of NGS has enhanced our understanding of the human microbiome composition [20]. NGS offers a remarkable depth and high-throughput accumulation of sequence information rapidly assessed from a single sample. NGS also allows for the classification and identification of microbes not identifiable through traditional culture methods, allowing the identification of new unique species.

Just as sequencing methods can influence the diversity of results, DNA extraction methods can also heavily influence the microbial diversity identified from a sample. DNA extraction methods remain the most time-consuming and cause of experimental variability in 16S rRNA NGS analysis, despite recent improvements to reduce errors and biases from the datasets [21]. Many factors influence variability between DNA extraction methods, including cell wall structure, reagent contamination, type of extraction (manual or automatic), and differences between laboratory staff and instrumentation used [22].

This review aims to critically evaluate the methodological approaches used in testing the female reproductive tract in patients undergoing fertility treatment, from the initial sampling site and collection technique through to the bioinformatic tools and pipeline development for analysis and interpretation of sequencing data. We carefully examine the associated bias, limitations, and advantages of each stage of the methodological approach to provide an optimal and uniform procedural approach for sequencing the genital tract microbiome in patients undergoing fertility treatment. In providing this approach, this review also aims to influence subsequent investigations and testing of the female reproductive tract in patients undergoing fertility treatment to inform and improve subsequent clinical practices and outcomes.

Search strategies

An online search was performed using PubMed and google scholar databases of all international publications containing ‘microbiome OR microbiota OR microflora’ AND ‘infertility OR fertility OR sterility OR assisted reproduction OR recurrent implantation failure OR IVF OR in vitro fertilisation’ AND ‘next generation sequencing OR NGS OR 16S’ AND ‘reproductive tract OR genital tract OR vagina OR uterus OR endometrial OR cervical OR urogenital’. Studies were then assessed and used based on inclusion and exclusion criteria (Table 1).

Table 1.

Inclusion and exclusion criteria used for selecting articles to be used in the review of 16S rRNA sequencing of the female reproductive tract in patients undergoing fertility treatment

Inclusion criteria Exclusion criteria
Publishing date Studies published from 2018 onwards (excluding articles in Vitale et al. study) Studies published before 2018 (excluding NGS papers in Vitale et al. study)
Specimen type Human Non-human
Patient characteristics Female and undergoing any form of fertility treatment Non-females and females not undergoing fertility treatment
Anatomical region Studies assessing the human vaginal, cervical, uterine, or endometrial microbiome Studies assessing microbiomes that are not vaginal, endometrial, cervical, or uterine
Sequencing methodology Short-read or long-read NGS sequencing Testing techniques that are not NGS
Sequencing approach 16S or metagenomics Non-NGS sequencing

Only experimental types of research were selected in the PubMed search with reviews (including narrative or systematic reviews) excluded. Inclusion criteria included studies published from 2018 onwards (excluding articles in Vitale et al. paper); studies assessing the human vaginal, cervical, or endometrial microbiome via short-read or long-read sequencing; and studies using a 16S or metagenomics approach. Exclusion criteria included non-human studies, studies published before 2018 (excluding NGS papers in Vitale et al. study); studies assessing microbiomes that are not vaginal, endometrial, cervical, or uterine; and testing techniques that are not NGS. Studies published prior to the 2018 Vitale et al. study were excluded as they had provided some analysis on methodological approaches for DNA sequencing. However, as there was no conclusion provided on optimal methodological approaches, it was thought that an updated review was required. By completing the search strategy from 2018 to 2024, this yielded 29 studies that fit the aim of this review where we compared the methodologies employed for sampling, transport and storage, DNA extractions, and sequencing protocols from each original research article (Table 2).

Table 2.

Summary of all included articles identified highlighting the methodologies and key microorganisms identified within each study

Authors Sample site Sampling method DNA storage/transport method DNA extraction Sequencing approach Sequencing technology Bioinformatics method Key microorganisms identified
Verstraelen et al. [26] Endometrium Endometrial biopsy and endometrial sampler brush Samples stored at − 80 °C prior to transport and extraction Pre-extraction lysis performed. DNA extraction method based off Vilchez-Vargas et al V1-V2 region of 16S rRNA gene using 27F & 338R primers Illumina technology Mothur and R studio Bacteroidetes Pelomonas genus, Bacteroidetes xylanisolvebs, Bacteroidetes thetaiotaomicron, Bacteroidetes fragilis, Lactobacillus iners, Lactobacillus crispatus, and Prevotella amnii
Franasiak et al. [68] Endometrium Tip of a transfer catheter Cell-lysis performed and stored at − 20 °C prior to DNA extraction DNA purification after lysis via bead beating Metagenomics of 16S rRNA gene using Ion 16S™ Metagenomics Kit primer sets Ion PGM ™ System QIIME Flavobacterium and Lactobacillus genus
Tao et al. [70] Endometrium Distal section of a transfer catheter Lysis and extraction performed after sampling Pre-extraction lysis. Bacterial genomic DNA Isolation kit (Norgen Biotek Corp, ON, Canada) V4 region of 16S rRNA gene with Illumina V4 Workflow Primers Illumina QIIME Lactobacillus, Corynebacterium, Bifidobacterium, Staphylococcus, and Streptococcus
Wee et al. [93] Vagina, cervix, and endometrium Swabs Immediately transported and frozen at − 80 °C except for endometrial samples which incubated overnight at 4 °C and then frozen at − 80 °C Enzymatic and mechanical lysis performed prior to DNeasy Blood and Tissue Kit (QIAGEN) used V1-V3 regions of 16S rRNA using 27F & 534R and 27F Mix Illumina QIIME Gardnerella, Ureaplasma, and Lactobacillus
Greenbaum et al. [9] Cervix Swabs Stored at − 80 °C Mechanical cell lysis then Powersoi® DNA Isolation Kit (MoBio V3-V4 region of 16S rRNA gene with custom primers Illumina Mothur and R studio Lactobacillus, Gardnerella, Preveotella, Sneathia
Kyono et al. [51] Endometrium and vagina Vaginal discharge and endometrial fluid Samples put into DNA stabilising solution after collection DNA extracted using Agencourt genfind v2 Blood & Serum DNA Isolation Kit 16S rRNA metagenomics Illumina StatMate V software GardnerellaStreptococcusAtopobiumBifidobacteriumSneathiaPrevotella, and Staphylococcus
Kitaya et al. [73] Endometrium and vagina Endometrial fluid and vaginal swabs Samples added to DNA stabilising solution Pre-lysis performed with Proteinase K. Agencourt Genfind v2 Blood & Serum DNA Isolation Kit V4 region of 16S rRNA gene using 515F and 806rB primers Illumina QIIME Lactobacillus
Cheong et al. [67] Endometrium Swabs Following DNA extraction, sample stored at − 20 °C Mechanical lysis. DNA precipitated via isopropanol and sodium acetate V3-V4 region metagenomics of 16S rRNA gene using Illumina provided primers Illumina Mothur Lactobacillus, Streptococcus, Veillonellaceae, and Preveotella
Liu et al. [69] Endometrium Biopsy and fluid Samples stored at − 80 °C prior to extraction Extraction performed according to published protocol by Yuan et al. (Yuan et al., 2012) V4 region of 16S rRNA gene only V4 primers specified Illumina Mothur Lactobacillus, Atopobium, Bifidobacterium, Preveotella, and Staphylococcus
Bernabeu et al. [13] Vagina Swab Samples stored at − 80 °C prior to extraction and − 20 °C after extraction PureLink microbiome DNA purification kit (ThermoFisher) V3-V4 region of 16S rRNA gene with 357F and 806R primers Illumina QIIME2 Lactobacillus, Gardnerella, Streptococcus, Clostridium, and Ureaplasma
Diaz-Martínez et al. [35] Endometrium and vagina Swabs Frozen at − 80 °C after sampling The PureLink Microbiome DNA Purification Kit (ThermoFisher) 16S rRNA metagenomics using 357F & 806R primers Illumina QIIME2 Lactobacillus, Streptococcus, Preveotella, Ralastonia, Dialister, and Gardnerella
Hao et al. [37] Cervix Swabs Frozen at -80 °C after sampling PureLink Microbiome DNA extraction Kit (ThermoFisher) V3-V4 regions of the 16S rRNA gene using 341F & 806R primers Illumina QIIME Lactobacillus, Gardnerella, Desulfovibrio, and Preveotella
Okwelogy et al. [52] Vagina Swabs DNA preservation buffer DNA extraction was based on a protocol developed by uBiome Inc V4 region of 16S rRNA gene with 515F & 806R primers Illumina QIIME/UCLUST Lactobacillus, Preveotella, and Gardnerella
Karaer et al. [38] Vagina Swabs Swabs stored at − 80 °C The QiAamp DNA Microbiome Kit (QIAGEN) V3-V4 region of 16S rRNA gene with Gene-specific sequences were selected from the study by Klindworth et al. (Klindworth et al., 2013) Illumina QIIME2 Lactobacillus, Gardnerella, and Streptococcus
Ichiyama et al. [49] Endometrium and vagina Vaginal swabs and pipette insertion for endometrium samples Samples immediately soaked in OMNIgene® VAGINAL Agencourt genfind v2 Blood & Serum DNA Isolation Kit or MagNA Pure 24 Pathogen 200 hp 1.0 protocol V4 region of 16S rRNA using 515f & 806rB primers Illumina QIIME Lactobacillus, Streptococcus, Gardnerella, and Preveotella
Oberle et al. [71] Endometrium Leftover endometrial tissue from routine scratching Sample stored at − 20 °C The DNeasy Blood and Tissue Kit (QIAGEN) and PureLink™ Microbiome DNA Purification Kit (ThermoFisher) compared V1-V9 of 16S rRNA gene using primers 27F & 1492R Nanopore EPI2ME Lactobacillus
Lüth et al. [39] Vagina Swabs Swabs frozen at -80 °C after sampling. Isolated DNA stored at − 20 °C Mechanical lysis then DNeasy PowerSoil Kit (QIAGEN) Both V3/V4 and V1-V9 16S rRNA gene using V3F & V4R and 27F-YM & 1492R-Y primers Illumina (V3/V4) and Nanopore (V1-V9) Mothur Lactobacillus, Haemophilus, Gardnerella, Preveotella, and Bifidobacterium
Moreno et al. [40] Endometrium Endometrial fluid and biopsy Samples transferred into RNAlater solution and stored at − 80 °C Pre-digestion step included then QIAamp DNA Blood Mini Kit (QIAGEN) 16S rRNA metagenomics using primers according to the Ion 16S™ Metagenomics Kit (ThermoFisher) Ion PGM™ QIIME2 Gardnerella, Klebsiella, Lactobacillus, Streptococcus, Haemophilus, Neisseria, Chryseobacterium, Atopobium, and Bifidobacterium
Keburiya et al. [83] Uterus Fragment of embryo transfer catheter Sample placed in culture media and cultivated in an anaerobic box DNA extraction was performed using PREP-RAPID DNA Extraction Kit V3-V4 region of 16S rRNA gene using 357F & 806R primers Illumina QIIME Lactobacillus, Staphylococcus, Streptococcus, Gardnerella, Actinomyces, Bifidobacterium, and Candida
Patel et al. [41] Vagina Swabs Swabs stored at − 80 °C DNA extraction was performed using QIAamp DNA Stool Mini Kit (QIAGEN) V2-V3 region of 16S rRNA gene using 101F & 518R primers Ion GeneStudio™ S5 system QIIME2 Lactobacillus, Leptotrichia, Snethia, Gardnerella, and Preveotella
Reschini et al. [25] Endometrium and vagina Vaginal swab and endometrial fluid via aspiration with catheter Both stored at − 80 °C DNA was extracted via QIAamp DNA Microbiome kit (QIAGEN) V3-V4-V6 regions of the 16S rRNA gene using Microbiota solution B kit and degenerative primers Illumina MicrobAT Lactobacillus, Gardnerella, Bifidobacterium, Propinibacterium, Pelomonas, Pseudomonas, Streptococcus, and Escherichia
Villani et al. [53] Cervix Swabs Swabs put into DNA stabilisation buffer. Following isolation, DNA was stored at − 80 °C Mechanical lysis performed then DNA was extracted using QIAamp DNA Blood and Tissue kit (QIAGEN) 16S rRNA gene metagenomics with gene-specific sequences were selected from the study by Klindworth et al. (Klindworth et al., 2013) Illumina BBDUk Lactobacillus, Gardnerella, Atopobium, Bifidobacterium, and Streptococcus
Vajpeyee et al. [42] Vagina Swabs Swabs frozen at − 80 °C DNA extracted via DNeasy PowerSoil kit (QIAGEN) 16S rRNA metageonimcs with 4-primer method PCR Nanopore MinKNOW/GSTK Lactobacillus, Gardnerella, Streptococcus, Atopobium, Preveotella, Snethia, Bifidobacterium, and Clostridium
Komiya et al. [50] Vagina Vaginal lavage and swabs Lavage samples stored at − 30 °C. Swabs stored at room temperature following a preservative liquid DNA extracted via QIAamp UCP Pathogen Mini Kit (QIAGEN) V1-V9, V3-V4 regions of the 16S rRNA gene via metagenomics using S-D-Bact-0008-c-S-20, 341F & 806R primers with anchor sequences on each Nanopore GUPPY/Minimap2 Lactobacillus, Streptococcus, Atopobium, and Staphylococcus
Zou et al. [43] Endometrium Endometrial tissue via endometrium sampler Sample stored at − 80 °C DNA extraction was performed using QIAamp DNA Microbiome Kit (QIAGEN) V2, V4, V8 and V3, V6-7, V9 regions of the 16S rRNA gene via metagenomics using primers provided in 16S Ion Metagenomics Kit (ThermoFisher) Ion S5 sequencing system and Ion 530 chip QIIME2/Trimmomatic Lactobacillus, Gardnerella, Preveotella, Ralstonia, Anaerobacillus, and Streptococcus
Ono et al. [32] Uterine endometrium Aspiration via an aspiration tube Soaked in OMNIgene®- VAGINAL microbiome kit Not stated V4 using Illumina 16S metagenomic sequencing protocol Illumina JMP version 16 Lactobacillus, Ureaplasma, Dialister, and Streptococcus
Su et al. [85] Vagina, cervix, and endometrium Vaginal and cervical mucus swabs endometrial fluid through catheter Samples stored at − 80 °C TIANGEN Kit (G-BIO Bioinformatics Co., Ltd (DP316 China) V3-V4 16S rRNA sequencing using 341F & 805R Illumina QIIME2 Lactobacillus, Gardnerella, Sphingobium, Corynebacterium, Ralstonia, Enterobacter, and Enterococcus
Nishio et al. [86] Cervicovaginal mucus Swab Samples immediately stored at − 20 °C, then transferred to − 80 °C ChargeSwitch Forensic DNA Purification Kit (Thermo Fisher Scientific, Waltham, MA, USA) V3-V4 16S rRNA sequencing using Bakt_341F & Bakt_805R Illumina QIIME2 Lactobacillus, Streptococcus, Gardnerella, Escherichia coli, and Bifidobacterium
Wei et al. [87] Endometrium Catheter Not stated FastDNA Spin Kit (MP Biomedicals, Santa Ana, CA, USA) V3-V4 16S rRNA sequencing using 338F & 806R Illumina QIIME2 Rhodococcus, Pseudomonas, Archomobacter, and Lactobacillus

DNA preparation and influences on sequencing data

Sampling methods

Unique microbial niches exist in different anatomical sites throughout the female reproductive tract with the sampling site influencing specific microbial species abundance and population representations [23]. As such, differing sampling methods—including sampling techniques and differing anatomical site selection—have demonstrated differing microbial population representation and microbial abundance results (Fig. 1) [2426].

Fig. 1.

Fig. 1

The most common sampling techniques and predominant bacteria found in each anatomical site of the female reproductive tract including the uterus, endometrium, cervix, and vagina for microbiome testing (image created in BioRender)

Previous studies investigating the vaginal microbiome used primarily vaginal swabs, with vaginal discharge and lavage samples used less commonly. Interestingly, although limited, data suggests differences in microbial composition in swabs, discharge, and lavage sampling for the vaginal microbiome [24]. In addition, comparison of swabbing types has also been compared, such as cytobrush and a swab, wherein the relative abundance of bacterial species was found to be very similar between both methods [24]. Additionally, several studies have recorded comparable bacterial abundances and diversity in self-collected versus clinically collected vaginal samples, supporting adequate sample collection without the requirement of a gynaecological examination [27, 28]. However, it is possible for microbes to be missed and variation to occur if the exact vaginal location is not disclosed following self-collection [29].

In cervical microbiome studies, swabs obtained from cervical and endocervical areas were the most represented, followed by specific devices used to obtain a transcervical sample. Interestingly, whilst there have been minimal comparisons to date examining cervical sampling techniques for microbiome composition, a cytobrush with an extended-tip spatula has been demonstrated as the best combination of collection for collecting endocervical cells for subsequent analysis [24]. This technique may also be suitable as the best practice for determining microbiome composition, but there was no mention of best practices concerning microbiome collection and minimisation of contamination from the vagina [24]. There must be significantly more studies investigating cervical microbiome collection techniques before a conclusion can be made on best practices.

Sample collections to investigate endometrial microbiome composition most commonly rely on endometrial fluid samples, followed by endometrial tissue from scratching and biopsies. Endometrial sampling poses significant challenges as contamination from the vagina and cervix is extremely difficult to avoid, with different sampling approaches returning significantly varied results regarding microbial content [24]. Endometrial biopsy samples considerably reduce the likelihood of contamination from the lower reproductive tract; however, they are highly invasive and usually obtained from patients with pre-existing conditions not representative of a healthy population [30]. Due to this, obtaining endometrial microbiome samples that lack contamination in a non-invasive way is extremely difficult. Reschini et al. suggest endometrial microbiome sampling via an embryo catheter in conjunction with rigorous aseptic methodologies to minimise contamination and invasiveness [25]. Determining any source of potential contamination is vital as this can significantly influence microbial community abundance, even for the same anatomical site.

Uterine microbiome samples collected via the distal section of a transfer catheter and the distal section of a transfer catheter are commonly used in embryo transfer or via aspiration. Until recently, the uterine microbiome was thought to be sterile, and little is still known about the microbiome of the uterus. Recent studies have identified microbes from uterine samples; however, caution is suggested for these findings with a high possibility of endometrial contamination likely influencing these results [26, 31, 32]. In patients not participating in ARTs, uterine samples are commonly obtained during hysterectomies and hysterectomy procedures [31]. However, for subsequent ART procedures, uterine microbiome samples are only obtainable through the distal section of a transfer catheter, posing significant potential contamination from vaginal, cervical, and endometrial microbes [26, 31]. Transcervical approaches are currently used, with further research and optimisation required to obtain pure uterine microbiome samples [26].

Sample storage and transport

Delays in DNA extraction after sampling require samples to be stored effectively to ensure the composition of microbiota remains representative of the populations at the time of sampling. Frozen storage of samples (− 80 °C) is considered the ‘gold standard’ best practice to ensure preserved microbes and prevention of the microbiota from changing significantly [33, 34].

In terms of sample storage following collection across literature, the majority of microbiome composition samples were stored at either − 20 °C or − 80 °C. [13, 25, 3543]. Whilst freezing bacterial samples has shown to alter the cellular structure of gram-positive bacteria, causing a higher ratio of Firmicutes to Bacteroidetes, the change was of no statistical significance [44]. To ensure the sample is the most representative population of the sample site, immediately freezing samples to − 80 °C is considered the ‘gold standard’ to ensure preservation of microbes and prevention of the microbiota changing significantly at room temperature (21–24 °C) [32, 45]. Prakash et al. (2020) identified that, generally, microbial samples stored at room temperature for an extended period resulted in changes in microbial communities and overall composition [46]. Choo et al. also found substantial microbe compositional changes in samples stored at room temperature when compared to samples stored at 4 °C or − 80 °C [43]. Whilst sample storage temperatures can significantly impact microbial composition, sample storage methods can also heavily influence alpha and beta diversity [47]. Alpha and beta diversity allow for the microbial differences within a sample (alpha) or across multiple samples (beta) to be quantified [48].

Additionally, some studies opt to add some form of DNA stabilising solution following sample collection prior to storage or bypassing freezing to maintain microbial composition of samples [40, 4953]. The addition of DNA stabilising solutions before freezing have been found to preserve microbial DNA integrity, allowing for the transportation of microbiome samples [54]. However, different stabilising solutions can influence microbial alteration. As an example, Choo et al. found that samples stored in Tris–EDTA buffer showed the most significant bacterial change when compared to the non-commercial DNAgenotek produced OMNIgene.GUT that aims to optimise collection for nucleic acids [44, 55]. Samples stored in Tris–EDTA buffer demonstrated an increased operational taxonomic unit abundance of facultative anaerobes such as Proteobacteria and decreased abundance of Firmicutes and Actinobacteria [44]. The reduction in specifically Firmicute abundance would significantly impact female tract microbiome samples as the Lactobacillus genera results from the Firmicutes/Bacillota phylum and is not only the most abundant genera found but the biggest indicator of a healthy or dysbiotic environment [26, 56, 57]. It is also worth noting that the use of lysis buffers in sample storage has resulted in an overall reduction in DNA yield but not in DNA integrity [58]. Additionally, samples stored in an RNA preservation buffer such as RNA later have demonstrated significant changes in bacterial diversity when compared with storage alone at − 80 °C along with a reduced DNA extraction yield and purity [44].

Choosing to complete DNA extraction directly after sampling rather than DNA extraction directly following sample freezing has also shown different impacts on microbial composition. Vogtmann et al. found that compared to immediate DNA extraction, freezing microbial samples resulted in an increased abundance of the Firmicutes strain and a decreased abundance of Bacteriodetes microbes [59]. However, due to factors such as transport and time constraints, DNA extraction after sample storage is the most common method of choice [59].

Whilst sampling methods influence microbial composition, sampling methods are less influential over microbiome abundance and diversity than individual variation [60, 61]. Overall, it should be recommended that to enable the most accurate microbial representation and provide high DNA integrity and yield, it is best practise to immediately extract DNA from samples and store extracted DNA at − 80 °C. If DNA extraction is unable to be completed at that time, samples should be immediately frozen at samples at − 80 °C until DNA extraction can occur.

DNA extraction methods

Just as sampling methods influence microbial composition, DNA extraction methods also impact the presence and abundance of certain microbes and vary significantly from study to study (Table 1). Choosing the correct DNA extraction method is vital as vaginal samples contain high proportions of host DNA (> 90%), which can impair species detection [62]. Specific DNA extraction techniques are believed to enhance the yield of gram-positive bacteria but may not be as effective for gram-negative bacteria, potentially misrepresenting the true population composition. Due to the increased rigidity and strength of gram-positive bacterial cell walls, more extensive lysis protocols are required to obtain their DNA. However, these rigorous methods can over-degrade gram-negative bacteria, thereby reducing their yields [63]. The inverse is also demonstrated when lysis techniques are unable to effectively lyse gram-positive bacteria, leading to decreased DNA yields and misrepresenting population diversity [63, 64]. However, efficient mycobiome lysis is dependent on repeated bead-beating and enzymatic lysis which are integral for fungal DNA extraction [65].

Interestingly, some results demonstrated the use of DNA extraction kits non-specific to microbiomes and including a pre-lysis step, on average, increased DNA yield and microbial diversity when compared to DNA extraction kits specific to microbiomes and lacking a pre-lysis step [66]. The use of optimised methodologies from previously published articles or custom protocols is the most common choice for DNA extraction [26, 35, 39, 42, 43, 71]. For studies using DNA extraction kits, the most commonly used kit used is the DNeasy PowerSoil kit, followed by other kits such as Agencourt Genfind v2 Blood & Serum isolation kit, PureLink Microbiota DNA extraction kit, and the QIAamp DNA Microbiome kit [26, 31, 35, 39, 42, 43, 71]. A comparative study by Mattei et al. recorded that the QIAGEN DNeasy method achieved a higher yield and quality than a modified MoBio PowerSoil kit. However, the MoBio PowerSoil kit demonstrated higher microbial diversity when compared to the DNeasy method and higher diversity than the standard protocol [72]. A summary of these comparisons is provided in Table 3 including the extraction technique, supplier, key steps used in template isolation, and the perceived advantages and limitations of the kits commonly used.

Table 3.

Comparison of different DNA extraction techniques key steps, advantages, and limitations [39, 41, 43, 53]

Extraction technique Key steps Advantages Limitations References
DNeasy PowerSoil Kit (QIAGEN)

1. Samples lysed via chemical and mechanical homogenisation

2. Bead beating and inhibitor removal

3. Purified lysate mixed with DNA binding solution, filtered and washed

4. Silica-bound DNA eluted

Easy to replicate

Increased purity of isolated DNA

Removes contaminants efficiently

Effective for highly contaminated samples like stool or soil

Aggressive lysis throughout protocol can cause fragmentation of DNA [39, 53, 71, 88]
Agencourt Genfind v2 Blood & Serum Isolation Kit (Beckman Coulter)

1. Sample lysed via chemicals

2. Sample bound to paramagnetic beads

3. Beads separated from contaminants via washing

DNA eluted

Easy to replicate protocol

High DNA yield

It does not provide an expected concentration range for samples, and it can be unclear how much of a sample is required for DNA extraction [49, 51, 72]
PureLink Microbiota Kit (Thermo Fisher Scientific)

1. DNA lysed via bead beating and heat

2. DNA bound to silica-based column

3. DNA washed in a silica-based column

4. DNA eluted

Easy to replicate and different protocols for different sample types

Encompasses a wide variety of sample types

It recovers DNA with high purity and eliminates inhibitory compounds

Typical recovery is 0.1–5 µg DNA

Not high-throughput compatible [13, 35, 37, 71]
QIAamp DNA Microbiome Kit (QIAGEN)

1. Host cells are gently lysed to leave bacterial cells intact

2. Released host DNA is enzymatically degraded via enzymes

3. Combination of mechanical and chemical lysis used to degrade bacteria cells

4. Bacterial DNA bound to the silica membrane

4. Membrane washed to remove contaminants

5. DNA eluted

Depletes host DNA to maximise bacterial DNA

Allows for highly sensitive 16S rDNA-based microbiome analysis

Host DNA enzymatic degradation may degrade the DNA of bacteria released via initial lysis

DNA purity decreased as contaminant removal is minimal

[25, 38, 43]

Additionally, some studies choose to complete a pre-digestion step before DNA extraction consisting of enzymatic and/or mechanical cell lysis to degrade bacterial cell walls for difficult-to-lyse bacteria in an effort to increase the overall yield of isolated DNA [51, 67, 68, 73, 74]. However, Gill et al. suggest that the choice of lysis method does not influence the detection of microbial communities yet do recommend a uniform method be applied to all samples [75]. Interestingly, DNA extraction methods that include bead beating have been demonstrated to improve microbial identification and representation [76, 77]. However, the use of bead beating has also been identified to increase contamination in samples [75].

Choosing automated versus manual DNA extraction practises can influence DNA purity, yield, and the representation of microbes in downstream analysis. Whilst robotic DNA extraction methods can prove to be time efficient and consistent across large sample sizes, their use is rarely tested and validated for non-human DNA extractions. The use of liquid handling robots throughout DNA extraction has shown to increase cross contamination from well-to-well plate-based DNA extractions than manual DNA extraction. To reduce this risk, it is suggested that additional positive and negative controls be established [78].

Unfortunately, the lack of consistency in DNA extraction methods for microbial studies across the literature makes it extremely difficult to compare results between studies and can lead to inaccurate conclusions. As a result, no specific DNA extraction kits can be recommended as superior in microbial analysis. However, it is recommended that DNA is extracted manually rather than a well-to-well robotic process to decrease cross contamination. The use of DNA extraction kits with a bead-beating step is also highly recommended to enable accurate microbiome representation and degradation of certain cell components along with the inclusion of positive and negative controls [22].

Sequence methodologies and robustness

General overview of sequencing methods

16S rRNA sequencing is currently the most widespread method for determining microbiome composition in female reproductive tracts; however, some studies have highlighted the utility of using a metagenomics approach rather than amplicon sequencing. Amplicon sequencing entails amplifying a variable region of highly conserved genes such as the 16S ribosomal RNA (rRNA) gene in DNA via PCR, followed by NGS for species identification and quantification [79]. However, a metagenomic approach sequences all the DNA present within a sample for identification and quantification [80].

16S sequencing

Amplicon sequencing performed by NGS offers remarkable depth and high-throughput information, with large quantities of sequence information rapidly assessed from a single sample. The selected amplicon for sequencing must be present in all microbes and contain regions of high conservation (to allow for consistent primer binding) in addition to regions of high variability to provide species-specific signatures. For this reason, the 16S rRNA gene is commonly used [79]. The 16S rRNA gene is approximately 1500 bp in size and has several highly conserved regions interleaved with nine hypervariable regions that demonstrate considerable sequence diversity amongst different microbial species and genera (Fig. 2) [81].

Fig. 2.

Fig. 2

Structure of the 16S rRNA gene showing the conserved (blue) and nine hypervariable (red) regions. Popular primer sets (green, orange arrows) for different NGS platforms are shown—adapted with permission [82]

Many combinations exist for choice of 16S rRNA region to be used in investigations using amplicon sequencing. Targeting 1–2 hypervariable regions is the most popular length of choice with specifically the V3-V4 hypervariable regions amongst the most popular (Table 4) [13, 25, 3539, 48, 51, 67, 8387]. However, it is not uncommon for studies to examine a single hypervariable region or many pairs at once. For example, Kyono et al. investigated the V4, V1-V2, and V3-V4 hypervariable regions, whereas Tao et al. only investigated hypervariable region V2 [51, 70]. Regions V1, V2, and V3 are consistent throughout hypervariable region selection with the V1-V2 fragments of 16S rRNA possessing the highest resolution for species and genera taxa. Thus, choosing either of these regions for examination allows for a more specific separation of the sequencing data into species-level operational taxonomic units (OTUs) for subsequent downstream analysis [88, 89].

Table 4.

Comparison of each hypervariable region chosen, the associated primer sequences, the limitations and advantages of hypervariable region selection and potential primer design, and reference

Region chosen Primer Primer sequences Advantages Limitations Reference
V3-V4

Forward: 341F, 357F &515F

Reverse: 806R

Forward 341F: 5′-CCTACGGGNGGCWGCAG-3′

Forward 357F: 5′-CCTACGGGAGGCAGCAG-3′

Forward 515F: 5′-GTGCCAGCMGCCGCGGTAA-3′

Reverse: 5′- GGACTACHVGGGTWTCTAAT-3′

Results in the highest richness and alpha diversity and provides the most extensive microbial population for partial 16S rRNA sequencing Can result in low resolution for taxonomies causing a significantly lower discriminatory power [13, 31, 35, 3739, 51, 53, 67]
V4

Forward: 515F

Reverse: 806R

Forward: 5′-GTGCCAGCMGCCGCGGTAA

Reverse: 5′- GGACTACHVGGGTWTCTAAT-3′

Highly conserved region which can be easily targeted Only one hypervariable region decreases resolution and decreases discriminatory power. Species level identification reduced due to high conservation [51, 52, 69, 70, 73]
V1-V3

Forward: 27F

Reverse: 515R

Forward: 5′-AGA GTT TGA TCC TGG CTC AG-3′

Reverse: A 5′-TTACCGCGGCTGCTGG -3′

Accurately identifies microbes and has a high taxonomic resolution for partial 16S rRNA sequencing Can result in low resolution for taxonomies causing a significantly lower discriminatory power [36, 93, 99]
V1-V2

Forward: 27F

Reverse: 338R

Forward: 5′-AGA GTT TGA TCC TGG CTC AG-3′

Reverse: 5′-TGCTGCCTCCCGTAGGAGT-3′

Poses the highest level of resolution for species and genera for partial 16S rRNA sequencing Can result in low resolution for taxonomies causing a significantly lower discriminatory power. Does not include the most conserved (V4) or most variable regions (V3) [49, 51]
V1-V9

Forward: 27F & 27F-YM

Reverse:1492R & 1492R-Y

Forward 27F: 5′-AGA GTT TGA TCC TGG CTC AG-3′

Forward 27F-YM: 5′AGAGTTTGATYMTGGCTCAG-3′

Reverse 1492: 5′-GGT TAC CTT GTT ACG ACT T-3′

Reverse 1492R-Y: 5′-GGTTACCTTGTTAYGACTT-3′

Allows for the whole 16S rRNA gene to be sequenced increasing resolution and discriminatory power Primers required to sequence whole 16S rRNA gene (V1-V9), represent microbial populations inaccurately through misidentifying microbes [39, 71]
V2-V3

Forward: 27F

Reverse:338R

Forward: 5′-AGA GTT TGA TCC TGG CTC AG-3′

Reverse: 5′-TGCTGCCTCCCGTAGGAGT-3′

Poses a high level of resolution for species and genera for partial 16S rRNA sequencing Doesn’t encompass all hypervariable regions decreasing taxonomic resolution and accurate microbial representation [41]

Whilst not as popular as analysis of 1–2 hypervariable regions, Oberele et al. and Lüth et al. chose to analyse all hypervariable regions (V1–V9) [39, 71]. The choice of 1–2 hypervariable regions is most commonly used as the majority of NGS technologies are only capable of generating short read lengths, allowing for a small number of hypervariable regions to be investigated. However, this approach can create issues for accurate taxonomic classification and result in low resolution for taxonomies causing a significantly lower discriminatory power. In comparison to full-length 16S rRNA sequencing, the sequencing of 1–2 hypervariable regions also limits the ability to characterise microbial communities, across a potential nine hypervariable regions [20, 79, 90, 91].

Just as hypervariable region selection can be variable, as can primer design as shown in Table 3. When investigating hypervariable regions, V3-V4 primer pair 515F and 806R are commonly used with individual modifications made per study such as in Kitaya et al., where the primers were modified to give 515FB and 806RB where B distinguishes updated primers from older primers [73]. For those who examined all hypervariable regions—including Lüth et al. and Oberle et al.—primers 27F & 1492R were consistently used across these studies, with Lüth et al. slightly modifying the primers to act as degenerative primers, to allow for variability of bases during amplification [39, 71, 92].

Remarkably, Wee et al. modified the 27F primer by including a mix of seven other primers, including a four-oligo mix, Bifidobacterium specific sequence, Borrelia and Chlamydia sequence-specific sequences [93]. Unfortunately, the 27F primer demonstrated significant limitations in microbial detection and accurate representation, making it a significantly less robust choice than other primers, such as 515F. Lüth et al. highlight the limitations associated with the 27F primer which was particularly problematic and failed to identify Chlamydia trachomatis and overestimated Lactobacillus iners, significantly impacting the accuracy of microbial representations for clinical interpretation, making the 27F, a significantly less robust choice compared to other primers, such as 515F for microbial detection and representation [39].

Whilst similar primers are used and can be slightly altered, under-representation and over-representation of species within the sample pose a significant issue for studies assessing the pathophysiology and abundances of microbes in microbiome samples. The use of primers 27F and 1492R and subsequent modifications pose the most significant risk of this overestimation and underestimation of specific microbes due to nucleotide mismatches [39]. In a clinical setting, this can result in inaccurate treatment due to clinical misinterpretation, potentially causing harm to the patient or having no effect in treating the dysbiosis. Therefore, it is critical to design and use primers that equally report the microbes within samples to allow for the most accurate representation of patient microbiomes for subsequent analysis and clinical decisions.

Metagenomics

Metagenomic approaches can be used to identify different microbial populations without the inherent biases that arise from the amplification of a single gene by PCR and categorising the genes present in a microbial community via selective sequencing of all DNA extracted [94, 95]. The need for amplifying a single gene is removed in a metagenomic approach, allowing for vigorous microbial population estimations of composition and diversity [94]. The diversity and composition of microbial communities are analysed via data sequencing following the merging of the genomes of microflora [96]. However, as all DNA is indiscriminately sequenced, amplification can still demonstrate bias and a metagenomic approach demands an increase in the depth of sequencing, raising overall costs [80]. Whilst metagenomics allows for whole DNA sequencing, sequencing 1–2 hypervariable regions at a time via metagenomics remains a popular approach throughout female reproductive tract microbiome investigations using primers provided in metagenomics kits [35, 40, 43, 53, 67, 68, 74].

16S vs. metagenomic sequencing approach

The utilisation of both amplicon sequencing and metagenomics have limitations, with microbial community identification with amplicon sequencing potentially introducing bias due to unequal amplification of the 16S rRNA genes of some species [84]. A 16S rRNA sequencing approach limits taxonomic resolution due smaller length of the amplicon product. Additionally, the 16S amplicon sequencing approach only allows for identification of bacteria and archaea, excluding fungi, viruses, and micro-eukaryotes. Despite these limitations, due to the small amplicon size of 16S rRNA sequencing, higher read depths are more easily obtained compared to metagenomic sequencing, which reduces the overall cost of this approach. Metagenomic sequencing also has some limitations, in that it relies more heavily on a reference database for assembly. This has been shown to reduce its capacity in identifying novel microbes within samples and increases the false positive likelihood whilst also demanding more computationally [80, 84, 95, 96]. Interestingly, both approaches have been demonstrated to give different community structures for microbial communities in samples obtained from multiple body sites [84] Whilst 16S rRNA sequencing proves a time and cost-efficient method, a metagenomics approach of sequencing microbial DNA within a sample is recommended to increase representation of bacteria, archaea, fungi, viruses, and micro-eukaryotes especially within the female reproductive microbiome [80, 96].

Comparison of the different NGS sequencing platforms

NGS technology encompassing short-read sequencing includes Illumina technology and Ion Torrent Systems, along with third-generation sequencing technologies (long-read sequencing) such as provided by nanopore and PacBio technology. All of these technologies allow for high-throughput sequencing whilst dramatically reducing costs per sample compared to more iterative sequencing methods such as Sanger sequencing.

Illumina-based studies

The NGS technology provided by Illumina sequences DNA via bridge amplification PCR, with DNA sequenced via the incorporation of fluorescent nucleotides to determine the DNA sequence. Illumina sequencing allows for sequencing to occur from both ends of a DNA fragment, termed paired-end sequencing, generating sequence data with in-depth coverage and high quality [97, 98].

In contrast to the high variability of hypervariable region selection, primer design and synthesis applied to the examination of microbiome samples, with Illumina sequencing, emerged as the most common method of choice [36, 38, 49, 51, 52, 70, 80, 99]. This can likely be attributed to the low error rate, reported accuracy of > Q30, and low DNA input (50–1000 ng) requirement of the Illumina NGS protocol [100, 101]. A quality score—or Q-score—represents the probability that a base is called wrong [102, 103]. For example, if the reported accuracy of Illumina sequencing is > Q30, that means that there is a 1 in 1000 or 0.001% chance the base was called incorrectly.

However, Illumina sequencing does demonstrate low taxonomic resolution and decreased accuracy in taxonomic classification due to the inability to sequence the whole 16S rRNA gene in one fragment. Although Illumina NGS is currently the most common method of choice, it is limited to a maximum of 600 bp (2 × 300 bp) read lengths per sequencing run. This only allows for a maximum of two of the nine hypervariable regions to be amplified in one fragment, impacting genus-level identification and creating discrepancies in diversity dependent upon the choice of regions [20, 79, 90, 91, 100].

Ion Torrent-based studies

The Ion Torrent NGS technology amplification relies on emulsion PCR, a bead-based method by which PCR occurs in micelles. The Ion Torrent technology utilises a semiconductor chip that contains a pH sensor to detect nucleotides via hydrogen ion release which enables read lengths between 200 and 600 bp and run times between 2 and 4 h. Ion Torrent systems are most suited to sequencing RNA and DNA templates of smaller size and the characterisation of small variants [98].

Compared to the multitude of studies using Illumina technology, research using Ion Torrent systems less abundant, with Illumina the most utilised NGS technology. However, studies using Ion Torrent technologies have recorded limited resolution of the 16S rRNA sequence and were unable to accurately identify specific bacterial species [41]. Whilst species-level identification was not obtained, the use of Ion Torrent systems has allowed for robust characterisation at genera-level identification [41, 68]. Moreno et al. found that the platform might detect DNA contamination skewing the microbiome community interpretation and composition [40]. Whilst Ion Torrent systems have a low error rate and sequence significantly faster than the Illumina platforms compared (up to 56 h), this technology is unable to recognise long repetitive sequences as well as large structural variants and assemble genomes [41, 97, 98, 100].

Third-generation sequencing-based studies

The emergence of third-generation sequencing platforms including Oxford Nanopore (nanopore sequencing) and Pacific Biosciences (PacBio sequencing) technologies have allowed for advancements in DNA sequencing, enabling sequences of over 10 kb in length to be generated and analysed [97, 98]. PacBio technology sequences DNA via a single-stranded circular DNA where a polymerase binds to an adapter sequence and amplifies DNA with fluorescent nucleotides, which, when bound, emit a light pulse interpreted as a nucleotide sequence [98]. Nanopore sequencing utilises a nanoscale protein pore (nanopore) which acts as a biosensor with DNA driven through the nanopore via a motor protein, disrupting an electric current, which is then decoded using algorithms to give real-time sequencing [104].

Unfortunately, no studies have yet investigated the microbiome of the female reproductive tract undergoing fertility treatment using PacBio sequencing. However, findings from a study assessing PacBio full-length sequencing for vaginal bacterial 16S rRNA gene classification by Wagner et al. recommended pooling no more than ten samples to enable data differentiation. Wagner et al. also found that high stringency post-screening on the PacBio platform removed true positive OTUs and therefore removed species from the dataset, with low stringency data containing additional false positive species [105].

Studies using nanopore technology have found a time and cost-efficient method for microbiota sequencing of the human vaginal microbiome with precise results. Additionally, nanopore sequencing demonstrated a rapid means for identifying microbes with a high resolution, enabling real-time analysis [39, 42, 50, 71]. In addition, nanopore sequencing has been highlighted as a potential feasible method for interventional studies to develop a portable microbiome-based clinical decision platform. This aligns with literature suggesting the potential for nanopore sequencing to contribute to clinical decision-making regarding ARTs [39]. However, the current 16S rRNA barcoding kit provided by ONT failed to identify bacteria including G. vaginalis and Bifidobacterium, decreasing overall resolution and an inaccurate determination of the microbial population representation [15]. Lüth et al. highlight the application of nanopore sequencing to species-level identification along with a reduction in false positives by adding an Emu algorithm [39]. The Emu algorithm creates microbial abundance profiles from full-length 16S rRNA sequencing reads by utilising an expectation–maximisation algorithm [106].

Overall, third-generation sequencing—specifically nanopore sequencing—has shown promise as a possible future microbiota-based diagnostic platform and potential therapeutic indicator for future research. The utilisation of third-generation sequencing in the clinical may enable clinicians to perform microbial analysis accurately as well as more efficiently and quickly. However, further in-depth investigations comparing methodological approaches for 16S sequencing of the female upper reproductive tract need to be performed to ensure a clinically robust methodology.

Bioinformatic pipeline tools and workflows

Bioinformatic pipeline usage and development vary significantly across the literature, with a multitude of tools available for each step and an abundance of data analysis options. Additionally, bioinformatic tools differ depending on which technology was selected for the study, with software compatible with only short-read lengths, or specifically tailored to long-read lengths.

Databases

To achieve accurate taxonomic classification, to provide precise population representation, reference sequence databases and their associated taxonomical classifications need to be carefully selected. The choice of reference database can significantly influence the accuracy of taxonomic classification results, particularly from understudied environments and outdated databases [107]. The Greengenes, NCBI, and SILVA databases are currently the most commonly used throughout microbial investigations [13, 31, 38, 40, 51, 71]. The Greengenes database provides full-length 16S rRNA curated taxonomy based on de novo tree inference and is regularly updated [108]. As such, the application of nanopore technologies to studies would demonstrate different microbial communities due to the limited compatible bioinformatic tools currently available. In contrast, the NCBI database uses phylogeny classification, consisting of a single, hierarchically organised list that is compatible with short- and long-read data to identify microbial communities [109]. The NCBI taxonomy database also supplies the hierarchical taxon tree and standard nomenclature used by databases like GenBank, DNA Data Bank of Japan (DDBJ), and European Nucleotide Archive (ENA) that collectively form the International Nucleotide Sequence Database Collaboration (INSDC) which is adopted by most metagenomic classifiers. The SILVA database is a quality-controlled database of aligned rRNA sequences of bacteria, eukaryotes, and archaea. This database features regular releases that are made available rather than through continuous updates. As a result, the SILVA database has been highlighted as an ideal reference for high-throughput classification of NGS data for 16S rRNA approaches whereas the utilisation of INSDC or NCBI (home of GenBank) are recommended for metagenomic investigations [110, 111].

Workflows

Regardless of the methodological techniques, technology, and platform selected, all bioinformatic workflows begin with quality checks of raw data. The development of operational taxonomic units (OTUs) or amplicon sequence variants (ASVs) through read assignment appears in almost every 16S bioinformatic work for microbial classification. OTUs are formulated to enable sequencing errors to be distinguished from real nucleotide data. The formation of OTUs occurs via clustering reads on a pre-set identity threshold (commonly 97%), to repair sequencing errors [112]. Whilst OTUs were the most common throughout this review, the use of ASVs has overtaken the use of OTUs and also shown to significantly benefit direct compatibility between studies without the need for reanalysis of data with alpha diversity assessments influenced depending on the method used [113].

A metagenomic workflow can perform either read-based taxonomic classification or assembly based taxonomic classification which impact microbial identification differently [114]. The clustering process required in an assembly-based process or 16S approach can be bypassed by the trimmed reads being aligned to a known database of bacterial sequences for taxonomy. However, read-based taxonomic classification is commonly performed when handling well researched organisms with relatively small metagenomes and has shown to gather more functional annotation than an assembly-based approach. Assembly-based taxonomic classification, however, essentially group reads into specific genomes which are then assigned to bacteria and classified accordingly through reconstructing microbial genomes. This process has a superior performance than read-based taxonomic assignments and is more favourable for longer read lengths [114117].

Studies that utilise short-read technology tend to use bioinformatic tools such as QIIME/QIIME2, USSEARCH, and DADA2 software for filtering, data quality parameters, and read clustering. QIIME/QIIME2 software was the most consistent tool used in data trimming, read clustering, chimera identification, and deletion and clustering of reads into OTUs for short read technology with the use of QIIME or QIIME2 dependent on the time of data analysis as QIIME2 superseded QIIME [13, 35, 40, 41, 49, 83]. Filtering, trimming, and quality checking are essential in the bioinformatic pipeline as raw reads are filtered for overall low quality, adapter sequences, low quality 5′ and 3′ tails, and contaminants, which are later removed from the data to provide more accurate and precise results. Chimeric sequences are incomplete remnants of previous PCR, with identification and removal of these essential in any bioinformatic pipeline [112]. Read clustering assembles the data into groups and, using specific statistics, reconstructs the genome and following this clustering, taxonomic assignment can occur. The taxonomic assignment entails a taxon of each sequence identified for a particular tool to match the sequence to a known database [112]. OTU assignment involves the use of QIIME, QIIME2, and Ribosomal Database Classifier (RDP). Some studies specifically include the Recombination Detection Program, which employs a pairwise scanning approach to analyse and identify recombinant DNA amongst a group of aligned sequences [118]. Finally, R studio and various R packages continue to provide the most popular tools used for data analysis across both short- and long-read sequencing outputs [12, 26, 36, 37, 39, 53, 67, 68, 71]. R studio provides a standard tool utilised in both technology studies for graphical taxonomic representation and has been demonstrated as advantageous in data analysis and taxonomic datasets.

Interestingly, there remains a lack of studies using mock communities with which to compare findings, ultimately making the results less robust for both short- and long-read technologies. The use of a mock microbial community for sequencing protocols and bioinformatic methodologies alongside unknown samples is necessary to validate the accuracy of methods selected. Synthetic mock community standards should be included in studies investigating microbiomes to identify potential biases and increase robustness [75].

Long-read sequencing data is typically analysed using Oxford Nanopore Technologies (ONT) software, such as the cloud-based data-analysis platform EPI2ME, and the base calling software GUPPY. Unlike short-read technologies, nanopore sequencing requires base-calling either during sequencing or following sequencing to allow for data output [119, 120]. QIIME software was developed for short-read length data analysis and its associated technologies and is not compatible with long-read data that generated from nanopore sequencing [121]. As such the use of bioinformatic tools for nanopore sequencing data analysis need to be specifically developed to accurately interpret long-read length data for accurate community identification.

Data availability and deposition

Reproducibility and repeatability of results is essential in microbiome investigations to validate a study’s findings and assess the consistency of results. To evaluate reproducibility and repeatability, however, datasets require re-analysis by researchers which can only be achieved if studies publish raw data or publish raw data into a microbiome data repository [122]. Whilst deposition of raw data into repositories like the International Nucleotide Sequence Database Collaboration (INSDC) centre are positive steps to increasing raw data availability across microbiome investigations, the data deposited is not integrated or processed within the INSDC centre due to a lack of funding [122, 123]. Due to the lack of data integration and processing after data deposition, the sequences have no reusability beyond the scope of the original investigation and unable to be repeated and reproduced within different studies. The lack of raw data available for re-analysis hinders meta-analysis and systematic reviews attempting to compare different methodological techniques and technologies within microbiome investigations due to the minimal consistency in methods. Additionally, tools that cause high variability within analysis protocols—like bioinformatic workflow and database selection—cannot be accurately compared. For example, the determination of QIIME vs. QIIME2 accuracy cannot be determined without comparing re-analysed data [123].

Whilst a comprehensive microbiome databank remains the ideal standard for data deposition in microbiome research, the lack of funding presents a significant barrier to its realisation. In the interim, ensuring that all raw microbiome data are made publicly available is essential for enhancing data reproducibility and repeatability, thereby facilitating more robust benchmarking and comparison across different methodological approaches.

Conclusion

This study highlighted the methodological approaches across the literature used for DNA sequencing of the genital tract microbiome for women undergoing fertility treatment. Unfortunately, no uniformity was found in the reported DNA extraction techniques, primer design, and chosen hypervariable regions of the NGS approaches. However, the choice of bioinformatic pipeline used in these studies demonstrated that QIIME and QIIME2 software were the most commonly used tools in short-read sequencing analysis. It is important to note that no clear differences were observed in sequencing methodologies or DNA extraction approaches based on female reproductive tract anatomical sites. This highlights a potential limitation and an area for future investigation, as understanding the differences could lead to more tailored and effective microbial assessments.

This review recommends the application of nanopore sequencing in microbiome investigations, to increase sequencing accuracy, precision, and comparability between different studies using an assembly-based metagenomics approach with INSDC or NCBI (home of GenBank) databases. This study also identified a limited number of studies that used nanopore sequencing to assess vaginal, endometrial, uterine, or cervical microbiomes in women undergoing fertility treatment. In conclusion, future studies are required to develop a uniform and robust methodology for accurate and precise microbial representation for use in these samples. This could include standardised protocols for sampling, DNA extraction and bioinformatics, and development of a publicly available data repository for benchmarking as well as longitudinal studies to track changes in the microbiome over time.

Funding

Open Access funding enabled and organized by CAUL and its Member Institutions

Declarations

We confirm that the manuscript has been read and approved by all named authors and that there are no other persons who satisfied the criteria for authorship but are not listed. We further confirm that the order of authors listed in the manuscript has been approved by all of us. We confirm that we have given due consideration to the protection of intellectual property associated with this work and that there are no impediments to publication, including the timing of publication, with respect to intellectual property. In so doing, we confirm that we have followed the regulations of our institutions concerning intellectual property. We understand that the corresponding author is the sole contact for the Editorial process (including Editorial Manager and direct communications with the office). He is responsible for communicating with the other authors about progress, submissions of revisions and final approval of proofs. We confirm that we have provided a current, correct email address which is accessible by the corresponding author.

Conflict of interest

The authors declare no competing interests.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Soave I, Lo Monte G, Marci R. Spontaneous pregnancy and unexplained infertility: a gift with many whys. North Am J Med Sci. 2012;4(10):512–3. 10.4103/1947-2714.102010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Eniola O, Adetola A, Abayomi B. A review of female infertility; important etiological factors and management. J Microbiol Biotechnol Res. 2012;2(3):379–85. [Google Scholar]
  • 3.Kumar N, Singh AK. Trends of male factor infertility, an important cause of infertility: a review of literature. J Hum Reprod Sci. 2015;8(4):191–6. 10.4103/0974-1208.170370. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Anwar S, Anwar A. Infertility: a review on causes, treatment and management. Womens Health Gynecol. 2016;2(6):1–5. [Google Scholar]
  • 5.Soto E, Copperman AB. Factors affecting fertility. In: Arici A, Seli E, editors. Infertility. Wiley-Blackwell; 2011. pp. 1–7. 10.1002/9781444393958.ch1.
  • 6.Bala R, Singh V, Rajender S, Singh K. Environment, lifestyle, and female infertility. Reprod Sci Thousand Oaks, Calif. 2021;28(3):617–38. 10.1007/s43032-020-00279-3. [DOI] [PubMed] [Google Scholar]
  • 7.Biggs SN, Kennedy J, Lewis SL, Hearps S, O’Bryan MK, McLachlan R, von Saldern S, Chambers G, Halliday J. Lifestyle and environmental risk factors for unexplained male infertility: study protocol for Australian male infertility exposure (AMIE), a case-control study. Reprod Health. 2023;20(1):32. 10.1186/s12978-023-01578-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Huang B, Fettweis JM, Paul Brooks J, Jefferson KK, Buck GA. The changing landscape of the vaginal microbiome. Clin Lab Med. 2014;34(4):747–61. 10.1016/j.cll.2014.08.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Greenbaum S, Greenbaum G, Moran-Gilad J, Weintraub AY. Ecological dynamics of the vaginal microbiome in relation to health and disease. Am J Obstet Gynecol. 2019;220(4):324–35. 10.1016/j.ajog.2018.11.1089. [DOI] [PubMed] [Google Scholar]
  • 10.García-Velasco JA, Menabrito M, Bruna Catalán I. What fertility specialists should know about the vaginal microbiome: a review. Reprod Biomed Online. 2017;35(1):103–12. 10.1016/j.rbmo.2017.04.005. [DOI] [PubMed] [Google Scholar]
  • 11.Holdcroft AM, Ireland DJ, Payne MS. The vaginal microbiome in health and disease—what role do common intimate hygiene practices play? Microorganisms (Basel). 2023;11(2):298. 10.3390/microorganisms11020298. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Chen X, Lu Y, Chen T, Li R. The female vaginal microbiome in health and bacterial vaginosis. Front Cel Infect Microbiol. 2021;11:631972. 10.3389/fcimb.2021.631972. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Bernabeu A, Lledo B, Díaz MC, Lozano FM, Ruiz V, Fuentes A, Lopez-Pineda A, Moliner B, Castillo JC, Ortiz JA, Ten J, Llacer J, Carratala-Munuera C, Orozco-Beltran D, Quesada JA, Bernabeu R. Effect of the vaginal microbiome on the pregnancy rate in women receiving assisted reproductive treatment. J Assist Reprod Genet. 2019;36(10):2111–9. 10.1007/s10815-019-01564-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.van Oostrum N, De Sutter P, Meys J, Verstraelen H. Risks associated with bacterial vaginosis in infertility patients: a systematic review and meta-analysis. Human Reprod. 2013;28(7):1809–15. 10.1093/humrep/det096. [DOI] [PubMed] [Google Scholar]
  • 15.Hyman R, Herndon C, Jiang H, Palm C, Fukushima M, Bernstein D, Vo K, Zelenko Z, Davis R, Giudice L. The dynamics of the vaginal microbiome during infertility therapy with in vitro fertilization-embryo transfer. Assist Reprod Technol. 2012;29(2):105–15. 10.1007/s10815-011-9694-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Gaudoin M, Rekha P, Morris A, Lynch J, Acharya U. Bacterial vaginosis and past chlamydial infection are strongly and independently associated with tubal infertility but do not affect in vitro fertilization success rates. Fertil Steril. 1999;72(4):730–2. 10.1016/S0015-0282(99)00310-6. [DOI] [PubMed] [Google Scholar]
  • 17.Kong Y, Liu Z, Shang Q, Gao Y, Li X, Zheng C, Deng X, Chen T. The disordered vaginal microbiota is a potential indicator for a higher failure of in vitro fertilization. Front Med. 2020;7:217. 10.3389/fmed.2020.00217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Datcu R, Gesink D, Mulvad G, Montgomery-Andersen R, Rink E, Koch A, Ahrens P, Jensen JS. Vaginal microbiome in women from Greenland assessed by microscopy and quantitative PCR. BMC Infect Dis. 2013;13(1):480. 10.1186/1471-2334-13-480. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.van de Wijgert J, Jespers V. The global health impact of vaginal dysbiosis. Res Microbiol; Spec Issue Vaginal Microbiol. 2017;168(9):859–64. 10.1016/j.resmic.2017.02.003. [DOI] [PubMed] [Google Scholar]
  • 20.Hang J, Desai V, Zavaljevski N, Yang Y, Lin X, Satya R, Martinez L, Blaylock J, Jarman R, Thomas S, Kuschner R. 16S rRNA gene pyrosequencing of reference and clinical samples and investigation of the temperature stability of microbiome profiles. Microbiome. 2014;2(1):31. 10.1186/2049-2618-2-31. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Davis A, Kohler C, Alsallaq R, Hayden R, Maron G, Margolis E. Improved yield and accuracy for DNA extraction in microbiome studies with variation in microbial biomass. BioTechniques. 2019;66(6):285–9. 10.2144/btn-2019-0016. [DOI] [PubMed] [Google Scholar]
  • 22.Greathouse KL, Sinha R, Vogtmann E. DNA extraction for human microbiome studies: the issue of standardization. Genome Biol. 2019;20(1):212. 10.1186/s13059-019-1843-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Huang Y, Wang Y, He Y, Ji Y, Wang L, Sheng H, Zhang M, Huang Q, Zhang D, Wu J, Zhong M, Zhou H. Homogeneity of the vaginal microbiome at the cervix, posterior fornix, and vaginal canal in pregnant Chinese women. Microb Ecol. 2015;69(2):407–14. 10.1007/s00248-014-0487-1. [DOI] [PubMed] [Google Scholar]
  • 24.Mitra A, MacIntyre DA, Mahajan V, Lee YS, Smith A, Marchesi JR, Lyons D, Bennett PR, Kyrgiou M. Comparison of vaginal microbiota sampling techniques: cytobrush versus swab. Sci Rep. 2017;7(1):9802–10. 10.1038/s41598-017-09844-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Reschini M, Benaglia L, Ceriotti F, Borroni R, Ferrari S, Castiglioni M, Guarneri D, Porcaro L, Vigano’ P, Somigliana E, Uceda Renteria S. Endometrial microbiome: sampling, assessment, and possible impact on embryo implantation. Sci Rep. 2022;12(1):8467. 10.1038/s41598-022-12095-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Verstraelen H, Vilchez-Vargas R, Desimpel F, Jauregui R, Vankeirsbilck N, Weyers S, Verhelst R, De Sutter P, Pieper DH, Van De Wiele T. Characterisation of the human uterine microbiome in non-pregnant women through deep sequencing of the V1–2 region of the 16S rRNA gene. PeerJ. 2016;4:e1602. 10.7717/peerj.1602. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Forney LJ, Gajer P, Williams CJ, Schneider GM, Koenig SSK, McCulle SL, Karlebach S, Brotman RM, Davis CC, Ault K, Ravel J. Comparison of self-collected and physician-collected vaginal swabs for microbiome analysis. J Clin Microbiol. 2010;48(5):1741–8. 10.1128/JCM.01710-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Virtanen S, Kalliala I, Nieminen P, Salonen A. Comparative analysis of vaginal microbiota sampling using 16S rRNA gene analysis. PloS One. 2017;12(7):e0181477. 10.1371/journal.pone.0181477. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Blankenship SA, Wylie KM, Tuuli MG, Macones GA, Stout MJ. Evaluation of patient- versus provider-collected vaginal swabs for microbiome analysis during pregnancy. Am J Obstet Gynecol. 2018;219(6):645. 10.1016/j.ajog.2018.10.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Molina NM, Sola-Leyva A, Haahr T, Aghajanova L, Laudanski P, Castilla JA, Altmäe S. Analysing endometrial microbiome: methodological considerations and recommendations for good practice. Human Reprod (Oxford). 2021;36(4):859–79. 10.1093/humrep/deab009. [DOI] [PubMed] [Google Scholar]
  • 31.Keburiya LK, Smolnikova VY, Priputnevich TV, Muravieva VV, Gordeev AB, Trofimov DY, Shubina ES, Kochetkova TO, Rogacheva MS, Kalinina EA, Sukhikh GT. Correction to: Does the uterine microbiota affect the reproductive outcomes in women with recurrent implantation failures? BMC Women’s Health. 2023;23(1):265. 10.1186/s12905-023-02302-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Ono Y, Kobayashi Y, Shimada S, Fukushi Y, Yoshino O, Wada S, Yamada H. Uterine endometrium microbiome in women with repeated implantation failure complicated by endometriosis. J Clin Med. 2024;13(16):4605. 10.3390/jcm13164605. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Hickl O, Heintz-Buschart A, Trautwein-Schult A, Hercog R, Bork P, Wilmes P, Becher D. Sample preservation and storage significantly impact taxonomic and functional profiles in metaproteomics studies of the human gut microbiome. Microorganisms (Basel). 2019;7(9):367. 10.3390/microorganisms7090367. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Zigui C, Hui PC, Mamie H, Yeoh YK, Yee WP, Chan Martin CW, Wong Martin CS, Ng Siew C, Chan Francis KL, Chan Paul KS. Impact of preservation method and 16S rRNA hypervariable region on gut microbiota profiling. mSystems. 2019;4(1):271. 10.1128/mSystems.00271-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Diaz-Martínez MdC, Bernabeu A, Lledó B, Carratalá-Munuera C, Quesada JA, Lozano FM, Ruiz V, Morales R, Llácer J, Ten J, Castillo JC, Rodríguez A, Nouni-García R, López-Pineda A, Moliner B, Bernabeu R. Impact of the vaginal and endometrial microbiome pattern on assisted reproduction outcomes. J Clin Med. 2021;10(18):4063. 10.3390/jcm10184063. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Graspeuntner S, Bohlmann MK, Gillmann K, Speer R, Kuenzel S, Mark H, Hoellen F, Lettau R, Griesinger G, König IR, Baines JF, Rupp J. Microbiota-based analysis reveals specific bacterial traits and a novel strategy for the diagnosis of infectious infertility. PLoS ONE. 2018;13(1):e0191047. 10.1371/journal.pone.0191047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Hao X, Li P, Wu S, Tan J. Association of the cervical microbiota with pregnancy outcome in a subfertile population undergoing in vitro fertilization: a case-control study. Frontiers in Cellular and Infection Microbiology. 2021;11:654202. 10.3389/fcimb.2021.654202. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Karaer A, Doğan B, Günal S, Tuncay G, Arda Düz S, Ünver T, Tecellioğlu N. The vaginal microbiota composition of women undergoing assisted reproduction: a prospective cohort study. BJOG : An Int J Obstet Gynaecol. 2021;128(13):2101–9. 10.1111/1471-0528.16782. [DOI] [PubMed] [Google Scholar]
  • 39.Lüth T, Graspeuntner S, Neumann K, Kirchhoff L, Masuch A, Schaake S, Lupatsii M, Tse R, Griesinger G, Trinh J, Rupp J. Improving analysis of the vaginal microbiota of women undergoing assisted reproduction using nanopore sequencing. J Assist Reprod Genet. 2022;39(11):2659–67. 10.1007/s10815-022-02628-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Moreno I, Codoñer FM, Vilella F, Valbuena D, Martinez-Blanch JF, Jimenez-Almazán J, Alonso R, Alamá P, Remohí J, Pellicer A, Ramon D, Simon C. Evidence that the endometrial microbiota has an effect on implantation success or failure. Am J Obstet Gynecol. 2016;215(6):684–703. 10.1016/j.ajog.2016.09.075. [DOI] [PubMed] [Google Scholar]
  • 41.Patel N, Patel N, Pal S, Nathani N, Pandit R, Patel M, Patel N, Joshi C, Parekh B. Distinct gut and vaginal microbiota profile in women with recurrent implantation failure and unexplained infertility. BMC Women’s Health. 2022;22(1):113. 10.1186/s12905-022-01681-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Vajpeyee M, Tiwari S, Yadav LB, Tank P. Assessment of bacterial diversity associated with assisted reproductive technologies through next-generation sequencing. Middle East Fertil Soc J. 2022;27(1):28. 10.1186/s43043-022-00117-3. [Google Scholar]
  • 43.Zou Y, Liu X, Chen P, Wang Y, Li W, Huang R. The endometrial microbiota profile influenced pregnancy outcomes in patients with repeated implantation failure: a retrospective study. J Reprod Immunol. 2023;155:103782. 10.1016/j.jri.2022.103782. [DOI] [PubMed] [Google Scholar]
  • 44.Choo JM, Leong LEX, Rogers GB. Sample storage conditions significantly influence faecal microbiome profiles. Sci Rep. 2015;5(1):16350. 10.1038/srep16350. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Ketter PM, Kamucheka R, Arulanandam B, Akers K, Cap AP. Platelet enhancement of bacterial growth during room temperature storage: mitigation through refrigeration. Transfusion. 2019;59(S2):1479–89. 10.1111/trf.15255. [DOI] [PubMed] [Google Scholar]
  • 46.Prakash O, Nimonkar Y, Desai D. A recent overview of microbes and microbiome preservation. Indian J Microbiol. 2020;60(3):297–309. 10.1007/s12088-020-00880-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Blekhman R, Tang K, Archie EA, Barreiro LB, Johnson ZP, Wilson ME, Kohn J, Yuan ML, Gesquiere L, Grieneisen LE, Tung J. Common methods for fecal sample storage in field studies yield consistent signatures of individual identity in microbiome sequencing data. Sci Rep. 2016;6(1):31519. 10.1038/srep31519. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Kers JG, Saccenti E. The power of microbiome studies: some considerations on which alpha and beta metrics to use and how to report results. Front Microbiol. 2022;12:796025. 10.3389/fmicb.2021.796025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Ichiyama T, Kuroda K, Nagai Y, Urushiyama D, Ohno M, Yamaguchi T, Nagayoshi M, Sakuraba Y, Yamasaki F, Hata K, Miyamoto S, Itakura A, Takeda S, Tanaka A. Analysis of vaginal and endometrial microbiota communities in infertile women with a history of repeated implantation failure. Reprod Med Biol. 2021;20(3):334–44. 10.1002/rmb2.12389. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Komiya S, Matsuo Y, Nakagawa S, Morimoto Y, Kryukov K, Okada H, Hirota K. MinION, a portable long-read sequencer, enables rapid vaginal microbiota analysis in a clinical setting. BMC Med Genomics. 2022;15(1):68. 10.1186/s12920-022-01218-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Kyono K, Hashimoto T, Nagai Y, Sakuraba Y. Analysis of endometrial microbiota by 16S ribosomal RNA gene sequencing among infertile patients: a single-center pilot study. Reprod Med Biol. 2018;17(3):297–306. 10.1002/rmb2.12105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Okwelogu SI, Ikechebelu JI, Agbakoba NR, Anukam KC. Microbiome compositions from infertile couples seeking in vitro fertilization, using 16S rRNA gene sequencing methods: any correlation to clinical outcomes? Front Cell Infection Microbiol. 2021;11:709372. 10.3389/fcimb.2021.709372. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Villani A, Fontana A, Barone S, de Stefani S, Primiterra M, Copetti M, Panebianco C, Parri C, Sciannamè N, Quitadamo PA, Tiezzi A, Santana L, Maglione A, D’Amato F, Perri F, Palini S, Pazienza V. Identifying predictive bacterial markers from cervical swab microbiota on pregnancy outcome in woman undergoing assisted reproductive technologies. J Clin Med. 2022;11(3):680. 10.3390/jcm11030680. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Kero K, Hieta N, Kallonen T, Ahtikoski A, Laine HK, Rautava J, Munukka E. Optimal sampling and analysis methods for clinical diagnostics of vaginal microbiome. Eur J Clin Microbiol Infect Dis. 2023;42(2):201–8. 10.1007/s10096-022-04545-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.DNAgenotek. DNAgenotek - company overview. 2024. https://www.dnagenotek.com/US/company/overview.html
  • 56.Miller EA, Beasley DE, Dunn RR, Archie EA. Lactobacilli dominance and vaginal pH: why is the human vaginal microbiome unique? Front Microbiol. 2016;7:1936. 10.3389/fmicb.2016.01936. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Younas H. Chapter 19 - Secondary metabolites from marine epiphytic bacteria against plant pathogens. In: Bacterial secondary metabolites. Elsevier Inc.; 2024. pp. 353–79. 10.1016/B978-0-323-95251-4.00012-0.
  • 58.Jenkins SV, Vang KB, Gies A, Griffin RJ, Jun S, Nookaew I, Dings RPM. Sample storage conditions induce post-collection biases in microbiome profiles. BMC Microbiolo. 2018;18(1):227. 10.1186/s12866-018-1359-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Vogtmann E, Chen J, Amir A, Shi J, Abnet CC, Nelson H, Knight R, Chia N, Sinha R. Comparison of collection methods for fecal samples in microbiome studies. Am J Epidemiol. 2017;185(2):115–23. 10.1093/aje/kww177. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Kumar T, Bryant M, Cantrell K, Song SJ, McDonald D, Tubb HM, Farmer S, Lewis A, Lukacz ES, Brubaker L, Knight R. Effects of variation in sample storage conditions and swab order on 16S vaginal microbiome analyses. Microbiol Spectr. 2024;12(1):e0371223. 10.1128/spectrum.03712-23. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Wu GD, Lewis JD, Hoffmann C, Chen Y, Knight R, Bittinger K, Hwang J, Chen J, Berkowsky R, Nessel L, Li H, Bushman FD. Sampling and pyrosequencing methods for characterizing bacterial communities in the human gut using 16S sequence tags. BMC Microbiol. 2010;10(1):206. 10.1186/1471-2180-10-206. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Marquet M, Zöllkau J, Pastuschek J, Viehweger A, Schleußner E, Makarewicz O, Pletz MW, Ehricht R, Brandt C. Evaluation of microbiome enrichment and host DNA depletion in human vaginal samples using oxford nanopore’s adaptive sequencing. Sci Rep. 2022;12(1):4000. 10.1038/s41598-022-08003-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Mahalanabis M, Al-Muayad H, Kulinski MD, Altman D, Klapperich CM. Cell lysis and DNA extraction of gram-positive and gram-negative bacteria from whole blood in a disposable microfluidic chip. Lab Chip. 2009;9(19):2811–7. 10.1039/b905065p. [DOI] [PubMed] [Google Scholar]
  • 64.Weerakkody LR, Witharana C. A rapid, inexpensive and effective method for the efficient isolation of genomic DNA from gram-negative bacteria. Mol Genet Genomics. 2024;299(1):26. 10.1007/s00438-024-02120-x. [DOI] [PubMed] [Google Scholar]
  • 65.Thielemann N, Herz M, Kurzai O, Martin R. Analyzing the human gut mycobiome – a short guide for beginners. Comput Struct Biotechnol J. 2022;20:608–14. 10.1016/j.csbj.2022.01.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Shvartsman E, Richmond MEI, Schellenberg JJ, Lamont A, Perciani C, Russell JNH, Poliquin V, Burgener A, Jaoko W, Sandstrom P, MacDonald KS. Comparative analysis of DNA extraction and PCR product purification methods for cervicovaginal microbiome analysis using cpn60 microbial profiling. PloS One. 2022;17(1):e0262355. 10.1371/journal.pone.0262355. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Cheong HC, Yap PSX, Chong CW, Cheok YY, Lee CYQ, Tan GMY, Sulaiman S, Hassan J, Sabet NS, Looi CY, Gupta R, Arulanandam B, AbuBakar S, Teh CSJ, Chang LY, Wong WF. Diversity of endocervical microbiota associated with genital chlamydia trachomatis infection and infertility among women visiting obstetrics and gynecology clinics in malaysia. PLoS ONE. 2019;14(11):e0224658. 10.1371/journal.pone.0224658. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Franasiak JM, Werner MD, Juneau CR, Tao X, Landis J, Zhan Y, Treff NR, Scott RT. Endometrial microbiome at the time of embryo transfer: next-generation sequencing of the 16S ribosomal subunit. J Assist Reprod Genet. 2016;33(1):129–36. 10.1007/s10815-015-0614-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Liu Y, Ko EY, Wong KK, Chen X, Cheung W, Law TS, Chung JP, Tsui SK, Li T, Chim SS. Endometrial microbiota in infertile women with and without chronic endometritis as diagnosed using a quantitative and reference range-based method. Fertil Steril. 2019;112(4):707-717.e1. 10.1016/j.fertnstert.2019.05.015. [DOI] [PubMed] [Google Scholar]
  • 70.Tao X, Franasiak JM, Zhan Y, Scott RT, Rajchel J, Bedard J, Newby R, Scott RT, Treff NR, Chu T. Characterizing the endometrial microbiome by analyzing the ultra-low bacteria from embryo transfer catheter tips in IVF cycles: next generation sequencing (NGS) analysis of the 16S ribosomal gene. Human Microbiome J. 2017;3:15–21. 10.1016/j.humic.2017.01.004. [Google Scholar]
  • 71.Oberle A, Urban L, Falch-Leis S, Ennemoser C, Nagai Y, Ashikawa K, Ulm PA, Hengstschläger M, Feichtinger M. 16S rRNA long-read nanopore sequencing is feasible and reliable for endometrial microbiome analysis. Reprod BioMed Online. 2021;42(6):1097–107. 10.1016/j.rbmo.2021.03.016. [DOI] [PubMed] [Google Scholar]
  • 72.Mattei V, Murugesan S, Al Hashmi M, Mathew R, James N, Singh P, Kumar M, Lakshmanan AP, Terranegra A, Al Khodor S, Tomei S. Evaluation of methods for the extraction of microbial DNA from vaginal swabs used for microbiome studies. Front Cell Infect Microbiol. 2019;9:197. 10.3389/fcimb.2019.00197. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Kitaya K, Nagai Y, Arai W, Sakuraba Y, Ishikawa T. Characterization of microbiota in endometrial fluid and vaginal secretions in infertile women with repeated implantation failure. Mediat Inflammation. 2019;2019:4893437–10. 10.1155/2019/4893437. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Moreno I, Garcia-Grau I, Perez-Villaroya D, Gonzalez-Monfort M, Bahçeci M, Barrionuevo MJ, Taguchi S, Puente E, Dimattina M, Lim MW, Meneghini G, Aubuchon M, Leondires M, Izquierdo A, Perez-Olgiati M, Chavez A, Seethram K, Bau D, Gomez C, Simon C. Endometrial microbiota composition is associated with reproductive outcome in infertile patients. Microbiome. 2022;10(1):1. 10.1186/s40168-021-01184-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Gill C, van de Wijgert Janneke HHM, Blow F, Darby AC. Evaluation of lysis methods for the extraction of bacterial DNA for analysis of the vaginal microbiota. PloS One. 2016;11(9):e0163148. 10.1371/journal.pone.0163148. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Knudsen BE, Bergmark L, Munk P, Lukjancenko O, Priemé A, Aarestrup FM, Pamp SJ. Impact of sample type and DNA isolation procedure on genomic inference of microbiome composition. mSystems. 2016;1(5):e00095–16. 10.1128/mSystems.00095-16. [DOI] [PMC free article] [PubMed]
  • 77.Yuan S, Cohen DB, Ravel J, Abdo Z, Forney LJ. Evaluation of methods for the extraction and purification of DNA from the human microbiome. PloS One. 2012;7(3):e33865. 10.1371/journal.pone.0033865. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Wohlfahrt D, Brooks J Paul, Seashols-Williams S, Singh B. Man v. cube: a comparison of the effect of manual versus robotic DNA extraction methods on the microbial communities associated with forensically relevant human body fluids. Unpublished manuscript. 2024.
  • 79.Wensel CR, Pluznick JL, Salzberg SL, Sears CL. Next-generation sequencing: insights to advance clinical investigations of the microbiome. J Clin Investig. 2022;132(7):1–12. 10.1172/JCI154944. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Peterson D, Bonham KS, Rowland S, Pattanayak CW, Klepac-Ceraj V. Comparative analysis of 16S rRNA gene and metagenome sequencing in pediatric gut microbiomes. Front Microbiol. 2021;12:670336. 10.3389/fmicb.2021.670336. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Greay TL, Gofton AW, Paparini A, Ryan UM, Oskam CL, Irwin PJ. Recent insights into the tick microbiome gained through next-generation sequencing. Parasites Vectors. 2018;11(1):12. 10.1186/s13071-017-2550-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.CJ Bioscience INC. (2017, 16S RRNA and 16S RRNA gene. EZ BioCloud. https://help.ezbiocloud.net/16s-rrna-and-16s-rrna-gene/
  • 83.Keburiya LK, Smolnikova VY, Priputnevich TV, Muravieva VV, Gordeev AB, Trofimov DY, Shubina ES, Kochetkova TO, Rogacheva MS, Kalinina EA, Sukhikh GT. Does the uterine microbiota affect the reproductive outcomes in women with recurrent implantation failures? BMC Women’s Health. 2022;22(1):168. 10.1186/s12905-022-01750-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Shah N, Tang H, Doak TG, Ye Y. Comparing bacterial communities inferred from 16S rRNA gene sequencing and shotgun metagenomics. Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing. 2011; 165–176. https://www.ncbi.nlm.nih.gov/pubmed/21121044 [DOI] [PubMed]
  • 85.Su W, Gong C, Zhong H, Yang H, Chen Y, Wu X, Jin J, Xi H, Zhao J. Vaginal and endometrial microbiome dysbiosis associated with adverse embryo transfer outcomes. Reprod Biol Endocrinol. 2024;22(1):111–2. 10.1186/s12958-024-01274-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Nishio E, Iwata A, Kawasaki R, Iwao K, Nishizawa H, Fujii T. Metabolomic and microbiome analysis of cervicovaginal mucus in in vitro fertilization–embryo transfer: toward predicting pregnancy success. Reprod Med Biol. 2024;23(1):e12568. 10.1002/rmb2.12568. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Wei Q, Chen H, Zou H, Zhang H, Liu S, Zheng J, Zhang S, Hu L. Impact of vaginal microecological differences on pregnancy outcomes and endometrial microbiota in frozen embryo transfer cycles. J Assist Reprod Genet. 2024;41(4):929–38. 10.1007/s10815-024-03066-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Bukin YS, Galachyants YP, Morozov IV, Bukin SV, Zakharenko AS, Zemskaya TI. The effect of 16S rRNA region choice on bacterial community metabarcoding results. Sci Data. 2019;(6). 10.1038/sdata.2019.7. [DOI] [PMC free article] [PubMed]
  • 89.López-Aladid R, Fernández-Barat L, Alcaraz-Serrano V, Bueno-Freire L, Vázquez N, Pastor-Ibáñez R, Palomeque A, Oscanoa P, Torres A. Determining the most accurate 16S rRNA hypervariable region for taxonomic identification from respiratory samples. Sci Rep. 2023;13(1):3974. 10.1038/s41598-023-30764-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Pervez T, Hasnain M, Abbas H, Moustafa MF, Aslam N, Shah SSM. A comprehensive review of performance of next-generation sequencing platforms. BioMed Res Int. 2022;2022:1–12. 10.1155/2022/3457806. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
  • 91.Zaura E. Next-generation sequencing approaches to understanding the oral microbiome. Adv Dental Res. 2012;24(2):81–5. 10.1177/0022034512449466. [DOI] [PubMed] [Google Scholar]
  • 92.Espinosa Prieto A, Hardion L, Debortoli N, Beisel J. Finding the perfect pairs: a matchmaking of plant markers and primers for multi-marker eDNA metabarcoding. Mol Ecol Resour. 2024;24:e13937. 10.1111/1755-0998.13937. [DOI] [PubMed]
  • 93.Wee BA, Thomas M, Sweeney EL, Frentiu FD, Samios M, Ravel J, Gajer P, Myers G, Timms P, Allan JA, Huston WM. A retrospective pilot study to determine whether the reproductive tract microbiota differs between women with a history of infertility and fertile women. Aust New Zealand J Obstet Gynaecol. 2018;58(3):341–8. 10.1111/ajo.12754. [DOI] [PubMed] [Google Scholar]
  • 94.Poretsky R, Rodriguez-R LM, Luo C, Tsementzi D, Konstantinidis KT. Strengths and limitations of 16S rRNA gene amplicon sequencing in revealing temporal microbial community dynamics. PloS One. 2014;9(4):e93827. 10.1371/journal.pone.0093827. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Wang W, Xu S, Ren Z, Tao L, Jiang J, Zheng S. Application of metagenomics in the human gut microbiome. World J Gastroenterol. 2015;21(3):803–14. 10.3748/wjg.v21.i3.803. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Rezasoltani S, Ahmadi Bashirzadeh D, Nazemalhosseini Mojarad E, Asadzadeh Aghdaei H, Norouzinia M, Shahrokh S. Signature of gut microbiome by conventional and advanced analysis techniques: advantages and disadvantages. Middle East J Dig Dis. 2020;12(1):5–11. 10.15171/mejdd.2020.157. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Hu T, Chitnis N, Monos D, Dinh A. Next-generation sequencing technologies: an overview. Human Immunol. 2021;82(11):801–11. 10.1016/j.humimm.2021.02.012. [DOI] [PubMed] [Google Scholar]
  • 98.Tlili C, Djebbi K, Elaguech MA, Bahri M, Zhou D, Shi B, Wang D. Next-generation DNA sequencing: Ion Torrent sequencers versus nanopore technology. Handbook of biochips (pp. 651–666). Springer New York. 2021; 10.1007/978-1-4614-3447-4_56
  • 99.Verstraelen H, Vieira-Baptista P, De Seta F, Ventolini G, Lonnee-Hoffmann R, Lev-Sagie A. The vaginal microbiome: I. Research development, lexicon, defining “normal” and the dynamics throughout women’s lives. J Lower Genital Tract Dis. 2022;26(1):73–8. 10.1097/LGT.0000000000000643. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Guerrero-Sanchez VM, Maldonado-Alconada AM, Amil-Ruiz F, Verardi A, Jorrín-Novo JV, Rey M. Ion torrent and lllumina, two complementary RNA-seq platforms for constructing the holm oak (quercus ilex) transcriptome. PloS One. 2019;14(1):e0210356. 10.1371/journal.pone.0210356. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.Quail MA, Smith M, Coupland P, Otto TD, Harris SR, Connor TR, Bertoni A, Swerdlow HP, Gu Y. A tale of three next generation sequencing platforms: comparison of ion torrent, pacific biosciences and Illumina MiSeq sequencers. BMC Genomics. 2012;13(1):341. 10.1186/1471-2164-13-341. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Cosgun E, Oh M. Exploring the consistency of the quality scores with machine learning for next-generation sequencing experiments. BioMed Res Int. 2020;2020:8531502–6. 10.1155/2020/8531502. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103.Gurson, N. (2015, What is accuracy in next-generation sequencing (NGS)? . ThermoFisher Scientific. https://www.thermofisher.com/blog/behindthebench/what-is-accuracy-in-next-generation-sequencing-ngs/
  • 104.Benítez-Páez A, Portune KJ, Sanz Y. Species-level resolution of 16S rRNA gene amplicons sequenced through the MinION™ portable nanopore sequencer. GigaScience. 2016;5(1):4. 10.1186/s13742-016-0111-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Wagner J, Coupland P, Browne HP, Lawley TD, Francis SC, Parkhill J. Evaluation of PacBio sequencing for full-length bacterial 16S rRNA gene classification. BMC Microbiol. 2016;16(1):274. 10.1186/s12866-016-0891-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Curry KD, Wang Q, Nute MG, Tyshaieva A, Reeves E, Soriano S, Wu Q, Graeber E, Finzer P, Mendling W, Savidge T, Villapol S, Dilthey A, Treangen TJ. Emu: species-level microbial community profiling of full-length 16S rRNA oxford nanopore sequencing data. Nat Methods. 2022;19(7):845–53. 10.1038/s41592-022-01520-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107.Smith RH, Glendinning L, Walker AW, Watson M. Investigating the impact of database choice on the accuracy of metagenomic read classification for the rumen microbiome. Anim Microbiome. 2022;4(1):57. 10.1186/s42523-022-00207-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 108.McDonald D, Price MN, Goodrich J, Nawrocki EP, DeSantis TZ, Probst A, Andersen GL, Knight R, Hugenholtz P. An improved greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea. 2012. https://escholarship.org/uc/item/5zm7f8ft [DOI] [PMC free article] [PubMed]
  • 109.Schoch CL, Ciufo S, Domrachev M, Hotton CL, Kannan S, Khovanskaya R, Leipe D, Mcveigh R, O’Neill K, Robbertse B, Sharma S, Soussov V, Sullivan JP, Sun L, Turner S, Karsch-Mizrachi I. NCBI taxonomy: a comprehensive update on curation, resources and tools. Database : The Journal of Biological Databases and Curation. 2020; 10.1093/database/baaa062 [DOI] [PMC free article] [PubMed]
  • 110.Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, Peplies J, Glöckner FO. The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res. 2013;41:D590–6. 10.1093/nar/gks1219. [DOI] [PMC free article] [PubMed]
  • 111.Breitwieser FP, Lu J, Salzberg SL. A review of methods and databases for metagenomic classification and assembly. Brief Bioinforma. 2019;20(4):1125–36. 10.1093/bib/bbx120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 112.Bharti R, Grimm DG. Current challenges and best-practice protocols for microbiome analysis. Brief Bioinforma. 2021;22(1):178–93. 10.1093/bib/bbz155. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 113.Jeske JT, Gallert C. Microbiome analysis via OTU and ASV-based pipelines—a comparative interpretation of ecological data in WWTP systems. Bioengineering (Basel). 2022;9(4):146. 10.3390/bioengineering9040146. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 114.Portik DM, Brown CT, Pierce-Ward NT. Evaluation of taxonomic classification and profiling methods for long-read shotgun metagenomic sequencing datasets. BMC Bioinformatics. 2022;23(1):541. 10.1186/s12859-022-05103-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 115.Tamames J, Cobo-Simón M, Puente-Sánchez F. Assessing the performance of different approaches for functional and taxonomic annotation of metagenomes. BMC Genomics. 2019;20(1):960. 10.1186/s12864-019-6289-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 116.Tran Q, Phan V. Assembling reads improves taxonomic classification of species. Genes. 2020;11(8):946. 10.3390/genes11080946. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 117.Spohr P, Scharf S, Rommerskirchen A, Henrich B, Jäger P, Klau GW, Haas R, Dilthey A, Pfeffer K. Insights into gut microbiomes in stem cell transplantation by comprehensive shotgun long-read sequencing. Sci Rep. 2024;14(1):4068. 10.1038/s41598-024-53506-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 118.Martin D, Rybicki E. RDP: detection of recombination amongst aligned sequences. Bioinformatics. 2000;16(6):562–3. 10.1093/bioinformatics/16.6.562. [DOI] [PubMed] [Google Scholar]
  • 119.Heikema AP, Horst-Kreft D, Boers SA, Jansen R, Hiltemann SD, de Koning W, Kraaij R, de Ridder MAJ, van Houten CB, Bont LJ, Stubbs AP, Hays JP. Comparison of Illumina versus nanopore 16S rRNA gene sequencing of the human nasal microbiota. Genes. 2020;11(9):1105. 10.3390/genes11091105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 120.Werner D, Acharya K, Blackburn A, Zan R, Plaimart J, Allen B, Mgana SM, Sabai SM, Halla FF, Massawa SM, Haile AT, Hiruy AM, Mohammed J, Vinitnantharat S, Thongsamer T, Pantha K, Mota Filho CR, Lopes BC. MinION nanopore sequencing accelerates progress towards ubiquitous genetics in water research. Water (Basel). 2022;14(16):2491. 10.3390/w14162491. [Google Scholar]
  • 121.Santos A, van Aerle R, Barrientos L, Martinez-Urtaza J. Computational methods for 16S metabarcoding studies using nanopore sequencing data. Comput Struct Biotechnol J. 2020;18:296–305. 10.1016/j.csbj.2020.01.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 122.Moossavi S, Fehr K, Khafipour E, Azad MB. Repeatability and reproducibility assessment in a large-scale population-based microbiota study: case study on human milk microbiota. Microbiome. 2021;9(1):41. 10.1186/s40168-020-00998-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 123.Kyrpides NC, Eloe-Fadrosh EA, Ivanova NN. Microbiome data science: understanding our microbial planet. Trends Microbiol. 2016;24(6):425–7. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Reproducibility and repeatability of results is essential in microbiome investigations to validate a study’s findings and assess the consistency of results. To evaluate reproducibility and repeatability, however, datasets require re-analysis by researchers which can only be achieved if studies publish raw data or publish raw data into a microbiome data repository [122]. Whilst deposition of raw data into repositories like the International Nucleotide Sequence Database Collaboration (INSDC) centre are positive steps to increasing raw data availability across microbiome investigations, the data deposited is not integrated or processed within the INSDC centre due to a lack of funding [122, 123]. Due to the lack of data integration and processing after data deposition, the sequences have no reusability beyond the scope of the original investigation and unable to be repeated and reproduced within different studies. The lack of raw data available for re-analysis hinders meta-analysis and systematic reviews attempting to compare different methodological techniques and technologies within microbiome investigations due to the minimal consistency in methods. Additionally, tools that cause high variability within analysis protocols—like bioinformatic workflow and database selection—cannot be accurately compared. For example, the determination of QIIME vs. QIIME2 accuracy cannot be determined without comparing re-analysed data [123].

Whilst a comprehensive microbiome databank remains the ideal standard for data deposition in microbiome research, the lack of funding presents a significant barrier to its realisation. In the interim, ensuring that all raw microbiome data are made publicly available is essential for enhancing data reproducibility and repeatability, thereby facilitating more robust benchmarking and comparison across different methodological approaches.


Articles from Journal of Assisted Reproduction and Genetics are provided here courtesy of Springer Science+Business Media, LLC

RESOURCES