Abstract
Microbial involvement in the pathogenesis have been suggested in both antineutrophil cytoplasmic antibody-associated vasculitis (AAV) and sarcoidosis, both of which have lung involvement. However, exhaustive research to assess the bacteria in the lung in AAV and in sarcoidosis have not been performed. We sought to elucidate the distinct dysbiotic lung microbiota between AAV and sarcoidosis. We used 16S rRNA gene high-throughput sequencing to obtain the bacterial community composition of bronchoalveolar lavage fluid (BALF) in patients with AAV (n = 16) compared to patients with sarcoidosis (n = 21). The patients had not undergone therapy with immunosuppressive medication when their BALF was acquired. No difference was observed in α-diversity between patients with AAV and patients with sarcoidosis when using all the detected taxa. We defined the taxa of the oral cavity by using the data of oral microbiota of healthy individuals from the Human Microbiome Project (HMP). The analysis using only oral taxa made the difference in α-diversity between AAV and sarcoidosis clearer compared with those using all the detected taxa. Besides, the analysis using detected taxa except for oral taxa also made the difference in α-diversity between AAV and sarcoidosis clearer compared with those using all the detected taxa. A linear negative relationship between the α-diversity and Birmingham vasculitis activity score (BVAS) was detected in the AAV group. The observed p-value for the effect of the disease groups on the ß-diversity was small while the effect of other factors including sex and smoking status did not have small p-values. By excluding oral taxa from all the detected taxa, we found a cluster mainly consisted of sarcoidosis patients which was characterized with microbial community monopolized by Erythrobacteraceae family. Our results suggested the importance of considering the influence of oral microbiota in evaluating lung microbiota.
Subject terms: Vasculitis syndromes, Vasculitis syndromes
Introduction
Granulomatosis with polyangiitis (GPA) and microscopic polyangiitis (MPA) are forms of antineutrophil cytoplasmic antibody (ANCA)-associated vasculitis (AAV), a systemic disease affecting multiple organs including the lungs and kidneys. The production of ANCA, playing a pathogenetic role in AAV, is thought to be initiated by exogenous or endogenous antigens including microbes, drugs, or dysregulated autoantigen expression1. Patients with GPA have a higher rate of Staphylococcus aureus carriage in their noses2 and bronchoalveolar lavage fluid (BALF)3.
Sarcoidosis is a granulomatous disorder affecting multiple organs, characterized by a non-caseating granuloma, the hallmark of sarcoidosis. The non-caseating granuloma is thought to be the result of immunological responses to antigenic triggers including spatial, seasonal, occupational, and infectious factors4. Numerous infectious agents have been suggested as possible etiologic agents of sarcoidosis, including mycobacteria and cutibacteria (formerly propionibacteria)5.
Contributions of mycobacteria to sarcoidosis have been suggested by studies of acid-fast cell wall-deficient forms of bacteria6 and a mycobacterial antigen, Mycobacterium tuberculosis catalase-peroxidase (mKatG)7. Propionibacterium acnes (Cutibacterium acnes) is also associated with sarcoidosis, as described in a study using lymph nodes of patients with sarcoidosis8. The identification of etiologic bacteria to sarcoidosis is challenging despite the above-mentioned findings, and novel concepts and techniques are required for this purpose.
The advent of new technologies using high-throughput sequencing has made it possible for us to evaluate and understand compositional differences in the microbiota of body sites between health and disease9. Because novel techniques for microbial identification are culture-independent, they have demonstrated diverse communities of microbes even in body sites including lung which are historically considered sterile in healthy status10. As for autoimmune diseases, patients with rheumatoid arthritis had less diversity and abundance of microbiota compared with healthy controls11.
We hypothesized that the difference of lung microbiota between AAV and sarcoidosis, both are diseases with lung involvement, would characterize each disease. In addition, we assumed that disease activity of AAV would associate with lung microbiota. Given these hypotheses, we herein evaluated the lung microbiota of AAV and sarcoidosis. We detected microbes that were reported to exist in the oral cavity also in our BALF samples in the analytic process and had a hypothesis that oral microbes might affect lung microbiota. However, we had no our own samples from the oral cavity nor data of oral microbiota. Therefore, we used data of oral microbiota from the Human Microbiome Project (HMP)12 to evaluate the effect of oral microbiota on lung microbiota.
METHODS
Patients and study criteria
We enrolled patients newly diagnosed with AAV or sarcoidosis between October 2014 and February 2018 at Nagasaki University Hospital and Sasebo Chuo Hospital. All patients with AAV were diagnosed based on the Chapel Hill Consensus Conference criteria13 and the European Medicines Agency algorithm14. The patients’ diagnoses included two types of AAV: GPA and MPA.
The diagnosis of sarcoidosis was histopathologically confirmed according to the consensus criteria of the American Thoracic Society/European Respiratory Society15. We excluded patients with tuberculosis and nontuberculous mycobacteria based on the results of the BALF culture and polymerase chain reaction (PCR). We also excluded patients with other diseases that could cause the same histopathological findings as sarcoidosis16.
We collected the demographic and clinical characteristics including organ involvement, and laboratory data at diagnosis. Smoking status was defined as “current smoker”, “former smoker”, and “never smoker.” Disease activities of AAV at diagnosis were assessed using Birmingham vasculitis activity score (BVAS)17. None of the patients had received corticosteroids or other immunosuppressive therapies at the time of BALF collection.
This study was performed in accordance with the Declaration of Helsinki and was approved by the Institutional Review Board of Nagasaki University Hospital (registration no.: 14122251) and Sasebo Chuo Hospital (registration no.: 2018-29). Informed consent for the use of their data was obtained from all of the patients.
Bronchoalveolar lavage and cell preparation
BALF was collected with four instillations of sterile physiological saline (50 mL) through a flexible bronchoscope, and the fluid was immediately retrieved by gentle suction using a sterile syringe. The fluid was gently suctioned back to a bottle kept on ice. We used the fourth BALF withdrawal for the DNA extraction.
DNA extraction
We collected pellets from BALF after centrifugation for 30 min at 13,000 g at 4 °C and the removal of supernatants. We stored pellets at −80 °C until processing. DNA extraction from the pellets was performed using a PowerBiofilm DNA Isolation Kit (MoBio Laboratories, Carlsbad, CA).
Sequence analysis
High-throughput sequencing of bacterial 16S rRNA genes amplicon and phylogenetic tree construction from the FASTQ-format outputs were conducted by the Bioengineering Lab Co. (Kanagawa, Japan). Bacterial 16S rRNA gene amplicons encoding the V4 region (300 or 250 read length, paired-end protocol) were sequenced using a MiSeq Illumina sequencer (Illumina, San Diego, CA). The reads started with a 515F-806R primer pair18 and sequences were extracted as the 16S rRNA V4 region. The primer sequences and deeper than 251 bases from the primer sequences were trimmed using the FASTX-Toolkit (ver. 0.0.14) (http://hannonlab.cshl.edu/fastx_toolkit/).
For the quality filtering, the threshold of quality score and length were set at 20 and 40, respectively, and basecalls which did not fulfill these criteria were not adopted. Forward and reverse reads with the length of 250 and 230 base pairs, respectively, were merged using FLASH (ver. 1.2.11, http://www.cbcb.umd.edu/software/flash) with default parameters other than these lengths. The merged reads with lengths of 240–260 base pairs were extracted using SeqIO in biopython. Chimeric reads identified using USEARCH with the reference sequence of Greengenes 13_8 were removed. The remaining sequences were clustered into operational taxonomic units (OTUs) using a 97% similarity threshold (without any external reference sequence collection) with the Quantitative Insights into Microbial Ecology (QIIME)19 pipeline with the default parameters (this process was accessed through pick_de_novo_otus.py command20).
Body-site specific OTU table from healthy population
We downloaded the Human Microbiome Project (HMP)12 dataset, that was generated from samples obtained from 5 body sites and 15 or 18 subsites (the difference of three depends on the subjects’ sex) of 242 healthy adult without evidence of disease12, from the HMP data analysis and coordinating center (DACC) (https://www.hmpdacc.org/hmp/) via the R package HMP16SData v.1.4.121. The details on data generation are published as two articles12,22.
Statistical analysis
The association between variables was assessed using Fisher’s exact test for categorical variables and the Mann-Whitney U test for quantitative variables.
The α-diversity was measured by the inverse Simpson index that is derived from the Simpson index23. The Simpson index is known to be more robust against variation in sampling effort24 than the Shannon index. This robustness is inherited to the inverse Simpson index because the inverse Simpson index is simply an inverse of the subtraction of the Simpson index from 1. The β-diversity was measured by the Morisita-Horn dissimilarity index from the perspective of robustness against variation in sampling effort25. The differences in the α-diversity indices between diseases was evaluated with the Mann-Whitney U statistic. The p-values for the Mann-Whitney U tests were calculated via permutation test26,27. Among the AAV patients, the linear relationships between the α-diversity index and BVAS were analyzed as regression coefficients for the α-diversity index on BVAS by the linear regression. Confidence intervals for the regression coefficients were constructed via bootstrap resampling28. The contour of two-dimensional probability densities was drawn based on a kernel density estimate with a Gaussian kernel with bandwidth selected by the “solve-the equation” estimator29,30.
From an interest of the microbial mouth-lung immigration31, we evaluated the α- and β-diversities in the lung microbiota after extracting a set of taxa of “inhabitant” in the oral cavity. We did not collect oral specimens, therefore we presumed that bacterial taxa of the oral cavity “inhabitant” were present in our subjects’ oral cavity. A set of taxa of “inhabitant” was determined based on the prevalence of each taxon among the HMP subjects, for each of the nine oral cavity and oropharynx subsites from which specimens were collected in the HMP; concretely, taxon with a frequency (read count > 0) of over 98% of the specimens obtained from a subsite (that was determined based on the infimum of a binomial probability with which the lower boundary of >0.95 of the 95% confidence interval by the Wilson’s score method32 was determined as a member of the “inhabitant” in the subsite. This procedure was performed using the ANCOM-II33. In addition, because we assumed that microbes of oral sites can be the source of noise in lung microbiota, we analyzed the lung microbiota after exclusion of oral microbes which can be noise. We defined taxa of the oral sites that over 5% of the HMP subjects had as the“vagrant.” An effect of selecting a specific set of taxa, which owed to extraction of “inhabitants” or exclusion of “vagrant”, on the differences in α-diversities between disease groups, were measured as a percentile rank of a Mann-Whitney’s U statistic from observed data on the empirical cumulative distribution function (ECDF) of the U statistics constructed from 2,000 sets of random drawn taxa. The respective numbers of the drawn taxa in the analysis was determined by the number of the taxa in the analysis. The random drawings were conducted without replacement. On the AAV patients, by switching the U statistic to the regression coefficients for α-diversity regressed on BVAS, the effect of selecting specific set of the taxa were evaluated on the ECDF constructed from the regression coefficients for the inverse Simpson index regressed on BVAS for random drawn 2,000 set of taxa.
The hierarchical clustering was conducted on the β-diversity by the complete linkage method. In the respective heatmaps, taxa were sorted with single linkage method on the Jaccard distances between taxa.
The effects of clinical factors on β-diversity were determined by a permutational multivariate analysis of variance (PERMANOVA)34. The PERMANOVA was conducted with complete cases for each dependent variable (there was a sample with missing values regarding the percentages of macrophages, lymphocytes, eosinophils, and neutrophils of BALF). All the reported p-values are descriptive35. All the statistical analyses were conducted under the R environment v. 3.6.036 using relevant packages (vegan v. 2.5.537, metagenomeSeq v.1.26.038, phyloseq v. 1.28.039, coin v. 1.3.040, and boot ver. 1.3–2241). Source codes used in the statistical analyses are available from https://github.com/mrmtshmp/microbiome_AAV.
RESULTS
Patient characteristics
Total 37 patients (16 AAV and 21 sarcoidosis) were recruited in this study (Fig. 1). Table 1 summarizes the demographic and clinical characteristics of the patients. The patients with AAV were older than the patients with sarcoidosis (median 78 yrs vs. 62 yrs, p = 0.0002). The median ages at diagnosis were reported to be 66.8 years old in GPA and 70.5 years old in MPA in the Japanese population42. Whereas the median age at diagnosis of sarcoidosis was reported to be 54 years old in the Japanese population43. Therefore, the older age at diagnosis of AAV compared with sarcoidosis in our patients was consistent with those in the nation-wide studies. Smoking status was not different between the disease groups. Thirteen patients were positive for myeloperoxidase (MPO)-ANCA. The median BVAS was 16 (interquartile range (IQR) : 8 to 19). All patients with AAV had lung involvement. Eighteen of the 21 patients with sarcoidosis had lung involvement. The patients with AAV had a lower recovery percentage of BALF compared to that of the patients with sarcoidosis. The patients with AAV had lower percentages of lymphocytes and higher percentages of neutrophils in BALF cells compared to the sarcoidosis group. Because BALF from patients with GPA with high disease activities had lower percentages of lymphocytes and higher percentages of neutrophils compared with those from patients with sarcoidosis44, our data were consistent with the previous report.
Table 1.
AAV, n = 16 | Sarcoidosis, n = 21 | p-value | |
---|---|---|---|
Female, n (%) | 11 (69) | 15 (71) | 1.00 |
Age, yrs, median (IQR) | 78 (75–81) | 62 (45–73) | 0.0002 |
Smoking: | |||
Never smokers, n (%) | 11 (69) | 11 (52) | |
Former smokers, n (%) | 3 (19) | 4 (19) | 0.5234* |
Current smokers, n (%) | 2 (13) | 6 (29) | |
MPO-ANCA-positive, n (%) | 13 (81) | ||
MPO-ANCA titer, U/mL (range) | 43 (15–172) | — | |
PR3-ANCA positive, n (%) | 1 (6) | ||
PR3-ANCA titer, U/mL | 34 (n = 1) | — | |
GPA/MPA | 2/14 | — | |
AAV involvements, n (%): | |||
Lung | 16 (100) | — | |
Kidney | 7 (44) | — | |
ENT | 1 (6) | — | |
Nerve | 3 (19) | — | |
Eye | 1 (6) | — | |
Joint | 2 (13) | — | |
BVAS, median (IQR) | 16 (8–19) | — | |
Sarcoidosis involvements, n (%): | |||
Lung | — | 18 (86) | |
Eye | — | 11 (52) | |
Lymph node | — | 18 (86) | |
Skin | — | 5 (24) | |
Heart | — | 3 (14) | |
Pancreas | — | 1 (5) | |
Bronchoscopy: | |||
Recovery percentage of BALF (%), median (IQR) | 34 (31–44) | 53 (48–62) | <0.0001 |
BAL fluid cell concentration (105 cells/mL), median (IQR) | 2.9 (2.0–6.0) | 2.2 (1.9–3.5) | 0.2102 |
Macrophages (%), median (IQR) | 52 (46–61) | 63 (48–83) | 0.1352 |
Lymphocytes (%), median (IQR) | 12 (6–28) | 36 (14–50) | 0.0052 |
Neutrophils (%), median (IQR) | 22 (5–45) | 1 (0–3) | <0.0001 |
Eosinophils (%), median (IQR) | 0 (0–3) | 0 (0–2) | 0.5302 |
*Cochran-Armitage test among never smokers, former smokers, and current smokers. AAV: ANCA-associated vasculitis, ANCA: antineutrophil cytoplasmic antibody, BALF: bronchoalveolar lavage fluid, ENT: ear: nose: throat, IQR: interquartile range, MPO: myeloperoxidase, PR3: proteinase 3.
α-diversity and β-diversity in lung-microbiota
The depth of sequencing was 14,986 reads per sample in median (IQR: 8,467 to 24,354) which passed the quality check explained in the section for Sequence analysis. There was no difference in depth of sequencing between the groups of subjects with AAV and sarcoidosis (median [IQR]: 16,786 [4,705 to 29,857] in AAV, 14,148 [7,608 to 24,354] in sarcoidosis, p = 0.57). The proportions of read counts from OTUs without taxonomic names in each subject’s sample were depicted as boxplots (Fig. S1A).
The difference in the inverse Simpson index between disease groups (AAV and sarcoidosis groups) was not observed (p = 0.830, Fig. 2A). We did not observe the association between the recovery percentage of BALF and the inverse Simpson index (Fig. S2). The observed p-values for differences in the inverse Simpson index between sexes or the smoking status were as follows; p = 0.273 for female vs. male subjects, p = 0.778 for current vs. never smokers, p = 0.575 for former vs. never smokers, p = 0.817 for current vs. former smokers. (Figs. S3 and S4).
A linear relationship between the inverse Simpson index and BVAS was shown in a scatter plot (Fig. 2B). The regression coefficient for the inverse Simpson index on BVAS was −0.206 (95%CI: −0.373 to −0.085).
The observed p-value for the effect of the disease groups on the Morisita-Horn dissimilarity index at the family rank was 0.0331 from the PERMANOVA. The dendrogram on the heatmap (Figs. 3A and S6) showed that small clusters consisted only of patients with each disease (namely, clusters with patient’s IDs of {12, 24}, {27, 23, 25}, {29, 34, 38}, {10, 41}, {22, 30, 11, 46}, {13, 21} and {32, 42, 31, 43}). However, we observed neither large clusters nor evident patterns in the heatmap that were shared with the patients with each disease despite the small p-value was observed from the PERMANOVA.
We evaluated the association between BVAS and the Morisita-Horn dissimilarity index (Fig. 3B, p = 0.3588, PERMANOVA). The effect of other physiological factors (age, sex, smoking status, percentage of macrophages, lymphocytes, eosinophils, and neutrophils in BALF) on the Morisita-Horn dissimilarity index were tabulated in the Table 2. We did not observe any patterns of clusters associated with BVAS in the heatmap (Figs. 3B and S6).
Table 2.
Taxonomic rank | AAV/Sarcoidosis n = 37 | BVAS n = 16 | Age n = 37 | Sex n = 37 | Smoking status Never + Former vs. Curr. n = 37 | Smoking status Never vs. Curr. + Former n = 37 | Macrophage n = 36 | Lymphocyte n = 36 | Neutrophil n = 36 | Eosinophil n = 36 |
---|---|---|---|---|---|---|---|---|---|---|
Family | 0.0331 | 0.3588 | 0.9241 | 0.9793 | 0.7986 | 0.9221 | 0.2027 | 0.1364 | 0.0599 | 0.4697 |
Genus | 0.0792 | 0.3436 | 0.9103 | 0.9623 | 0.7866 | 0.8253 | 0.2014 | 0.0618 | 0.0935 | 0.4943 |
We detected Prevotellaceae, Veillonellaceae, and Streptococcaceae, which have been detected in BALF of healthy individuals in previous researches31,45–47 in our BALF samples (Fig. 3A). Therefore, we aimed to evaluate the contributions of oral microbes on the lung microbiota.
Lung-microbiota diversity analysis focusing on oral inhabitant taxa
As we did not collect oral specimen in our study, we aimed to determine the oral taxa commonly shared in healthy individuals by using the HMP dataset. We extracted taxa that were detected with 0.98 or higher frequency in each oral subsite and determined them as subsite’s inhabitant taxa. Furthermore, we validated the stability of the selected taxa by seeing how many numbers of taxa shared by 98% of subjects were retained as the percentage of subjects was increased up to 99.5% (Fig. S1B). We found that “hard palate” as an oral subsite had a stable number of taxa because 11 taxa at family rank of “hard palate inhabitant” shared by 98% of subjects were retained by 99% of subjects while other body sites’ numbers of taxa shrank. Therefore, we presumed that the 11 taxa of “hard palate inhabitant” were shared commonly in our subjects. The identified 11 taxa as “hard palate inhabitant” were as follows (alphabetical order); Actinomycetaceae, Carnobacteriaceae, Fusobacteriaceae, Gemellaceae, Lachnospiraceae, Neisseriaceae, Pasteurellaceae, Porphyromonadaceae, Prevotellaceae, Streptococcaceae, and Veillonellaceae.
We observed a clearer difference in α-diversity between the disease groups on the 11 taxa of “hard palate inhabitant” (Fig. 4A, p = 0.262, Wilcoxon rank sum test for the inverse Simpson index) than that on all the detected taxa. The percentile rank of the U statistic was 9.1% on the ECDF constructed from U statistics for 2,000 random drawing of 11 taxa (Fig. 4B). To confirm that observed effect of selecting the set of taxa of “hard palate inhabitant” was not a result from the association between sexes and α-diversity, we reanalyzed after stratification by the sexes. As results, we observed percentile ranks those were closer to 50.0% than analysis without the stratification (17.6% for female subjects and 26.1% for male subjects). Those increment in the percentile ranks, especially in the male subjects, suggested that the observed relationship was partly evoked from those between sexes and the α-diversity. The differences in the inverse Simpson index among smoking status were made clearer, but not sexes, when limiting to “hard palate inhabitant” (Figs. 3S and 4S).
The regression coefficient for the inverse Simpson index regressed on BVAS was estimated −0.086 (95%CI: −0.186 to −0.001) when limiting the included taxa to “hard palate inhabitant”. The absolute value of the regression coefficient became smaller compared with that estimated from all the detected taxa (−0.206 (95%CI: −0.373 to −0.085)) because of the decreased α-diversity by downsizing of the number of taxa from 144 to 11. While the regression coefficient for the inverse Simpson index shrank after limiting to “hard palate inhabitant”, the percentile rank was 1.1% on the ECDF. In the distribution of BVAS, we observed separation between sexes (green arrows in Fig. 4C depicted the plots of male subjects). Because we obtained the percentile rank of 0.4% for the female subjects and also 0.4% for the male subjects on the ECDF when stratifying subjects by sexes, we concluded the observed effect was not resulted from the separation of BVAS between sexes. The regression coefficients were −0.185 (95%CI: −0.321 to −0.076) for female subjects and −0.109 (95%CI: −0.511 to 0.133) for male subjects.
The effect of disease groups on the Morisita-Horn dissimilarity index (p = 0.1410, PERMANOVA) was diminished when limiting to “hard palate inhabitant” compared with that from all the detected taxa. We observed no evident patterns in the heatmap that were shared with the patients with each disease (Figs. 5A and S6). We observed the diminished effect of BVAS on the β-diversity (p = 0.5907, PERMANOVA for Morisita-Horn dissimilarity index) when limiting to “hard palate inhabitant” compared with that from all the detected taxa. We did not observe any patterns of clusters associated with BVAS in the heatmap (Figs. 5B and S6).
Results from analyses on taxa of inhabitant in other oral subsites (that include saliva, attached keratinized gingiva, buccal mucosa, hard palate, palatine tonsils, sub and supra gingival plaque, throat and tongue dorsum) are shown in supplementary data (Figs. S7–S10).
Lung-microbiota diversity analysis after noise reduction
To avoid the influence of oral microbes on lung microbiota, we evaluated associations between the diseases and the lung microbiota after excluding the taxa of “vagrant”. After the exclusion, the number of taxa in the OTU table was 48 at family rank.
The observed p-value for the difference in the inverse Simpson index between diseases on the 48 taxa was 0.126 (Fig. 6A), which was smaller than both of the observed p-value from that on all the detected 144 taxa (p = 0.830) and that on the 11 taxa of “hard palate inhabitant” (p = 0.262). In addition, in the inverse Simpson index between diseases, we observed oppositely directed difference, compared with the result from the analysis on the taxa of “hard palate inhabitant”. The percentile rank of the U statistic was 94.2% on the ECDF (Fig. 6B). To confirm that this effect was not a result from the association between sexes and α-diversity, we reanalyzed after stratification by the sexes. As results, we observed that percentile ranks were still high but closer to 50.0% than that obtained from analysis without stratification (the percentile ranks of 90.6% for female subjects and 85.6% for male subjects). Consequently, we concluded that the observed effect was mainly of exclusion of the taxa of “vagrant” though was partly evoked from those between sexes and the α-diversity.
The regression coefficient for the inverse Simpson index regressed on BVAS was estimated 0.022 (95%CI: −0.135 to 0.097) when excluding “vagrant” (Fig. 6C). The percentile rank of the observed regression coefficient was 97.0% on the ECDF (Fig. 6D). We observed separation in BVAS between sexes (green arrows in Fig. 6C depicted the plots of male subjects), therefore we again stratified the subjects by the sex and reanalyzed on the association between these two variables. The analysis resulted to reveal that the effect of the exclusion of the “vagrant” on the association between BVAS and the inverse Simpson index did not differ from those of drawing randomly 48 taxa in the female subjects (the percentile rank was 59.4%). Regarding the male subjects, the percentile rank was remained high (94.5%). We show the effects of the sex and smoking status on the inverse Simpson index in Fig. S5.
Through the β-diversity analyses with the OTU table after exclusion of the taxa of “vagrant”, we observed the diminished effect of disease groups on the microbiota composition dissimilarity between subjects (p = 0.1514, PERMANOVA for Morisita-Horn dissimilarity index) compared with that on all the detected taxa. The clustering analysis revealed a large cluster consisted of mainly subjects with sarcoidosis (9 of the 11 members) that was characterized by the monopoly in the microbiota composition by the Erythrobacteraceae family (Figs. 7A and S6). Regarding the effect of BVAS on the β-diversity, observed p-value was smaller (p = 0.2185, PERMANOVA for Morisita-Horn dissimilarity index) than that from observation on all the detected taxa. We did not observe any patterns of clusters associated with BVAS in the heatmap (Figs. 7B and S6).
DISCUSSION
We have found associations between lung microbiota and two diseases as follows; first, the α-diversity negatively related with BVAS in the AAV group. Secondly, although no differences in the α-diversity was detected between AAV and sarcoidosis when evaluating all of the detected taxa, we found clearer differences between them when limiting the included taxa to “hard palate inhabitant” or excluding “vagrant”. Thirdly, we found the effect of the disease groups on the β-diversity with small p-value. Fourthly, although we found no distinctive patterns of clusters by all the detected taxa, the cluster characterized by the existence of Erythrobacteraceae family was consistent with the sarcoidosis group when excluding the “vagrant.”
Our result that the α-diversity negatively related with BVAS is consistent with the previous study using nasal swabs in GPA, which showed reduced α-diversity in patients with active disease (BVAS ≥ 1) compared with those in remission (BVAS = 0) while it was not significant48. The relationships between the disease severity and α-diversity have been reported in chronic obstructive pulmonary disease (COPD). The reduced α-diversity of the microbiota were observed in severe COPD when compared with mild COPD in BALF49 and sputum50,51. Although the question is whether the reduced α-diversity is the direct cause of increased disease activity (local or systemic inflammation) or merely its consequence, it remains unanswered by our result. Interaction with each other may form the vicious circle that causes persistent inflammation and dysbiosis in lung.
Microbes are shared between upper and lower respiratory tracts in healthy individuals31,45–47. We suggested the influence of migration of oral microbes into lung should be taken in consideration when evaluating lung microbiota even in patients with diseases. The difference in α-diversity between two diseases depended on the selection of taxa which limited to “hard palate inhabitant” or excluded “vagrant.” Although we cannot confirm which selection of taxa appropriately reflects the difference between two diseases, opposite effects of selecting taxa to the difference of α-diversity between two diseases may suggest importance of oral microbes or microbes in lung that do not exist in oral cavity.
Whether AAV or sarcoidosis themselves associated with the detection of taxa of oral microbes is another question. We had no patients who had obvious dysphagia, aspiration, or other manifestations that cause the migration of oral microbes into lung. In addition, dysphagia is a rare manifestation in both AAV52 and sarcoidosis53. Whereas, it is suggested that gastroesophageal reflux disease (GERD)-associated microaspiration may lead to the progression of idiopathic pulmonary fibrosis (IPF) and IPF may increase intrathoracic pressure, which can aggravate GERD vice versa54. GERD is a common problem in sarcoidosis55, but not in AAV56. Therefore, diseases themselves, especially sarcoidosis, may have had impacts on the detection of taxa of oral microbes in our study.
Limiting taxa based on oral taxa led to the intriguing finding that a cluster with Erythrobacteraceae family was consistent with the part of sarcoidosis group when excluding “vagrant.” Members of Erythrobacteraceae family are Gram-negative, aerobic, rod-shaped or pleomorphic coccoid bacteria57. They are isolated from wild rice, cold-seep sediment, desert sand, tepid water, seawater, tidal flats, marine sediment, and marine invertebrates. No information on pathogenicity of Erythrobacteraceae for human is available.
We found the effect of the disease groups on the β-diversity with small p-value while other physiological factors did not have. For example, smoking status has been reported to have significant impacts on β-diversity in lung microbiota58, but, it did not have the effect with small p-value in our study.
Several microbes have been reported to associate with AAV or sarcoidosis. As for AAV, a study using quantitative culture experiments reported that Staphylococcus aureus in BALF was particularly associated with patients with GPA compared with patients with idiopathic pulmonary fibrosis (IPF)3. One study using nasal swabs reported decreased Propionibacterium acnes and Staphylococcus epidermidis in patients with GPA when compared with healthy controls59. Another study using nasal swabs reported increased relative abundance of Planococcaceae family and decreased Moraxellaceae, Tissierellaceae, Staphylococcaceae, and Propionibacteriaceae families in GPA when compared with healthy controls48. A study with culture experiments with nasal swabs reported Staphylococcus pseudintermedius in patients with GPA60. As for sarcoidosis, in addition to Mycobacterium7 and Propionibacterium acnes8, Atopobium and Fusobacterium have been reported as candidates for sarcoidosis-associated microbiota in BALF61. However, we could not find any differences in the relative abundance of taxa at family rank corresponding to these taxa between AAV and sarcoidosis in the present study. A research including BALF samples from 16 patients with sarcoidosis and 12 healthy controls also showed no significant microbial differences between two groups62.
We enumerate 4 probable causes of theses inconsistencies of results between ours and previous researches. First, because previous studies of these bacteria used the nasal swabs48,59,60, nasal carriage2, blood samples6, or lymph nodes8,63, these results may be attributable to the differences of specimens. Second, we compared AAV with sarcoidosis although one study compared GPA with IPF3 and others compared AAV or sarcoidosis with healthy controls. Third, because the 16S rRNA gene sequencing approach is reported to be not sensitive in identifying nontuberculous mycobacteria among airway samples64, our attempt to detect mycobacteria using 16S rRNA may be inappropriate. Finally, this study predominantly included MPA, not GPA.
This study has some limitations. First, we could not evaluate BALF specimens of healthy individuals, and we thus could not identify bacteria that are present in both AAV and sarcoidosis but not in healthy subjects. In addition, the absence of data from healthy persons may have affected the interpretation of α-diversity and β-diversity regarding AAV and sarcoidosis. Second, we could not assess the effects of GPA or MPA and MPO-ANCA or PR3-ANCA, because most of the patients with AAV were MPA. Further research is necessary to address the question of whether distinct dysbiosis exists among types of AAV. Third, we could not evaluate the contamination. In samples with very low amounts of microbial biomass including BALF, many of the true signals are masked by contaminating DNA65. The assessment of processes of bronchoscope, extraction of DNA, amplification and library preparation might have been helpful to exclude contamination. Fourth, we did not evaluate the background microbial taxa. The significant intrusion of the background microbiota through the bronchoscopic procedure and the kit may obscure results as suggested66. Fifth, although we used the fourth BALF withdrawal for the DNA extraction, we had no data regarding the microbial differences between the first to third BALF and the fourth BALF. Sixth, the present study was done in a small number of patients to assess lung microbiota from two hospitals in a regional part of Japan. Predominant patients with AAV are MPA and MPO-ANCA positive patients, not GPA nor PR3-ANCA positive patients, which is quite different from those of the US and European countries. Our results may not apply to patients with AAV and sarcoidosis in other regions. Lastly, we used data of oral microbiota from the HMP to select “hard palate inhabitant” and exclude “vagrant.” Because the subjects of HMP are completely different from our subjects in terms of many physiological factors including age and race. These large differences may hamper the evaluation of our data.
In conclusion, α-diversity between AAV and sarcoidosis did not differ, but the observed p-value for the effect of the disease groups on the β-diversity was small. The α-diversity negatively related with BVAS in the AAV group. Limiting taxa to oral taxa or excluding oral taxa made the difference in α-diversity between AAV and sarcoidosis clearer. By excluding oral taxa, we found a cluster with Erythrobacteraceae family which mainly consisted of sarcoidosis. Our results suggested the importance of considering the influence of oral microbiota in evaluating lung microbiota.
Supplementary information
Acknowledgements
This work was supported by the Japan Rheumatism Foundation.
Author contributions
K.I. made substantial contributions to the study’s concept. S.N., H.I., A.H., T.K. and N.S. obtained the BALF. S.F. and K.I. collected the bacterial DNA from BALF. S.S. provided critical advice regarding the collection of bacterial DNA data. S.M., a biostatistician, conducted the statistical analyses. S.F. and S.M. drafted the manuscript with the assistance and supervision of K.I. Authors S.F., K.I., S.N., H.I., A.H., T.K., N.S., Y.T., T.A., T.K., S.K., N.I., M.T., H.N., T.O., Y.U., H.M. and A.K. treated the patients and collected the primary data. K.I. critically revised the manuscript. K.I. and A.K. supervised the entire study and gave final approval of the article. All authors read and approved the final manuscript.
Data availability
Data are available from DDBJ Sequence Read Archive (accession number: PRJDB8270). Data are accessed from the following URLS: (http://trace.ddbj.nig.ac.jp/BPSearch/bioproject?acc=PRJDB8270) (https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJDB8270).
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Shoichi Fukui and Shimpei Morimoto.
Supplementary information
is available for this paper at 10.1038/s41598-020-66178-4.
References
- 1.Jennette JC, Falk RJ. Pathogenesis of antineutrophil cytoplasmic autoantibody-mediated disease. Nat Rev Rheumatol. 2014;10:463–473. doi: 10.1038/nrrheum.2014.103. [DOI] [PubMed] [Google Scholar]
- 2.Stegeman CA, et al. Association of chronic nasal carriage of Staphylococcus aureus and higher relapse rates in Wegener granulomatosis. Ann Intern Med. 1994;120:12–17. doi: 10.7326/0003-4819-120-1-199401010-00003. [DOI] [PubMed] [Google Scholar]
- 3.Richter AG, Stockley RA, Harper L, Thickett DR. Pulmonary infection in Wegener granulomatosis and idiopathic pulmonary fibrosis. Thorax. 2009;64:692–697. doi: 10.1136/thx.2008.110445. [DOI] [PubMed] [Google Scholar]
- 4.Baughman RP, Lower EE, du Bois RM. Sarcoidosis. Lancet. 2003;361:1111–1118. doi: 10.1016/S0140-6736(03)12888-7. [DOI] [PubMed] [Google Scholar]
- 5.Chen ES, Moller DR. Etiologies of Sarcoidosis. Clin Rev Allergy Immunol. 2015;49:6–18. doi: 10.1007/s12016-015-8481-z. [DOI] [PubMed] [Google Scholar]
- 6.Almenoff PL, Johnson A, Lesser M, Mattman LH. Growth of acid fast L forms from the blood of patients with sarcoidosis. Thorax. 1996;51:530–533. doi: 10.1136/thx.51.5.530. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Song Z, et al. Mycobacterial catalase-peroxidase is a tissue antigen and target of the adaptive immune response in systemic sarcoidosis. J Exp Med. 2005;201:755–767. doi: 10.1084/jem.20040429. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Ishige I, Usui Y, Takemura T, Eishi Y. Quantitative PCR of mycobacterial and propionibacterial DNA in lymph nodes of Japanese patients with sarcoidosis. Lancet. 1999;354:120–123. doi: 10.1016/S0140-6736(98)12310-3. [DOI] [PubMed] [Google Scholar]
- 9.Cho I, Blaser MJ. The human microbiome: At the interface of health and disease. Nat. Rev. Genet. 2012;13:260–270. doi: 10.1038/nrg3182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Dickson RP, Erb-Downward JR, Huffnagle GB. The role of the bacterial microbiome in lung disease. Expert Rev. Respir. Med. 2013;7:245–257. doi: 10.1586/ers.13.24. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Scher JU, et al. The lung microbiota in early rheumatoid arthritis and autoimmunity. Microbiome. 2016;4:60. doi: 10.1186/s40168-016-0206-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Human Microbiome Project Consortium Structure, function and diversity of the healthy human microbiome. Nature. 2012;486:207–14. doi: 10.1038/nature11234. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Jennette JC, et al. 2012 revised International Chapel Hill Consensus Conference Nomenclature of Vasculitides. Arthritis Rheum. 2013;65:1–11. doi: 10.1002/art.37715. [DOI] [PubMed] [Google Scholar]
- 14.Watts R, et al. Development and validation of a consensus methodology for the classification of the ANCA-associated vasculitides and polyarteritis nodosa for epidemiological studies. Ann Rheum Dis. 2007;66:222–227. doi: 10.1136/ard.2006.054593. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Statement on sarcoidosis Joint Statement of the American Thoracic Society (ATS), the European Respiratory Society (ERS) and the World Association of Sarcoidosis and Other Granulomatous Disorders (WASOG) adopted by the ATS Board of Directors and by the ER. Am. J. Respir. Crit. Care Med. 1999;160:736–755. doi: 10.1164/ajrccm.160.2.ats4-99. [DOI] [PubMed] [Google Scholar]
- 16.Judson MA. The diagnosis of sarcoidosis. Clin. Chest Med. 2008;29:415–27, viii. doi: 10.1016/j.ccm.2008.03.009. [DOI] [PubMed] [Google Scholar]
- 17.Mukhtyar C, et al. Modification and validation of the Birmingham vasculitis activity score (version 3) Ann. Rheum. Dis. 2009;68:1827–1832. doi: 10.1136/ard.2008.101279. [DOI] [PubMed] [Google Scholar]
- 18.Caporaso JG, et al. Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample. Proc Natl Acad Sci USA. 2011;108(Suppl):4516–4522. doi: 10.1073/pnas.1000080107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Caporaso JG, et al. QIIME allows analysis of high-throughput community sequencing data. Nat Methods. 2010;7:335–336. doi: 10.1038/nmeth.f.303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Rideout JR, et al. Subsampled open-reference clustering creates consistent, comprehensive OTU definitions and scales to billions of sequences. PeerJ. 2014;2:e545. doi: 10.7717/peerj.545. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Schiffer L, et al. HMP16SData: Efficient Access to the Human Microbiome Project Through Bioconductor. Am. J. Epidemiol. 2019;188:1023–1026. doi: 10.1093/aje/kwz006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Methé BA, et al. A framework for human microbiome research. Nature. 2012;486:215–221. doi: 10.1038/nature11209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Jost L. Entropy and diversity. Oikos. 2006;113:363–375. [Google Scholar]
- 24.Magurran, A. E. & Mcgill, B. J. Biological Diversity: Frontiers in Measurement and Assessment. (Oxford Univ Pr, 2011).
- 25.Beck J, Holloway JD, Schwanghart W. Undersampling and the measurement of beta diversity. Methods Ecol. Evol. 2013;4:370–382. [Google Scholar]
- 26.Hothorn T, Hornik K, van de Wiel MA, Zeileis A. A Lego System for Conditional Inference. Am. Stat. 2006;60:257–263. [Google Scholar]
- 27.Strasser H, Weber C. The asymptotic theory of permutation statistics. Math. Methods Stat. 1999;8:220–250. [Google Scholar]
- 28.Efron, B. Computer age statistical inference: algorithms, evidence, and data science. (Cambridge University Press, 2016).
- 29.Venables, W. N. & Ripley, B. D. Modern applied statistics with S-PLUS. (Springer Science & Business Media, 2013).
- 30.Sheather SJ, Jones MC. A Reliable Data-Based Bandwidth Selection Method for Kernel Density Estimation. J. R. Stat. Soc. Ser. B. 1991;53:683–690. [Google Scholar]
- 31.Dickson, R. P. et al. Bacterial Topography of the Healthy Human Lower Respiratory Tract. MBio8 (2017). [DOI] [PMC free article] [PubMed]
- 32.Wilson EB. Probable Inference, the Law of Succession, and Statistical Inference. J. Am. Stat. Assoc. 1927;22:209–212. [Google Scholar]
- 33.Kaul A, Mandal S, Davidov O, Peddada SD. Analysis of microbiome data in the presence of excess zeros. Front. Microbiol. 2017;8:1–10. doi: 10.3389/fmicb.2017.02114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Anderson MJ. A new method for non-parametric multivariate analysis of variance. Austral Ecol. 2001;26:32–46. [Google Scholar]
- 35.Amrhein V, Trafimow D, Greenland S. Inferential Statistics as Descriptive Statistics: There Is No Replication Crisis if We Don’t Expect Replication. Am. Stat. 2019;73:262–270. [Google Scholar]
- 36.R Core Team. R: A Language and Environment for Statistical Computing. (R Foundation for Statistical Computing, 2018).
- 37.Jari, O. et al. vegan: Community Ecology Package. (2018).
- 38.Joseph, N. P., Mihai, P. & Hector Corrada, B. metagenomeSeq: Statistical analysis for sparse high-throughput sequncing. (2013).
- 39.McMurdie PJ, Holmes S. phyloseq: An R Package for Reproducible Interactive Analysis and Graphics of Microbiome Census Data. Plos One. 2013;8:e61217. doi: 10.1371/journal.pone.0061217. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Hothorn T, Hornik K, Wiel MAvande, Zeileis A. Implementing a Class of Permutation Tests: The coin Package. J. Stat. Softw. 2008;28:1–23. [Google Scholar]
- 41.Canty, A. & Ripley, B. D. boot: Bootstrap R (S-Plus) Functions. (2019).
- 42.Sada KE, et al. Comparison of severity classification in Japanese patients with antineutrophil cytoplasmic antibody-associated vasculitis in a nationwide, prospective, inception cohort study. Mod Rheumatol. 2016;26:730–737. doi: 10.3109/14397595.2016.1140274. [DOI] [PubMed] [Google Scholar]
- 43.Hattori T, et al. Nationwide survey on the organ-specific prevalence and its interaction with sarcoidosis in Japan. Sci. Rep. 2018;8:1–7. doi: 10.1038/s41598-018-27554-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Schnabel A, Reuter M, Gloeckner K, Müller-Quernheim J, Gross WL. Bronchoalveolar lavage cell profiles in Wegener’s granulomatosis. Respir. Med. 1999;93:498–506. doi: 10.1016/s0954-6111(99)90093-8. [DOI] [PubMed] [Google Scholar]
- 45.Segal LN, et al. Enrichment of lung microbiome with supraglottic taxa is associated with increased pulmonary inflammation. Microbiome. 2013;1:1–12. doi: 10.1186/2049-2618-1-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Bassis CM, et al. Analysis of the upper respiratory tract microbiotas as the source of the lung and gastric microbiotas in healthy individuals. MBio. 2015;6:1–10. doi: 10.1128/mBio.00037-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Segal LN, et al. Enrichment of the lung microbiome with oral taxa is associated with lung inflammation of a Th17 phenotype. Nat. Microbiol. 2016;1:1–11. doi: 10.1038/nmicrobiol.2016.31. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Lamprecht P, et al. Changes in the composition of the upper respiratory tract microbial community in granulomatosis with polyangiitis. J. Autoimmun. 2019;97:29–39. doi: 10.1016/j.jaut.2018.10.005. [DOI] [PubMed] [Google Scholar]
- 49.Erb-Downward, J. R. et al. Analysis of the lung microbiome in the ‘healthy’ smoker and in COPD. Plos One6 (2011). [DOI] [PMC free article] [PubMed]
- 50.Galiana A, et al. Sputum microbiota in moderate versus severe patients with COPD. Eur. Respir. J. 2014;43:1787–1790. doi: 10.1183/09031936.00191513. [DOI] [PubMed] [Google Scholar]
- 51.Garcia-Nuñez M, et al. Severity-related changes of bronchial microbiome in chronic obstructive pulmonary disease. J. Clin. Microbiol. 2014;52:4217–4223. doi: 10.1128/JCM.01967-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Yamaguchi T, et al. A case of Wegener’s granulomatosis associated with progressive dysphagia owing to esophageal involvement. Mod. Rheumatol. 2007;17:521–525. doi: 10.1007/s10165-007-0633-4. [DOI] [PubMed] [Google Scholar]
- 53.Abdallah T, et al. Isolated dysphagia unmasking bulbar neurosarcoidosis and pulmonary sarcoidosis. Arab J. Gastroenterol. 2014;15:85–87. doi: 10.1016/j.ajg.2014.03.001. [DOI] [PubMed] [Google Scholar]
- 54.Wang Z, et al. Gastroesophageal Reflux Disease in Idiopathic Pulmonary Fibrosis: Uncertainties and Controversies. Respiration. 2018;96:571–587. doi: 10.1159/000492336. [DOI] [PubMed] [Google Scholar]
- 55.Reynolds HY. Sarcoidosis: impact of other illnesses on the presentation and management of multi-organ disease. Lung. 2002;180:281–299. doi: 10.1007/s004080000104. [DOI] [PubMed] [Google Scholar]
- 56.Matsumoto M, et al. Esophageal involvement in microscopic polyangiitis: A case report and review of literature. Intern. Med. 2007;46:663–668. doi: 10.2169/internalmedicine.46.6115. [DOI] [PubMed] [Google Scholar]
- 57.Rosenberg, E. The prokaryotes: Alphaproteobacteria and betaproteobacteria. The Prokaryotes: Alphaproteobacteria and Betaproteobacteria 1–1012, 10.1007/978-3-642-30197-1 (2013).
- 58.Panzer AR, et al. Lung microbiota is related to smoking status and to development of acute respiratory distress syndrome in critically Ill trauma patients. Am. J. Respir. Crit. Care Med. 2018;197:621–631. doi: 10.1164/rccm.201702-0441OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Rhee RL, et al. Characterisation of the nasal microbiota in granulomatosis with polyangiitis. Ann. Rheum. Dis. 2018;77:1448–1453. doi: 10.1136/annrheumdis-2018-213645. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Kronbichler, A. et al. Nasal carriage of Staphylococcus pseudintermedius in patients with granulomatosis with polyangiitis. Rheumatology (Oxford). 10.1093/rheumatology/key317 (2018). [DOI] [PMC free article] [PubMed]
- 61.Zimmermann, A. et al. Atopobium and Fusobacterium as novel candidates for sarcoidosis-associated microbiota. Eur Respir J 50, (2017). [DOI] [PubMed]
- 62.Clarke EL, et al. Microbial Lineages in Sarcoidosis. A Metagenomic Analysis Tailored for Low-Microbial Content Samples. Am J Respir Crit Care Med. 2018;197:225–234. doi: 10.1164/rccm.201705-0891OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Zhao M-M, et al. High throughput 16SrRNA gene sequencing reveals the correlation between Propionibacterium acnes and sarcoidosis. Respir. Res. 2017;18:28. doi: 10.1186/s12931-017-0515-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Sulaiman, I. et al. Evaluation of the airway microbiome in nontuberculous mycobacteria disease. 10.1183/13993003.00810-2018. [DOI] [PubMed]
- 65.Goffau MCD, et al. Recognizing the reagent microbiome. Nat. Microbiol. 2018;3:851–853. doi: 10.1038/s41564-018-0202-y. [DOI] [PubMed] [Google Scholar]
- 66.Carney SM, et al. Methods in lung microbiome research. Am. J. Respir. Cell Mol. Biol. 2020;62:283–299. doi: 10.1165/rcmb.2019-0273TR. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Data are available from DDBJ Sequence Read Archive (accession number: PRJDB8270). Data are accessed from the following URLS: (http://trace.ddbj.nig.ac.jp/BPSearch/bioproject?acc=PRJDB8270) (https://www.ncbi.nlm.nih.gov/bioproject/?term=PRJDB8270).