Skip to main content
Microorganisms logoLink to Microorganisms
. 2020 Aug 6;8(8):1196. doi: 10.3390/microorganisms8081196

Gastric Microbiome Diversities in Gastric Cancer Patients from Europe and Asia Mimic the Human Population Structure and Are Partly Driven by Microbiome Quantitative Trait Loci

Bruno Cavadas 1,2,3,*, Rui Camacho 4,5, Joana C Ferreira 1,2,3, Rui M Ferreira 1,2, Ceu Figueiredo 1,2,6, Alvis Brazma 7, Nuno A Fonseca 8, Luísa Pereira 1,2,6
PMCID: PMC7463948  PMID: 32781641

Abstract

The human gastrointestinal tract harbors approximately 100 trillion microorganisms with different microbial compositions across geographic locations. In this work, we used RNASeq data from stomach samples of non-disease (164 individuals from European ancestry) and gastric cancer patients (137 from Europe and Asia) from public databases. Although these data were intended to characterize the human expression profiles, they allowed for a reliable inference of the microbiome composition, as confirmed from measures such as the genus coverage, richness and evenness. The microbiome diversity (weighted UniFrac distances) in gastric cancer mimics host diversity across the world, with European gastric microbiome profiles clustering together, distinct from Asian ones. Despite the confirmed loss of microbiome diversity from a healthy status to a cancer status, the structured profile was still recognized in the disease condition. In concordance with the parallel host-bacteria population structure, we found 16 human loci (non-synonymous variants) in the European-descendent cohorts that were significantly associated with specific genera abundance. These microbiome quantitative trait loci display heterogeneity between population groups, being mainly linked to the immune system or cellular features that may play a role in enabling microbe colonization and inflammation.

Keywords: gastric microbiome, gastric cancer, European and Asian diversity, biomarkers, microbiome quantitative trait loci, miQTL

1. Introduction

Current estimates place the beginning of the cooperative interaction between microbes and their animal hosts in the last 500 million years [1]. Examples of such close interactions in the gastrointestinal tract are Bacteroidaceae and Bifidobacteriaceae, which have been symbionts of primates for over 15 million years [2]. A big increase in normal human microbiome knowledge is being contributed by large scale studies such as the Human Microbiome Project (HMP, led by the United States of America) [3] and MetaHIT (led by the European Union and China) [4]. In the particular case of the largest human microbiome reservoir, the gastrointestinal tract, it harbors approximately 100 trillion microorganisms in a proportion of approximately 1:1 to human cells [5,6], with up to 100 times more genes than the human genome [4,7].

The microbiome composition is dynamic and is affected by factors such as diet, antibiotics, age and ethnicity. By analysing the gut microbiome in stool samples from BaAka pygmies and its neighbor Bantus, and comparing them with samples collected in the United States, Gomez et al. [8] showed that hunter-gatherers harbored an increased abundance of Prevotellaceae, Treponema, and Clostridiaceae, associated with a profile of increased virulence, and amino acid and vitamin metabolism. The gut microbiome of Bantu was dominated by Firmicutes and displayed western-like features, such as an increased abundance of predictive carbohydrates and xenobiotic metabolic pathways. Nevertheless, the two African groups were more similar to each other in terms of their gut microbiome diversity than either group against the USA cohort. Additionally, there was a loss of (alpha) diversity in the industrialized population, mainly due to the loss of the bacterial taxa involved in fiber processing, coupled with a gain of lineages associated with the consumption of agricultural products. The gut microbiome of the Danish is generally enriched with the phylum Firmicutes, including Oenococcus and other lactic acid bacteria [9], whereas that of the Chinese has greater abundance of Proteobacteria. A genome-wide association study of the gut microbiota in 1812 Germans [10] identified significant associations for the overall microbial variation and individual taxa at multiple loci, especially with the VDR (vitamin D receptor) gene, several disease susceptibility genes and sterol metabolism pathway components. The authors estimate that non-genetic and genetic factors account each for 10% of gut microbiota variations. A recent study conducted in a large cohort of Flemish and German individuals lead to the identification of further genetic associations involving multiple microbial traits, including that between a single nucleotide polymorphism (SNP) on the RAPGEF1 gene, which is involved in the regulation of the gastrointestinal tract’s physiology, and Ruminococcus [11].

Microbiome homeostasis is critical for human health. Changes in the microbiome composition, also known as dysbiosis, have now been associated with several diseases, such as obesity [12], atherosclerosis [13], inflammatory bowel disease [14], Alzheimer’s [15], and cancer [16]. In the gastric cancer setting, studies that characterized the gastric microbiome found complex bacterial communities dominated by different species with a low abundance of H. pylori in the stomach of cancer patients [17,18,19,20]. Divergent microbial profiles associated with cancer were described in these studies, which may be explained in part by the different technical approaches, namely if the original sample was obtained from a gastric juice or mucosal specimen, and also due to the diverse geographic origins of the patients studied.

The Cancer Genome Atlas (TCGA) has collected the largest adenocarcinoma cohort of patients from several countries in Europe (Germany, Poland, Ukraine and Russia), Asia (Vietnam and South Korea), Oceania (Australia) and America (the United States (USA), Canada and Brazil), and makes freely available whole genome/exome sequences (WGS/WES) and transcriptomes [21]. While the number of WGS-typed samples is limited to around 50, RNASeq has been performed in all cancer samples and in around 30 samples from adjacent normal tissue. It has been shown that it is possible to identify and quantify non-human sequenced fragments (including bacteria and viruses) from human-centred WGS [22,23] and RNASeq [23,24]. Another interesting public database is the Genotype-Tissue Expression (GTEx), which provides RNASeq of several tissues from deceased non-cancer patients [25] in the United States, mostly of European descent, including approximately 200 gastric samples. The GTEx has been intensively used to infer expression quantitative trait loci (eQTL) in the various human tissues. Currently, these two databases are the richest in terms of the geographical origin of samples, completeness of human omic characterization, clinical classification, and direct gastric tissue sampling. Nevertheless, the microbiome was not profiled in these samples. In this sense, the inference of the microbial community in the TCGA and GTEx datasets would provide a comprehensive investigation on the role of host diversity in shaping the microbiome dynamics.

In this work, we departed from the TCGA and GTEx RNASeq datasets to investigate and to infer the bacterial microbiome in cancerous and non-cancerous gastric tissues. Furthermore, we evaluated the microbiome variation attending to the human population structure and searched for microbiome quantitative trait loci (miQTL).

2. Materials and Methods

2.1. Samples

Gastric adenocarcinoma RNASeq raw reads, unmapped to the GRCh38 human genome encompassing 375 tumors and 27 matched normal tissues, were obtained from aligned RNASeq data on the TCGA Genomic Data Commons repository (https://gdc.cancer.gov/). For these samples, information regarding their origin, sex, age, ethnicity, pathogenic (type and location of tumor, pathogenicity scoring), and clinical information (treatment and vital status) was available. For this tissue, the geographic collection of samples spanned 11 countries (Australia, Brazil, Canada, Germany, Moldova, Poland, South Korea, Ukraine, the United Kingdom, the United States and Vietnam). Following the same rationale, we have also inferred the microbiome profile of stomach samples of non-disease individuals collected in the United States through the analysis of RNASeq data from the GTEx database [25]. From the 202 stomach GTEx samples, we excluded samples with an ethnicity other than “white” and with clinical evidence of gastritis, acid reflux or other gastric pathologies, leaving a total of 164 samples. The distribution of the samples used in this work is summarized in Supplementary Table S1. The original material was frozen tissue, collected under careful standard operating procedures available at the consortia websites (https://brd.nci.nih.gov/brd/sop-compendium/show/701; https://biospecimens.cancer.gov/resources/sops/default.asp).

This study was carried in accordance with the recommended data access guidelines from the TCGA and GTEx datasets. We received administrative permission for the download of restricted-access data from stomach cancer patients (TCGA) and patients with normal stomachs from deceased individuals (GTEx). Additional approval by the local Ethics Committee was not required.

2.2. Microbiome Inference

A total of 900 bacterial whole genomes (one representative strain per species; Supplementary Table S2) were collected from NCBI following the species identified by the HMP [3] and MetaHit [9] projects in the gastrointestinal tract and complemented the species identified in the works of Rajilić-Stojanović and de Vos [26] and Ferreira et al. [20].

The pipeline used in this manuscript to characterize and quantify the bacterial species is described in detail elsewhere [27], and summed up in Supplementary Figure S1. Briefly, QmihR began by trimming reads using Trimmomatic (v0.36) [28], checking if: (1) the average Phred of two consecutive reads was below 20, and (2) the read length was smaller than 40 bases. These reads were then aligned, by global alignment, against the bacterial reference database with Bowtie2 (v2.2.7) [29] and the quantification of bacterial genera was performed through RSEM (v1.2.29) [30] for the probabilistic assignment of multi-mapping reads. The pipeline aggregates the counts of the mapped reads in the various genes within a species to produce counts of reads aligned per species. After analyses, we limited counting reads to those mapped to bacterial rRNA genes. Reads belonging to the same genera were combined and normalized by the library size for the total mapped reads against the bacterial reference database.

2.3. Checking for Genus Coverage, Richness and Evenness

Estimations of the genus coverage, richness, evenness, and diversity were calculated in R using the Phyloseq package (v1.23.1) [31]. The alpha diversity was determined by the Shannon index. Good’s coverage calculated an estimator of the sample coverage of a community. The Chao1 estimator returned the genus richness based on a vector or matrix of abundance data, reflecting the proportion of singletons and doubletons in the dataset. The evenness reflects the closeness in the number of genera, corresponding to Shannon’s index divided by the number of genera.

2.4. Phylogenetic Analyses

The phylogenetic information for the genera present in our samples was retrieved (on the 20th January 2019) from the Time Tree website (http://www.timetree.org/). This information was used to calculate unweighted and weighted UniFrac distances. The UniFrac distances are beta measures (the mean genera diversity among samples) that take into account the similarities and differences among genera and including the weighted or unweighted genera abundances in calculations [32]. The Bray–Curtis dissimilarity, another distance-based method, takes into account the counts at each site. The unrooted neighbor-joining tree was used to represent the weighted UniFrac distances between TCGA population samples. A permutation multivariate analysis (1000 permutations) of variance (PERMANOVA; [33]) of the weighted UniFrac distances (to reflect the phylogenetic diversity between genera) was also performed and plotted in a non-metric multidimensional scale (nMDS). The null hypothesis tested by PERMANOVA was that the centroids of each group are equivalent, under the assumption of exchangeability of the samples among groups.

A t-distributed stochastic neighbor embedding (t-SNE) analysis based on the normalized values was estimated to check for the batch influence on the beta-diversity. This analysis was conducted in R using Rtsne package (v0.15) [34].

2.5. Evaluating Microbiota Composition in Gastric Tumorigenesis

A differential expression was applied to evaluate the changes in the microbiome abundance (for genera and phyla) by using the DESeq2 package (v1.24.0) [35]. The heatmap reflecting these changes was obtained in R using the gplots (v3.0.1.1) package.

2.6. Host-Genome and Microbiome Associations

The association of host germline genotypes with microbiome genera abundance was tested in the TCGA and GTEx datasets. The tested genera were the ones listed in Figure 1, which displayed frequencies above 1% in at least one population. Genera abundance was further normalized by an arc-sin transformation to obtain normality. These miQTLs were identified by employing Matrix eQTL software (v2.2) [36]. Host germline variants, common to both datasets, were limited to the MAF > 0.05 (minimum allele frequency), non-synonymous type and were in a Hardy–Weinberg equilibrium (p-value > 1 × 10−5). The country of origin, sex, age, sequencing center and year of sample shipment were considered as confounding variables in all analyses. The p-values were subsequently corrected for multiple testing using Storey’s q-value method [37]. To check the FST distances between the human population groups for these variants, we used the African, Asian and European populations from the 1000 Genomes project [38].

Figure 1.

Figure 1

Microbiota composition in gastric cancer and non-cancerous samples. (A) Heatmap of statistically significant differentiated genera and phyla abundance when comparing the control Genotype-Tissue Expression (GTEx) non-cancer gastric microbiome with the European ancestry (Europe and USA) The Cancer Genome Atlas (TCGA) cancer samples. (B) Log2 fold change of the statistically significant differentiated genera and phyla between the non-cancer (negative axis) and cancer (positive axis) cohorts.

3. Results

3.1. Quality Control of RNASeq Data for Microbiome Inference

After retrieving bacterial reads from the transcriptomic data of the TCGA and GTEx and, since batch effects in TCGA samples have been described [39,40], an initial quality control of the data was performed. A characterization based on diversity revealed that samples in batches with a year of shipment between 2010 and 2012 had a statistically significant (p < 2.2 × 10−16) higher diversity than the batches from 2013 and 2014 (median Shannon diversity of 2.92 (2010–2012); 1.31 (2013 to 2014), Supplementary Figure S2A). A detailed inspection of the relative abundance of bacteria per sample showed that some samples had an almost monophyletic microbiome. The most extreme case was found in the samples which were shipped in 2013 and 2014, which contained almost exclusively Pseudomonas (128 samples, mean relative abundance of 95% (Supplementary Table S3)). The samples shipped in 2010, 2011 and 2012 had a more variable microbiome (247 samples in total; with a 20% mean relative abundance of Pseudomonas), mostly made of three similarly frequent genera and many less frequent ones. These results led us to conclude that the 2013 and 2014 batches were possibly contaminated by Pseudomonas (Supplementary Figure S2B) and were thus removed from further analyses. No discernible differences between batches in the GTEx samples were observed (data not shown). Even though the alpha diversity analysis showed similar values of diversity for the 2010, 2011 and 2012 batches, we further assessed the batch-specific enrichment of genera. The results showed (Supplementary Figure S2C) that the 2010 and 2011 batches cluster together into two different sub-clusters, while the 2012 samples showed a more homogeneous distribution. Given the small sample sizes of the 2010 and 2011 batches (n = 37 and n= 73, respectively) comparatively to the 2012 batch (n = 137), we conservatively limited further analyses to the TCGA 2012 cohort (largely European and East Asian samples).

We then checked where in the bacterial genome these reads were mapping to, observing that 81.1% of all the aligned RNASeq reads mapped to the ribosomal RNA (rRNA) genes. On average, 9356 and 7171 reads in the TCGA and GTEx samples, respectively, mapped to rRNA genes with the following equivalent distributions (p = 0.74): 39.66% and 36.47% on 16S rRNA; 60.12% and 63.17% on 23S rRNA; 0.22% and 0.36% on 5S rRNA. To avoid biases due to differentially expressed protein coding-genes, we limited all further analyses to bacterial rRNA-mapped reads.

3.2. Microbiome Composition in Cancer vs. Non-Cancer Samples from European Ancestry

Previous investigations using 16S rRNA-based methods have disclosed differences in the gastric microbiome between patients with and without cancer [18,19,20], mainly in the decreased microbiome diversity in patients with cancer status. Therefore, we inferred the gastric microbiome from the TCGA (cancer cohort) and GTEx (non-cancer cohort) datasets. However, since the GTEx dataset originates from USA individuals, who are mostly of European ancestry (n = 164), we limited this comparison to the TCGA samples from European and USA (n= 83) origins.

We assessed the microbial diversity in our inferred data, through metrics such as Good’s coverage, evenness, alpha diversity (Shannon index) and Chao1 estimator. The median estimated coverage was above 97% in each group (Supplementary Figure S3). The bacterial richness, evenness, and Shannon diversity were significantly lower in cancer than in non-cancer samples (p = 1.1 × 10−9, p < 2.2 × 10−16, and p < 2.2 × 10−16, respectively; Supplementary Figure S3).

To evaluate whether there were differences in taxa that could explain the variation in diversity, we performed a differential analysis of taxa. The results showed that some genera reached statistically significant differences above two log fold changes between non-cancer and cancer cohorts: Bacillus (p = 1.5 × 10−113), Parasutterella (p = 7.5 × 10−27), Brevibacillus (p = 4.9 × 10−18), Fusobacterium (p = 9.4 × 10−14), Enterobacter (p = 8.6 × 10−24), Cloacibacterium (p = 1.48 × 10−17) and Suterella (p = 2.3 × 10−21) were enriched in the microbiome of cancer samples, whereas, Kocuria (p = 2.6 × 10−10), Aeromicrobium (p = 1.0 × 10−10), Stenotrophomonas (p = 6.0 × 10−15), Phyllobacterium (p = 8.6 × 10−24), Brachybacterium (p = 5.4 × 10−10), Staphylococcus (p = 2.5 × 10−22), Blastococcus (p = 5.6 × 10−14), Micrococcus (p = 3.2 × 10−15), Rhodococcus (p = 3.1 × 10−20), Propionibacterium (p = 1.5 × 10−38), Escherichia (p = 7.7 × 10−19), Anaerococcus (p = 3.0 × 10−16), Serratia (p = 6.9 × 10−27), Brevibacterium (p = 4.9 × 10−18) and Nevskia (p = 1.2 × 10−47) were significantly enriched in non-cancer samples (Figure 1A,B). There were no differences in the relative abundance of these genera between the four cancer stages (AJCC pathological tumor staging; Supplementary Figure S4). The differences in genera abundance led to phylum Actinobacteria (p = 9.4 × 10−51) being more abundant (with statistical significance) in the non-cancer cohort (Figure 1A,B).

3.3. Microbiome Profiling in Function of Host Geographic Origin

Taking into consideration that, for the gastric adenocarcinoma, the TCGA cohort includes samples collected from multiple geographic locations, we leveraged this information to evaluate whether the microbiome varied between the main human ancestral backgrounds.

We first checked if the RNASeq-based inference had enough resolution to reliably characterize the microbiome diversity from all the geographic locations available. As can be seen in Figure 2A, Good’s coverage per sample was always higher than 84%, with 57% of samples having this estimate > 98%, and 25% of samples having > 99%. Chao1 estimated a median of 196 and 264 genera for the various populations in the TCGA and GTEx datasets, respectively. Overall, within the TCGA cancer cohort, the bacterial richness and evenness were similar in all populations, and the Shannon index was not significantly different between continents.

Figure 2.

Figure 2

Alpha diversity and phylogenetic analysis of the gastric microbiome. (A) Good’s coverage, Chao1, Shannon and evenness in the inferred microbiome patterns. (B) Phylogenetic tree and respective abundance of genera found in the gastric microbiome (only genera that passed a mean threshold of 1% in at least one population are displayed in the graph). Samples are from non-cancer individuals from the GTEx dataset (United States n = 164, in brink pink) and cancer patients from the TCGA 2012 batch (United States n = 14 in gold; Ukraine n = 24 in limeade, Poland n = 11 in Caribbean green; Russia n = 34 in cerulean; South Korea n = 28 in purple and Vietnam n = 26 in pink).

We next evaluated if the microbiome had a different signature across continents. In agreement with previous descriptions of the gastric microbiome [41,42], genera found in the GTEx and TCGA stomach samples belonged mainly to Proteobacteria, Firmicutes, Actinobacteria, Bacteroidetes and Fusobacteria phyla (Figure 2B). Firmicutes was the dominant phylum observed in Asia, while Proteobacteria was dominant in Europe (53.6% and 39.7%, and 34.0% and 47.3%, respectively). These differences between continents were statistically significant (p = 1.9 × 10−6 and p = 2.9 × 10−5, respectively). No statistically significant differences were observed between the remaining phyla.

Within the cancer cohort, Bacillus was the dominant genus in European and East Asian samples, despite the relative abundance of this genus being significantly higher in the latter (30.4% and 38.1%, respectively; p = 4.1 × 10−3). Bacillus was followed by Pseudomonas (24.2% and 15.9%; p = 7.9 × 10−5), Shigella (5.4% and 4.2%; p = 4.9 × 10−3), and Lactobacillus (2.3% and 6.4%; p = 0.1), while Helicobacter was vestigial (lower than 1%).

Still within the cancer cohort, but considering the genera per country of origin, the USA had a high relative abundance of Fusobacterium (4.0%) and a low relative abundance of Lactobacillus (0.3%), while the opposite was observed in the central-eastern group of Ukraine, Poland and Russia (1.0% and 1.5% in Ukraine, 0.3% and 1.7% in Poland, and 1.6% and 3.9% in Russia). Bacillus accounted for 28% of the reads in samples from Russia. In Asia, the samples from South Korean and Vietnamese had a high relative abundance of Bacillus (40.0% and 37.1%, respectively) and Pseudomonas (10.0% and 22.1%, respectively), but while South Korea had more Lactobacillus (11.0%) and Fusobacterium (2.6%) and less Achromobacter (0.8%), Vietnam had the opposite trend (1.4%, 0.4% and 2.7%, respectively). These profiles are statistically different for the comparisons between European and East Asian groups, as evaluated in the nMDS plot for the weighted UniFrac distances and PERMANOVA analysis (stress = 0.17; r2 = 0.05; p = 0.054; Figure 3A): USA vs. South Korea, p = 0.048; Poland vs. South Korea, p = 0.024; South Korea vs. Vietnam, p = 0.006. By displaying the mean weighted UniFrac distances between groups in a neighbor-joining tree (Figure 3B), it was observed that the distances between the microbiomes of cancer samples from different geographic locations reflect the pattern of the genetic distances between main human populations: European populations clustering together, and East Asian populations in another branch. Thus, the population structure was observed in the microbiome of tumor samples. The microbiome of the non-cancerous samples GTEx (mainly European-Americans) was closer to the European tumor cohort (Figure 3C), reflecting a common human ancestry.

Figure 3.

Figure 3

Gastric cancer microbiome profiling in function of the host geographical origin. (A) non-metric multidimensional scale (nMDS) plot for the weighted UniFrac distances and PERMANOVA analysis (stress = 0.17; r2 = 0.05; p~0.054) between the geographic groups in the cancer cohort (same colors as in Figure 1). (B) Neighbor-joining tree of the mean weighted UniFrac distances between the geographic groups in the cancer cohort. (C) Neighbor-joining tree of the mean weighted UniFrac distances between the geographic groups in the cancer and non-cancer cohorts.

3.4. Identification of Host Genetic Variation Associated with the Microbiome

Table 1 summarizes the 16 significant miQTLs, 12 in the TCGA European and 4 in the non-diseased gastric GTEx cohort. No miQTLs were observed at a q-value lower than 0.1 in the TCGA Asian cohort, most probably due to the low power of the limited Asian sample size. Some of the European miQTLs identified in this study already had a known interaction with the microbiome. The association between the DPH1 gene and Corynebacterium was described before—this gene encodes an enzyme involved in the secondary transformation (diphtamidation) of histidines in elongation factor 2 (EEF2), which is a target of the diphtheria toxin secreted by Corynebacterium diphtheriae [43]. The association between the ZC3H12D gene and Lactobacillus and Fusobacterium may be related with the regulation of the inflammatory response [44]. Other miQTLs found in this study have been previously linked to microbial features, although not directly associated with specific genera. The TRIM31 gene was found to inhibit invasive bacteria by the induction of an autolysosome. The downregulation of this gene by the human cytomegalovirus results in the hyperproliferation of invasive bacteria [45]. Multiple kinesin motors, including KIF24, are required for the formation of Salmonella-induced filaments during infection by Salmonella enterica [46]. The other miQTL are novel and need a more extensive investigation.

Table 1.

Statistically significant (q-value < 0.15) microbiome quantitative trait loci (miQTLs) in the TCGA-European and GTEx cohorts. No miQTLs were identified in the Asian cohort.

SNP Genus GeneID p -Value q -Value Genotype Frequency Fst (p-Values)
MAF_EUR MAF_EAS MAF_AFR EUR_AFR EUR_EAS EAS_AFR
TCGA-EUROPE rs1131600 Corynebacterium DPH1 7.97104 × 10−8 0.01501 G|G: 0.028
A|G: 0.252
A|A: 0.720
G|G: 0.004
A|G: 0.105
A|A: 0.891
A|G: 0.020
A|A: 0.980
0.14323
(0.00000 ± 0.0000)
0.04831
(0.00000 ± 0.0000)
0.0359
(0.00000 ± 0.0000)
TCGA-EUROPE rs61997220 Lactobacillus ZC3H12D 3.07635 × 10−7 0.02414 T|T: 0.907
C|T: 0.093
T|T: 1.000 T|T: 0.995
C|T: 0.005
0.04507
(0.00000 ± 0.0000)
0.04582
(0.00000 ± 0.0000)
0.00113
(0.25978 + 0.0048)
TCGA-EUROPE rs9262143 Acinetobacter PPP1R18 5.68970 × 10−7 0.02414 C|C: 0.845
C|T: 0.151
T|T: 0.004
C|C: 1.000 C|C: 0.995
C|T: 0.005
0.08232
(0.00000 ± 0.0000)
0.07868
(0.00000 ± 0.0000)
0.00113
(0.26255 ± 0.0046)
TCGA-EUROPE rs17350674 Cloacibacterium KIF24 5.97391 × 10−7 0.02414 C|C: 0.614
A|A: 0.050
A|C: 0.336
C|C: 0.988
A|A: 0.002
A|C: 0.010
C|C: 0.980
A|C: 0.020
0.21379
(0.00000 ± 0.0000)
0.19977
(0.00000 ± 0.0000)
0
(0.52688 ± 0.0052)
TCGA-EUROPE rs2219078 Acinetobacter SULT1C3 6.40940 × 10−7 0.02414 G|G: 0.658
A|G: 0.300
A|A: 0.042
G|G: 0.097
A|G: 0.452
A|A: 0.450
G|G: 0.472
A|G: 0.443
A|A: 0.085
0.03291
(0.00000 ± 0.0000)
0.38529
(0.00000 ± 0.0000)
0.24086
(0.00000 ± 0.0000)
TCGA-EUROPE rs2523989 Acinetobacter TRIM31 1.03982 × 10−6 0.03263 C|C: 0.775
C|T: 0.209
T|T: 0.016
C|C: 0.905
C|T: 0.093
T|T: 0.002
C|C: 0.903
C|T: 0.095
T|T: 0.002
0.03296
(0.00000 ± 0.0000)
0.03172
(0.00000 ± 0.0000)
0
(0.99990 ± 0.0000)
TCGA-EUROPE rs7198494 Achromobacter C16orf46 1.75767 × 10−6 0.04728 A|A: 0.648
A|G: 0.294
G|G: 0.058
A|A: 0.962
A|G: 0.036
G|G: 0.002
A|A: 0.526
A|G: 0.389
G|G: 0.085
0.01389
(0.00010 ± 0.0001)
0.15724
(0.00000 ± 0.0000)
0.21452
(0.00000 ± 0.0000)
TCGA-EUROPE rs61997220 Fusobacterium ZC3H12D 2.66174 × 10−6 0.05857 T|T: 0.907
C|T: 0.093
T|T: 1.000 T|T: 0.995
C|T: 0.005
0.04507
(0.00000 ± 0.0000)
0.04582
(0.00000 ± 0.0000)
0.00113
(0.25978 ± 0.0048)
TCGA-EUROPE rs62572859 Lactobacillus C9orf129 2.79953 × 10−6 0.05857 C|C: 0.750
C|T: 0.221
T|T: 0.030
C|C: 0.919
C|T: 0.081
C|C: 0.433
C|T: 0.475
T|T: 0.092
0.09115
(0.00000 ± 0.0000)
0.05751
(0.00000 ± 0.0000)
0.22638
(0.00000 ± 0.0000)
TCGA-EUROPE rs1014867 Lactobacillus FAT4 5.53680 × 10−6 0.09807 C|C: 0.895
C|T: 0.103
T|T: 0.002
C|C: 0.885
C|T: 0.109
T|T: 0.006
C|C: 0.865
C|T: 0.126
T|T: 0.009
0.00188
(0.09425 ± 0.0026)
0
(0.56796 ± 0.0055)
0.00016
(0.33403 ± 0.0052)
TCGA-EUROPE rs1782360 Lactobacillus LRBA 6.08310 × 10−6 0.09807 G|G: 0.855
C|C: 0.012
C|G: 0.133
G|G: 0.754
C|C: 0.022
C|G: 0.224
G|G: 0.495
C|C: 0.077
C|G: 0.428
0.13144
(0.00000 ± 0.0000)
0.01505
(0.00040 ± 0.0002)
0.06804
(0.00000 ± 0.0000)
TCGA-EUROPE rs4963198 Corynebacterium LRRC56 6.84488 × 10−6 0.09914 G|G: 0.109
A|A: 0.437
A|G: 0.453
G|G: 0.022
A|A: 0.768
A|G: 0.210
G|G: 0.319
A|A: 0.194
A|G: 0.487
0.09743
(0.00000 ± 0.0000)
0.11482
(0.00000 ± 0.0000)
0.33553
(0.00000 ± 0.0000)
GTEx rs61733127 Streptococcus PHLPP2 3.22625 × 10−11 0.00001 G|G: 0.024
A|G: 0.249
A|A: 0.728
G|G: 0.012
A|G: 0.109
A|A: 0.879
A|G: 0.056
A|A: 0.944
0.09268
(0.00000 ± 0.0000)
0.03325
(0.00000 ± 0.0000)
0.01635
(0.00000 ± 0.0000)
GTEx rs74344827 Streptococcus TAT 9.87962 × 10−8 0.01160 G|G: 0.700
A|G: 0.274
A|A: 0.026
G|G: 0.905
A|G: 0.087
A|A: 0.008
G|G: 0.663
A|G: 0.309
A|A: 0.029
0.00052
(0.22265 ± 0.0035)
0.06188
(0.00000 ± 0.0000)
0.07456
(0.00000 ± 0.0000)
GTEx rs73229817 Corynebacterium PDLIM2 2.60383 × 10−7 0.02039 C|C: 0.901
C|T: 0.097
T|T: 0.002
C|C: 1.000 C|C: 0.998
C|T: 0.002
0.05466
(0.00000 ± 0.0000)
0.0498
(0.00000 ± 0.0000)
0
(0.99990 ± 0.0000)
GTEx rs12807209 Nevskia MUC6 5.79755 × 10−7 0.03405 G|G: 0.998
C|G: 0.002
G|G: 1.000 G|G: 0.750
C|C: 0.018
C|G: 0.231
0.11679
(0.00000 ± 0.0000)
0
(0.50579 ± 0.0054)
0.11913
(0.00000 ± 0.0000)

Interestingly, most of the miQTLs displayed heterogeneity between population groups, thus explaining in part the microbiome heterogeneity related with human ancestry. The heterogeneity was inferred from the FST distances between population groups based on the genotype frequencies from the 1000 Genomes project [38] (Table 1). All miQTLs except rs1014867-FAT4 showed significant genetic distances between at least two population groups, indicating that these SNPs are very heterogeneous. An illustrative example of this is the association between the rs2219078 SNP located in the SULT1C3 gene with Acinetobacter (Figure 4), with allele A being positively associated with a higher frequency of this bacteria (statistically significant in the TCGA European cohort (False discovery rate (FDR) adjusted p-value of 0.024), and following the same tendency in the TCGA East Asian and GTEx European cohorts). Homozygous AA are relatively rare in Europe (4.2%) and Africa (8.5%), but very frequent in East Asia (45.0%). This SNP was also identified as an eQTL (expression quantitative trait loci) in the stomach (allele A associated with a higher gene expression) by the GTEx project, and the SULT1C3 gene plays a role in sulfotransferase activity, being involved in the biotransformation of xenobiotics [47]. Curiously, several Acinetobacter strains are used in environmental and biotechnological applications for the biodegradation, leaching and removal of several organic and inorganic compounds [48]. The associated human gene and bacteria may have an additive function in the metabolism of xenobiotics.

Figure 4.

Figure 4

Example of an identified microbiome quantitative trait locus. (A) Worldwide genotype frequencies of rs2219078 SNP located in the SULT1C3 gene for 1000 Genomes project populations. (B) Acinetobacter relative abundances of the rs2219078 genotypes in the TCGA European cohort. (C) The TCGA East Asian cohort. (D) The GTEx European-descendent cohort.

4. Discussion

In this work, we have indirectly characterized the gastric microbiome of gastric cancer patients and deceased non-cancer individuals from RNASeq data obtained from the TCGA and GTEx projects, respectively. We demonstrated that the microbiome inference was reliable, even and rich for genera coverage, after a careful evaluation of batch-contamination by Pseudomonas (extending the contamination issues already noticed by [40]) and batch-specific microbiome compositions. Due to the indirect classification from human-based RNASeq data, some inherent limitations were unavoidable. The samples were collected and processed using human specific protocols lacking microbial DNA/RNA-free reagents, microbiome-related negative controls, as well as microbial RNA-specific enrichment steps. RNASeq in both these projects was enriched by poly-A selection. Polyadenylation was thought to be scarcely present in bacteria [49]. However, in contrast to eukaryotic cells, polyadenylation is known to not be restricted to the mRNA in bacteria, with 5S [50], 16S and 23S rRNA also being polyadenylated [51]. Given that these bacterial rRNAs were the most abundant transcripts found in this study, it is not surprising that we were able to identify abundant bacterial reads in the non-human RNASeq databases.

The microbiome inference in large well-characterized cohorts of samples from diverse populations, directly in the stomach tissue, confirmed known evidence and revealed new findings. Specifically, our results confirmed the loss of microbiome diversity in gastric cancer patients [19,20] with a recognizable structured microbiome profile in the disease condition, and uncovered novel relationships between bacteria and host ancestry. Within the European-descendant background, a low microbial diversity in cancer cases was accompanied by a decrease in the abundance of Rhodococcus, Phyllobacterium and Staphylococcus, which was counterbalanced by the increase in Bacillus, Enterobacter, Fusobacterium and Sutterella. At the phylum level, a statistically significant difference in Actinobacteria was observed. The low frequency of non-cancer patients of Asian ancestry in GTEx did not help ascertain whose bacteria could be candidate cancer biomarkers in this ancestry. Using a non-metric multidimensional scaling analysis with the UniFrac distance, we verified the significant differences between the European and East Asian groups of patients, which reinforces the role of geographic origins in shaping the microbiome composition. In fact, these differences observed between continents may help to explain differences in the identification of bacterial species associated with gastric cancer in the various studies [17,18,19,20].

This mimicking of the microbiome diversity in gastric cancer and host diversity across the world is, in part, driven by the associations between microbe genera with human SNPs that are heterogeneous between African, European and Asian populations. Many of these non-synonymous SNPs are located in genes that play a role in the immune response. Our major observations of microbiome structure across the globe, driven by the microbiome quantitative trait loci, reinforce the importance of the microbiome–host binomial. This knowledge is essential for the identification of predictive biomarkers, by which the host and microbe genetic factors must be taken into account to predict microbiome-related conditions. Therefore, we propose that the results of microbiome studies should be interpreted considering the geographic origin and population genetic background of the host.

Acknowledgments

The results shown here are in part based upon data generated by the TCGA Research Network: https://www.cancer.gov/tcga. The Genotype-Tissue Expression (GTEx) Project was supported by the Common Fund of the Office of the Director of the National Institutes of Health, and by NCI, NHGRI, NHLBI, NIDA, NIMH, and NINDS

Supplementary Materials

The following supplementary materials are available online at https://www.mdpi.com/2076-2607/8/8/1196/s1, Table S1: Distribution of samples used in this study by cohort, date of collection or DNA extraction (GTEx) and population. Table S2: List of bacterial species used in this study. Figure S1: Schematic representation of the QmihR pipeline. Figure S2: Quality control assessment of the TCGA gastric adenocarcinoma samples based on the raw bacterial counts and alpha diversity (Shannon). Table S3: Percentage of abundance per sample of Helicobacter and of genera that reach a value of 90% (Escherichia, Lactobacillus and Pseudomonas), with information on the country of origin and year of shipment. Figure S3: Genus richness in the inferred microbiome patterns between the GTEx non-cancer samples and the TCGA cancer samples from the sample ancestral background (European and United states). Figure S4: Violin plot representation of the relative abundance of the statistically significant differentiated genera in the four cancer stages (AJCC pathological tumor stating) and GTEx non-cancer dataset.

Author Contributions

All authors declare that everyone contributed adequately to the experimental procedures and writing of the manuscript. Conceptualization, B.C., R.C., N.A.F. and L.P.; methodology, B.C., R.C., R.M.F., C.F., N.A.F. and L.P.; formal analysis, B.C. and J.C.F.; resources, A.B.; writing, B.C., R.C., J.C.F., R.M.F., C.F. A.B., N.A.F. and L.P. All authors have read and agreed to the published version of the manuscript.

Funding

Funding was provided through the project “Advancing cancer research: from basic knowledge to application”; NORTE-01-0145-FEDER-000029; “Projetos Estruturados de I&D&I”, by Norte 2020—Programa Operacional Regional do Norte. N.F. was partially supported by the European Union’s Horizon 2020 research and innovation programme under grant agreement No 668981. R.M.F. is funded by the “FCT Scientific Employment Stimulus—Individual Call” program (CEECIND/01854/2017).

Conflicts of Interest

R.M.F. and C.F. own patent WO/2018/169423 on microbiome markers for gastric cancer. The remaining authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  • 1.Cho I., Blaser M.J. The human microbiome: At the interface of health and disease. Nat. Rev. Genet. 2012;13:260–270. doi: 10.1038/nrg3182. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Moeller A.H., Caro-Quintero A., Mjungu D., Georgiev A.V., Lonsdorf E.V., Muller M.N., Pusey A.E., Peeters M., Hahn B.H., Ochman H. Cospeciation of gut microbiota with hominids. Science. 2016;353:380–382. doi: 10.1126/science.aaf3951. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Turnbaugh P.J., Ley R.E., Hamady M., Fraser-Liggett C.M., Knight R., Gordon J.I. The human microbiome project. Nature. 2007;449:804–810. doi: 10.1038/nature06244. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Qin J., Li R., Raes J., Arumugam M., Burgdorf K.S., Manichanh C., Nielsen T., Pons N., Levenez F., Yamada T., et al. A human gut microbial gene catalogue established by metagenomic sequencing. Nature. 2010;464:59–65. doi: 10.1038/nature08821. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Sender R., Fuchs S., Milo R. Are We Really Vastly Outnumbered? Revisiting the Ratio of Bacterial to Host Cells in Humans. Cell. 2016;164:337–340. doi: 10.1016/j.cell.2016.01.013. [DOI] [PubMed] [Google Scholar]
  • 6.Sender R., Fuchs S., Milo R. Revised Estimates for the Number of Human and Bacteria Cells in the Body. PLoS Biol. 2016;14:e1002533. doi: 10.1371/journal.pbio.1002533. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Bäckhed F., Roswall J., Peng Y., Feng Q., Jia H., Kovatcheva-Datchary P., Li Y., Xia Y., Xie H., Zhong H., et al. Dynamics and Stabilization of the Human Gut Microbiome during the First Year of Life. Cell Host Microbe. 2015;17:690–703. doi: 10.1016/j.chom.2015.04.004. [DOI] [PubMed] [Google Scholar]
  • 8.Gomez A., Petrzelkova K.J., Burns M.B., Yeoman C.J., Amato K.R., Vlckova K., Modry D., Todd A., Jost Robinson C.A., Remis M.J., et al. Gut Microbiome of Coexisting BaAka Pygmies and Bantu Reflects Gradients of Traditional Subsistence Patterns. Cell Rep. 2016;14:2142–2153. doi: 10.1016/j.celrep.2016.02.013. [DOI] [PubMed] [Google Scholar]
  • 9.Li J., Jia H., Cai X., Zhong H., Feng Q., Sunagawa S., Arumugam M., Kultima J.R., Prifti E., Nielsen T., et al. An integrated catalog of reference genes in the human gut microbiome. Nat. Biotechnol. 2014;32:834–841. doi: 10.1038/nbt.2942. [DOI] [PubMed] [Google Scholar]
  • 10.Wang J., Thingholm L.B., Skiecevičienė J., Rausch P., Kummen M., Hov J.R., Degenhardt F., Heinsen F.A., Rühlemann M.C., Szymczak S., et al. Genome-wide association analysis identifies variation in vitamin D receptor and other host factors influencing the gut microbiota. Nat. Genet. 2016;48:1396–1406. doi: 10.1038/ng.3695. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Hughes D.A., Bacigalupe R., Wang J., Rühlemann M.C., Tito R.Y., Falony G., Joossens M., Vieira-Silva S., Henckaerts L., Rymenans L., et al. Genome-wide associations of human gut microbiome variation and implications for causal inference analyses. Nat. Microbiol. 2020 doi: 10.1038/s41564-020-0743-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Parekh P.J., Balart L.A., Johnson D.A. The Influence of the Gut Microbiome on Obesity, Metabolic Syndrome and Gastrointestinal Disease. Clin. Transl. Gastroenterol. 2015;6:e91. doi: 10.1038/ctg.2015.16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Chistiakov D.A., Bobryshev Y.V., Kozarov E., Sobenin I.A., Orekhov A.N. Role of gut microbiota in the modulation of atherosclerosis-associated immune response. Front. Microbiol. 2015;6:671. doi: 10.3389/fmicb.2015.00671. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Ribaldone D.G., Caviglia G.P., Abdulle A., Pellicano R., Ditto M.C., Morino M., Fusaro E., Saracco G.M., Bugianesi E., Astegiano M. Adalimumab Therapy Improves Intestinal Dysbiosis in Crohn’s Disease. J. Clin. Med. 2019;8:1646. doi: 10.3390/jcm8101646. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.He Y., Li B., Sun D., Chen S. Gut Microbiota: Implications in Alzheimer’s Disease. J. Clin. Med. 2020;9:2042. doi: 10.3390/jcm9072042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Thomas R.M., Jobin C. The Microbiome and Cancer: Is the ‘Oncobiome’ Mirage Real? Trends Cancer. 2015;1:24–35. doi: 10.1016/j.trecan.2015.07.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Dicksved J., Lindberg M., Rosenquist M., Enroth H., Jansson J.K., Engstrand L. Molecular characterization of the stomach microbiota in patients with gastric cancer and in controls. J. Med. Microbiol. 2009;58:509–516. doi: 10.1099/jmm.0.007302-0. [DOI] [PubMed] [Google Scholar]
  • 18.Aviles-Jimenez F., Vazquez-Jimenez F., Medrano-Guzman R., Mantilla A., Torres J. Stomach microbiota composition varies between patients with non-atrophic gastritis and patients with intestinal type of gastric cancer. Sci. Rep. 2014;4:4202. doi: 10.1038/srep04202. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Coker O.O., Dai Z., Nie Y., Zhao G., Cao L., Nakatsu G., Wu W.K., Wong S.H., Chen Z., Sung J.J.Y., et al. Mucosal microbiome dysbiosis in gastric carcinogenesis. Gut. 2018;67:1024–1032. doi: 10.1136/gutjnl-2017-314281. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Ferreira R.M., Pereira-Marques J., Pinto-Ribeiro I., Costa J.L., Carneiro F., Machado J.C., Figueiredo C. Gastric microbial community profiling reveals a dysbiotic cancer-associated microbiota. Gut. 2018;67:226–236. doi: 10.1136/gutjnl-2017-314205. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Cancer Genome Atlas Research Network Comprehensive molecular characterization of gastric adenocarcinoma. Nature. 2014;513:202–209. doi: 10.1038/nature13480. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Kostic A.D., Gevers D., Pedamallu C.S., Michaud M., Duke F., Earl A.M., Ojesina A.I., Jung J., Bass A.J., Tabernero J., et al. Genomic analysis identifies association of Fusobacterium with colorectal carcinoma. Genome Res. 2012;22:292–298. doi: 10.1101/gr.126573.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Zhang W., Meehan J., Su Z., Ng H.W., Shu M., Luo H., Ge W., Perkins R., Tong W., Hong H. Whole genome sequencing of 35 individuals provides insights into the genetic architecture of Korean population. BMC Bioinform. 2014;15(Suppl. 11):S6. doi: 10.1186/1471-2105-15-S11-S6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Castellarin M., Warren R.L., Freeman J.D., Dreolini L., Krzywinski M., Strauss J., Barnes R., Watson P., Allen-Vercoe E., Moore R.A., et al. Fusobacterium nucleatum infection is prevalent in human colorectal carcinoma. Genome Res. 2012;22:299–306. doi: 10.1101/gr.126516.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.GTEx Consortium Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans. Science. 2015;348:648–660. doi: 10.1126/science.1262110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Rajilić-Stojanović M., de Vos W.M. The first 1000 cultured species of the human gastrointestinal microbiota. FEMS Microbiol. Rev. 2014;38:996–1047. doi: 10.1111/1574-6976.12075. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Cavadas B., Ferreira J., Camacho R., Fonseca N., Pereira L. QmihR: Pipeline for Quantification of Microbiome in Human RNA-seq; Proceedings of the 11th International Conference on Practical Applications of Computational Biology & Bioinformatics, PACBB 2017; Porto, Portugal. 21–23 June 2017; pp. 173–179. [Google Scholar]
  • 28.Bolger A.M., Lohse M., Usadel B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Langmead B., Salzberg S.L. Fast gapped-read alignment with Bowtie 2. Nat. Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Li B., Dewey C.N. RSEM: Accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinform. 2011;12:323. doi: 10.1186/1471-2105-12-323. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.McMurdie P.J., Holmes S. phyloseq: An R package for reproducible interactive analysis and graphics of microbiome census data. PLoS ONE. 2013;8:e61217. doi: 10.1371/journal.pone.0061217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Hamady M., Lozupone C., Knight R. Fast UniFrac: Facilitating high-throughput phylogenetic analyses of microbial communities including analysis of pyrosequencing and PhyloChip data. ISME J. 2010;4:17–27. doi: 10.1038/ismej.2009.97. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Anderson M. A new method for non-parametric multivariate analysis of variance. Aust. J. Ecol. 2001;26:2–46. doi: 10.1111/j.1442-9993.2001.tb00081.x. [DOI] [Google Scholar]
  • 34.Krijthe J.H. Rtsne: T-Distributed Stochastic Neighbor Embedding Using Barnes-Hut Implementation. [(accessed on 28 August 2019)];2015 Available online: https://github.com/jkrijthe/Rtsne.
  • 35.Love M.I., Huber W., Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Shabalin A.A. Matrix eQTL: Ultra fast eQTL analysis via large matrix operations. Bioinformatics. 2012;28:1353–1358. doi: 10.1093/bioinformatics/bts163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Storey J.D., Tibshirani R. Statistical significance for genomewide studies. Proc. Natl. Acad. Sci. USA. 2003;100:9440–9445. doi: 10.1073/pnas.1530509100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Auton A., Brooks L.D., Durbin R.M., Garrison E.P., Kang H.M., Korbel J.O., Marchini J.L., McCarthy S., McVean G.A., Abecasis G.R. A global reference for human genetic variation. Nature. 2015;526:68–74. doi: 10.1038/nature15393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Cantalupo P.G., Katz J.P., Pipas J.M. HeLa nucleic acid contamination in the cancer genome atlas leads to the misidentification of human papillomavirus 18. J. Virol. 2015;89:4051–4057. doi: 10.1128/JVI.03365-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Robinson K.M., Crabtree J., Mattick J.S., Anderson K.E., Dunning Hotopp J.C. Distinguishing potential bacteria-tumor associations from contamination in a secondary data analysis of public cancer genome sequence data. Microbiome. 2017;5:9. doi: 10.1186/s40168-016-0224-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Bik E.M., Eckburg P.B., Gill S.R., Nelson K.E., Purdom E.A., Francois F., Perez-Perez G., Blaser M.J., Relman D.A. Molecular analysis of the bacterial microbiota in the human stomach. Proc. Natl. Acad. Sci. USA. 2006;103:732–737. doi: 10.1073/pnas.0506655103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Delgado S., Cabrera-Rubio R., Mira A., Suárez A., Mayo B. Microbiological survey of the human gastric ecosystem using culturing and pyrosequencing methods. Microb. Ecol. 2013;65:763–772. doi: 10.1007/s00248-013-0192-5. [DOI] [PubMed] [Google Scholar]
  • 43.Tsuda-Sakurai K., Miura M. The hidden nature of protein translational control by diphthamide: The secrets under the leather. J. Biochem. 2019;165:1–8. doi: 10.1093/jb/mvy071. [DOI] [PubMed] [Google Scholar]
  • 44.Matsushita K., Takeuchi O., Standley D.M., Kumagai Y., Kawagoe T., Miyake T., Satoh T., Kato H., Tsujimura T., Nakamura H., et al. Zc3h12a is an RNase essential for controlling immune responses by regulating mRNA decay. Nature. 2009;458:1185–1190. doi: 10.1038/nature07924. [DOI] [PubMed] [Google Scholar]
  • 45.Ra E.A., Lee T.A., Won Kim S., Park A., Choi H.J., Jang I., Kang S., Hee Cheon J., Cho J.W., Eun Lee J., et al. TRIM31 promotes Atg5/Atg7-independent autophagy in intestinal cells. Nat. Commun. 2016;7:11726. doi: 10.1038/ncomms11726. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Kaniuk N.A., Canadien V., Bagshaw R.D., Bakowski M., Braun V., Landekic M., Mitra S., Huang J., Heo W.D., Meyer T., et al. Salmonella exploits Arl8B-directed kinesin activity to promote endosome tubulation and cell-to-cell transfer. Cell Microbiol. 2011;13:1812–1823. doi: 10.1111/j.1462-5822.2011.01663.x. [DOI] [PubMed] [Google Scholar]
  • 47.Daniels J., Kadlubar S. Sulfotransferase genetic variation: From cancer risk to treatment response. Drug Metab. Rev. 2013;45:415–422. doi: 10.3109/03602532.2013.835621. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Abdel-El-Haleem D. Acinetobacter: Environmental and biotechnological applications. Afr. J. Biotechnol. 2003;2:71–74. [Google Scholar]
  • 49.Dreyfus M., Régnier P. The poly(A) tail of mRNAs: Bodyguard in eukaryotes, scavenger in bacteria. Cell. 2002;111:611–613. doi: 10.1016/S0092-8674(02)01137-6. [DOI] [PubMed] [Google Scholar]
  • 50.Li Z., Pandit S., Deutscher M.P. Polyadenylation of stable RNA precursors in vivo. Proc. Natl. Acad. Sci. USA. 1998;95:12158–12162. doi: 10.1073/pnas.95.21.12158. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Mohanty B.K., Kushner S.R. Analysis of the function of Escherichia coli poly(A) polymerase I in RNA metabolism. Mol. Microbiol. 1999;34:1094–1108. doi: 10.1046/j.1365-2958.1999.01673.x. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from Microorganisms are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES