Skip to main content
BioMed Research International logoLink to BioMed Research International
. 2017 Aug 20;2017:8184160. doi: 10.1155/2017/8184160

Evaluation of the Bacterial Diversity in the Human Tongue Coating Based on Genus-Specific Primers for 16S rRNA Sequencing

Beili Sun 1, Dongrui Zhou 2, Jing Tu 1, Zuhong Lu 1,2,*
PMCID: PMC5585543  PMID: 28904972

Abstract

The characteristics of tongue coating are very important symbols for disease diagnosis in traditional Chinese medicine (TCM) theory. As a habitat of oral microbiota, bacteria on the tongue dorsum have been proved to be the cause of many oral diseases. The high-throughput next-generation sequencing (NGS) platforms have been widely applied in the analysis of bacterial 16S rRNA gene. We developed a methodology based on genus-specific multiprimer amplification and ligation-based sequencing for microbiota analysis. In order to validate the efficiency of the approach, we thoroughly analyzed six tongue coating samples from lung cancer patients with different TCM types, and more than 600 genera of bacteria were detected by this platform. The results showed that ligation-based parallel sequencing combined with enzyme digestion and multiamplification could expand the effective length of sequencing reads and could be applied in the microbiota analysis.

1. Introduction

The complex microbial flora living on or within the human body has long been proposed to contribute to the human health as well as disease [18] (Eckburg et al., 2005). Using culturing or unculturing methodology, over 25,000 bacterial phylotypes and over 700 prevalent taxa at the species level have been identified in the oral microbiome, colonizing in the oral cavity including teeth, gingival sulcus, tongue, cheeks, palates, and tonsils [920] (Keijser et al., 2008; Nasidze et al., 2009).

As a reservoir for oral microorganisms, food scraps, saliva, and shed epithelial cells, the human tongue has been investigated showing significant association with the microbial communities of the gut and diseases such as gastritis or halitosis [11, 2124]. In traditional Chinese medicine (TCM), according to the color and thickness, the human tongue coating can be divided into a few types consisting of thin-white coating, thick-white coating, sticky-white coating, thin-yellow coating, and so forth [25]. The color and shape of tongue coating may reflect the composition of the bacteria colonizing the tongue dorsum. Several researches have considered the association between tongue coating microbiome and traditional tongue diagnosis [2629]. Jiang et al. investigated 19 gastritis patients with a TCM Cold Syndrome or TCM Hot Syndrome tongue coating and 8 healthy controls by Illumina paired-end, double-barcode 16S rRNA V6 tag sequencing. Han et al. sequenced the V2–V4 region of 16S rRNA gene by pyrosequencing to investigate the tongue coating microbiome in patients with colorectal cancer and healthy controls. Their results indicated that the richness of the bacterial communities in the patients with thin tongue coating and healthy controls was higher than in the patients with thick tongue coating.

In the past few years, pyrosequencing of the 16S rRNA gene and sequencing-by-synthesis of metagenomics have been the widely applied technologies in the study of microbial communities. [3039]. Pyrosequencing technology has the benefits of relatively long length of sequencing read and the drawbacks of high reagent cost and high error rates in homopolymer repeats. Metagenomic analysis requires deep sequencing data mining and large amounts of sequencing reads for gene assembly and annotation [24, 27, 40]. Ligation-based sequencing technology provides inherent error correction by two-base encoding, which makes the platform much more accurate [34]. Due to the short sequencing length, this methodology has a few restrictions in the microbiome diversity research.

In the present study, we develop an effective approach using multigroup amplification and massively parallel ligation-based sequencing technology to determine the bacterial diversity. In addition, we compare the tongue coating microbial diversities of different TCM tongue coating types using this method.

2. Materials and Methods

2.1. Samples

Tongue coating samples were collected from two groups of lung cancer patients in Nanjing Chest Hospital. Each group represented a specific TCM tongue coating type: the thin-white type and the white-greasy type (Figure 1). A total of 13 subjects were collected in the morning and all the volunteers had no breakfast before sampling. All subjects had no oral inflammation and had refrained from brushing teeth and drinking colored beverage for 16 hours before testing. The tongue coating samples were collected with sterilized cotton swabs, mixed with 1 ml of Ringer's solution, and stored in −20°C immediately until extracting the total microbial genome DNA. All the 13 samples were investigated by DGGE analysis (data not shown). Based on the DGGE results, the bacteria composition was significantly different between different tongue coating types. Six typical samples from two tongue coating types were chosen for thorough parallel sequencing, and the clinical parameters of subjects were shown in Table 1.

Figure 1.

Figure 1

Thin-white tongue coating type (a) and white-greasy tongue coating type (b) in TCM theory.

Table 1.

Clinical parameters and barcode primer sequences of study subjects.

Subjects ID Gender Age TCM tongue coating type Barcode sequence (5′ to 3′)
A4 Male 30 Thin-white CTGCCCCGGGTTCCTCATTCTCTAAGCCCCTGCTGTACGGCCAAGGCG
A5 Male 58 Thin-white CTGCCCCGGGTTCCTCATTCTCTCACACCCTGCTGTACGGCCAAGGCG
A6 Female 72 Thin-white CTGCCCCGGGTTCCTCATTCTCTTCCCTTCTGCTGTACGGCCAAGGCG
B2 Male 57 White-greasy CTGCCCCGGGTTCCTCATTCTCTCCGATTCTGCTGTACGGCCAAGGCG
B3 Female 34 White-greasy CTGCCCCGGGTTCCTCATTCTCTTCGTTGCTGCTGTACGGCCAAGGCG
B4 Male 62 White-greasy CTGCCCCGGGTTCCTCATTCTCTGGGCACCTGCTGTACGGCCAAGGCG

2.2. Multiple Group PCR Primers Design

The multigroup PCR primers were designed based on Human Oral Microbiome Database (HOMD) (http://www.homd.org/). The 16S rRNA gene sequences of a genus containing 4 species at least in HOMD were picked out as a group and aligned by MEGA (v 4.0.2). Totally, 421 species belonging to 27 genera were selected. The conserved regions flanking the variable region can be used for the multigroup PCR primers design. A specific group primer consisted of 22 bp of sequences containing Eco57I recognition site and 20 bp of conserved region sequences (Figure 2). Table 2 indicates all the 26 pairs of group primers generated.

Figure 2.

Figure 2

Schematic diagram of the primer design and library preparation. All the 16S rRNA gene sequences from the same genera were aligned to generate 20 bp of the conserved forward sequence (CF) and conserved reverse sequence (CR) flanking the variable region. The forward primer (PF) and reverse primer (PR) were designed by adding 6 bp of Eco57I recognition site and 16/18 bp of auxiliary sequence at the 5′ end of CF and CR. The microbial genome DNA of each subject was amplified by all the pairs of primers from different genus, producing the amplicons containing the Eco57I cutting site (CS) which was 16 bp downstream of the recognition site. Followed by Eco57I digestion, products generated by all the genus-specific primers, conserving 4 bp of each primer sequence, were mixed together to produce the sequencing library. The results of ligation-based sequencing were millions of reads, which were the first 35 bases at the 5′ end from either forward strand or reverse strand of library.

Table 2.

The designed multigroup PCR primers.

Primer ID Genus name Primer sequence (5′ - 3′) T m (°C) Amplicon length (bp)
ActF Actinomyces Pa-Eb-GCGAAGAACCTTACCAAGGC 56.2 142
ActR Actinomyces Qc-E-TGACGACAACCATGCACCAC
PreF Prevotella P-E-GAACCTTACCCGGGCTTGAA 54.2 138
PreR Prevotella Q-E-TGACGACAACCATGCAGCAC
StrF Streptococcus P-E-AACGATAGCTAATACCGCAT 46.1 139
StrR Streptococcus Q-E-TAATACAACGCAGGTCCATC
TreF Treponema P-E-CGCGAGGAACCTTACCTGGG 53.7 137
TreR Treponema Q-E-ACGACAGCCATGCAGCACCT
LepF Leptotrichia P-E-ACGCGAGGAACCTTACCAGA 53.5 139
LepR Leptotrichia Q-E-CAGCCATGCACCACCTGTCT
NeiF Neisseria P-E-CGGGTTGTAAAGGACTTTTG 49.2 135
NeiR Neisseria Q-E-AGTTAGCCGGTGCTTATTCT
SelF Selenomonas P-E-CTTCGGATCGTAAAGCTCTG 51.4 142
SelR Selenomonas Q-E-TAGTTAGCCGTGGCTTCCTC
CapF Capnocytophaga P-E-TACGCGAGGAACCTTACCAA 51.3 138
CapR Capnocytophaga Q-E-ACAACCATGCAGCACCTTGA
PorF Porphyromonas P-E-CAGCCAAGTCGCGTGAAGGA 54.0 172
PorR Porphyromonas Q-E-CTGGCACGGAGTTAGCCGAT
FusF Fusobacterium P-E-ACGCGTAAAGAACTTGCCTC 51.4 75
FusR Fusobacterium Q-E-ACGCGTAAAGAACTTGCCTC
MycF Mycoplasma P-E-GATGGAGCGACACAGCGTGC 55.2 187
MycR Mycoplasma Q-E-GCGGCTGCTGGCACATAGTT
DiaF Dialister P-E-CGGAATTATTGGGCGTAAAG 53.1 161
DiaR Dialister Q-E-CTTTCCTCTCCGATACTCCA
EubF Eubacterium P-E-GATACCCTGGTAGTCCACGC 51.6 146
EubR Eubacterium Q-E-CTCCCCAGGTGGAATACTTA
FirF Firmicutes P-E-GATACCCTGGTAGTCCACGC 51.6 146
FirR Firmicutes Q-E-CTCCCCAGGTGGAATACTTA
PepsF Peptostreptococcus P-E-TAATTCGAAGCAACGCGAAG 52.7 154
PepsR Peptostreptococcus Q-E-CGACAACCATGCACCACCTG
KinF Kingella P-E-GGAATTACTGGGCGTAAAGC 52.4 168
KinR Kingella Q-E-AATTCTACCCCCCTCTGACA
PepnF Peptoniphilus P-E-ATCACTGGGCGTAAAGGGTT 51.1 186
PepnR Peptoniphilus Q-E-CGCATTTCACCGCTACACTA
LachF Lachnospiraceae P-E-CAACGCGAAGAACCTTACCA 54.2 146
LachR Lachnospiraceae Q-E-GACGACAACCATGCACCACC
PasF Aggregatibacter b P-E-CGGGTTGTAAAGTTCTTTCG 48.4 135
PasR Aggregatibacter Q-E-TTAGCCGGTGCTTCTTCTGT
PasF Haemophilus P-E-CGGGTTGTAAAGTTCTTTCG 48.4 135
PasR Haemophilus Q-E-TTAGCCGGTGCTTCTTCTGT
LactF Lactobacillus P-E-CCGCAACGAGCGCAACCCTT 55.3 103
LactR Lactobacillus CCTCCGGTTTGTCACCGGCA
TM7F TM7 P-E-GGGCGTAAAGAGTTGCGTAG 49.2 185
TM7R TM7 Q-E-TACGGATTTCACTCCTACAC
GemF Gemella AAAGCTCTGTTGTTAGGGAA 46.8 98
GemR Gemella P-E-GGTGGCTTTCTGGTTAGGTA
PseF Pseudomonas GGCGGCAGGCCTAACACATG 56.5 99
PseR Pseudomonas P-E-TTACTCACCCGTCCGCCGCT
VeiF Veillonella P-E-GTAAAGCTCTGTTAATCGGG 48.1 100
VeiR Veillonella GTGGCTTTCTATTCCGGTAC
MogF Mogibacterium P-E-CACGTGCTACAATGGTCGGT 50.7 101
MogR Mogibacterium ATCCGAACTGGGATCGGTTT
CatF Catonella P-E-CATGCAAGTCGAACGGAGAT 50.8 101
CatR Catonella GTTACTCACCCGTCCGCCAC

aCCAAGGCGGCCGTACG; bCTGAAG; cCCGACGTCGACTATCCAT.

2.3. DNA Extraction, Amplification, and Ligation Sequencing Library Preparation

All the samples were centrifuged and resuspended in 1 ml of 10x TE buffer (pH 8). The suspension was mixed with 100 μl of lysozyme (200 mg/ml) and incubated at 37°C for 1 h. Lysis solution containing 50 μl of SDS (10%, v/v) and 20 μl of Proteinase K (20 mg/ml) was added and the mixture was incubated at 55°C for 3 h. The total bacterial genome DNA was obtained by phenol-chloroform extraction and ethanol precipitation.

The PCR amplification was carried out separately by each of group primers for each sample with the same condition except for the annealing temperature. All the PCR reactions were performed in a final volume of 25 μl containing 1x PCR buffer (Takara), 2.5 mM MgCl2, 200 μM each dNTP, 2.5 U of Taq polymerase, 20 ng of template DNA, and 1 μM each forward and reverse primer. The PCR mixtures were initially denatured at 95°C for 3 min, followed by 30 cycles: 45 s at 94°C, 45 s at (Tm − 5)°C, and 45 s at 72°C. The PCR amplicons were digested using the enzyme Eco57I (Fermentas, Burlington, Canada) according to manufacturer's instructions. The expected fragment DNA was acquired by agarose gel electrophoresis and purified by QIAEX II Gel Extraction Kit (Qiagen, Hilden, Germany). PCR amplicon mixture was prepared by pooling approximately equal amounts of recovered fragments from the same sample. After blunting the 3′ protruding termini of mixtures by T4 DNA polymerase, four sequencing libraries were constructed for the ligation-based sequencing system.

2.4. Massively Parallel Ligation Sequencing

In order to maximize the sequencing capacity and simplify the workflow of sample preparation, all the samples were operated in a single sequencing run with barcodes added. Six barcodes were ligated to the 3′ end of the library templates with T4 DNA ligase as unique adaptors (Table 1).

SOLiD P1 adaptors (5′-CCACTACGCCTCCGCTTTCCTCTCTATGGGCAGTCGGTGAT-3′) were ligated at the 5′ end of templates, followed by standard SOLiD library preparation protocol.

2.5. Construction of Reference Sequences

Due to the characteristic of 2-base encoding in SOLiD sequencing, the sequencing reads are in color-space format “0, 1, 2, 3,” which stands for the permutation of the adjacent bases. The brief data processing pipeline in our study was designed as Figure 2 shows, because the novel microbial analysis methods are inappropriate. A total of 1,049,433 unaligned 16S rDNA sequences of both bacteria and archaea were downloaded from Ribosomal Database Project (RDP) (http://rdp.cme.msu.edu) resource (release 10, update 13). Using in silico analysis of amplification by the designed primers and endonuclease digestion, 26 groups of fragments were selected. Repetitive sequences were moved to construct 26 groups of reference sequences (REF-DB). Taxonomies of REF-DB (TAXA-DB) were assigned using the original full-length 16S rRNA gene sequences by the RDP online classifier (http://rdp.cme.msu.edu/classifier/classifier.jsp) at 80% confidence cut-off. If the reference was yielded from more than one full-length 16S rRNA gene sequence, each of the original sequences was assigned separately. From the genus level, the taxonomy shared by three-fourths majority of the full length sequences was defined as the taxonomy of the reference. Otherwise, taxonomy of higher level was compared until the domain level.

2.6. Analysis of Sequencing Data

Sequencing reads were split into six samples according to the 4-mer barcodes in P2 adaptor. In color space, all the samples were aligned separately to REF-DB by Corona Lite Program (v4.2, http://solidsoftwaretools.com) with up to 3 color-space mismatches. For unique reads (uniquely placed matches), a few steps of filtering were performed. Firstly, the matching position should be at the 5′ end of the reference sequence (forward-strand matching) or reverse compliment (reverse-strand matching). Meanwhile, the top four positions of reads were checked with one mismatch in color space to filter out nonspecific amplification or false endonuclease-digested products. Secondly, the abundance of reads matching “forward-strand” and “reverse-strand” references was counted separately and the reads with less than 5 counts in both strands were discarded. Finally, Operational Taxonomic Units (OTUs) were identified according to matched sequences in the REF-DB and corresponding taxonomy information in TAXA-DB. For the nonunique reads that matched the reference in more than one location, all of the matched references were assigned at the phylum level according to TAXA-DB. A matched read was assigned to a specific taxonomy, while all the references it matched belonged to accordant phylum. Otherwise, the read was denoted as “undefined.” Both the unique and nonunique matched reads were summarized by a set of Perl scripts.

2.7. Phylogenetic Analysis

The original full-length sequences of matched OTUs were aligned using ClustalW, and a relaxed neighbor-joining tree was built by PHILIP 3.68. Weighted and unweighted UniFrac were run using the resulting tree and environment annotation of number of hits. PCA was performed on the resulting matric of distances between each pair of samples.

2.8. Prediction of Metagenome from 16S rRNA Gene Data

The PICRUSt project aims to support prediction of the unobserved character states in a community of organisms from phylogenetic information about the organisms in that community. This program was used to predict metagenome abundance from 16S rRNA gene data.

3. Results

3.1. Sequencing Performance of SOLiD Reads

One-quarter of slide was used to perform the ligation-based sequencing, and 84,399,432 reads of 35-base length (2.75 gigabases in total) were captured after removing sequences of insufficient quality. Splitted by 4-mer barcodes, a total number of 75,115,494 reads (89%) were generated (Table 3). The difference of sequencing throughput among six samples was probably due to the sample quality or the procedure of library preparation.

Table 3.

Statistical results of SOLiD sequencing tags mapping with RDP references.

Subject ID Raw reads Matched reads Uniquely matched reads
Counts Frequency Counts Frequency
A4 12,455,117 2,043,364 16.41% 119,454 0.96%
A5 5,043,037 861,788 17.09% 63,751 1.26%
A6 27,658,749 4,965,785 17.95% 418,203 1.51%
B2 10,171,189 1,406,962 13.83% 160,263 1.58%
B3 8,938,854 1,968,093 22.02% 96,206 1.08%
B4 10,848,548 1,915,504 17.66% 119,292 1.1%

3.2. Taxa Assignment of the Sequencing Reads

Since SOLiD system employs 2-base encoding and color-space strategy, a single color change is a measurement error, two adjacent color changes may result from a single nucleotide variation in base space, and three color-space mismatches might imply two adjacent variants compared to reference. We used up to three color changes in 35-base length as the alignment parameter, which implied at most two nucleotide mismatches. Consequently, the sequence similarity was more than 94%, which corresponded to the genus level classification. Aligned to REF-DB, matched and uniquely matched reads were summarized in Table 3.

To increase the accuracy of taxonomy assignment, two steps of validation were performed before OTUs definition as Methods described. Altogether, we discarded 33.75% of the uniquely matched reads, leaving 647,375 sequencing reads for taxonomy analysis. Compared to TAXA-DB, number of OTUs was identified, ranging from 409 to 669 for six samples, respectively. Most of the OTUs (more than 93%) could be assigned at the genus or lower level, while a very small amount of OTUs was just assigned at the phylum or higher level (Table 4).

Table 4.

OTUs assignment of unique tags based on RDP classifier.

OTU A4 A5 A6 B2 B3 B4
Number PCT (%) Number PCT (%) Number PCT (%) Number PCT (%) Number PCT (%) Number PCT (%)
Genus 548 93.20 406 94.90 495 95.93 464 94.90 388 94.90 637 95.22
Family 15 2.55 11 2.57 10 1.94 12 2.45 10 2.45 17 2.54
Order 9 1.53 3 0.70 4 0.78 4 0.73 3 0.73 6 0.90
Class 9 1.53 3 0.70 6 1.16 7 1.22 5 1.22 8 1.20
Phylum 1 1.70 0 0 0 0 1 0.24 1 0.24 0 0
Domain 6 1.02 5 1.17 1 0.19 3 0.49 2 0.49 1 0.15
Total OTUs 588 428 516 491 409 669

3.3. Microbiome Diversity of the Tongue Coating Samples

The dominant phyla (relative abundance > 1%) across all samples were Firmicutes (55.19%  ± 16.00%), Proteobacteria (26.11%  ± 14.85%), Bacteroidetes (14.80%  ± 2.36%), Actinobacteria (2.03%  ± 1.45%), and Chlorobi (1.17%  ± 0.79%). Ultimately, we identified 209 genera from the six samples, and the abundance of each genus was defined in Methods. As shown in Figure 3, 92 genera were assigned to phylum Firmicutes, with the maximum abundance as well. Phyla Proteobacteria and Bacteroidetes contained some dominant genera in our results, while only 68 and 30 genera were assigned to these two phyla separately. The fifteen most abundant genera of each sample were illustrated in Figure 4, with total abundance ranging from 70.2% to 82%. There were 163 genera assigned from 79,762 reads for sample A4, and the most abundant genera were Haemophilus (13.38%, Proteobacteria), Haliscomenobacter (10.23%, Bacteroidetes), Enterococcus (9.50%, Firmicutes), Streptococcus (5.56%, Firmicutes), and Acetanaerobacterium (4.17%, Firmicutes). For sample A5, 134 genera were identified from 41,580 reads, having the most abundant genera of Haemophilus (11.40%), Haliscomenobacter (7.81%), Bacillus a. (6.40%, Firmicutes), Streptococcus (5.50%), and Enterococcus (4.62%). There were 141 genera assigned from 242,085 reads for sample A6, and the most abundant genera were Ralstonia (17.31%, Proteobacteria), Anaerostipes (10.32%, Firmicutes), Bacillus a (9.53%, Firmicutes), Haemophilus (4.96%), and Coprococcus (3.69%, Firmicutes). For white-greasy tongue coating type samples, for sample B2, 119 genera were found within 155,093 reads, and the dominant genera were Roseburia (26.91%, Firmicutes), Anaerotruncus (9.8%, Firmicutes), Coprococcus (7.91%), Sphingobacterium (5.81%, Bacteroidetes), and Dorea (3.48%, Firmicutes). For sample B3, 128 genera were assigned from 47,717 reads, with the largest proportion genera of Streptococcus (34.25%), Sphingobacterium (7.14%), Prevotella (7.08%, Bacteroidetes), Enterococcus (6.79%), and Acetanaerobacterium (4.50%). For sample B4, 143 genera were found within 81,138 reads, and the dominant genera were Haemophilus (19.03%), Sphingobacterium (10.65%), Enterococcus (7.94%), Acetanaerobacterium (5.31%), and Bacillus c (4.32%, Firmicutes). Overall, the dominant microbial phyla of the three thin-white tongue coating samples were Firmicutes, Proteobacteria, and Bacteroidetes, while the white-greasy samples B2 and B3 had the second abundant phylum of Bacteroidetes, and the sample B4 had the most abundant phylum of Proteobacteria.

Figure 3.

Figure 3

Relative abundances of taxonomy classification of the tongue coating microbiome at the phylum level in the six tongue coating samples.

Figure 4.

Figure 4

Dominant genera (top 15) assigned in the six samples.

In order to maximize the high-throughput usable data for deep analysis, both the uniquely and nonuniquely matched reads were classified at phylum level. The percent of assigned reads was ranging from 58.8% to 65.6%. Compared with the results based on unique reads, the proportion of phyla Firmicutes and Actinobacteria had dramatically increased (data not shown).

3.4. Shared Genera of the Same Tongue Coating Type Samples

We compared the shared genera within the same tongue coating type samples. There were 117 genera shared between the thin-white tongue coating type samples. Of these genera, 58 were from Firmicutes, 37 genera were from Proteobacteria, and 12 genera belonged to Bacteroidetes. For the white-greasy tongue coating type samples, 96 genera were observed to be shared. 48 genera belonged to Firmicutes, 28 genera belonged to Proteobacteria, and 11 genera belonged to Bacteroidetes, respectively.

3.5. Comparison of Bacterial Diversity between Different Tongue Coating Type Samples

There were 77 genera shared across the six samples (Figure 5). 35 genera belonged to Firmicutes, 23 genera belonged to Proteobacteria, and 8 genera belonged to Bacteroidetes. We compared the genera observed in one tongue coating type but not the other. Figure 6 showed the genera that appeared in all the thin-white tongue coating type but existed in none white-greasy tongue coating type sample. A total of 12 genera were observed only in the thin-white tongue coating type samples: Pelomonas, Haemophilus, Thioalkalispira, and Zoogloea (Proteobacteria); Jeotgalibacillus, Granulicatella, Lachnobacterium, Peptostreptococcus, Anaeromusa, Parasporobacterium, and Sporobacterium (Firmicutes); and Prevotella (Bacteroidetes). Figure 7 showed the genera that appeared in all the white-greasy tongue coating type but did not exist in all the thin-white tongue coating type sample. The white-greasy tongue coating type samples were specifically unique for 6 genera: Anaerostipes, Lactobacillus, Anaerobacter, Ruminococcaceae Incertae sedis, and Oribacterium (Firmicutes) and Acidithiobacillus (Proteobacteria). Analyzed by Student's t-test, 19 genera performed significantly different between two tongue coating types (Table 5).

Figure 5.

Figure 5

Relative abundance of assigned taxonomy at genus level shared in the six tongue coating samples.

Figure 6.

Figure 6

Relative abundance of assigned taxonomy at genus level shared only in the thin-white tongue coating samples.

Figure 7.

Figure 7

Relative abundance of assigned taxonomy at genus level shared only in the white-greasy tongue coating samples.

Table 5.

Relative abundance comparison of significantly different genera between thin-white and white-greasy tongue coating types.

Phylum Genus A4 A5 A6 B2 B3 B4 P
Bacteroidetes Proteiniphilum 85 71 60 0 0 12 0.003
Bacteroidetes Salinibacter 228 274 328 68 25 13 0.004
Proteobacteria Thiobacter 166 199 226 0 0 10 0.006
Firmicutes Cryptanaerobacter 219 320 257 42 16 0 0.007
Firmicutes Peptococcus 439 295 338 26 16 0 0.013
Firmicutes Veillonella 2047 2501 1434 383 861 111 0.019
Bacteroidetes Bacteroides 219 313 188 111 16 26 0.019
Firmicutes Anaerostipes 0 0 0 100 168 158 0.021
Firmicutes Sporobacterium 9 15 11 0 0 0 0.022
Firmicutes Fastidiosipila 11 15 6 31 23 21 0.026
Bacteroidetes Prevotella 21 13 15 0 0 0 0.027
Firmicutes Parasporobacterium 20 15 11 0 0 0 0.028
Proteobacteria Smithella 23 19 22 0 11 0 0.028
Firmicutes Anaerobacter 0 0 0 31 39 20 0.033
Proteobacteria Thioalkalispira 47 31 24 0 0 0 0.038
Firmicutes Trichococcus 127 171 81 4652 5795 9135 0.041
Firmicutes Faecalibacterium 1504 2974 1936 95 47 232 0.042
Firmicutes Lachnobacterium 106 79 47 0 0 0 0.046
Bacteroidetes Subsaxibacter 0 17 0 26 54 65 0.047

Analyzed by UniFrac software, the thin-white tongue coating type and white-greasy tongue coating type were observably different at the genus level as shown in principal component analysis (PCA) plot (Figure 8). The three thin-white tongue coating subjects had relatively similar microbial diversity (Bonferroni-corrected P value is 0.25), while three white-greasy tongue coating subjects behaved clear deviation (Bonferroni-corrected P value, all >0.5).

Figure 8.

Figure 8

Principal component analysis of microbial diversities of thin-white tongue coating patients (blue squares) and white-greasy tongue coating patients (red circles).

3.6. Metagenome Prediction

PICRUSt was used to predict a microbial community metagenome based on 16S gene data. PICRUSt results were then analyzed using LEfSe to identify microbial functions that were significantly different in their relative abundance among groups. The top of differentially abundant bacterial functions were “oxidative phosphorylation,” “ribosome,” “amino sugar and nucleotide sugar metabolism,” and “secretion system.”

4. Discussion

In this study, we explored a method to detect the microbial diversity using 16S rRNA gene by high-throughput SOLiD sequencing system. The ligation-based system features 2-base encoding, which is a proprietary mechanism that interrogates each base twice. The 2-base encoding algorithm filtered raw errors after sequencing, providing built-in error correction. The output of sequencing reads is in the format of color space, which means that the output reads must align to color-space reference. Although the short length is the main drawback of SOLiD platform compared to other sequencing technologies, our method could extend the effective length of sequencing read to more than 100 bp. First, the sequencing length was extended to 51 bp, by adding 16 nucleotides digested by Eco57I. Second, concerning that the two strands of library DNA could be sequenced in 5′ to 3′ direction, the usable sequencing length was equivalent to 102 bp, which was close to the length of one hypervariable region of 16S rRNA gene. Meanwhile, the validation of first four sequenced nucleotides, which were conserved from the designed primers, could enhance the sequencing accuracy in the validation step.

To evaluate the effect of this methodology, six tongue coating samples from different TCM tongue coating types were investigated. In TCM theories, the tongue coating reflects the status of physiological and clinicopathological changes of inner parts of body. As a common type, thin-white tongue coating is a symbol of good health. A white-greasy tongue coating like powder indicates turbidity and external pathogenic heat. The abundant bacterial groups found in our study are similar to those found in most other studies. Several studies completing microbial analysis of the healthy human tongue using 16S rRNA sequencing showed the most abundant phyla to be Fusobacteria, Actinobacteria, Firmicutes, Proteobacteria, and Bacteroidetes. Jiang et al. investigated 27 tongue coating samples by Illumina technology and identified 715 differentially abundant, species-level OTUs on tongue coatings of the enrolled patients compared to healthy controls. Furthermore, 123 and 258 species-level OTUs were identified in patients with Cold/Hot Syndrome. In Jiang et al.'s report, the dominant phyla in Chinese tongue coating microflora samples were Firmicutes, Bacteroidetes, Proteobacteria, Actinobacteria, and Fusobacteria. The dominant phyla in our six tongue coating samples were Firmicutes, Proteobacteria, Bacteroidetes, Actinobacteria, and Chlorobi. The similar results represented the effectiveness of our methodology compared with other sequencing systems. Han et al. sequenced the V2–V4 region of 16S rRNA gene by pyrosequencing to investigate the tongue coating microbiome in patients with colorectal cancer and healthy controls. Prevotella, Haemophilus, and Streptococcus were dominant in Han et al.'s samples, which is the same result in our study. Based on these conclusions, the method combining ligation-based sequencing and Eco57I digestion exhibited equivalent effect compared with Illumina or pyrosequencing technologies.

Using the 16S rRNA gene, the core function of tongue coating microbiome could be predicted by the PICRUSt software. Pathways encoding for carbohydrate metabolism and oxidative phosphorylation metabolism were detected. The oral cavity is a major gateway to the human body. Microorganisms colonizing in the oral cavity have a significant probability of spreading to the stomach, lung, and intestinal tract. The metagenome prediction validated the role of tongue coating.

In our results, except for some common genera in human body, a few environmental bacteria were observed as well. One of the explanations is that the sequencing results are not precise. The other hypothesis is that these genera are still unknown bacteria, which have similar sequences with the environmental bacteria from database. The more bacteria are sequenced, the more affirmatory genera could be defined.

The composition of the microbial communities on the tongue coating varies between individuals. UniFrac principal coordinates analysis showed no apparent clustering of microbial communities between two types of tongue coating. This may be indicative of the fact that, despite the many different habitats on the human tongue, many bacterial species are shared among those habitats.

Acknowledgments

The Hi-Tech Research and Development Program of China (Project no. 2006AA020702) and the National Natural Science Foundation of China (Project no. 30600152) supported this work.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

References

  • 1.Andersson A. F., Lindberg M., Jakobsson H., Bäckhed F., Nyrén P., Engstrand L. Comparative analysis of human gut microbiota by barcoded pyrosequencing. PLoS ONE. 2008;3(7, article e2836) doi: 10.1371/journal.pone.0002836. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Bik E. M., Eckburg P. B., Gill S. R., et al. Molecular analysis of the bacterial microbiota in the human stomach. Proceedings of the National Academy of Sciences of the United States of America. 2006;103(3):732–737. doi: 10.1073/pnas.0506655103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Deng W., Xi D., Mao H., Wanapat M. The use of molecular techniques based on ribosomal RNA and DNA for rumen microbial ecosystem studies: a review. Molecular Biology Reports. 2008;35(2):265–274. doi: 10.1007/s11033-007-9079-1. [DOI] [PubMed] [Google Scholar]
  • 4.Segata N., Kinder Haake S., Mannon P., et al. Composition of the adult digestive tract bacterial microbiome based on seven mouth surfaces, tonsils, throat and stool samples. Genome Biology. 2012;13(6):1–18. doi: 10.1186/gb-2012-13-6-r42. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Tschöp M. H., Hugenholtz P., Karp C. L. Getting to the core of the gut microbiome. Nature Biotechnology. 2009;27(4):344–346. doi: 10.1038/nbt0409-344. [DOI] [PubMed] [Google Scholar]
  • 6.Turnbaugh P. J., Hamady M., Yatsunenko T., et al. A core gut microbiome in obese and lean twins. Nature. 2009;457(7228):480–484. doi: 10.1038/nature07540. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Consortium T. H. M. P. Structure, function and diversity of the healthy human microbiome. Nature. 2012;486(7402):207–214. doi: 10.1038/nature11234. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Turnbaugh P. J., Ley R. E., Hamady M., Fraserliggett C. M., Knight R., Gordon J. I. The human microbiome project. Indian Journal of Microbiology. 2012;52(3):804–810. [Google Scholar]
  • 9.Aas J. A., Paster B. J., Stokes L. N., Olsen I., Dewhirst F. E. Defining the normal bacterial flora of the oral cavity. Journal of Clinical Microbiology. 2005;43(11):5721–5732. doi: 10.1128/JCM.43.11.5721-5732.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Bik E. M., Long C. D., Armitage G. C., et al. Bacterial diversity in the oral cavity of 10 healthy individuals. ISME Journal. 2010;4(8):962–974. doi: 10.1038/ismej.2010.30. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Bordas A., McNab R., Staples A. M., Bowman J., Kanapka J., Bosma M. P. Impact of different tongue cleaning methods on the bacterial load of the tongue dorsum. Archives of Oral Biology. 2008;53:S13–S18. doi: 10.1016/S0003-9969(08)70004-9. [DOI] [PubMed] [Google Scholar]
  • 12.Crielaard W., Zaura E., Schuller A. A., Huse S. M., Montijn R. C., Keijser B. J. F. Exploring the oral microbiota of children at various developmental stages of their dentition in the relation to their oral health. BMC Medical Genomics. 2011;4, article 22 doi: 10.1186/1755-8794-4-22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Lazarevic V., Whiteson K., Hernandez D., Francois P., Schrenzel J. Study of inter- and intra-individual variations in the salivary microbiota. BMC Genomics. 2010;11(523) doi: 10.1186/1471-2164-11-523. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Lazarevic V., Whiteson K., Huse S., et al. Metagenomic study of the oral microbiota by Illumina high-throughput sequencing. Journal of Microbiological Methods. 2009;79(3):266–271. doi: 10.1016/j.mimet.2009.09.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Pushalkar S., Mane S. P., Ji X., et al. Microbial diversity in saliva of oral squamous cell carcinoma. FEMS Immunology and Medical Microbiology. 2011;61(3):269–277. doi: 10.1111/j.1574-695X.2010.00773.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Watanabe R., Hojo K., Nagaoka S., Kimura K., Ohshima T., Maeda N. Antibacterial activity of sodium citrate against oral bacteria isolated from human tongue dorsum. Journal of Oral Biosciences. 2011;53(1):87–92. doi: 10.2330/joralbiosci.53.87. [DOI] [Google Scholar]
  • 17.Zaura E., Keijser B. J., Huse S. M., Crielaard W. Defining the healthy core microbiome of oral microbial communities. BMC Microbiology. 2009;9(259) doi: 10.1186/1471-2180-9-259. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Abreu M. T., Peek Jr. R. M. Gastrointestinal malignancy and the microbiome. Gastroenterology. 2014;146(6):1534–1546. doi: 10.1053/j.gastro.2014.01.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Dewhirst F. E., Chen T., Izard J., et al. The human oral microbiome. Journal of Bacteriology. 2010;192(19):5002–5017. doi: 10.1128/JB.00542-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Jakobsson H. E., Jernberg C., Andersson A. F., Sjolund-Karlsson M., Jansson J. K., Engstrand L. Short-term antibiotic treatment has differing long-term impacts on the human throat and gut microbiome. PLoS One. 2010;5(3, article e9836) doi: 10.1371/journal.pone.0009836. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Kazor C. E., Mitchell P. M., Lee A. M., et al. Diversity of bacterial populations on the tongue dorsa of patients with halitosis and healthy patients. Journal of Clinical Microbiology. 2003;41(2):558–563. doi: 10.1128/JCM.41.2.558-563.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Li R., Ma T., Jin G., Liang X., Li S. Imbalanced network biomarkers for traditional Chinese medicine Syndrome in gastritis patients. Scientific Reports. 2013;3(3):1543–1543. doi: 10.1038/srep01543. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Marsland B. J., Gollwitzer E. S. Host-microorganism interactions in lung diseases. Nature Reviews Immunology. 2014;14(12):827–835. doi: 10.1038/nri3769. [DOI] [PubMed] [Google Scholar]
  • 24.Xu P., Gunsolley J. Application of metagenomics in understanding oral health and disease. Virulence. 2014;5(3):424–432. doi: 10.4161/viru.28532. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Tang J. L., Liu B. Y., Ma K. W. Traditional Chinese medicine. Lancet. 2008;372(9654):1938–1940. doi: 10.1016/S0140-6736(08)61354-9. [DOI] [PubMed] [Google Scholar]
  • 26.Han S., Chen Y., Hu J., Ji Z. Tongue images and tongue coating microbiome in patients with colorectal cancer. Microbial Pathogenesis. 2014;77:1–6. doi: 10.1016/j.micpath.2014.10.003. [DOI] [PubMed] [Google Scholar]
  • 27.Jiang B., Liang X., Chen Y., et al. Integrating next-generation sequencing and traditional tongue diagnosis to determine tongue coating microbiome. Scientific Reports. 2012;2, article 936 doi: 10.1038/srep00936. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Sun Z., Zhao J., Qian P., et al. Metabolic markers and microecological characteristics of tongue coating in patients with chronic gastritis. BMC Complementary and Alternative Medicine. 2013;13, article 227 doi: 10.1186/1472-6882-13-227. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Zhao Y., Gou X., Dai J., et al. Differences in metabolites of different tongue coatings in patients with chronic hepatitis B. Evidence-Based Complementary and Alternative Medicine. 2013;2013:12. doi: 10.1155/2013/204908.204908 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Bentley D. R., Balasubramanian S., Swerdlow H. P., et al. Accurate whole human genome sequencing using reversible terminator chemistry. Nature. 456;7218:53–59. doi: 10.1038/nature07517. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Harris T. D., Buzby P. R., Babcock H., et al. Single-molecule DNA sequencing of a viral genome. Science. 2008;320(5872):106–109. doi: 10.1126/science.1150427. [DOI] [PubMed] [Google Scholar]
  • 32.Hudson M. E. Sequencing breakthroughs for genomic ecology and evolutionary biology. Molecular Ecology Resources. 2008;8(1):3–17. doi: 10.1111/j.1471-8286.2007.02019.x. [DOI] [PubMed] [Google Scholar]
  • 33.Margulies M., Egholm M., Altman W. E., Attiya S., Bader J. S., Bemben L. A. Genome sequencing in microfabricated high-density picolitre reactors. Nature. 2005;437(7057):376–380. doi: 10.1038/nature03959. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Valouev A., Ichikawa J., Tonthat T., et al. A high-resolution, nucleosome position map of C. elegans reveals a lack of universal sequence-dictated positioning. Genome Research. 2008;18(7):1051–1063. doi: 10.1101/gr.076463.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Claesson M. J., Cusack S., O'Sullivan O., et al. Composition, variability, and temporal stability of the intestinal microbiota of the elderly. Proceedings of the National Academy of Sciences of the United States of America. 2011;108(1):4586–4591. doi: 10.1073/pnas.1000097107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Ding T., Schloss P. D. Dynamics and associations of microbial community types across the human body. Nature. 2014;509(7500):357–360. doi: 10.1038/nature13178. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Huber J. A., Mark Welch D. B., Morrison H. G., et al. Microbial population structures in the deep marine biosphere. Science. 2007;318(5847):97–100. doi: 10.1126/science.1146689. [DOI] [PubMed] [Google Scholar]
  • 38.Huttenhower C., Gevers D., Knight R., et al. Structure, function and diversity of the healthy human microbiome. Nature. 2011;486(7402):207–214. doi: 10.1038/nature11234. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Qin J. J., Li Y. R., Cai Z. M., et al. A metagenome-wide association study of gut microbiota in type 2 diabetes. Nature. 2012;490(7418):55–60. doi: 10.1038/nature11450. [DOI] [PubMed] [Google Scholar]
  • 40.Ye J., Cai X., Cao P. Problems and prospects of current studies on the microecology of tongue coating. Chinese Medicine. 2014;9(1, article 9) doi: 10.1186/1749-8546-9-10. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from BioMed Research International are provided here courtesy of Wiley

RESOURCES