Abstract
Background
Diagnosis of the cause of cerebral thrombi is vital for recurrence prevention but also challenging. The presence of the microbiome has recently been confirmed in thrombus, suggesting a novel approach to distinguish cerebral thrombi of different origins. However, little is known about whether there is heterogeneity in microbiological colonization of cerebral thrombi of different sources.
Methods and Results
Forty patients experiencing acute ischemic stroke were included and clinical data were collected. Metagenomic next‐generation sequencing was adopted to detect bacterial and genomic signatures of human cerebral thrombi samples. We found similar species diversity between the large‐artery atherosclerosis thrombi and cardioembolic thrombi but different species composition and distribution of cerebral thrombus microbiota. Compared with the group with cardioembolism, the group with large‐artery atherosclerosis showed a significantly higher relative abundance of Ralstonia insidiosa among the top 10 bacterial species in cerebral thrombi. Twenty operational taxonomy units were correlated with 11 clinical indicators of ischemic stroke. The Gene Ontology enrichment analysis revealed 9 different enriched biological processes (translation and carbohydrate metabolic process, etc). The enriched Kyoto Encyclopedia of Genes and Genomes pathways included ribosome, butanoate metabolism, and sulfur metabolism.
Conclusions
This study, based on the approach of metagenomic next‐generation sequencing, provides a diagnostic microbiological method to discriminate individuals with cardioembolic thrombi from those with large‐artery atherosclerosis thrombi with human cerebral thrombi samples. Our findings provide a fresh perspective on microbial heterogeneity of cerebral thrombi and demonstrate biological processes and pathway features of cerebral thrombi.
Keywords: cerebral thrombus, metagenomic next‐generation sequencing, microbiome, thrombectomy
Subject Categories: Ischemic Stroke, Embolism, Thrombosis
Nonstandard Abbreviations and Acronyms
- AIS
acute ischemic stroke
- KEGG
Kyoto Encyclopedia of Genes and Genomes
- LAA
large‐artery atherosclerosis
- mNGS
metagenomic next‐generation sequencing
- mRS
modified Rankin Scale
- NIHSS
National Institutes of Health Stroke Scale
Clinical Perspective.
What Is New?
Metagenomic next‐generation sequencing is used to identify the microbial distribution and genome information of human cerebral thrombus.
Microbial heterogeneity of cerebral thrombi is associated with different origins.
Microbial genes further reveal biological processes and pathway features of cerebral thrombi.
What Are the Clinical Implications?
Our study provides a metagenomic next‐generation sequencing‐based diagnostic microbiology method to distinguish atherosclerotic thrombus and cardiogenic thrombus.
Our study also reveals the close relevance of the thrombus microbiome and clinical indicators of patients with stroke, indicating the potential effects of the thrombus microbiota on host physiological function.
These findings facilitate a new direction to develop metagenomic next‐generation sequencing‐based treatment strategies for thrombolysis and thrombosis prevention.
Stroke is the second leading cause of death and the third leading cause of disability in the world. 1 Acute ischemic stroke (AIS) is the most common subtype of stroke. Thrombi‐based cerebral blood vessel occlusion can directly lead to ischemic stroke onset. The pathogenetic diagnosis of AIS subtypes is crucial for determining secondary prevention therapy against stroke recurrence, which occurs in approximately 25% of individuals with stroke within 5 years after the initial occurrence. 2 Currently, the pathogenetic diagnosis of stroke is a comprehensive diagnosis dependent on atrial fibrillation rates, risk factors, postoperative angiography, and thrombus cellular component identification. 3 Several recent studies have provided novel evidence to help distinguish cerebral thrombi origin, such as molecular characteristics of thrombus, 4 thrombus‐related imaging sign, 5 and morphological signs of intravital clot contraction. 6 We have previously found that thrombotic component analysis has important implications for stroke therapeutic effect, secondary prevention, and the search for new targets for antithrombotic therapy. 7 However, our understanding of thrombus heterogeneity is far from sufficient.
The microbiota–gut–brain axis is well known to play a vital role in the reciprocal interactions between the gut and the brain during acute cerebral ischemia, mainly mediated by microbiota‐related metabolites, bacterial components (eg, lipopolysaccharide) and the neuroimmune system. 8 In addition, gut microbe‐derived metabolites, such as trimethylamine‐N‐oxide and short‐chain fatty acids, can promote thrombosis potential 9 and affect atherosclerotic plaque stability. 10 Moreover, thrombus‐related microbiota is highly reported in coronary thrombus, 11 where high levels of microbiota and microbe‐derived metabolites are associated with the survival rate of patients with myocardial infarction. 12 Recently, studies have proven microbiota colonization in cerebral thrombus, 13 , 14 which mainly originated from the plasma 15 of patients with AIS. Thrombi from cardioembolic and large‐artery atherosclerotic strokes had distinct bacterial concentrations, dominant species, and distribution patterns. 15 Nevertheless, alternative approaches for analyzing complex microbial communities, such as metagenomic next‐generation sequencing (mNGS), may provide more robust information on detailed microbial identification, gene diversity, and functional profiles. 16
In the present study, we aimed to analyze mNGS data of 40 thrombi obtained from patients with AIS undergoing endovascular thrombectomy and discriminate cardioembolic and atherothrombotic thrombi origins according to their microbiota population and gene diversity. We sought to establish a more accurate pathogenetic diagnosis to provide more information for secondary stroke prevention therapy and to correlate microbial features with clinical characteristics. Based on mNGS data, Kyoto Encyclopedia of Genes and Genomes (KEGG), and Gene Ontology analyses were conducted to offer additional details about the potential impacts of microbial gene enrichment, which may help to provide more insight into the thrombotic microenvironment and host function.
METHODS
The data related to this study are available from the corresponding author upon reasonable request. The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.
Study Patients and Sample Preparation
In the present study, we continuously investigated thrombi (n=40) from patients with AIS who underwent endovascular thrombectomy in the First Affiliated Hospital of Army Medical University between November 2020 and February 2022. Our methods and experimental protocols were conducted according to the recommendations of guidelines from the Institutional Review Board of the First Affiliated Hospital of Army Medical University and were granted by the Ethics Committee of the First Affiliated Hospital of Army Medical University [(A)KY2021023]. All patients or their legal representatives signed informed consent for medical research of their images and specimens.
Baseline clinical data and laboratory examination indices were immediately recorded when the patient with AIS was admitted to the hospital. The severity of patients with AIS was evaluated with the National Institutes of Health Stroke Scale (NIHSS) score 17 on the day of admission, at 24 hours after endovascular thrombectomy and at discharge. The ischemic stroke subtypes were assessed using the TOAST (Trial of Org 10172 in Acute Stroke Treatment) criteria. 18 The functional outcomes were assessed with a 90‐day modified Rankin Scale (mRS) score, 19 and independent functional outcome was defined as mRS score 0 to 2. Vascular recanalization was evaluated by the modified cerebral infarction thrombolytic classification scoring standard. 20 (Grades 0–2a were defined as nonrecanalized blood vessels, and grades 2b–3 were defined as successful blood vessel recanalization.)
Thrombi from 40 patients with AIS who met the following criteria were consecutively included: (1) patients over 18 years old; (2) clinical indication for mechanical recanalization of a proximal intracranial occlusion of the anterior or posterior circulation and at least 1 thrombus fragment was collected during the procedure.
The exclusion criteria were as follows: (1) currently suffering from infection, (2) autoimmune disease, (3) severe liver function and renal function impairment, and (4) active or recurrent cancer.
Thrombi were retrieved using a stent retriever or aspiration device according to the judgment of the 2 treating neurological intervention physicians. If the thrombus material collected in a patient by any pass was pooled, it was further considered 1 thrombus. Of note, because thrombi often adhere to the device, it must be rinsed with sterile saline and gently manipulated for transfer to a sterile collection container. The collected thrombus was placed into a preprepared sterile collection container with PBS. The thrombus was then frozen in liquid nitrogen and stored at −80 °C until mNGS analysis.
Nucleic Acid Extraction, Library Preparation, and Sequencing
All thrombus samples were collected by following the standards of aseptic processing procedures. The thrombus tissue specimen was cut into small pieces and then wall broken. The specific wall breaking method was as follows: (1) 1 g of 0.5 mm diameter glass beads was added to the broken wall tube, (2) 0.6 mL of treated specimen was added, (3) the tube was shaken at high speed at 2800 to 3200 r/min for 30 minutes, (4) nucleic acid extraction was performed in 300 L according to the TIANamp Micro DNA Kit (DP316, China Tiangen Biochemical Technology Co., Ltd.), and (5) after extraction, DNA was sonicated to fragments of aproximately 200 to 300 bp.
The DNA library libraries were constructed by fragmenting DNA, end repair, adapter ligation, and amplifying the DNA with polymerase chain reaction. On an Illumina NextSeq 550 system (Illumina, San Diego, CA), pooled libraries were sequenced using a 75‐bp, single‐end sequencing kit. The qualified results had a Q30 score of at least 90% and at least 15 million readings per sample. Negative controls were conducted and environmental contaminants were excluded via a background pathogen detection method of bioinformatics analysis.
Bioinformatics Analyses
After removing low‐quality and short read sequences (length<35 bp), 75 bp single‐end reads from Illumine Nextseq 550 were analyzed with IDseq software developed in house to determine the abundance of each microorganism. The high‐quality read data were aligned to the human reference genome sequence, the sequence on the alignment was removed, and the part was determined to be a human host sequence 21 The remaining read sequences were aligned using a pathogen‐specific microbial database containing 6350 bacteria, 1064 fungi, 4945 viruses, and 234 parasites. The number of reads that could match the various pathogen‐specific sequences was obtained by the alignment algorithm, and the possible pathogens were judged according to the number of sequences and other clinical tests.
Bacterial Diversity and Taxonomic Analysis
Bacterial diversity was evaluated using a sampling‐based operational taxonomic unit analysis, with the Shannon index, Chao index, and Abundance‐Based Coverage Estimator index serving as indicators. Calculations were facilitated by the R program package “vegan.” Additionally, principal component analysis and principal coordinate analysis were employed through the R package (http://www.R‐project.org/) to visualize the microbiome space among the group samples. A heatmap, generated using the heatmap builder, was used to identify key variables.
Taxonomic analyses of bacteria and comparisons between the 2 groups were conducted using the Wilcoxon rank‐sum test, encompassing both bacterial phylum and genus. Furthermore, the linear discriminant analysis effect size method (http://huttenhower.sph.harvard.edu/lefse/) was applied to analyze the microbial characterization of thrombi between the groups with large‐artery atherosclerosis (LAA) and with cardioembolism, based on the normalized relative abundance matrix. This approach initially identified features with significant differential abundance using the Kruskal–Wallis rank‐sum test (P<0.05) and then assessed the effect size of each feature through linear discriminant analysis (LDA), with an LDA score (log10) threshold of 2.
Random Forest Model Prediction
Taking into account the robustness of the algorithm, we used the random forest function from the R package “RandomForest” to construct a prediction model aimed at pinpointing potential diagnostic biomarkers. The study gathered abundance profiles, and all of this sample set was used for model training. Core genera specific to the groups with LAA or cardioembolism were selected as predictive input variables. To identify the genera that significantly contributed to the predictions, we employed a nested 10‐fold cross‐validation procedure. Briefly, the top 30 species in the ranking of importance were selected according to the feature importance evaluated by the random forest model. Then, the feature selection algorithm is used to select the 30 species on the basis of the feature selection to obtain the final species set for model construction. Furthermore, the area under the curve index and receiver operating characteristic analysis were conducted to assess the efficacy of potential cutoff values for the tests. A 10‐fold cross‐validation was used to evaluate the area under the curve, sensitivity and specificity of the model.
Statistical Analysis
IBM SPSS Statistics 22 (SPSS, Chicago, IL) was used to analyze differences between the 2 groups' clinical indices. A Mann–Whitney U test was used to determine whether there was statistical significance between the 2 groups. The statistical significance was set at P < 0.05. Analysis of all sequenced data was conducted using R (R Foundation for Statistical Computing, Vienna, Austria). The Kruskal–Wallis test was used for α ‐diversity between groups and for species abundance differences between groups, and the Spearman correlation test was used for species abundance. The nonparametric factor Kruskal–Wallis rank‐sum test was used to select metagenomic biomarker species (P<0.05) for LDA effect size analysis. The influence of differential species (LDA score) on biomarker species was then evaluated by LDA. It was considered statistically significant to identify species with LDA>2.0.
RESULTS
Characteristics of the Study Participants
The baseline characteristics of the patients are shown in the Table. In total, 40 patients were included in this study, all of whom underwent endovascular thrombectomy; 24 were men, and the mean age of all patients was 70.55±11.01 years. Overall, 15 patients (37.5%) were classified into the group with LAA, and 25 (62.5%) were classified into the group with cardioembolism based on the TOAST classification. In the group with cardioembolism, 15 patients (60%) had atrial fibrillation and 11 patients (44%) had >2 stent passages, which were significantly higher than those in the group with LAA. The rates of having a smoking history and using rescue therapy in the group with LAA were significantly higher than those in the group with cardioembolism. Besides laboratory tests there were no significant differences in white blood cells, neutrophils, monocytes, lymphocytes, platelets, fibrinogen, prothrombin time, and activated partial thromboplastin time between the 2 groups.
Table .
Baseline Characteristics
| Variables | Entire population (n=40) | P value | |
|---|---|---|---|
| Group with LAA (n=15) | Group with cardioembolism (n=25) | ||
| Demographics | |||
| Male, n (%) | 10 (66.7) | 14 (56.0) | 0.505 |
| Age, y, median (IQR) | 72.0 (68.3–78.8) | 74.0 (58.0–80.0) | 0.720 |
| Vascular risk factors | |||
| Hypertension, n (%) | 10 (66.7) | 13 (52.0) | 0.364 |
| Coronary artery disease, n (%) | 2 (13.3) | 5 (20.0) | 0.591 |
| Atrial fibrillation, n (%) | 1 (6.7) | 15 (60.0) | 0.001* |
| Hyperlipidemia, n (%) | 2 (13.3) | 3 (12.0) | 0.902 |
| Diabetes, n (%) | 2 (13.3) | 3 (12.0) | 0.902 |
| Smoking, n (%) | 9 (60.0) | 7 (28.0) | 0.046* |
| Drinking, n (%) | 5 (33.3) | 6 (24.0) | 0.522 |
| Clinical characteristics | |||
| Admission National Institutes of Health Stroke Scale score, median (IQR) | 18 (14–24) | 17 (11–18) | 0.088 |
| Admission modified Rankin Scale score, median (IQR) | 4 (4–5) | 4 (3–4) | 0.080 |
| Stroke onset to thrombectomy, min, median (IQR) | 310 (212–600) | 285 (220–453) | 0.439 |
| Thrombolysis, n (%) | 4 (26.7) | 3 (12.0) | 0.452 |
| Vascular occlusion site | |||
| Internal carotid artery, n (%) | 6 (40.0) | 11 (44.0) | 0.804 |
| Middle cerebral artery, n (%) | 8 (53.3) | 12 (48.0) | 0.744 |
| Basilar artery, n (%) | 1 (6.7) | 2 (8.0) | 0.877 |
| Procedure details | |||
| Number of stent passage >2, n (%) | 2 (13.3) | 11 (44.0) | 0.045* |
| Rescue therapy, n (%) | 4 (26.7) | 1 (4.0) | 0.036* |
| Modified thrombolysis in cerebral infarction 2b/3, n (%) | 13 (86.7) | 23 (92.0) | 0.586 |
| Laboratory tests | |||
| White blood cells, ×109/L, median (IQR) | 9.2 (7.6–11.7) | 7.5 (6.2.9.2) | 0.112 |
| Neutrophils, ×109/L, median (IQR) | 7.0 (6.0–9.1) | 5.9 (4.3–7.0) | 0.074 |
| Monocytes, ×109/L, median (IQR) | 0.5 (0.3–0.6) | 0.5 (0.4–0.7) | 0.524 |
| Lymphocytes, ×109/L, median (IQR) | 1.0 (0.9–1.7) | 1.1 (0.9–1.7) | 0.720 |
| Platelets, ×109/L, median (IQR) | 152.5 (138.3–215.0) | 171.0 (151.0–229.0) | 0.699 |
| Fibrinogen, g/L, median (IQR) | 2.7 (2.2–3.6) | 2.5 (2.1–3.5) | 0.761 |
| Prothrombin time, s, median (IQR) | 11.4 (10.8–12.3) | 12.1 (11.0–12.9) | 0.192 |
| Activated partial thromboplastin time, s, median (IQR) | 25.8 (25.3–28.6) | 27.3 (25.6–28.7) | 0.361 |
IQR indicates interquartile range; and LAA, large‐artery atherosclerosis.
P<0.05.
Thrombus Microbiota Profile and Thrombus Microbial Diversity
In this study, the microbial composition of thrombi was identified via mNGS. The hierarchical clustering results of all 40 samples comparing data in phylum, genus, and species classification are shown in Figure 1, including the top 10 compositions and relative abundance of the bacterial community in both groups.
Figure 1. Species composition and distribution of cerebral thrombi from different sources at the phylum, genus, and species levels. Each cerebral thrombus sample (total=40), including the top 10 bacterial species compositions at the (A) phylum, genus (C), and species (E) levels in abundance is shown in the heat color map. The top 10 relative abundance species compositions at phylum (B), genus (D), and species (F) levels for all samples (group with LAA vs group with cardioembolism) are displayed in a bar chart. **P<0.01.

LAA indicates large‐artery atherosclerosis.
The top 10 most abundant phyla in the thrombus were Proteobacteria, Actinobacteria, Firmicutes, Ascomycota, Bacteroidetes, Basidiomycota, Deinococcus, Thermus, Discosea, Cyanobacteria, and Fusobacteria (Figure 1A). The top 10 most abundant genera of bacteria in the thrombus were Acinetobacter, Ralstonia, Corynebacterium, Staphylococcus, Lactobacillus, Pseudomonas, Mycobacterium, Sphingomonas, Aquincola, and Bacteroides (Figure 1C). The top 10 most abundant species of bacteria in the thrombus were Acinetobacter_parvus, Ralstonia_insidiosa, Lactobacillus_iners, Aquincola_tertiaricarbonis, Ralstonia_pickettii, Mycobacterium_iranicum, Bradyrhizobium_oligotrophicum, Bacteroides_graminisolvens, Sphingomonas_echinoides, Candida_parapsilosis (Figure 1E). Based on further analysis of the top 10 microbiota composition of thrombus at the phylum, genus, and species levels, the dominant taxa in phylum were not different between the group with LAA and the group with cardioembolism (P>0.05, Figure 1B). However, the abundance of Ralstonia at the top 10 microbiota compositions of thrombi at genus level was significantly higher in the group with LAA group in the group with cardioembolism (P=0.00999, Figure 1D). Further analysis showed that the abundance of Ralstonia_insidiosa in the top 10 microbiota compositions of thrombi at the species level was significantly higher in the group with LAA than in the group with cardioembolism (P=0.00791, Figure 1F). More data on differential abundance at the phylum, class, order, family, genus, and species levels between the 2 groups are presented in Figure S1.
We further assessed the thrombus‐related changes in bacterial alpha diversity. However, there were no significant differences between the group with LAA and the group with cardioembolism using 5 methods to estimate the thrombus‐related change in alpha diversity (P>0.05, Figure 2A through 2E). Moreover, in the beta‐diversity analysis, the principal coordinate analysis and the principal component analysis based on genus profiles showed no separation between the group with LAA and the group with cardioembolism (P>0.05, Figure 2F through 2G).
Figure 2. Species diversity of cerebral thrombi from different origins.

The calculated α‐diversity indices of cerebral thrombi: A, the richness index, B, the ACE index, C, the Chao1 index, D, the Shannon index, and E, the Simpson index. The calculated β‐diversity indices of cerebral thrombi: F, PCoA of the samples in a 2‐dimensional graph (PCoA1=13.18%, PCoA2=9.12), (G) PCA of the samples in a 2‐dimensional graph (PCA1=18.12%, PCA2=12.98%). ACE indicates Abundance‐Based Coverage Estimator; LAA, large‐artery atherosclerosis; PCA, principal component analysis; and PCoA, principal coordinate analysis.
Differences in the Microbial Composition of Thrombi Between the Groups With LAA and Cardioembolism
To identify the specific bacterial taxa and predominant bacteria distinguishing LAA and cardioembolic emboli, linear discriminant analysis effect size was used to assess the maximum difference in the microbial structures in patients with LAA stroke versus those in patients with cardioembolic stroke. The contributions of the thrombus microbial composition to the differences between the 2 groups were evaluated using the LDA score. As shown in Figure 3, 77 genera were significantly enriched, and 19 genera were significantly depleted in the group with LAA compared with the group with cardioembolism (all P<0.05).
Figure 3. Biomarker identification by LEfSe analysis.

A, LEfSe analysis (taxa with LDA score>2); B, cladogram of the main taxa of microbiota that were different between the group with LAA and the group with cardioembolism on the basis of LEfSe analysis. (Color code: Yellow indicates no significant difference in taxa; Green indicates significantly different taxa, with their highest relative abundance in the group with LAA; Red indicates significantly different taxa, with their highest relative abundance in the group with cardioembolism). LAA indicates large‐artery atherosclerosis; and LEfSe, Linear discriminant analysis effect size.
Distinguishing Potential of Cerebral Thrombus Origin Based on Multiple Gene Markers
In this study, we further constructed a random forest classifier model between 15 patients with LAA stroke and 25 patients with cardioembolic stroke to assess the potential of multiple gene markers as a noninvasive distinguishing tool for cerebral thrombus origin. A total of 9 operational taxonomic units were selected as the optimal marker set of cerebral thrombus origin in a 10‐fold cross‐validation with 5 repeats random forest model (Figure 4A). The list of the top 9 bacterial taxa at the species level is shown in Figure 4B. In addition, according to the results of the receiver operating characteristic analysis (Figure 4C), the distinguishing potential value of cerebral thrombus origin showed good discrimination, with an area under the receiver operating characteristic curve of 0.903 (95% CI, 0.808–0.998; P<0.001). The sensitivity and specificity were 100% and 85.7%, respectively.
Figure 4. Multiple gene markers for distinguishing cerebral thrombi origins.

A, Nine microbial OTUs were found to be the best set of markers by a random forest model; B, Chart of the 9 microbial OTUs variables importance ranking based on mean decrease accuracy; C, ROC curve analysis of the predictive value of the 9 microbial gene markers for distinguishing cerebral thrombi origins. AUC indicates area under the curve; FPR, false positive rate; OTU, operational taxonomic unit; ROC, receiver operating characteristics; and TPR, true positive rate.
Correlation Between the Thrombus Microbiome and Clinical Indicators of Patients With LAA or Cardioembolic Stroke
In this study, we also analyzed the correlations between the thrombus microbiome and clinical indicators of cerebral thrombus. Of note, 9 clinical indicators, including neutrophil counts, lymphocyte counts, lymphocyte to monocyte ratio, neutrophil to lymphocyte ratio, ejection fraction, NIHSS score (preadmission), mRS score (preadmission), NIHSS score (postoperation), and mRS score (postoperation), were closely related to the microbiome of cerebral thrombi (as shown in Figure 5). Neutrophil counts were positively correlated with Elizabethkingia meningoseptica. Lymphocyte counts were positively correlated with Schizophyllum commune but negatively associated with Massilia putida. Lymphocyte to monocyte ratio was positively correlated with Lactococcus raffinolactis. The neutrophil to lymphocyte ratio was positively correlated with Massilia putida. Ejection fraction % was negatively correlated with Pannonibacter phragmitetus. NIHSS score (preadmission) was positively correlated with Acinetobacter ursingii, Corynebacterium jeikeium, and Staphylococcus arlettae but negatively correlated with Aspergillus chevalieri and Serinicoccus chungangensis. mRS score (preadmission) was positively correlated with Acinetobacter ursingii, Alishewanella agri, Corynebacterium jeikeium, and Schizophyllum commune but negatively correlated with Aspergillus chevalieri. NIHSS score (postoperation) was positively correlated with Bacillus thermoamylovorans, Castellaniella caeni, Corynebacterium jeikeium, and Providencia stuartii but negatively correlated with Aspergillus chevalieri and Prevotella loescheii. mRS score (postoperation) was positively correlated with Bacillus thermoamylovorans, Castellaniella caeni, Corynebacterium glutamicum, Corynebacterium jeikeium, Providencia stuartii, and Staphylococcus arlettae but negatively correlated with Aspergillus chevalieri.
Figure 5. Correlation between different species and clinical data.

Heat map illustrating the partial Spearman's correlation coefficients among 20 OTUs and 11 clinical indicators of acute ischemic stroke (N=40). Red and blue denote positive and negative correlations, respectively. ρ, Spearman's correlation coefficient. *P<0.05. EF indicates ejection fraction; LMR, lymphocyte to monocyte ratio; mRS, modified Rankin Scale score; NIHSS, National Institutes of Health Stroke Scale; NLR, neutrophil to lymphocyte ratio; pre, preadmission; post, postoperation; and WBC, white blood cell.
Functional Analysis of Genes Associated With the Cerebral Thrombus Microbiota
Furthermore, we performed both Gene Ontology annotation and KEGG pathway enrichment analysis to obtain deeper insight into the biological functions of microbial genes associated with the thrombus microbiota.
Here, we identified a total of 608 microbial genes. The Gene Ontology enrichment analysis results are shown in Figure 6A. Differentially expressed genes associated with the thrombus microbiota of biological process were involved in translation, carbohydrate metabolic process, and phosphoenolpyruvate‐dependent sugar phosphotransferase system. The cellular component analysis revealed that differentially expressed genes were significantly enriched in the ribosome. For molecular function analysis, the top 6 markedly enriched terms were structural constituent of ribosome, RNA binding, GTP binding, GTPase activity, transferase activity transferring glycosyl groups, and protein‐N(PI)‐phosphohistidine‐sugar phosphotransferase activity. In addition, the enriched KEGG pathways presented in Figure 6B included ribosome, butanoate metabolism, and sulfur metabolism.
Figure 6. Functional analysis of cerebral thrombus using GO and KEGG databases.

A, Bubble plot of GO enrichment analysis of DEGs including BPs, CCs, and MFs between the group with LAA and the group with cardioembolism; B, Bubble plot of KEGG enrichment analysis of DEGs between the group with LAA and the group with cardioembolism. The x axis indicates the proportion of the number of genes that differ from the total gene count in a certain pathway. The size and color of the dots indicate the extent of gene enrichment and the significance of the data, respectively. BP indicates biological processes; CC, cellular components; DEGs, differentially expressed genes; GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes; LAA, large‐artery atherosclerosis; and MF, molecular functions.
DISCUSSION
Our study presented the microbiome distribution and genome information of cerebral thrombi with the use of mNGS to identify stroke cause. The relative abundance of Ralstonia insidiosa may differ between the groups with LAA and cardioembolism. Microbial gene markers provided as a predictive model were identified to classify stroke cause diagnosis. Furthermore, this study demonstrated the close relevance of the thrombus microbiome and clinical indicators of patients with stroke, suggesting the potential effects of the thrombus microbiota on host physiological function. Based on Gene Ontology analysis, microbial genes enriched in thrombi participated in multiple biological processes (translation, carbohydrate metabolic process, phosphoenolpyruvate‐dependent sugar phosphotransferase system), cellular components (ribosome), and molecular functions (structural constituent of ribosome, RNA binding, GTP binding, GTPase activity, transferase activity transferring glycosyl groups, and protein‐N(PI)‐phosphohistidine‐sugar phosphotransferase activity), whereas KEGG analysis showed pathways relevant to ribosome, butanoate metabolism, and sulfur metabolism. Our findings provided a novel strategy to identify the different sources of cerebral thrombi with microbial heterogeneity and suggested the effect of microbial colonization on translational process and cellular metabolism.
Among all of the pathogens detected by mNGS, the relative abundance of Ralstonia insidiosa seem to be different between the group with LAA group and the group with cardioembolism. A previous study demonstrated that Ralstonia can release lipopolysaccharide (LPS), 22 which contributes to the neuroinflammatory response. 23 LPS may play important roles in different causes of stroke. It has been found that LPS is present in human carotid atherosclerotic plaques, 24 and experimentally LPS aggravates atherosclerosis. 25 Animal studies have shown that LPS may trigger dysfunction of serum lipids, which further accumulate on the aortic walls to form atherosclerotic plaques in apolipoprotein E−/− mice. 26 Therefore, our results preliminarily indicate the possible high accumulation of Ralstonia insidiosa in human atherosclerotic thrombus samples, which still needs to be further confirmed by more samples.
Previous studies have confirmed the correlation of thrombus composition with functional parameters such as stroke severity (NIHSS) and clinical outcome (mRS). 27 In our study, we reported that NIHSS and mRS scores were positively related to Corynebacterium jeikeium in thrombi but negatively related to Aspergillus chevalieri in thrombi. Corynebacterium jeikeium has recently been found to be the most common species in clinical specimen cultures, particularly blood, pus, urine, and pleural effusion. Corynebacterium jeikeium has become more frequently identified as a severe cause of serious bloodstream infections, infective endocarditis, pneumonia, meningitis, and skin infections. 28 Thus, the level of Corynebacterium jeikeium in thrombi might be an indicator of severity and clinical outcomes.
Although clots form in blood, the microenvironments of arterial thrombosis and venous thrombosis differ. As our results showed, the diversity of the microbiome in LAA and cardioembolic‐derived thrombi may be attributed to the diverse blood microenvironment of thrombosis. As reported, circulating microbiota and microbiota‐related metabolites can play a prothrombotic role, which enhances thrombin generation, endothelial dysfunction, venous stasis, platelet activation, and eventually hypercoagulation. 29 Therefore, microbiota and microbiota‐related metabolites may contribute to thromboinflammation, which results in cerebral artery blockage, inflammatory responses, and severe neuronal damage following ischemic events. Further study is needed to determine the potential differences in microbiota and microbiota‐related metabolite distributions in the arterial system and venous system.
mNGS has been widely used in multiple infectious diseases and performs massive sequencing of DNA, allowing the simultaneous detection of multiple genes even at very low mutational levels. Because microbiology identification can guide diagnosis, management, and treatment strategies for patients with infection, 30 mNGS application in cerebral thrombus samples allows for a novel understanding of microbial compositions and microbe‐phenotype associations. Moreover, microbial genes of thrombi were mainly mapped to ribosomes. In line with this finding, disturbances of ribosome proteins such as S6 kinase have been prominent in platelet activation and aggregation. 31
Regarding the limitations of this study, an additional large sample size is needed to further clarify whether and how the thrombus microbiome can reveal stroke cause and guide second prevention strategies using anticoagulant therapy or antiplatelet therapy. Additionally, due to the limited size of individual thrombus samples, we did not analyze microbial metabolites. Whether targeting specific bacteria or their metabolites is effective for thrombolysis and recurrence prevention remains to be discussed.
CONCLUSIONS
Together, this mNGS‐based approach has presented evidence that microbial diversity in thrombus ecology provides a diagnostic microbiology method to distinguish atherosclerotic thrombus and cardiogenic thrombus. These findings facilitate new directions to uncover thrombus microbiome disturbances in depth and develop treatment strategies for thrombolysis and thrombosis prevention.
Sources of Funding
This work was supported by the National Natural Science Foundation of China (Nos. 81971130 and 8220052770), the technical innovation project in major clinical fields of Third Military Medical University (CX2019LC103), and Chongqing Technology Innovation and Application Development Program (no. cstc2019jscx‐gksbX0064).
Disclosures
None.
Supporting information
Figures S1–S7
Acknowledgments
Zhenhua Zhou: conception, supervision and design of this article. Xi Chen, Yuxuan He, Linke Zhou, Xingtong Li, Meiling Jiang, Min Wu, Jing Li, Li Chen, and Huan Xu: experiments and data acquisition. Xi Chen, Yuxuan He, Huan Xu, and Li Gui: data analysis. Xi Chen and Yuxuan He: article drafting and figures creating. All authors contributed to the article and approved the submitted version.
This article was sent to Luciano A. Sposato, MD, MBA, FRCPC, Associate Editor, for review by expert referees, editorial decision, and final disposition.
Supplemental Material is available at https://www.ahajournals.org/doi/suppl/10.1161/JAHA.123.033221
For Sources of Funding and Disclosures, see page 11.
References
- 1. Collaborators GBDS . Global, regional, and national burden of stroke and its risk factors, 1990–2019: a systematic analysis for the Global Burden of Disease study 2019. Lancet Neurol. 2021;20:795–820. doi: 10.1016/S1474-4422(21)00252-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Benjamin EJ, Blaha MJ, Chiuve SE, Cushman M, Das SR, Deo R, de Ferranti SD, Floyd J, Fornage M, Gillespie C, et al. Heart disease and stroke statistics‐2017 update: a report from the American Heart Association. Circulation. 2017;135:e146–e603. doi: 10.1161/CIR.0000000000000485 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Radu RA, Terecoasă EO, Băjenaru OA, Tiu C. Etiologic classification of ischemic stroke: where do we stand? Clin Neurol Neurosurg. 2017;159:93–106. doi: 10.1016/j.clineuro.2017.05.019 [DOI] [PubMed] [Google Scholar]
- 4. Suissa L, Guigonis JM, Graslin F, Robinet‐Borgomano E, Chau Y, Sedat J, Lindenthal S, Pourcher T. Combined omic analyzes of cerebral thrombi: a new molecular approach to identify cardioembolic stroke origin. Stroke. 2021;52:2892–2901. doi: 10.1161/STROKEAHA.120.032129 [DOI] [PubMed] [Google Scholar]
- 5. Wei L, Zhu Y, Deng J, Li Y, Li M, Lu H, Zhao Y. Visualization of thrombus enhancement on thin‐slab maximum intensity projection of CT angiography: an imaging sign for predicting stroke source and thrombus compositions. Radiology. 2021;298:374–381. doi: 10.1148/radiol.2020201548 [DOI] [PubMed] [Google Scholar]
- 6. Khismatullin RR, Nagaswami C, Shakirova AZ, Vrtkova A, Prochazka V, Gumulec J, Macak J, Litvinov RI, Weisel JW. Quantitative morphology of cerebral thrombi related to intravital contraction and clinical features of ischemic stroke. Stroke. 2020;51:3640–3650. doi: 10.1161/STROKEAHA.120.031559 [DOI] [PubMed] [Google Scholar]
- 7. Chen X, Wang L, Jiang M, Lin L, Ba Z, Tian H, Li G, Chen L, Liu Q, Hou X, et al. Leukocytes in cerebral thrombus respond to large‐vessel occlusion in a time‐dependent manner and the association of NETs with collateral flow. Front Immunol. 2022;13:834562. doi: 10.3389/fimmu.2022.834562 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Yamashiro K, Kurita N, Urabe T, Hattori N. Role of the gut microbiota in stroke pathogenesis and potential therapeutic implications. Ann Nutr Metab. 2021;77(Suppl 2):36–44. doi: 10.1159/000516398 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Zhu W, Gregory JC, Org E, Buffa JA, Gupta N, Wang Z, Li L, Fu X, Wu Y, Mehrabian M, et al. Gut microbial metabolite TMAO enhances platelet hyperreactivity and thrombosis risk. Cell. 2016;165:111–124. doi: 10.1016/j.cell.2016.02.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Shen X, Li L, Sun Z, Zang G, Zhang L, Shao C, Wang Z. Gut microbiota and atherosclerosis‐focusing on the plaque stability. Front Cardiovasc Med. 2021;8:668532. doi: 10.3389/fcvm.2021.668532 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Pessi T, Karhunen V, Karjalainen PP, Ylitalo A, Airaksinen JK, Niemi M, Pietila M, Lounatmaa K, Haapaniemi T, Lehtimaki T, et al. Bacterial signatures in thrombus aspirates of patients with myocardial infarction. Circulation. 2013;127(1219–1228):e1211–e1216. doi: 10.1161/CIRCULATIONAHA.112.001254 [DOI] [PubMed] [Google Scholar]
- 12. Sardu C, Consiglia Trotta M, Santella B, D'Onofrio N, Barbieri M, Rizzo MR, Sasso FC, Scisciola L, Turriziani F, Torella M, et al. Microbiota thrombus colonization may influence athero‐thrombosis in hyperglycemic patients with ST segment elevation myocardial infarction (STEMI). Marianella study. Diabetes Res Clin Pract. 2021;173:108670. doi: 10.1016/j.diabres.2021.108670 [DOI] [PubMed] [Google Scholar]
- 13. Vajpeyee A, Chauhan PS, Pandey S, Tiwari S, Yadav LB, Shroti AK, Vajpeyee M. Metagenomics analysis of thrombus samples retrieved from mechanical thrombectomy. Neurointervention. 2021;16:39–45. doi: 10.5469/neuroint.2020.00353 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Walker M, Uranga C, Levy SH, Kelly C, Edlund A. Thrombus‐associated microbiota in acute ischemic stroke patients. Surg Neurol Int. 2022;13:247. doi: 10.25259/SNI_104_2022 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Liao Y, Zeng X, Xie X, Liang D, Qiao H, Wang W, Guan M, Huang S, Jing Z, Leng X, et al. Bacterial signatures of cerebral thrombi in large vessel occlusion stroke. MBio. 2022;13:e0108522. doi: 10.1128/mbio.01085-22 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Wensel CR, Pluznick JL, Salzberg SL, Sears CL. Next‐generation sequencing: insights to advance clinical investigations of the microbiome. J Clin Invest. 2022;132:132. doi: 10.1172/JCI154944 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Brott T, Adams HP Jr, Olinger CP, Marler JR, Barsan WG, Biller J, Spilker J, Holleran R, Eberle R, Hertzberg V, et al. Measurements of acute cerebral infarction: a clinical examination scale. Stroke. 1989;20:864–870. doi: 10.1161/01.str.20.7.864 [DOI] [PubMed] [Google Scholar]
- 18. Adams HP Jr, Bendixen BH, Kappelle LJ, Biller J, Love BB, Gordon DL, Marsh EE III. Classification of subtype of acute ischemic stroke. definitions for use in a multicenter clinical trial. TOAST. Trial of Org 10172 in acute stroke treatment. Stroke. 1993;24:35–41. doi: 10.1161/01.str.24.1.35 [DOI] [PubMed] [Google Scholar]
- 19. Sulter G, Steen C, De Keyser J. Use of the Barthel index and modified Rankin scale in acute stroke trials. Stroke. 1999;30:1538–1541. doi: 10.1161/01.str.30.8.1538 [DOI] [PubMed] [Google Scholar]
- 20. Higashida RT, Furlan AJ, Roberts H, Tomsick T, Connors B, Barr J, Dillon W, Warach S, Broderick J, Tilley B, et al. Trial design and reporting standards for intra‐arterial cerebral thrombolysis for acute ischemic stroke. Stroke. 2003;34:e109–e137. doi: 10.1161/01.STR.0000082721.62796.09 [DOI] [PubMed] [Google Scholar]
- 21. Shi Y, Wang G, Lau HC, Yu J. Metagenomic sequencing for microbial DNA in human samples: emerging technological advances. Int J Mol Sci. 2022;23:23. doi: 10.3390/ijms23042181 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Hong YH, Huang C, Wang KC, Chu TH, Li CH, Chu YJ, Cheng CP. Mutations in Ralstonia solanacearum loci involved in lipopolysaccharide biogenesis, phospholipid trafficking and peptidoglycan recycling render bacteriophage infection. Arch Microbiol. 2014;196:667–674. doi: 10.1007/s00203-014-1002-2 [DOI] [PubMed] [Google Scholar]
- 23. Singh S, Sahu K, Singh C, Singh A. Lipopolysaccharide induced altered signaling pathways in various neurological disorders. Naunyn Schmiedeberg's Arch Pharmacol. 2022;395:285–294. doi: 10.1007/s00210-021-02198-9 [DOI] [PubMed] [Google Scholar]
- 24. Carnevale R, Nocella C, Petrozza V, Cammisotto V, Pacini L, Sorrentino V, Martinelli O, Irace L, Sciarretta S, Frati G, et al. Localization of lipopolysaccharide from Escherichia coli into human atherosclerotic plaque. Sci Rep. 2018;8:3598. doi: 10.1038/s41598-018-22076-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Trojova I, Kozarova M, Petrasova D, Malachovska Z, Paranicova I, Joppa P, Tkacova R. Circulating lipopolysaccharide‐binding protein and carotid intima‐media thickness in obstructive sleep apnea. Physiol Res. 2018;67:69–78. doi: 10.33549/physiolres.933632 [DOI] [PubMed] [Google Scholar]
- 26. Zhou L, Long J, Sun Y, Chen W, Qiu R, Yuan D. Resveratrol ameliorates atherosclerosis induced by high‐fat diet and LPS in ApoE(−/−) mice and inhibits the activation of CD4(+) T cells. Nutr Metab (Lond). 2020;17:41. doi: 10.1186/s12986-020-00461-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Staessens S, François O, Brinjikji W, Doyle KM, Vanacker P, Andersson T, De Meyer SF. Studying stroke thrombus composition after thrombectomy: what can we learn? Stroke. 2021;52:3718–3727. doi: 10.1161/strokeaha.121.034289 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Yamamuro R, Hosokawa N, Otsuka Y, Osawa R. Clinical characteristics of Corynebacterium bacteremia caused by different species, Japan, 2014‐2020. Emerg Infect Dis. 2021;27:2981–2987. doi: 10.3201/eid2712.210473 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Violi F, Pignatelli P, Castellani V, Carnevale R, Cammisotto V. Gut dysbiosis, endotoxemia and clotting activation: a dangerous trio for portal vein thrombosis in cirrhosis. Blood Rev. 2023;57:100998. doi: 10.1016/j.blre.2022.100998 [DOI] [PubMed] [Google Scholar]
- 30. Chiu CY, Miller SA. Clinical metagenomics. Nat Rev Genet. 2019;20:341–355. doi: 10.1038/s41576-019-0113-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Aslan JE, Tormoen GW, Loren CP, Pang J, McCarty OJ. S6K1 and mTOR regulate Rac1‐driven platelet activation and aggregation. Blood. 2011;118:3129–3136. doi: 10.1182/blood-2011-02-331579 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Figures S1–S7
