Abstract
Background
Nonsense-mediated decay (NMD) is a cellular mechanism that degrades mRNAs with premature termination codons (PTCs), preventing the production of truncated, potentially harmful proteins. While its role in viral infections is increasingly recognized, the relationship between NMD-linked mutations in SARS-CoV-2 and COVID-19 severity remains poorly understood.
Objective
To investigate the presence of SARS-CoV-2 nonsense mutations predicted to trigger NMD and assess their association with clinical disease severity and viral genomic.
Methods
We conducted whole-genome sequencing on samples from 129 hospitalized COVID-19 patients. A panel of 21 nonsense mutations predicted to activate the NMD pathway was identified and analyzed. Statistical correlations with clinical severity were assessed, including multivariate analysis. Receiver Operating Characteristic (ROC) curves and interdependency analysis of mutation combinations were also performed.
Results
Five NMD-associated mutations (Variants 5, 6, 7, 9, and 15) showed significant associations with mild disease. These mutations, located in key SARS-CoV-2 genes, include Variant 5 (g.C5575G, T, synonymous substitution in ORF1ab), Variant 6 (g.T5653G, substitution in ORF1ab), Variant 7 (g.G6094GGACAGACTTT/GCCTACACGACGCTAATC, insertion in spike protein), Variant 9 (g.G6446GAATGA, insertion in spike protein), and Variant 15 (g.T10968TATATTGA, insertion in N protein). In the host-adjusted multivariable model, the presence of at least one NMD-inducing mutation was an independent protective factor (OR = 0.34, 95% CI: 0.16–0.72, p = 0.005). However, after adjusting for SARS-CoV-2 lineage, this association was attenuated (OR = 0.67, 95% CI: 0.03–13.84, p = 0.793). In the lineage-only model, Omicron infection showed higher odds of severe disease compared to Delta (OR = 1.91, 95% CI: 0.77–4.77, p = 0.164). ROC analysis indicated limited predictive value for individual variants (AUC = 0.37–0.48), but specific combinations, such as Variants 5 and 7, markedly reduced severe case incidence (92.9–4.8%, p = 0.0001).
Conclusion
NMD-inducing nonsense mutations were associated with reduced COVID-19 severity in host-adjusted analyses, but this effect diminished after accounting for viral lineage, suggesting that variant distribution, particularly Omicron may influence these associations. Integrating viral genomic background with host and clinical data may enhance risk prediction and inform antiviral strategies.
Supplementary Information
The online version contains supplementary material available at 10.1186/s12879-025-11609-8.
Keywords: COVID-19, Nonsense-mediated decay, SARS-CoV-2, Variants, Whole-genome sequencing
Introduction
The coronavirus disease 2019 (COVID-19), caused by SARS-CoV-2, has been one of the most impactful global health crises in modern history [1, 2]. Since its emergence in late 2019, the virus has infected over 700 million individuals and led to more than 7 million deaths worldwide [3, 4]. While many patients experience mild to moderate illness, others develop severe complications such as acute respiratory distress syndrome (ARDS), multi-organ failure, and death [5, 6]. This wide variability in clinical outcomes has prompted intense research into the virological and host-related factors that influence disease severity.
One emerging area of interest is the role of nonsense-mediated decay (NMD) in viral infections, including COVID-19 [7, 8]. NMD is a conserved cellular surveillance mechanism that degrades messenger RNA (mRNA) transcripts containing premature termination codons (PTCs) [9]. This process protects cells from producing truncated, potentially harmful proteins [10]. Beyond its housekeeping role, NMD also regulates the expression of many normal genes and can influence immune responses, stress reactions, and viral replication [11, 12].
Viruses, including SARS-CoV-2, can interact with the host NMD pathway in complex ways [10]. Some viral RNAs may be targeted by NMD, reducing their expression and limiting replication [11, 13]. However, viruses can also evolve to evade or suppress NMD, allowing even defective transcripts to persist and potentially confer advantages such as immune evasion or altered pathogenesis [14]. Therefore, mutations in the viral genome that introduce PTCs or affect RNA splicing, collectively termed NMD-inducing or NMD-associated mutations, may play a role in modulating the clinical course of infection [15].
Recent sequencing studies have revealed the presence of NMD-associated mutations across various SARS-CoV-2 lineages, including Variants of Concern (VOCs), such as Alpha, Delta, and Omicron [16]. The frequency and distribution of these NMD-related mutations differ between variants, possibly reflecting evolutionary pressures and host interactions [17]. These mutations, which can introduce premature termination codons (PTCs) or affect RNA splicing, may influence viral gene expression and pathogenicity [18]. It has been demonstrated that the Delta variant, associated with more severe disease, has shown a higher frequency of certain NMD-associated mutations [19], whereas the Omicron variant, despite its high transmissibility, presents a more complex mutational profile with varying impacts on severity [20].
Importantly, the relationship between NMD-associated mutations and COVID-19 disease severity remains poorly understood. On one hand, NMD-triggered degradation of essential viral transcripts may reduce viral load and attenuate disease [7]. On the other hand, mutations that allow the virus to escape NMD could support ongoing replication and immune evasion, thereby worsening clinical outcomes [21, 22]. Additionally, host genetic differences in the efficiency of NMD machinery may further influence how these mutations affect disease progression [23].
Preliminary studies suggest that the presence of NMD-inducing mutations in SARS-CoV-2 may correlate with increased disease severity in some cases [24, 25], potentially through enhanced inflammatory responses or prolonged viral persistence [26, 27]. However, systematic investigations linking these mutations with clinical data across large patient populations are limited.
This study aims to explore the association between NMD-inducing mutations in the SARS-CoV-2 genome and the severity of COVID-19 in infected individuals. By analyzing the genomic sequences of viral isolates from patients with varying clinical presentations, we will assess the frequency, distribution, and functional impact of these mutations. Understanding how viral mutations interact with host RNA surveillance mechanisms such as NMD can provide deeper insights into the molecular underpinnings of COVID-19 pathogenesis. This knowledge may help identify novel biomarkers for disease severity, guide therapeutic development, and inform public health responses to emerging variants.
Methods
Patient recruitment and clinical categorization
This study was designed as a retrospective observational cohort study conducted between December 2021 and December 2022. It involved the collection and analysis of clinical and genomic data from 126 patients with RT-PCR–confirmed SARS-CoV-2 infection who were admitted to military hospitals. The objective was to investigate the association between viral NMD-associated mutations and the clinical severity of COVID-19. Ethical approval was obtained from the Egypt Center For Research and Regenerative Medicine (1/11-2021), and written informed consent was obtained from all participants or their legal guardians. Eligible participants were adults aged 18 years or older who had tested positive for SARS-CoV-2 by real-time reverse transcription polymerase chain reaction (RT-PCR) and were admitted to one of the participating hospitals. Patients were clinically categorized into three severity groups based on the World Health Organization (WHO) guidelines and institutional criteria: mild (symptomatic without radiologic or clinical signs of pneumonia), moderate (radiographic evidence of pneumonia without need for oxygen supplementation), and severe (requiring supplemental oxygen, intensive care unit admission, or mechanical ventilation). Patients were recruited from a single outbreak during a defined time, ensuring a homogeneous cohort, and statistical analyses were adjusted for comorbidities and age confounders to minimize bias.
Exclusion criteria included the absence of confirmatory RT-PCR results, incomplete clinical records, prior receipt of antiviral therapy before hospital admission, immunocompromised status such as active cancer or organ transplantation, and patient refusal or withdrawal of consent. Clinical and demographic data, including age, sex, comorbidities, date of symptom onset, hospital stay duration, and clinical outcomes, were collected using a standardized case report form completed by trained study staff from patient interviews and electronic medical records.
Sample collection
Nasopharyngeal (NP) swab specimens were collected from all patients for virological analysis. Swabs were obtained using sterile flocked swabs by trained healthcare professionals following WHO-recommended procedures. The swab was gently inserted through one nostril to reach the posterior nasopharynx, rotated to absorb secretions, and then immediately placed into 3 mL of sterile viral transport medium (VTM).
Specimens were transported to the central virology laboratory within two hours of collection using cold-chain procedures at 2–8 °C. In cases where transport exceeded two hours, samples were stored at 2 °C and shipped under temperature-controlled conditions. Upon arrival, samples were handled in a Class II biosafety cabinet in a Biosafety Level 2 Plus facility. Each sample was assigned a unique barcode and processed according to standard laboratory protocols.
Aliquots were prepared for RNA extraction, RT-PCR confirmation, and whole-genome sequencing. The remaining sample material was stored at − 80 °C for future analyses. All laboratory personnel wore appropriate personal protective equipment (PPE), and strict biosafety and quality control protocols were followed throughout the process to ensure sample integrity and prevent contamination.
Viral genome sequences were obtained from nasopharyngeal swabs collected upon patient admission, followed by RNA extraction and whole-genome sequencing using standard protocols. Sequences were processed and aligned to the reference SARS-CoV-2 genome using bioinformatics tools to identify mutations.
RNA extraction and sequencing
Viral RNA was extracted from nasopharyngeal swab samples using the QIAamp Viral RNA Mini Kit (QIAGEN, Hilden, Germany) according to the manufacturer’s instructions. All extraction steps were performed under sterile conditions in a Class II biosafety cabinet within a Biosafety Level 2 Plus (BSL-2+) facility. Extracted RNA samples were quantified using the Qubit Fluorometer (Thermo Fisher Scientific, USA) to ensure sufficient concentration and quality for downstream sequencing applications [28].
Whole genome sequencing (WGS) of SARS-CoV-2
Whole genome sequencing of SARS-CoV-2 was conducted using the Illumina iSeq 100 platform to generate high-quality viral genomic data for mutation analysis. Following RNA extraction and quantification, complementary DNA (cDNA) synthesis and targeted amplification of the SARS-CoV-2 genome were performed using a tiled amplicon-based approach optimized for high genome coverage. Library preparation was conducted according to the manufacturer’s protocol using the iSeq 100 i1 Reagent v2 (300-cycle) kit (Illumina, PN 20021535), which allows for paired-end sequencing with 2 × 150 bp reads.
Amplicon libraries were prepared by reverse transcription of extracted viral RNA, followed by multiplex PCR using SARS-CoV-2 specific primers designed to amplify overlapping regions across the entire viral genome. After amplification, libraries were cleaned using magnetic bead-based purification and indexed through ligation with unique dual indices to allow for multiplexing of multiple samples per run. The quality and fragment size distribution of the libraries were verified using an Agilent Bioanalyzer or Qubit fluorometric analysis.
The pooled libraries were denatured, diluted, and loaded onto the Illumina iSeq 100 System according to the manufacturer’s instructions. Sequencing runs were monitored in real time to ensure optimal performance metrics, including cluster density, base quality scores, and Q30 values. Each sequencing batch included a no-template control and a positive SARS-CoV-2 control to validate run performance and detect potential contamination [29].
Data analysis & bioinformatics
Raw reads processing and mapping
Raw sequencing reads were demultiplexed and converted to FASTQ format (Version 0.11.9) and MultiQC (Version 1.13) using Illumina’s bcl2fastq software. Quality control checks were performed using FastQC, and adapter sequences and low-quality bases were trimmed using Trimmomatic [30]. High-quality reads were aligned to the SARS-CoV-2 reference genome (NC_045512.2) using the Burrows-Wheeler Aligner (BWA-MEM) (Version 0.7.17-r1188) algorithm. PCR duplicates were marked and removed using Picard Tools, and alignment statistics were reviewed to ensure adequate coverage (> 95% of the genome, with an average depth of ≥ 100×) [31].
Variant detection
Variant calling was performed using the iVar pipeline in conjunction with SAMtools (Version 1.17) [32], followed by refinement and confirmation using the Genome Analysis Toolkit (GATK) HaplotypeCaller [33]. Variants were filtered to exclude sequencing artifacts, low-quality calls (Phred score < 30), and regions with coverage < 10×. A consensus genome sequence for each sample was generated based on the high-confidence variants.
Variants were functionally annotated using SnpEff with a custom SARS-CoV-2 database to identify synonymous, non-synonymous, frameshift, and nonsense mutations [34]. We focused specifically on identifying NMD-associated mutations, defined as:
Nonsense mutations introduce premature termination codons (PTCs) in protein-coding regions.
Frameshift insertions or deletions (indels) are predicted to introduce PTCs downstream.
Splice site mutations located within ± 2 base pairs of canonical splice donor or acceptor sites.
To assess the likelihood of NMD targeting, mutations were further analyzed in the context of transcript structure. Specifically, premature stop codons located > 50–55 nucleotides upstream of the final exon-exon junction were flagged as potential NMD-triggering events based on established criteria for NMD recognition in eukaryotic cells. These predictions were cross-referenced with published SARS-CoV-2 transcriptome models and viral open reading frame (ORF) annotations.
All identified NMD-associated mutations were cataloged by genomic position, variant type, affected gene or ORF, and patient severity classification (mild, moderate, or severe). Frequencies were compared across clinical groups to explore potential correlations between NMD mutation burden and disease severity.
Where applicable, mutational profiles were also mapped to known Variants of Concern (VOCs) using lineage assignment tools such as Pangolin and Nextclade, and results were validated against publicly available datasets (GISAID and CoV-GLUE) [35].
Finally, Spearman correlation coefficient analysis between selected mutations was done using the “cor” function in R (Version 4.2), and the sample clustering was performed using the “hclust” function in R (Version 4.2).
Viral lineage determination
Whole-genome SARS-CoV-2 sequences from all study participants were analyzed to assign Pango lineages using pangolin v4.3 with the latest pangoLEARN model (accessed July 2025). WHO-recognized variant classifications were mapped from Pango lineages, resulting in four lineage categories in our dataset: Delta (B.1.617.2 and sublineages), Alpha (B.1.1.7 and sublineages), Omicron (BA., XBB., and sublineages), and VOIs/VOCs (other) (e.g., C.36.3, C.36, B.1.351). The results were validated against publicly available datasets (GISAID and CoV-GLUE) [35].
Sample size calculation
The sample size for this study was determined using power analysis to detect a significant association between the presence of NMD-associated mutations and COVID-19 disease severity. The calculation was based on a two-sided Chi-square test for comparing proportions between groups, with a significance level (α) of 0.05 and a power (1–β) of 80%. An effect size (Cohen’s w) of 0.25 was assumed, representing a medium effect based on preliminary literature and pilot data. The minimum required sample size was calculated to be 120 participants. To account for a potential 5% rate of data exclusion due to poor sequencing quality or incomplete clinical data, the target enrollment was increased slightly. As a result, 129 patients were recruited between December 2021 and December 2022.
The sample size calculation was performed using G*Power version 3.1.9.7 (Heinrich Heine University Düsseldorf, Germany), employing the Chi-square test family for goodness-of-fit (contingency tables). This sample size was deemed adequate to detect statistically significant differences in the frequency of NMD-associated mutations across clinical severity groups (mild, moderate, and severe) with sufficient precision and reliability.
Statistical analysis
Statistical analyses were performed using SPSS version 26.0 (IBM Corp., Armonk, NY) and GraphPad Prism version 9.0 (GraphPad Software, San Diego, CA). Patient demographic and clinical characteristics were summarized using descriptive statistics. Continuous variables were presented as means ± standard deviations (SD) or medians with interquartile ranges (IQR), depending on data distribution assessed by the Shapiro–Wilk test. Categorical variables were expressed as counts and percentages.
To evaluate the association between NMD-associated mutations and COVID-19 disease severity, patients were grouped into three severity categories (mild, moderate, and severe). The frequency of NMD-inducing mutations was compared across severity groups using the Chi-square test or Fisher’s exact test, as appropriate. Continuous variables such as age, viral load (Ct values), and number of NMD-associated mutations per genome were compared between groups using one-way ANOVA for normally distributed data or the Kruskal–Wallis test for non-parametric data.
We conducted multivariable logistic regression to evaluate the association between NMD-linked mutations, viral lineage, and COVID-19 severity, adjusting for host factors (age, sex, comorbidities). Analyses were conducted in three steps: (1) NMD-positive vs. NMD-negative model, Host-adjusted model assessing the association between NMD status and severity, without including viral lineage, (2) Viral lineage model, Model with WHO variant (Delta as reference category) as the primary exposure, adjusting for host factors, to assess association between lineage and severity, and (3) Lineage-adjusted NMD model, Model including both NMD status and WHO variant (Delta as reference) to assess the independent association between NMD status and severity after accounting for viral lineage.
Odds ratios (ORs), 95% confidence intervals (CIs), and p-values were calculated. Models with insufficient exposure variation (e.g., where all patients in a lineage shared the same NMD status) were noted as not estimable (NE). A p-value < 0.05 was considered statistically significant.
Receiver operating characteristic (ROC) curve analysis was also performed to evaluate the predictive value of the NMD mutation burden for distinguishing severe from non-severe cases. The area under the curve (AUC) was used to quantify diagnostic performance.
Where relevant, correlations between the number of NMD-associated mutations and clinical outcomes, such as length of hospital stay, were assessed using Spearman’s rank correlation coefficient.
Results
Variant filtering and identification of NMD-Associated mutations
Whole-genome sequencing was performed on 129 SARS-CoV-2 samples collected from patients admitted to military hospitals. Following quality control and bioinformatics filtering, an initial total of 2,192 variants were identified. Applying depth (Dp ≥ 100) and functional annotation filters, 444 high-confidence variants were retained for further analysis. Among these, 47 loss-of-function mutations were identified, all located within the open reading frame 1ab (ORF1ab) region. Of particular interest, 21 nonsense mutations predicted to activate the nonsense-mediated decay (NMD) mechanism were selected as the focus of this study.
Demographic and clinical characteristics
The clinical and demographic distribution of the studied cohort is shown in Table 1. The cohort consisted of 52 females (39.8%) and 77 males (60.2%), with no statistically significant difference in gender distribution across severity levels (p = 0.958). Similarly, age stratification showed no significant association with disease severity (p > 0.05 for all age groups).
Table 1.
Demographic and clinical data for the studied cohort:
| Clinical data | Positivity | N (%) | Mild | Moderate | Severe | P-value |
|---|---|---|---|---|---|---|
| Gender | Female | 52(39.8) | 8(36.8) | 27(40.3) | 17(40.5) | 0.958 |
| Male | 77(60.2) | 12(63.2) | 40(59.7) | 25(59.5) | ||
| Age (18–24) | Negative | 12(9.4) | 4(21.1) | 5(7.5) | 3(7.1) | 0.153 |
| Positive | 116(90.6) | 15(78.9) | 62(92.5) | 39(92.9) | ||
| Age (25–64) | Negative | 123(96.1) | 17(89.5) | 66(98.5) | 40(95.2) | 0.540 |
| Positive | 5(3.9) | 2(10.5) | 1(1.5) | 2(4.8) | ||
| Age (≥ 65) | Negative | 121(94.5) | 17(89.5) | 63(94) | 41(97.6) | 0.190 |
| Positive | 7(5.5) | 2(10.5) | 4(6) | 1(2.4) | ||
| Fever ≥ 38 °C | Negative | 20(14.8) | 4(15.8) | 12(63.2) | 4(21.1) | 0.484 |
| Positive | 109(85.2) | 16(14.7) | 55(50.5) | 38(34.9) | ||
| Muscle pain (Myalgia) | Negative | 62(47.7) | 13(19.7) | 28(45.9) | 21(34.4) | 0.241 |
| Positive | 67(52.3) | 7(10.4) | 39(58.2) | 21(31.3) | ||
| Sore throat | Negative | 69(53.5) | 14(20) | 33(48) | 22(32) | 0.218 |
| Positive | 60(46.5) | 5(9) | 35(58) | 20(33) | ||
| Cough | Negative | 35(26.6) | 6(14.7) | 20(58.8) | 9(26.5) | 0.625 |
| Positive | 94(73.4) | 14(14.9) | 47(50) | 33(35.1) | ||
| Dyspnea | Negative | 84(64.8) | 16(19.3) | 40(47) | 28(33.7) | 0.106 |
| Positive | 45(35.2) | 3(6.7) | 28(62.2) | 14(31.1) | ||
| Wheezing | Negative | 125(97.7) | 19(15.2) | 64(51.2) | 42(33.6) | 0.247 |
| Positive | 4(2.3) | 0(0) | 3(100) | 0(0) | ||
| Chest pain | Negative | 91(71.1) | 17(18.7) | 45(49.5) | 29(31.9) | 0.156 |
| Positive | 38(28.9) | 2(5.4) | 22(95.5) | 14(35.1) | ||
| Headache | Negative | 91(71.1) | 14(15.4) | 45(49.5) | 32(35.2) | 0.578 |
| Positive | 38(28.9) | 6(13.5) | 22(59.5) | 10(27 | ||
| Joint Pain | Negative | 104(81.3) | 16(15.4) | 53(51) | 35(33.7) | 0.806 |
| Positive | 25(18.8) | 3(12.5) | 15(58.3) | 7(29.2) | ||
| Nausea/Vomiting | Negative | 107(83.6) | 17(15.9) | 55(51.4) | 35(32.7) | 0.744 |
| Positive | 22(16.4) | 2(9.5) | 12(57.1) | 8(33.3) | ||
| Coma | Negative | 127(99.2) | 19(15) | 66(52) | 42(33.1) | 0.632 |
| Positive | 2(0.8) | 0(0) | 2(100) | 0(0) |
N (%): N: Number of patients, (%): percentage within infection severity, P-value ≤0.05 considered significant.
Patients were stratified into three clinical severity categories, “mild, moderate, and severe,” based on chest Computerized Axial Tomography (CT) findings and clinical parameters. The cohort was then divided into two groups based on the presence (positive) or absence (negative) of one or more of the 21 identified NMD-related variants.
Clinical symptoms such as fever, myalgia, sore throat, cough, and dyspnea did not demonstrate statistically significant differences between the NMD variant-positive and variant-negative groups across severity categories (p > 0.05 for all symptoms).
Association between NMD-Associated variants and disease severity
Tables 2 and 3, and Fig. 1. presents the distribution of the 21 NMD-associated variants across patient severity groups and their statistical associations. Among these:
Table 2.
Frequencies and distribution of the detected variants in the studied cohort
| Variant | Positivity | N (%) | Mild | Moderate | Severe | Statistics |
|---|---|---|---|---|---|---|
| Variant 1 | Negative | 128(99.20 | 18(94.7) | 68(52.8) | 42(32.8) | 0.056 |
| Positive | 1(0.8) | 1(100) | 0(0) | 0(0) | ||
| Variant 2 | Negative | 116(89.8) | 17(14.8) | 62(53) | 37(32.2) | 0.883 |
| Positive | 13(10.2) | 2(15.4) | 6(46.2) | 5 (38.5) | ||
| Variant 3 | Negative | 123(96.1) | 16(84.2 | 65(97) | 42(99.2) | 0.011 |
| Positive | 6(3.9) | 3(15.8) | 2(3) | 1(0.8) | ||
| Variant 4 | Negative | 125(96.9) | 17(89.5) | 65(97) | 43(100) | 0.091 |
| Positive | 4(3.1) | 2(10.5) | 2(3) | 0(0) | ||
| Variant 5 | Negative | 98(76.6) | 7(36.8) | 54(80.6) | 37(88.1) | 0.000 |
| Positive | 31(23.4) | 13(63.2) | 13(19.4) | 5(11.9) | ||
| Variant 6 | Negative | 117(9.6) | 11(52.6) | 64(95.5) | 42(100) | 0.0001 |
| Positive | 12(9.4) | 9(47.4) | 3(4.5) | 0(0) | ||
| Variant 7 | Negative | 104(80.5) | 11(52.6) | 54(80.6) | 39(92.9) | 0.001 |
| Positive | 25(19.5) | 9(47.4) | 13(19.4) | 3(7.1) | ||
| Variant 8 | Negative | 102(78.9) | 14(73.7) | 54(80.6) | 34(78.6) | 0.807 |
| Positive | 27(21.1) | 5(26.3) | 13(19.4) | 9(21.4) | ||
| Variant 9 | Negative | 117(90.6) | 12(63.2) | 64(94) | 41(97.6) | 0.000 |
| Positive | 12(9.4) | 7(36.8) | 4(6) | 1(2.4) | ||
| Variant 10 | Negative | 117(91.4) | 18(94.7) | 60(89.6) | 39(92.9) | 0.714 |
| Positive | 12(8.6) | 1(5.3) | 7(10.4) | 4(2.3) | ||
| Variant 11 | Negative | 124(96.9) | 18(94.7) | 65(97) | 41(97.6) | 0.832 |
| Positive | 5(3.1) | 1(5.3) | 2(3) | 2(4) | ||
| Variant 12 | Negative | 125(97.7) | 17(98.5) | 66(98.5) | 42(100) | 0.034 |
| Positive | 4(2.3) | 2(10.5) | 2(1.5) | 0(0) | ||
| Variant 13 | Negative | 127(99.2) | 19(100) | 66(98.5) | 42(100) | 0.632 |
| Positive | 2(0.8) | 0(0) | 2(1.8) | 0(0) | ||
| Variant 14 | Negative | 125(97.7) | 17(89.5) | 67(100) | 41(97.4) | 0.028 |
| Positive | 4(2.3) | 2(10.5) | 0(0) | 2(2.6) | ||
| Variant 15 | Negative | 125 (97.7) | 16(84.2) | 67(100) | 42(100) | 0.0001 |
| Positive | 4(2.3) | 3(15.8) | 0(0) | 0(0) | ||
| Variant 16 | Negative | 105(81.3) | 14(68.4) | 57(85.1) | 34(81) | 0.259 |
| Positive | 24(18.8) | 6(31.6) | 10(14.9) | 9(19) | ||
| Variant 17 | Negative | 106(82.8) | 14(73.7) | 58(86.6) | 34(81) | 0.391 |
| Positive | 23(17.2) | 5(26.3) | 9(13.4) | 9(19) | ||
| Variant 18 | Negative | 80(62.5) | 8(42.1) | 44(65.7) | 28(66.7) | 0.137 |
| Positive | 49(37.5) | 11(57.9) | 23(34.4) | 15 (33.3) | ||
| Variant 19 | Negative | 123(96.1) | 17(89.5) | 64(95.5) | 42(100) | 0.136 |
| Positive | 6(3.9) | 2(10.5) | 4(4.5) | 0(0) | ||
| Variant 20 | Negative | 107(83.6) | 11(57.9) | 59(88.1) | 37(88.1) | 0.005 |
| Positive | 22(16.4) | 9(42.1) | 8(11.9) | 5(11.9) | ||
| Variant 21 | Negative | 31(23.4) | 9(47.4) | 15(20.9) | 7(16.7) | 0.025 |
| Positive | 98(76.6) | 10(52.6) | 53(79.1) | 35(83.3) |
N (%): N: Number of patients, (%): percentage within infection severity, P-value ≤ 0.05 considered significant
Table 3.
NMD-associated variants
| No | Variants | Variants |
|---|---|---|
| 1 | g.C589CTAGT | ins TAGT |
| 2 | g.G2654T | Substitution |
| 3 | g.C3444CCGGCAGCCCCATCGGTCCTCTCGCATGGAGCTTCTGTGTGGAAACTAA, CCCTACACGGAACGTTCTGAAAAG | insCGGCAGCCCCATCGGTCCTCTCGCATGGAGCTTCTGTGTGGAAACTAA/ins CCTACACGGAACGTTCTGAAAAG |
| 4 | g.T3836TGTTAAAAACAGTACAATTCTG | insGTTAAAAACAGTACAATTCTG |
| 5 | g.C5575G, T | Substitution/Substitution synonymous substitution |
| 6 | g.T5653G | Substitution |
| 7 | g.G6094GGACAGACTTT, GCCTACACGACGCTAATC | insGACAGACTTT/insCCTACACGACGCTAATC |
| 8 | g.G6422T | Substitution |
| 9 | g.G6446GAATGA | insAATGA |
| 10 | g.T7258A | Substitution |
| 11 | g.T7672TTAAGGAAGGTGTAGAGTTTCTTA | insTAAGGAAGGTGTAGAGTTTCTTA |
| 12 | g.C8316CTTCTTTAAGTTTAGAATAG, CTACACACCCTCTTTTAAGAAAGG, CTGATCA | insTTCTTTAAGTTTAGAATAG/insTACACACCCTCTTTTAAGAAAGG/insTGATCA |
| 13 | g.A8378T | Substitution |
| 14 | g.A10888AGCTGATGTTACTAAAATAAAACCT, ATACAACTAGCTACAGAGAAG | insGCTGATGTTACTAAAATAAAACCT/insTACAACTAGCTACAGAGAAG |
| 15 | g.T10968TATATTGA | insATATTGA |
| 16 | g.T11187A | Substitution |
| 17 | g.T11880A | Substitution |
| 18 | g.C12420CTAGTTA | insTAGTTA |
| 19 | g.A12615AGCTGATGTTACTAAAATAAAACCT | insGCTGATGTTACTAAAATAAAACCT |
| 20 | g.C12713CAG, CACACCCTCTTTTAAGAAAG | insACACCCTCTTTTAAGAAAG |
| 21 | g.T13408A | Substitution |
Fig. 1.
Heatmap of Positive NMD-Associated Variants Across COVID-19 Severity Levels. Heatmap displaying the number of patients who tested positive for each of the 21 identified NMD-associated variants, stratified by COVID-19 severity (mild, moderate, and severe). Each row represents a specific NMD-related nonsense mutation in the SARS-CoV-2 ORF1ab region, and each column indicates a severity level. The color intensity corresponds to the number of patients in each category, with darker shades indicating higher counts
Variants 5, 6, 7, 9, and 15 showed highly significant associations with disease severity (p < 0.001). In each case, patients positive for these variants were more likely to have mild disease, while negative cases showed a higher frequency of severe outcomes, suggesting a potential protective role of these variants.
Variants 3 and 20 demonstrated moderate significance (p ≤ 0.01), indicating a possible association with disease progression, though less pronounced than the highly significant variants.
Variants 12, 14, and 21 showed mild statistical significance (p < 0.05), suggesting weaker but potentially relevant associations.
The remaining 11 variants (Variants 1, 2, 4, 8, 10, 11, 13, 16, 17, 18, and 19) did not show any statistically significant correlation with disease severity (p ≥ 0.05), indicating no detectable relationship in this cohort.
These findings highlight a heterogeneous distribution of NMD-associated mutations and their varied influence on clinical severity. Figure 2. Presents that the most significant variants appear to correlate with reduced severity, which means that they have potential protective or neutral effects, supporting the hypothesis that certain NMD-triggering mutations may attenuate viral pathogenicity by interfering with essential viral gene expression and modulating disease outcome.
Fig. 2.
Distribution of COVID-19 Severity Among Positive Cases for Highly Significant NMD-Associated Variants. Bar chart showing the distribution of disease severity (mild, moderate, severe) among patients who tested positive for five NMD-associated SARS-CoV-2 variants that showed high statistical significance in relation to clinical severity (Variants 5, 6, 7, 9, and 15; p < 0.001 for each)
Comparative ROC analysis of NMD variant burden
To evaluate the discriminative power of NMD-associated mutations for predicting severe COVID-19, two ROC analyses were conducted. The first model included all 21 NMD-associated nonsense mutations, while the second included only the 12 variants that demonstrated statistically significant associations with disease severity.
The ROC curve using all 21 variants produced an AUC of 0.37, suggesting poor predictive ability. This may be due to the inclusion of variants with no functional relevance, which likely diluted the impact of protective variants. In contrast, the model restricted to the 12 significant variants yielded a slightly improved AUC of 0.48 but still failed to achieve acceptable discriminative performance (Fig. 3). These findings indicate that although certain NMD mutations are statistically associated with reduced severity, they do not function as effective standalone biomarkers for predicting disease outcomes. They may, however, contribute value as part of a multivariable model incorporating host factors and clinical data.
Fig. 3.
Comparison of ROC Curves for NMD Mutation Burden. ROC curves comparing the predictive value of two models for identifying severe COVID-19 cases based on NMD mutation burden. The grey dashed line represents the ROC curve using all 21 NMD-associated variants (AUC = 0.37). The solid green line represents the ROC curve using only the 12 statistically significant variants (AUC = 0.48). Both curves fall below the line of no discrimination (diagonal), indicating that NMD mutation burden alone, whether broadly defined or refined to significant variants, has limited predictive power for distinguishing severe from non-severe COVID-19 outcomes
Multivariable logistic regression analyses of NMD status, viral lineage, and severe COVID-19
A total of 129 patients were included, with WHO variant distribution as follows: Delta (n = 47, 36.7%), Omicron (n = 41, 32.0%), VOIs/VOCs (n = 31, 24.2%), and Alpha (n = 10, 7.0%). Severity rates varied by lineage: Alpha 11.1%, Delta 25.5%, Omicron 39.0%, and VOIs/VOCs 45.2%. The complete list of patient-level lineage assignments is provided in Supplementary Table S1.
Association between NMD-positive status and disease severity
In the host-adjusted model (age, sex, comorbidities), NMD-positive status was significantly associated with reduced odds of severe COVID-19 (adjusted OR = 0.34, 95% CI: 0.16–0.72, p = 0.005).
Association between viral lineage and disease severity
When WHO variant was modeled as the primary exposure (Delta as the reference), Omicron infection was associated with 1.91 times higher odds of severe disease compared to Delta, although this did not reach statistical significance (95% CI: 0.77–4.77, p = 0.164). Alpha and VOIs/VOCs could not be stably estimated due to small sample sizes and/or lack of variation in severity within the group.
Association between NMD status and disease severity, adjusted for viral lineage
When WHO variant was added to the NMD model (Delta reference), the association between NMD-positive status and severity was attenuated (adjusted OR = 0.67, 95% CI: 0.03–13.84, p = 0.793) and lost statistical significance. In this lineage-adjusted model, Omicron retained a similar effect size compared to the lineage-only model (adjusted OR = 1.87, 95% CI: 0.74–4.73, p = 0.184). The attenuation of the NMD association reflects that part of the observed severity variation was explained by differences in viral lineage. The full regression outputs for all three models are presented in Table 4.
Table 4.
Multivariate logistic regression models predicting severe COVID-19 disease
| Variable | NMD vs. Severity Model | Lineage vs. Severity Model | NMD vs. Severity, lineage-adjusted | ||||||
|---|---|---|---|---|---|---|---|---|---|
| OR | 95% CI | P-value | OR | 95% CI | P-value | OR | 95% CI | P-value | |
| Age | 1.02 | 1.00–1.04 | 0.078 | 1.02 | 1.00–1.04 | 0.078 | 1.02 | 1.00–1.04 | 0.078 |
| Sex (Male) | 1.18 | 0.56–2.49 | 0.670 | 1.18 | 0.56–2.49 | 0.670 | 1.18 | 0.56–2.49 | 0.670 |
| Comorbidities | 1.94 | 0.92–4.09 | 0.082 | 1.94 | 0.92–4.09 | 0.082 | 1.94 | 0.92–4.09 | 0.082 |
| NMD_Positive | 0.34 | 0.16–0.72 | 0.005 | — | — | — | 0.67 | 0.03–13.84 | 0.793 |
| Viral lineage (WHO variant) | |||||||||
| Alpha | — | — | — | NE | NE | NE | NE | NE | NE |
| Omicron | — | — | — | 1.9 | 0.77–4.72 | 0.164 | 1.87 | 0.74–4.73 | 0.184 |
| VOIs/VOCs | — | — | — | NE | NE | NE | NE | NE | NE |
Multivariable logistic regression models evaluating the association between NMD-positive status, viral lineage, and severe COVID-19. Model 1: Association between NMD-positive status and severity, adjusted for age, sex, and comorbidities. Model 2: Association between WHO variant (Delta reference) and severity, adjusted for age, sex, and comorbidities. Model 3: Association between NMD-positive status and severity after adjustment for WHO variant (Delta reference) and host factors. NE: Not estimable due to no variation in NMD status or insufficient cell counts.
Age showed a trend toward significance (p = 0.078), indicating a possible increase in risk with advancing age, though not reaching conventional significance. Comorbidities also demonstrated a borderline effect (p = 0.082), consistent with previous findings on increased risk of severe outcomes. Sex (male vs. female) was not significantly associated with severity in this model (p = 0.670).
These findings support the hypothesis that NMD-triggering mutations may attenuate the pathogenic potential of SARS-CoV-2 and reinforce the importance of incorporating viral genomic features in clinical severity prediction models.
Synergistic influence of NMD-Related mutations on infection severity
To assess the association between NMD-associated mutations and COVID-19 severity, Chi-square tests were conducted, followed by False Discovery Rate (FDR) correction to account for multiple hypothesis testing. Variants 5, 7, 5 + 7, 5 + 18, and 15 exhibited significant associations with disease severity, with p-values < 0.05, and these associations remained robust after FDR correction (FDR < 0.05). Specifically, Variants 5 and 15 demonstrated the strongest associations, with p-values of 0.0000 and 0.0001, respectively, and corresponding FDR values of 0.0000 and 0.0002. In contrast, Variant 18, with a p-value of 0.137, did not show a significant association after FDR correction (FDR = 0.137), indicating no strong evidence for its role in disease severity. These results suggest that, while several mutations are strongly associated with COVID-19 severity, others, such as Variant 18, require further investigation to confirm their potential relevance. The use of FDR adjustment ensures that the identified associations are robust and not due to random chance, addressing concerns about multiple comparisons (Table 5).
Table 5.
Association between COVID-19 severity and NMD-Associated variants with high interdependency (ID ≥ 0.65)
| Infection severity | Positivity | N (%) | Mild | Moderate | Severe | P-value |
|---|---|---|---|---|---|---|
| Variant 5 | Negative | 98(76.6) | 7(36.8) | 54(80.6) | 37(88.1) | 0.0001 |
| Positive | 30(23.4) | 12(63.2) | 13(19.4) | 5(11.9) | ||
| Variant 7 | Negative | 103(80.5) | 10(52.6) | 54(80.6) | 39(92.9) | 0.001 |
| Positive | 25(19.5) | 9(47.4) | 13(19.4) | 3(7.1) | ||
| Variant 5 + 7 | Negative | 108(84.4) | 10(52.6) | 58(86.6) | 40(95.2) | 0.0001 |
| Positive | 20(15.6) | 9(47.4) | 9(13.4) | 2(4.8) | ||
| Variant 5 | Negative | 98(76.6) | 7(36.8) | 54(80.6) | 37(88.1) | 0.000 |
| Positive | 30(23.4) | 12(63.2) | 13(19.4) | 5(11.9) | ||
| Variant 18 | Negative | 80(62.5) | 8(42.1) | 44(65.7) | 28(66.7) | 0.137 |
| Positive | 48(37.5) | 11(57.9) | 23(34.4) | 14(33.3) | ||
| Variant 5 + 18 | Negative | 101(78.9) | 9(47.4) | 55(82.1) | 37(88.1) | 0.001 |
| Positive | 27(21.1) | 10(52.6) | 12(17.9) | 5(11.9) | ||
| Variant 12 | Negative | 125(97.7) | 17(98.5) | 66(98.5) | 42(100) | 0.034 |
| Positive | 3(2.3) | 2(10.5) | 1(1.5) | 0(0) | ||
| Variant 15 | Negative | 98(76.6) | 16(84.2) | 67(100) | 42(100) | 0.000 |
| Positive | 30(23.4) | 3(15.8) | 0(0) | 0(0) | ||
| Variant 12 + 15 | Negative | 125(97.7) | 17(98.5) | 66(98.5) | 42(100) | 0.034 |
| Positive | 3(2.3) | 2(10.5) | 1(1.5) | 0(0) |
These findings indicate that specific NMD-related mutations, particularly Variants 5, 7, and 15, may function as independent protective markers. Moreover, their combined presence appears to exert a cumulative, synergistic effect, further reducing the risk of severe disease.
Discussion
This study provides novel evidence that specific SARS-CoV-2 nonsense-mediated decay (NMD)-associated mutations are significantly associated with reduced COVID-19 severity. Through comprehensive whole-genome sequencing and targeted analysis of nonsense mutations predicted to activate the host NMD pathway, we identified a panel of 21 NMD-inducing mutations primarily within ORF1ab and examined their correlation with clinical outcomes. Our results demonstrate that a subset of these mutations, particularly when co-occurring in specific combinations, may confer a protective effect by attenuating viral virulence.
NMD is a highly conserved post-transcriptional surveillance mechanism that degrades mRNAs harboring premature termination codons (PTCs) [36], thereby preventing the production of truncated, potentially deleterious proteins [37]. While traditionally considered a housekeeping function in eukaryotic cells, emerging research has revealed its role in host-pathogen interactions [38]. Several viruses have evolved mechanisms to evade or suppress the NMD pathway to enhance their replication [9]. Conversely, some viral mutations may inadvertently trigger NMD, potentially reducing viral fitness and pathogenesis [7, 38].
In the context of SARS-CoV-2, the role of NMD-associated mutations in shaping viral behavior remains underexplored. A study by Karousis et al.. (2021) [39]suggested that coronaviruses possess RNA structural features that may modulate susceptibility to NMD, particularly in subgenomic RNAs [40, 41]. However, this study did not focus on nonsense mutations nor link these features to clinical outcomes. Similarly, Bakhshandeh et al.. (2021) noted that the SARS-CoV-2 genome contains sequences that could interact with RNA surveillance pathways but stopped short of evaluating the phenotypic consequences of PTC-generating mutations [7, 42]. In contrast, our study explicitly investigates the presence of nonsense mutations, predicts their likelihood of inducing NMD, and directly correlates them with clinical severity, marking a clear advancement in the understanding of virus-host RNA interactions in COVID-19.
Beyond SARS-CoV-2, NMD has been implicated in other viral infections. In HIV-1, NMD is known to degrade viral transcripts with PTCs, and suppression of NMD enhances viral replication [10, 43, 44]. Hepatitis C virus (HCV) similarly interacts with the NMD pathway, where disruption of host NMD enhances virus stability and pathogenicity [45]. These findings support the broader concept that host NMD functions as a molecular barrier against viral infection [10, 45]. The current study builds on this body of literature by providing clinical-genomic evidence of this relationship in a large cohort of COVID-19 patients.
Importantly, five NMD-associated variants (Variants 5, 6, 7, 9, and 15) showed highly significant associations with mild disease, while three others demonstrated moderate or mild statistical significance. Multivariate analysis confirmed that the presence of at least one NMD-inducing variant was an independent protective factor (OR = 0.34, p = 0.005) after adjusting for age, sex, and comorbidities. Notably, neither age nor comorbidities reached conventional statistical significance, further underscoring the impact of viral genomic features on clinical outcomes.
While this study focuses on NMD-associated mutations, it is important to acknowledge that clinical effects may also be influenced by loss-of-function mutations, such as those in ORF3b and ORF8, which modulate immune responses. NMD-induced mutations may lead to the degradation of mRNAs with premature termination codons, preventing the production of truncated proteins that could impact viral replication or host interactions [46].
ROC curve analysis illustrated the limitations of using NMD mutation presence as a standalone predictor, with AUC values of 0.37 (all 21 variants) and 0.48 (only significant variants). This indicates that while such mutations correlate with severity on a population level, they do not suffice as individual predictive markers. Nevertheless, they may contribute to composite risk scores when integrated with host and clinical variables. Given the sparse, binary nature of the mutation features and the limited number of variants analyzed, the ROC curve analysis highlights the challenges of using NMD mutation presence as a reliable predictor. The relatively low AUC values suggest that while these mutations correlate with severity at the population level, they are not strong enough to serve as standalone predictive markers. However, they may still hold value when integrated into composite risk models alongside host and clinical variables.
A key novel contribution of this study lies in the analysis of synergistic effects among interdependent variants (ID ≥ 0.65). We observed that combinations of specific NMD variants conferred greater protective effects than individual mutations alone. For instance, the co-presence of Variants 5 and 7 was associated with a dramatic reduction in severe cases (from 92.9 to 4.8%, p = 0.000). Similar synergistic trends were seen with combinations involving Variants 12, 15, and 18. To our knowledge, this is the first study to explore such inter-variant interactions and their impact on disease severity in the context of SARS-CoV-2.
Another strength is the real-world clinical relevance of the cohort, drawn from 129 hospitalized patients with confirmed SARS-CoV-2 infection over a 12-month period. This provides a diverse and temporally representative dataset, improving generalizability across different phases of the pandemic. Moreover, the use of a rigorous filtering pipeline (including coverage depth ≥ 100, high-confidence variant calling, and NMD prediction based on position relative to exon-exon junctions) enhances the reliability of the mutation annotations.
The multivariable analyses demonstrate that while NMD-positive status was initially associated with a reduced risk of severe COVID-19, this relationship was attenuated after adjustment for SARS-CoV-2 lineage, with Omicron showing higher (though nonsignificant) odds of severity compared to Delta. This finding aligns with reports that variant-specific differences, particularly for Omicron may influence disease outcomes through changes in transmissibility, immune escape, and virulence profiles [47, 48]. Several prior studies have noted that host genetic variants, including those affecting mRNA processing and decay pathways, can modulate COVID-19 severity [49], but our results suggest that the apparent effect of NMD-linked mutations must be interpreted in the context of circulating viral lineages. The attenuation observed after lineage adjustment mirrors findings from genomic epidemiology studies, which have shown that viral genetic background can act as a confounder when assessing host genetic effects [50]. Together, these data emphasize that integrated host–virus analyses are necessary to disentangle the contributions of human genomic factors from variant-specific pathogenicity. Future studies with balanced representation of NMD-positive and NMD-negative individuals within each lineage are needed to confirm whether the observed protective association is consistent across SARS-CoV-2 variants or is modified by viral genomic background.
However, several limitations should be acknowledged. Functional validation of NMD induction was based on bioinformatic predictions rather than experimental data. Host genetic variation in NMD efficiency, which may impact the effects of viral mutations, was not considered. Although interdependency analysis suggests co-evolution of protective variants, the underlying mechanisms require further investigation. Future studies with broader range of mutation types, including protein-coding and non-coding variants are needed to better elucidate the genetic underpinnings of COVID-19 severity.
Conclusion
In conclusion, this study adds a new dimension to the understanding of COVID-19 pathogenesis by highlighting the protective role of NMD-triggering nonsense mutations in SARS-CoV-2. The identification of specific variants and synergistic combinations that reduce disease severity underscores the importance of viral genomic surveillance not only for tracking transmission and immune escape but also for understanding disease-modulating features. These findings may inform future research on antiviral targets and risk prediction models that incorporate viral genetic data alongside host and clinical markers. While these mutations are associated with milder outcomes, causality cannot be inferred without further experimental validation.
Supplementary Information
Acknowledgements
Not applicable.
Abbreviations
- NMD
Nonsense-mediated decay
- PTCs
Premature termination codons
- COVID-19
Coronavirus that cause pandemic COVID-19
- ROC
Receiving operating characteristics
- OR
Odd ratio
- AUC
Area under the curve
- ARDS
Acute respiratory distress syndrome
- mRNA
Messenger RNA
- RT-PCR
Real-time reverse transcription polymerase chain reaction
- WHO
World Health Organization
- NP
Nasopharyngeal
- VTM
Viral transport medium
- PPE
Personal protective equipment
- BSL-2+
Biosafety Level 2 Plus
- cDNA
Complementary DNA
- ANOVA
Analysis of variances
- CIs
Confidence intervals
- ORF1ab
Open reading frame 1ab
Authors’ contributions
All authors contributed to the study’s conception and design. W.A.H., A.A., and O.M.A. were responsible for material preparation. M.G.S., O.M.A., U.B., and A.F.G. were responsible for data collection and analysis. N.E. was responsible for writing the first draft of the manuscript. All authors commented on previous versions of the manuscript and read and approved the final manuscript.
Funding
Open access funding provided by The Science, Technology & Innovation Funding Authority (STDF) in cooperation with The Egyptian Knowledge Bank (EKB). The authors received no financial support for the research, authorship, and/or publication of this article.
Data availability
The complete genome sequences of SARS-CoV-2 generated in this study (n=129) have been deposited in the GISAID database (Global Initiative on Sharing All Influenza Data) under the accession numbers listed in **Supplementary Table 2**. These data are publicly available to support global efforts in monitoring viral evolution and facilitate ongoing scientific collaboration. The submission follows the GISAID data-sharing framework as described by Shu and McCauley in 2017 **.**.
Declarations
Ethics approval and consent to participate
The study was approved by the Research Ethics Committee at the Egypt Center for Research and Regenerative Medicine, approval No: (1/11-2021). Informed consent was obtained from all participants prior to their involvement in the study. The study was performed following all relevant ethical standards of the Declaration of Helsinki.
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Mohan BS, Vinod N. COVID-19: an insight into SARS-CoV2 pandemic originated at Wuhan City in Hubei Province of China. J Infect Dis Epidemiol. 2020;6(4):146. 10.23937/2474-3658/1510146
- 2.Mallah SI, Ghorab OK, Al-Salmi S, Abdellatif OS, Tharmaratnam T, Iskandar MA, et al. COVID-19: breaking down a global health crisis. Ann Clin Microbiol Antimicrob. 2021;20(1):35. 10.1186/s12941-021-00438-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Pollard CA, Morran MP, Nestor-Kalinoski AL. The COVID-19 pandemic: a global health crisis. Physiol Genom. 2020;52(11):549–57. 10.1152/physiolgenomics.00089.2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Vannabouathong C, Devji T, Ekhtiari S, Chang Y, Phillips SA, Zhu M, et al. Novel coronavirus COVID-19: current evidence and evolving strategies. J Bone Joint Surg Am. 2020;102(9):734–44. 10.2106/JBJS.20.00396. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Krynytska I, Marushchak M, Birchenko I, Dovgalyuk A, Tokarskyy O. COVID-19-associated acute respiratory distress syndrome versus classic al acute respiratory distress syndrome (a narrative review). Iran J Microbiol. 2021;13(6):737–47. 10.18502/ijm.v13i6.8072. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Pfortmueller CA, Spinetti T, Urman RD, Luedi MM, Schefold JC. COVID-19-associated acute respiratory distress syndrome (CARDS): current knowledge on pathophysiology and ICU treatment – A narrative review. Best Pract Res Clin Anaesthesiol. 2021;35(3):351–68. 10.1016/j.bpa.2020.12.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Wada M, Lokugamage KG, Nakagawa K, Narayanan K, Makino S. Interplay between coronavirus, a cytoplasmic RNA virus, and nonsense-m ediated mRNA decay pathway. Proc Natl Acad Sci USA. 2018;115(43):E10157–66. 10.1073/pnas.1811675115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Banday AR, Stanifer ML, Florez-Vargas O, Onabajo OO, Papenberg BW, Zahoor MA, et al. Genetic regulation of OAS1 nonsense-mediated decay underlies associati on with COVID-19 hospitalization in patients of European and African ancestries. Nat Genet. 2022;54(8):1103–16. 10.1038/s41588-022-01113-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Leon K, Ott M. An ‘arms race’ between the Nonsense-mediated mRNA decay pathway and viral infections. Semin Cell Dev Biol. 2021;111:101–7. 10.1016/j.semcdb.2020.05.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.van der Klugt T, van Gent M. The dynamic interactions between virus infections and nonsense-mediated decay. Hum Mol Genet. 2025. 10.1093/hmg/ddae151. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Ahmed MR, Du Z. Molecular interaction of Nonsense-Mediated mRNA decay with viruses. Viruses. 2023;15(4):816. 10.3390/v15040816. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Mallick M, Boehm V, Xue G, Blackstone M, Gehring Niels H, Chakrabarti S. Modulation of UPF1 catalytic activity upon interaction of SARS-CoV-2 nucleocapsid protein with factors involved in nonsense mediated-mRNA decay. Nucleic Acids Res. 2024;52(21):13325–39. 10.1093/nar/gkae829. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.May JP, Yuan X, Sawicki E, Simon AE. RNA virus evasion of nonsense-mediated decay. PLoS Pathog. 2018;14(11):e1007459. 10.1371/journal.ppat.1007459. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Colombo M, Karousis ED, Bourquin J, Bruggmann R, Mühlemann O. Transcriptome-wide identification of NMD-targeted human mRNAs reveals extensive redundancy between SMG6- and SMG7-mediated degradation pathways. RNA. 2017;23(2):189–201. 10.1261/rna.059055.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Nickless A, Bailis JM, You Z. Control of gene expression through the nonsense-mediated RNA decay pathway. Cell Biosci. 2017;7:26. 10.1186/s13578-017-0153-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Thorne LG, Bouhaddou M, Reuschl A-K, Zuliani-Alvarez L, Polacco B, Pelin A, et al. Evolution of enhanced innate immune evasion by SARS-CoV-2. Nature. 2022;602(7897):487–95. 10.1038/s41586-021-04352-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Kishor A, Fritz SE, Hogg JR. Nonsense-mediated mRNA decay: the challenge of telling right from wrong in a complex transcriptome. WIREs RNA. 2019;10(6):e1548. 10.1002/wrna.1548. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Banoun H. Evolution of SARS-CoV-2: review of mutations, role of the host immune system. Nephron. 2021;145(4):392–403. 10.1159/000515417. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Singh J, Pandit P, McArthur AG, Banerjee A, Mossman K. Evolutionary trajectory of SARS-CoV-2 and emerging variants. Virol J. 2021;18(1):166. 10.1186/s12985-021-01633-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Khan S, Yahiro T, Kimitsuki K, Hashimoto T, Matsuura K, Yano S, et al. Exploring the replication and pathogenic characteristics of alpha, delta, and Omicron variants of SARS-CoV-2. Int J Mol Sci. 2024;25(23):12641. 10.3390/ijms252312641. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Perrin-Vidoz L. The nonsense-mediated mRNA decay pathway triggers degradation of most BRCA1 mRNAs bearing premature termination codons. Hum Mol Genet. 2002;11(23):2805–14. 10.1093/hmg/11.23.2805. [DOI] [PubMed] [Google Scholar]
- 22.Inglis AJ, Guna A, Gálvez-Merchán Á, Pal A, Esantsi TK, Keys HR, et al. Coupled protein quality control during nonsense-mediated mRNA decay. J Cell Sci. 2023;136(10):jcs261216. 10.1242/jcs.261216 [DOI] [PMC free article] [PubMed]
- 23.Pawlicka K, Kalathiya U, Alfaro J. Nonsense-Mediated mRNA decay: pathologies and the potential for novel therapeutics. Cancers. 2020;12(3):765. 10.3390/cancers12030765. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Mehta P, Chattopadhyay P, Ravi V, Tarai B, Budhiraja S, Pandey R. SARS-CoV-2 infection severity and mortality is modulated by repeat-mediated regulation of alternative splicing. Microbiol Spectr. 2023;11(5):e0135123. 10.1128/spectrum.01351-23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Sun L, Mailliot J, Schaffitzel C. Nonsense-Mediated mRNA decay factor functions in human health and disease. Biomedicines. 2023;11(3):722. 10.3390/biomedicines11030722. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Theunissen MTM, van den Elsen RM, House TL, Crittenden B, van Doorn PA, van der Ploeg AT, et al. The impact of COVID-19 infection, the pandemic and its associated control measures on patients with Pompe disease. J Neurol. 2024;271(1):32–45. 10.1007/s00415-023-11999-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Peterson DR, Baran AM, Bhattacharya S, Branche AR, Croft DP, Corbett AM, et al. Gene expression risk scores for COVID-19 illness severity. J Infect Dis. 2023;227(3):322–31. 10.1093/infdis/jiab568. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Hidayat MT, Prasetyo EN. Modified spin Column-Based RNA extraction methods of Staphylococcus Au reus using PureLink® RNA mini kit and basic laboratory instrument. IJMLST. 2021;3(2):73–80. 10.33086/ijmlst.v3i2.1863. [Google Scholar]
- 29.Ntagereka PB, Oyola SO, Baenyi SP, Rono GK, Birindwa AB, Shukuru DW, et al. Whole-genome sequencing of SARS-CoV-2 reveals diverse mutations in Circulating alpha and delta variants during the first, second, and third waves of COVID-19 in South kivu, East of the Democratic Republic of the congo. Int J Infect Dis. 2022;122:136–43. 10.1016/j.ijid.2022.05.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Leggett RM, Ramirez-Gonzalez RH, Clavijo BJ, Waite D, Davey RP. Sequencing quality assessment tools to enable data-driven informatics for high throughput genomics. Front Genet. 2013;4:288. 10.3389/fgene.2013.00288. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Guo L, Huo H. An efficient Burrows–Wheeler transform-based aligner for short read mapping. Comput Biol Chem. 2024;110:108050. 10.1016/j.compbiolchem.2024.108050. [DOI] [PubMed] [Google Scholar]
- 32.Bassano I, Ramachandran VK, Khalifa MS, Lilley CJ, Brown MR, van Aerle R, et al. Evaluation of variant calling algorithms for wastewater-based epidemiology using mixed populations of SARS-CoV-2 variants in synthetic and wastewater samples. Microb Genomics. 2023;9(4). 10.1099/mgen.0.000933. [DOI] [PMC free article] [PubMed]
- 33.Van der Auwera GA, Carneiro MO, Hartl C, Poplin R, del Angel G, Levy-Moonshine A, et al. From FastQ data to High‐Confidence variant calls: the genome analysis toolkit best practices pipeline. CP Bioinf. 2013;43(1):1101–033. 10.1002/0471250953.bi1110s43. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Iqbal N, Rafiq M, Masooma, Tareen S, Ahmad M, Nawaz F, et al. The SARS-CoV-2 differential genomic adaptation in response to varying UVindex reveals potential genomic resources for better COVID-19 diagnosis and prevention. Front Microbiol. 2022;13:922393. 10.3389/fmicb.2022.922393. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Jung A, Droit L, Febles B, Fronick C, Cook L, Handley SA, et al. Tracking the prevalence and emergence of SARS-CoV-2 variants of concern using a regional genomic surveillance program. Microbiol Spectr. 2024;12(8):e0422523. 10.1128/spectrum.04225-23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Lejeune F. Nonsense-mediated mRNA decay at the crossroads of many cellular pathways. BMB Rep. 2017;50(4):175–. 10.5483/bmbrep.2017.50.4.015. 85. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Yi Z, Sanjeev M, Singh G. The branched nature of the Nonsense-Mediated mRNA decay pathway. Trends Genet. 2021;37(2):143–. 10.1016/j.tig.2020.08.010. 59. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Balistreri G, Bognanni C, Mühlemann O. Virus escape and manipulation of cellular Nonsense-Mediated mRNA decay. Viruses. 2017;9(1):24. 10.3390/v9010024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Karousis ED, Gypas F, Zavolan M, Mühlemann O. Nanopore sequencing reveals endogenous NMD-targeted isoforms in human cells. Genome Biol. 2021;22(1):223. 10.1186/s13059-021-02439-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Long S. SARS-CoV-2 subgenomic rnas: characterization, utility, and perspective s. Viruses. 2021;13(10):1923. 10.3390/v13101923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Malone B, Urakova N, Snijder EJ, Campbell EA. Structures and functions of coronavirus replication–transcription complexes and their relevance for SARS-CoV-2 drug design. Nat Rev Mol Cell Biol. 2022;23(1):21–39. 10.1038/s41580-021-00432-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Bakhshandeh B, Jahanafrooz Z, Abbasi A, Goli MB, Sadeghi M, Mottaqi MS, et al. Mutations in SARS-CoV-2; consequences in structure, function, and pathogenicity of the virus. Microb Pathog. 2021;154:104831. 10.1016/j.micpath.2021.104831. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Nakano K, Ando T, Yamagishi M, Yokoyama K, Ishida T, Ohsugi T, et al. Viral interference with host mRNA surveillance, the nonsense-mediated mRNA decay (NMD) pathway, through a new function of HTLV-1 rex: implications for retroviral replication. Microbes Infect. 2013;15(6–7):491–505. 10.1016/j.micinf.2013.03.006. [DOI] [PubMed] [Google Scholar]
- 44.Lorgeoux R-P, Guo F, Liang C. From promoting to inhibiting: diverse roles of helicases in HIV-1 replication. Retrovirology. 2012;9(1):79. 10.1186/1742-4690-9-79. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Ramage HR, Kumar GR, Verschueren E, Johnson JR, Von Dollen J, Johnson T, et al. A combined proteomics/genomics approach links hepatitis C virus infection with nonsense-mediated mRNA decay. Mol Cell. 2015;57(2):329–40. 10.1016/j.molcel.2014.12.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Lam J-Y, Yuen C-K, Ip JD, Wong W-M, To KK-W, Yuen K-Y, et al. Loss of orf3b in the Circulating SARS-CoV-2 strains. Emerg Microbes Infections. 2020;9(1):2685–96. 10.1080/22221751.2020.1852892. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Chatterjee S, Bhattacharya M, Nag S, Dhama K, Chakraborty C. A detailed overview of SARS-CoV-2 omicron: its Sub-Variants, mutations and pathophysiology, clinical characteristics, immunological landscape, immune escape, and therapies. Viruses. 2023;15(1):157. 10.3390/v15010167 [DOI] [PMC free article] [PubMed]
- 48.Zabidi NZ, Liew HL, Farouk IA, Puniyamurti A, Yip AJW, Wijesinghe VN, et al. Evolution of SARS-CoV-2 variants: implications on immune escape, vaccination, therapeutic and diagnostic strategies. Viruses. 2023;15(4):944. 10.3390/v15040944 [DOI] [PMC free article] [PubMed]
- 49.Singh MS, Pyati A, Rubi RD, Subramanian R, Muley VY, Ansari MA, et al. Systems-wide view of host-pathogen interactions across COVID-19 severities using integrated omics analysis. iScience. 2024;27(3):109087. 10.1016/j.isci.2024.109087. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Cruz Cisneros MC, Anderson EJ, Hampton BK, Parotti B, Sarkar S, Taft-Benz S, et al. Host genetic variation impacts SARS-CoV-2 vaccination response in the diversity. Outbred Mouse Popul. 2024;12(1):103. 10.3390/vaccines12010103 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The complete genome sequences of SARS-CoV-2 generated in this study (n=129) have been deposited in the GISAID database (Global Initiative on Sharing All Influenza Data) under the accession numbers listed in **Supplementary Table 2**. These data are publicly available to support global efforts in monitoring viral evolution and facilitate ongoing scientific collaboration. The submission follows the GISAID data-sharing framework as described by Shu and McCauley in 2017 **.**.



