Abstract
Mycobacterium tuberculosis (Mtb) adhesin proteins are promising candidates for subunit vaccine design. Multi-epitope Mtb vaccine and diagnostic candidates were designed using immunoinformatic tools. The antigenic potential of 26 adhesin proteins were determined using VaxiJen 2.0. The truncated heat shock protein 70 (tnHSP70), 19 kDa antigen lipoprotein (lpqH), Mtb curli pili (MTP), and Phosphate transport protein S1 (PstS1) were selected based on the number of known epitopes on the Immune Epitope Database (IEDB). B- and T-cell epitopes were identified using BepiPred2.0, ABCpred, SVMTriP, and IEDB, respectively. Population coverage was analysed using prominent South African specific alleles on the IEDB. The allergenicity, physicochemical characteristics and tertiary structure of the tri-fusion proteins were determined. The in silico immune simulation was performed using C-ImmSim. Three truncated sequences, with predicted B and T cell epitopes, and without allergenicity or signal peptides were linked by three glycine-serine residues, resulting in the stable, hydrophilic molecules, tnlpqH-tnPstS1-tnHSP70 (64,86 kDa) and tnMTP-tnPstS1-tnHSP70 (63,96 kDa). Restriction endonuclease recognition sequences incorporated at the N- and C-terminal ends of each construct, facilitated virtual cloning using Snapgene, into pGEX6P-1, resulting in novel, highly immunogenic vaccine candidates (0,912–0,985). Future studies will involve the cloning, recombinant protein expression and purification of these constructs for downstream applications.
Keywords: Tuberculosis, Mycobacterium tuberculosis, Multi-epitope vaccine, Adhesin proteins, Immunoinformatics
Highlights
Article Highlights:
-
•
Mycobacterium tuberculosis (Mtb) adhesin proteins are lucrative targets as TB diagnostic biomarkers as well as preventative vaccine candidates.
-
•
The fusion of multiple epitope regions from two or more adhesin proteins can improve the immunogenicity of the candidate, triggering a heightened response or improve the serodiagnostic potential.
-
•
Immunoinformatic tools were utilized to select epitope regions from three Mtb adhesin proteins: 19-kDa lipoprotein (lpqH), Phosphate transport Phosphate transport protein S1 (PstS1) and Mycobacterium tuberculosis curli pili (MTP).
-
•
The C-terminal region of the heat shock protein 70 (HSP70) was incorporated into the design of a trifusion protein construct using the above adhesin proteins since it possesses adjuvant potential and epitope regions.
-
•
Two novel trifusion protein constructs were generated, tnLPH and tnMPH, which contained truncated regions of lpqH, PstS1 and HSP70; and MTP, PstS1 and HSP70, respectively.
-
•
Both trifusion constructs were non allergens that were hydrophilic in nature and thermostable with a good population coverage both globally and within South Africa.
-
•
The predicted structures were of good quality. In silico immune simulation revealed that both constructs elicited both an adaptive and innate immune response.
1. Introduction
Despite the past coronavirus pandemic, tuberculosis (TB) remains a global epidemic [1]. An estimated 10,6 million people were infected with TB in the year 2021 whilst a further ±1,5 million people lost their lives to this dreaded disease [2]. Africa accounts for 23 % of the global TB infections, with South Africa listed as a high-burden TB country. Despite the availability of an array of first- and second-line anti-TB drugs, the escalating drug resistance and the lack of rapid, accurate point of care diagnostic tests and effective vaccines have resulted in high disease burden [3,4].
Although TB has been present for centuries, only one vaccine, the Bacillus-Calmette Guerin (BCG) vaccine, which was initially administered in 1921, remains approved for the prevention of this disease [5]. The BCG is an attenuated derivative of Mycobacterium bovis, in which the RD1 region was deleted [6]. The use of BCG in developing countries with a high prevalence of TB is strongly encouraged, despite its short-lived protection. BCG provides protection against adolescent, but not adult, pulmonary TB (pTB), as exposure to environmental mycobacteria renders the vaccine ineffective in the latter. Since inception, six strains of the BCG vaccine with phenotypic variations have been produced, that could potentially account for the variable protection observed in some population groups [7].
The temporary protection offered by the BCG and the potential risk of it becoming virulent, has created the need for a new and improved TB vaccine candidate that is effective on all age groups and populations [8]. Three approaches can be taken during vaccine development [[9], [10], [11], [12]]: 1) an improvement of the current version of BCG vaccine which is safe and long-lasting; 2) a prime-boost strategy wherein the viral vectored or protein adjuvant is administered at a later stage as a booster dose; 3) immunotherapeutic vaccines which reduce the duration of TB therapy. The WHO prioritized the new vaccine candidates based on two strategies: 1) safe, effective, and affordable TB vaccine for adolescents and adults; 2) a vaccine for neonates and infants with improved safety and efficacy in comparison to the BCG [13]. Presently, the vaccine pipeline has approximately 29 new TB vaccine candidates in various phases of trials. Only six of these are in phase III, including three live attenuated mycobacterial vaccines: BCG travel vaccine MTBVAC and the VPM1002; one protein adjuvant vaccine: GamTBvac; and one inactivated mycobacterial vaccine: Immunvac (MIP) [[14], [15], [16]]. These trials attempt to measure either the prevention of infection (POI) against Mtb, or prevention of acquiring TB disease (POD), or prevention of recurrent (POR) TB disease. Vaccine candidates can be divided into whole cell vaccines and subunit vaccines [11]. Whole cell vaccines include live mycobacterial vaccines derived from live attenuated Mtb strains, M. bovis BCG or recombinant BCG and killed mycobacterial vaccines that could be formulated from other saprophytic mycobacterial species or Mtb [17]. Subunit vaccines contain Mtb antigens expressed as recombinant proteins formulated with different adjuvants [18]. These proteins are expressed by recombinant vectors that are used as vehicles for the administration of antigens. There are currently 9 subunit vaccine candidates with only one (GamTBvac) in the final phase.
Mycobacterium tuberculosis has distinct phases of growth which may be associated with active mycobacterial replication, persistence and dormancy [19]. Biomarkers are measurable indicators of the state of disease progression. Adhesin proteins (Table 1) display biomarker potential as they may be associated with active bacterial replication [[20], [21], [22]]. Subunit vaccines comprising Mtb proteins are lucrative candidates for vaccine design [23,24]. Adhesin proteins such as those belonging to the ESX-1 secretion system (Esx) and antigen 85 (Ag85) family, culture filtrate protein 10 (CFP-10) and heparin-binding hemagglutinin adhesin (HBHA) are highly immunogenic and have previously displayed protection in animal models [25,26]. Protection against TB requires a cell-mediated immune response which involves multiple components such as cluster of differentiation 4 and 8 (CD4+ and CD8+) T cells [27]. A multi-stage TB vaccine which comprises a fusion of adhesin proteins which are known to elicit an immune response during the various stages of TB is crucial to the control of the disease. Therefore, the selection of the adhesin protein within the fusion construct is critical. Adhesin proteins involved in the generation of an immune response during the early stages of TB infection are pivotal for the prevention and establishment of disease and should therefore be highly favoured in the design of a fusion construct [28].
Table 1.
Adhesin | Rv Number | Host Receptor | Reference |
---|---|---|---|
Alanine proline-rich antigen (Apa) | Rv1860 | Macrophages | [29] |
19-kDa Antigen (lpqH) | Rv3763 | Monocytes and macrophages | [30] |
Antigen 85 complex (Ag85) | Rv0129c, Rv1886c, | Fibronectin and elastin | [31] |
Rv3803c, and Rv3804c | |||
Cpn60.2 molecular chaperone (Cpn60.2) | Rv0440 | CD43 on Macrophages | [32] |
Curli pili (MTP) | Rv3312A | Laminin, macrophages and epithelial cells | [33] |
Early secreted antigen (ESAT-6) | Rv3874/Rv3875 | Laminin | [34] |
Glutamine synthetase A1 | Rv2220 | Fibronectin | [35] |
Glyceraldehyde-3phosphate dehydrogenase (GAPDH) | Rv1436 | Fibronectin | [36] |
Heparin-binding hemagglutinin adhesin (HBHA) | Rv0475 | Epithelial cells | [37] |
L,D-transpeptidase | Rv0309 | Fibronectin and laminin | [38] |
Malate synthase (MS) | Rv1837c | Fibronectin, laminin, and epithelial cells | [39] |
Membrane protein | Rv2599 | Collagen, fibronectin and laminin | [38] |
Molecular chaperone DnaK (HSP70) | Rv0350 | Macrophages | [32] |
Mycobacterium cell entry-1 protein | Rv0169 | Epithelial cells | [40] |
N-acetylmuromoyl-L-alanine amidase | Rv3717 | Fibronectin and laminin | [38] |
PE-PGRS proteins | Rv1759c, Rv1818c | Fibronectin and macrophages | [41] |
PstS-1 (38-kDa antigen) | Rv0934 | Macrophages | [30] |
Type IV pili | Rv3654c-Rv3660c | Macrophage and epithelial cells | [42] |
2. Methodology
2.1. Mycobacterium tuberculosis adhesin protein sequences
A list of 26 Mtb adhesin proteins (Table 1) were identified in a review by Ramsugit et al. (2016) [22]. All information pertaining to the adhesin proteins were obtained from the Tuberculist GenoList Pasteur Database (http://genolist.pasteur.fr/TubercuList/). This includes protein and DNA sequence, gene and protein size, location, Rv number and function.
2.2. Determination of vaccine potential of adhesin proteins
The protein sequences of each of the adhesins were input onto the VaxiJen 2.0 online tool (http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html) [43]. This tool provides the antigenic potential of each sequence which is identified as a probable antigen if the bacterial threshold of 0.4 is exceeded. This tool was used to rank the 26 Mtb adhesin proteins.
2.3. Identification of the number of epitopes on adhesin proteins
The top ten ranked adhesin proteins mostly likely to generate an immune response were analysed on the Immune Epitope Database (IEDB) (https://www.iedb.org), an online tool used to enumerate and identify the epitope regions present on the protein [44]. The following filters were applied: organism: Mycobacterium tuberculosis (ID:1773); Antigen name; include Positive assays; Host: Homo sapiens (human); Disease data: infectious disease.
A literature search was conducted on adhesin proteins which displayed multiple epitope regions as identified by the IEDB, to aid in the selection of the proteins for the fusion molecule. Adhesins which induced cytokine and antibody responses detected in patient sera, were selected for further analysis. These included Mycobacterium tuberculosis curli pili (MTP); Phosphate-specific transporter protein PstS 1 (PstS1); 19 kDa Lipoprotein (lpqH) and Heat-shock protein 70 (HSP70). For the purposes of this study, the truncated HSP70 protein (tnHSP70), amino acids 359–610, was used for further analysis owing to its adjuvant properties.
2.4. Prediction of B-cell epitopes on protein sequences
B-cells are involved in the neutralization of pathogenic molecules using specific secreted or surfaced expressed receptors. Several online tools were explored to identify B-cell epitopes within the protein sequences of the selected adhesins. These include BepiPred2.0 (https://services.healthtech.dtu.dk/service.php?BepiPred-2.0), ABCpred server (https://webs.iiitd.edu.in/raghava/abcpred/ABC_submission.html) and SVMTriP (http://sysbio.unl.edu/SVMTriP/prediction.php). BepiPred2.0 uses a random forest algorithm trained on new epitopes annotated from antibody-antigen protein structures [45]. The ABCpred server is based on recurrent machine-based techniques using fixed length patterns to predict B-cell epitopes on an antigenic sequence [46]. The SVMTriP tool employs super vector machine (SVM) in combination with tri-peptide similarity and propensity scores (SVMTriP) to better predict linear epitope regions [47].
2.5. Prediction of signal peptides within protein sequence
The SignalP 6.0 server (https://services.healthtech.dtu.dk/service.php?SignalP) was used to identify the presence of signal peptides on the protein sequences [48]. The server is able to predict the five types of signal peptides and their cleavage locations.
2.6. Prediction of T-cell epitopes
The Mtb adhesin proteins, tnHSP70, lpqH, MTP and PstS1 protein sequences lacking the signal peptide sequence were subjected to the MHCI and MHCII epitope prediction using the IEDB (http://tools.iedb.org/mhci/and http://tools.iedb.org/mhcii/) [49]. For the prediction of the MHC I binding sites, 54 HLA-A, HLA-B and HLA-C reference alleles were selected [50]. For the prediction of the MHC II binding sites, the reference set of 7 HLA-DR alleles (HLA-DRB1∗03:01, HLA-DRB1∗07:01, HLA-DRB1∗15:01, HLA-DRB3∗01:01, HLA-DRB3∗02:02, HLA-DRB4∗01:01, HLA-DRB5∗03:01) with a default length of 15-mer were selected [51,52].
2.7. Population coverage for MCH class I and II alleles
The population coverage for MHC I and MHC II Mtb adhesin protein epitopes were determined using the IEDB analysis resource tool (http://tools.iedb.org/population/) [49]. A combination of MHCI and MHCII alleles which are prominent to the South African population were selected [53]. These include HLA-A∗30:01, HLA-A∗02:01, HLA-A∗30:01, HLA-B∗42:01, HLA-B∗58:01, HLA-B∗58:02, HLA-B∗15:01, HLA-B∗58:01, HLA-B∗58:02, HLA-C∗06:02, HLA-DPB1∗13:01, HLA-DQB1∗06:01, HLA-DRB1∗11:01, HLA-DRB1∗13:01, HLA-A∗01:01, HLA-A∗02:01, HLA-B∗07:02, HLA-C∗07:01, HLA-C∗07:02, HLA-DRB1∗03:01.
2.8. Prediction of allergenicity of protein sequences
The potential of the selected proteins to cause an allergic response was determined using AlgPred (https://webs.iiitd.edu.in/raghava/algpred/submission.html) [54] and AllerTop Pro 2.0 https://www.ddg-pharmfac.net/AllerTOP/feedback.py) [55].
2.9. Prediction of secondary and tertiary structure, and physicochemical characteristics of fusion proteins
The ProtParam server was used to predict the physicochemical characteristics of the tri-fusion proteins (https://web.expasy.org/cgi-bin/protparam/protparam) [56]. The parameters evaluated included the number of amino acids, molecular weight, theoretical isoelectric point (pI), extinction coefficient, estimated half-life in vitro and in vivo, instability index, aliphatic index and Grand average of hydropathicity (GRAVY).
The D-I-TASSER (Deep learning-based Iterative Threading ASSEmbly Refinement) online tool (https://zhanggroup.org/D-I-TASSER/), an extension of the I-TASSER, was used to predict the tertiary structure and function of the tri-fusion protein based on the submission of the amino acid sequence whilst utilizing deep convolutional neural-network based distance-map predictions guiding the fragment assembly simulations of the I-TASSER [57,58]. In addition, the server provides an estimated accuracy of the predicted models including TM-score. The tool also predicts the secondary structure, solvent accessibility, enzyme classification and ligand-binding sites.
2.10. Refinement and validation of predicted tertiary structures
The first model of each trifusion protein construct predicted by the D-I-TASSER server was subjected to refinement using the web-based Galaxy Refine server (https://galaxy.seoklab.org) [59]. The first refined model generated was validated on the SAVESv6.0 server (https://saves.mbi.ucla.edu/) using the PROCHECK, ERRAT and Ramachandran verification tools.
2.11. In silico immune simulation
To characterize the immune response induced by the tri-fusion protein vaccine candidates, in silico immune simulation was performed using the C-ImmSim server (https://kraken.iac.rm.cnr.it/C-IMMSIM/) [60]. The server predicts immune interactions using position-specific scoring matrices derived from machine learning techniques for peptide prediction. It concurrently simulates three compartments representing three separate anatomical regions found in mammals: (i) the bone marrow, where hematopoietic stem cells were simulated to produce new lymphocytes and myeloid cells; (ii) the thymus, where naive T cells were selected to avoid autoimmunity; and (iii) the lymphatic organ such as lymph nodes. Generally, 2–3 doses within four week interval is recommended for most vaccines. Hence, three doses of 1000 vaccine units without lipopolysaccharide (LPS) were tested over four weeks in the immune simulation of these candidates. The parameters were as follows: three injections at time-step 1, 84, and 168, respectively; random seed: 12345; simulation volume of 10 and 1000 simulation steps [61].
2.12. In silico cloning of the tri-fusion proteins
Sequences were designed to incorporate BamH1 and Xho1 and EcoR1 and Xho1 restriction endonuclease recognition sequences at the N- and C-terminal ends of tri-fusion constructs tnlpqH-tnPstS1-tnHSP70 and tnMTP-tnPstS1-tnHSP70, respectively. This enables cloning into the pGEX6P-1 vector via the BamH1 and Xho1 and EcoR1 and Xho1 restriction sites using the Snapgene software tool.
3. Results
3.1. Determination of vaccine potential and the identification of number of epitopes on adhesin proteins
The known Mtb adhesin proteins were ranked according to the vaccine potential using the VaxiJen online tool (Table 2). The VaxiJen tool identified any sequence that exceeds the threshold of 0.4 as a probable antigen. Of the 26 adhesin proteins (0,3676–2,0053), the top ten (0,5914–2,0053) were selected for further evaluation. The IEDB was used to identify the presence of epitope regions within the top ten adhesin protein sequences. Only four adhesin proteins: 19 kDa antigen lipoprotein (lpqH), Phosphate transport protein S 1 (PstS1), Heat-shock protein 70 (dnaK/HSP70) and the heparin-binding hemagglutinin (HBHA) adhesin protein, returned results with identified epitope regions. The lpqH, PstS1 and HSP70 adhesin proteins were selected for further analysis. The MTP adhesin protein which did not display any identified epitopes on the IEDB, but was selected since previous studies [[42], [62], [63]] on this protein displayed promising results as a diagnostic biomarker and was detected in patient serum samples indicating that it was capable of inducing an immune response. The C-terminal region of the HSP70 possesses natural adjuvant properties [[64], [65], [66], [67]]. In addition, heat shock proteins such as HSP70 act as molecular chaperones aiding in the protein folding process which could prove beneficial when fused to other proteins [68]. Therefore, the tnHSP70 was utilized in the construction of the tri-fusion proteins in this study.
Table 2.
Rank | Adhesin Protein | Accession No. | Probability of Antigen (VaxiJen) | No. of epitopes (IEDB) |
---|---|---|---|---|
1 | Wag22_ PE_PGRS | Rv1759c | 2,0053 | No data |
2 | PE-PGRS33 | Rv1818c | 1,6972 | No data |
3 | 19 kDa Antigen | Rv3763 | 1,15 | 14 |
4 | mtp | Rv3312A | 0,7609 | No data |
5 | PstS-1 | Rv0934 | 0,7312 | 48 |
6 | DnaK/Hsp70 | Rv0350 | 0,6417 | 69 |
7 | N-acetylmuromoyl-L-alanine amidase | Rv3717 | 0,6327 | No data |
8 | HupB | Rv2986c | 0,6089 | No data |
9 | HBHA | Rv0475 | 0,5986 | 3 |
10 | Type IV pili | Rv3655c | 0,5914 | No data |
11 | Membrane protein | Rv2599 | 0,5883 | – |
12 | Ag85B | Rv1886c | 0,5842 | – |
13 | Type IV pili | Rv3654c | 0,5748 | – |
14 | MCE1A | Rv0169 | 0,5666 | – |
15 | Protein Kinase D | Rv0931c | 0,5634 | – |
16 | ESAT-6 | Rv3875 | 0,5577 | – |
17 | Cpn60,2 | Rv0440 | 0,5421 | – |
18 | Ag85A | Rv3804c | 0,5259 | – |
19 | Apa | Rv1860 | 0,5233 | – |
20 | GlnA1 | Rv2220 | 0,5087 | – |
21 | L,D Transpeptidase | Rv0309 | 0,5044 | – |
22 | Ag85C | Rv0129c | 0,4795 | – |
23 | MPT51/Ag85C | Rv3803c | 0,4515 | – |
24 | GAPDH | Rv1436 | 0,4123 | – |
25 | Type IV pili | Rv3656c | 0,3738 | – |
26 | Malate Synthase | Rv1837c | 0,3676 | – |
N.B: (-)Denotes that this adhesin protein was not investigated for this study and therefore data (if available) was not obtained.
3.2. Prediction of B-cell epitopes on protein sequences
The sequences of the four selected adhesin proteins; tnHSP70, lpqH, MTP and PstS1 were further investigated for the presence of other B-cell epitope regions using online immunoinformatic tools. Fig. S1 displays the epitope regions identified for each of the adhesin proteins using the BepiPred 2.0 Tool. Any amino acid sequence exceeding the threshold of 0.5 was considered an epitope. The structural predictions of these epitope regions including their accessibility are noted based on the intensity of the gradients. Most of the epitopes on these proteins can be located on the coil and are exposed. The tnHSP70 adhesin protein contains three regions of continuous B-cell epitopes which are predominantly located on the coil region and are mostly buried. The lpqH protein contains five regions of continuous B-cell epitopes which are predominantly exposed on the coil, whilst few epitopes are located on the sheet and are buried. The MTP protein contains three regions of continuous B-cell epitopes, all of which are located on the coil and are either exposed or buried. Approximately 10 continuous B-cell epitope regions are located on the PstS1 protein with the majority being located on the coil where they are exposed, whilst other epitopes can be found on both the helix and sheet and are either buried or exposed.
The adhesin proteins were also subjected to other immunoinformatic tools such as the SvM TRiP and ABCpred to further identify B-cell epitopes which are displayed in Table 3. In addition to new epitope sequences, common regions were located among each of the sequences with the different tools used. The epitope regions identified by these tools were selected for the design of the tri-fusion protein constructs for use as a multi-epitope vaccine candidate and diagnostic biomarker. A continuous sequence encompassing all the epitopes was selected for ease of cloning rather than random epitope regions (Table S1). Two tri-fusion protein constructs were designed to contain truncated (tn) regions of each protein; tnlpqH-tnPstS1-tnHSP70 (tnLPH) and tnMTP-tnPstS1-tnHSP70 (tnMPH) (Fig. 1).
Table 3.
B Cell Epitope Prediction Tool | Predicted Epitope Sequence for each Protein and Score (if available) |
|||
---|---|---|---|---|
tnHSP70 | lpqH | MTP | PstS1 | |
BepiPred | EVKDVLLLDVTPLSLGIETKGGVMTRLIERNTTIPTKRSETFTTADDNQPSVQIQVYQGEREIAAHNKLLGSF | SSNKSTTGSGETTTAAGTTASPGAASG | AAQTAPVPDY | GASDAYLSEGDMAAHK |
GKDQNVT | AWGPNWDPYTCHDDFHRDSDGPDHSRDYPGP | MNIALAISAQQVNYNLPGVSEHLKLNGKVLAA | ||
DGNPPEVKSVGLGNVN | GPVLDDPGAAPPPPAAG | IKTWDDPQIAALNPGVNL | ||
QIEVTFDIDANGIVHVTAKDKGTGKENTIRIQEGSGLSKEDIDRMIKDAEAHAEEDRKRREEADVRNQAETLVYQ | YTSGTGQGNASATK | SKQDPEGWGKSPGFGTTVDFPAVPGALGENGNGG | ||
GVDMANPMSPVNKS | QASQRGL | |||
SSGNFLLP | ||||
KEQREAEGGSKVPEDTLNKVDAAVAEAKAALGGSDISAIKSAMEKLGQESQALGQAIYEAAQAASQATGAAHPGGEPGGAHP | FASKTPANQAISMIDGPAPD | |||
NRQKDAATAQ | ||||
SFLD | ||||
HFQP | ||||
ABCpred | EAHAEEDRKRREEADV (0.91) | VTGSVVCTTAAGNVNAIGGA (0.83) | YYWCPGQPFDPAWGP (0.94) | DFPAVPGALGENGNGGMVTG (0.86) |
RGIPQIEVTFDIDANG (0.90) | TTASPGAASGPKVVIDGKDQ (0.82) | QSAAQTAPVPDYYWCP (0.93) | LHRSDGSGDTFLFTQYLSKQ (0.85) | |
TKRSETFTTADDNQPS (0.86) | TGIAAVLTDGNPPEVKSVGL (0.81) | HRDSDGPDHSRDYPGP (0.91) | LSEGDMAAHKGLMNIALAIS (0.82) | |
AHPGGEPGGAHPGSAD(0.86) | GSGETTTAAGTTASPGAASG (0.80) | GPILEGPVLDDPGAAP (0.87) | IINYEYAIVNNRQKDAATAQ (0.81) | |
KGGVMTRLIERNTTIP (0.86) | KVVIDGKDQNVTGSVVCTTA(0.80) | AACILATGVAGLGVGA (0.85) | LGNSSGNFLLPDAQSIQAAA (0.81) | |
DRMIKDAEAHAEEDRK (0.86) | TLGYTSGTGQGNASATKDGS (0.79) | GPNWDPYTCHDDFHRD (0.83) | YIGISFLDQASQRGLGEAQL (0.81) | |
GLSKEDIDRMIKDAE (0.83) | TGQGNASATKDGSHYKITGT (0,76) | DDPGAAPPPPAAGGGA (0.82) | NGNGGMVTGCAETPGCVAYI (0.81) | |
KVPEDTLNKVDAAVAE (0.82) | AAGNVNAIGGAATGIAAVLT (0,74) | PDHSRDYPGPILEGPV (0.82) | EHLKLNGKVLAAMYQGTIKT (0.80) | |
GNPPEVKSVGLGNVNGVTLG (0.55) | TGVAGLGVGAQSAAQT (0.73) | GCAETPGCVAYIGISFLDQA (0.79) | ||
ABCpred | FELTGIPPAPRGIPQI (0.80) | RFACRTLMLAACILAT (0.54) | QIAALNPGVNLPGTAVVPLH (0.78) | |
EAEGGSKVPEDTLNKV (0.80) | NYNLPGVSEHLKLNGKVLAA (0.78) | |||
IERNTTIPTKRSETFT (0.79) | LHWAITDGNKASFLDQVHFQ (0.78) | |||
GQAIYEAAQAASQATG (0.79) | AGFASKTPANQAISMIDGPA (0.77) | |||
GQESQALGQAIYEAAQ (0.78) | LPGTAVVPLHRSDGSGDTFL (0.76) | |||
SLGIETKGGVMTRLIE (0.78) | GTIKTWDDPQIAALNPGVNL (0.76) | |||
TFDIDANGIVHVTAKD (0.77) | FLFTQYLSKQDPEGWGKSPG (0.74) | |||
HVTAKDKGTGKENTIR (0.77) | VNNRQKDAATAQTLQAFLHW (0.72) | |||
QAASQATGAAHPGGEP (0.75) | PDAQSIQAAAAGFASKTPAN (0.72) | |||
GSDISAIKSAMEKLGQ (0.74) | QASQRGLGEAQLGNSSGNFL (0.72) | |||
RKRREEADVRNQAETL (0.71) | DGPAPDGYPIINYEYAIVNN (0.68) | |||
SVQIQVYQGEREIAAH (0.69) | LMNIALAISAQQVNYNLPGV (0.67) | |||
SAMEKLGQESQALGQA (0.68) | QDPEGWGKSPGFGTTVDFPA (0.66) | |||
FTTADDNQPSVQIQVY (0.67) | KVLAAMYQGTIKTWDDPQIA (0.61) | |||
ETLVYQTEKFVKEQRE (0.65) | ||||
YQGEREIAAHNKLLGS (0.63) | ||||
SVM TRiP | HAEEDRKRREEADVRNQAET (1.000) | SGETTTAAGTTASPGAASGP (1.00) | DMAAHKGLMNIALAISAQQV (1.000) | |
DISAIKSAMEKLGQESQALG (0.563) | DGKDQNVTGSVVCTTAAGNV (0.998) | PLHRSDGSGDTFLFTQYLSK (0.998) | ||
IAAHNKLLGSFELTGIPPAP (0.435) | DGNPPEVKSVGLGNVNGVTL (0.997) | SQRGLGEAQLGNSSGNFLLP (0.990) | ||
AEGGSKVPEDTLNKVDAAVA (0.353) | LPGVSEHLKLNGKVLAAMYQ (0.351) | |||
DIDANGIVHVTAKDKGTGKE (0.353) | QAAAAGFASKTPANQAISMI (0.294) | |||
QAIYEAAQAASQATGAAHPG (0.203) |
3.3. Prediction of signal peptides within protein sequence
The presence of signal peptides on proteins presents issues with downstream applications especially when attempting to clone the gene into another host [69]. The SignalP 6.0 online tool was used to identify the presence of these peptide regions within the selected adhesin proteins. All three proteins contained a signal peptide (Fig. 2). The tnHSP70 protein did not contain any signal peptide sequences (Fig. 2D). Both the lpqH and PstS1 adhesin proteins contain a lipoprotein signal peptide (Sec/SPII) with a probability of 1. The cleavage sites can be located between positions 21 and 22 of the lpqH adhesin protein (Fig. 3A) and at positions 23 and 24 on the PstS1 adhesin protein (Fig. 2C). The MTP protein contains a signal peptide (Sec/SPI) with a probability of 0.9979 with a cleavage site located between positions 31 and 32 (Fig. 2B). For the design of the tri-fusion protein construct, these sequences were excluded and their truncated sequences were used for further analysis (tnlpqH, tnMTP and tnPstS1).
3.4. Prediction of T-cell epitopes within protein sequence
T cells are able to recognize epitopes that are presented by the major histocompatibility molecules (MHC) which are either expressed on the surfaces of nucleated cells or on the surfaces of specialized antigen-presenting cells (APCs) [70]. The IEDB tool was used to determine the T cell epitopes present on each of the adhesin protein sequences. Table 4 displays the epitopes sequences binding to either MHC I or MHC II. These epitope regions were included in the trifusion protein construct sequences displayed on Table S1.
Table 4.
Adhesin Protein | MHC I |
MHC II |
||||
---|---|---|---|---|---|---|
Allele | Epitope | Rank | Allele | Epitope | Rank | |
tnHSP70 | HLA-B∗40:01 | REIAAHNKL | 0.990 | HLA-DRB3∗01:0 | EVTFDIDANGIVHVT | 0.11 |
HLA-A∗68:01 | ETKGGVMTR | 0.982 | HLA-DRB3∗01:0 | IEVTFDIDANGIVHV | 0.13 | |
HLA-B∗07:02 | APRGIPQIEV | 0.915 | HLA-DRB3∗02:0 | TRLIERNTTIPTKRS | 0.08 | |
HLA-A∗03:01 | RIQEGSGLSK | 0.913 | HLA-DRB3∗01:0 | QIEVTFDIDANGIVH | 0.13 | |
HLA-A∗02:03 | RLIERNTTI | 0.900 | HLA-DRB5∗01:0 | ANGIVHVTAKDKGTG | 0.13 | |
lpqH | HLA-A∗11:01 | ATKDGSHYK | 0.960 | HLA-DRB1∗03:01 | ASGPKVVIDGKDQNV | 2.50 |
HLA-A∗30:01 | ATKDGSHYK | 0.946 | HLA-DRB1∗03:01 | SGPKVVIDGKDQNVT | 2.50 | |
HLA-A∗26:01 | NVNGVTLGY | 0.928 | HLA-DRB1∗03:01 | AASGPKVVIDGKDQN | 2.70 | |
HLA-A∗03:01 | ATKDGSHYK | 0.872 | HLA-DRB1∗03:01 | GPKVVIDGKDQNVTG | 2.70 | |
HLA-A∗68:02 | TTAAGNVNI | 0.815 | HLA-DRB1∗03:01 | PKVVIDGKDQNVTGS | 2.70 | |
PstS1 | HLA-B∗57:01 | QTLQAFLHW | 0.996 | HLA-DRB5∗01:01 | INYEYAIVNNRQKDA | 0.60 |
HLA-B∗58:01 | QTLQAFLHW | 0.993 | HLA-DRB5∗01:01 | IINYEYAIVNNRQKD | 0.66 | |
HLA-B∗57:01 | LSKQDPEGW | 0.980 | HLA-DRB5∗01:01 | NYEYAIVNNRQKDAA | 0.66 | |
HLA-B∗07:02 | TPASSPVTL | 0.978 | HLA-DRB5∗01:01 | PIINYEYAIVNNRQK | 0.66 | |
HLA-B∗58:01 | LSKQDPEGW | 0.975 | HLA-DRB5∗01:01 | YEYAIVNNRQKDAAT | 0.71 | |
MTP | HLA-B∗57:01 | QTAPVPDYYW | 0.994 | HLA-DRB3∗01:01 | APVPDYYWCPGQPFD | 5.80 |
HLA-B∗58:01 | QTAPVPDYYW | 0.990 | HLA-DRB3∗01:01 | PVPDYYWCPGQPFDP | 5.90 | |
HLA-A∗01:01 | QTAPVPDYY | 0.977 | HLA-DRB3∗01:01 | VPDYYWCPGQPFDPA | 6.00 | |
HLA-A∗26:01 | QTAPVPDYY | 0.827 | HLA-DRB3∗01:01 | PDYYWCPGQPFDPAW | 6.20 | |
HLA-A∗30:02 | QTAPVPDYY | 0.805 | HLA-DRB3∗01:01 | TAPVPDYYWCPGQPF | 6.80 |
3.5. Population coverage for MCH class I and II alleles
The IEDB population coverage tool was used to investigate the population coverage of a combination of classes of the trifusion protein constructs using the alleles prominent to the South African population. This predicted a population of 91,81 % for both tnLPH and tnMPH with an average number of epitope hits of 1,98. Using the same alleles, a world population coverage was also performed for a combination of both the classes. Both tnLPH and tnMPH displayed a coverage of 90,95 % with an average number of epitope hits of 1,97.
3.6. Prediction of allergenicity of protein sequences
Two online tools were used to determine if the truncated epitope regions of the selected adhesin proteins could induce an allergic response in the host. Allergenicity is the ability of the antigens to induce an abnormal immune response. The allergenicity results of the truncated proteins are listed in Table S2. All the truncated regions of the adhesin protein and the tri-fusion constructs, tnLPH and tnMPH, do not contain any experimental Immunoglobulin E (IgE) epitopes. The IgE is an allergen-specific antibody. The Motif Alignment and Search Tool (MAST) tool searches biological sequence databases for the sequences that contain one or more of a group of known motifs [54]. According to the output, all proteins are non-allergens since there were no matches to any motifs. The Support Vector Machine (SVM) can detect allergens based on either the amino acid sequence or the dipeptide composition of the protein. Based on amino acid composition, both the tri-fusion protein constructs and the tnlpqH, tnPstS1 proteins were identified as probable allergens whilst the tnMTP and tnHSP70 were identified as an allergen and potential allergen, respectively. Using the dipeptide composition, tnlpqH, tnPstS1 and tnLPH were identified as potential allergens, whereas the tnMTP was identified as a non-allergen and the tnHSP70 and tnMPH tri-fusion construct were identified as allergens. The BLAST results indicated that all individual truncated proteins were non-allergens, whereas a sequence present in the tri-fusion protein constructs was identified as an allergen. According to the hybrid approach, which considers the results from each of the preceding methods, all proteins were non-allergens. The AllerTop 2.0 online tool which is based on the amino acid composition of the protein and its structure-activity relationships (QSAR), identified all proteins as probable non-allergens, except tnMTP which was identified as a probable allergen.
3.7. Prediction of secondary and tertiary structure of fusion protein and its physicochemical characteristics
The physicochemical characteristics such as the amino acid composition, theoretical pI, protein half-life, aliphatic and instability index and the grand average of hydropathicity (GRAVY) of each of the tri-fusion proteins, determined by the ProtParam ExPasy tool, are displayed in Table 5. The instability index of both protein constructs was below 40 whilst the GRAVY index of the tri-fusion protein was calculated to be −0.29 and-0.406 for tnLPH and tnMPH, respectively.
Table 5.
Protein Parameters | tnLPH | tnMPH |
---|---|---|
Number of amino acids | 636 | 617 |
Molecular weight | 64865.80 | 63958.62 |
Theoretical pI | 4.77 | 4.63 |
Total number of negatively charged residues (Asp + Glu) | 73 | 79 |
Total number of positively charged residues (Asp + Glu) | 45 | 43 |
Formula | C2809H4458N798O942S12 | C2793H4336N782O917S13 |
Total number of atoms | 9019 | 8841 |
Extinction coefficients all pairs of Cys residues form cystines all Cys residues are reduced | 35995 | 55600 |
35870 | 55350 | |
Estimated half-life | 30 h (mammalian reticulocytes, in vitro | 1 h (mammalian reticulocytes, in vitro |
>20 h (yeast, in vivo) | 30 min (yeast, in vivo) | |
>10 h. (E. coli, in vivo) | >10 h. (E. coli, in vivo) | |
Instability Index | 27.52 | 34.23 |
Aliphatic index | 76.79 | 70.97 |
Grand average of hydropathicity (GRAVY) | −0.29 | −0.406 |
The tertiary structure of tnLPH and tnMPH was predicted by the D-I-TASSER server and 5 models with TM-scores were obtained. A template modelling score (TM-score) is between 0 and 1 and is proportional to the relation of the prediction. Fig. 3A and B displays the first model of each of the trifusion proteins which displayed the highest TM-score. Model 1 of tnLPH had a TM-score of 0.65 (0.48–0.65) and the first model of tnMPH had a TM-score 0.51 (0.49–0.51).
3.8. Tertiary structure refinement and validation
The first model of the D-I-TASSER prediction was subjected to refinement and 5 models were generated by the Galaxy Refine server with the first model being highly favoured (Fig. 3C & D). Each model displays scores for the global distance test (GDT-HA), root mean square deviation (RMSD), MolProbity, Clash Score, Poor rotamers and Rama favoured score (Figs. S2A and B) . In comparison to the initial model of the tnLPH, the selected refined model displayed a reduced MolProbity (from 3.262 to 2.424) and Clash score (100.2–22.0) with an increased Rama favourability (74.8–91.5) and presence of poor rotamers (0–1.3). The selected refined tnMPH model displayed a reduced MolProbity (3.068–2.135) and Clash score (65.9–15.8) and an increased Rama favourability (76.4–93.3) and presence of poor rotamers (0–0.7). Both models displayed a GDT-HA (0.9367–0.9404) and RMSD (0.461–0.498) score indicating that the refined models closely resembled the native trifusion models.
The refined models were then subjected to validation on the SAVESv6.0 server. The ERRAT data suggests good overall quality of the refined models. The Ramachandran plot, generated by the PROCHECK programme, graphically displays the energetically allowed and forbidden regions for the dihedral angles. The tnLPH has 87,7% of residues in the favoured regions; 10% in the additional allowed regions, 0.8% in the generously allowed regions; and 1.4% in the disallowed regions. The tnMPH has 89.1% of residues in the favoured regions; 8.5 % in the additional allowed regions, 1 % in the generously allowed regions; and 1.4% in the disallowed regions.
3.9. In silico immune simulation
The in silico immune simulation indicated that both tri-fusion protein candidates were able to elicit an immune response. This was characterized by the heightened immunoglobulin M (IgM) and immunoglobulin G (IgG) concentration in response to the antigen when injected at all three timepoints (Fig. 4A and B). Furthermore, the antigen was depleted whilst the IgG concentration was maintained over a prolonged period, indicating that the immune system was able to eradicate the antigen from the system. In addition, there was an increase in natural killer (NK) cells, dendritic cell (DC), macrophages (MA) and epithelial cells (EP) (Figs. S3 and S4: I, J, K and L). Three peaks corresponding to the timepoint of injections were observed in B- cell population and T-helper (TH) population (Figs. S3 and S4:A and D). The cytokine profile induced by the immunization of tnLPH and tnMPH were analysed (Fig. 4C and D). Both candidates generated high levels of interferon gamma (IFN-g), transforming growth factor beta (TGF-b), interleukin-10 (IL-10) and interleukin-12 (IL-12) where three peaks were observed. The D levels remained low indicating that these candidates did not pose a danger to the immune system [60].
3.10. In silico cloning of the tri-fusion proteins
Fig. 1 displays the order in which the trifusion protein constructs will be arranged. The DNA sequences of each of the selected proteins are highlighted in Table S1. Restriction endonucleases were included onto the N- and C-terminal ends of tri-fusion constructs tnlpqH-tnPstS1-tnHSP70 and tnMTP-tnPstS1-tnHSP70, respectively. Fig. 5 displays the recombinant vector map of the trifusion protein constructs, tnLPH and tnMPH, respectively. The Snapgene tool was further used to confirm that the incorporation of the restriction sites on the proposed protein constructs did not alter the reading frame. In addition, the PreScission Protease site remained intact which could be later be exploited for the cleavage of the glutathione-S-transferase (GST) tag.
4. Discussion
Diagnostic tests that are based on multiple, instead of a single antigen, have demonstrated improved sensitivity [[71], [72], [73], [74]]. Recombinant subunit proteins constructed from many TB antigens have become suitable vaccine candidates owing to their potent immune response [[75], [76], [77], [78], [79]]. The epitope regions present on an antigen are responsible for the interaction with the host cells, triggering an immune response [80]. The removal of non-epitope regions from an antigen and the fusion of several epitope regions from the same, or different antigens, may greatly enhance the sensitivity of a diagnostic assay [81]. In addition, the fused epitopes may stimulate a heightened protection owing to the accessibility of these regions for the interaction with immune cells [64]. Adhesin proteins known to elicit an immune response are suitable candidates for the design of a multi-epitope fusion construct [82,83]. In silico and in vivo studies have demonstrated that the Heparin-binding hemagglutinin adhesin (HBHA) and the Antigen 85 Complex are well-studied adhesin proteins that have shown great potential as both diagnostic and vaccine candidates [[84], [85], [86], [87], [88], [89]]. An immunodiagnostic test containing the Mtb HBHA induced interferon-gamma-release assay (IGRA) differentiated between active TB infection and latent TB infection but with reduced sensitivities in HIV infected patients [89]. A mucosal vaccine containing the fusion of Ag85B-Acr-HBHA which is absorbed on the surface of the inactivated Bacillus subtilis spores displayed reduced bacterial infection in the lungs and spleen of mice but not in non-human primates (NHP) when serving as a booster to the current vaccine, BCG [87]. A trial in South African adolescents (Aeras C-040-404) with BCG revaccination and vaccination with H4:IC31, a recombinant subunit vaccine comprising adhesins Ag85B and TB10.4, displayed efficacy in preventing sustained QuantiFERON (QFT) conversion [90,91]. The study showed highest vaccine-induced CD4+ T-cell response rates were for those recognizing the Ag85B component. A virus-like particle (VLP) consisting of a fusion of the HBHA and MTP epitopes (LV20) was expressed in the Baculovirus-insect cell expression system [86]. When coupled with an adjuvant such as the DDA and Poly I:C (DP), this vaccine demonstrated both a cell-mediated and humoral response. Therefore, the fusion of many immunodominant adhesin proteins make for a more enhanced vaccine candidate.
In this study, two tri-fusion protein constructs comprising multiple epitopes from 3 immunodominant adhesin proteins were designed for use as potential serodiagnostic and vaccine candidates.
4.1. Four immunodominant Mtb adhesin proteins were selected
The VaxiJen software enabled the ranking of 26 known Mtb adhesin proteins based on their ability to act as potential antigens. Any antigen exceeding the bacterial threshold of 0.4 [43] was considered as a potential antigen. The suitability of the top ten ranked adhesin proteins as candidates for the tri-fusion protein construct was determined from published reports. The presence of epitope regions was determined using the IEDB, a database with previously studied experimental data on antibody and T cell epitopes. Six of the ten adhesin proteins did not return a positive result since no experimental data was available on the database at the time of analysis. This included the highly ranked Wag22_PE_PGRS and the PE_PGRS33 adhesin protein.
Therefore, despite being ranked 1 and 2, the adhesin proteins belonging to the PE_PGRS family were not selected owing to their inability to be expressed and purified outside of their host with ease [92,93], and the lack of available experimental data on the IEDB [94]. The HSP70, PstS1, 19 kDa lipoprotein and the HBHA adhesin proteins demonstrated the presence of known epitopes on the IEDB, ranging from 4 to 60 epitope regions, with the HBHA displaying the lowest number. The MTP adhesin protein was selected, despite not yielding any saved epitope regions, since previous studies conducted by Alteri et al., 2007; Naidoo et al., 2018; Pillay et al., 2024 (unpublished data) demonstrated the presence of anti-MTP antibodies in patient serum samples [63,42,95]; indicating that MTP is capable of eliciting host immune response. The adhesin proteins tnHSP70, PstS1, 19 kDa and MTP were considered suitable candidates for use in the tri-fusion protein construct as discussed below.
4.1.1. Heat-shock protein 70 (HSP70)
Heat-shock proteins (hsp) have been shown to possess anti-inflammatory properties and function as immune modulators. Mycobacterial heat-shock proteins such as the 70kDa display an intense helper effect in cellular and humoral responses when coupled to other peptides in the absence of an adjuvant [[96], [97], [98]]. The HSP70 adhesin is known to induce the early production of IgG. The C-terminal domain of the Mtb HSP70 from amino acids 359–610 was shown to be a highly potent adjuvant in comparison to the amino acids located in the N-terminal domain when investigating the humoral response of the Hepatitis B vaccine [99]. The early secretory protein 6 (ESAT-6) fused to the C-terminal of the HSP70 stimulated Th1 mediated immunity and was shown to retain immunogenicity when fused together [66]. The use of the HSP70 as a fusion protein resulted in protein expression within both the soluble fraction and inclusion body, with the latter displaying a greater yield [66]. Since HSP70 is a molecular chaperone, it can assist in the refolding of proteins and prevent aggregation formation during stressful conditions [100]. It is anticipated that the use of the C-terminal of the Hsp70359-610 will not only enhance the immune response attained by the tri-fusion protein vaccine candidate but will also promote the expression of the recombinant protein within the soluble fraction. Further analysis revealed that this region contains several B- and T-cell epitope sequences.
4.1.2. 19 kDa lpqH lipoprotein
Lipoproteins are involved in the virulence and immunoregulatory processes of TB pathogenesis. The lpqH adhesin protein interacts with the mannose receptor to promote the phagocytosis of the Mtb bacilli. In addition to inducing a T cell mediated immunity, it has been shown to down-regulate the antigen presentation to T cells [[101], [102], [103]]. The stage-specific expression promotes such adhesin proteins as suitable vaccine candidates. The DNA vaccine constructed with lpqH did not provide protection in mice against TB [104,105]. In contrast, the expression of the lpqH in strains such as Mycobacterium smegmatis and Mycobacterium vaccae demonstrated a reduced protection in mice when challenged with Mtb [104]. This indicates that the use of a live form of mycobacteria displays reduced vaccine efficacy and should therefore not be considered for testing the lpqH. A DNA vaccine comprising lpqH fused to a marked protein of the autophagosome, a microtubule associated protein light chain-3 (CL3), lowered bacterial load in the spleen and lungs of challenged mice [106]. This indicates an improved Th1 response and protection when two or more proteins are fused in the construction of vaccine candidates. In animals, lpqH is regarded as a plasma biomarker which can differentiate between paratuberculosis positive animals or paratuberculosis vaccinated cows from the TB positive cows [107].
4.1.3. Phosphate transport protein 1 (PstS1)
Another lipoprotein, PstS1, a phosphate-binding transporter protein, is a glycosylated protein with mannose residues located at the N-terminal region [108]. The study of the three PstS proteins (PstS1, PstS2 and PstS3) as DNA vaccine candidates demonstrated that the PstS1 did not confer a protective effect, whilst the PstS3 induced the highest cytokine secretion [109]. However, PstS1 displayed a strong antibody response suggesting that it may be used as a serodiagnostic biomarker [30]. It has also been shown to differentiate between active and latent TB and is most likely associated with active TB [110]. The use of a truncated PstS1 antigen was shown to improve the diagnostic efficiency of the assay [111]. The removal of several amino acid sequences from both the N- and C-terminal domains unmasked the epitopes, leading to greater exposure for antibody-antigen interaction. The authors further showed that the fusion of the truncated PstS1 to the EchA1 (Rv0222) proteins displayed an improved serodiagnosis of TB [112].
4.2. Two trifusion protein constructs with multi-epitope regions lacking a signal peptide with good population coverage
Common B-cell epitope regions (Table 3) predicted by the three online tools were incorporated into the design of the tri-fusion protein. T-cell epitope regions were identified using IEDB where reference alleles for both the MHC I and MHC II binding sites were selected. The epitope regions were also included in the tri-fusion protein constructs. Since cloning was to be performed in-house, a continuous region of the epitopes on the gene sequence was used. Three adhesin proteins, lpqH, PstS1 and MTP were identified by the signal peptide prediction tool to contain a signal peptide at the N-terminal domain. For the design of the tri-fusion protein construct, the signal peptides of these proteins were removed to optimize the expression of the recombinant protein [113]. The truncated gene sequence encoding the adhesin proteins that were used in the design of the tri-fusion protein are shown in Table S1 with the schematic of the design of the trifusion constructs shown in Fig. 2. The population coverage of both the tnLPH and tnMPH trifusion protein constructs in the South African population was 91,81 % for a combination of MHC alleles with an average of 1,98 epitopes recognized by the population. In addition, a world coverage of 90,95 % was obtained. This indicates that the potential vaccine candidates can be recognized by a vast number of people and can therefore elicit an immune response.
4.3. AllerTop 2.0 and the hybrid approach indicate that both trifusion protein constructs are non-allergens
The ability of the antigens to induce an abnormal immune response was predicted using AlgPred and AllerTop 2.0 (Table S2). The AllerTop server predicts the allergenicity of a protein based on the physicochemical characteristics of the protein sequence [55,114]. AlgPred integrates the results of various methods to predict an allergen [54], however, the allergenicity results of the truncated proteins obtained were contradictory in each of the methods. According to the IgE epitope mapping, Motif Alignment and Search Tool (MAST), all the truncated regions of the adhesin protein and the tri-fusion constructs, tnLPH and tnMPH are non-allergens. The MAST tool displayed the highest sensitivity of 93.94% with a specificity of 33.34%, whilst the highest specificity of 98.14% with a sensitivity of 17.47% was achieved by the IgE mapping. However, The Support Vector Machine (SVM) based on amino acid composition which has an accuracy of 85.02% predicted that both the tri-fusion protein constructs and the tnHSP70, tnlpqH, tnPstS1 and tnHSP70 proteins are probable allergens, whilst the tnMTP is identified as an allergen. Using the dipeptide composition with an accuracy of 85%, the tnHSP70, tnlpqH, tnPstS1 and tnLPH were identified as potential allergens, whereas the tnMTP was identified as a non-allergen and the tnMPH tri-fusion construct was identified as an allergen. The BLAST results predicted that all individual truncated proteins were non-allergens but a sequence within the tri-fusion protein constructs was identified as an allergen. However, the Hybrid Approach predicted that all proteins were non-allergens. This approach is based on the combination of two or more methods and can therefore be considered highly likely. The AllerTop 2.0 online tool predicted that all proteins are non-allergens except for tnMTP, which was identified as a probable antigen. The AllerTop 2.0 tool has been shown to display the highest sensitivity (94%) when compared to other tools whilst the AlgPred showed the highest specificity (100%) [114]. Therefore, all results were taken into consideration for finalizing the design and use of the tri-fusion protein construct.
4.4. Hydrophilic, thermo-stable tri-fusion protein constructs identified by ExPasy ProtParam with a confident prediction of tertiary structure by D-I-TASSER
The physicochemical properties of each tri-fusion protein were computed by ExPasy ProtParam tool. These included the amino acid composition, molecular weight, theoretical isoelectric points (pI), extinction coefficient, instability index, aliphatic index, grand average of hydropathicity (GRAVY), and total number of negatively and positively charged residues (Table 5). The theoretical pI was 4.77 and 4.63, for tnLPH and tnMPH, respectively. The theoretical pI is the pH at which a particular molecule or surface carries no net electrical charge, and it is useful for understanding the protein charge stability. The extinction coefficient is defined as a measurement of how strongly a protein absorbs light at 280 nm and may aid in the study of the interactions of protein-protein and protein-ligand [115]. The extinction coefficients for tnLPH and tnMPH were determined to be 35870 and 55350, respectively. A high extinction coefficient is associated with the high concentration of cysteine residues which implies a more stable protein structure [116,117]. The instability index is an estimation of the stability of a protein experimentally, where a value smaller than 40 is considered as stable. The instability index values for tnLPH and tnMPH is 27.52 and 34.23, respectively, indicating that both tri-fusion proteins are stable. The half-life of tnLPH and tnMPH were estimated at 30 h and 1 h in mammalian reticulocytes respectively. This implies that the tnLPH may actually be able to trigger a better immune response in vivo rather than the tnMPH since it will persist longer within the host. The aliphatic index of a protein is defined as the relative volume occupied by aliphatic side chains, alanine, valine, isoleucine and leucine. The aliphatic index for tnLPH and tnMPH is 76.79 and 70.97, respectively, indicating that the tri-fusion protein is thermo-stable over a wide temperature range. The GRAVY value was calculated as the sum of hydropathy values of all of the amino acids divided by the number of residues in the sequence. A low GRAVY value indicates the possibility of the protein being a hydrophilic rather than hydrophobic protein. The GRAVY index of the tri-fusion protein was calculated to be −0.29 and-0.406 for tnLPH and tnMPH, respectively. This implies that the tri-fusion protein may be hydrophilic in nature and maybe more amenable to purification strategies.
The D-I-TASSER online software was used to predict the tertiary structure of both trifusion constructs which proposed 5 models with varying TM-scores. TM-score is a more accurate measure of global similarity between two structures in comparison to the RMSD [58]. The TM-score closest to 1 implies a perfect match between the protein structures and the first models for both constructs displayed the highest TM-score (Fig. 3A and B).
This models were subjected to refinement to optimize the predicted structures. The Galaxy Refine server generated five models with the first model being the most favoured. The Galaxy Refine displays six factors relating to the original model submission, including the GDT-HA, RMSD, MolProbity, Clash score, Poor Rotamers and Rama favoured [59]. The GDT-HA and RMSD pertains to the similarity of the two structures and this scores indicated that the refined model and the original model were highly similar. The MolProbity score should lie between −4 and 2. The MolProbity score for the original models of both proteins were >3 and the reduction observed is an indication of a refined model with a closer to normal structure. This is further confirmed by the reduction in clash scores observed [118]. The poor rotamer score refers to the geometry of the amino acid side chains and its presence in the poor favoured category. Despite an increase in comparison to the original model, the refined models are within the accepted range since a value greater than 1.5% is a cause for concern. The Rama favourability of these models is linked to the proposed statistical distribution of the combinations of the residues on the backbone dihedral angles. This is graphically illustrated by the Ramachandran plots generated by the PROCHECK validation server [119,120]. The refined tnLPH and tnMPH models contain 87.7–89.1% of residues in the favoured regions; 10–8.5% in the additional allowed regions, 0.8–1% in the generously allowed regions; and 1.4% in the disallowed regions. The residues in disallowed region should not exceed 2% as this would indicate steric hindrance between the side chain and the main atoms resulting in a poor protein structure. Since both constructs are below this value, the refined structure is of good quality. The ERRAT program verifies protein structure that is determined by crystallography and provides an overall measure of quality and resolution of structure [121]. A good model should display an overall quality of 95% but anything above 50% is accepted. The overall quality of the tnLPH and tnMPH was 76.51% and 85.25%, respectively. Collectively, the refinement and validation results indicate that the predicted structures for the tnLPH and tnMPH trifusion proteins are good. The in silico cloning of both trifusion constructs into the pGEX6P-1 vector indicated that the reading frame of the sequence is not altered nor does its incorporation affect the PreScission Protease sequence on the vector.
4.5. In silico immune simulation shows that both trifusion constructs are immunogenic
In silico analysis of the tri-fusion constructs indicated that the vaccine candidates may be immunogenic since the design was based on B and T cell epitopes which are needed to stimulate both a humoral and adaptive immune response. This is further evidenced by the in silico immune simulation which revealed that both the tnLPH and tnMPH elicited both an adaptive and innate immune response characterized by heightened concentrations of TH cells and B-cells during immunization. Furthermore, both candidates stimulated the production of the IFN-g, TGF-b IL-10 and IL-12 cytokines which are essential for the protection of the host (Fig. 4C and D). The presence of immunoglobulins remained after the eradication of the antigen (Fig. 4A and B). The use of a GGGGS linker sequence between each of the truncated adhesin protein will provide accessibility and flexibility to the epitope regions. The analysis of the physicochemical characteristics of the tri-fusion construct indicates that both candidates when expressed, should be hydrophilic and stable but this is yet to be experimentally confirmed. Therefore, the present findings should be followed up with the synthesis of the proposed trifusion constructs, cloned into a suitable vector and transformed into an appropriate bacterial host such as the Escherichia coli BL21. The recombinant protein should be expressed and purified in sufficient quantity for the immunization of mice to further validate the findings of the in silico analysis. Furthermore, the recombinant proteins can be evaluated in an enzyme-linked immunosorbent assay (ELISA) to determine the diagnostic potential of these two tri-fusion proteins.
5. Conclusions
The use of in silico immunoinformatic tools has been shown to be vital to the generation of potential vaccine candidates, diagnostic biomarkers and therapeutic drug targets. It has greatly assisted in the control of coronavirus pandemic and has extensively been used for the design of new TB vaccine candidates. Two potential subunit vaccines comprising of truncated epitope regions of immunodominant Mtb adhesin proteins were designed. The phosphate-binding transport protein PstS1, C-terminal region of heat-shock protein 70, and either the Mycobacterium tuberculosis curli pili or the 19 kDa lipoprotein, were incorporated in the each of the trifusion protein constructs. The analysis revealed that the potential vaccine candidates are immunogenic, thermostable, hydrophilic and are structures of good quality. However, these claims are yet to be validated by in vitro and in vivo experiments.
Ethical considerations
This study was approved by the Biomedical Research Ethics Committee (BREC) of the University of KwaZulu-Natal (BREC/00001444/2020).
CRediT authorship contribution statement
Koobashnee Pillay: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Resources, Validation, Visualization, Writing – original draft, Writing – review & editing. Thamsanqa E. Chiliza: Writing – review & editing. Sibusiso Senzani: Writing – review & editing. Balakrishna Pillay: Conceptualization, Writing – review & editing. Manormoney Pillay: Conceptualization, Funding acquisition, Supervision, Writing – review & editing.
Declaration of competing interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Acknowledgements
This research was funded by the project running costs of MP's UKZN Research Flagship “Afrocentric Precision Approach to Control Health Epidemics (APACHE)” grant award as Co-PI. KP was supported by the National Research Foundation (NRF)- German Academic Exchange Service (DAAD) PhD Scholarship (grant ID 117866).
This research was presented at the University of KwaZulu-Natal's College of Health Science School of Laboratory Medicine and Medical Sciences Research Day on the October 4, 2023 in Westville, South Africa.
Footnotes
Supplementary data to this article can be found online at https://doi.org/10.1016/j.heliyon.2024.e37536.
Appendix A. Supplementary data
The following is the Supplementary data to this article:
References
- 1.Dheda K., Perumal T., Moultrie H., Perumal R., Esmail A., Scott A.J., et al. The intersecting pandemics of tuberculosis and COVID-19: population-level and patient-level impact, clinical presentation, and corrective interventions. The Lancet Respiratory. 2022;10:603–622. doi: 10.1016/S2213-2600(22)00092-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.World Health Organization Global tuberculosis report 2022. 2022. https://www.who.int/teams/global-tuberculosis-programme/tb-reports/global-tuberculosis-report-2022
- 3.Voss G., Casimiro D., Neyrolles O., Williams A., Kaufmann S.H.E., McShane H., et al. Progress and challenges in TB vaccine development. F1000Res. 2018;7:199. doi: 10.12688/f1000research.13588.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Mutavhatsindi H., van der Spuy G.D., Malherbe S.T., Sutherland J.S., Geluk A., Mayanja-Kizza H., et al. Validation and optimization of host immunological bio-signatures for a point-of-care test for TB disease. Front. Immunol. 2021;12:194. doi: 10.3389/FIMMU.2021.607827/BIBTEX. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Luca S., Mihaescu T. vol. 8. 2013. (Maedica-a Journal of Clinical Medicine History of BCG Vaccine). [PMC free article] [PubMed] [Google Scholar]
- 6.Mahairas G.G., Sabo P.J., Hickey M.J., Singh D.C., Stover C.K. Molecular analysis of genetic differences between Mycobacterium bovis BCG and virulent M. bovis. J. Bacteriol. 1996;178:1274. doi: 10.1128/JB.178.5.1274-1282.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Keyser A., Troudt J.L.M., Taylor J.L., Izzo A.A. BCG sub-strains induce variable protection against virulent pulmonary Mycobacterium tuberculosis infection, with the capacity to drive Th2 immunity. Vaccine. 2011;29:9308. doi: 10.1016/J.VACCINE.2011.10.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Nemes E., Geldenhuys H., Rozot V., Rutkowski K.T., Ratangee F., Bilek N., et al. Prevention of M. Tuberculosis infection with H4:IC31 vaccine or BCG revaccination. N. Engl. J. Med. 2018;379 doi: 10.1056/NEJMoa1714021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.White R.G., Hanekom W.A., Vekemans J., Harris R.C. The way forward for tuberculosis vaccines. 2019. [DOI] [PubMed]
- 10.Kaufmann S.H.E., Dockrell H.M., Drager N., Ho M.M., McShane H., Neyrolles O., et al. TBVAC2020: advancing tuberculosis vaccines from discovery to clinical development. Front. Immunol. 2017;8:1203. doi: 10.3389/fimmu.2017.01203. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Voss G., Casimiro D., Neyrolles O., Williams A., Kaufmann S.H.E., Mcshane H., et al. Progress and challenges in TB vaccine development. F1000 Research. 2018;7:1–14. doi: 10.12688/f1000research.13588.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Zhu B., Dockrell H.M., Ottenhoff T.H.M., Evans T.G., Zhang Y. Tuberculosis vaccines: opportunities and challenges. Respirology. 2018 doi: 10.1111/resp.13245. [DOI] [PubMed] [Google Scholar]
- 13.World Health Organization WHO preferred product characteristics for new tuberculosis vaccines department of immunization. Vaccines And Biologicals. 2018 [Google Scholar]
- 14.TB Vaccine Clinical Pipeline - Working Group on New TB Vaccines n.d. https://newtbvaccines.org/tb-vaccine-pipeline/(accessed January 6, 2024).
- 15.Martin C., Aguilo N., Marinova D., Gonzalo-Asensio J. Update on TB vaccine pipeline. Appl. Sci. 2020;10 doi: 10.3390/app10072632. [DOI] [Google Scholar]
- 16.Pipeline of vaccines - TBVI n.d. https://www.tbvi.eu/what-we-do/pipeline-of-vaccines/(accessed September 17, 2022).
- 17.Scriba T.J., Kaufmann S.H.E., Lambert P.H., Sanicas M., Martin C., Neyrolles O. Vaccination against tuberculosis with whole-cell mycobacterial vaccines. J. Infect. Dis. 2016;214:659–664. doi: 10.1093/INFDIS/JIW228. [DOI] [PubMed] [Google Scholar]
- 18.Duong V.T., Skwarczynski M., Toth I. Towards the development of subunit vaccines against tuberculosis: the key role of adjuvant. Tuberculosis. 2023;139 doi: 10.1016/J.TUBE.2023.102307. [DOI] [PubMed] [Google Scholar]
- 19.Gengenbacher M., Kaufmann S.H.E. Mycobacterium tuberculosis: success through dormancy. FEMS Microbiol. Rev. 2012;36:514. doi: 10.1111/J.1574-6976.2012.00331.X. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Wright K., Plain K., Purdie A., Saunders B.M., De Silva K. Biomarkers for detecting resilience against mycobacterial disease in animals. Infect. Immun. 2020;88 doi: 10.1128/IAI.00401-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Govender V.S., Ramsugit S., Pillay M. Mycobacterium tuberculosis adhesins: potential biomarkers as anti-tuberculosis therapeutic and diagnostic targets. Microbiology (N. Y.) 2014;160:1821–1831. doi: 10.1099/mic.0.082206-0. [DOI] [PubMed] [Google Scholar]
- 22.Ramsugit S., Pillay M. Identification of Mycobacterium tuberculosis adherence-mediating components: a review of key methods to confirm adhesin function. Iran J Basic Med Sci. 2016;19:579–584. [PMC free article] [PubMed] [Google Scholar]
- 23.Xin Q., Niu H., Li Z., Zhang G., Hu L., Wang B., et al. Subunit vaccine consisting of multi-stage antigens has high protective efficacy against Mycobacterium tuberculosis infection in mice. PLoS One. 2013;8 doi: 10.1371/JOURNAL.PONE.0072745. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Bellini C., Horváti K. Recent advances in the development of protein- and peptide-based subunit vaccines against tuberculosis. Cells. 2020;9 doi: 10.3390/CELLS9122673. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Yuan W., Dong N., Zhang L., Liu J., Lin S., Xiang Z., et al. Immunogenicity and protective efficacy of a tuberculosis DNA vaccine expressing a fusion protein of Ag85B-Esat6-HspX in mice. Vaccine. 2012;30:2490–2497. doi: 10.1016/j.vaccine.2011.06.029. [DOI] [PubMed] [Google Scholar]
- 26.Fulkerson J., Brennan M., Velmurugan K., Locht C. 2012. Recombinant Mycobacterium Encodinga Heparin-Binding Hemagglutinin (Hbha) Fusion. US 2018/0305416 A1. [Google Scholar]
- 27.de Martino M., Lodi L., Galli L., Chiappini E. Immune response to Mycobacterium tuberculosis: a narrative review. Front Pediatr. 2019;7 doi: 10.3389/FPED.2019.00350. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Squeglia F., Ruggiero A., De Simone A., Berisio R. A structural overview of mycobacterial adhesins: key biomarkers for diagnostics and therapeutics. Protein Sci. 2018;27:369–380. doi: 10.1002/pro.3346. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Ragas A., Roussel L., Puzo G., Rivière M. The Mycobacterium tuberculosis cell-surface glycoprotein apa as a potential adhesin to colonize target cells via the innate immune system pulmonary C-type lectin surfactant protein A. J. Biol. Chem. 2007;282:5133–5142. doi: 10.1074/jbc.M610183200. [DOI] [PubMed] [Google Scholar]
- 30.Esparza M., Palomares B., García T., Espinosa P., Zenteno E., Mancilla R. PstS-1, the 38-kDa Mycobacterium tuberculosis glycoprotein, is an adhesin, which binds the macrophage mannose receptor and promotes phagocytosis. Scand. J. Immunol. 2015;81:46–55. doi: 10.1111/sji.12249. [DOI] [PubMed] [Google Scholar]
- 31.Ronning D.R., Vissa V., Besra G.S., Belisle J.T., Sacchettini J.C. Mycobacterium tuberculosis antigen 85A and 85C structures confirm binding orientation and conserved substrate specificity. J. Biol. Chem. 2004;279:36771–36777. doi: 10.1074/jbc.M400811200. [DOI] [PubMed] [Google Scholar]
- 32.Hickey T., Ziltener H.J., Speert D.P., Stokes R.W. Mycobacterium tuberculosis employs Cpn60. 2 as an adhesin that binds CD43 on the macrophage surface. Cell Microbiol. 2010;12:1634–1647. doi: 10.1111/j.1462-5822.2010.01496.x. [DOI] [PubMed] [Google Scholar]
- 33.Naidoo N., Pillay M. Bacterial pili, with emphasis on Mycobacterium tuberculosis curli pili: potential biomarkers for point-of care tests and therapeutics. Biomarkers. 2017;22:93–105. doi: 10.1080/1354750X.2016.1252960. [DOI] [PubMed] [Google Scholar]
- 34.Renshaw P.S., Lightbody K.L., Veverka V., Muskett F.W., Kelly G., Frenkiel T.A., et al. Structure and function of the complex formed by the tuberculosis virulence factors CFP-10 and ESAT-6. EMBO J. 2005;24:2491–2498. doi: 10.1038/sj.emboj.7600732. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Xolalpa W., Vallecillo A.J., Lara M., Mendoza-Hernandez G., Comini M., Spallek R., et al. Identification of novel bacterial plasminogen-binding proteins in the human pathogen Mycobacterium tuberculosis. Proteomics. 2007;7:3332–3341. doi: 10.1002/pmic.200600876. [DOI] [PubMed] [Google Scholar]
- 36.Pancholi V., Chhatwal G.S. Housekeeping enzymes as virulence factors for pathogens. International Journal of Medical Microbiology. 2003;293:391–401. doi: 10.1078/1438-4221-00283. [DOI] [PubMed] [Google Scholar]
- 37.Menozzi F.D., Bischoff R., Fort E., Brennan M.J., Locht C. vol. 95. Proceedings of the National Academy of Sciences; 1998. pp. 12625–12630. (Molecular Characterization of the Mycobacterial Heparin-Binding Hemagglutinin, a Mycobacterial Adhesin). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Kumar S., Puniya B.L., Parween S., Nahar P., Ramachandran S. Identification of novel adhesins of M. tuberculosis H37Rv using integrated approach of multiple computational algorithms and experimental analysis. PLoS One. 2013;8 doi: 10.1371/journal.pone.0069790. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Kinhikar A.G., Vargas D., Li H., Mahaffey S.B., Hinds L., Belisle J.T., et al. Mycobacterium tuberculosis malate synthase is a laminin†binding adhesin. Mol. Microbiol. 2006;60:999–1013. doi: 10.1111/j.1365-2958.2006.05151.x. [DOI] [PubMed] [Google Scholar]
- 40.Chitale S., Ehrt S., Kawamura I., Fujimura T., Shimono N., Anand N., et al. Recombinant Mycobacterium tuberculosis protein associated with mammalian cell entry. Cell Microbiol. 2001;3:247–254. doi: 10.1046/j.1462-5822.2001.00110.x. [DOI] [PubMed] [Google Scholar]
- 41.Singh M., Martens A., Campuzano J., Amador A., Mondragón- Palomino M., Laclette J.P., et al. The PE-PGRS glycine-rich proteins of Mycobacterium tuberculosis : a new family of fibronectin-binding proteins? Microbiology (N. Y.) 1999;145:3487–3495. doi: 10.1099/00221287-145-12-3487. [DOI] [PubMed] [Google Scholar]
- 42.Alteri C.J., Xicohténcatl-Cortes J., Hess S., Caballero-Olin G., Girón J.A., Friedman R.L. Mycobacterium tuberculosis produces pili during human infection. Proc Natl Acad Sci U S A. 2007;104:5145–5150. doi: 10.1073/PNAS.0602304104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Doytchinova I.A., Flower D.R. VaxiJen: a server for prediction of protective antigens, tumour antigens and subunit vaccines. BMC Bioinf. 2007;8:1–7. doi: 10.1186/1471-2105-8-4/TABLES/2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Vita R., Mahajan S., Overton J.A., Dhanda S.K., Martini S., Cantrell J.R., et al. The immune epitope database (IEDB): 2018 update. Nucleic Acids Res. 2019;47:D339–D343. doi: 10.1093/NAR/GKY1006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Jespersen M.C., Peters B., Nielsen M., Marcatili P. BepiPred-2.0: improving sequence-based B-cell epitope prediction using conformational epitopes. Nucleic Acids Res. 2017;45:W24. doi: 10.1093/NAR/GKX346. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Saha S., Raghava G.P.S. Prediction of continuous B-cell epitopes in an antigen using recurrent neural network. Proteins. 2006;65:40–48. doi: 10.1002/PROT.21078. [DOI] [PubMed] [Google Scholar]
- 47.Yao B., Zhang L., Liang S., Zhang C. SVMTriP: a method to predict antigenic epitopes using support vector machine to integrate tri-peptide similarity and propensity. PLoS One. 2012;7 doi: 10.1371/JOURNAL.PONE.0045152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Teufel F., Almagro Armenteros J.J., Johansen A.R., Gíslason M.H., Pihl S.I., Tsirigos K.D., et al. SignalP 6.0 predicts all five types of signal peptides using protein language models. Nat. Biotechnol. 2022;40(40):1023–1025. doi: 10.1038/s41587-021-01156-3. 7 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Dhanda S.K., Mahajan S., Paul S., Yan Z., Kim H., Jespersen M.C., et al. IEDB-AR: immune epitope database––analysis resource in 2019. Nucleic Acids Res. 2019;47:W502–W506. doi: 10.1093/nar/gkz452. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Reynisson B., Alvarez B., Paul S., Peters B., Nielsen M. NetMHCpan-4.1 and NetMHCIIpan-4.0: improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC eluted ligand data. Nucleic Acids Res. 2020;48:W449–W454. doi: 10.1093/nar/gkaa379. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Wang P., Sidney J., Dow C., Mothé B., Sette A., Peters B. A systematic assessment of MHC class II peptide binding predictions and evaluation of a consensus approach. PLoS Comput. Biol. 2008;4 doi: 10.1371/JOURNAL.PCBI.1000048. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Wang P., Sidney J., Kim Y., Sette A., Lund O., Nielsen M., et al. Peptide binding predictions for HLA DR, DP and DQ molecules. BMC Bioinf. 2010;11:1–12. doi: 10.1186/1471-2105-11-568/TABLES/5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Tshabalala M., Mellet J., Pepper M.S. Human leukocyte antigen diversity: a southern african perspective. J Immunol Res. 2015;2015 doi: 10.1155/2015/746151. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Saha S., Raghava G.P.S. AlgPred: prediction of allergenic proteins and mapping of IgE epitopes. Nucleic Acids Res. 2006;34 doi: 10.1093/NAR/GKL343. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Dimitrov I., Bangov I., Flower D.R., Doytchinova I. AllerTOP v.2—a server for in silico prediction of allergens. J. Mol. Model. 2014;20 doi: 10.1007/S00894-014-2278-5. [DOI] [PubMed] [Google Scholar]
- 56.Gasteiger E., Hoogland C., Gattiker A., Duvaud S., Wilkins M.R., Appel R.D., et al. Protein identification and analysis tools on the ExPASy server. The Proteomics Protocols Handbook. 2005:571–607. doi: 10.1385/1-59259-890-0. 571. [DOI] [Google Scholar]
- 57.Zhang Y. I-TASSER server for protein 3D structure prediction. BMC Bioinf. 2008;9:1–8. doi: 10.1186/1471-2105-9-40/FIGURES/4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Zheng W., Li Y., Zhang C., Zhou X., Pearce R., Bell E.W., et al. Protein structure prediction using deep learning distance and hydrogen‐bonding restraints in CASP14. Proteins: Struct., Funct., Bioinf. 2021;89:1734–1751. doi: 10.1002/prot.26193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Heo L., Park H., Seok C. GalaxyRefine: protein structure refinement driven by side-chain repacking. Nucleic Acids Res. 2013;41:W384–W388. doi: 10.1093/nar/gkt458. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Rapin N., Lund O., Bernaschi M., Castiglione F. Computational immunology meets bioinformatics: the use of prediction tools for molecular binding in the simulation of the immune system. PLoS One. 2010;5 doi: 10.1371/JOURNAL.PONE.0009862. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Al Tbeishat H. Novel in Silico mRNA vaccine design exploiting proteins of M. tuberculosis that modulates host immune responses by inducing epigenetic modifications. Sci. Rep. 2022;12 doi: 10.1038/s41598-022-08506-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Naidoo N., Ramsugit S., Pillay M. Mycobacterium tuberculosis pili (MTP), a putative biomarker for a tuberculosis diagnostic test. Tuberculosis. 2014;94:338–345. doi: 10.1016/j.tube.2014.03.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Naidoo N., Pillay B., Bubb M., Pym A., Chiliza T., Naidoo K., et al. Evaluation of a synthetic peptide for the detection of anti-Mycobacterium tuberculosis curli pili IgG antibodies in patients with pulmonary tuberculosis. Tuberculosis. 2018;109:80–84. doi: 10.1016/j.tube.2018.01.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Junqueira-Kipnis A.P., Marques Neto L.M., Kipnis A. Role of fused Mycobacterium tuberculosis immunogens and adjuvants in modern tuberculosis vaccines. Front. Immunol. 2014;5 doi: 10.3389/FIMMU.2014.00188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Fu F., Tian H., Li X., Lang Y., Tong G., Liu S., et al. C-terminal heat shock protein 70 of Mycobacterium tuberculosis as a molecular adjuvant for DNA vaccination with the porcine circovirus type 2 ORF2 (capsid) gene in mice. Vet. J. 2013;195:244–247. doi: 10.1016/J.TVJL.2012.06.005. [DOI] [PubMed] [Google Scholar]
- 66.Tebianian M., Hoseini A.Z., Ebrahimi S.M., Memarnejadian A., Mokarram A.R., Mahdavi M., et al. Cloning, expression, and immunogenicity of novel fusion protein of Mycobacterium tuberculosis based on ESAT-6 and truncated C-terminal fragment of HSP70. Biologicals. 2011;39:143–148. doi: 10.1016/j.biologicals.2011.02.002. [DOI] [PubMed] [Google Scholar]
- 67.Lehner T, Wang Y, Whittall T, Mcgowan E, Kelly CG, Singh M. Heat Shock Proteins and Modulation of Cellular Function Functional Domains of HSP70 Stimulate Generation of Cytokines and Chemokines, Maturation of Dendritic Cells and Adjuvanticity. n.d. [DOI] [PubMed]
- 68.Ellis J. Proteins as molecular chaperones. Nature. 1987;328:378–379. doi: 10.1038/328378A0. [DOI] [PubMed] [Google Scholar]
- 69.Freudl R. Signal peptides for recombinant protein secretion in bacterial expression systems. Microb. Cell Factories. 2018;17:52. doi: 10.1186/S12934-018-0901-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Schaap-Johansen A.L., Vujović M., Borch A., Hadrup S.R., Marcatili P. T cell epitope prediction and its application to immunotherapy. Front. Immunol. 2021;12:2994. doi: 10.3389/FIMMU.2021.712488/BIBTEX. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Dai Z., Liu Z., Xiu B., Yang X., Zhao P., Zhang X., et al. A multiple-antigen detection assay for tuberculosis diagnosis based on broadly reactive polyclonal antibodies. Iran J Basic Med Sci. 2017;20:360. doi: 10.22038/IJBMS.2017.8575. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Pathakumari B., Prabhavathi M., Anbarasu D., Paramanandhan P., Raja A. Dynamic IgG antibody response to immunodominant antigens of M. tuberculosis for active TB diagnosis in high endemic settings. Clin. Chim. Acta. 2016;461:25–33. doi: 10.1016/J.CCA.2016.06.033. [DOI] [PubMed] [Google Scholar]
- 73.Jaganath D., Rajan J., Yoon C., Ravindran R., Andama A., Asege L., et al. Evaluation of multi-antigen serological screening for active tuberculosis among people living with HIV. PLoS One. 2020;15 doi: 10.1371/JOURNAL.PONE.0234130. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Burbelo P.D., Keller J., Wagner J., Klimavicz J.S., Bayat A., Rhodes C.S., et al. Serological diagnosis of pulmonary Mycobacterium tuberculosis infection by LIPS using a multiple antigen mixture. BMC Microbiol. 2015;15 doi: 10.1186/s12866-015-0545-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Fan X., Zhao X., Wang R., Li M., Luan X., Wang R., et al. A novel multistage antigens ERA005f confer protection against Mycobacterium tuberculosis by driving Th-1 and Th-17 type T cell immune responses. Front. Immunol. 2023;14 doi: 10.3389/FIMMU.2023.1276887. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Gong W., Liang Y., Mi J., Jia Z., Xue Y., Wang J., et al. Peptides-based vaccine MP3RT induced protective immunity against Mycobacterium tuberculosis infection in a humanized mouse model. Front. Immunol. 2021;12 doi: 10.3389/fimmu.2021.666290. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Cheng P., Xue Y., Wang J., Jia Z., Wang L., Gong W. Evaluation of the consistence between the results of immunoinformatics predictions and real-world animal experiments of a new tuberculosis vaccine MP3RT. Front. Cell. Infect. Microbiol. 2022;12 doi: 10.3389/fcimb.2022.1047306. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Cheng P., Jiang F., Wang G., Wang J., Xue Y., Wang L., et al. Bioinformatics analysis and consistency verification of a novel tuberculosis vaccine candidate HP13138PB. Front. Immunol. 2023;14 doi: 10.3389/fimmu.2023.1102578. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Jiang F., Peng C., Cheng P., Wang J., Lian J., Gong W. PP19128R, a multiepitope vaccine designed to prevent latent tuberculosis infection, induced immune responses in silico and in vitro assays. Vaccines (Basel) 2023;11:856. doi: 10.3390/vaccines11040856. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Andersen P., Andersen P. Host responses and antigens involved in protective immunity to Mycobacterium tuberculosis. Scand. J. Immunol. 1997;45:115–131. doi: 10.1046/J.1365-3083.1997.D01-380.X. [DOI] [PubMed] [Google Scholar]
- 81.Zhou J., Chen J., Peng Y., Xie Y., Xiao Y. A promising tool in serological diagnosis: current research progress of antigenic epitopes in infectious diseases. Pathogens. 2022;11:1095. doi: 10.3390/PATHOGENS11101095. 2022;11:1095. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Gong W., Pan C., Cheng P., Wang J., Zhao G., Wu X. Peptide-based vaccines for tuberculosis. Front. Immunol. 2022;13 doi: 10.3389/FIMMU.2022.830497. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Vinod V., Vijayrajratnam S., Vasudevan A.K., Biswas R. The cell surface adhesins of Mycobacterium tuberculosis. Microbiol. Res. 2020;232 doi: 10.1016/j.micres.2019.126392. [DOI] [PubMed] [Google Scholar]
- 84.Andongma B.T., Huang Y., Chen F., Tang Q., Yang M., Chou S.-H., et al. In silico design of a promiscuous chimeric multi-epitope vaccine against Mycobacterium tuberculosis. Comput. Struct. Biotechnol. J. 2023;21:991–1004. doi: 10.1016/j.csbj.2023.01.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Keikha M., Karbalaei M., Ghazvini K. In silico design of multi-epitope ESAT-6:Ag85b:fcγ2a fusion protein as a novel candidate for tuberculosis vaccine. Arch Clin Infect Dis. 2020;15 doi: 10.5812/archcid.90449. [DOI] [Google Scholar]
- 86.Wang J., Xie T., Ullah I., Mi Y., Li X., Gong Y., et al. A VLP-based vaccine displaying HBHA and MTP antigens of Mycobacterium tuberculosis induces protective immune responses in M. tuberculosis H37Ra infected mice. Vaccines (Basel) 2023;11 doi: 10.3390/vaccines11050941. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.White A.D., Tran A.C., Sibley L., Sarfas C., Morrison A.L., Lawrence S., et al. Spore-FP1 tuberculosis mucosal vaccine candidate is highly protective in Guinea pigs but fails to improve on BCG-conferred protection in non-human primates. Front. Immunol. 2023;14 doi: 10.3389/fimmu.2023.1246826. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Forouharmehr A. Whole proteome screening to develop a potent epitope-based vaccine against Coxiella burnetii: a reverse vaccinology approach. J. Biomol. Struct. Dyn. 2024:1–13. doi: 10.1080/07391102.2024.2326198. [DOI] [PubMed] [Google Scholar]
- 89.Tang J., Huang Y., Cai Z., Ma Y. Mycobacterial heparin-binding hemagglutinin (HBHA)-induced interferon-γ release assay (IGRA) for discrimination of latent and active tuberculosis: a systematic review and meta-analysis. PLoS One. 2021;16 doi: 10.1371/journal.pone.0254571. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Nemes E., Geldenhuys H., Rozot V., Rutkowski K.T., Ratangee F., Bilek N., et al. Prevention of M. tuberculosis infection with H4:IC31 vaccine or BCG revaccination. N. Engl. J. Med. 2018;379:138–149. doi: 10.1056/NEJMoa1714021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Lai R., Ogunsola A.F., Rakib T., Behar S.M. Key advances in vaccine development for tuberculosis-success and challenges. NPJ Vaccines. 2023;8:158. doi: 10.1038/s41541-023-00750-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Guo F., Wei J., Song Y., Li B., Qian Z., Wang X., et al. Immunological effects of the PE/PPE family proteins of Mycobacterium tuberculosis and related vaccines. Front. Immunol. 2023;14 doi: 10.3389/FIMMU.2023.1255920. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Strong M., Sawaya M.R., Wang S., Phillips M., Cascio D., Eisenberg D. Toward the structural genomics of complexes: crystal structure of a PE/PPE protein complex from Mycobacterium tuberculosis. Proc Natl Acad Sci U S A. 2006;103:8060–8065. doi: 10.1073/PNAS.0602606103/SUPPL_FILE/02606FIG9.JPG. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Fleri W., Vaughan K., Salimi N., Vita R., Peters B., Sette A. The immune epitope database: how data are entered and retrieved. J Immunol Res. 2017;2017 doi: 10.1155/2017/5974574. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Pillay K., Coetzer T., Connolly C., Pillay B., Chiliza T., Naidoo K., et al. 2024. Diversity of IgG Antibody Response to Mycobacterium tuberculosis Curli Pili (MTP) in HIV-Positive and Negative Patients from Different Geographical Regions in Sub-saharan Africa. [Google Scholar]
- 96.Jiang J., Xie D., Zhang W., Xiao G., Wen J. 2013. Fusion of Hsp70 to Mage-A1 Enhances the Potency of Vaccine-specific Immune Responses. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Kim W.S., Kim J.-S., Kim H.M., Kwon K.W., Eum S.-Y., Shin S.J. Comparison of immunogenicity and vaccine efficacy between heat-shock proteins, HSP70 and GrpE, in the DnaK operon of Mycobacterium tuberculosis. Sci. Rep. 2018;8 doi: 10.1038/s41598-018-32799-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Kim W.S., Kim J.-S., Kim H.M., Kwon K.W., Eum S.-Y., Shin S.J. Comparison of immunogenicity and vaccine efficacy between heat-shock proteins, HSP70 and GrpE, in the DnaK operon of Mycobacterium tuberculosis. Sci. Rep. 2018;8 doi: 10.1038/s41598-018-32799-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Li X., Yang X., Li L., Liu H., Liu J. A truncated C-terminal fragment of Mycobacterium tuberculosis HSP70 gene enhanced potency of HBV DNA vaccine. Vaccine. 2006;24:3321–3331. doi: 10.1016/J.VACCINE.2006.01.012. [DOI] [PubMed] [Google Scholar]
- 100.Ghazaei C. Role and mechanism of the Hsp70 molecular chaperone machines in bacterial pathogens. J. Med. Microbiol. 2017;66:259–265. doi: 10.1099/JMM.0.000429/CITE/REFWORKS. [DOI] [PubMed] [Google Scholar]
- 101.Diaz-Silvestre H., Espinosa-Cueto P., Sanchez-Gonzalez A., Esparza-Ceron M.A., Pereira-Suarez A.L., Bernal-Fernandez G., et al. The 19-kDa antigen of Mycobacterium tuberculosis is a major adhesin that binds the mannose receptor of THP-1 monocytic cells and promotes phagocytosis of mycobacteria. Microb. Pathog. 2005;39:97–107. doi: 10.1016/J.MICPATH.2005.06.002. [DOI] [PubMed] [Google Scholar]
- 102.Noss E.H., Pai R.K., Sellati T.J., Radolf J.D., Belisle J., Golenbock D.T., et al. Toll-like receptor 2-dependent inhibition of macrophage class II MHC expression and antigen processing by 19-kDa lipoprotein of Mycobacterium tuberculosis. J. Immunol. 2001;167:910–918. doi: 10.4049/JIMMUNOL.167.2.910. [DOI] [PubMed] [Google Scholar]
- 103.Sánchez A., Espinosa P., García T., Mancilla R. The 19 kDa Mycobacterium tuberculosis lipoprotein (LpqH) induces macrophage apoptosis through extrinsic and intrinsic pathways: a role for the mitochondrial apoptosis-inducing factor. Clin. Dev. Immunol. 2012;2012:11. doi: 10.1155/2012/950503. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Yeremeev V.V., Lyadova I.V., Nikonenko B.V., Apt A.S., Abou-Zeid C., Inwald J., et al. The 19-kD antigen and protective immunity in a murine model of tuberculosis. Clin. Exp. Immunol. 2000;120:274. doi: 10.1046/J.1365-2249.2000.01212.X. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Post F.A., Manca C., Neyrolles O., Ryffel B., Young D.B., Kaplan G. Mycobacterium tuberculosis 19-kilodalton lipoprotein inhibits Mycobacterium smegmatis-induced cytokine production by human macrophages in vitro. Infect. Immun. 2001;69:1433–1439. doi: 10.1128/IAI.69.3.1433-1439.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Hu D., Wu J., Zhang R., Chen L., Chen Z., Wang X., et al. Autophagy-targeted vaccine of LC3-LpqH DNA and its protective immunity in a murine model of tuberculosis. Vaccine. 2014;32:2308–2314. doi: 10.1016/j.vaccine.2014.02.069. [DOI] [PubMed] [Google Scholar]
- 107.Palacios A., Sampedro L., Sevilla I.A., Molina E., Gil D., Azkargorta M., et al. Mycobacterium tuberculosis extracellular vesicle-associated lipoprotein LpqH as a potential biomarker to distinguish paratuberculosis infection or vaccination from tuberculosis infection. BMC Vet. Res. 2019;15 doi: 10.1186/S12917-019-1941-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108.Wehmeier S., Varghese A.S., Gurcha S.S., Tissot B., Panico M., Hitchen P., et al. Glycosylation of the phosphate binding protein, PstS, in Streptomyces coelicolor by a pathway that resembles protein O-mannosylation in eukaryotes. Mol. Microbiol. 2009;71:421–433. doi: 10.1111/J.1365-2958.2008.06536.X. [DOI] [PubMed] [Google Scholar]
- 109.Tanghe A., Lefevre P., Denis O., D'Souza S., Braibant M., Lozes E., et al. Immunogenicity and protective efficacy of tuberculosis DNA vaccines encoding putative phosphate transport receptors. J. Immunol. 1999;162:1113–1119. doi: 10.4049/JIMMUNOL.162.2.1113. [DOI] [PubMed] [Google Scholar]
- 110.Davidow A., Kanaujia G.V., Shi L., Kaviar J., Guo X.D., Sung N., et al. Antibody profiles characteristic of Mycobacterium tuberculosis infection state. Infect. Immun. 2005;73:6846. doi: 10.1128/IAI.73.10.6846-6851.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111.Khurshid S., Khalid R., Afzal M., Waheed Akhtar M. 2013. Truncation of PstS1 Antigen of Mycobacterium tuberculosis Improves Diagnostic Efficiency. [DOI] [PubMed] [Google Scholar]
- 112.Khurshid S., Afzal M., Khalid R., Khan I.H., Waheed Akhtar M. Improving sensitivity for serodiagnosis of tuberculosis using TB16.3-echA1 fusion protein. Tuberculosis. 2014;94:519–524. doi: 10.1016/j.tube.2014.06.006. [DOI] [PubMed] [Google Scholar]
- 113.Peng C., Shi C., Cao X., Li Y., Liu F., Lu F. Factors influencing recombinant protein secretion efficiency in gram-positive bacteria: signal peptide and beyond. Front. Bioeng. Biotechnol. 2019;7 doi: 10.3389/FBIOE.2019.00139/BIBTEX. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 114.Dimitrov I., Flower D.R., Doytchinova I. AllerTOP--a server for in silico prediction of allergens. BMC Bioinf. 2013;14(Suppl 6) doi: 10.1186/1471-2105-14-S6-S4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 115.Gill S.C., von Hippel P.H. Calculation of protein extinction coefficients from amino acid sequence data. Anal. Biochem. 1989;182:319–326. doi: 10.1016/0003-2697(89)90602-7. [DOI] [PubMed] [Google Scholar]
- 116.Meitzler J.L., Hinde S., Bánfi B., Nauseef W.M., De Montellano P.R.O. Conserved cysteine residues provide a protein-protein interaction surface in dual oxidase (DUOX) proteins. J. Biol. Chem. 2013;288:7147. doi: 10.1074/JBC.M112.414797. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 117.Bhopatkar A.A., Uversky V.N., Rangachari V. Granulins modulate liquid–liquid phase separation and aggregation of the prion-like C-terminal domain of the neurodegeneration-associated protein TDP-43. J. Biol. Chem. 2020;295:2506. doi: 10.1074/JBC.RA119.011501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 118.Gront D., Kmiecik S., Blaszczyk M., Ekonomiuk D., Koliński A. Optimization of protein models. WIREs Computational Molecular Science. 2012;2:479–493. doi: 10.1002/wcms.1090. [DOI] [Google Scholar]
- 119.RomanA Laskowski, JantoonC Rullmann, MalcolmW MacArthur, Kaptein R., Thornton JanetM. AQUA and PROCHECK-NMR: programs for checking the quality of protein structures solved by NMR. J. Biomol. NMR. 1996;8 doi: 10.1007/BF00228148. [DOI] [PubMed] [Google Scholar]
- 120.Aarthy M., Singh S.K. Advances in Protein Molecular and Structural Biology Methods. Elsevier; 2022. Envisaging the conformational space of proteins by coupling machine learning and molecular dynamics; pp. 467–475. [DOI] [Google Scholar]
- 121.Colovos C., Yeates T.O. Verification of protein structures: patterns of nonbonded atomic interactions. Protein Sci. 1993;2:1511–1519. doi: 10.1002/pro.5560020916. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.