ABSTRACT
Klebsiella pneumoniae is an opportunistic Gram-negative bacterium that has become a leading causative agent of nosocomial infections, mainly infecting patients with immunosuppressive diseases. Capsular (K) serotypes K1, K2, K47, and K64 are commonly associated with higher virulence (hypervirulent Klebsiella pneumoniae), and more threateningly, isolates belonging to the last two K serotypes are also frequently associated with resistance to carbapenem (hypervirulent carbapenem-resistant Klebsiella pneumoniae). The prevalence of these isolates has posed significant threats to human health, and there are no appropriate therapies available against them. Therefore, in this study, a method combining immunoinformatics and pangenome analysis was applied for contriving a multiepitope subunit vaccine against these four threatening serotypes. To obtain cross-protection, 12 predicted conserved antigens were screened from the core genome of 274 complete Klebsiella pneumoniae genomes (KL1, KL2, KL47, and KL64), from which the epitopes of T and B cells were extracted for vaccine construction. In addition, the immunological properties, the interaction with Toll-like receptors, and the stability in a simulative humoral environment were evaluated by immunoinformatics methods, molecular docking, and molecular dynamics simulation. All of these evaluations indicated the potency of this constructed vaccine to be an effective therapeutic agent. Lastly, the cDNA of the designed vaccine was optimized and ligated to pET-28a(+) for expression vector construction. Overall, our research provides a newly cross-protective control strategy against these troublesome pathogens and paves the way for the development of a safe and effective vaccine.
IMPORTANCE Klebsiella pneumoniae is an opportunistic Gram-negative bacterium that has become a leading causative agent of nosocomial infections. Among the numerous capsular serotypes, K1, K2, K47, and K64 are commonly associated with higher virulence (hypervirulent K. pneumoniae). More threateningly, the last two serotypes are frequently associated with resistance to carbapenem (hypervirulent carbapenem-resistant K. pneumoniae). However, there is currently no therapeutic agent or vaccine specifically against these isolates. Therefore, development of a vaccine against these pathogens is very essential. In this study, for the first time, a method combining pangenome analysis, reverse vaccinology, and immunoinformatics was applied for contriving a multiepitope subunit vaccine against K. pneumoniae isolates of K1, K2, K47, and K64. Also, the immunological properties of the constructed vaccine were evaluated and its high potency was revealed. Overall, our research will pave the way for the vaccine development against these four threatening capsular serotypes of K. pneumoniae.
KEYWORDS: Klebsiella pneumoniae, hypervirulent, carbapenem resistant, pangenome analysis, immunoinformatics, multiepitope subunit vaccine
INTRODUCTION
Klebsiella pneumoniae is a member of the family Enterobacteriaceae, a Gram-negative bacterium, and a common causative agent of community-acquired and nosocomial infections. Immunosuppressed patients, especially those in intensive care units (ICUs), are susceptible to this pathogen. Patients infected with this pathogen could be induced to develop numerous pathological characteristics, including pneumonia, bacteremia, endocarditis, meningitis, and cellulitis (1). Among the numerous capsular (K) types, K1 and K2 are often associated with highly virulent strains (2, 3), and these strains are also known as hypervirulent K. pneumoniae (hvKP). To the best of our knowledge, the isolates belonging to K types K1 and K2 alone could compose over 70% to 80% of isolates from liver abscesses (4), and they constitute almost all the isolates from meningitis or complications of endophthalmitis (5–7). Unlike classic K. pneumoniae (cKP), hvKP can develop metastatic spread and multisite infection (8). More threateningly, K47 and K64 isolates are frequently associated with resistance to carbapenem (9), which poses great challenges to antimicrobial therapy (9–11); such isolates are known as hypervirulent carbapenem-resistant K. pneumoniae (hv-CRKP). Thus, a broad-spectrum therapeutic agent needs to be designed urgently and developed against K. pneumoniae isolates of serotypes K1, K2, K47, and K64.
Vaccine treatment may be a better choice than antibiotic therapy, especially in the face of hv-CRKP isolates. Although traditional whole-cell K. pneumoniae vaccines have been considered a promising method to prevent respiratory and urinary infections by K. pneumoniae, the potential toxicity and limited protections among serotypes limit their widespread use (12). Other than that, capsular polysaccharides (CPS) and lipopolysaccharides (LPS) of K. pneumoniae with higher immunogenicity and better surface exposure make them attractive vaccine components; however, various capsular serotypes among K. pneumoniae (13–15) and potential high LPS toxicity limit the development of vaccines based on CPS and LPS.
Subunit vaccine based on certain proteins and with more flexible formulations is considered an ideal therapeutic strategy and has the potential to provide protection across different serotypes. However, the effectiveness of this type of vaccine exclusively depends on the selection of antigen proteins. Traditional antigen selection using empirical screening methods or immunoproteomics is laborious and expensive (16, 17). A rapid, more rational, comprehensive, and high-throughput approach is urgently needed.
Reverse vaccinology (RV) is an in silico method that uses only bacterial genome sequence, without any culture or empirical screening to identify protective antigens. This technology has been applied to conquering many pathogens (18–22). However, RV analysis based on the genome of a single strain or the DNA sequence of a plasmid hinders the development of vaccines that provide cross-serotype protection.
In this study, using 274 complete genomes of K. pneumoniae (containing the K serotypes KL1, KL2, KL47, and KL64), a method combining pangenome analysis and RV analysis (so-called pan-RV analysis), was used to screen the protective antigens from the core genome. Then, to conserve the immune resources of the body and make the immune response intensive, the epitopes of T and B cells were extracted from the conserved antigens to construct the multiepitope subunit vaccine. Subsequently, a series of immunological properties of the designed vaccine were evaluated, including allergenicity, antigenicity, various physicochemical properties, and immune simulation. The interactions between the structure of the contrived vaccine and two Toll-like receptors (TLRs) were elucidated by molecular docking. Also, molecular dynamics simulation was conducted to assess the stability of this multiepitope subunit K. pneumoniae vaccine in a humoral environment. Finally, codon optimization and cloning in silico were performed to ensure the efficiency of expression of the constructed vaccine in an Escherichia coli expression system. Overall, the results demonstrate the efficacy and reliability of the multiepitope subunit vaccine construct, which holds a strong rationale for further wet-lab validation for vaccine development tackling infections caused by the four threatening serotypes.
RESULTS
Pangenome analysis of complete genomes of Klebsiella pneumoniae.
Utilizing the software Prokka, a total of 274 complete genomes of K. pneumoniae were annotated, the FASTA format files of all the genomes were transformed to GFF3 format files, and these files were used to perform pangenome analysis using the program Roary. The results of pangenome analysis revealed that a total of 30,742 genes were identified among these 274 genomes; 2,519 genes were identified as the core genes. A whole-genome phylogenetic tree and a matrix with the presence and absence of core and accessory genes are shown in Fig. 1c. The pangenome of K. pneumoniae was open (shown in Fig. 1a), which means that as the number of strains analyzed increases, the total number of genes in the pangenomic pool could increase with no limit. And on the contrary, the number of conserved genes decreased and tended to be stable (shown in Fig. 1b).
RV analysis for protein prioritization.
The core proteomes identified from the pangenome analysis were subjected to reverse vaccinology (RV) analysis to prioritize the proteins, which would be used for epitope extraction in the downstream pipeline.
(i) Prediction of subcellular localization.
Subcellular localization screening of the core proteomes using PSORTb (version 3.0) showed that 1,258 proteins were cytoplasmic, 601 proteins were cytoplasmic membrane, 85 were periplasmic, 4 were extracellular, 39 were outer membrane, and 532 were of unknown localization (Fig. 2). The proteins with extracellular secretion and located at the outer membrane were selected for next analysis.
(ii) Antigenicity prediction.
The proteins identified in the upstream region were scored by VaxiJen (version 2.0). A total of 40 proteins (see Table S3 in the supplemental material) were identified as the potential antigens (with the default threshold of 0.4), of which 12 proteins (Table 1) with a score of >0.7 were screened out. These proteins were considered the prioritized proteins and were used for epitope extraction to construct the multiepitope K. pneumoniae vaccine. In addition, the serotype coverage of these 12 proteins in isolates belonging to serotypes other than the four serotypes studied is listed in Table S4.
TABLE 1.
RefSeq IDa | UniProt ID of RefSeq | Protein | Gene | VaxiJen score | No. of amino acids | Subcellular location | Annotation |
---|---|---|---|---|---|---|---|
WP_004144576.1 | A0A0H3GIV3 | PHOE | phoE | 0.7708 | 350 | Outer membrane | Outer membrane pore protein E |
WP_002895068.1 | A0A0H3GQC1 | PAL | pal_2 | 0.9239 | 174 | Outer membrane | Peptidoglycan-associated protein |
Not available | J2LUK0 | FEPA | fepA | 0.7065 | 748 | Outer membrane | Outer membrane receptor FepA |
WP_002901634.1 | A0A377ZI05 | OMPW | ompW | 0.7549 | 212 | Outer membrane | Outer membrane protein W |
WP_004217144.1 | J2X9A3 | FIU | fiu | 0.7079 | 772 | Outer membrane | Catecholate siderophore receptor Fiu |
WP_002907749.1 | A0A663AU05 | SLYB | slyB | 0.983 | 155 | Outer membrane | Outer membrane lipoprotein SlyB |
WP_002908860.1 | A0A0H3GU43 | LPP | lpp | 0.7585 | 78 | Outer membrane | Major outer membrane lipoprotein Lpp |
WP_002911596.1 | W1DR23 | OMPN | ompN_1 | 0.7072 | 381 | Outer membrane | Outer membrane protein N |
WP_004149542.1 | J2DJ81 | NLPD | nlpD_2 | 0.7571 | 376 | Outer membrane | Lipoprotein NlpD |
WP_002916050.1 | W1DS11 | KDGM | kdgM | 0.8903 | 231 | Outer membrane | Oligogalacturonate-specific porin protein KdgM |
WP_015959089.1 | A6TF12 | DAMX | damX | 0.8546 | 428 | Outer membrane | Cell division protein DamX |
WP_002921917.1 | A0A663BKZ3 | YIAD | yiaD | 0.8886 | 220 | Outer membrane | Putative lipoprotein YiaD |
ID, identifier.
HTL epitope prediction.
The helper T-lymphocyte (HTL) epitopes were identified using the IEDB server for three HLA supertypes, including HLA-DR (HLA-DRB1*01:01, HLA-DRB1*03:01, HLA-DRB1*04:01, HLA-DRB1*04:05, HLA-DRB1*07:01, HLA-DRB1*08:02, HLA-DRB1*09:01, HLA-DRB1*11:01, HLA-DRB1*12:01, HLA-DRB1*13:02, HLA-DRB1*15:01, HLA-DRB3*01:01, HLA-DRB3*02:02, HLA-DRB4*01:01,and HLA-DRB5*01:01), HLA-DQ (HLA-DQA1*05:01/DQB1*02:01, HLA-DQA1*05:01/DQB1*03:01, HLA-DQA1*03:01/DQB1*03:02, HLA-DQA1*04:01/DQB1*04:02, HLA-DQA1*01:01/DQB1*05:01, and HLA-DQA1*01:02/DQB1*06:02), and HLA-DP (HLA-DPA1*02:01/DPB1*01:01, HLA-DPA1*01:03/DPB1*02:01, HLA-DPA1*01:03/DPB1*04:01, HLA-DPA1*03:01/DPB1*04:02, HLA-DPA1*02:01/DPB1*05:01, and HLA-DPA1*02:01/DPB1*14:01). All the predicted T-cell epitopes are listed in Table S5. The epitopes with the lowest percentile rank (only the epitopes with percentile rank of <1 were sorted) and a 50% inhibitory concentration (IC50) value of <50 nM in each HLA supertype of each prioritized protein were considered HTL epitopes in our analysis pipeline. A total of 20 epitopes were screened out for our multiepitope vaccine construction (listed in Table 2).
TABLE 2.
Protein | Epitope | Allele | Percentile rank | IC50 (nM) |
---|---|---|---|---|
PHOE | MMGFVASTATQAAEV | HLA-DRB1*09:01 | 0.11 | 8.9 |
PAL_2 | QNNIVYFDLDKYDIR | HLA-DQA1*01:01/DQB1*05:01 | 0.62 | 47.1 |
DKYDIRSDFAAMLDA | HLA-DRB3*01:01 | 0.05 | 18 | |
FEPA | FNVPFFWLADQTLTL | HLA-DQA1*05:01/DQB1*02:01 | 0.73 | 39.2 |
IPGIRFDYHNQFGSN | HLA-DRB3*01:01 | 0.05 | 20.6 | |
OMPW | GINYTTFFNEDFNDT | HLA-DPA1*01:03/DPB1*02:01 | 0.85 | 30.5 |
VRPYVGAGINYTTFF | HLA-DQA1*05:01/DQB1*03:01 | 0.19 | 13.4 | |
RLDPWVFMFSAGYRF | HLA-DRB1*07:01 | 0.27 | 5.8 | |
FIU | LCLGASPAAGIAAEN | HLA-DQA1*05:01/DQB1*03:01 | 0.39 | 18.3 |
GGGVRYVGSLRRGSD | HLA-DRB5*01:01 | 0.37 | 3.3 | |
LPP1 | None | |||
SLYB | SNAIGAIGGAVLGGF | HLA-DQA1*05:01/DQB1*03:01 | 0.15 | 18.3 |
SVTYGTIVHTRAVQI | HLA-DRB1*07:01 | 0.07 | 2.9 | |
OMPN_1 | KRKVLALMVPALLMA | HLA-DRB1*01:01 | 0.16 | 34.2 |
NLPD_2 | APVSSAGGAASSSTN | HLA-DQA1*05:01/DQB1*03:01 | 0.45 | 19.2 |
AQPIQPMQTQTIQPA | HLA-DRB4*01:01 | 0.08 | 33 | |
KDGM | HLHAQYSFDNGFYVA | HLA-DRB3*01:01 | 0.12 | 5.7 |
DAMX | PAATAAAAAPAAKTG | HLA-DQA1*05:01/DQB1*03:01 | 0.02 | 6 |
LDKYVVYETSRNGQP | HLA-DRB1*04:05 | 0.65 | 30.9 | |
YIAD | GKGALIGAAAGAALG | HLA-DQA1*05:01/DQB1*03:01 | 0.02 | 13.9 |
GDNIVLNMPNNVTFD | HLA-DRB1*13:02 | 0.01 | 1.3 |
B-cell epitope prediction.
The ABCpred and BCPred servers were used to identify the B-cell epitopes. The predicted epitopes were compared, and the overlaps between the predictions of the two servers were used for multiepitope vaccine construction. The epitope identification threshold of ABCpred is set to 0.5, and epitopes screened out by the BCPred server with classifier specificity set to 75%. All the predicted B-cell epitopes are listed in Table S6. The epitope (16-mer) that overlapped between the two servers with the highest score (based on the score obtained by the ABCpred server) in each protein was selected (Table 3) for vaccine construction.
TABLE 3.
Protein | Epitope | Location in the protein (residue) | ABCpred score |
---|---|---|---|
PHOE | SEFSGNKTESDSSQKT | 79 | 0.87 |
PAL_2 | PVMAIAACSSNKNASN | 15 | 0.93 |
FEPA | QGNIYAGDTQYSNGNL | 273 | 0.92 |
OMPW | TATVRPTEGSDNVLGS | 33 | 0.9 |
FIU | RYHPGEPRTFMLTANV | 744 | 0.94 |
SLYB | IGAIGGAVLGGFLGNT | 62 | 0.89 |
LPP1 | None | ||
OMPN_1 | AGSGEGTNNGGKRKLA | 179 | 0.87 |
NLPD_2 | SGMLITPPPSGVKSAP | 49 | 0.94 |
KDGM | TVEAKWRSGGDNGSQP | 56 | 0.92 |
DAMX | PQAVAKTPVESKPVQP | 257 | 0.91 |
YIAD | RTTGMGPANPIASNST | 187 | 0.86 |
Multiepitope subunit vaccine design.
The HTL epitopes and B-cell epitopes analyzed upstream were fused with linker sequences. The HTL epitopes were linked by a GPGPG linker, the B-cell epitopes were linked by a KK linker, and the cholera toxin subunit B (CTB) was employed as an adjuvant which was linked to the N terminus of the construct with the help of an EAAAK linker. The amino acid sequence of our developed multiepitope K. pneumoniae vaccine is as follows: MMGFVASTATQAAEVGPGPGQNNIVYFDLDKYDIRGPGPGDKYDIRSDFAAMLDAGPGPGFNVPFFWLADQTLTLGPGPGIPGIRFDYHNQFGSNGPGPGGINYTTFFNEDFNDTGPGPGVRPYVGAGINYTTFFGPGPGRLDPWVFMFSAGYRFGPGPGLCLGASPAAGIAAENGPGPGGGGVRYVGSLRRGSDGPGPGSNAIGAIGGAVLGGFGPGPGSVTYGTIVHTRAVQIGPGPGKRKVLALMVPALLMAGPGPGAPVSSAGGAASSSTNGPGPGAQPIQPMQTQTIQPAGPGPGHLHAQYSFDNGFYVAGPGPGPAATAAAAAPAAKTGGPGPGLDKYVVYETSRNGQPGPGPGGKGALIGAAAGAALGGPGPGGDNIVLNMPNNVTFDGPGPGSEFSGNKTESDSSQKTKKIGAIGGAVLGGFLGNTKKPVMAIAACSSNKNASNKKRTTGMGPANPIASNSTKKSGMLITPPPSGVKSAPKKTATVRPTEGSDNVLGSKKAGSGEGTNNGGKRKLAKKTVEAKWRSGGDNGSQPKKPQAVAKTPVESKPVQPKKRYHPGEPRTFMLTANVKKQGNIYAGDTQYSNGNLEAAAKMIKLKFGVFFTVLLSSAYAHGTPQNITDLCAEYHNTQIYTLNDKIFSYTESLAGKREMAIITFKNGAIFQVEVPGSQHIDSQKKAIERMKDTLRIAYLTEAKVEKLCVWNNKTPHAIAAISMAN.
Prediction of allergenicity, antigenicity, and various physicochemical properties.
The allergenicity of the vaccine was evaluated using the AllergenFP (version 1.0) and AllerTOP (version 2.0) servers; the results of both servers indicated that the sequence of vaccine corresponds to a probable nonallergen. The antigenicity of the vaccine was revalidated using the VaxiJen server, and the score of antigenicity obtained by the server was 0.9223 (a protein with a score above 0.4 is predicted as a probable antigen).
Various physicochemical properties were assessed utilizing the web server ProtPara; the results of assessment are as follows. Our designed vaccine consists of 725 amino acids with 44 negatively charged residues and 69 positively charged residues, and its molecular weight is 73.9 kDa. The theoretical pI of vaccine protein is predicted to be 9.62. The vaccine contains 10,322 atoms, and its chemical formula is written as C3277H5117N921O987S20. The estimated half-life is 30 h in vitro, and it could reach more than 20 h and 10 h in yeast (in vivo) and in Escherichia coli (in vivo), respectively. The instability index (II) was computed to be 30.53, classifying the vaccine as a stable protein. The aliphatic index was calculated to be 62.68, suggesting its relatively higher thermostability. The grand average of hydropathicity (GRAVY) value of the vaccine is −0.336, indicating the hydrophilicity of the protein.
Immune simulations in silico.
The C-ImmSim server was utilized for performing immune simulation and generating the immune response profile of the designed vaccine. Three injections were conducted at 4-week intervals, and a total of 200 days of immunological data were recorded.
The simulation analysis showed that the antibody (Ab) titer had two steep rises after two later administrations (Fig. 3a), indicating that a strong humoral immune response was generated. Compared with the primary immune response, the secondary immune response was largely augmented, with more active B cells (Fig. 3b and c), helper T cells (Fig. 3d and e), and cytotoxic T cells (Fig. 3f). Furthermore, the increased concentration of dendritic cells (Fig. 3g) and macrophages (Fig. 3h) indicated a good antigen representation by these antigen-presenting cells (APCs). Importantly, the concentration of cytokines and interleukins is recorded in Fig. 3i, showing the relatively higher levels after the second injection. Overall, our designed vaccine has the potential ability to induce high levels of Ab, activated B cells and T cells, cytokines, and APCs against the pathogen.
Prediction, refinement, and quality assessment of the 3D structure of the developed multiepitope subunit vaccine.
The three-dimensional (3D) structure of the designed vaccine was constructed using the Phyre 2 server, and subsequently, the initial structure was refined by the Galaxyrefine server. A model with the highest Rama favored score (80.5) was adopted among the five generated models as the final structure for downstream analysis, and the structure of this model was visualized by PyMOL (Fig. 4a).
A model with an ideal state should have more residues in the Ramachandran-favored region and have fewer in the outlier region and rotamer region; the Ramachandran plot of the final structure of our developed vaccine revealed that there were 80.50% of residues in the Ramachandran-favored region, only 3.73% in the outlier region, and 0.95% in the rotamer region (Fig. 4b). The MolProbity and Clash scores were 2.12 and 7.06, respectively. Furthermore, the overall quality and potential error of the final structure were evaluated by ProSA, and the Z-score was calculated to be −3.7, the point indicating that the vaccine protein fell with the range of experimental structures (Fig. 4c).
Molecular dynamics simulation of the multiepitope subunit vaccine.
The molecular dynamics of the final vaccine protein was simulated using GROMACS to predict its stability in the biological environment. The OPLS-AA (Optimized Potential for Liquid Simulation - All Atom) force field was applied, and the vaccine protein was placed in the center of the cubic box with a total of 34,020 water molecules added (Fig. 5a). To ensure that the system was electrically neutral, 25 chloride ions were added into the box (Fig. 5a). Subsequently, energy minimization was performed toward the solvated and electroneutral system, using the steepest-descent minimization algorithm; the system energy was eventually reduced to around −2,000,000 KJ mol−1 (Fig. 5b). After that, NVT ensemble (Fig. 5c) and NPT ensemble (Fig. 5d) were conducted with settings of 300K and 1 atm, respectively. The root mean square deviation (RMSD) of the vaccine showed that the RMSD value was kept around 0.4 nm after 6 ns (Fig. 5e), indicating that the conformation of our designed vaccine was stable. The results of root medium square fluctuation (RMSF) (Fig. 6f) indicated that residues 300 to 400 and 500 to 725 have relatively lower structure flexibility, with a lower RMSF value, while the region of residues 400 to 500 has higher flexibility, with a higher RMSF value.
Molecular docking of the multiepitope subunit vaccine structure with Toll-like receptors.
The ClusPro server was utilized for performing molecular docking of the designed vaccine with TLR2 and TLR4, respectively. For each docking, a total of 30 clusters were generated by the server; the cluster with the lowest energy score was considered the docking result. The lowest energy score of the vaccine-TLR2 docking cluster was −1,315.4, while that of the vaccine-TLR4 docking cluster was −1,264.1. The results of molecular docking of vaccine with TLR2 and TLR4 were visualized by PyMOL (Fig. 6a and c).
To further understand the interaction between vaccine and the TLRs, the PDBsum tool was used to reveal the residues and binding forces of receptor-ligand interactions. In the docking of vaccine and TLR2 (Fig. 6b), the number of interface residues of vaccine and TLR was 28; the interface areas of vaccine and TLR were 1,270 Å2 and 1,329 Å2, respectively. As for binding force, 5 salt bridges, 17 hydrogen bonds, and 166 nonbonded contacts were detected. In the docking of vaccine and TLR4 (Fig. 6d), the number of interface residues of the vaccine was 38, and that of TLR4 was 30. The interface areas of vaccine and TLR were 1,606 Å2 and 1,724 Å2, respectively. A total of 9 salt bridges, 23 hydrogen bonds, and 233 nonbonded contacts were found in the vaccine-TLR4 complex. The results indicate that the designed vaccine has high affinity with these two types of TLRs.
Codon adaption and in silico cloning.
Using the JCat server, cDNA of our designed vaccine was generated and codon adaption was performed to improve the cDNA sequence (Text S7). The codon adaptation index (CAI) value and the GC content of the improved sequence were 1.0 and 54.53%, respectively, indicating high-level expression of our designed vaccine in E. coli K-12. Vector construction and visualization were performed utilizing SnapGene software; the vaccine sequence was inserted (Fig. 7, marked with red color) between the restriction sites BamHI and HindIII.
DISCUSSION
Klebsiella pneumoniae is an opportunistic pathogen causing both community-acquired and nosocomial infections. K. pneumoniae isolates of capsular serotypes K1, K2, K47, and K64 are frequently identified as members of either of the families hvKP and hv-CRKP, which have posed significant threats to human health (2–4, 9). Vaccination is a good choice for its ability to elicit strong immune responses against drug-resistant isolates. However, conventional vaccines have many limitations; for example, the vaccines based on whole cells and LPS are limited by potential toxicity, and those of others based on CPS have poor protective ability of cross serotypes. Meanwhile, the process of traditional vaccine development is expensive, laborious, and time-consuming. Promisingly, new technologies in the field of immunoinformatics are already accelerating the development of vaccines by efficient in silico screening of ideal epitope candidates in a target antigen that has the potential to evoke both humoral and cellular immune responses (23). During the past few years, these technologies have also facilitated the development of epitope-based vaccines, and the immunoinformatics approach has been applied widely to design multiepitope vaccines against various pathogens (24, 25). In contrast to traditional vaccines, multiepitope vaccines have many competing advantages, such as lower reactivity, more intensive immune responses, refined stability, and improved solubility (26).
However, the majority of current multiepitope vaccines developed using immunoinformatics are based on empirical protective proteins, which limits their potential to provide broad-spectrum protection. Therefore, in this study, for the first time, an approach combining immunoinformatics and pangenome analysis was employed to develop a multiepitope subunit vaccine against K1, K2, K47, and K64 K. pneumoniae. To obtain cross-protectivity, pangenome-based reverse vaccinology (pan-RV) analysis was performed first, and a total of 12 proteins predicted as protective antigens were screened from the core genome of 274 K. pneumoniae isolates (covering the four K serotypes KL1, KL2, KL47, and KL64). Interestingly, some of these proteins have been previously confirmed as the immunological effectors among various pathogens, such as PHOE (27, 28), PAL (29, 30), NLPD (31), LPP (32), OMPW (33–35), OMPN (36, 37), and DAMX (38). More importantly, SLYB, YIAD, KDGM, FIU, and FEPA, identified in this study, may be new antigens with potential protection for cross-serotype K. pneumoniae infections. Meanwhile, more meaningfully, the results regarding the coverage of these proteins in strains other than these four serotypes suggest that they may have a more broad-spectrum protective potential. Twenty T-cell epitopes and 11 B-cell epitopes were predicted and extracted from these 12 antigen proteins, linked by the linker, and cholera toxin subunit B (CTB), with the ability to improve the antigenicity (39), was added to the N terminus to construct the vaccine.
After completing the construction of the vaccine, a series of properties were evaluated. The constructed vaccine was predicted as a nonallergen with high immunogenicity. The results of ProtParam showed that the molecular weight of the designed vaccine is 73.9 kDa, indicating that the vaccine protein is easy to express and purify. The instability index of 30.53 illustrates its stability, the aliphatic index of 62.68 indicates its higher thermostability, and the protein also has better hydrophilicity, with a GRAVY value of −0.336 (40). The immune simulations performed by C-ImmSim revealed that this constructed vaccine has the potential to elicit both humoral and cellular immune responses and induce high levels of cytokines and APCs against the pathogen. Further, the 3D structure of our developed vaccine was predicted using Phyre 2 and refined by the Galaxyrefine server. A Ramachandran plot of the final vaccine structure showed 80.5% of residues to be in the favored region, while only 3.73% were in the outlier region. The Z-score generated by ProSA is −3.7; both two indexes mentioned above indicate the high quality of the predicted 3D structure. Next, molecular dynamics simulation of this structure further proved its stability in the humoral environment, with a lower RMSD value. Additionally, considering the important role of TLR2 and TLR4 in both host-pathogen interactions (infections by K. pneumoniae could make the TLRs in human airway epithelial cells overexpressed, mainly TLR2 and TLR4 [41]) and activation of the innate immune response, molecular docking of constructed vaccine with these two Toll-like receptors was performed. The results of docking confirmed that our designed vaccine has strong interaction with TLR2 and TLR4. Finally, the cDNA of the designed vaccine was optimized for adaption for E. coli and ligated with pET-28a(+) to construct an expression vector.
Overall, the vaccine designed in this study is based on the concept of being able to provide cross-protection and was evaluated in silico to have ideal immunological properties. Although wet-lab validation is required to determine its safety and effectiveness, our research will pave the way for the vaccine development against these four threatening serotypes of KP isolates.
Conclusion.
Currently, the prevalence of Klebsiella pneumoniae, especially associated with high virulence and high drug resistance, has posed significant threats to human health. In this study, a method combining pangenome analysis and immunoinformatics was employed for contriving a multiepitope subunit vaccine against the four threatening serotypes of K. pneumoniae. This designed vaccine contains multiple epitopes of T and B cells from the conserved protective antigens based on pangenome analysis. The results of immunoinformatics analysis and molecular docking indicated that it has good immunological properties and high affinity with immunoreceptors (TLR2 and TLR4). In addition, high stability in the biological environment was confirmed using molecular dynamics simulation. Finally, the cDNA of the designed vaccine was optimized and the expression vector was constructed. Despite the further wet-lab validation is required to determine its safety and effectiveness, our in silico research will pave the way for the vaccine development against these four threatening serotypes of K. pneumoniae.
MATERIALS AND METHODS
The workflow of our developed multiepitope KP vaccine based on immunoinformatics is illustrated in Fig. 8.
Genome retrieval and pangenome analysis.
A total of 274 complete genomes of K. pneumoniae isolates (human isolates) encompassing K serotypes KL1, KL2, KL47, and KL64 were retrieved from the NCBI. The information for these genomes is listed in Table S1.
Using Prokka (version 1.14) (42), the coding DNA sequences (CDSs) of all the genomes were annotated, and GFF3 format files of all genome sequences were generated for pangenome analysis. Two hundred seventy-four proteomes were analyzed by Roary (version 3.13.0) (43) for identification of the core proteomes with the default settings (genes that are found in 99% to 100% of isolates were considered core genes).
Reverse vaccinology analysis for protein prioritization.
(i) Prediction of subcellular localization. The core proteomes identified by pangenome analysis were uploaded to the PSORTb server (version 3.0.2; http://www.psort.org/psortb/) (44) for prediction of subcellular localization; all the settings were default and chose “Negative” in the option of “Choose Gram stain” for submission. Only the proteins predicted to be located in the outer membrane or those predicted as extracellular proteins could be screened out for the next analysis.
(ii) Prediction of antigenicity.
The amino acid sequences of proteins filtered in previous steps were uploaded to VaxiJen (version 2.0) (http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html) (45) to perform antigenicity prediction. This prediction method, based on physicochemical properties of proteins, does not recourse to sequence alignment, the precision rate of which ranges from 70% to 89%. In order to maximize screening efficiency, proteins with a VaxiJen score greater than 0.7 were considered the prioritized proteins for epitope extraction.
In addition, serotype coverage of these prioritized proteins was evaluated among the K. pneumoniae isolates outside the four serotypes (accession numbers of 613 K. pneumoniae genomes are listed in Table S2).
Helper T-lymphocyte (HTL) epitope prediction.
An online tool (MHC-II Binding Predictions) (46) in the Immune Epitope Database server (IEDB; http://www.iedb.org/) were employed for prediction of major histocompatibility complex class II (MHC-II) binding epitopes. Proteins prioritized using pan-RV strategy on core genome were uploaded to the server with binding predictions for full HLA reference set using the IEDB-recommended 2.22 prediction method. This method uses the consensus approach, combining NN-align, SMM-align, CombLib, and Sturniolo if any corresponding predictor is available for the molecule; otherwise, NetMHCIIpan is used (47–51). Prediction results were evaluated by the percentile rank and IC50, peptides with a small-numbered percentile rank and IC50 value of <50 nM were considered high affinity and used for vaccine construction.
B-cell epitope prediction.
The ABCpred (http://crdd.osdd.net/raghava/abcpred/) and BCPRED (http://ailab.ist.psu.edu/bcpred/) servers were used for the prediction of B-cell epitopes of the prioritized proteins. The ABCpred server, based on an artificial neural network (ANN), is able to predict B-cell epitopes with 65.93% accuracy (52); default settings were applied and B-cell epitopes with scores over 0.51 were identified. The BCPred server is based on a support vector machine (SVM) algorithm using string kernels for B-cell epitope prediction (53). The same proteins as uploaded to the ABCpred server were uploaded to the BCPred server; all settings were default except for changing epitope length to 16 to make it consistent with ABCpred analysis. The overlaps (based on the results from the ABCpred server) between the predictions of the two servers were used for multiepitope vaccine construction.
Multiepitope subunit vaccine design.
The MHC-II binding epitopes and B-cell epitopes screened in the upstream analysis were used for multiepitope subunit vaccine design. To enable effective separation of epitopes in vivo and to avoid the possibility of junctional epitope formation, the HTL epitopes were linked by the GPGPG linker, and B-cell epitopes were linked by the KK linker (54). Cholera toxin subunit B (CTB), retrieved from UniProt (https://www.uniprot.org/uniprot/P01556), was employed as an adjuvant to attach the N-terminal end of the vaccine using the EAAAK linker. CTB is the nontoxic portion of cholera toxin; it has affinity with the monosialotetrahexosylganglioside (GM1) that is widely distributed on various cell types, such as gut epithelial cells, macrophages, dendritic cells, and B cells, which enables it to be better exposed with the immune system (39).
Prediction of allergenicity, antigenicity, and various physicochemical properties.
Proteins with allergenicity could induce a harmful immune response, and a vaccine itself should be nonallergic. Two bioinformatics tools—AllergenFP version 1.0 and AllerTOP version 2.0—were used for allergenicity prediction. AllergenFP utilizes a novel alignment-free descriptor-based fingerprint approach to identify allergens and nonallergens, while AllerTOP bases on a K-nearest neighbor algorithm with 85.3% accuracy at 5-fold cross-validation (55, 56).
The amino acid sequence of the multiepitope vaccine was uploaded to VaxiJen version 2.0 (http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html) to evaluate the antigenicity of the designed vaccine.
To assess the physicochemical properties of the vaccine, the web server ProtParam (https://web.expasy.org/protparam/) was employed (57), and the parameters, including the molecular weight, theoretical pI, amino acid composition, atomic composition, extinction coefficient, estimated half-life, instability index, aliphatic index, and grand average of hydropathicity (GRAVY) were computed. Molecular weight and theoretical pI are calculated from the input sequence; the amino acid and atomic compositions are self-explanatory. The extinction coefficient indicates light absorption of a protein at a certain wavelength; it is estimated by amino acid composition. In vivo half-life evaluation of proteins relied on the principle of “N-end rule” (58). The instability index estimates the stability of protein in a test tube; the protein is considered stable when its instability index in smaller than 40. The aliphatic index reflects the relative volume occupied by aliphatic side chains in a certain protein, which is regarded as a positive factor associated with thermostability of globular proteins. The GRAVY value for a protein is calculated as the sum of hydropathy values (59), indicating the hydrophobic nature of the protein.
Immune simulation.
In silico immune simulation was performed to predict immune response profile using the C-ImmSim server (https://kraken.iac.rm.cnr.it/C-IMMSIM/) (60). C-ImmSim is an agent-based simulator of the immune response that utilizes the Celada-Seiden model to simulate the mammalian immune system against constructive vaccine to generate the immune profiles, both humoral and cellular. All the settings remain default except for setting “Simulation Steps” to 600 and the injection interval to 1 month (three injections administered).
Finally, to avoid inducing autoimmunity, the homology between designed vaccine and human protein was checked using BLASTp online server (https://blast.ncbi.nlm.nih.gov/Blast.cgi?PROGRAM=blastp&PAGE_TYPE=BlastSearch&LINK_LOC=blasthome) (61). Ideally, the designed vaccine should have no homology with human proteins.
Prediction, refinement, and quality assessment of the 3D structure of the developed multiepitope subunit vaccine.
The Phyre 2 server (http://www.sbg.bio.ic.ac.uk/phyre2/html/page.cgi?id=index) was used to predict the three-dimensional structure of the designed vaccine; all settings on the webpage remained default except for changing the modeling mode from “normal” to “intensive.” This optional “intensive mode” combining multiple template modeling with simplified ab initio folding simulation could create a complete full-length model of the uploaded sequence (62).
To improve the conformational structure of the predicted protein, the 3D structure of designed vaccine modeled by Phyre 2 server was further refined using the GalaxyRefine server (http://galaxy.seoklab.org/cgi-bin/submit.cgi?type=REFINE). After performing repeated structure perturbation and global structure relaxation by molecular dynamics simulation, a total of five models of the multiepitope vaccine were generated, and among them, structure perturbation was applied only to clusters of side chains in model 1, while more aggressive perturbation to secondary-structure elements and loops were applied in models 2 to 5 (63).
The PDB file of refined structure of our developed vaccine was uploaded to the SWISS-MODEL server (https://swissmodel.expasy.org/assess) (64) for tertiary-structure assessment. A Ramachandran plot was generated in the analysis results; this plot indicates the energetically favored regions for backbone dihedral angles against of amino acid residues in protein structure. In addition, due to MolProbity-based running, the most relevant scores were provided and the residues with low quality in the developed structure were easily identified. Then the ProSA web server (https://prosa.services.came.sbg.ac.at/prosa.php) was employed for protein structure validation. ProSA calculates an overall quality score for a certain protein structure, and a score that is outside the range characteristic for native proteins may indicate that errors exist in the structure (65).
Molecular dynamics simulation of the multiepitope subunit vaccine.
In order to further assess the stability of our developed vaccine in a biological environment, a command line-based software, GROMACS (version 2020) (66), was used. The parameter settings and analysis steps are as follows. (i) The PDB file of the designed vaccine containing only protein atoms was input into a GROMACS module called pdbgmx. The OPLS-AA (Optimized Potential for Liquid Simulation - All Atom) force field was selected and a gro file compatible with it was generated. (ii) The box type was defined as a cube and the distance from vaccine protein to box edge was set to >1.0 nm using the editconf module. Then the solvate module was used to fill the box with water. And after that, the genion tool was employed for adding ions to keep the system electrically neutral. (iii) Energy minimization was performed to ensure that the system had no steric clashes or inappropriate geometry using the steepest-descent minimization algorithm, with which the energy of the system will rapidly decrease to below 1,000 KJ mol−1 nm−1 within 5,000 maximum steps. (iv) Two phases of equilibration (NVT and NPT) were conducted to equilibrate the solvent and ions around the vaccine protein. The first phase was performed under the NVT ensemble with the restrained system heated to 300K under constant volume conditions in 100 ps. After reaching the set temperature, the second phase was imposed under the NPT ensemble using the Parrinello-Rahman pressure coupling method with a 105-Pa pressure setting; the simulation steps were same as that in NVT ensemble with 100 ps. (v) Molecular dynamics simulation was carried out toward the final system for 10 ns, with coordinates recorded every 10 ps.
Molecular docking of the multiepitope subunit vaccine structure with Toll-like receptors.
To evaluate the binding affinity between our contrived vaccine and human Toll-like receptors (TLR), TLR2 and TLR4 were separately docked with vaccine protein. The structure of TLR2 was obtained from PDB code 2Z7X, while that of TLR4 was retrieved from PDB code 3FXI. A web-based tool called ClusPro (version 2.0) (https://cluspro.bu.edu/) (67) was utilized for molecular docking; this tool uses three computational steps for protein-protein docking. First, rigid body docking is performed by sampling billions of conformations. Then the root mean square deviation (RMSD) of the 1,000 clusters with the lowest energy structures is calculated to match the most likely model of the complex. Finally, the selected model is applied with energy minimization to refine the structure, and the top 10 docking results are listed separately. The 3D structures of Toll-like receptors and the vaccine-TLR complex generated from molecular docking were visualized using PyMOL software (version 2.4.0). PDBsum (68) was used to map the amino acid residues that interacted between the contrived vaccine and TLRs.
Codon adaption and in silico cloning.
A JAVA-based tool called JCat (http://www.jcat.de/Start.jsp) (69) was used for codon optimization to adapt the codon usage by the prokaryotic engineered bacteria (Escherichia coli). The amino acid sequence of the designed vaccine was pasted into the input window, the type of pasted sequence was set to “Protein sequence,” and the prokaryotic organism was chosen as Escherichia coli K-12. The vaccine-adapted DNA sequence was ligated in the multiple-cloning site (MCS) of E. coli plasmid pET-28a(+) to express the vaccine protein using SnapGene (version 4.2.4) (https://www.snapgene.com/).
ACKNOWLEDGMENTS
This work was supported by grants from the Key Research and Development Plan of Jiangsu Province (BE2019304), the Guidance Foundation of the Sanya Institute of Nanjing Agriculture University (NAUSY-MS12), the Key Research Project of Jiangsu Commission of Health (ZD2021037), and the National Key R&D Program of China (2018YFC1602500).
Footnotes
Supplemental material is available online only.
Contributor Information
Zhongming Tan, Email: jstzm@jscdc.cn.
Wei Zhang, Email: vszw@njau.edu.cn.
Florence Claude Doucet-Populaire, University Paris-Saclay, AP-HP Hôpital Antoine Béclère, Service de Microbiologie, Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS.
REFERENCES
- 1.Tumbarello M, Trecarichi EM, De Rosa FG, Giannella M, Giacobbe DR, Bassetti M, Losito AR, Bartoletti M, Del Bono V, Corcione S, Maiuro G, Tedeschi S, Celani L, Cardellino CS, Spanu T, Marchese A, Ambretti S, Cauda R, Viscoli C, Viale P, Isgri S, ISGRI-SITA (Italian Study Group on Resistant Infections of the Società Italiana Terapia Antinfettiva). 2015. Infections caused by KPC-producing Klebsiella pneumoniae: differences in therapy and mortality in a multicentre study. J Antimicrob Chemother 70:2133–2143. doi: 10.1093/jac/dkv086. [DOI] [PubMed] [Google Scholar]
- 2.Wang JH, Liu YC, Lee SS, Yen MY, Chen YS, Wang JH, Wann SR, Lin HH. 1998. Primary liver abscess due to Klebsiella pneumoniae in Taiwan. Clin Infect Dis 26:1434–1438. doi: 10.1086/516369. [DOI] [PubMed] [Google Scholar]
- 3.Yu WL, Ko WC, Cheng KC, Lee CC, Lai CC, Chuang YC. 2008. Comparison of prevalence of virulence factors for Klebsiella pneumoniae liver abscesses between isolates with capsular K1/K2 and non-K1/K2 serotypes. Diagn Microbiol Infect Dis 62:1–6. doi: 10.1016/j.diagmicrobio.2008.04.007. [DOI] [PubMed] [Google Scholar]
- 4.Wang TC, Lin JC, Chang JC, Hiaso YW, Wang CH, Chiu SK, Fung CP, Chang FY, Siu LK. 2021. Virulence among different types of hypervirulent Klebsiella pneumoniae with multi-locus sequence type (MLST)-11, serotype K1 or K2 strains. Gut Pathog 13:40. doi: 10.1186/s13099-021-00439-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Fung CP, Chang FY, Lee SC, Hu BS, Kuo BI, Liu CY, Ho M, Siu LK. 2002. A global emerging disease of Klebsiella pneumoniae liver abscess: is serotype K1 an important factor for complicated endophthalmitis? Gut 50:420–424. doi: 10.1136/gut.50.3.420. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Merlet A, Cazanave C, Dutronc H, de Barbeyrac B, Brisse S, Dupon M. 2012. Primary liver abscess due to CC23-K1 virulent clone of Klebsiella pneumoniae in France. Clin Microbiol Infect 18:E338–E339. doi: 10.1111/j.1469-0691.2012.03953.x. [DOI] [PubMed] [Google Scholar]
- 7.Shon AS, Bajwa RP, Russo TA. 2013. Hypervirulent (hypermucoviscous) Klebsiella pneumoniae: a new and dangerous breed. Virulence 4:107–118. doi: 10.4161/viru.22718. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Marr CM, Russo TA. 2019. Hypervirulent Klebsiella pneumoniae: a new public health threat. Expert Rev Anti Infect Ther 17:71–73. doi: 10.1080/14787210.2019.1555470. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Yang Q, Jia X, Zhou M, Zhang H, Yang W, Kudinha T, Xu Y. 2020. Emergence of ST11-K47 and ST11-K64 hypervirulent carbapenem-resistant Klebsiella pneumoniae in bacterial liver abscesses from China: a molecular, biological, and epidemiological study. Emerg Microbes Infect 9:320–331. doi: 10.1080/22221751.2020.1721334. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Gu D, Dong N, Zheng Z, Lin D, Huang M, Wang L, Chan EW, Shu L, Yu J, Zhang R, Chen S. 2018. A fatal outbreak of ST11 carbapenem-resistant hypervirulent Klebsiella pneumoniae in a Chinese hospital: a molecular epidemiological study. Lancet Infect Dis 18:37–46. doi: 10.1016/S1473-3099(17)30489-9. [DOI] [PubMed] [Google Scholar]
- 11.Huang YH, Chou SH, Liang SW, Ni CE, Lin YT, Huang YW, Yang TC. 2018. Emergence of an XDR and carbapenemase-producing hypervirulent Klebsiella pneumoniae strain in Taiwan. J Antimicrob Chemother 73:2039–2046. doi: 10.1093/jac/dky164. [DOI] [PubMed] [Google Scholar]
- 12.Assoni L, Girardello R, Converso TR, Darrieux M. 2021. Current stage in the development of Klebsiella pneumoniae vaccines. Infect Dis Ther 10:2157–2175. doi: 10.1007/s40121-021-00533-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Pletz MW, Uebele J, Götz K, Hagel S, Bekeredjian-Ding I. 2016. Vaccines against major ICU pathogens: where do we stand? Curr Opin Crit Care 22:470–476. doi: 10.1097/MCC.0000000000000338. [DOI] [PubMed] [Google Scholar]
- 14.Choi M, Hegerle N, Nkeze J, Sen S, Jamindar S, Nasrin S, Sen S, Permala-Booth J, Sinclair J, Tapia MD, Johnson JK, Mamadou S, Thaden JT, Fowler VG, Jr, Aguilar A, Teran E, Decre D, Morel F, Krogfelt KA, Brauner A, Protonotariou E, Christaki E, Shindo Y, Lin YT, Kwa AL, Shakoor S, Singh-Moodley A, Perovic O, Jacobs J, Lunguya O, Simon R, Cross AS, Tennant SM. 2020. The diversity of lipopolysaccharide (O) and capsular polysaccharide (K) antigens of invasive Klebsiella pneumoniae in a multi-country collection. Front Microbiol 11:1249. doi: 10.3389/fmicb.2020.01249. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Follador R, Heinz E, Wyres KL, Ellington MJ, Kowarik M, Holt KE, Thomson NR. 2016. The diversity of Klebsiella pneumoniae surface polysaccharides. Microb Genom 2:e000073. doi: 10.1099/mgen.0.000073. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Bao Y, Zhai Z, Wang S, Ma J, Zhang W, Lu C. 2013. Chaperonin GroEL: a novel phylogenetically conserved protein with strong immunoreactivity of avian pathogenic Escherichia coli isolates from duck identified by immunoproteomics. Vaccine 31:2947–2953. doi: 10.1016/j.vaccine.2013.04.042. [DOI] [PubMed] [Google Scholar]
- 17.Hisham Y, Ashhab Y. 2018. Identification of cross-protective potential antigens against pathogenic Brucella spp. through combining pan-genome analysis with reverse vaccinology. J Immunol Res 2018:1474517. doi: 10.1155/2018/1474517. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Pizza M, Scarlato V, Masignani V, Giuliani MM, Arico B, Comanducci M, Jennings GT, Baldi L, Bartolini E, Capecchi B, Galeotti CL, Luzzi E, Manetti R, Marchetti E, Mora M, Nuti S, Ratti G, Santini L, Savino S, Scarselli M, Storni E, Zuo P, Broeker M, Hundt E, Knapp B, Blair E, Mason T, Tettelin H, Hood DW, Jeffries AC, Saunders NJ, Granoff DM, Venter JC, Moxon ER, Grandi G, Rappuoli R. 2000. Identification of vaccine candidates against serogroup B meningococcus by whole-genome sequencing. Science 287:1816–1820. doi: 10.1126/science.287.5459.1816. [DOI] [PubMed] [Google Scholar]
- 19.Wizemann TM, Heinrichs JH, Adamou JE, Erwin AL, Kunsch C, Choi GH, Barash SC, Rosen CA, Masure HR, Tuomanen E, Gayle A, Brewah YA, Walsh W, Barren P, Lathigra R, Hanson M, Langermann S, Johnson S, Koenig S. 2001. Use of a whole genome approach to identify vaccine molecules affording protection against Streptococcus pneumoniae infection. Infect Immun 69:1593–1598. doi: 10.1128/IAI.69.3.1593-1598.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Chakravarti DN, Fiske MJ, Fletcher LD, Zagursky RJ. 2000. Application of genomics and proteomics for identification of bacterial gene products as potential vaccine candidates. Vaccine 19:601–612. doi: 10.1016/S0264-410X(00)00256-5. [DOI] [PubMed] [Google Scholar]
- 21.Montigiani S, Falugi F, Scarselli M, Finco O, Petracca R, Galli G, Mariani M, Manetti R, Agnusdei M, Cevenini R, Donati M, Nogarotto R, Norais N, Garaguso I, Nuti S, Saletti G, Rosa D, Ratti G, Grandi G. 2002. Genomic approach for analysis of surface proteins in Chlamydia pneumoniae. Infect Immun 70:368–379. doi: 10.1128/IAI.70.1.368-379.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Ariel N, Zvi A, Grosfeld H, Gat O, Inbar Y, Velan B, Cohen S, Shafferman A. 2002. Search for potential vaccine candidate open reading frames in the Bacillus anthracis virulence plasmid pXO1: in silico and in vitro screening. Infect Immun 70:6817–6827. doi: 10.1128/IAI.70.12.6817-6827.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Adu-Bobie J, Capecchi B, Serruto D, Rappuoli R, Pizza M. 2003. Two years into reverse vaccinology. Vaccine 21:605–610. doi: 10.1016/S0264-410X(02)00566-2. [DOI] [PubMed] [Google Scholar]
- 24.Nezafat N, Eslami M, Negahdaripour M, Rahbar MR, Ghasemi Y. 2017. Designing an efficient multi-epitope oral vaccine against Helicobacter pylori using immunoinformatics and structural vaccinology approaches. Mol Biosyst 13:699–713. doi: 10.1039/c6mb00772d. [DOI] [PubMed] [Google Scholar]
- 25.Chatterjee N, Ojha R, Khatoon N, Prajapati VK. 2018. Scrutinizing Mycobacterium tuberculosis membrane and secretory proteins to formulate multiepitope subunit vaccine against pulmonary tuberculosis by utilizing immunoinformatic approaches. Int J Biol Macromol 118:180–188. doi: 10.1016/j.ijbiomac.2018.06.080. [DOI] [PubMed] [Google Scholar]
- 26.Abdulla F, Adhikari UK, Uddin MK. 2019. Exploring T & B-cell epitopes and designing multi-epitope subunit vaccine targeting integration step of HIV-1 lifecycle using immunoinformatics approach. Microb Pathog 137:103791. doi: 10.1016/j.micpath.2019.103791. [DOI] [PubMed] [Google Scholar]
- 27.Tommassen J, Agterberg M, Janssen R, Spierings G. 1993. Use of the enterobacterial outer membrane protein PhoE in the development of new vaccines and DNA probes. Zentralbl Bakteriol 278:396–406. doi: 10.1016/S0934-8840(11)80856-X. [DOI] [PubMed] [Google Scholar]
- 28.Janssen R, Wauben M, van der Zee R, Tommassen J. 1994. Immunogenicity of a mycobacterial T-cell epitope expressed in outer membrane protein PhoE of Escherichia coli. Vaccine 12:406–409. doi: 10.1016/0264-410X(94)90115-5. [DOI] [PubMed] [Google Scholar]
- 29.Park S, Kirthika P, Jawalagatti V, Senevirathne A, Lee JH. 2021. Salmonella delivered Lawsonia intracellularis novel epitope-fusion vaccines enhance immunogenicity and confers protection against Lawsonia intracellularis in mice. Vet Microbiol 263:109264. doi: 10.1016/j.vetmic.2021.109264. [DOI] [PubMed] [Google Scholar]
- 30.Hsieh PF, Liu JY, Pan YJ, Wu MC, Lin TL, Huang YT, Wang JT. 2013. Klebsiella pneumoniae peptidoglycan-associated lipoprotein and murein lipoprotein contribute to serum resistance, antiphagocytosis, and proinflammatory cytokine stimulation. J Infect Dis 208:1580–1589. doi: 10.1093/infdis/jit384. [DOI] [PubMed] [Google Scholar]
- 31.Leow CY, Kazi A, Hisyam Ismail CMK, Chuah C, Lim BH, Leow CH, Banga Singh KK. 2020. Reverse vaccinology approach for the identification and characterization of outer membrane proteins of Shigella flexneri as potential cellular- and antibody-dependent vaccine candidates. Clin Exp Vaccine Res 9:15–25. doi: 10.7774/cevr.2020.9.1.15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Kopparapu PK, Deshmukh M, Hu Z, Mohammad M, Maugeri M, Götz F, Valadi H, Jin T. 2021. Lipoproteins are responsible for the pro-inflammatory property of Staphylococcus aureus extracellular vesicles. Int J Mol Sci 22:7099. doi: 10.3390/ijms22137099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Kurupati P, Teh BK, Kumarasinghe G, Poh CL. 2006. Identification of vaccine candidate antigens of an ESBL producing Klebsiella pneumoniae clinical strain by immunoproteome analysis. Proteomics 6:836–844. doi: 10.1002/pmic.200500214. [DOI] [PubMed] [Google Scholar]
- 34.Maiti B, Shetty M, Shekar M, Karunasagar I, Karunasagar I. 2012. Evaluation of two outer membrane proteins, Aha1 and OmpW of Aeromonas hydrophila as vaccine candidate for common carp. Vet Immunol Immunopathol 149:298–301. doi: 10.1016/j.vetimm.2012.07.013. [DOI] [PubMed] [Google Scholar]
- 35.Casey WT, Spink N, Cia F, Collins C, Romano M, Berisio R, Bancroft GJ, McClean S. 2016. Identification of an OmpW homologue in Burkholderia pseudomallei, a protective vaccine antigen against melioidosis. Vaccine 34:2616–2621. doi: 10.1016/j.vaccine.2016.03.088. [DOI] [PubMed] [Google Scholar]
- 36.Yang Q, Pan YL, Wang KY, Wang J, He Y, Wang EL, Liu T, Yi G, Chen DF, Huang XL. 2016. OmpN, outer membrane proteins of Edwardsiella ictaluri are potential vaccine candidates for channel catfish (Ictalurus punctatus). Mol Immunol 78:1–8. doi: 10.1016/j.molimm.2016.08.011. [DOI] [PubMed] [Google Scholar]
- 37.Pang HY, Li Y, Wu ZH, Jian JC, Lu YS, Cai SH. 2010. Immunoproteomic analysis and identification of novel immunogenic proteins from Vibrio harveyi. J Appl Microbiol 109:1800–1809. doi: 10.1111/j.1365-2672.2010.04808.x. [DOI] [PubMed] [Google Scholar]
- 38.Li L, Song M, Peng B, Peng XX, Li H. 2020. Identification and innate immunity mechanism of protective immunogens from extracellular proteins of Edwardsiella tarda. Fish Shellfish Immunol 97:41–45. doi: 10.1016/j.fsi.2019.12.020. [DOI] [PubMed] [Google Scholar]
- 39.Stratmann T. 2015. Cholera toxin subunit B as adjuvant—an accelerator in protective immunity and a break in autoimmunity. Vaccines (Basel) 3:579–596. doi: 10.3390/vaccines3030579. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Ali M, Pandey RK, Khatoon N, Narula A, Mishra A, Prajapati VK. 2017. Exploring dengue genome to construct a multi-epitope based subunit vaccine by utilizing immunoinformatics approach to battle against dengue infection. Sci Rep 7:9232. doi: 10.1038/s41598-017-09199-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Regueiro V, Moranta D, Campos MA, Margareto J, Garmendia J, Bengoechea JA. 2009. Klebsiella pneumoniae increases the levels of Toll-like receptors 2 and 4 in human airway epithelial cells. Infect Immun 77:714–724. doi: 10.1128/IAI.00852-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Seemann T. 2014. Prokka: rapid prokaryotic genome annotation. Bioinformatics 30:2068–2069. doi: 10.1093/bioinformatics/btu153. [DOI] [PubMed] [Google Scholar]
- 43.Page AJ, Cummins CA, Hunt M, Wong VK, Reuter S, Holden MT, Fookes M, Falush D, Keane JA, Parkhill J. 2015. Roary: rapid large-scale prokaryote pan genome analysis. Bioinformatics 31:3691–3693. doi: 10.1093/bioinformatics/btv421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Yu NY, Wagner JR, Laird MR, Melli G, Rey S, Lo R, Dao P, Sahinalp SC, Ester M, Foster LJ, Brinkman FS. 2010. PSORTb 3.0: improved protein subcellular localization prediction with refined localization subcategories and predictive capabilities for all prokaryotes. Bioinformatics 26:1608–1615. doi: 10.1093/bioinformatics/btq249. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Doytchinova IA, Flower DR. 2007. VaxiJen: a server for prediction of protective antigens, tumour antigens and subunit vaccines. BMC Bioinformatics 8:4. doi: 10.1186/1471-2105-8-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Wang P, Sidney J, Dow C, Mothe B, Sette A, Peters B. 2008. A systematic assessment of MHC class II peptide binding predictions and evaluation of a consensus approach. PLoS Comput Biol 4:e1000048. doi: 10.1371/journal.pcbi.1000048. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Nielsen M, Lund O. 2009. NN-align. An artificial neural network-based alignment algorithm for MHC class II peptide binding prediction. BMC Bioinformatics 10:296. doi: 10.1186/1471-2105-10-296. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Nielsen M, Lundegaard C, Lund O. 2007. Prediction of MHC class II binding affinity using SMM-align, a novel stabilization matrix alignment method. BMC Bioinformatics 8:238. doi: 10.1186/1471-2105-8-238. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Sidney J, Assarsson E, Moore C, Ngo S, Pinilla C, Sette A, Peters B. 2008. Quantitative peptide binding motifs for 19 human and mouse MHC class I molecules derived using positional scanning combinatorial peptide libraries. Immunome Res 4:2. doi: 10.1186/1745-7580-4-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Sturniolo T, Bono E, Ding J, Raddrizzani L, Tuereci O, Sahin U, Braxenthaler M, Gallazzi F, Protti MP, Sinigaglia F, Hammer J. 1999. Generation of tissue-specific and promiscuous HLA ligand databases using DNA microarrays and virtual HLA class II matrices. Nat Biotechnol 17:555–561. doi: 10.1038/9858. [DOI] [PubMed] [Google Scholar]
- 51.Andreatta M, Karosiene E, Rasmussen M, Stryhn A, Buus S, Nielsen M. 2015. Accurate pan-specific prediction of peptide-MHC class II binding affinity with improved binding core identification. Immunogenetics 67:641–650. doi: 10.1007/s00251-015-0873-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Saha S, Raghava GP. 2006. Prediction of continuous B-cell epitopes in an antigen using recurrent neural network. Proteins 65:40–48. doi: 10.1002/prot.21078. [DOI] [PubMed] [Google Scholar]
- 53.El-Manzalawy Y, Dobbs D, Honavar V. 2008. Predicting linear B-cell epitopes using string kernels. J Mol Recognit 21:243–255. doi: 10.1002/jmr.893. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Mahapatra SR, Dey J, Kaur T, Sarangi R, Bajoria AA, Kushwaha GS, Misra N, Suar M. 2021. Immunoinformatics and molecular docking studies reveal a novel multi-epitope peptide vaccine against pneumonia infection. Vaccine 39:6221–6237. doi: 10.1016/j.vaccine.2021.09.025. [DOI] [PubMed] [Google Scholar]
- 55.Dimitrov I, Naneva L, Doytchinova I, Bangov I. 2014. AllergenFP: allergenicity prediction by descriptor fingerprints. Bioinformatics 30:846–851. doi: 10.1093/bioinformatics/btt619. [DOI] [PubMed] [Google Scholar]
- 56.Dimitrov I, Bangov I, Flower DR, Doytchinova I. 2014. AllerTOP v.2—a server for in silico prediction of allergens. J Mol Model 20:2278. doi: 10.1007/s00894-014-2278-5. [DOI] [PubMed] [Google Scholar]
- 57.Gasteiger E, Hoogland C, Gattiker A, Duval S, Wilkins M, Appel R, Bairoch A. 2005. Protein identification and analysis tools on the ExPASy server; the proteomics protocols handbook. Humana Press, Totowa, NJ. [Google Scholar]
- 58.Varshavsky A. 1997. The N-end rule pathway of protein degradation. Genes Cells 2:13–28. doi: 10.1046/j.1365-2443.1997.1020301.x. [DOI] [PubMed] [Google Scholar]
- 59.Kyte J, Doolittle RF. 1982. A simple method for displaying the hydropathic character of a protein. J Mol Biol 157:105–132. doi: 10.1016/0022-2836(82)90515-0. [DOI] [PubMed] [Google Scholar]
- 60.Rapin N, Lund O, Bernaschi M, Castiglione F. 2010. Computational immunology meets bioinformatics: the use of prediction tools for molecular binding in the simulation of the immune system. PLoS One 5:e9862. doi: 10.1371/journal.pone.0009862. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Gonzalez-Pech RA, Stephens TG, Chan CX. 2019. Commonly misunderstood parameters of NCBI BLAST and important considerations for users. Bioinformatics 35:2697–2698. doi: 10.1093/bioinformatics/bty1018. [DOI] [PubMed] [Google Scholar]
- 62.Kelley LA, Mezulis S, Yates CM, Wass MN, Sternberg MJ. 2015. The Phyre2 web portal for protein modeling, prediction and analysis. Nat Protoc 10:845–858. doi: 10.1038/nprot.2015.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Ko J, Park H, Heo L, Seok C. 2012. GalaxyWEB server for protein structure prediction and refinement. Nucleic Acids Res 40:W294–W297. doi: 10.1093/nar/gks493. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Waterhouse A, Bertoni M, Bienert S, Studer G, Tauriello G, Gumienny R, Heer FT, de Beer TAP, Rempfer C, Bordoli L, Lepore R, Schwede T. 2018. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res 46:W296–W303. doi: 10.1093/nar/gky427. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Wiederstein M, Sippl MJ. 2007. ProSA-web: interactive web service for the recognition of errors in three-dimensional structures of proteins. Nucleic Acids Res 35:W407–W410. doi: 10.1093/nar/gkm290. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Abraham MJ, Murtola T, Schulz R, Páll S, Smith JC, Hess B, Lindahl E. 2015. GROMACS: high performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX 1–2:19–25. doi: 10.1016/j.softx.2015.06.001. [DOI] [Google Scholar]
- 67.Kozakov D, Hall DR, Xia B, Porter KA, Padhorny D, Yueh C, Beglov D, Vajda S. 2017. The ClusPro web server for protein-protein docking. Nat Protoc 12:255–278. doi: 10.1038/nprot.2016.169. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Laskowski RA. 2001. PDBsum: summaries and analyses of PDB structures. Nucleic Acids Res 29:221–222. doi: 10.1093/nar/29.1.221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Grote A, Hiller K, Scheer M, Münch R, Nörtemann B, Hempel DC, Jahn D. 2005. JCat: a novel tool to adapt codon usage of a target gene to its potential expression host. Nucleic Acids Res 33:W526–W531. doi: 10.1093/nar/gki376. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.