Skip to main content
Pathogens and Global Health logoLink to Pathogens and Global Health
. 2022 May 12;117(2):134–151. doi: 10.1080/20477724.2022.2072456

Recent trends in next generation immunoinformatics harnessed for universal coronavirus vaccine design

Chin Peng Lim a,b, Boon Hui Kok b, Hui Ting Lim b, Candy Chuah c, Badarulhisam Abdul Rahman d, Abu Bakar Abdul Majeed e, Michelle Wykes f, Chiuan Herng Leow b, Chiuan Yee Leow a,
PMCID: PMC9970233  PMID: 35550001

ABSTRACT

The ongoing pandemic of coronavirus disease 2019 (COVID-19), caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has globally devastated public health, the economies of many countries and quality of life universally. The recent emergence of immune-escaped variants and scenario of vaccinated individuals being infected has raised the global concerns about the effectiveness of the current available vaccines in transmission control and disease prevention. Given the high rate mutation of SARS-CoV-2, an efficacious vaccine targeting against multiple variants that contains virus-specific epitopes is desperately needed. An immunoinformatics approach is gaining traction in vaccine design and development due to the significant reduction in time and cost of immunogenicity studies and increasing reliability of the generated results. It can underpin the development of novel therapeutic methods and accelerate the design and production of peptide vaccines for infectious diseases. Structural proteins, particularly spike protein (S), along with other proteins have been studied intensively as promising coronavirus vaccine targets. Numbers of promising online immunological databases, tools and web servers have widely been employed for the design and development of next generation COVID-19 vaccines. This review highlights the role of immunoinformatics in identifying immunogenic peptides as potential vaccine targets, involving databases, and prediction and characterization of epitopes which can be harnessed for designing future coronavirus vaccines.

KEYWORDS: Human coronavirus, immunoinformatics, peptide vaccine, vaccine target, database, characterization, epitope prediction

Introduction

History of human coronaviruses and their evolution

In December 2019, a novel contagious coronavirus was first identified in Wuhan, China. This virus (initially known as Wuhan coronavirus) turned out to be the third member of the class of coronaviruses to affect global health following Severe Acute Respiratory Syndrome coronavirus (SARS-CoV) reported in 2002 [1] and Middle East Respiratory Syndrome coronavirus (MERS-CoV) reported in 2012 [2]. The Wuhan coronavirus was later officially designated as Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) given the fact that this virus is genetically related to SARS-CoV.

The receptor-binding domain (RBD) of the spike protein from SARS-CoV-2 is significantly similar to the structure of the spike protein of SARS-CoV [3,4]. However, the spike protein of SARS-CoV-2 possesses a higher affinity to angiotensin converting enzyme 2 (ACE2) than that of SARS-CoV [5]. The disease caused by SARS-CoV-2 was later designated as Coronavirus Disease 2019 (COVID-19) by World Health Organization (WHO). COVID-19 is continuing to spread around the world. As of 23 March 2022, there were more than 474 million confirmed cases and more than 6.1 million deaths across almost 200 countries (mortality rate of over 1%). COVID-19 has shown severity that surpasses previous infections in terms of the propensity to spread readily and cause severe symptoms in the elderly and individuals with morbidities like diabetes and cardiovascular diseases [6,7]. According to standard operating procedures (SOPs) issued by the health authority, treatment concentrates typically on managing symptoms and providing respiratory support [8].

Coronaviruses are a group of enveloped viruses with single-stranded and positive-sense RNA genomes made up of at least six open reading frames (ORFs) [9]. Coronaviruses are well-known for their rapid mutation rate. Since 2019, the genomic sequence of SARS-CoV-2 has mutated repeatedly, giving rise to a significant number of variants reported worldwide [4]. Currently, coronaviruses are only proven to be linked with respiratory diseases in humans. In fact, a total of six human coronaviruses have been described, generally classified into two main groups, pathogenic strain and non-pathogenic strain. In the mid 1960s, two non-pathogenic strains were first reported, termed HCoV-229E [10] and HCoV-OC43 [11]. In November 2002, a respiratory disease first occurred in China followed by a rapid transmission to other countries, ultimately causing more than 8,000 confirmed cases, with a mortality rate of 9.6% [9]. The etiologic agent was recognized as SARS-CoV, a member of betacoronavirus. This was the first report of a coronavirus able to jump the species barrier from bats to humans and cause serious disease, indicating its potential for zoonotic transmission [12,13]. Soon enough, two other non-pathogenic strains, HCoV-NL63 [14] and HCoV-HKU1 [15], were discovered in 2004 and 2005, respectively. A decade later, in 2012, a SARS-like virus emerged and circulated in Saudi Arabia through the manifestation of respiratory disease [16]. Despite the relatively limited transmission among humans, this pathogenic strain, termed MERS-CoV led to two major outbreaks including much later one that happened in South Korea. Overall, the number of cases exceeded 2,000 globally and the mortality rate was as high as 35% [17]. Contrary to both SARS-CoV and SARS-CoV-2 which bind to the human ACE2 receptor, MERS-CoV instead attaches to dipeptidyl-peptidase 4 (DPP4) of the host cells to mediate the infection. Basically, HCoV-NL63 and HCoV-229E were initially detected in bats, whereas HCoV-OC43 and HCoV-HKU1 possibly originated in rodents. The coronaviruses that are phylogenetically related to SARS-CoV and MERS-CoV were found in diverse bat species [18].

Next generation vaccine development based on reverse vaccinology

Immunity plays an essential role in protecting humans against pathogenic infections. The stimulation of protective immunity is usually dependent on immune response against only a small part of foreign proteins in the pathogen, while immunity to the remaining proteins may even induce undesirable allergenic responses or adverse effects to the functions of T-cells and B-cells [19]. With the advances in next-generation sequencing technologies, the amount of genomic data generated that are relevant to immunological research has been substantial. With the aid of a sophisticated immunoinformatics approach, in silico analysis of genomic data can be used to identify potential vaccine targets [20]. This innovative approach can be adopted to design novel subunit vaccines with one or more proteins from the pathogen(s) as required [21–23]. There are several advantages to using immunogenic epitope-based peptides as components of vaccines, including the omission of infectious substances, significantly-reduced risk of cross-reactivity with host tissues, high capacity to perform chemical modification on the products, rapid manufacturing in large scale and relatively long-term storage [24]. In principle, an epitope-based peptide vaccine consists of short antigenic peptides that are able to induce both cellular and humoral immune responses. The length of peptides selected are usually 20 to 30 amino acids from pathogen proteins that carry the highest antigenicity [25–27]. This type of vaccine depends on the major histocompatibility complex (MHC) presentation of the introduced peptides to the CD4 and CD8 T-cells. However, the vast diversity of human MHC loci needs to be taken into consideration in vaccine design to achieve effective immunity in a heterogenous population [28]. Moreover, the efficacy of a vaccine largely relies on the T-cell response to the selected epitopes and targeted antigens. Also needing consideration, is that peptides tend to be biologically active for a short period of time due to enzymatic degradation and that some short peptides may not be recognized by immune cells such as dendritic cells and macrophages, and thus do not elicit considerable immune response [24]. Hence, co-administration of adjuvants is one of the methods used to enhance the immune response [29]. Alternatively, immune response can be intensified when the desired antigenic peptides are linked to larger carrier proteins to improve their stability [30–32]. There is more evidences indicate that multi-component peptide vaccines are able to induce stronger response than a single component peptide vaccine [33].

Immunoinformatics approach

Over the past decades, immunology has been advancing and improving through online resources and software, leading to the establishment of a branch of bioinformatics known as immunoinformatics [34–36]. Immunoinformatics encompasses computational techniques and resources used to study the immune functions. This includes the structural, functional and regulatory analysis of sequence-based or molecular data, modelling of immunologic data and immunologic databases as well as simulation of wet-laboratory experiments [37,38]. Sophisticated tools are employed to depict the immune processes at the molecular, cellular or system level and the development of novel research applications and therapy [36]. In reverse vaccinology, immunoinformatics is utilized in the mapping of potential T-cell and B-cell epitopes via studies and design of predictive algorithms. Antigenic regions and binding sites of the target proteins can thus be identified by analysing pathogen genomes [39,40]. T-cell and B-cell epitopes can also be linked together to form an immunogenic peptide molecule [41]. The immunoinformatics approach is gaining huge interest in various research fields due to the capability in time and cost reduction in immunogenicity studies and increasing reliability in terms of the results generated. Accurate prediction of new epitopes encourages the development of therapeutic methods and accelerate the design and production of peptide vaccines for infectious diseases [42]. Furthermore, the advancement of next-generation sequencing (NGS) has made sequencing of the complete genome of pathogens and humans possible, fast-tracking the development of methods used to predict immunogenic epitopes. The resulting vaccines can be universal and protect against all types of microorganisms as well as provide protection for all individuals regardless of the polymorphism of HLA alleles [27,43]. During the COVID-19 pandemic, many studies related to vaccine design using immunoinformatics approach have also been conducted against other major human pathogens, including Klebsiella pneumoniae, Staphylococcus aureus and Salmonella sp [44–49]., demonstrating once again the reliability and capability of immunoinformatics in designing effective vaccines. In this review, we will describe the vital roles of immunoinformatics approach in identifying immunogenic peptides as potential vaccine targets, involving databases, and prediction and characterization of epitopes. We use keywords such as ‘immunoinformatics’, ‘coronavirus’ and ‘vaccine’ and all seven existing human coronaviruses are included in the scope of this review.

Emergence of SARS-CoV-2 variant of concern (voc) strains

A substitution of aspartic acid with glycine at position 614 (D614G) found in the spike protein of SARS-CoV-2 was firstly discovered in February 2020. Scientists soon discovered that this mutation created a variant which was associated with enhanced infectivity. Within a few months, this variant dominated over the ancestral strain first identified in China, circulating extensively worldwide [50]. To date, five novel variants emerged notably as Variant of Concern (VOC), which are Alpha (UK variant B.1.1.7), Beta (South African variant B.1.351), Gamma (Brazilian variant P.1), Delta (Indian variant B.1.617.2) and Omicron (South African variant B.1.1.529). Needless to say, all of these variants contain D614G substitution in their S protein region. First of all, Alpha variant (B.1.1.7) was initially identified in the United Kingdom. Numerous mutations were observed in the regions including ORF1ab polyprotein, S protein, ORF8 protein and N protein. The S protein had several substitutions as well as deletions at positions 69 and 70 [51]. Substitution P681H is adjacent to the furin cleavage site, which is essential for viral infection [52] while deletion ∆69-70 increases viral infectivity and is associated with immune escape in immunocompromised patients [53]. It is noteworthy that ORF8 protein was truncated due to the presence of a stop codon. According to a preprint [54], Covaxin from Bharat Biotech is effective against Alpha variant.

The Beta variant (B.1.351) contains mutations in ORF1ab polyprotein, S protein, E protein and N protein. This variant was discovered in South Africa with an E484K substitution detected in the spike protein. This particular mutation has exhibited the ability to affect antibody binding [55] and reduce the efficacy of existing vaccines such as Pfizer/BioNTech vaccine, Moderna vaccine, AstraZeneca-Oxford vaccine [56] and Sinovac vaccine [57]. The next variant of concern was the Gamma variant (P.1), initially found in Brazil which has several mutations in ORF1ab polyprotein, S protein and N protein. This variant shares similarities with the B.1.351 variant, such as E484K and K417N/T mutations [58]. These three variants have a N501Y mutation in the spike protein, to be precise, within the RBD. This mutation is shown to alter the conformation of RBD, leading to a higher transmission of viruses. Mutations of K417N/T, E484K, and N501Y in the spike protein result in reduced neutralizing activity by antibodies from convalescent and mRNA vaccine-elicited sera [59]. The Delta variant (B.1.617.2) was first reported in India and it seemed that the mutations are basically different from the three other variants. In addition to ORF1ab protein, S protein and N protein, substitutional mutations took place in M protein, ORF3a protein and ORF7 protein. The mutation of L452R in the S protein reduces the neutralization efficacy by antibodies [60]. While being unique in this variant, T478K is in close proximity to the key mutation of E484K found in other variants such as Beta (B.1.351). Studies have inferred that mutation of P681R showed a certain degree of linkage to more viral loads and increased transmission [61].

Finally, Omicron variant (B.1.1.529) was first reported in South Africa. Mutations are present in ORF1ab protein as well as all structural proteins. In S protein, the number of mutations is strikingly massive, that is 34 mutations, including three deletions and an insertion of three residues at position 214. Unprecedentedly, the RBD also accommodates as many as 15 mutations. For instance, although the glutamic acid at position 484 is substituted by alanine instead of lysine as observed in Beta and Gamma variant, Omicron variant has been reported to exhibit a similar escaping effect from neutralizing antibodies [62]. Q493R and Q498R introduce additional electrostatic interactions and S477N forms hydrogen bonding with ACE2 receptor, increasing the binding affinity towards the receptor [63]. Furthermore, a triple mutation is spotted at the furin cleavage site (H655Y, N679K, and P681H), which may imply an enhancement in the transmissibility [64]. Still, Omicron variant shares some mutations with other VOCs, such as ∆69-70 and N501Y. Table 1 contains all notable mutations observed in these variants [65].

Table 1.

List of non-synonymous mutations in the Variants of Concern (VOCs) (B.1.1.7, B.1.351, P.1, B.1.617.2 and B.1.1.529)

Region Non-synonymous mutations
 
B.1.1.7 (α) B.1.351 (β) P.1 (γ) B.1.617.2 (δ) B.1.1.529 (ο)
ORF1a polyprotein T1001I
A1708D
I2230T
∆3675-3677
Q57H
T256I
K1655N
K3353R
∆3675-3677
S253P
S1188L
K1795Q
∆3675-3677
- K856R
∆2083
L2084I
A2710T
T3255I
P3395H
∆3674-3676
I3758V
ORF1b polyprotein P314L P314L P314L
E1264D
P314L
P1000L
P314L
I1566V
S protein ∆69-70
∆144
N501Y
A570D
D614G
P681H
T716I
S982A
D1118H
D80A
D215G
∆241-243
K417N
E484K
N501Y
D614G
A701V
L18F
T20N
P26S
D138Y
R190S
K417T
E484K
N501Y
D614G
H655Y
T1027I
V1176F
T19R
∆156-157
R158G
L452R
T478K
D614G
P681R
D950N
A67V
∆69-70
T95I
∆142-144
Y145D
∆211
L212I
ins214EPE
G339D
S371L
S373P
S375F
K417N
N440K
G446S
S477N
T478K
E484A
Q493R
G496S
Q498R
N501Y
Y505H
T547K
D614G
H655Y
N679K
P681H
N764K
D796Y
N856K
Q954H
N969K
L981F
M protein - - - I82T D3G
Q19E
A63T
E protein - P71L - - T9I
N protein D3L
R203K
G204R
S235F
T205I P80R
R203K
G204R
D63G
R203M
D377Y
P13L
∆31-33
R203K
G204R
ORF3a protein - Q57H - S26L -
ORF7a protein - - - V82A
T120I
-
ORF8 protein Q27stop
R52I
Y73C
- E92K - -

Vaccine targets identified by immunoinformatics analysis

In order to obtain promising vaccine targets, the identification of conserved regions across variant strains is particularly essential. The proteins that are specific to the target pathogen as well as those that are released or upregulated during the interaction with the host are suited to be vaccine candidates [66]. The potential of the non-conservedproteins to be a therapeutic target relies on the functional importance of the genes and respective proteins in the growth and survival of the pathogen [67,68]. Furthermore, in terms of the immunological mechanism, induction of antibodies as effector molecules is usually needed, suggesting that chosen vaccine protein has to be immunogenic enough to stimulate protective B-cell responses. Prior to this, pattern recognition or augmented dendritic cell activation is important for the optimization of T-helper cell response [69,70]. The best vaccination strategies include both neutralizing antibodies and cytotoxic T-cells [69]. Antigens consisting of multiple epitopes are better than a single epitope. In fact, T-cell epitope content is a significant marker to study the overall immunogenic potential [66].

Structural proteins for human coronavirus vaccine development

Structural proteins of coronaviruses have been gaining attention in the process of designing and developing vaccines. Coronaviruses contain four types of structural proteins, including spike glycoprotein (S) forming into bulky peplomers in the viral envelope, membrane glycoprotein (M), internal nucleocapsid protein (N) and transmembrane envelope protein (E). Some species contain another protein with hemagglutination and esterase functions (HE).

S protein binds to one another naturally into a homo-trimer, resembling Class I membrane fusion protein [71]. S protein comprises two subunits. The S1 subunit is subdivided into N-terminal domain (NTD) and C-terminal domain (CTD). The RBD is located in the CTD. On the other hand, S2 subunit contains the basic elements responsible for membrane fusion, that are, fusion peptide (FP), heptad repeats (HR), membrane proximal external region (MPER) and transmembrane domain (TM). Full-length S protein, S1 subunit, RBD, NTD and FP have been investigated as potential targets of vaccines. The RBD is the most studied due to the fact that it attaches straight to the ACE2 receptor on the host cells. RBD immunization induced antibodies are capable of blocking this interaction and thus effectively minimize the infection [72]. To elaborate, the RBD is conserved relative to the S1 subunit, plus many neutralizing epitopes are found within this domain, making it an excellent candidate for inclusion in a vaccine [73]. N protein is located inside the virus, where it surrounds the viral RNA and forms a helical nucleocapsid. This protein, the most abundant protein in coronavirus, is relatively more conserved than S protein [74]. It is highly antigenic and suitable as a marker for diagnostic purposes [72]. Studies have stated that E protein is a noticeable virulence factor [75]. Along with S and E proteins, M proteins are also found on the viral envelope and are the major component needed for initiating viral-budding. M protein is abundant on the viral envelope (surface) and is well conserved among different species [76].

Given its significance in viral tropism and entry, most studies targeted the S protein of coronavirus, either as the sole target or incorporated with other viral proteins (Table 2). The S1 protein binds to receptors (e.g. ACE2) on the host cell membrane followed by the fusion between virions and cell membranes facilitated by the S2 protein. According to Bhatnager et al. [77], fusion proteins determine the infectivity of coronavirus, making them promising therapeutic candidates. Thus, a multiepitope vaccine was developed from conserved parts of S2 domain to strengthen the immune response from both T-cells and B-cells while eliminating the regions that may cause undesirable side effects such as allergy. Furthermore, Jyotisha et al. [78] have worked on RBD in their in silico vaccine development since this region exhibits no significant homology with the human proteome, diminishing the possibility of autoimmunity. For the latter approach, a group of scientists, Rahman et al. [79] investigated on the development of three types of structural proteins namely S, E and M to produce chimeric peptide vaccines. Instead of the entire length of S protein, RBD and NTD were chosen given that these segments possess immunodominant neutralizing domains. Despite mutations on the RBD which could allow new strains to escape from specific antibodies, a combination of both proteins has proven to generate robust neutralizing antibodies and long-term protective immunity in animal models. E and M proteins can further boost the immunity to tackle viral evasion. Next, main protease, nsp12 and nsp13 which are highly conserved in coronaviruses, are selected with S protein in developing multiepitope vaccine [80]. Safavi et al. [81] did research in the design of a peptide vaccine containing immunogenic epitopes from non-structural proteins located in ORF1ab polyprotein and RBD from S protein. The primary aim was to create vaccines capable of stimulating neutralizing antibodies that inhibit the viral binding to the host cells [81].

Table 2.

List of vaccine candidates studied via immunoinformatics relating to coronaviruses.

Vaccine candidate Articles reviewed
Spike protein [131–135]
S2 domain [77]
RBD [78]
N protein [136]
E protein [137]
M protein [82]
N protein and S protein [138]
S protein and E protein [139]
S protein, M protein and E protein [79,140]
S protein, M protein and N protein [141]
S protein, E protein and N protein [142]
All structural proteins [143,144]
Whole proteome [145]
Subtractive proteomics [146]
3CL hydrolase (main protease/Mpro) [88]
nsp1 [89]
nsp4 [147]
S protein, E protein, N protein, M protein, selected ORFs (ORF3, ORF4a, ORF4b, ORF5), ORF1a, ORF1ab and ORF8b [148]
E protein, ORF3a protein, N protein, ORF7a protein and M protein [90]
S protein, main protease, nsp12, nsp13 [80]
N protein, M protein, ORF3a protein and S protein [149]
S protein and ORF1ab protein [81]
ORF1ab protein, S protein, M protein and N protein [150]
N protein, ORF3a protein and M protein [91]

There are a number of studies that have selected N protein as the target for potential vaccine candidates. In fact, this gene is relatively more conserved and mutationally stable. N proteins of many coronaviruses are highly immunogenic and are abundant during the course of the infection. N protein can also lead to long-lasting immune response without causing any side effect, especially from T-cells. Being the most abundant protein, M protein appears to carry a pivotal role in intracellular budding [82]. High immunogenicity and multifunctional properties of E protein have made itself a promising vaccine target. Significantly, this protein may be directly associated with virulence as indicated by the deletion of this protein led to increased expression of apoptotic markers and elevated virus-induced inflammation in the infected cells [83]. Moreover, as observed from the accumulation of E protein at the ERGIC, this indicates its role in viral assembly and budding [84,85]. Despite minor portion is incorporated eventually into the viral envelope, it has been found that E protein is also participating in virion release [86,87]. On the other hand, many research studies suggest that all structural glycoproteins are considerably involved in viral pathogenesis. A multi-epitope vaccine should thus be designed, to mainly focus on disrupting the life-cycle of coronaviruses.

Non-Structural proteins and accessory proteins for human coronavirus vaccine development

Other proteins chosen for vaccine development includes 3CL hydrolase, nsp1, ORF3a protein and ORF7a protein. 3CL hydrolase is vital for proteolytic maturation of the virus. Short peptides were extracted from this protein as potential epitopes for both CTLs and HTLs in the design of a multiepitope vaccine [88]. The nsp1 may be a major virulence factor for coronaviruses as seen from its functions. It blocks host gene expression by preventing the translation involving 40S ribosomal subunit as well as degrades mRNA of the host cells. In infected cells, the expression of the IFN genes and the host antiviral signalling pathways were impeded [89]. E protein, ORF3a protein, N protein, ORF7a protein and M protein showed a remarkable linkage with the structural integrity and functionality of the virus [90]. ORF3a protein, N protein and M protein are important in the virus replication and function [91]. Figure 1 illustrates the open reading frames (ORFs) in coronavirus genome and a schematic coronavirus structure labelled with structural proteins.

Figure 1.

Figure 1.

Open reading frames (ORFs) composed of non-structural proteins, structural proteins and accessory proteins (from left to right) in coronavirus genome and schematic virion structure of coronavirus. (Note: SARS-CoV-2 genome was used as the template to represent the basic genome organization.) Coronaviruses contain four types of structural proteins, including spike glycoprotein (S) forming into bulky peplomers in the viral envelope, membrane glycoprotein (M), internal nucleocapsid protein (N) and transmembrane envelope protein (E). some species contain another protein with hemagglutination and esterase functions (HE). all non-structural proteins are translated from ORF1ab region (nsp1-16) while the mumber of accessory proteins varies among different coronaviruses. in the case of SARS-CoV-2, the accessory proteins include ORF3a protein, ORF6 protein, ORF7a protein, ORF7b protein, ORF8 protein and ORF10 protein.

Genomics, proteomics and immunomics database

Both molecular biology and immunology produce large amounts of data that have to be stored in general-purpose and designated immunological databases. General-purpose biological databases contain annotated entries of biological sequences. These entries typically contain the sequence, a short description, the source organism, a list of structural or functional features and literature references [37]. A list of databases is available to extract raw data such as protein sequences and structural information as stated in Table 3.

Table 3.

List of databases.

The Allele Frequency Net Database (AFND) is an open resource storing the frequency data on the polymorphisms of immune-related genes such as human leukocyte antigen (HLA) system, killer-cell immunoglobulin-like receptors (KIR), MHC class I chain-related genes (MIC) and cytokine gene polymorphisms. AFND collects information and data from four main sources, which are, (i) peer-reviewed publications, (ii) analysed populations by International HLA and Immunogenetics Workshops (IHWSs), (iii) individual submissions and (iv) short publication reports (SPR) in Human Immunology Journal. The allele, gene, genotype or haplotype frequencies for the abovementioned loci can be searched in this database [92]. AutoPeptiDB is another database, composed of 103 high-resolution peptide-protein complexes. The binding affinity of short peptides and proteins is calculated based on the data arising from these complexes in this database. The peptide–protein interactions being considered, generally take place in cellular activities such as signal transduction, protein transport, antigen binding and enzyme-substrate inhibition. There are no two protein monomers that share more than 70% sequence identity in this dataset [93].

GenBank serves as a public database of genetic sequences, focusing on the expansion and dissemination of information. The repository relies on the submissions of sequence data from authors and whole-genome shotgun (WGS) as well as high-throughput data from sequencing centres and issued patents from The U.S. Patent and Trademark Office. GenBank is a partner of the International Nucleotide Sequence Database Collaboration (INSDC) along with European Nucleotide Archive (ENA) and Data Bank of Japan (DDBJ) in which data exchange is done on a daily basis so that a systematic collection of sequence information is accessible worldwide. GenBank also collects and stores amino acid sequences from databases like SWISS-PROT, Protein Research Foundation (PRF) and Protein Data Bank (PDB) [94]. GISAID has gained its reputation as a trustworthy means for international sharing of all influenza virus data including genetic sequence and metadata [95]. In response to the COVID-19 pandemic, related data have also been shared via this public domain recently. InterPro is a unified resource resulting from the integration of protein signature databases including PROSITE, PRINTS, ProDom, Pfam, SMART, TIGRFAMs, PIRSF, SUPERFAMILY, Gene3D and PANTHER. The major application is annotation and functional classification of uncharacterized sequences. Based on sequence positions and protein coverage, protein signatures that fall into the same family or functional domain are grouped into single entry with respective annotation and cross-references [96–98].

Protein Data Bank (PDB) contains structural data of biological macromolecules, typically from X-ray crystallography followed by nuclear magnetic resonance (NMR) and theoretical modelling. The primary data generated are 3D coordinates, source organism, sequence, chemical structures of cofactors and prosthetic groups, names of all components found in structure, qualitative structural information and literature citations. Besides, detailed descriptions of experimental methods for structure determination are included. PDB entries are linked to more than 10 other databases and go through multiple times of review and validation in order to ensure that the description on the 3D structure of the macromolecule of PDB entries in the specified experimental state is reliable and accurate [99]. Pfam is a database of protein families and domains built for the analysis of novel genomes, metagenomes and experimental work on certain proteins and systems. Every Pfam family has a seed alignment of a representative set of sequences. A profile hidden Markov models (HMM) is then automatically constructed from the seed alignment and searched against a sequence database known as pfamseq. All sequence regions that fulfil the family-specific curated threshold are aligned to the profile HMM for full alignment. Pfam entries are manually annotated with functional information from the literature [100]. The UniProt Knowledgebase (UniProt) functions as the central database of protein sequences with functional annotations.The UniProt Knowledgebase has two sections: ‘UniProt/TrEMBL’ comprising computationally analysed records and ‘UniProt/Swiss-Prot’ containing manually annotated records resulting from literature retrieval and curator-evaluated computational analysis. Cross references to external resources are possible in UniProt, for example DNA sequence entries in the INSDC databases, 2D PAGE and 3D protein structure databases, protein domain and family characterization databases, post-translational modification (PTM) databases, species-specific data collections, variant databases and disease databases [101]. The Virus Pathogen Database and Analysis Resource (ViPR) is a repository for viruses that are classified into Category A-C priority pathogens. It keeps record of the information for virus families that are pathogenic specifically to humans including Arenaviridae, Bunyaviridae, Caliciviridae, Coronaviridae, Filoviridae, Flaviviridae, Hepeviridae, Herpesviridae, Paramyxoviridae, Picornaviridae, Poxviridae, Reoviridae, Rhabdoviridae and Togaviridae. Basically, ViPR integrates viral data and related information from three sources including public databases such as GenBank, direct submissions from authors and data derived through computational approach [102]. Finally, as the name implies, viruSITE is a genomic resource specifically for all virus taxa, which gathers data from numerous resources. For instance, genome sequences are retrieved from the reference genomes stored in the NCBI RefSeq database. The data are downloaded, processed and deposited into the database using in-house scripts. It is also a versatile genome search engine and integrated suite of tools for visualization and analysis of sequences between viral genomes [103].

It is noteworthy to know that a few other online platforms were established for coronavirus-related research including CoronaVIR, Coronavirus Database (CoVdb), COVIEdb, CoVIDep, CoV3D and DBCOVP. CoronaVIR collects and stores SARS-CoV-2 information consisting of genomics, diagnosis and therapeutics to serve the purpose of suggesting potential candidates for the development of effective diagnostic tools and vaccines [104]. CoVdb also accommodates gene information of coronavirus, but not limited to SARS-CoV-2 only, such as gene function, subcellular localization, topology, protein structure and evolutionary biology [105]. COVIDep analyses and identifies T-cell and B-cell epitopes based on the immunological data of SARS-CoV and genetic data of SARS-CoV-2. The locations of the identified epitopes are indicated on the primary structure of SARS-CoV-2 proteins as well as graphical display of analysed sequences and genetic variation present on SARS-CoV-2 proteins [106]. Similarly, COVIEdb predicts potential T-cell and B-cell epitopes for SARS-CoV, MERS-CoV and SARS-CoV-2 through their protein sequences. Ultimately, pan-coronavirus vaccine and drug development are facilitated for future coronavirus outbreak [107]. CoV3D regularly updates structural datasets of coronaviruses and enables structural viewing of single and multiple complexes, which then can be utilized for analysis, modelling and structural design. This tool helps the users in understanding antibody recognition on S protein, polymorphisms and glycosylation [108]. DBCOVP is a repository of structural proteins from the genomes of betacoronavirus genera (SARS-CoV, MERS-CoV and SARS-CoV-2) with detailed information such as sequence-structural properties, predicted conserved T-cell and B-cell epitopes, and tertiary structure of epitope-HLA binding-complexes [109].

Analysis and characterization of antigenic protein candidates which can prime T-Cells and B-cells

An epitope is the part of an antigen that is recognized by the adaptive immune system. It binds to specific receptors including antibodies, MHC molecules and T-cell receptors [28]. The binding portion of an antibody is termed a paratope. Epitopes can be either continuous or discontinuous. A continuous or linear epitope is a relatively short (usually 5–6) amino acid sequences recognized by the paratope of a corresponding antibody. In contrast, a discontinuous epitope consists of non-adjacent segments of amino acids, not necessarily from one chain, which form a specific 3D structure, which can also be recognized by antibodies. Since discontinuous epitope arises from a specific 3D fold, it is also known as conformational epitope. Notably, epitopes recognized by B-cell epitopes may contain lipids, nucleic acids or carbohydrates, giving resultant antibodies a vast repertoire while T-cell epitopes are usually peptide fragments. The investigation, identification and development of epitopes are crucial in promoting the advancement of diagnostics and therapeutics [110].

Common parameters that are considered when selecting protein vaccine candidates include antigenicity, allergenicity and physicochemical properties. Under this category, altogether 34 different tools can be employed (Table 4). VaxiJen is a server developed for alignment-independent recognition and prediction of protective antigens of bacterial, viral and tumour origin. The server contains models derived by ACC pre-processing of amino acids properties for protein classification and quantitative structure–activity relationships (QSAR) studies of peptides with different lengths [111]. The ExPASy server contains a variety of databases and analysis tools, focusing on proteins and proteomics. On the other hand, ProtParam is an extensively-used tool that is able to compute physicochemical properties from a protein sequence, without requiring any additional information. These properties include the molecular weight, theoretical pI, amino acid composition, atomic composition, extinction coefficient, estimated half-life, instability index, aliphatic index, and grand average of hydropathicity index (GRAVY). The amino acid and atomic compositions are self-explanatory. The query protein can either be specified as a Swiss-Prot/TrEMBL accession number or as a raw sequence [112]. Finally, similar to VaxiJen, ACC pre-processing is applied to sets of known allergens with different origins and routes of exposure in AllerTop. From here, alignment-independent models for allergen recognition are established based on the chemical properties of amino acid sequences. A mirror set of non-allergens is compiled from the same species. Five machine learning methods including discriminant analysis by partial least squares (DA-PLS), logistic regression (LR), decision tree (DT), naïve Bayes (NB) and k nearest neighbours (kNN) are utilized for the differentiation between allergens and non-allergens. The high sensitivity is seen by the ability to identify new allergens which are structurally diverse in comparison to existing allergens [113]. Overall, these tools can be used to identify potential new vaccine candidates.

Table 4.

List of servers for analysis and characterization of protein candidates.

T-Cell epitope prediction tools

Decades of studies on MHC-binding motifs and experimental characterization of MHC binders and epitopes [114,115] have progressively boosted the establishment of an information base for predicting peptide binding. This vast amount of information ultimately contributes to the emergence of algorithms for predicting MHC-binding peptides. These developed prediction systems aim to accurately assign binding affinity of a query peptide to a specific MHC molecule. Because of high specificity, HLA-ligand (peptide) binding is the limiting step in the antigen processing pathway. In general, the coverage of the DQ and DP loci is lower than that of the DR locus [43]. Furthermore, determining the peptides that bind to a particular MHC molecule is essential in understanding immune responses before proceeding to the design of peptide-based vaccines and immunotherapies [116]. The peptides presented by MHC class I molecules are generally 8 to 11 amino acids long whereas the ones presented by MHC class II molecules are reported to be longer than 30 amino acids [114]. The prediction of MHC class II-binding peptides is more difficult than class I-peptides, owing to the unknown position of the binding core within the longer peptides [43]. Following in silico predictions, the selected peptides then undergo experimental validation as T-cell epitopes [117]. The prediction tools are grouped according to the type of MHC molecules encountered (Table 5; Figure 2).

Table 5.

list of servers for T-Cell epitope prediction.

Tool Web address Reference Articles reviewed
MHC I binding
CTLPred http://crdd.osdd.net/raghava/ctlpred/ [164] [77]
MHC-I Binding Predictions http://tools.iedb.org/mhci/ [119,165–172] [77,78,88,135–137,139,173]
MHC-NP http://tools.iedb.org/mhcnp/ [174] [136,139]
NetCTL v1.2 https://services.healthtech.dtu.dk/service.php?NetCTL-1.2 [118] [78,80,81,88,132,136,139,141–143,146,156]
NetCTLpan v1.1 https://services.healthtech.dtu.dk/service.php?NetCTLpan-1.1 [175] [81,82,139]
NETMHC 4.0 https://services.healthtech.dtu.dk/service.php?NetMHC-4.0 [165] [90]
NetMHCPAN https://services.healthtech.dtu.dk/service.php?NetMHCpan-4.1 [176] [81,141,145]
ProPred I http://crdd.osdd.net/raghava/propred1/ [177] [131,133,150]
Proteasomal cleavage/TAP transport/MHC class I combined predictor http://tools.iedb.org/processing/ [178] [79,138,139,148]
SYFPEITHI http://www.syfpeithi.de/bin/MHCServer.dll/EpitopePrediction.htm [115] [81]
MHC II binding
IFNepitope http://crdd.osdd.net/raghava/ifnepitope/ [121] [77–79,81,82,88,133,141,142,144,146,148,156]
IL-4 Pred http://crdd.osdd.net/raghava/il4pred/ [179] [156]
IL-10 Pred http://crdd.osdd.net/raghava/IL-10pred/ [180] [141,156]
MHC2Pred http://crdd.osdd.net/raghava/mhc2pred/ [181] [81]
MHC-II Binding Predictions http://tools.iedb.org/mhcii/ [120,169,182–185] [77–79,88,135,137,139,142,146,148,156,163]
NetMHCII 2.3 https://services.healthtech.dtu.dk/service.php?NetMHCII-2.3 [186] [80]
NetMHCII PAN 3.2 http://www.cbs.dtu.dk/services/NetMHCIIpan-3.2/ [186] [81,82,90,141,145]
ProPred http://crdd.osdd.net/raghava/propred/ [187] [131,133,150]
MHC I and II binding
MHCPred http://www.ddg-pharmfac.net/mhcpred/MHCPred/ [188] [81,90]
Rankpep http://imed.med.ucm.es/Tools/rankpep.html [189] [77,89,91]
TepiTool http://tools.iedb.org/tepitool/ [190] [81]

Figure 2.

Figure 2.

Classification of T-cell epitope prediction tools based on the type of MHC molecules. tools such as MHC-I binding predictions, NetCTL and NetMHCPAN are designed for MHC class I molecules, while IFNepitope, MHC-II binding predictions and NetMHCIIPAN are the examples of prediction tools for MHC class II molecules. meanwhile, there are in silico tools developed for the prediction of T-cell epitopes that can recognized by both MHC class I and II molecules, for example, rankpep and tepitool.

NetCTL is a web-based tool used to predict human MHC Class I CTL epitopes found in proteins. It combines the predictions of proteasomal cleavage, TAP transport efficiency, and MHC class I affinity. In this case, datasets within this tool are expanded to improve the tool performance. The epitopes predicted are restricted to the A26 and B39 supertypes [118]. NetMHCpan uses both peptide and primary HLA sequence as inputs for artificial neural network-driven (ANN-driven) predictions to generate data while considering all HLA specificities. It is pan-specific, which means that it predicts the affinity of interaction of query peptide with human HLA-A or HLA-B molecule. At population level, it provides the broadest possible population coverage in vaccine design. The ability to handle HLA molecule as well as the availability of specific data for a particular HLA haplotype has lead toward individualized immunotherapy and diagnostics [119]. NetMHCIIpan is ANN-trained on quantitative peptide-binding data covering MHC class II molecules. The identification of the binding core is greatly improved using a network alignment procedure namely offset correction. This procedure integrates the predictions of different neural networks to enhance core register identification without affecting binding affinity. The prediction values are interpreted in the form of IC50 binding affinity. Precise identification of the peptide-binding register leads to the characterization of the MHC class II binding pocket, which is crucial for the discovery and design of peptides that are MHC-recognized [120]. IFNepitope is a web server developed using PHP, Perl, HTML and Java scripts. It is useful for the development of subunit vaccines. Basically, this server is composed of three modules, Predict, Design and Scan. The Predict module allows users to screen peptide library for suitable IFN-γ inducing epitopes. The Design module identifies minimum mutations required in a peptide to produce an IFN-γ inducing epitope. All possible peptides with single-residue mutation are generated and IFN-γ inducing epitopes are predicted from these mutated peptides. Finally, the Scan module predicts regions in an antigen that can activate IFN-γ [121].

B-Cell epitope prediction tools

In contrast to T-cell epitopes, prediction of B-cell epitopes is significantly different. Potential B-cell epitopes have to also engage helper T-cell response and are not necessarily continuous in sequence as described earlier. Folding of proteins allows spatial proximity of amino acids that may be far apart in terms of sequence. The prediction of discontinuous B-cell epitopes is therefore more sophisticated and difficult than that of continuous ones mainly due to machine learning-based methods require continuous sequence data. Importantly, approximately 85% of documented B-cell epitopes are continuous in sequence [122], at some point, can thus be predicted by methods similar to those of T-cell epitope prediction. The working hypothesis applied to existing B-cell epitope predictors is that some amino acids have a higher likelihood of being part of a B-cell epitope [43]. Moreover, a limited understanding of the characteristics of B-cell epitopes hinders the performance of prediction methods. A considerable number of predicted epitopes are subsequently found to be false positives or false negatives. Hence, it is essential to retrieve a comprehensive description of the epitope, for example, its sequence composition and structural characteristics [122]. Most of the prediction tools are designed for linear epitopes instead of conformational epitopes (Table 6; Figure 3). The application of other prediction tools is not restricted to either T-cell or B-cell (Table 7).

Table 6.

list of servers for B-Cell epitope prediction.

Figure 3.

Figure 3.

Classification of B-cell epitope prediction tools based on the type of epitopes. most of the prediction tools are designed for linear epitopes instead of conformational epitopes. antibody epitope prediction, ABCPred and BepiPred are some of the prediction tools for linear B-cell epitope. to predict conformational B-cell epitopes, tools such as ElliPro and DiscoTope can be used.

Table 7.

List of servers for epitope prediction.

Tool Web address Reference Articles reviewed
Antigenic peptides prediction tool http://imed.med.ucm.es/Tools/antigenic.pl [195] [135]
CPORT https://milou.science.uu.nl/services/CPORT/ [198] [79]
Epitope Cluster Analysis http://tools.iedb.org/cluster/ [199] [138]
NetChop‐3.1 https://services.healthtech.dtu.dk/service.php?NetChop-y3.1 [201] [81,139]

Bepipred combines the prediction of a HMM and the propensity scale by Parker et al. [123]. It has high accuracy for predicting B-cell epitopes and better performance than other methods when tested on the validation data set. This improved version is trained only on epitope data derived from crystal structures, presumably increasing the quality and predictive power in contrast to the prediction tools trained on linear peptides tested for antibody recognition [124]. ElliPro is a web-based tool for the prediction of conformational epitopes from protein sequence or structure. It implements a modified version of Thorntons method [125], MODELLER program and Jmol viewer to run the prediction and visualization of antibody epitopes in protein sequences and structures. ElliPro considers protein structure as an ellipsoid and calculates protrusion indexes for protein residues outside of the ellipsoid [126]. DiscoTope is a structure-based prediction method for residues located in discontinuous B‐cell epitopes [127]. DiscoTope uses a combination of epitope features such as amino acid statistics, residue solvent accessibility, spatial distribution, intermolecular contacts and surface exposure. It is trained on a compiled data set of discontinuous epitopes from 76 X‐ray protein structures of antibody-antigen complexes. Antibody binding studies and site‐directed mutagenesis are then applied to categorize predicted epitope residues and validate binding. Furthermore, analysis of the local conformations of epitope residues is essential because these conformations must be preserved for discontinuous epitopes [128]. IgPred server relates an antigenic amino acid sequence to its potency to generate systemic (IgG), allergic (IgE) and mucosal (IgA) type of antibody response. Amino acid and dipeptide composition are input features. IgA epitopes are labelled as positive set while the rest of the epitopes are negative set. The prediction of a peptide inducing more than one class of antibodies is possible. If a particular peptide has equal score for two models, it will be assigned to both classes given that the score for both classes is more than the threshold [129].

Conclusion

The emergence of immune-escaped variants of the SARS-CoV-2 has raised concern about the effectiveness of current COVID-19 vaccines rolled out worldwide. Notably, incidents about prior infected individuals as well as vaccinated individuals being tested positive with COVID-19 and some of them even being hospitalized are constantly reported, especially during the severe waves attributed to Delta and Omicron. Instead of scaling up the vaccination rate all over the world, there is a pressing need to develop and redesign more protective vaccines for future prevention. In light of this, a variety of immune system-related databases, prediction software and modelling tools, informatics and computational infrastructure for connecting computer modelling and wet-lab experimentation are critically required to convert high-throughput immune data into effective next-generation vaccines. Immunoinformatics is a promising approach to accelerate the design of novel, universal and efficacious next-generation vaccines and their delivery to pre-clinical studies. With the advances of computational technologies, immunoinformatics is no longer a theoretical approach in dealing with the diagnostics and therapeutics purposes. Its cost-effectiveness and increasing reliability have pushed the development of the field of immunology further than ever. The application of immunoinformatics has been expanding to cover nearly all aspects of immunology including genomics and proteomics associated with the immune system. Therefore, it would be necessary to regularly monitor the development and improvement of immunoinformatics tools and related databases in terms of algorithms and technology so that the deviation between actual and estimated data can be reduced. In this case, close collaboration between immunoinformatics experts and experimental immunologists is vital to nourish the advancement of immunoinformatics, at the same time, to improve its effectiveness and accuracy, for efficient data interpretation. It is noteworthy that immunoinformatics is merely the calculation and prediction of results via in silico tools. Databases are established on the basis the data have already been tested through real experiments. The quality of the prediction largely depends on the quality of data and, the sophistication and precision of the algorithms used. Slight errors in the sequence data generated will affect the efficiency of downstream analysis [130].

Funding Statement

This work was supported by the Malaysia Ministry of Higher Education [203.CIPPM.6711968]; Malaysia Ministry of Higher Education [203/PPSP/6171200]; Universiti Sains Malaysia [1001.CIPPM.8011078].

Disclosure statement

No potential conflict of interest was reported by the author(s).

References

  • [1].Peng G-W, He J-F, Lin, J-Y, et al. Epidemiological study on severe acute respiratory syndrome in Guangdong province. Zhonghua liuxingbingxue zazhi. 2003;24(5): 350–352 [PubMed] [Google Scholar]
  • [2].Zaki AM, van Boheemen S, Bestebroer TM, et al. Isolation of a novel coronavirus from a man with pneumonia in Saudi Arabia. N Engl J Med. 2012;367(19):1814–1820. [DOI] [PubMed] [Google Scholar]
  • [3].Zhou P, Yang X-L, Wang X-G, et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. 2020;579(7798):270–273. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [4].Lu R, Zhao X, Li J, et al. Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding. Lancet. 2020;395(10224):565–574. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [5].Wrapp D, Wang N, Corbett KS, et al. Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation. Science. 2020;367(6483):1260–1263. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [6].Guan WJ, Ni ZY, Hu Y, et al. Clinical characteristics of coronavirus disease 2019 in China. N Engl J Med. 2020;382(18):1708–1720. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [7].Guo YR, Cao QD, Hong ZS, et al. The origin, transmission and clinical therapies on coronavirus disease 2019 (COVID-19) outbreak - an update on the status. Mil Med Res. 2020;7(1):11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [8].Zhang L, Liu Y.. Potential interventions for novel coronavirus in China: a systematic review. J Med Virol. 2020;92(5):479–490. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [9].Perlman S, Netland J. Coronaviruses post-SARS: update on replication and pathogenesis. Nature Rev Microbiol. 2009;7(6):439–450. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [10].Hamre D, Procknow JJ. A new virus isolated from the human respiratory tract. Pro Soc Exp Biol Med. 1966;121(1):190–193. [DOI] [PubMed] [Google Scholar]
  • [11].Almeida JD, Tyrrell DAJ. The morphology of three previously uncharacterized human respiratory viruses that grow in organ culture. J Gen Virol. 1967;1(2):175–178. [DOI] [PubMed] [Google Scholar]
  • [12].van der Hoek L. Human coronaviruses: what do they cause? Antivir Ther. 2007;12(4):651–658. [PubMed] [Google Scholar]
  • [13].Li W, Shi Z, Yu M, et al. Bats are natural reservoirs of SARS-like coronaviruses. Science. 2005;310(5748):676. [DOI] [PubMed] [Google Scholar]
  • [14].van der Hoek L, Pyrc K, Jebbink MF, et al. Identification of a new human coronavirus. Nat Med. 2004;10(4):368–373. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [15].Woo PCY, Lau SKP, and Chu CM, et al. Characterization and complete genome sequence of a novel coronavirus, Coronavirus HKU1 from patients with pneumonia. Journal of Virology. 2005;79(2):884. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [16].de Groot RJ, Baker SC, Baric RS, et al. Middle East respiratory syndrome coronavirus (MERS-CoV): announcement of the Coronavirus study group. J Virol. 2013;87(14):7790–7792. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [17].Chafekar A, Fielding BC. MERS-CoV: understanding the latest human coronavirus threat. Viruses. 2018;10(2):93. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [18].Cui J, Li F, Shi Z-L. Origin and evolution of pathogenic coronaviruses. Nat Rev Microbiol. 2019;17(3):181–192. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [19].Mohsenzadegan M, Saebi F, Yazdani M, et al. Autoantibody against new gene expressed in prostate protein is traceable in prostate cancer patients. Biomark Med. 2018;12(10):1125–1138. [DOI] [PubMed] [Google Scholar]
  • [20].Pizza M, Scarlato V, Masignani V, et al. Identification of vaccine candidates against serogroup B meningococcus by whole-genome sequencing. Science. 2000;287(5459):1816–1820. [DOI] [PubMed] [Google Scholar]
  • [21].Petrovsky N, Aguilar JC. Vaccine adjuvants: current state and future trends. Immunol Cell Biol. 2004;82(5):488–496. [DOI] [PubMed] [Google Scholar]
  • [22].Thompson AL, Staats HF. Cytokines: the future of intranasal vaccine adjuvants. Clin Dev Immunol. 2011, 2011;289597. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [23].Sesardic D. Synthetic peptide vaccines. J Med Microbiol. 1993;39(4):241–242. [DOI] [PubMed] [Google Scholar]
  • [24].Fujita Y, Taguchi H. Current status of multiple antigen-presenting peptide vaccine systems: application of organic and inorganic nanoparticles. Chem Cent J. 2011;5(1):48. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [25].Bijker MS, Melief CJM, Offringa R, et al. Design and development of synthetic peptide vaccines: past, present and future. Expert Rev Vaccines. 2007;6(4):591–603. [DOI] [PubMed] [Google Scholar]
  • [26].Lin SY-H, Cheng C-W, Su EC-Y. Prediction of B-cell epitopes using evolutionary information and propensity scales. BMC Bioinformatics. 2013;14(2):S10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [27].Toussaint NC, Maman Y, Kohlbacher O, et al. Universal peptide vaccines – optimal peptide vaccine design based on viral sequence conservation. Vaccine. 2011;29(47):8745–8753. [DOI] [PubMed] [Google Scholar]
  • [28].Purcell AW, McCluskey J, Rossjohn J. More than one reason to rethink the use of peptides in vaccine design. Nat Rev Drug Discov. 2007;6(5):404–414. [DOI] [PubMed] [Google Scholar]
  • [29].Heegaard PMH, Dedieu L, Johnson N, et al. Adjuvants and delivery systems in veterinary vaccinology: current state and future developments. Arch Virol. 2011;156(2):183–202. [DOI] [PubMed] [Google Scholar]
  • [30].Schutze MP, Leclerc C, Jolivet M, et al. Carrier-induced epitopic suppression, a major issue for future synthetic vaccines. J Immunol. 1985;135(4):2319. [PubMed] [Google Scholar]
  • [31].Di John D, Torres J, Murillo J, et al. Effect of priming with carrier on response to conjugate vaccine. Lancet. 1989;334(8677):1415–1418. [DOI] [PubMed] [Google Scholar]
  • [32].Herzenberg LA, Tokuhisa T, Herzenberg LA. Carrier-priming leads to hapten-specific suppression. Nature. 1980;285(5767):664–667. [DOI] [PubMed] [Google Scholar]
  • [33].Carvalho TF, Haddad JPA, Paixão TA, et al. Meta-analysis and advancement of Brucellosis vaccinology. PLOS ONE. 2016;11(11): e0166582. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [34].Gardy JL, Lynn DJ, Brinkman FSL, et al. Enabling a systems biology approach to immunology: focus on innate immunity. Trends Immunol. 2009;30(6):249–262. [DOI] [PubMed] [Google Scholar]
  • [35].Tomar N, De RK. Immunoinformatics: an integrated scenario. Immunology. 2010;131(2):153–168. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [36].Brusic V, Petrovsky N. Immunoinformatics and its relevance to understanding human immune disease. Expert Rev Clin Immunol. 2005;1(1):145–157. [DOI] [PubMed] [Google Scholar]
  • [37].Kazi A, Chuah C, Majeed ABA, et al. Current progress of immunoinformatics approach harnessed for cellular- and antibody-dependent vaccine design. Pathog Glob Health. 2018;112(3):123–131. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [38].Davies MN, Flower DR. Harnessing bioinformatics to discover new vaccines. Drug Discov Today. 2007;12(9):389–395. [DOI] [PubMed] [Google Scholar]
  • [39].De Groot AS, Berzofsky JA. From genome to vaccine — new immunoinformatics tools for vaccine design. Methods. 2004;4(34):425–428. [DOI] [PubMed] [Google Scholar]
  • [40].Hemmati M, Raoufi E, Fallahi H. Predicting candidate epitopes on ebola virus for possible vaccine development. In: S.I O, editor. Advances in Ebola control. London, UK: InTechOpen; 2017. p. 98. [Google Scholar]
  • [41].Raoufi E, Hemmati M, Eftekhari S, et al. Epitope prediction by novel immunoinformatics approach: a state-of-the-art review. Int J Pept Res Ther. 2020;26(2):1155–1163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [42].Backert L, Kohlbacher O. Immunoinformatics and epitope prediction in the age of genomic medicine. Genome Med. 2015;7(1):119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [43].Mahapatra SR, Sahoo S, Dehury B, et al. Designing an efficient multi-epitope vaccine displaying interactions with diverse HLA molecules for an efficient humoral and cellular immune response to prevent COVID-19 infection. Expert Rev Vaccines. 2020;19(9):871–885. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [44].Dey J, Mahapatra SR, Singh P, et al. B and T cell epitope-based peptides predicted from clumping factor protein of Staphylococcus aureus as vaccine targets. Microb Pathog. 2021;160:105171. [DOI] [PubMed] [Google Scholar]
  • [45].Mahapatra SR, Dey J, Kaur T, et al. Immunoinformatics and molecular docking studies reveal a novel multi-epitope peptide vaccine against pneumonia infection. Vaccine. 2021;39(42):6221–6237. [DOI] [PubMed] [Google Scholar]
  • [46].Chatterjee R, Sahoo P, Mahapatra SR, et al. Development of a conserved chimeric vaccine for induction of strong immune response against Staphylococcus aureus using immunoinformatics approaches. Vaccines (Basel). 2021;9(9):1038 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [47].Mahapatra SR, Dey J, Kushwaha GS, et al. Immunoinformatic approach employing modeling and simulation to design a novel vaccine construct targeting MDR efflux pumps to confer wide protection against typhoidal Salmonella serovars. J Biomol Struct Dyn. 2021; Online ahead of print . [DOI] [PubMed] [Google Scholar]
  • [48].Dey J, Mahapatra SR, Lata S, et al. Exploring Klebsiella pneumoniae capsule polysaccharide proteins to design multiepitope subunit vaccine to fight against pneumonia. Expert Rev Vaccines. 2022;21(4):569–587. [DOI] [PubMed] [Google Scholar]
  • [49].Korber B, Fischer WM, Gnanakaran S, et al. Tracking changes in SARS-CoV-2 spike: evidence that D614G increases infectivity of the COVID-19 virus. Cell. 2020;182(4):812–827. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [50].Galloway SE, Paul P, MacCannell DR, et al. Emergence of SARS-CoV-2 B.1.1.7 lineage - united states, december 29, 2020-january 12, 2021. Morbidity and mortality weekly report (MMWR). 2021;70(3): 95–99 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [51].Hoffmann M, Kleine-Weber H, Pöhlmann S. A multibasic cleavage site in the spike protein of SARS-CoV-2 is essential for infection of human lung cells. Mol Cell. 2020;78(4):779–784. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [52].Kemp SA, Collier DA, Datir RP, et al. SARS-CoV-2 evolution during treatment of chronic infection. Nature. 2021;592(7853):277–282. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [53].Sapkal GN, Yadav PD, Ella R, et al. Neutralization of UK-variant VUI-202012/01 with COVAXIN vaccinated human serum. bioRxiv. 2021; Preprint . [Google Scholar]
  • [54].Greaney AJ, Loes AN, Crawford KHD, et al. Comprehensive mapping of mutations in the SARS-CoV-2 receptor-binding domain that affect recognition by polyclonal human plasma antibodies. Cell Host Microbe. 2021;29(3):463–476.e6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [55].Bian L, Gao F, Zhang J, et al. Effects of SARS-CoV-2 variants on vaccine efficacy and response strategies. Expert Rev Vaccines. 2021;20(4):365–373. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [56].Chen Y, Shen H, Huang R, et al. Serum neutralising activity against SARS-CoV-2 variants elicited by CoronaVac. Lancet Infect Dis. 2021;21(8):1071–1072. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [57].Maggi F, Novazzi F, Genoni A, et al. Imported SARS-CoV-2 variant P.1 in traveler returning from Brazil to Italy. Emerg Infect Dis. 2021;27(4):1249–1251. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [58].Wang Z, Schmidt F, Weisblum Y, et al. mRNA vaccine-elicited antibodies to SARS-CoV-2 and circulating variants. Nature. 2021;592(7855):616–622. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [59].McCallum M, Bassi J, Marco AD. SARS-CoV-2 immune evasion by the B.1.427/B.1.429 variant of concern , et al. Science. 2021;373(6555):648–654 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [60].Johnson BA, Xie X, Kalveram B. Furin cleavage site is key to SARS-CoV-2 pathogenesis , et al. bioRxiv. 2020; Preprint. [Google Scholar]
  • [61].Cao Y, Wang J, Jian F, et al. Omicron escapes the majority of existing SARS-CoV-2 neutralizing antibodies. Nature. 2022;602(7898):657–663. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [62].McCallum M, Czudnochowski N, Rosen Laura E, et al. Structural basis of SARS-CoV-2 Omicron immune evasion and receptor engagement. Science. 2022;375(6583):864–868. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [63].Peacock TP, Goldhill DH, Zhou J, et al. The furin cleavage site in the SARS-CoV-2 spike protein is required for transmission in ferrets. Nat Microbiol. 2021;6(7):899–909. [DOI] [PubMed] [Google Scholar]
  • [64].Elbe S, Buckland-Merrett G. Data, disease and diplomacy: GISAID’s innovative contribution to global health. Global Challenges. 2017;1(1):33–46. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [65].Terry FE, Moise L, Martin RF, et al. Time for T? Immunoinformatics addresses vaccine design for neglected tropical and emerging infectious diseases. Expert Rev Vaccines. 2015;14(1):21–35. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [66].Agüero F, Al-Lazikani B, Aslett M, et al. Genomic-scale prioritization of drug targets: the TDR targets database. Nat Rev Drug Discov. 2008;7(11):900–907. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [67].Doyle MA, Gasser RB, Woodcroft BJ, et al. Drug target prediction and prioritization: using orthology to predict essentiality in parasite genomes. BMC Genomics. 2010;11(1):222. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [68].Spohn G, Bachmann MF. Exploiting viral properties for the rational design of modern vaccines. Expert Rev Vaccines. 2008;7(1):43–54. [DOI] [PubMed] [Google Scholar]
  • [69].Bachmann MF, Dyer MR. Therapeutic vaccination for chronic diseases: a new class of drugs in sight. Nat Rev Drug Discov. 2004;3(1):81–88. [DOI] [PubMed] [Google Scholar]
  • [70].Bosch Berend J, van der Zee R, de Haan Cornelis AM, et al. The coronavirus spike protein is a class I virus fusion protein: structural and functional characterization of the fusion core complex. J Virol. 2003;77(16):8801–8811. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [71].Zhang J, Zeng H, Gu J, et al. Progress and prospects on vaccine development against SARS-CoV-2. Vaccines (Basel). 2020;8(2):153. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [72].Jiang S, He Y, Liu S. SARS vaccine development. Emerg Infect Dis. 2005;11(7):1016–1020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [73].Ong E, Wang H, Wong MU, et al. Vaxign-ML: supervised machine learning reverse vaccinology model for improved prediction of bacterial protective antigens. Bioinformatics. 2020;36(10):3185–3191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [74].Nieto-Torres JL, DeDiego ML, Verdiá-Báguena C, et al. Severe acute respiratory syndrome coronavirus envelope protein ion channel activity promotes virus fitness and pathogenesis. PLoS Pathog. 2014;10(5):e1004077. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [75].Neuman BW, Kiss G, Kunding AH, et al. A structural analysis of M protein in coronavirus assembly and morphology. J Struct Biol. 2011;174(1):11–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [76].Bhatnager R, Bhasin M, Arora J, et al. Epitope based peptide vaccine against SARS-COV2: an immune-informatics approach. J Biomol Struct Dyn. 2021;39(15):5690–5705. [DOI] [PubMed] [Google Scholar]
  • [77].Singh JS, Qureshi IA. Multi-epitope vaccine against SARS-CoV-2 applying immunoinformatics and molecular dynamics simulation approaches. J Biomol Struct Dyn. 2022;40(7):2917–2933. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [78].Rahman MS, Hoque MN, Islam MR, et al. Epitope-based chimeric peptide vaccine design against S, M and E proteins of SARS-CoV-2, the etiologic agent of COVID-19 pandemic: an in silico approach. PeerJ. 2020;8:e9572. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [79].Rehman HM, Mirza MU, Saleem M, et al. A putative prophylactic solution for COVID-19: development of novel multiepitope vaccine candidate against SARS‐COV‐2 by comprehensive immunoinformatic and molecular modelling approach. Biology (Basel). 2020;9(9):296. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [80].Safavi A, Kefayat A, Mahdevar E, et al. Exploring the out of sight antigens of SARS-CoV-2 to design a candidate multi-epitope vaccine by utilizing immunoinformatics approaches. Vaccine. 2020;38(48):7612–7628. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [81].Ayyagari VS, TC V, et al. Design of a multi-epitope-based vaccine targeting M-protein of SARS-CoV-2: an immunoinformatics approach. J Biomol Struct Dyn. 2022;40(7):2963–2977. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [82].DeDiego ML, Nieto-Torres JL, Jiménez-Guardeño JM, et al. Severe acute respiratory syndrome coronavirus envelope protein regulates cell stress response and apoptosis. PLoS Pathog. 2011;7(10):e1002315. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [83].Corse E, Machamer CE. The cytoplasmic tails of infectious bronchitis virus E and M proteins mediate their interaction. Virology. 2003;312(1):25–34. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [84].Hogue BG, Machamer CE. Coronavirus Structural Proteins and Virus Assembly . . Nidoviruses (Wiley; ). 2007;179–200. [Google Scholar]
  • [85].Ye Y, Hogue Brenda G. Role of the coronavirus E viroporin protein transmembrane domain in virus assembly. J Virol. 2007;81(7):3597–3607. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [86].Ruch Travis R, Machamer Carolyn E. The hydrophobic domain of infectious bronchitis virus E protein alters the host secretory pathway and is important for release of infectious virus. J Virol. 2011;85(2):675–685. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [87].Jakhar R, Kaushik S, Gakhar SK. 3CL hydrolase-based multiepitope peptide vaccine against SARS-CoV-2 using immunoinformatics. J Med Virol. 2020;92(10):2114–2123. [DOI] [PubMed] [Google Scholar]
  • [88].Chaudhuri A. Comparative analysis of non structural protein 1 of SARS-CoV2 with SARS-CoV1 and MERS-CoV: An in silico study. J Mol Struct . 2021;1243:130854. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [89].Joshi A, Joshi BC, M.a.-u M, et al. Epitope based vaccine prediction for SARS-COV-2 by deploying immuno-informatics approach. Inf Med Unlocked. 2020;19: 100338. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [90].Enayatkhani M, Hasaniazad M, Faezi S, et al. Reverse vaccinology approach to design a novel multi-epitope vaccine candidate against COVID-19: an in silico study. J Biomol Struct Dyn. 2021;39(8):2857–2872. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [91].Gonzalez-Galarza FF, McCabe A, Eduardo J M.d S, et al. Allele frequency net database (AFND) 2020 update: gold-standard data classification, open access genotype data and new query tools. Nucleic Acids Res. 2020;48(1):D783–D788. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [92].London N, Movshovitz-Attias D, Schueler-Furman O. The structural basis of peptide-protein binding strategies. Structure. 2010;18(2):188–199. [DOI] [PubMed] [Google Scholar]
  • [93].Benson DA, Cavanaugh M, Clark K, et al. GenBank. Nucleic Acids Res. 2017;45(D1:D37–D42. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [94].Shu Y, McCauley J. GISAID: global initiative on sharing all influenza data - from vision to reality. Euro surveillance: European communicable disease bulletin. 2017;22(13):30494. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [95].Hunter S, Apweiler R, Attwood TK, et al. InterPro: the integrative protein signature database. Nucleic Acids Res. 2009;37(Database issue):D211–D215. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [96].Mulder NJ, Apweiler R, Attwood TK, et al. New developments in the InterPro database. Nucleic Acids Res. 2007;35:D224–D228. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [97].Apweiler R, Attwood TK, Bairoch A, et al. The InterPro database, an integrated documentation resource for protein families, domains and functional sites. Nucleic Acids Res. 2001;29(1):37–40. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [98].Berman HM, Westbrook J, Feng Z, et al. The protein data bank. Nucleic Acids Res. 2000;28(1):235–242. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [99].Finn RD, Bateman A, Clements J, et al. Pfam: the protein families database. Nucleic Acids Res. 2014;42(Database issue):D222–D230. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [100].Bairoch A, Apweiler R, Wu CH, et al. The universal protein resource (uniprot). Nucleic Acids Res. 2005;33(1):D154–D159. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [101].Pickett BE, Sadat EL, Zhang Y, et al. ViPR: an open bioinformatics database and analysis resource for virology research. Nucleic Acids Res. 2012;40(Database issue):D593–D598. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [102].Stano M, Beke G, Klucar L. viruSITE-Integrated database for viral genomics. Database. 2016:2016:baw162. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [103].Patiyal S, Kaur D, Kaur H, et al. A web-based platform on Coronavirus Disease-19 to maintain predicted diagnostic, drug. and vaccine candidates. Monoclonal Antibodies in Immunodiagnosis and Immunotherapy. 2020;39(6):204–216. [DOI] [PubMed] [Google Scholar]
  • [104].Zhu Z, Meng K, Liu G, et al. A database resource and online analysis tools for coronaviruses on a historical and global scale. Database. 2020;2020:70. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [105].Ahmed SF, Quadeer AA, McKay MR. COVIDep: a web-based platform for real-time reporting of vaccine target recommendations for SARS-CoV-2. Nat Protoc. 2020;15(7):2141–2142. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [106].Wu J, Chen W, Zhou J, et al. COVIEdb: a database for potential immune epitopes of coronaviruses. Front Pharmacol. 2020;11:572249. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [107].Gowthaman R, Guest JD, Yin R, et al. CoV3D: a database of high resolution coronavirus protein structures. Nucleic Acids Res. 2021;49(D1:D282–D287. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [108].Sahoo S, Mahapatra SR, Parida BK, et al. DBCOVP: a database of coronavirus virulent glycoproteins. Comput Biol Med. 2021;129:104131. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [109].Fleri W, Salimi N, Vita R, et al. Immune Epitope Database and analysis resource. In: Ratcliffe MJH, editors. Encyclopedia of immunobiology. Academic Press: Oxford; 2016. p. 220–224. [Google Scholar]
  • [110].Doytchinova IA, Flower DR. VaxiJen: a server for prediction of protective antigens, tumour antigens and subunit vaccines. BMC Bioinformatics. 2007;8(1):4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [111].Gasteiger E, Hoogland C, Gattiker A, et al. Protein Identification and analysis tools on the expasy server. In: Walker JMeditors. The proteomics protocols handbook. Totowa NJ: Humana Press; 2005. p. 571–607. [Google Scholar]
  • [112].Dimitrov I, Bangov I, Flower DR, et al. AllerTOP v.2–A server for in silico prediction of allergens. J Mol Model. 2014;20(6):2278. [DOI] [PubMed] [Google Scholar]
  • [113].Brusic V, Rudy G, Harrison LC. MHCPEP, a database of MHC-binding peptides. Nucleic Acids Res. 1998;26(1):368–371. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [114].Rammensee HG, Bachmann J, Emmerich NPN, et al. SYFPEITHI: database for MHC ligands and peptide motifs. Immunogenetics. 1999;50(3–4):213–219. [DOI] [PubMed] [Google Scholar]
  • [115].Berzofsky JA, Ahlers JD, Belyakov IM. Strategies for designing and optimizing new generation vaccines. Nat Rev Immunol. 2001;1(3):209–219. [DOI] [PubMed] [Google Scholar]
  • [116].Brusic V, Bajic VB, Petrovsky N. Computational methods for prediction of T-cell epitopes— a framework for modelling, testing, and applications. Methods. 2004;34(4):436–443. [DOI] [PubMed] [Google Scholar]
  • [117].Larsen MV, Lundegaard C, Lamberth K, et al. Large-scale validation of methods for cytotoxic T-lymphocyte epitope prediction. BMC Bioinformatics. 2007;8:424. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [118].Nielsen M, Lundegaard C, Blicher T, et al. NetMHCpan, a method for quantitative predictions of peptide binding to any HLA-A and -B locus protein of known sequence. PLOS ONE. 2007e796–e796; 28 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [119].Andreatta M, Karosiene E, Rasmussen M, et al. Accurate pan-specific prediction of peptide-MHC class II binding affinity with improved binding core identification. Immunogenetics. 2015;67(11–12):641–650. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [120].Dhanda SK, Vir P, Raghava GPS. Designing of interferon-gamma inducing MHC Class-II binders. Biol Direct. 2013;8:30. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [121].Kringelum JV, Nielsen M, Padkjær SB, et al. Structural analysis of B-cell epitopes in antibody: protein complexes. Mol Immunol. 2013;53(1–2):24–34. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [122].Parker JM, Guo D, Hodges RS. New hydrophilicity scale derived from high-performance liquid chromatography peptide retention data: correlation of predicted surface residues with antigenicity and X-ray-derived accessible sites. Biochemistry. 1986;25(19):5425–5432. [DOI] [PubMed] [Google Scholar]
  • [123].Jespersen MC, Peters B, Nielsen M, et al. BepiPred-2.0: improving sequence-based B-cell epitope prediction using conformational epitopes. Nucleic Acids Res. 2017;45(W1:W24–W29. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [124].Thornton JM, Edwards MS, Taylor WR, et al. Location of ‘continuous’ antigenic determinants in the protruding regions of proteins. EMBO J. 1986;5(2):409–413. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [125].Ponomarenko J, Bui -H-H, Li W, et al. ElliPro: a new structure-based tool for the prediction of antibody epitopes. BMC Bioinformatics. 2008;9:514. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [126].Kringelum JV, Lundegaard C, Lund O, et al. Reliable B cell epitope predictions: impacts of method development and improved benchmarking. PLoS Comput Biol. 2012;8(12):e1002829. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [127].Haste Andersen P, Nielsen M, Lund O. Prediction of residues in discontinuous B-cell epitopes using protein 3D structures. Protein Sci. 2006;15(11):2558–2567. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [128].Gupta S, Ansari HR, Gautam A, et al. Identification of B-cell epitopes in an antigen for inducing specific class of antibodies. Biol Direct. 2013;8():27. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [129].Bahrami AA, Payandeh Z, Khalili S, et al. Immunoinformatics: in silico approaches and computational design of a multi-epitope, immunogenic protein. Int Rev Immunol. 2019;38(6):307–322. [DOI] [PubMed] [Google Scholar]
  • [130].Tahir Ul Qamar M, S S, U.a A, et al. Epitope‐based peptide vaccine design and target site depiction against middle east respiratory syndrome Coronavirus: an immune-informatics study. J Transl Med. 2019;17(1):362. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [131].Panda PK, Arul MN, Patel P, et al. Structure-based drug designing and immunoinformatics approach for SARS-CoV-2. Sci Adv. 2020;6(28):eabb8097. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [132].Sanami S, Zandi M, Pourhossein B, et al. Design of a multi-epitope vaccine against SARS-CoV-2 using immunoinformatics approach. Int J Biol Macromol. 2020;164:871–883. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [133].Ghorbani A, Zare F, Sazegari S, et al. Development of a novel platform of virus-like particle (VLP)-based vaccine against COVID-19 by exposing epitopes: an immunoinformatics approach. New Microbes New Infect. 2020;38:100786. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [134].Rahman N, Ali F, Basharat Z, et al. Vaccine design from the ensemble of surface glycoprotein epitopes of SARS-CoV-2: an immunoinformatics approach. Vaccines (Basel). 2020;8(3):423. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [135].Rakib A, Sami SA, Islam MA, et al. Epitope-based immunoinformatics approach on nucleocapsid protein of severe acute respiratory syndrome-Coronavirus-2. Molecules. 2020;25(21):5088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [136].Tilocca B, Soggiu A, Sanguinetti M, et al. Immunoinformatic analysis of the SARS-CoV-2 envelope protein as a strategy to assess cross-protection against COVID-19. Microbes Infect. 2020;22(4):182–187. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [137].Cun Y, Li C, Shi L, et al. COVID-19 coronavirus vaccine T cell epitope prediction analysis based on distributions of HLA class I loci (HLA-A, -B, -C) across global populations. Hum Vaccin Immunother. 2021;17(4):1097–1108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [138].Ibrahim HS, Kafi SK. A computational vaccine designing approach for MERS-CoV infections. In: Tomar N, editor. Immunoinformatics. New York NY: Springer US; 2020. p. 39–145. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [139].Tahir Ul Qamar M, F S, S A, et al. Reverse vaccinology assisted designing of multiepitope-based subunit vaccine against SARS-CoV-2. Infect Dis Poverty. 2020;9(1):132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [140].Khairkhah N, Aghasadeghi MR, Namvar A, et al. Design of novel multiepitope constructs-based peptide vaccine against the structural S, N and M proteins of human COVID-19 using immunoinformatics analysis. PLOS ONE. 2020;15(10):e0240577. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [141].Kumar A, Kumar P, Saumya KU, et al. Exploring the SARS-CoV-2 structural proteins for multi-epitope vaccine development: an in-silico approach. Expert Rev Vaccines. 2020;19(9):887–898. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [142].Waqas M, Haider A, Sufyan M, et al. Determine the potential epitope based peptide vaccine against novel SARS-CoV-2 targeting structural proteins using immunoinformatics approaches. Front Mol Biosci. 2020;7:227. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [143].Akhand MRN, Azim KF, Hoque SF, et al. Genome based evolutionary lineage of SARS-CoV-2 towards the development of novel chimeric vaccine. Infect Genet Evol. 2020;85:104517. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [144].Shi J, Zhang J, Li S, et al. Epitope-based vaccine target screening against highly pathogenic MERS-CoV: an in silico approach applied to emerging infectious diseases. PLOS ONE. 2015;10(12):e0144475. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [145].Khan S, Shaker B, Ahmad S, et al. Towards a novel peptide vaccine for middle east respiratory syndrome coronavirus and its possible use against pandemic COVID-19. 2Journal of Molecular Liquids 2020; 114706. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [146].Basu A, Sarkar A, Maulik U. Strategies for vaccine design for Corona virus using Immunoinformatics techniques. bioRxiv. 2020;Preprint . [Google Scholar]
  • [147].Srivastava S, Kamthania M, Singh S, et al. Structural basis of development of multi-epitope vaccine against Middle East respiratory syndrome using in silico approach. Infect Drug Resist. 2018;11:2377–2391. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [148].Sarkar B, Ullah MA, Johora FT, et al. Immunoinformatics-guided designing of epitope-based subunit vaccines against the SARS Coronavirus-2 (SARS-CoV-2). Immunobiology. 2020;225(3):151955. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [149].Sahoo B, Kant K, Rai NK, et al. Identification of T-cell epitopes in proteins of novel human coronavirus, SARS-Cov-2 for vaccine development. Int J Appl Biol Pharm. 2020;11(2): 37–45 [Google Scholar]
  • [150].The Uniprot C. UniProt: a worldwide hub of protein knowledge. Nucleic Acids Res. 2019;47(D1:D506–D515. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [151].Saha S, Raghava GP. AlgPred: prediction of allergenic proteins and mapping of IgE epitopes. Nucleic Acids Res. 2006;34:W202–W209. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [152].Maurer-Stroh S, Krutz NL, Kern PS, et al. AllerCatPro-Prediction of protein allergenicity potential from the protein sequence. Bioinformatics. 2019;35(17):3020–3027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [153].Dimitrov I, Naneva L, Doytchinova I, et al. AllergenFP: allergenicity prediction by descriptor fingerprints. Bioinformatics. 2014;30(6):846–851. [DOI] [PubMed] [Google Scholar]
  • [154].Muh HC, Tong JC, Tammi MT. AllerHunter: a SVM-pairwise system for assessment of allergenicity and allergic cross-reactivity in proteins. PLOS ONE. 2009;4(6):e5861. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [155].Samad A, Ahammad F, Nain Z, et al. Designing a multi-epitope vaccine against SARS-CoV-2: an immunoinformatics approach. J Biomol Struct Dyn. 2022; 40(1):14–30. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [156].Cheng J, Randall AZ, Sweredoski MJ, et al. SCRATCH: a protein structure and structural feature prediction server. Nucleic Acids Res. 2005;33(2):W72–W76. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [157].Calis JJ, Maybeno M, Greenbaum JA, et al. Properties of MHC class I presented peptides that enhance immunogenicity. PLoS Comput Biol. 2013;9(10): e1003266. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [158].Bui -H-H, Sidney J, Li W, et al. Development of an epitope conservancy analysis tool to facilitate the design of epitope-based diagnostics and vaccines. BMC Bioinformatics. 2007;8(1):361. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [159].Chen VB, Arendall WB 3rd, Headd JJ, et al. MolProbity: all-atom structure validation for macromolecular crystallography. Biol Crystallogr. 2010;66(1):12–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [160].Hebditch M, Carballo-Amador MA, Charonis S, et al. Protein-Sol: a web tool for predicting protein solubility from sequence. Bioinformatics. 2017;33(19):3098–3100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [161].Gupta S, Kapoor P, Chaudhary K, et al. In silico approach for predicting toxicity of peptides and proteins. PLOS ONE. 2013;8(9):e73957. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [162].Banerjee A, Santra D, Maiti S. Energetics and IC50 based epitope screening in SARS CoV-2 (COVID 19) spike protein by immunoinformatic analysis implicating for a suitable vaccine development. J Transl Med. 2020;18(1):281. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [163].Bhasin M, Raghava GP. Prediction of CTL epitopes using QM, SVM and ANN techniques. Vaccine. 2004;22(23–24):3195–3204. [DOI] [PubMed] [Google Scholar]
  • [164].Andreatta M, Nielsen M. Gapped sequence alignment using artificial neural networks: application to the MHC Class I system. Bioinformatics. 2016;32(4):511–517. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [165].Karosiene E, Lundegaard C, Lund O, et al. NetMHCcons: a consensus method for the major histocompatibility complex class I predictions. Immunogenetics. 2012;64(3):177–186. [DOI] [PubMed] [Google Scholar]
  • [166].Zhang H, Lund O, Nielsen M. The PickPocket method for predicting binding specificities for receptors based on receptor pocket similarities: application to MHC-peptide binding. Bioinformatics. 2009;25(10):1293–1299. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [167].Rasmussen M, Fenoy E, Harndahl M, et al. Pan-specific prediction of peptide-MHC Class I complex stability, a correlate of T cell immunogenicity. J Iimmunol. 2016;197(4):1517–1524. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [168].Sidney J, Assarsson E, Moore C, et al. Quantitative peptide binding motifs for 19 human and mouse MHC Class I molecules derived using positional scanning combinatorial peptide libraries. Immunome Res. 2008;4:2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [169].Moutaftsi M, Peters B, Pasquetto V, et al. A consensus epitope prediction approach identifies the breadth of murine TCD8+-cell responses to vaccinia virus. Nat Biotechnol. 2006;24(7):817–819. [DOI] [PubMed] [Google Scholar]
  • [170].Peters B, Sette A. Generating quantitative models describing the sequence specificity of biological processes with the stabilized matrix method. BMC Bioinformatics. 2005;6:132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [171].Kim Y, Sidney J, Pinilla C, et al. Derivation of an amino acid similarity matrix for peptide: MHC binding and its application as a Bayesian prior. BMC Bioinformatics. 2009;10:394. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [172].Dai Y, Chen H, Zhuang S, et al. Immunodominant regions prediction of nucleocapsid protein for SARS-CoV-2 early diagnosis: a bioinformatics and immunoinformatics study. Pathog Glob Health. 2020;1–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [173].Giguère S, Drouin A, Lacoste A, et al. MHC-NP: predicting peptides naturally processed by the MHC. J Immunol Methods. 2013;400-401:30–36. [DOI] [PubMed] [Google Scholar]
  • [174].Stranzl T, Larsen MV, Lundegaard C, et al. NetCTLpan: pan-specific MHC Class I pathway epitope predictions. Immunogenetics. 2010;62(6):357–368. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [175].Jurtz V, Paul S, Andreatta M, et al. NetMHCpan-4.0: improved peptide-MHC Class I Interaction predictions integrating eluted ligand and peptide binding affinity data. J Immunol. 2017;199(9):3360–3368. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [176].Singh H, Raghava GPS. ProPred1: prediction of promiscuous MHC Class-I binding sites. Bioinformatics. 2003;19(8):1009–1014. [DOI] [PubMed] [Google Scholar]
  • [177].Larsen M, Lundegaard C, Lamberth K, et al. An integrative approach to CTL epitope prediction: a combined algorithm integrating MHC class I binding, TAP transport efficiency, and proteasomal cleavage predictions. Eur J Immunol. 2005;35:2295–2303. [DOI] [PubMed] [Google Scholar]
  • [178].Dhanda SK, Gupta S, Vir P, et al. Prediction of IL4 Inducing Peptides. Clin Dev Immunol. 2013;2013:263952. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [179].Nagpal G, Usmani SS, Dhanda SK, et al. Computer-aided designing of immunosuppressive peptides based on IL-10 inducing potential. Sci Rep. 2017;7:42851. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [180].Lata S, Bhasin M, Raghava GP. Application of machine learning techniques in predicting MHC binders. Methods Mol Biol. 2007;409:201–215. [DOI] [PubMed] [Google Scholar]
  • [181].Nielsen M, Andreatta M. NNAlign: a platform to construct and evaluate artificial neural network models of receptor–ligand interactions. Nucleic Acids Res. 2017;45(W1:W344–W349. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [182].Sturniolo T, Bono E, Ding J, et al. Generation of tissue-specific and promiscuous HLA ligand databases using DNA microarrays and virtual HLA class II matrices. Nat Biotechnol. 1999;17(6):555–561. [DOI] [PubMed] [Google Scholar]
  • [183].Nielsen M, Lundegaard C, Lund O. Prediction of MHC class II binding affinity using SMM-align, a novel stabilization matrix alignment method. BMC Bioinformatics. 2007;8(1):238. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [184].Wang P, Sidney J, Dow C, et al. A systematic assessment of MHC Class II peptide binding predictions and evaluation of a consensus approach. PLoS Comput Biol. 2008;4(4):e1000048. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [185].Jensen KK, Andreatta M, Marcatili P, et al. Improved methods for predicting peptide binding affinity to MHC class II molecules. Immunology. 2018;154(3):394–406. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [186].Singh H, Raghava GPS. ProPred: prediction of HLA-DR binding sites. Bioinformatics. 2001;17(12):1236–1237. [DOI] [PubMed] [Google Scholar]
  • [187].Guan P, Doytchinova IA, Zygouri C, et al. MHCPred: a server for quantitative prediction of peptide-MHC binding. Nucleic Acids Res. 2003;31(13):3621–3624. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [188].Reche PA, Glutting JP, Reinherz EL. Prediction of MHC class I binding peptides using profile motifs. Hum Immunol. 2002;63(9):701–709. [DOI] [PubMed] [Google Scholar]
  • [189].Paul S, Sidney J, Sette A, et al. Tepitool: a pipeline for computational prediction of T cell epitope candidates. Curr Protoc Immunol. 2016;114:18.19.1–18.19.24. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [190].Saha S, Raghava GP. Prediction of continuous B-cell epitopes in an antigen using recurrent neural network. Proteins. 2006;65(1):40–48. [DOI] [PubMed] [Google Scholar]
  • [191].Chou PY, Fasman GD. Prediction of the secondary structure of proteins from their amino acid sequence. Adv Enzymol Relat Areas Mol Biol. 1978;47:45–148. [DOI] [PubMed] [Google Scholar]
  • [192].Emini EA, Hughes JV, Perlow DS, et al. Induction of hepatitis A virus-neutralizing antibody by a virus-specific synthetic peptide. J Virol. 1985;55(3):836–839. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [193].Karplus PA, Schulz GE. Prediction of chain flexibility in proteins. Naturwissenschaften. 1985;72(4):212–213. [Google Scholar]
  • [194].Kolaskar AS, Tongaonkar PC. A semi-empirical method for prediction of antigenic determinants on protein antigens. FEBS Lett. 1990;276(1–2):172–174. [DOI] [PubMed] [Google Scholar]
  • [195].El-Manzalawy Y, Dobbs D, Honavar V. Predicting linear B-cell epitopes using string kernels. J Mol Recognit. 2008;21(4):243–255. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [196].El-Manzalawy Y, Dobbs D, Honavar V. Predicting flexible length linear B-cell epitopes. Computational Systems Bioinformatics Conference. 2008;7:121–132. [PMC free article] [PubMed] [Google Scholar]
  • [197].de Vries SJ, Bonvin AMJJ. CPORT: a consensus interface predictor and its performance in prediction-driven docking with HADDOCK. PLOS ONE. 2011;6(3):e17695. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [198].Dhanda SK, Vaughan K, Schulten V, et al. Development of a novel clustering tool for linear peptide sequences. Immunology. 2018;155(3):331–345. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [199].Nielsen M, Lundegaard C, Lund O, et al. The role of the proteasome in generating cytotoxic T-cell epitopes: insights obtained from improved predictions of proteasomal cleavage. Immunogenetics. 2005;57(1–2):33–41. [DOI] [PubMed] [Google Scholar]

Articles from Pathogens and Global Health are provided here courtesy of Taylor & Francis

RESOURCES