Skip to main content
Virology Journal logoLink to Virology Journal
. 2011 Feb 8;8:55. doi: 10.1186/1743-422X-8-55

Promiscuous prediction and conservancy analysis of CTL binding epitopes of HCV 3a viral proteome from Punjab Pakistan: an In Silico Approach

Abida Shehzadi 1,, Shahid ur Rehman 1, Muhammad Idrees 2,
PMCID: PMC3042956  PMID: 21303499

Abstract

Background

HCV is a positive sense RNA virus affecting approximately 180 million people world wide and about 10 million Pakistani populations. HCV genotype 3a is the major cause of infection in Pakistani population. One of the major problems of HCV infection especially in the developing countries that limits the limits the antiviral therapy is the long term treatment, high dosage and side effects. Studies of antigenic epitopes of viral sequences of a specific origin can provide an effective way to overcome the mutation rate and to determine the promiscuous binders to be used for epitope based subunit vaccine design. An in silico approach was applied for the analysis of entire HCV proteome of Pakistani origin, aimed to identify the viral epitopes and their conservancy in HCV genotypes 1, 2 and 3 of diverse origin.

Results

Immunoinformatic tools were applied for the predictive analysis of HCV 3a antigenic epitopes of Pakistani origin. All the predicted epitopes were then subjected for their conservancy analysis in HCV genotypes 1, 2 and 3 of diverse origin (worldwide). Using freely available web servers, 150 MHC II epitopes were predicted as promiscuous binders against 51 subjected alleles. E2 protein represented the 20% of all the predicted MHC II epitopes. 75.33% of the predicted MHC II epitopes were (77-100%) conserve in genotype 3; 47.33% and 40.66% in genotype 1 and 2 respectively. 69 MHC I epitopes were predicted as promiscuous binders against 47 subjected alleles. NS4b represented 26% of all the MHC I predicted epitopes. Significantly higher epitope conservancy was represented by genotype 3 i.e. 78.26% and 21.05% for genotype 1 and 2.

Conclusions

The study revealed comprehensive catalogue of potential HCV derived CTL epitopes from viral proteome of Pakistan origin. A considerable number of predicted epitopes were found to be conserved in different HCV genotype. However, the number of conserved epitopes in HCV genotype 3 was significantly higher in contrast to its conservancy in HCV genotype 1 and 2. Despite of the lower conservancy in genotype 1 and 2, all the predicted epitopes have important implications in diagnostics as well as CTL-based rational vaccine design, effective for most population of the world and especially the Pakistani Population.

Background

Family Flaviviridae comprises small enveloped pathogens classified in three genera: Flavivirus, Pestivirus, and Hepacivirus. Members of these genera cause various diseases in humans and other animals such as birds, horses and pigs. The only genera Flavivirus contain more than 70 members including Hepatitis C Virus (HCV), Dengue virus, West Nile virus and tick-borne encephalitis virus [1-3].

HCV is a positive sense RNA virus affecting approximately 180 million people world wide and rate of infection in Pakistani population is about 10 million [4,5]. HCV genome contributes about 9400 nucleotides that encode single polyprotein of approximately 3010 to 3033 amino acids in length [6]. This single polyprotein is processed by viral as well as host proteases into three structural proteins (i.e. core, E1 and E2) and four non-structural proteins (i.e. NS2, NS3, NS4, and NS5A) [7].

HCV mainly spreads via blood supply, reuse of glass syringes and needles, unsterilized medical equipment, use of tooth brushes of HCV patients, etc [7] and causes of acute and chronic infections [8]. Clinical demonstrations of acute Hepatitus C Viral infection include Jaundice, Fever, Myalgia, Fatigue, Lethargy, Increased ALT, Anorexia and Fulminant hepatic failure [7]. About 80% of HCV infected individuals develop chronic infections [9]. Chronic liver infections develop chronic hepatitis, cirrhosis and hepatocellular carcinoma within a period of 10, 20 and 30 years respectively followed by viral infection [10,11]. Out of 70-80% chronically infected individuals, 20% develop cirrhosis and 1-5% individuals suffer from final stage of liver diseases [12]. Hepatic steatosis is the accumulation of lipids in hepatocytes and is reported for the cause of cirrhosis [13] with the more severe cases being reported in patients infected with HCV genotype 3a [14]. The prevelance of steatosis in Pakistani population is about 61.5-65.5% compared with 32.8-81.2% in western countries [15]. The percentage of males infected with HCV chronic liver stage is higher then females with the age of patients between 40-50 years [5].

HCV is classified into six genotypes each heaving various subtypes [16-18]. These genotypes are distributed differently in various parts of the world with the genetic variance between them is about one third. The genotypes 1, 2 and 3 have world wide distribution. But the significant differences are observed in subtype distribution. Subtype 1a is mostly found in North America and Europe followed by 2b and 3a. Subtype 1b is frequently found in South East Europe and Tunisia and 2c in North Italy. Genotype 4 is mainly restricted to Middle East and Central Africa and genotype 5 in South Africa. Genotype 6 is distributed throughout South East Asia and also being isolated from Hong Kong and Vietnam [17]. The most frequent HCV genotypic distribution in Pakistan is 3a [49.05%] followed by 3b [17.66%] [19]. The knowledge of HCV distribution is crucial for treatment therapy and vaccination because of its predictive value in terms of response to antiviral therapy and vaccination. Effective responses to antiviral therapy are normally associated with genotype 2 and 3 in comparison to any other genotype [20].

HCV replicates at about 1012 new HCV viruses/day. Replication is carried out by RNA dependent RNA polymerase. RNA polymerase lacks the "proofreading" ability that ensures the high mutation rate of about 8-18 mutations in genomic RNA/year [21,20]. Such a high mutation rate limits the treatment therapy and vaccination. The current treatment therapy for HCV is INF alpha along with ribavirin limited to about 50% population [22]. Although the response rate is not much deterring, but high dosage, long-term treatment and side effects limits the usage [23,21]. There is the possibility that after next few years, new antiviral agents such as inhibitors of the viral protease, helices or polymerase will further improve the response rate of the current therapeutic agents. However, antiviral therapy is not affordable in most developing countries, where the prevalence of HCV is generally the highest. Thus, given the huge reservoir of HCV worldwide, the development of an effective vaccine may be the cheapest way to control disease associated with HCV infection.

Development of an effective HCV vaccine requires understanding of immune response. Viral immune response is associated with Major Histocompatabiliy complex protein (MHC) and T lymphocytes/T cell. MHC are classified into 2 broad categories, MHC I and MHC II [24]. MHC initially recognizes the viral antigenic epitopes and presents to T lymphocytes for degradation. MHC I presents the antigenic epitopes to CD8+ T cells and MHC II presents to CD4+ T cells for viral degradation [25,26]. CD8 T cells also referred to as cytotoxic T cells (CTL or Tc), limit viral infections by initial recognizing and their subsequent killing infected cells and secreting cytokines. CD4 T referred to as helper cells or Th cells and provides growth factors and signals for generation and maintenance of CD8 T cells [27]. T cells recognize the antigens only when they are associated with MHC, surface glycoprotein exposed on surface of all vertebrate cells. The selection of T cell epitopes is also important because these are linear and hence easy to synthesize.

A particular vaccine developed against HCV can't be effective for Pakistani population because of variations in HCV genomic sequences and distribution with regard to geographical area. Since a large number of Pakistani population is infected by HCV3a and number of patients enrolled in public and hospitals is increasing day by day. So there is a current need to develop a vaccine against HCV in particular to HCV3a that will cover approximately maximum Pakistani population. The current vaccines are DNA vaccine, Peptide vaccine and epitopic vaccines. Epitopes are the small antigenic segments of viral proteins and causes infections in the host. Epitopic vaccines provide more potent and controlled immune response and eliminates the potential lethal effects of the use of whole viral proteins [28]. Promiscuous epitopes (epitopes capable of binding maximum number of HLA alleles) may overcome the population coverage. Secondly the conserved epitopes reduces antigen escape associated with the viral mutation [29]. So the present study was designed for the prediction of promiscuous epitopes and to analyze their conservancy in general population. Any mutation in the peptide/epitope will lower the conservancy, so it was hypothesized to analyze the pI value of the mutated amino acid residue, that if remain in the range as was in original epitope provides the likeliness of that particular epitope to be used for epitopic vaccine design having an effective control over viral mutation, immune response with minimum side effects.

Methods

Sequence Retrieval and Analysis

The sequence of fully sequenced HCV 3a genome and protein of Pakistani origin was retrieved from NCBI [GU294484]. The number of individual bases in the genome i.e. the number of adenine; cytosine, guanine and thymine were calculated from DDBJ database. The molecular weight of proteins, percentage of highly repeated amino acid and the least repeated amino acid in the viral protein was calculated by using sequence and search analysis tool at PIR database (http://pir.georgetown.edu/).

Epitope Prediction

Promiscuous epitopes of HCV 3a viral proteins were predicted for HLA I and HLA II binding alleles using freely available immunoinformatics tools such as ProPred I, and ProPred respectively. In comparison to other epitope prediction tools, Propred 1 and Propred cover maximum number of Human Leukocyte antigens i.e. HLA and being used for epitopic prediction for HBV and tuberculosis. ProPred1 allows the user to predict antigenic apitopes for 47 MHC I alleles and ProPred allows epitopes prediction for 51 MHC II alleles. Predictions through these tools can be carried out at various thresholds from 1 to 10%. The algorithms designed for the working of these tools are based on linear coefficients of matrices. Maximum of the matrics were retrieved from BIMASS where the score of each peptide is calculated in multiplication and/or sum up manner. For example the score of following peptide "PACDPGRAA" can be calculated by using following equation:

Score = P(1) × A(2) × C(3) × D(4) × P(5) × G(6) × R(7) × A(8) × A(9)

Score = P(1) + A(2) + C(3) + D(4) + P(5) + G(6) + R(7) + A(8) + A(9)

Where P (1) is score of P at position 1.

Only the promiscuous epitopes with score higher than the chosen threshold score were assigned as predicted epitopes for the selected HLA alleles [30]. For the following study the default threshold i.e. 4% was used where the sensitivity and specificity are nearly the same for most of the HLA alleles available in ProPred1 and ProPred server. Moreover, MHC I alleles were predicted by keeping the proteosome and immunoproteosome filters on at 5% threshold because most of the MHC binders having a proteosomal cleavage site at C-terminal have higher likelihood to be T-cell epitopes [31]. The predicted promiscuous epitopes were positioned in the table in a decreasing order of their score.

Epitope Conservancy Analysis

All the predicted epitopes of HCV 3a proteins of Pakistani origin were subjected for worldwide conservancy analysis among HCV genotype 1, 2 and 3. 5 sequences against each HCV protein (used for epitope prediction) were retrieved from NCBI randomly. The predicted epitopes of HCV 3a (Pakistani origin) along with 5 selected sequences of individual genotypes (genotype 1, 2 and 3; one at a time) were submitted to epitope conservancy analysis tool available at IEDB database (http://tools.immuneepitope.org/tools/conservancy/iedb_input). All the epitopes having 77-100% conservancy were selected while rejecting the epitopes having variation at the anchor residues. The anchor residues in the predicted epitopes were highlighted by making it bold. The epitopes that were 100% conserved in the selected proteins of the 3 viral genotypes 1, 2 and 3 were also fully bold. Epitopes with 88/77% conservancy were with single or double amino acid variation respectively and to highlight them bold format was used in the conservancy column against each genotype.

Asteric sign (*) indicates that one out of five selected sequences either does not respond to epitope conservancy or have conservancy lower then 77%. Double asteric sign (**) indicates that only one sequence responds for 77-100% conservancy to the selected epitope.

Validation of varied amino acids using pI value

The Peptides with single or double amino acid variation were analyzed for their hydropathic characteristics or pI value [32]. The pI gives the information that the varied residue retained the amino acid group or diverted from its normal group in a particular peptide under consideration and thus provides information to be used or their rejection. All the varied amino acid residue with diverted group (with considerable change of pI value) were separated from other using superscript "D" for single variation and "DD" for diverted group for doubly varied residues. The superscript "D" in doubly varied residues of particular peptides represents the partial variation i.e. one of the varied residue retained the amino acid group while other residue shifted the amino acid group by a considerable change of pI value.

Results

HCV 3a genome of Pakistani origin comprises 9474 bp with GC content 2622 and 2700 respectively. The GC contents are 12.35% higher then AT contents. The genome encodes a polyprotein that subsequently get fragmented into structural and non structural protein of obvious molecular weight. The envelope protein E2 comprises highest moleculat weight 38755.3 KDa (Table 1). Leucine (L) a neutral nonpolar amino acid residue has the highest percent of repetition (13.1%) in E2 protein. The least repeated residue of E2 is a basic polar Lysine (K) (1.4%). The shortest segment viral protein is NS4a (5751.69 KDa molecular weight) comprising 54 amino acid residues. Leucine (L) and Valine (V) have highest percentage of repetition (14.8) and Histidine (H), Methionine (M), Threonine (T) and Tryptophan (W) are the least repeated amino acid residues (1.9%). The molecular weight of other viral proteins and percent repetition of their amino acid residue for were listed in Table 1. The percentage of amino acid residues gives an out look for their pI value and their probability of incidence in the antigenic epitopes.

Table 1.

It comprises the data of HCV genome size, Proteins, Molecular weight and %age of highly repeated and least repeated amino acid residues in individual bases

Bases No. Proteins aa Number Mol. Wt. Highly repeated aa % of repetition Least repeated aa % of repetition
Total bp 9474 Capsid 114 12985.8 R 18.4 C/F 0.9

A 1974 Core 75 7638.88 L 16 E/K/M/Y 1.3

C 2700 E1 190 20643.9 V 11.1 E 1.1

G 2622 E2 350 38755.3 L 13.1 K 1.4

T 2178 NS3 149 15423.6 A/G 11.4 N 0.7

NS4a 54 5751.69 L/V 14.8 H/M/T/W 1.9

NS4b 194 20167.5 A 13.4 C 0.4

NS5a-1a 62 6700.72 G 14.5 E/D 1.6

NS5a-1b 101 11224.6 P 11.9 K/W 1

F (Phenylalanine), I (Isoleucine), L (Leucine), M (Methionine), V (Valine), W (Tryptophan) and Y (Tyrosine) were mainly the anchor residues for MHC II predicted epitopes and are nonpolar in nature. Total 150 epitopes were predicted against 51 alleles of MHC II (Table 2). The highest number of epitopes was represented by E2 protein comprising 20% of all MHC II predicted epitopes. VFLLNPCGL, FVILVFLLL, WHINSTVLH, FNLLDVPKA, LELINTHGS, VQYLYGVGS are the promiscuous binders of 45-50 MHC II alleles. E2 is followed by NS2 and NS4B proteins representing 14.66% of the predicted MHC II epitopes. In case of NS2 VRAHVLVRL, VILLTSLLY and VRLCMFVRS are the best binders both in term of score and the HLA allele coverage (50-51 MHC II alleles). FFNILGGWV, VNLLPAILS and VVNLLPAIL are the best binders of NS4b protein both in terms of HLA coverage (41 HLA coverage for the first epitope and 51 for the next 2 epitopes) and binding efficiency. LVVGVICAA, FNILGGWVA, WQKLEAFWH, IQYLAGLST and VVGVICAAL are also the epitopes of good quality covering 31 to 35 HLA alleles available in ProPred. For the NS5a_1a only three epitopes (MRLAGPRTC, FISCQKGYK and VVSTRCPCG) were predicted as promiscuous binders with the binding score higher then the selected threshold. Out of these three epitopes MRLAGPRTC is capable of binding all the HLA alleles available in ProPred server while FISCQKGYK and VVSTRCPCG bind 22 and 25 HLA alleles respectively. The predicted promiscuous binders against other proteins were also summarized in table 2.

Table 2.

Predicted HLA II epitopes HCV Proteins of Pakistani origin and their conservancy in Genotype 1, 2 and 3 worldwide

Epitope start Position Predicted T-cell epitopes HLA alleles HCV genotype 1 HCV Genotype 2 HCV Genotype 3
Capsid

43 LGVRATRKA 23 LGVRATRKTD LGVRATRKT LGVRATRKTD

36 LPRRGPRLG 15 LPRRGPRLG LPRRGPRLG LPRRGPRLG

106 WGPNDPRRR 16 WGPTDPRRR WGPTDPRHRD WGPNDPRRR

34 YVLPRRGPR 24 YLLPRRGPR YLLPRRGPR YVLPRRGPR

21 VKFPGGGQI 8 VKFPGGGQI * VKFPGGGQI VKFPGGGQI

35 VLPRRGPRL 9 VLPRRGPRL

45 VRATRKASE 25 VRATRKTSED VRATRKTSED VRATRKTSED

30 VGGVYVLPR 39 VGGVYLLPR * VGGVYLLPR VGGVYVLPR

15 IRRPQDVKF 6 IRRPQDVKF

95 WLLSPRGSR 28 WLLSPRGSR WLLSPRGSR WLLSPRGSR

29 IVGGVYVLP 3 IVGGVYLLP * IVGGVYLLP IVGGVYVLP

82 WPLYGNEGC 10 WPLYGNEGC WPLYGNEGC WPLYGNEGC

85 YGNEGCGWA 11 YGNEGCGWA YGNEGCGWA * YGNEGCGWA

33 VYVLPRRGP 1 VYLLPRRGP VYLLPRRGP VYVLPRRGP

Core

61 FLLALLSCL 50 FLLALLSCL FLLALLSCI FLLALLSCL

64 LALLSCLIH 45 LALLSCLTVDD LALLSCLIH *

15 FADLMGYIP 41 FADLMGYIP FADLMGYIP FADLMGYIP

24 LVGAPVGGV 44 LVGAPLGGA LVGAPVGGV *

63 LLALLSCLI 36 LLALLSCLTD LLALLSCITD LLALLSCLI *

62 FLLALLSCL 24 FLLALLSCL FLLALLSCI FLLALLSCL *

32 VARALAHGV 10 VARALAHGV VARALAHGV

21 YIPLVGAPV 28 YIPLVGAPL YIPVVGAPL YIPLVGAPV

19 MGYIPLVGA 26 MGYIPLVGA MGYIPVVGA MGYIPLVGA

E1

58 YVGATTASI 41 YVGATTASI *

140 MVVAHILRL 39 MVVAHILRL*

2 WRNTSGLYV 27 WRNTSGLYV

138 VGMVVAHIL 28

56 VKYVGATTA 21 VRYVGATTAD *

9 YVLTNARSN 31 YVLTNDCSNDD

161 WGVLAGLAY 15 WGVLAGMAY WGVVFGLAY WGILAGLAY

93 FLVGQAFTF 11 FLVGQLFTF FLVGQAFTF

181 IIMVMFSGV 91 IIMVMFSGV

130 MMMNWSPAV 35 MMMNWSPTAD MMMNWSPAM

134 WSPAVGMVV 6 WSPAMGMVV *

132 MNWSPAVGM 14 MNWSPAMGM *

169 YYTMQGNWA 18 YYSMQGNWA

47 WTPMTPTVA 21 WTPVTPTVA *

172 MQGNWAKVA 25 MVGNWAKVLD MQGAWAKVID WTPVTPTVAD *

145 ILRLPQTLF 19 ILRLPQTLF

E2

122 MLPHHRPVV 3

151 VFLLNPCGL 48

337 WEFVILVFL 4 WEFIVLVFL

339 FVILVFLLL 46 FIVLVFLLL

35 WHINSTVLH 41

342 LVFLLLADA LLFLLLADA LLFLLLADA LVFLLLADA

100 VLLAYAPRP 50

198 FRPLLPHRL 47

218 VRLGALVDT 12

62 FNLLDVPKA 45

26 LELINTHGS 46

57 FYYHKFNLL 12 FYYHKFNSSD FYYHKFNSTDD

83 VGPLDRCQH 26

58 YYHKFNLLD 24

286 LLHSTTELA 17 LLHSTTEWA LLHSTTELA

129 VVVGTTDPK 14 VVVGTTDKLDD VVVGTTDRLDD * VVVGTTDAK

320 VQYLYGVGS 46 VQYLYGVGS VQYLYGVGS

159 LLVVGGLGG 14

293 LAILPCSFT 7 LAILPCSFT

335 LKWEFVILV 4 LKWEFIVLV

322 YLYGVGSGM 5 YLYGVGSSID YLYGVGSGM

300 FTPMPALST 17 FTPMPALST

245 FYTVQGEDV 4

18 IVRGPEQRL 26

100 VLLAYAPRP 4

257 VWHRFTAAC 19 VEHRLTAACD *

206 LLQETSRGH 8

1 YITGGTAAR 8

267 WTRGERCDI 10 WTRGERCEI

310 IHLHQNIVD 11 IHLHQNIVD * IHLHQNIVD

NS2

101 VRAHVLVRL 51 VRAHVLVRL

62 VILLTSLLY 50 VILLTSLLY *

73 LVFDIAKLL 24 LVFDITKLLD * LVFDITKLLD * LIFDITKLLD

153 LKDLAVATE 7 LKDLAVATE *

113 FVRSVTGGK 37

130 VGRWFNTYL 11 VGRWFNTYL *

123 FQMAILSVG 31 FQMIILHVGD

137 YLYDHLAPM 21 YLYDHLAPM

74 VFDIAKLLIA 23 VFDITKLLLA D * VFDITKLLLAD

107 LVRLCMFVR 36 LVRLCMLVR

108 VRLCMFVRS 51 VRLCMLVRS *

89 YFVRAHVLV 33 YFVRAHVLV

11 ILVLFGFFT 15

37 YAICRCESA 18 IINGLPVSAD YTICRCESAD *

33 WWNQYAICR 8 WWNQYTICRD

185 ILCGLPVSA 10 IINGLPVSA * IINGLPVSA ILCGLPVSA

145 MQHWAAAGL 18 MQHWAAAGL

50 VPPLLARGS 21 VPSLLARGSD *

88 LYLIQAAIT 35 LYLIQTAITD *

158 VATEPVIFS 14 VAVEPVVFSD VAVEPVVFS VATEPVIFS

37 YAICRCESA 19 YTICRCESAD *

175 WGADTAACG 11 WGADTAACG * WGADTAACG * WGADTAACG

NS3

4 VQVLSTATQ 46 VQIVSTATQ VQVLSSVTQD

43 LQMYTNVDQ 42

129 VCTRGVAKA 21 VCTRGVAKA VCARGVAKSDD *

24 WTVYHGAGS 13 WTVYHGAGT WTVYHGAGN

84 VIPARRRGD 18 VIPVRRRGD *

138 LQFIPVETL 45

140 FIPVETLST 43 FIPVENLGTD

6 VLSTATQTF 19 IVSTATQTF

53 LVGWPAPPG 29 LVGWPAPQG LVGWPSPPGD

27 YHGAGSRTL 22 YHGAGTRTI

14 FLGTTLGGV 10

77 LVTREADVI 25 LVTRHADVI D * LVTRNADVID *

98 LSPRPLACL 12 LSPRPLSTLD

124 IFRAAVCTR 44

NS4a

23 VVIVGHIEL 43 VVIVGRIILDD VVIVGRIVLDD VVIVGHIEL

3 WVLLGGVLAA 43 WVLVGGVLAA WVLVGGVLAA WVLLGGVLAA

4 VLLGGVLAAL 40 VLVGGVLAAL VLVGGVLAAL VLLGGVLAAL

38 VPDKEVLYQ 11 VPDKEVLYQ *

24 VIVGHIELG 8 VIVGHIELG

10 LAALAAYCLS 8 LAALAAYCLT * LAALAAYCLS LAALAAYCLS

16 YCLSVGCVV 6 YCLSTGCVVD YCLSVGCVV

26 VGHIELGGK 9 VGHIELGGK

25 IVGHIELGG 29 IVGHIELGG

20 VGCVVIVGH 15 VGCVVIVGH

9 VLAALAAYC 9 VLAALAAYC* VLAALAAYC VLAALAAYC

29 IELGGKPAL 14 IELGGKPAL

NS4b

81 FFNILGGWV 41 FFNILGGWV

153 VNLLPAILS 51 VNLLPAILS VNLLPAILS VNLLPAILS

152 VVNLLPAIL 51

39 WNFVSGIQY 16 WNFISGIQY WNFISGIQY WNFVSGIQY

165 LVVGVICAA 35 LVVGVVCAA LVVGVVCAA LVVGVICAA

82 FNILGGWVA 32 FNILGGWVA FNILGGWVA FNILGGWVA

81 FFNILGGWV 5 FFNILGGWV

63 LMAFAASVT 9 LMAFTASITD LMAFTAAVTDD LMAFTASVTD

27 WQKLEAFWH 35 WQKLEVFWAD WQKLEAFWH *

167 VGVICAALL 11 VGVVCAAIL VGVVCAAIL VGVICAAIL

45 IQYLAGLST 35 IQYLAGLST IQYLAGLST IQYLAGLST

64 MAFAASVTS 23 MAFTASITS MAFTAAVTSDD MAFTASVTSD

84 ILGGWVATH 24 ILGGWVAAQDD ILGGWVAAQDD ILGGWVATH

103 VVSGLAGAA 10 VGAGLAGAAD VVSGLAGAA

166 VVGVICAAL 31 VVGVVCAAI VVGVVCAAI VGVICAAIL

85 LGGWVATHL 3 LGGWVAAQLDD LGGWVAAQLDD LGGWVATHL

60 VASLMAFAA 15 VASLMAFTAD

41 FVSGIQYLA 8 FISGIQYLA FISGIQYLA FVSGIQYLA

139 FKIMGGELP 21 FKIMSGEVPD FKIMGGEFP *

9 LQRATQQQA 14 LQRATQQQA *

122 LDILAGYGA 6 LDILAGYGA *

104 VSGLAGAAI 3 VSGLAGAAI

NS5a_1a

39 MRLAGPRTC 51 MRIVGPRTC * FISCQKGYRD * MRLAGPRTC*

3 FISCQKGYK 22 FFSCQRGYKDD * FISCQKGYK *

19 VVSTRCPCG 25 VMSTRCPCG *

NS5a_1b

73 LLRDEITFV 20 LLRDEVTFQD* LLRDEVTFQ D ** LLRDEITFV *

16 WRVAANSYV 33 WRVAAEEYVDD * WRVAASEYVD WRVAANSYV

55 FTEVDGVRL 4 FTELDGVRL* FTEVDGVRL ** FTEVDGVRL

80 FVVGLNSYA 25 FVVGLNSYA *

32 FHYITGATE 16 FHYITGATE

61 VRLHRYAPP 27 VRLHRYAPA* VRLHRYAPP *

87 YAIGSQLPC 20 YVVGSQLPC * VRLHRYAPAD ** YAIGSQLPC *

23 YVEVRRVGD 14 YVEVTRVGDD * YVEVTRVGDD ** YVEVRRVGD

Bold amino acid residues in T-cell Epitope column indicates the anchor residues

Bold individual amino acid residues in HCV Genotype 1, 2 and 3 columns indicated the variation in peptide in comparison to the predicted epitope

*Indicates that one of the protein sequence selected for epitope conservancy either does not respond or have conservancy lower then 70%

** Indicates that only one of the protein sequence from selected sequences respond to epitope conservancy

D Indicates that amino acid residue in case of single/double variation diverted their group compared to primary epitope using pI value

DD Indicates that both amino acid residues in case of double variation diverted their group compared to primary epitope using pI value

Total 69 epitopes were predicted as promiscuous epitopes for MHC I alleles. The anchor residues in case of MHCI are quite varying both in amino acid residues and also in their nature. Mostly represented anchor residues are neutral nonpolar and neutral polar. However, quite small percentage of anchor residues were also acidic polar and basic polar in nature. The highest number of MHC I binding epitopes were represented by NS4b protein comprising 26% of all MHC I predicted epitopes. NFVSGIQYL epitope of NS4b is the best promiscuous binder of highest binding score. NS4b is followed by NS2, E2 and NS3 proteins representing 20.28% (NS2 epitopes) and 11.59% (for E2 and NS3). In case of NS2, 14 promiscuous epitopes were predicted with varying binding efficiency. GSRDGVILL, DGVILLTSL, WAAAGLKDL and LQVWVPPLL are the good binders both in term of score and the HLA allele coverage (21, 28, 27 and 28 alleles respectively). E2 predicted epitopes covers 20 to 28 HLA alleles except the PLLHSTTEL epitope that covers only 11 HLA alleles but with highest binding efficiency. NS3 epitopes covers 8 to 25 HLA alleles and were also ranked on the basis of their binding efficiency predicted by the score. The least represented epitopes were by NS5a_1a. It comprises only one epitope (HVKNGSMRL) as predicted promiscuous binders for 16 MHC I binding alleles. The promiscuous binders of MHC I for other proteins were also predicted and summarized in table 3.

Table 3.

Predicted HLA I epitopes HCV Proteins of Pakistani origin and their conservancy in Genotype 1, 2 and 3 worldwide

Epitope start Position Predicted T-cell epitopes HLA alleles HCV genotype 1 HCV Genotype 2 HCV Genotype 3
Capsid

38 RRGPRLGVR 9 RRGPRLGVR RRGPRLGVR RRGPRLGVR

35 VLPRRGPRL 25 VLPRRGPRL

Core

55 PGCSFSIFL 8 PGCSFSIFL PGCSFSIFL PGCSFSIFL *

41 RALEDGINF 20 RVLEDGVNF ** RALEDGINF *

7 VIDTLTCGF 15 VIDTLTCGF VIDTITCGF * VIDTLTCGF *

35 ALAHGVRAL 24 ALAHGVRVL ALAHGVRVL ALAHGVRAL *

24 LVGAPVGGV 18 LVGAPLGGA * LVGAPVGGV *

26 GAPVGGVAR 9 GAPLGGAAR * GAPLGGVAR GAPVGGVAR *

E1

135 SPAVGMVVA 14 SPAMGMVVA *

86 GDVCGAVFL 19 GDLCGSVFLD GDVCGAVMI GDMCGAVFL *

144 HILRLPQTL 22 HILRLPQTL *

156 IAGAHWGVL 27 IAGAHWGVL IAGAHWGIL

64 ASIRGHVDL 25 ASIRSHVDLD

E2

285 PLLHSTTEL 11 PLLHSTTEWD PLLHSTTEL

305 ALSTGLIHL 25 ALTTGLIHL ALSTGLIHL

227 CSFTPMPAL 20 CSFTTLPALD * CSFTPMPAL

71 QQLQAHHFL 27

157 CGLLVVGGL 28

212 RGHIQPVRL 24

6 TAARGGQGL 25

157 CGLLVVGGL 28

NS2

172 VITWGADTA 6 VITWGADTA *

75 FDIAKLLIA 12 FDITKLLLAD FDITKLLLAD * FDITKLLIAD

70 YPSLVFDIA 15 YPSLIFDITDD

57 GSRDGVILL 21 GGRDAVILLD ** GGRDAVILLD * GSRDGVILL

60 DGVILLTSL 26 DGVILLTSL *

148 WAAAGLKDL 27 WAASGLRDLDD** WAAAGLKDL *

50 VPPLLARGS 11 VPPLLARGS *

46 LQVWVPPLL 28 LHVWVPPLNDD LHVWVPPLNDD * LQVWVPPLL *

117 VTGGKYFQM 16 VVGGKYFQMD *

65 LTSLLYPSL 23 LTSLLYPSL

6 TLGAGILVL 48 TLGAGVLVL *

73 LVFDIAKLL 31 LVFDITKLLD LVFDITKLLD * LIFDITKLLD

145 MQHWAAAGL 26 MQHWAAAGL

178 DTAACGDIL 21 DTAACGDII DTAACGDIID * DTAACGDIL

NS3

119 GHVAGIFRA 8 GHAVGIFRA * GHVVGLFRA *

27 YHGAGSRTL 14 YHGAGTRTI YHGAGNKTLD

128 AVCTRGVAK 8 AVCTRGVAK AVCTRGVAK *

57 PAPPGAKSL 11 PAPQGARSLDD * PSPPGTKSLDD

98 LSPRPLACL 25 LSPRPLSTLD

95 ASLLSPRPL 24

130 CTRGVAKAL 20 CTRGVAKAV

7 LSTATQTFL 24

NS4a

3 WVLLGGVLA 11 WVLVGGVLA WVLVGGVLA WVLLGGVLA

23 VVIVGHIEL 21 VVIVGRIILDD VVIVGRIVLDD VVIVGHIEL

10 LAALAAYCL 24 LAALAAYCL * LAALAAYCL LAALAAYCL

5 LLGGVLAAL 27 LVGGVLAAL LVGGVLAAL LLGGVLAAL

NS4b

96 PQSSSAFVV 6 PQSSSAFVV

40 NFVSGIQYL 30 NFISGIQYL NFISGIQYL NFVSGIQYL

81 FFNILGGWV 13 FFNILGGWV

46 QYLAGLSTL 17 QYLAGLSTL QYLAGLSTL QYLAGLSTL

102 FVVSGLAGA 9 FVGAGLAGAD FVVSGLAGA

54 LPGNPAVAS 14 LPGNPAIAS LPGNPAIAS LPGNPAVAS

161 SPGALVVGV 14 SPGALVVGV SPGALVVGV SPGALVVGV

141 IMGGELPNA 7 IMGGEFPTAD *

164 ALVVGVICA 11 ALVVGVVCA ALVVGVVCA ALVVGVICA

117 LGRVLLDIL 22 LGKVLVDILD* LGKVLVDILD LGKVLLDILD *

59 AVASLMAFA 9 AIASLMAFTD AIASLMAFTD AVASLMAFTD

152 VVNLLPAIL 15

113 GIGLGRVLL 24 GIGLGKVLLD *

56 GNPAVASLM 12 GNPAIASLM GNPAIASLM GNPAVASLM

52 STLPGNPAV 21 STLPGNPAI STLPGNPAV STLPGNPAV

85 LGGWVATHL 26 LGGWVAAQLDD LGGWVAAQLDD LGGWVATHL

145 ELPNAEDVV 11

99 SSAFVVSGL 22 SSAFVVSGL

NS5a_1a

33 HVKNGSMRL 16 HVKNGSMRI * HVKNGSMRI ** HVKNGSMRL

NS5a_1b

49 VPAAEFFTE 6 VPAPEFFTE * VPAPEFFTE ** VPAAEFFTE

79 TFVVGLNSY 10 TFQVGLNQYD * TFTVGLNSFD * TFTVGLNSYD *

76 DEITFVVGL 19 DEVTFQVGLD * DEVTFTVGLD* DEITFMVGL *

Bold amino acid residues in T-cell Epitope column indicates the anchor residues

Bold individual amino acid residues in HCV Genotype 1, 2 and 3 columns indicated the variation in peptide in comparison to the predicted epitope

*Indicates that one of the protein sequence selected for epitope conservancy either does not respond or have conservancy lower then 70%

** Indicates that only one of the protein sequence from selected sequences respond to epitope conservancy

D Indicates that amino acid residue in case of single/double variation diverted their group compared to primary epitope using pI value

DD Indicates that both amino acid residues in case of double variation diverted their group compared to primary epitope using pI value

Out of total 150 predicted MHC II epitopes, 75.33% were (77-100%) conserve in genotype 3 (Table 1) against the randomly selected viral proteins. Out of 75.33% conserved peptides of genotype 3, 71.68% peptides were 100% conserve while 22.12% peptides were having single residue variation (88% epitope conservancy). Only the 40% peptides of singly varied residues diverted their amino acid group and the pI value while 60% singly varied residues retained the amino acid group as was in the predicted epitope of HCV 3a proteins. 6.19% peptides comprised the 77% epitope conservancy because of double residue variation in the peptides of general population in contrast to predicted epitopes of HCV 3a of Pakistani origin. Out of 6.19%, doubly varied amino acid residues 42.85% peptides retained their amino acid group and nearly same pI value as in case of predicted epitope while 28.57% peptides were having partial group divertion and 28.57% (of doubly varied amino acid residues) peptides diverted their amino acid group because of considerable variation in the pI value. Similar data was also obtained for the HCV genotype 1 and 2 consisting 47.33% and 40.66% conservancy respectively. However, in contrast to genotype 3, only 23.94% predicted epitopes were 100% conserve in randomly selected sequences of genotype 1 and 22.95% in genotype 2. Their rate of single/double residue variation was also predicted and expressed as figure 1.

Figure 1.

Figure 1

A comparative analysis of HCV 3a Predictive epitopes against MHC II alleles and their conservancy analysis in Genotype 1, 2 and 3 worldwide.

Out of total 69 predicted MHC I epitopes, 78.26% were (77-100%) conserve in genotype 3 (Table 2) against the randomly selected viral proteins. Out of 78.26% conserved peptides of genotype 3, 72.22% peptides were 100% conserve while 22.22% peptides were having single residue variation (88% epitope conservancy). 40.66% peptides of singly varied residues retained the amino acid group as was in the predicted epitope of HCV 3a proteins while 58.33% singly varied residues diverted their amino acid group and the pI value. 5.5% peptides comprised the 77% epitope conservancy because of double residue variation in the peptides of general population in contrast to predicted epitopes of HCV 3a of Pakistani origin. Out of 5.5%, doubly varied amino acid residues 66.66% peptides were having partial group divertion and 33.33% (of doubly varied amino acid residues) peptides diverted their amino acid group because of considerable variation in the pI value. Similar data was also obtained for the HCV genotype 1 and 2 consisting 55.07% conservancy. However, in contrast to genotype 3, only 21.05% predicted epitopes were 100% conserve in randomly selected sequences of genotype 1 and 2. Their rate of single/double residue variation was also predicted and expressed as figure 2.

Figure 2.

Figure 2

A comparative analysis of HCV 3a Predictive epitopes predicted against MHC I and their conservancy analysis in Genotype 1, 2 and 3 worldwide.

Discussion

The modern technique for control of HCV infection is a vaccine preparation that can specifically induce antibody-mediated immunity. The rapid advancements in the computational methodologies and immunoinformatics/immuno-bioinformatics provide new strategies for the synthesis of antigen specific epitopic vaccine against infectious agents such as viruses and pathogens. Epitopic vaccine against HIV, malaria and tuberculosis provided promising results and supported the defensive and therapeutic uses of these vaccines [33]. Thus in the present study, a new systematic immunoinformatics approach was applied for the predicted antigenic epitopes of HCV 3a proteins of Pakistani origin followed by diversity and conservancy in other genotypes (1,2 and 3) in randomly selected HCV sequences from NCBI and mainly belong to Thailand, Cuba, UK, USA, China, Japan, France, Italy and Germany. The immunogenic epitopes identified were nanomers and could be used diagnostically to detect HCV specific CTL responses in the patients and after vaccination. A CTL based HCV vaccine might not efficient enough to prevent from infection but it might protect the body from the disease. The analysis showed that the minimal number of epitopes required to represent the complete anigenicity of the whole proteins are significantly smaller then required to represent full length proteins. The majority of the epitopes reported here had intermediate to high HLA binding affinity.

By the use of an efficient CTL based epitope delivery technology; the predicted epitopes could eventually become vaccines in their own or fused as polytopes. The design of the HCV vaccine using conserved epitopes can avoid viral mutation and thus provides more efficient results. The study shows that the predicted epitopes were highly conserved in HCV genotype 3 and also but less conserved in genotype 1 and 2 both for MHC I and MHC II. Moreover, to ensure the viral detection at all stages of its intracellular evolution we have used all the viral proteins. Therefore, the total number of predicted epitopes were also maximized in correspond to the number of covered proteins used for the analysis.

Abbreviations

HCV: hepatitis C virus; HLA: human leukocyte antigen; MHC: major histocompatability complex; CTL: cytotoxic T lymphocytes.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

AS and SH designed the study. AS performed the immunoinformatics analysis and drafted the manuscript. MI critically reviewed the manuscript. All authors have read and approved the final manuscript.

Contributor Information

Abida Shehzadi, Email: abida@cemb.edu.pk.

Shahid ur Rehman, Email: shahid926@gmail.com.

Muhammad Idrees, Email: idreeskhan@cemb.edu.pk.

References

  1. Qi R, Zhang L, Chi C. Biological characteristics of dengue virus and potential targets for drug design. Acta Biochim Biophys Sin. 2008;40:91–101. doi: 10.1111/j.1745-7270.2008.00382.x. [DOI] [PubMed] [Google Scholar]
  2. Blitvich JB, Fernandez-Salas I, Contreras-Cordero FJ, Marlenee LN, Gonzalez-Rojas IJ, Komar N, Gubler JD, Calisher HC, Beaty JB. Serologic Evidence of West Nile Virus Infection in Horses, Coahuila State, Mexico. Emerging Infectious Diseases. 2003;9(7):853–856. doi: 10.3201/eid0907.030166. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Moennig V. Introduction to classical swine fever: virus, disease and control policy. Veterinary Microbiology. 2000;73:93–102. doi: 10.1016/S0378-1135(00)00137-1. [DOI] [PubMed] [Google Scholar]
  4. Stiffler DJ, Nguyen M, Sohn AJ, Liu C, Kaplan D, Seeger C. Focal Distribution of Hepatitis C Virus RNA in Infected Livers. PLoS ONE. 2009;4(8):1–7. doi: 10.1371/journal.pone.0006661. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Raja SN, Janjua AK. Epidemiology of hepatitis C virus infection in Pakistan. J Microbiol Immunol Infect. 2008;41:4–8. [PubMed] [Google Scholar]
  6. Raja SN, Singh NN, Janjua AK, Najam-Ul-Haq R. Hepatitis C virus infection: An enigma continues. Medicine Today. 2006;4(3):93–101. [Google Scholar]
  7. Das RB, Kundu B, Khandapkar R, Sahni S. Geographical distribution of hepatitis C virus genotype in India. Indian J Pathol Microbiol. 2002;45(3):323–328. [PubMed] [Google Scholar]
  8. Rychłowska M, Bieńkowska-Szewczyk K. Hepatitis C- new developments in the studies of the viral life cycle. Acta Biochimica Polonica. 2007;54(4):703–715. [PubMed] [Google Scholar]
  9. Jobarteh Modou, Malfroy Marine, Peterson Ingrid, Jeng Adam, Ramu Sarge-Njie, Alabi Abraham, Peterson Kevin, Cotten Matt, Hall Andrew, Sarah Rowland-Jones, Whittle Hilton, Tedder Richard, Jaye Assan, Mendy Maimuna. Seroprevalence of hepatitis B and C virus in HIV-1 and HIV-2 infected Gambians. Virology Journal. 2010;7:230–239. doi: 10.1186/1743-422X-7-230. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Re DV, Caggiari L, Vita DS, Mazzaro C, Lenzi M, Galli M, Monti G, Ferri C, Zignego LA, Gabrielli A, Sansonno D, Dammacco F, Libra M, Sacchi N, Talamini R, Spina M, Cannizzaro R, Guidoboni M, Dolcetti R. Genetic insights into the disease mechanisms of type II mixed cryoglobulinemia induced by hepatitis C virus. Digestive and Liver Disease. 2007;39(1):65–71. doi: 10.1016/s1590-8658(07)80014-4. [DOI] [PubMed] [Google Scholar]
  11. Wazir I, Wazir F, Javed M, Saeed M, Najeeb-ul-Haq Khan H. Effect of vitamin "e" supplements in herapy of chronic hepatitis c: a histological study. Gomal Journal of Medical Sciences. 2008;6(2):81–86. [Google Scholar]
  12. Jia Y, Wei L, Jiang D, Wang J, Cong X, Fei R. Antiviral action of interferon-α against hepatitis C virus replicon and its modification by interferon-γ and interleukin-8. J Gastroenterology and Hepatology. 2007;22:1278–1285. doi: 10.1111/j.1440-1746.2007.04957.x. [DOI] [PubMed] [Google Scholar]
  13. Yoon JE, Hu K. Hepatitis C Virus (HCV) Infection and Hepatic Steatosis. Int J Med Sci. 2006;3(2):53–56. doi: 10.7150/ijms.3.53. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Minakari M, Sameni KF, Shalmani MH, Molaee M, Zali M. Hepatic Steatosis in Iranian Patients with Chronic Hepatitis C. Med Princ Pract. 2008;17:126–130. doi: 10.1159/000112966. [DOI] [PubMed] [Google Scholar]
  15. Zubair A, Jamal S, Mubarik A. Morphometric Analysis of Hepatic Steatosis in Chronic Hepatitis C Infection. The Saudi Journal of Gastroenterology. 2009;15(1):11–4. doi: 10.4103/1319-3767.45047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Ali Amjad, Habib Ahmad, Ali Ijaz, Sheema Khan, Gulshan Zaidi, Muhammad Idrees. Prevalence of active hepatitis c virus infection in district mansehra Pakistan. Virology Journal. 2010;7:334–338. doi: 10.1186/1743-422X-7-334. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Ramia S, Eid-Fares J. Distribution of hepatitis C virus genotypes in the Middle East. International Journal of Infectious Diseases. 2006;10:272–277. doi: 10.1016/j.ijid.2005.07.008. [DOI] [PubMed] [Google Scholar]
  18. Ali Amjad, Habib Ahmed, Muhammad Idrees. Molecular epidemiology of Hepatitis C virus genotypes in Khyber Pakhtoonkhaw of Pakistan. Virology Journal. 2010;7:203–209. doi: 10.1186/1743-422X-7-203. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Idrees M, Riazuddin S. Frequency distribution of hepatitis C virus genotypes in different geographical regions of Pakistan and their possible routes of transmission. BMC Infectious Diseases. 2008;8(69):1–9. doi: 10.1186/1471-2334-8-69. 69. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Lauer G, Walker B. Hepatitis C V Irus Infection. N Engl J Med. 2001;345(1):41–52. doi: 10.1056/NEJM200107053450107. [DOI] [PubMed] [Google Scholar]
  21. Jawaid A, Khuwaja KA. Treatment and Vaccination for Hepatitis C: Present and Future. J Ayub Med Coll. 2008;20(1):129–133. Abbottabad. [PubMed] [Google Scholar]
  22. Pan Q, Henry DS, Metselaar JH, Scholte B, Kwekkeboom J, Tilanus WH, Janssen ALH, Laan WJL. Combined antiviral activity of interferon-α and RNA interference directed against epatitis without affecting vector delivery and gene silencing. J Mol Med. 2009;87:713–722. doi: 10.1007/s00109-009-0470-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Gozlan J, Lacombe K, Gault E, Raguin G, Girard P. Complete cure of HBV-HDV co-infection after 24 weeks of combination therapy with pegylated interferon and ribavirin in a patient co-infected with HBV/HCV/HDV/HIVq. Journal of Hepatology. 2009;50:432–434. doi: 10.1016/j.jhep.2008.05.029. [DOI] [PubMed] [Google Scholar]
  24. Tan Lei, Lu Huijun, Dan Zhang, Mingyao Tian, Hu Bo, Wang Zhuoyue, Jin Ningyi. Protection against H1N1 influenza challenge by a DNA vaccine expressing H3/H1 subtype hemagglutinin combined with MHC class II-restricted epitopes. Virology Journal. 2010;7:363–376. doi: 10.1186/1743-422X-7-363. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Reche AP, Reinherz LE. PEPVAC: a web server for multi-epitope vaccine development based on the prediction of supertypic MHC ligands. Nucleic Acids Research. 2005;33:138–142. doi: 10.1093/nar/gki357. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Zhang LG, Khan MA, Srinivasan NK, August TJ, Brusic V. MULTIPRED: a computational system for prediction of promiscuous HLA binding peptides. Nucleic Acids Research. 2005;33:172–179. doi: 10.1093/nar/gki452. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Robinson LH, Amara RR. T cell vaccines for microbial infections. Nature Medicine Suppliment. 2005;11(4):25–32. doi: 10.1038/nm1212. [DOI] [PubMed] [Google Scholar]
  28. Groot DSA, Sbai H, Aubin SC, Mcmurry J, Martin W. Immuno-informatics: Mining genomes for vaccine components. Immunology and Cell Biology. 2002;80:255–269. doi: 10.1046/j.1440-1711.2002.01092.x. [DOI] [PubMed] [Google Scholar]
  29. Lund O, Nielsen M, Kesmir C, Petersen GA, Lundegaard C, Worning P, Sylvester-Hvid C, Lamberth K, Røder G, Justesen S, Buus S, Brunak S. Definition of supertypes for HLA molecules using clustering of specificity matrices. Immunogenetics. 2004;55(12):797–810. doi: 10.1007/s00251-004-0647-4. [DOI] [PubMed] [Google Scholar]
  30. Singh H, Raghava SPG. ProPred1: prediction of promiscuous MHC Class-I binding sites. Bioinformatics. 2003;19(8):1009–1014. doi: 10.1093/bioinformatics/btg108. [DOI] [PubMed] [Google Scholar]
  31. Somvanshi P, Singh V, Seth KP. In Silico Prediction of Epitopes in Virulence Proteins of Mycobacterium Tuberculosis H37Rv for Diagnostic and Subunit Vaccine Design. Journal of Proteomics & Bioinformatics. 2008;1(3):143–153. [Google Scholar]
  32. Kyte J, Doolittle FR. A simple method for displaying the hydropathic character of a protein. Journal of Molecular Biology. 1982;157(1):105–132. doi: 10.1016/0022-2836(82)90515-0. [DOI] [PubMed] [Google Scholar]
  33. Khan MA, Miotto O, Heiny TA, Salmon J, Srinivasan NK, Nascimento MJE, Marques JETA, Brusic V, Tan WT, August TJ. A systematic bioinformatics approach for selection of epitope-based vaccine targets. Cellular Immunology. 2006;244:141–147. doi: 10.1016/j.cellimm.2007.02.005. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Virology Journal are provided here courtesy of BMC

RESOURCES