PSBinder: A Web Service for Predicting Polystyrene Surface-Binding Peptides

Ning Li; Juanjuan Kang; Lixu Jiang; Bifang He; Hao Lin; Jian Huang

doi:10.1155/2017/5761517

. 2017 Dec 27;2017:5761517. doi: 10.1155/2017/5761517

PSBinder: A Web Service for Predicting Polystyrene Surface-Binding Peptides

Ning Li ¹, Juanjuan Kang ¹, Lixu Jiang ¹, Bifang He ¹, Hao Lin ^1,², Jian Huang ^1,^2,^✉

PMCID: PMC5763211 PMID: 29445741

Abstract

Polystyrene surface-binding peptides (PSBPs) are useful as affinity tags to build a highly effective ELISA system. However, they are also a quite common type of target-unrelated peptides (TUPs) in the panning of phage-displayed random peptide library. As TUP, PSBP will mislead the analysis of panning results if not identified. Therefore, it is necessary to find a way to quickly and easily foretell if a peptide is likely to be a PSBP or not. In this paper, we describe PSBinder, a predictor based on SVM. To our knowledge, it is the first web server for predicting PSBP. The SVM model was built with the feature of optimized dipeptide composition and 87.02% (MCC = 0.74; AUC = 0.91) of peptides were correctly classified by fivefold cross-validation. PSBinder can be used to exclude highly possible PSBP from biopanning results or to find novel candidates for polystyrene affinity tags. Either way, it is valuable for biotechnology community.

1. Introduction

Phage display is a versatile and powerful technology to find ligands for any given target [1–3]. These targets can be a wide variety of substances, such as small molecules, proteins, glycan, cells, organs, and even whole organisms. In traditional phage display experiments, the 96-well plates or microplates are commonly used. Therefore, ligands which bind to polystyrene surface (PS) can appear in the biopanning results unintentionally. On one hand, a high affinity polystyrene surface-binding peptide (PSBP) can help to build a highly effective ELISA system and immobilize proteins or antibodies directly onto the polystyrene plates with minimal conformational changes [4–8]. On the other hand, PSBPs as the target-unrelated peptides (TUPs) are false positive results and may mislead the following experiments [9]. Therefore, it is important to identify if a peptide is likely to be a PSBP in the biopanning results as either the intended peptide or just a TUP.

It is not difficult to identify a PSBP experimentally [9]. However, experimental methods are not economical when dealing with a large quantity of peptides. To save money and time, computational methods for the prediction of PSBP are urgently needed. The machine learning-based approaches have been proved to be quite powerful in dealing with protein and peptide classification problems [10–13]. In this paper, we have proposed a novel PSBP predictor based on support vector machine (SVM) named PSBinder. It can be used to exclude the false positive peptides rapidly and effectively and obtain truly interesting peptides more accurately.

2. Materials and Methods

2.1. Datasets

We collected the training data from the BDB database released in Jan 2017, which is an information portal to biopanning data [14–16]. The training datasets consisted of the positive and negative datasets. As positive data, the PSBPs were collected from nine different phage display libraries. In order to ensure the comparability between the positive and the negative data, we randomly chose peptides obtained by panning against the same library with targets other than PS. For some libraries that do not have enough number of negative peptides, we collected the peptides in the same length from other libraries as an alternative.

The cysteine amino acids at both ends of the circular peptides were deleted. All peptides harboring ambiguous residues (“B”, “J”, “O”, “U”, “X,” and “Z”) or nonalphabetic characters were excluded. We compared each sequence in the negative dataset with the one in the positive dataset and deleted the identical sequences in negative dataset and replenished the peptides. To exclude possible PSBP crept in the negative data, we used the Generalized Jaccard similarity to keep the peptide sequence similarity of positive and negative data below 90% [17]. Eventually we constructed the negative and positive datasets and each had 104 peptides [4, 18–25]. The whole training dataset is freely available as supplementary online material (available here).

2.2. Features and Feature Selection

Extracting the rational features is an extremely significant step in constructing a well-behaved prediction model [26, 27]. Several kinds of typical features, such as single amino acid compositions (AACs) and dipeptide compositions (DPCs), amino acid physicochemical properties, and the pseudo-amino-acid composition, are widely used in developing classifiers for protein and peptide prediction. The classifiers based on these features have shown excellent performance [10, 28–32].

It is a wise method to count the amino acid frequencies of protein sequences to express the feature of protein sequences. We can distinguish different types of protein through the difference in the frequency distribution of amino acids between sequences. And this is also applicable for peptide sequences; we chose the AACs as the feature. In order to compensate for the lack of intrinsic link of the amino acid, we also import the DPCs. A peptide sequence can be composed of 20 amino acids (ACDEFGHIKLMNPQRSTVWY) at random in each position, so a peptide that contains L amino acids could be expressed as

\begin{matrix} β = (β_{1}, β_{2}, \dots, β_{L}) . \end{matrix}

(1)

β ₁, β₂, and β_L represent the first, the second, and the Lth amino acid of the peptide sequence β. And the definition of AAC and DPC is as follows:

\begin{matrix} A A C (i) = \frac{x_{i}}{\sum_{i = 1}^{20} x_{i}} \\ D P C (j) = \frac{y_{j}}{\sum_{j = 1}^{400} y_{j}}, \end{matrix}

(2)

where i stands for one of the 20 amino acids and j one of the 400 dipeptides. x_i denotes the number of residues of each type and y_j represents the number of dipeptides of each type in each sequence.

In order to build a prediction model with high efficiency, AAC and DPC were further screened to drop the irrelevant, redundant, and noisy features through fselect.py script supported by LIBSVM3.22 [33]. Feature selection was performed as follows. The feature was put into an initially null set in descending order by accuracy one by one and the accuracy of each set was calculated when an element was added in. When the prediction accuracy reached the highest value, we chose the set as the optimal feature subset. After the above procedures, we finally acquired the optimized AAC (OAAC) and the optimized DPC (ODPC).

2.3. Support Vector Machine

In machine learning methods, the support vector machine is a supervised learning model algorithm for regression analysis and prediction of data. The SVM has gained increasing popularity and also been extensively used in the field of bioinformatics [34–37]. We applied SVM to the analysis and prediction of PSBP. The SVM model was developed by using LIBSVM3.22 [33], which is an integrated software for support vector classification. The best error factor c and the kernel function variance g needed to build the model can be found by the software's built-in python script grid.py. In order to visualize the prediction results, the parameter b is set to 1 in the process of model training.

2.4. Prediction Assessment

N-fold cross-validation is often used to evaluate the predictive performance of statistical predictive models. The advantage of the N-fold cross-validation method is the simultaneous and repetitive use of randomly generated subsamples for training and verification. In this work, all established models were evaluated by using fivefold cross-validation, where the entire dataset was randomly divided into five groups, each containing an equal number of peptides. Four groups were used for training and the remaining one was used for testing. This process would be repeated five times. In such a way, each group was used as the test group once. Eventually the average prediction accuracy of five kinds of combination was calculated as the final accuracy of one model.

To evaluate the performance of the prediction models, we used four indicators: sensitivity (Sn), specificity (Sp), accuracy (Acc), and Matthews correlation coefficient (MCC).

\begin{matrix} S n = \frac{T P}{T P + F N}, \\ S p = \frac{T N}{T N + F P}, \\ A c c = \frac{T P + T N}{T P + F N + F P + T N}, \\ M C C \\ = \frac{T P \times T N - F P \times F N}{\sqrt{(T P + F P) (T P + F N) (T N + F P) (T N + F N)}} . \end{matrix}

(3)

In the above formulas, TP and TN represent the number of correctly predicted PSBPs and non-PSBPs, respectively and FP and FN represent the number of wrongly predicted PSBPs and non-PSBPs, respectively. MCC is one of the most robust parameters in any class predictive approach. A MCC equal to 1 is deemed to be the best prediction, whereas 0 is for a completely random prediction and −1 is an absolutely adverse prediction. In addition, the competence of the model is illustrated with the Receiver Operating Characteristic (ROC) curve. The area under the ROC curve (AUC) is used as the performance measure. For a perfect prediction, the maximum value of the AUC equals 1.0. For a random guess, the AUC equals 0.5.

2.5. Online Web Service

We used Perl to write the common gateway interface script for the web service. The feature extraction script was written by Python. The web service allows user to submit peptide sequences in FASTA format or as plain text. The result will be returned and displayed in a table after prediction.

3. Results

3.1. The Establishment of Prediction Model and Performance Evaluation

In this study, the positive dataset contains 104 peptide sequences, and the negative dataset is composed of 104 peptide sequences with the same length and almost the same source to the corresponding positive peptides. According to formula (2), each sequence of 420 features can be calculated. By filtering these redundant and high dimensional features, we finally obtained 9 OAAC and 146 ODPC. The model built with ODPC attains the maximum accuracy of 87.02% and an impressive MCC of about 0.74 (Table 1). These indicators show the excellent performance and strong generalization ability of the predictor.

Table 1.

Performances of SVM-based models trained with different features.

Feature	Sn (%)	Sp (%)	Acc (%)	MCC
Optimized amino acid composition (OAAC)	66.35	79.81	73.08	0.47
Optimized dipeptide composition (ODPC)	88.46	85.58	87.02	0.74

Open in a new tab

To more intuitively illustrate the efficiency of the predictor, we also used the ROC curve to graphically describe the performance of the predictor. Figure 1 is the ROC curve of the predictor constructed by the ODPC. The abscissa of the graph represents the false positive rate of the prediction model and the ordinate of the graph represents the true positive rate. In a rational situation, we expect a true positive rate equal to 1 and false positive rate equal to 0 and at this time the AUC is 1. The AUC area of our predictor is as high as 0.91, which demonstrates that the predictive performance of our predictor is pretty good.

The ROC curve graph of the prediction model based on ODPC.

3.2. Comparison with Other Machine Learning Methods

In order to prove that the prediction model based on SVM is better than the prediction model based on other machine learning methods, we used the ODPC to build predictive models based on Naive Bayes, Logistic Function, Random Forest, LibD3C [38], and Decision Tree J48, respectively, [39]. As the fivefold cross-validation results shown in Table 2, the average accuracy of the SVM model is approximately 3.82%, 5.95%, 9.12%, 11.06%, and 25.97% higher than that of Naive Bayes, Logistic Function, Random Forest, LibD3C, and Decision Tree J48 classifiers, respectively. This indicates a better performance of our SVM-based model.

Table 2.

The prediction performances of various machine learning methods.

Machine learning methods	Sn (%)	Sp (%)	Acc (%)	MCC
Support vector machine	88.46	85.58	87.02	0.74
Naive Bayes	83.70	82.70	83.20	0.66
Logistic Function	76.90	86.50	81.70	0.64
Random Forest	73.10	82.70	77.90	0.56
LibD3C	78.72	73.68	75.96	0.52
Decision Tree J48	48.10	74.00	61.05	0.23

Open in a new tab

3.3. Online Web Service

In order to facilitate its usage among relevant researchers, we integrated this tool with SAROTUP, which has been developed into a suite of web tools for identifying or predicting target-unrelated peptides. Users can directly access the PSBinder and get results at http://i.uestc.edu.cn/sarotup/cgi-bin/PSBinder.pl.

4. Discussion

In the published papers, the PS-binding motifs such as WXXW [19], FHXXW [21], and WXXWXXXW [23] had been found in many PSBPs. However, there are many PSBPs that do not have the typical motifs [23]. There are no tools capable of rationally predicting PSBP when peptides bear no such motifs. PSBinder was modeled by the dipeptide features, which successfully responds to these situations.

Our model was built with 146 features. The top three features are WG, WF, and WE. According to the analysis of amino acid composition, we found that the most frequently occurring amino acids were W, Y, and F. It indicates that the hydrophobic amino acids with the benzene ring may play an important role in binding polystyrene. And all the hydrophobic amino acids appear in our features. Thus, when a peptide has the amino acids with the benzene ring and is accompanied by many hydrophobic amino acids, it may be a PSBP.

In addition, after the completion of our predictor, a paper published very recently reported a PSBP with the sequence of VHWDFRQWWQPS [40]. As the paper reported, this sequence does not have typical PS-binding motifs. Since this peptide is not seen in the training datasets, we used it as an independent case test. PSBinder predicted this peptide as a PSBP (the probability is about 0.88), which agreed with the experimental result.

5. Conclusions

In this paper, we developed a predictor based on SVM to detect if a peptide is a PSBP. The model constructed by optimized dipeptide features had a good performance. The maximum accuracy of 87.02% was achieved with 0.74 MCC, 88.46% sensitivity, and 85.58% specificity, respectively. In addition, in order to facilitate its usage, the SVM-based model was implemented into an online web service called PSBinder. It is practical and freely available at http://i.uestc.edu.cn/sarotup/cgi-bin/PSBinder.pl. PSBinder would be a useful tool to predict PSBPs, whether as TUPs or intended peptides. It will help to speed up the experiment process and facilitate the development of biological products.

Acknowledgments

This work was supported by the National Natural Science Foundation of China [61571095] and the Fundamental Research Funds for the Central Universities of China [ZYGX2015Z006]. Thanks are due to Dr. Ratmir Derda for his outstanding contributions to the correction of the paper.

Disclosure

Jian Huang is the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Authors' Contributions

Jian Huang and Hao Lin conceived and designed the study, Ning Li performed the study, Ning Li and Juanjuan Kang analyzed the data, Lixu Jiang and Bifang He constructed the datasets, and Ning Li wrote the paper.

Supplementary Materials

The supplementary dataset consists of the positive and negative datasets. Each dataset contains 104 peptides which were collected from the BDB database [16]. The “positive.txt” is the positive dataset; the “negative.txt” is the negative dataset.

Click here for additional data file.^{(2.7KB, zip)}

References

1.He B., Mao C., Ru B., Han H., Zhou P., Huang J. Epitope mapping of metuximab on CD147 using phage display and molecular docking. Computational and Mathematical Methods in Medicine. 2013;2013:6. doi: 10.1155/2013/983829.983829 [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Huang J., Ru B., Dai P. Bioinformatics resources and tools for phage display. Molecules. 2011;16(1):694–709. doi: 10.3390/molecules16010694. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Zhang Y., He B., Liu K., et al. A novel peptide specifically binding to VEGF receptor suppresses angiogenesis in vitro and in vivo. Signal Transduction and Targeted Therapy. 2017;2:p. 17010. doi: 10.1038/sigtrans.2017.10. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Feng B., Dai Y., Wang L., Tao N., Huang S., Zeng H. A novel affinity ligand for polystyrene surface from a phage display random library and its application in anti-HIV-1 ELISA system. Biologicals. 2009;37(1):48–54. doi: 10.1016/j.biologicals.2008.10.002. [DOI] [PubMed] [Google Scholar]
5.Kumada Y., Hamasaki K., Shiritani Y., Ohse T., Kishimoto M. Efficient immobilization of a ligand antibody with high antigen-binding activity by use of a polystyrene-binding peptide and an intelligent microtiter plate. Journal of Biotechnology. 2009;142(2):135–141. doi: 10.1016/j.jbiotec.2009.03.011. [DOI] [PubMed] [Google Scholar]
6.Niveleau A., Bruno C., Drouet E., Brebant R., Sergeant A., Troalen F. Grafting peptides onto polystyrene microplates for ELISA. Journal of Immunological Methods. 1995;182(2):227–234. doi: 10.1016/0022-1759(95)00053-D. [DOI] [PubMed] [Google Scholar]
7.Tang J.-B., Sun X.-F., Yang H.-M., et al. Well-oriented ZZ-PS-tag with high Fc-binding onto polystyrene surface for controlled immobilization of capture antibodies. Analytica Chimica Acta. 2013;776:74–78. doi: 10.1016/j.aca.2013.03.017. [DOI] [PubMed] [Google Scholar]
8.Kumada Y., Hamasaki K., Shiritani Y., et al. Direct immobilization of functional single-chain variable fragment antibodies (scFvs) onto a polystyrene plate by genetic fusion of a polystyrene-binding peptide (PS-tag) Analytical and Bioanalytical Chemistry. 2009;395(3):759–765. doi: 10.1007/s00216-009-2999-y. [DOI] [PubMed] [Google Scholar]
9.Bakhshinejad B., Sadeghizadeh M. A polystyrene binding target-unrelated peptide isolated in the screening of phage display library. Analytical Biochemistry. 2016;512:120–128. doi: 10.1016/j.ab.2016.08.013. [DOI] [PubMed] [Google Scholar]
10.He B., Kang J., Ru B., Ding H., Zhou P., Huang J. SABinder: A Web Service for Predicting Streptavidin-Binding Peptides. BioMed Research International. 2016;2016:8. doi: 10.1155/2016/9175143.9175143 [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Song L., Li D., Zeng X., Wu Y., Guo L., Zou Q. nDNA-prot: identification of DNA-binding proteins based on unbalanced classification. BMC Bioinformatics. 2014;15(1, article 298) doi: 10.1186/1471-2105-15-298. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Feng P. M., Chen W., Lin H., Chou K. iHSP-PseRAAAC: identifying the heat shock protein families using pseudo reduced amino acid alphabet composition. Analytical Biochemistry. 2013;442(1):118–125. doi: 10.1016/j.ab.2013.05.024. [DOI] [PubMed] [Google Scholar]
13.Tang H., Zou P., Zhang C., Chen R., Chen W., Lin H. Identification of apolipoprotein using feature selection technique. Scientific Reports. 2016;6 doi: 10.1038/srep30441.30441 [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Ru B., Huang J., Dai P., et al. MimoDB: a new repository for mimotope data derived from phage display technology. Molecules. 2010;15(11):8279–8288. doi: 10.3390/molecules15118279. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Huang J., Ru B., Zhu P., et al. MimoDB 2.0: a mimotope database and beyond. Nucleic Acids Research. 2012;40(1):D271–D277. doi: 10.1093/nar/gkr922. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.He B., Chai G., Duan Y., et al. BDB: biopanning data bank. Nucleic Acids Research. 2016;44:D1127–D1132. doi: 10.1093/nar/gkv1100. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Pan L., Lei Y., Wang C., Xie J. Method on entity identification using similarity measure based on weight of Jaccard. Beijing Jiaotong Daxue Xuebao/Journal of Beijing Jiaotong University. 2009;33(6):141–145. [Google Scholar]
18.Kenan D. J., Walsh E. B., Meyers S. R., et al. Peptide-PEG Amphiphiles as Cytophobic Coatings for Mammalian and Bacterial Cells. Chemistry & Biology. 2006;13(7):695–700. doi: 10.1016/j.chembiol.2006.06.013. [DOI] [PubMed] [Google Scholar]
19.Adey N. B., Mataragnon A. H., Rider J. E., Carter J. M., Kay B. K. Characterization of phage that bind plastic from phage-displayed random peptide libraries. Gene. 1995;156(1):27–31. doi: 10.1016/0378-1119(95)00058-E. [DOI] [PubMed] [Google Scholar]
20.Sakiyama T., Ueno S., Imamura K., Nakanishi K. Use of a novel affinity tag selected with a bacterial random peptide library for improving activity retention of glutathione S-transferase adsorbed on a polystyrene surface. Journal of Molecular Catalysis B: Enzymatic. 2004;28(4-6):207–214. doi: 10.1016/j.molcatb.2003.12.019. [DOI] [Google Scholar]
21.Anni H., Nikolaeva O., Israel Y. Selection of phage-display library peptides recognizing ethanol targets on proteins. Alcohol. 2001;25(3):201–209. doi: 10.1016/S0741-8329(01)00164-1. [DOI] [PubMed] [Google Scholar]
22.Menendez A., Scott J. K. The nature of target-unrelated peptides recovered in the screening of phage-displayed random peptide libraries with antibodies. Analytical Biochemistry. 2005;336(2):145–157. doi: 10.1016/j.ab.2004.09.048. [DOI] [PubMed] [Google Scholar]
23.Gebhardt K., Lauvrak V., Babaie E., Eijsink V., Lindqvist B. H. Adhesive peptides selected by phage display: characterization, applications and similarities with fibrinogen. Peptide Research. 1996;9(6):269–278. [PubMed] [Google Scholar]
24.Serizawa T., Techawanitchai P., Matsuno H. Isolation of peptides that can recognize syndiotactic polystyrene. ChemBioChem. 2007;8(9):989–993. doi: 10.1002/cbic.200700133. [DOI] [PubMed] [Google Scholar]
25.Kumada Y., Tokunaga Y., Imanaka H., et al. Screening and characterization of affinity peptide tags specific to polystyrene supports for the orientated immobilization of proteins. Biotechnology Progress. 2006;22(2):401–405. doi: 10.1021/bp050331l. [DOI] [PubMed] [Google Scholar]
26.Ru B., 'T Hoen P. A. C., Nie F., Lin H., Guo F.-B., Huang J. PhD7Faster: predicting clones propagating faster from the Ph.D.-7 phage display peptide library. Journal of Bioinformatics and Computational Biology. 2014;12(1) doi: 10.1142/S021972001450005X.1450005 [DOI] [PubMed] [Google Scholar]
27.Huang J., Ru B., Li S., Lin H., Guo F.-B. SAROTUP: scanner and reporter of target-unrelated peptides. Journal of Biomedicine and Biotechnology. 2010;2010:7. doi: 10.1155/2010/101932.101932 [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Lin C., Zou Y., Qin J., et al. Hierarchical classification of protein folds using a novel ensemble classifier. PLoS ONE. 2013;8(2) doi: 10.1371/journal.pone.0056499.e56499 [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Chou K.-C. Prediction of protein cellular attributes using pseudo-amino acid composition. Proteins: Structure, Function, and Genetics. 2001;43(3):246–255. doi: 10.1002/prot.1035. [DOI] [PubMed] [Google Scholar]
30.Wei L., Xing P., Su R., Shi G., Ma Z. S., Zou Q. CPPred-RF: A Sequence-based Predictor for Identifying Cell-Penetrating Peptides and Their Uptake Efficiency. Journal of Proteome Research. 2017;16(5):2044–2053. doi: 10.1021/acs.jproteome.7b00019. [DOI] [PubMed] [Google Scholar]
31.Fan G. L., Li Q. Z. Discriminating bioluminescent proteins by incorporating average chemical shift and evolutionary information into the general form of Chou's pseudo amino acid composition. Journal of Theoretical Biology. 2013;334:45–51. doi: 10.1016/j.jtbi.2013.06.003. [DOI] [PubMed] [Google Scholar]
32.Fan G., Li Q. Predict mycobacterial proteins subcellular locations by incorporating pseudo-average chemical shift into the general form of Chou's pseudo amino acid composition. Journal of Theoretical Biology. 2012;304:88–95. doi: 10.1016/j.jtbi.2012.03.017. [DOI] [PubMed] [Google Scholar]
33.Chang C., Lin C. LIBSVM: a Library for support vector machines. ACM Transactions on Intelligent Systems and Technology. 2011;2(3, article 27) doi: 10.1145/1961189.1961199. [DOI] [Google Scholar]
34.Guo S. H., Deng E. Z., Xu L. Q. iNuc-PseKNC: a sequence-based predictor for predicting nucleosome positioning in genomes with pseudo k-tuple nucleotide composition. Bioinformatics. 2014;30(11):1522–1529. doi: 10.1093/bioinformatics/btu083. [DOI] [PubMed] [Google Scholar]
35.Lin H., Deng E., Ding H., Chen W., Chou K. iPro54-PseKNC: a sequence-based predictor for identifying sigma-54 promoters in prokaryote with pseudo k-tuple nucleotide composition. Nucleic Acids Research. 2014;42(21):12961–12972. doi: 10.1093/nar/gku1019. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Chen W., Feng P., Ding H., Lin H., Chou K.-C. IRNA-Methyl: Identifying N6-methyladenosine sites using pseudo nucleotide composition. Analytical Biochemistry. 2015;490:26–33. doi: 10.1016/j.ab.2015.08.021. [DOI] [PubMed] [Google Scholar]
37.Liu B., Wang X., Zou Q., Dong Q., Chen Q. Protein remote homology detection by combining Chou's pseudo amino acid composition and profile-based protein representation. Molecular Informatics. 2013;32(9-10):775–782. doi: 10.1002/minf.201300084. [DOI] [PubMed] [Google Scholar]
38.Lin C., Chen W., Qiu C., Wu Y., Krishnan S., Zou Q. LibD3C: ensemble classifiers with a clustering and dynamic selection strategy. Neurocomputing. 2014;123:424–435. doi: 10.1016/j.neucom.2013.08.004. [DOI] [Google Scholar]
39.Hall M., Frank E., Holmes G., Pfahringer B., Reutemann P., Witten I. H. The WEKA data mining software: an update. ACM SIGKDD Explorations Newsletter. 2009;11(1):10–18. doi: 10.1145/1656274.1656278. [DOI] [Google Scholar]
40.Qiang X., Sun K., Xing L., et al. Discovery of a polystyrene binding peptide isolated from phage display library and its application in peptide immobilization. Scientific Reports. 2017;7(1):p. 2673. doi: 10.1038/s41598-017-02891-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Click here for additional data file.^{(2.7KB, zip)}

[B1] 1.He B., Mao C., Ru B., Han H., Zhou P., Huang J. Epitope mapping of metuximab on CD147 using phage display and molecular docking. Computational and Mathematical Methods in Medicine. 2013;2013:6. doi: 10.1155/2013/983829.983829 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B2] 2.Huang J., Ru B., Dai P. Bioinformatics resources and tools for phage display. Molecules. 2011;16(1):694–709. doi: 10.3390/molecules16010694. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B3] 3.Zhang Y., He B., Liu K., et al. A novel peptide specifically binding to VEGF receptor suppresses angiogenesis in vitro and in vivo. Signal Transduction and Targeted Therapy. 2017;2:p. 17010. doi: 10.1038/sigtrans.2017.10. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B4] 4.Feng B., Dai Y., Wang L., Tao N., Huang S., Zeng H. A novel affinity ligand for polystyrene surface from a phage display random library and its application in anti-HIV-1 ELISA system. Biologicals. 2009;37(1):48–54. doi: 10.1016/j.biologicals.2008.10.002. [DOI] [PubMed] [Google Scholar]

[B5] 5.Kumada Y., Hamasaki K., Shiritani Y., Ohse T., Kishimoto M. Efficient immobilization of a ligand antibody with high antigen-binding activity by use of a polystyrene-binding peptide and an intelligent microtiter plate. Journal of Biotechnology. 2009;142(2):135–141. doi: 10.1016/j.jbiotec.2009.03.011. [DOI] [PubMed] [Google Scholar]

[B6] 6.Niveleau A., Bruno C., Drouet E., Brebant R., Sergeant A., Troalen F. Grafting peptides onto polystyrene microplates for ELISA. Journal of Immunological Methods. 1995;182(2):227–234. doi: 10.1016/0022-1759(95)00053-D. [DOI] [PubMed] [Google Scholar]

[B7] 7.Tang J.-B., Sun X.-F., Yang H.-M., et al. Well-oriented ZZ-PS-tag with high Fc-binding onto polystyrene surface for controlled immobilization of capture antibodies. Analytica Chimica Acta. 2013;776:74–78. doi: 10.1016/j.aca.2013.03.017. [DOI] [PubMed] [Google Scholar]

[B8] 8.Kumada Y., Hamasaki K., Shiritani Y., et al. Direct immobilization of functional single-chain variable fragment antibodies (scFvs) onto a polystyrene plate by genetic fusion of a polystyrene-binding peptide (PS-tag) Analytical and Bioanalytical Chemistry. 2009;395(3):759–765. doi: 10.1007/s00216-009-2999-y. [DOI] [PubMed] [Google Scholar]

[B9] 9.Bakhshinejad B., Sadeghizadeh M. A polystyrene binding target-unrelated peptide isolated in the screening of phage display library. Analytical Biochemistry. 2016;512:120–128. doi: 10.1016/j.ab.2016.08.013. [DOI] [PubMed] [Google Scholar]

[B10] 10.He B., Kang J., Ru B., Ding H., Zhou P., Huang J. SABinder: A Web Service for Predicting Streptavidin-Binding Peptides. BioMed Research International. 2016;2016:8. doi: 10.1155/2016/9175143.9175143 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B11] 11.Song L., Li D., Zeng X., Wu Y., Guo L., Zou Q. nDNA-prot: identification of DNA-binding proteins based on unbalanced classification. BMC Bioinformatics. 2014;15(1, article 298) doi: 10.1186/1471-2105-15-298. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B12] 12.Feng P. M., Chen W., Lin H., Chou K. iHSP-PseRAAAC: identifying the heat shock protein families using pseudo reduced amino acid alphabet composition. Analytical Biochemistry. 2013;442(1):118–125. doi: 10.1016/j.ab.2013.05.024. [DOI] [PubMed] [Google Scholar]

[B13] 13.Tang H., Zou P., Zhang C., Chen R., Chen W., Lin H. Identification of apolipoprotein using feature selection technique. Scientific Reports. 2016;6 doi: 10.1038/srep30441.30441 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B14] 14.Ru B., Huang J., Dai P., et al. MimoDB: a new repository for mimotope data derived from phage display technology. Molecules. 2010;15(11):8279–8288. doi: 10.3390/molecules15118279. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B15] 15.Huang J., Ru B., Zhu P., et al. MimoDB 2.0: a mimotope database and beyond. Nucleic Acids Research. 2012;40(1):D271–D277. doi: 10.1093/nar/gkr922. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B16] 16.He B., Chai G., Duan Y., et al. BDB: biopanning data bank. Nucleic Acids Research. 2016;44:D1127–D1132. doi: 10.1093/nar/gkv1100. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B17] 17.Pan L., Lei Y., Wang C., Xie J. Method on entity identification using similarity measure based on weight of Jaccard. Beijing Jiaotong Daxue Xuebao/Journal of Beijing Jiaotong University. 2009;33(6):141–145. [Google Scholar]

[B18] 18.Kenan D. J., Walsh E. B., Meyers S. R., et al. Peptide-PEG Amphiphiles as Cytophobic Coatings for Mammalian and Bacterial Cells. Chemistry & Biology. 2006;13(7):695–700. doi: 10.1016/j.chembiol.2006.06.013. [DOI] [PubMed] [Google Scholar]

[B19] 19.Adey N. B., Mataragnon A. H., Rider J. E., Carter J. M., Kay B. K. Characterization of phage that bind plastic from phage-displayed random peptide libraries. Gene. 1995;156(1):27–31. doi: 10.1016/0378-1119(95)00058-E. [DOI] [PubMed] [Google Scholar]

[B20] 20.Sakiyama T., Ueno S., Imamura K., Nakanishi K. Use of a novel affinity tag selected with a bacterial random peptide library for improving activity retention of glutathione S-transferase adsorbed on a polystyrene surface. Journal of Molecular Catalysis B: Enzymatic. 2004;28(4-6):207–214. doi: 10.1016/j.molcatb.2003.12.019. [DOI] [Google Scholar]

[B21] 21.Anni H., Nikolaeva O., Israel Y. Selection of phage-display library peptides recognizing ethanol targets on proteins. Alcohol. 2001;25(3):201–209. doi: 10.1016/S0741-8329(01)00164-1. [DOI] [PubMed] [Google Scholar]

[B22] 22.Menendez A., Scott J. K. The nature of target-unrelated peptides recovered in the screening of phage-displayed random peptide libraries with antibodies. Analytical Biochemistry. 2005;336(2):145–157. doi: 10.1016/j.ab.2004.09.048. [DOI] [PubMed] [Google Scholar]

[B23] 23.Gebhardt K., Lauvrak V., Babaie E., Eijsink V., Lindqvist B. H. Adhesive peptides selected by phage display: characterization, applications and similarities with fibrinogen. Peptide Research. 1996;9(6):269–278. [PubMed] [Google Scholar]

[B24] 24.Serizawa T., Techawanitchai P., Matsuno H. Isolation of peptides that can recognize syndiotactic polystyrene. ChemBioChem. 2007;8(9):989–993. doi: 10.1002/cbic.200700133. [DOI] [PubMed] [Google Scholar]

[B25] 25.Kumada Y., Tokunaga Y., Imanaka H., et al. Screening and characterization of affinity peptide tags specific to polystyrene supports for the orientated immobilization of proteins. Biotechnology Progress. 2006;22(2):401–405. doi: 10.1021/bp050331l. [DOI] [PubMed] [Google Scholar]

[B26] 26.Ru B., 'T Hoen P. A. C., Nie F., Lin H., Guo F.-B., Huang J. PhD7Faster: predicting clones propagating faster from the Ph.D.-7 phage display peptide library. Journal of Bioinformatics and Computational Biology. 2014;12(1) doi: 10.1142/S021972001450005X.1450005 [DOI] [PubMed] [Google Scholar]

[B27] 27.Huang J., Ru B., Li S., Lin H., Guo F.-B. SAROTUP: scanner and reporter of target-unrelated peptides. Journal of Biomedicine and Biotechnology. 2010;2010:7. doi: 10.1155/2010/101932.101932 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B28] 28.Lin C., Zou Y., Qin J., et al. Hierarchical classification of protein folds using a novel ensemble classifier. PLoS ONE. 2013;8(2) doi: 10.1371/journal.pone.0056499.e56499 [DOI] [PMC free article] [PubMed] [Google Scholar]

[B29] 29.Chou K.-C. Prediction of protein cellular attributes using pseudo-amino acid composition. Proteins: Structure, Function, and Genetics. 2001;43(3):246–255. doi: 10.1002/prot.1035. [DOI] [PubMed] [Google Scholar]

[B30] 30.Wei L., Xing P., Su R., Shi G., Ma Z. S., Zou Q. CPPred-RF: A Sequence-based Predictor for Identifying Cell-Penetrating Peptides and Their Uptake Efficiency. Journal of Proteome Research. 2017;16(5):2044–2053. doi: 10.1021/acs.jproteome.7b00019. [DOI] [PubMed] [Google Scholar]

[B31] 31.Fan G. L., Li Q. Z. Discriminating bioluminescent proteins by incorporating average chemical shift and evolutionary information into the general form of Chou's pseudo amino acid composition. Journal of Theoretical Biology. 2013;334:45–51. doi: 10.1016/j.jtbi.2013.06.003. [DOI] [PubMed] [Google Scholar]

[B32] 32.Fan G., Li Q. Predict mycobacterial proteins subcellular locations by incorporating pseudo-average chemical shift into the general form of Chou's pseudo amino acid composition. Journal of Theoretical Biology. 2012;304:88–95. doi: 10.1016/j.jtbi.2012.03.017. [DOI] [PubMed] [Google Scholar]

[B33] 33.Chang C., Lin C. LIBSVM: a Library for support vector machines. ACM Transactions on Intelligent Systems and Technology. 2011;2(3, article 27) doi: 10.1145/1961189.1961199. [DOI] [Google Scholar]

[B34] 34.Guo S. H., Deng E. Z., Xu L. Q. iNuc-PseKNC: a sequence-based predictor for predicting nucleosome positioning in genomes with pseudo k-tuple nucleotide composition. Bioinformatics. 2014;30(11):1522–1529. doi: 10.1093/bioinformatics/btu083. [DOI] [PubMed] [Google Scholar]

[B35] 35.Lin H., Deng E., Ding H., Chen W., Chou K. iPro54-PseKNC: a sequence-based predictor for identifying sigma-54 promoters in prokaryote with pseudo k-tuple nucleotide composition. Nucleic Acids Research. 2014;42(21):12961–12972. doi: 10.1093/nar/gku1019. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B36] 36.Chen W., Feng P., Ding H., Lin H., Chou K.-C. IRNA-Methyl: Identifying N6-methyladenosine sites using pseudo nucleotide composition. Analytical Biochemistry. 2015;490:26–33. doi: 10.1016/j.ab.2015.08.021. [DOI] [PubMed] [Google Scholar]

[B37] 37.Liu B., Wang X., Zou Q., Dong Q., Chen Q. Protein remote homology detection by combining Chou's pseudo amino acid composition and profile-based protein representation. Molecular Informatics. 2013;32(9-10):775–782. doi: 10.1002/minf.201300084. [DOI] [PubMed] [Google Scholar]

[B38] 38.Lin C., Chen W., Qiu C., Wu Y., Krishnan S., Zou Q. LibD3C: ensemble classifiers with a clustering and dynamic selection strategy. Neurocomputing. 2014;123:424–435. doi: 10.1016/j.neucom.2013.08.004. [DOI] [Google Scholar]

[B39] 39.Hall M., Frank E., Holmes G., Pfahringer B., Reutemann P., Witten I. H. The WEKA data mining software: an update. ACM SIGKDD Explorations Newsletter. 2009;11(1):10–18. doi: 10.1145/1656274.1656278. [DOI] [Google Scholar]

[B40] 40.Qiang X., Sun K., Xing L., et al. Discovery of a polystyrene binding peptide isolated from phage display library and its application in peptide immobilization. Scientific Reports. 2017;7(1):p. 2673. doi: 10.1038/s41598-017-02891-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

PSBinder: A Web Service for Predicting Polystyrene Surface-Binding Peptides

Ning Li

Juanjuan Kang

Lixu Jiang

Bifang He

Hao Lin

Jian Huang

Abstract

1. Introduction

2. Materials and Methods

2.1. Datasets

2.2. Features and Feature Selection

2.3. Support Vector Machine

2.4. Prediction Assessment

2.5. Online Web Service

3. Results

3.1. The Establishment of Prediction Model and Performance Evaluation

Table 1.

Figure 1.

3.2. Comparison with Other Machine Learning Methods

Table 2.

3.3. Online Web Service

4. Discussion

5. Conclusions

Acknowledgments

Disclosure

Conflicts of Interest

Authors' Contributions

Supplementary Materials

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

PSBinder: A Web Service for Predicting Polystyrene Surface-Binding Peptides

Ning Li

Juanjuan Kang

Lixu Jiang

Bifang He

Hao Lin

Jian Huang

Abstract

1. Introduction

2. Materials and Methods

2.1. Datasets

2.2. Features and Feature Selection

2.3. Support Vector Machine

2.4. Prediction Assessment

2.5. Online Web Service

3. Results

3.1. The Establishment of Prediction Model and Performance Evaluation

Table 1.

Figure 1.

3.2. Comparison with Other Machine Learning Methods

Table 2.

3.3. Online Web Service

4. Discussion

5. Conclusions

Acknowledgments

Disclosure

Conflicts of Interest

Authors' Contributions

Supplementary Materials

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases