Abstract
In vivo phage display is widely used for identification of organ- or disease-specific homing peptides. However, the current in vivo phage biopanning approaches fail to assess biodistribution of specific peptide phages across tissues during the screen, thus necessitating laborious and time-consuming post-screening validation studies on individual peptide phages. Here, we adopted bioinformatics tools used for RNA sequencing for analysis of high-throughput sequencing (HTS) data to estimate the representation of individual peptides during biopanning in vivo. The data from in vivo phage screen were analyzed using differential binding—relative representation of each peptide in the target organ versus in a panel of control organs. Application of this approach in a model study using low-diversity peptide T7 phage library with spiked-in brain homing phage demonstrated brain-specific differential binding of brain homing phage and resulted in identification of novel lung- and brain-specific homing peptides. Our study provides a broadly applicable approach to streamline in vivo peptide phage biopanning and to increase its reproducibility and success rate.
Graphical Abstract
Graphical Abstract.
In vivo phage display using differential binding approach.
INTRODUCTION
The ability to present foreign peptides as fusions to the nucleocapsid proteins on the surface of bacteriophage was first described by Smith in 1985 (1). Since then, application of the engineered phages for de novo peptide ligand discovery has become an invaluable and widely applied tool (2). Pasqualini and Ruoslahti introduced in vivo phage display in 1996 (3) and adopted the method for identification of peptides that target tumor vasculature in 1998 (4). During in vivo biopanning, after short circulation (typically 10 min to 1 h), free phages and phages expressing weakly binding peptide sequences are removed by perfusion. Target tissues, e.g. xenograft tumors, or specific organs, are then collected and tissue-bound phages are rescued by amplification for use in subsequent round of selection (5).
A number of tumor-homing peptides have been identified using in vivo phage display. The power of in vivo display is particularly well illustrated by identification of a novel class of tumor-homing peptides, tumor penetrating peptides that use a complex multistep mechanism, very hard to envision to be designed rationally, for tumor-specific homing and vascular exit. The prototypic tumor penetrating peptide, iRGD (amino acid sequence: CRGDKGPDC) contains a RGD motif that binds the integrins and a cryptic R/KXXR/K C-end rule (CendR) motif that is activated by proteolytic cleavage (6). C-terminally exposed CendR motif binds to neuropilin-1 (NRP-1) to trigger endocytic transport and tumor penetration pathway named CendR pathway (7). The use of CendR pathway can improve transport conjugated and coadministered therapeutic compounds into tumors in vivo (8). iRGD peptide is clinically developed by CEND Therapeutics Inc. (La Jolla, CA, USA) in phase I clinical trial on pancreatic cancer patients as an add-on treatment in combination with Abraxane and gemcitabine (clinical trial identifier: NCT03517176). Other vascular-homing peptides cilengitide (sequence: RGDfV) (9), SFITGv6 (sequence: GRCTFRGDLMQLCYPD) (10), CNGRC (sequence: CNGRC) (11), tumor extracellular matrix (ECM) homing peptides DAG (sequence: CDAGRKQKC) (12), ZD2 (sequence: CTVRTSADC) (13), PL1 (sequence: PPRRGLIKLKTS) (14) and PL3 (sequence: AGRGRLVR) (15), IP3 (sequence: CKRDLSRRC) (16), CAR (sequence: CARSKNKDC) (17) and M2 tumor-associated macrophage targeting UNO peptide (18) have been selected for possible therapeutic use and reviewed recently (19). Moreover, healthy organs can be specifically targeted with homing peptides. Examples are brain homing peptide CAGALCY (20,21) and prototypic CendR peptide RPARPAR that upon intravenous administration accumulates in lungs and heart (7).
High-throughput sequencing (HTS) of peptide-encoding region of the phage genome can be used to gain insight into the evolution of the peptide landscape during biopanning. HTS has proven particularly useful for binding peptide identification during ‘low noise’ selections on purified individual target proteins (22). However, HTS-based identification of in vivo homing peptides remains challenging. One reason is that the peptides that accumulate in intended target tissue can also home to other sites, including healthy organs. Such off-target binding will only be noted when biodistribution of individual phages is assessed. Another complicating factor, especially when using in vivo phage display with lysogenic filamentous phages that rely on bacterial cellular machinery for export, is the enrichment of parasitic peptide sequences that increase phage fitness and selection against phages displaying peptides that reduce phage amplification. During M13 phage amplification, representation of different peptide phages in a pool can be skewed 10- to 100-fold (23). Two approaches can be used to reduce the amplification bias—either avoiding the amplification altogether or performing the amplification in a way that each phage has the same amplification ratio (22,24). Bioinformatics tools have been developed to prospectively identify parasitic peptide sequences, such as SAROTUP (25) and PhD7Faster (26).
In recent years, a number of studies have been published on HTS-based phage display. Hurwitz et al. developed motif search algorithm that breaks 12-mer peptide sequences into smaller fragments and aligns them with other sequenced fragments using MAFFT software (27). Another motif search algorithm for mapping consensus sequences within the phage display-derived HTS datasets was developed by Rebollo et al. (28). Biopanning Data Bank (BDB) is a large-scale effort to combine together datasets acquired from phage display experiments with >95 next-generation phage display datasets and biopanning data from 1540 published articles (29). PHage Analysis for Selective Targeted PEPtides (PHASTpep) has introduced HTS data analysis approach of phage pools resulting both from positive and negative selections. HTS data can be further sorted based on their abundance in each pool (30).
Here, we present a statistical analysis-based approach for HTS data mining to determine differential binding of peptide phages across multiple organs during in vivo biopanning. Our approach eliminates parasitic peptide sequences in the selected peptide phage pool and minimizes false leads, thereby reducing the number of individual peptide phages that need to be used for in vivo biodistribution studies.
MATERIALS AND METHODS
In vivo phage display
For in vivo biopanning, we used a model low-diversity (105) CX7C T7 phage peptide library with spiked brain-homing peptide (CAGALCY (20,21)) as a positive control. The library was purified by PEG8000 precipitation and filtered through 0.2 μm syringe filter. For first biopanning round, 100 μl of 5 × 109 plaque forming units (pfu) CX7C T7 library with spiked-in 5 × 104 pfu CAGALCY in phosphate-buffered saline (PBS) was injected intravenously (i.v.) in Balb/C mice. Copy number of CAGALCY phages, 5 × 104 pfu, was similar to that of individual peptide-phages in the CX7C library, as 105 different phages from naïve CX7C library were amplified and 5 × 109 phages were used per injection. The input diversity used is in the same range of a typical peptide phage library after ex vivo preselection step often included prior to in vivo biopanning. After 30 min, the animals were anesthetized and perfused with 10–15 ml of PBS containing 1% bovine serum albumin (BSA) (PBS–BSA). The target and control organs (brain, lung, liver, kidney, skeletal muscle) were collected, transferred in 1 ml of Lysogeny broth with 1% of Nonoxynol-40 ( LB–1% NP-40), and homogenized on ice using handheld homogenizer with replaceable plastic tips. Phages in tissue lysates were rescued and amplified in Escherichia coli strain BLT5403 in semi-solid media, purified by PEG8000 precipitation and filtered through 0.2 μm filter. For second round of selection, three brain samples or three lung samples from first round were mixed and 5 × 109 pfu in 100 μl were used for injection (n = 3 both for lung selection and brain selection) using the same procedure as in first cycle. Tissues were dissected and processed identically to first cycle. Final libraries were amplified, purified by PEG8000 precipitation and used for sequencing.
The animal procedures were approved by the Committee of Animal Experimentation, Estonian Ministry of Agriculture in accordance with Estonian legislation and Directive 2010/63/EU of the European Parliament and of the Council of 22 September 2010 on the protection of animals used for scientific purposes (permits #48 and #159).
Determination of phage displayed peptide sequences
Peptide-encoding region of bacteriophage genome was amplified by PCR using Phusion Green Hot Start II High-Fidelity DNA Polymerase (Thermo Scientific, F537L) (reaction volume: 25 μl, list of primers added as Supplementary Table S1). Cycling conditions: denaturation at 98°C for 30 s, followed by 25 amplification cycles (10 s at 98°C, 21 s at 72°C), and final elongation (72°C for 5 min). Polymerase chain reaction (PCR) products were purified using AMPure XP Bead Based Next-Generation Sequencing Cleanup system (Beckmann Coulter, A63881) using 1.8 μl of beads per 1 μl of PCR products. Purified PCR products were quantified using Agilent Bioanalyzer 2100 Instrument using the High sensitivity DNA Kit (Agilent, 5067-4626). Ion Torrent Emulsion PCR and enrichment steps were performed using Ion PGM HiQ View OT2 kit (Life technologies, A29900). HTS was performed using Ion Torrent™ Personal Genome Machine™ (ION-PGM) using Ion PGM HiQ View sequencing kit (Life technologies, A30044) and Ion 316v2 chips (Life Technologies, 448149). The FASTQ sequence files were converted to text files and translated using in-house developed python scripts.
Peptide data mining
edgeR (31) was used for normalization of peptide count (Input data available as Supplementary Tables S2–S4) in the samples and for statistical analyses. Peptides with <5 reads present in <2 replicate samples were not used for further analysis. Three biological replicates of peptides present in brain or lung samples versus input library were compared for peptide representation analysis. Three biological replicates of peptides present in brain or lung samples versus control organs (liver, kidney, muscle, brain or lung) were compared for differential binding analysis. False discovery rate (FDR) cutoff of 0.05 and log FC cutoff of 2 were used as thresholds for statistical significance. UpSetR (32) package was used to create visualizations of the number of peptides in multiple-organ comparisons. Packages from tidyverse were used to create other sequencing analysis-related graphics (33).
Peptide phage cloning
Nucleotide sequences coding for BH1 (CLNSNKTNC), BH2 (CWRENKAKC) and LH1 (CGPNRRGDC) peptides were cloned into phage genomic DNA to be expressed on the phage surface as C-terminal fusions to capsid protein. The cloning was performed using complementary oligonucleotides (AATTCTTGCGGCCCGAACCGCCGCGGCGATTGCTA and AGCTTAGCAATCGCCGCGGCGGTTCGGGCCGCAAG for LH1, AATTCTTGCCTGAACA GCAACAAAACCAACTGCTA and AGCTTAGCAGTTGGTTTTGTTGCTGTTCAGGCAAG for BH1, AATTCTTGCTGGCGCGAAAACAAAGCGAAATGCTA and AGCTTAGCATTTC GCTTTGTTTTCGCGCCAGCAAG for BH2). Complementary oligonucleotides were diluted in milli-Q (mQ) water at 100 nM, heated at 95°C for 5 min and slowly cooled to room temperature for annealing. The cloning was performed according to the manufacturer's protocol using the T7Select® 415-1 Cloning Kit (Millipore, 70015-3) with 1 μl of annealed oligonucleotides.
Intra-animal validation of candidate phages
HTS play-off technique allows comparative internally controlled auditioning of tissue selectivity of systemic candidate peptide phages. Phages expressing a BH1, BH2, LH1, CAGALCY or G7 (CGGGGGGGC) peptides were amplified in E. coli strain BLT5403, purified by PEG8000 precipitation, followed by cesium chloride gradient centrifugation, and dialyzed against PBS. The phages were mixed in equimolar ratio and 1 × 1010 pfu of phage mix was injected i.v. into Balb/C mice (n = 3). After 30 min, the animals were anesthetized and perfused with PBS–BSA. The target and control organs (brain, lung, liver, kidney, skeletal muscle) were collected, the tissues were homogenized and phages in tissue lysates were amplified and purified by PEG8000 precipitation. The distribution of phages between target and control organs was determined by HTS. Fold change was calculated as follows:
![]() |
where
peporgan = % of reads coming from the peptide of interest in particular organ sequencing dataset;
pepinput = % of reads coming from the peptide of interest in input library sequencing dataset;
G7organ = % of reads coming from G7 peptide in particular organ sequencing dataset;
G7input = % of reads coming from G7 peptide in input library sequencing dataset.
Inter-animal validation of candidate phages
Phage biodistribution studies by titering are widely used to study the biodistribution of the single peptide phages in individual animals. For titering-based biodistribution studies, 5 × 109 pfu of phage expressing BH1, BH2, LH1, CAGALCY or G7 peptide was injected i.v. into Balb/C mice (n = 3). After 60 min, the animals were anesthetized and perfused with PBS–BSA. The target and control organs (brain, lung, liver, kidney, skeletal muscle) were collected, homogenized, and the amount of phage in tissue lysates was determined by titering using E. coli strain BLT5615. Fold change was calculated as follows:
![]() |
where
n = number of animals per group (n = 3 in this case);
peporgan = pfu/mg titer from peptide coding phage of interest in particular organ;
G7organ = pfu/mg titer from G7 coding phage in the particular organ.
Database search for target-unrelated peptides
All sequences that were identified using differential binding approach were searched in SAROTUP (http://i.uestc.edu.cn/sarotup3/) (25) to identify target-unrelated peptides (TUPScan: http://i.uestc.edu.cn/sarotup3/cgi-bin/TUPScan.pl), peptides containing sequences known to cause phages to grow faster (PhDFaster 2.0: http://i.uestc.edu.cn/sarotup3/cgi-bin/PhD7Faster.pl) and already known sequences in BDB (MimoScan: http://i.uestc.edu.cn/sarotup3/cgi-bin/MimoScan.pl).
Synthesis and functionalization of silver nanoparticles
For peptide biodistribution visualization purpose, Ag nanoparticle labeled peptides were used. The synthesis and surface functionalization of silver nanoparticles (AgNPs) was based on previously published methods (21,34,35) with minor modifications. Biotin-Ahx-LGDPNS-CAGALCY-OH (CAGALCY), biotin-Ahx-CLNSNKTNC-OH (BH1), biotin-Ahx-CWRENKAKC-OH (BH2) (TAG Copenhagen A/S, Frederiksberg, Denmark) were used as targeting moieties. Briefly, ultrapure mQ water (0.5 l; resistivity 18 MΩ cm−1) in a foil-wrapped flask cleaned with a piranha solution (H2SO4/H2O2) was heated to 65°C with stirring. For isotopic barcoding, 107AgNO3 or 109AgNO3 (50.4 mg; Isoflex USA, San Fransisco, CA, USA) pre-dissolved in 1 ml of mQ water were added to the flask. Next, trisodium citrate hydrate (200 mg; #25114, Sigma-Aldrich Co., LLC) and tannic acid (1.2 mg; #403040, Sigma-Aldrich Co., LLC) were pre-dissolved in mQ water (20 ml) and added to the vessel. The solution was heated at 65°C for 3 min, followed by stirring on a hot plate at 250°C in the dark for 27 min.
NeutrAvidin (NA; #31055, Thermo Scientific Inc., Washington, USA) was modified with an OPSS–PEG(5k)–SCM linker (OPSS; JenKem Technology USA Inc., Texas, USA) according to the procedure described by Braun et al. (34). Subsequently, NeutrAvidin-OPSS (3.9 ml, 2.9 mg/ml) was added to the core AgNPs (500 ml). After 2 min, 4-morpholineethanesulfonic acid hemisodium salt (5 ml, 0.5 M in mQ water; #M0164, Sigma-Aldrich Co., LLC) was added. The pH of the solution was adjusted to 6.0 with 1 N NaOH, and the solution was incubated in a water bath at 37°C for 24 h. The solution was brought to room temperature (RT) and 10× phosphate buffered saline (50 ml; PBS; Naxo OÜ, Tartumaa, Estonia) was added, followed by Tween-20 (250 μl; #P9416, Sigma-Aldrich Co., LLC). The solution was centrifuged at 17 500 × g for 20 min (4°C), the supernatant was removed, and the particles were resuspended in 0.005% Tween-20 in PBS (PBST). Next, tris(2-carboxyethyl)phosphine hydrochloride solution (TCEP; #646547, Sigma-Aldrich Co., LLC) was added to a final concentration of 1 mM, followed by incubation at RT for 30-min. Then, lipoic acid–PEG(1k)–NH2 (#PG2-AMLA-1k, Nanocs Inc., New York, USA) was added to a final concentration of 5 μM, and the mixture was incubated at RT for 2.5 h. The solution was centrifuged at 17 500 × g at 4°C for 20 min, the supernatant was removed and the particles were resuspended in PBST to half of the initial volume. The AgNP solution was filtered through a 0.45 μm filter and stored at 4°C in dark.
CF555 dye (#92130, Biotium Inc., California, USA) with an NHS group was coupled to the NH2 groups on the AgNPs. CF555-NHS (5 μl, 2 mM) in dimethyl sulfoxide (DMSO; Sigma-Aldrich Co., LLC) was added to AgNPs (500 μl), followed by overnight incubation at 4°C. The particles were washed three times by centrifugation at 17 500 × g at 4°C for 20 min, followed by resuspension of the particles in PBST. Next, biotinylated peptides were coupled to the particles by adding peptide (10 μl, 2 mM in mQ water) to AgNPs (500 μl), followed by incubation at RT for 30 min. The AgNPs were washed, 0.2 μm filtered and stored at 4°C in the dark.
Mapping of AgNP biodistribution using laser ablation inductively coupled plasma mass spectrometry (LA-ICP-MS)
The 107AgNPs (functionalized with a peptide (CAGALCY, BH1 or BH2) and control 109AgNPs (blocked with biotin) were mixed in equimolar ratio and i.v. injected into mice. In total, 200 μl of AgNPs (OD415 = 100) was injected per mouse. After 3 h of circulation, the mice were anesthetized and perfused with PBS. The brain and liver were collected, frozen in OCT, sectioned at 30 μm on Superfrost+ slides, and dried in a vacuum desiccator.
Laser ablation inductively coupled plasma mass spectrometry (LA-ICP-MS) analysis of the tissue samples was performed using Agilent 8800 ICP-MS/MS coupled to a Cetac LSX-213 G2+ laser ablation unit equipped with HelEx II ablation cell and connected using ARIS (Aerosol Rapid Introduction System) sample introduction system. The system was tuned using NIST 610 glass. Ablation was performed as line scans using 65 μm square spot, scan speed of 260 μm/s, 20 Hz and fluence of 13.5 J/cm2. Five parallel ablations with 65 μm spacing were performed on each tissue. ICP-MS was operated in single quad mode. Data collection was performed in TRA mode with dwell times of 9.5 ms on mass 13C and 14 ms on mass 107Ag and 109Ag corresponding to a total duty cycle of 50ms. Data reduction was performed using Iolite v3.62. Median with MAD error 2 SD outlier reject was used for data selection.
RESULTS
Identification of candidate brain and lung homing peptides using in vivo peptide phage display with low complexity library
Low-diversity naïve CX7C T7 phage peptide library with spiked-in brain-homing peptide CAGALCY was used as the starting library for in vivo phage display screens. For round 1 of biopanning, the starting library was injected into mice (n = 3) and after 30 min of circulation, the mice were perfused to remove free and weakly bound background phages. Target (lung, brain) and control organs (liver, kidney, skeletal muscle) were collected, phages from these organs were amplified and the samples were subjected to Ion-Torrent HTS. Starting library was sequenced in three technical replicates. Phage pools recovered from lung and brain were used for two separate second biopanning rounds (n = 3 for each). After round 2, all control and target organs were collected, and phage pools were sequenced (Figure 1). Sequencing was also performed for input libraries.
Figure 1.
Flow of in vivo phage biopanning. In round 1, 100 μl of 5 × 109 pfu collapsed CX7C T7 phage library and 5 × 104 pfu of CAGALCY coding phage were injected in mice (n = 3). Target organs (lung and brain) and control organs (kidney, liver and muscle) were collected and phage pools contained in target organs were amplified. For round 2, 5 × 109 pfu of amplified libraries were injected in mice for further selection of lung (n = 3) or brain (n = 3) selective phages. Phage pools isolated from each organ at each round were subjected to HTS.
The sequences of peptides expressed on the surface of phages recovered from target and control tissues were derived from the DNA sequencing data. The representation of reliable peptides, defined as peptides present in all replicates, was determined for three technical replicates of round 1 input library and for three biological replicates of phages amplified from organs (Figure 2). The number of reliable peptides for input libraries is the same as the depth of sequencing since these samples were prepared as technical replicates from one input sample. At the sequencing depth used, reliable peptides in round 1 input library made up ∼30% of all peptides. Phage pools recovered from brain displayed smaller fraction of reliable peptides (∼10%). Percentage of reliable peptides in brain was the lowest, while in other organs percentage of reliable peptides from rounds 1 and 2 brain samples were comparable to the one in technical replicates. Round 2 lung samples contained a smaller fraction of reliable peptides.
Figure 2.
Percentage of reliable peptides during in vivo selection rounds 1 and 2. The graphs represent percentage of reliable peptides (defined as being present in n = 3 replicates for each sample) in mice dosed with naïve library (A), round 1 brain-selected library (B) and round 1 lung-selected library (C) . Input libraries represent technical variability between technical replicates (n = 3) whereas organ-rescued libraries represent biological variability (n = 3).
Differential binding approach identifies organ-selective peptides
Organ selectivity of peptides was determined using differential binding approach, with statistical comparisons performed for each control organ (lung, kidney, liver, and muscle in the case the brain is the target organ) and for the target organ (brain) by using log FC >2 and FDR <0.05 as defined cutoffs. Each peptide that appeared brain-selective based on these criteria, was analyzed further. Peptides from each comparison (e.g. lung–brain, kidney–brain, liver–brain and muscle–brain) were then cross-checked with all identified peptides in other comparisons. Target organ selective peptides were defined as being present in all comparisons. Peptides that were selective only when compared to one or two control organs were discarded (Figure 3).
Figure 3.
Differential binding analysis. HTS data from target organ (brain or lung) were compared to all control organs individually. Using edgeR, log FC >2 and FDR <0.05 were applied as cutoff to identify peptides that are statistically signifficantly over-represented in brain. Peptides from each individual comparison were cross-compared to identify hits that appear in all comparisons as brain selective.
Landscapes of peptides displayed on phages recovered from brain samples in round 1 of biopanning were compared to those in the kidney, liver, muscle and lung. This analysis yielded a single brain-selective peptide when compared against the muscle sample. In round 2 of selection, 17 peptides were selective for the brain compared to the muscle, 9 compared to the liver, 7 compared to the kidney and 1 peptide (CAGALCY, known brain homing peptide spiked in the starting library) compared to the lung. CAGALCY was brain selective when compared to all control organs and six peptides appeared brain-selective when compared against kidney, liver and muscle (Figure 4A). Based on log FC difference against control organs, BH1 (CLNSNKTNC) and BH2 (CWRENKAKC) were selected for individual testing as the most promising candidate brain homing peptides (Supplementary Table S5).
Figure 4.
Multiorgan comparison of selected phage pools to identify organ specific homing peptides. Statistically significant (FDR < 0.05) brain homing (A) and lung homing (B) peptides compared against each control organ (log FC > 2, FDR < 0.05) and number of peptides that overlap between different comparisons. Only one peptide (CAGALCY) is brain selective and three peptides are lung selective when compared to all other control organs (Supplementary Table S5 and S6).
Differential binding analysis for lung-selective peptides in mice dosed with naïve library (round 1) yielded no lung selective peptides when compared to any other control organ. In contrast, round 2 yielded 18 lung selective peptides when the dataset was compared against the kidney, 16 peptides when compared against the liver, 6 when compared against the brain and 5 when compared against the muscle. Three peptides were present in all comparisons as lung-selective peptides (Figure 4B). Notably, among 20 peptides that were lung selective compared to at least one organ, 8 contained the CX5RGDC motif and 7 ended with C-terminal arginine residue (Supplementary Table S6). All three of the identified lung-specific peptides contained the RGDC motif. LH1 (CGPNRRGDC) was selected for further testing, as this peptide had the highest log FC difference toward lung when compared to the brain (log FC = 4.3) and liver (log FC = 3.9).
Enrichment analysis is the most commonly used approach to identify peptides in phage display. We next analyzed enrichment of reliable peptides in target organs in round 1 versus round 2. Theoretically, the peptides that are most enriched, are most likely to reach the target and possess desired binding properties. We used gating log FC >2 and FDR < 0.05 between round 1 input library and phages recovered from the target organs in rounds 1 and 2. As a threshold, we decided to apply the criteria used for differential binding analysis. After first round of selection, this approach resulted in identification of four peptides that are enriched in the lung and no brain-enriched peptides. After round 2, 16 peptides were found enriched in the brain (Supplementary Table S7) and 37 peptides in the lung (Supplementary Table S8).
One of four peptides enriched in the lungs after round 1 was an R-ending peptide, KTGARKR; the rest of sequences contained KLAALE phage vector-derived sequence. After round 2, among the 16 peptides enriched in the brain, CAGALCY was the most enriched sequence, followed by CVRLNKVRC (EN-2) and CTVTNKVRC (EN-3). EN-2 sequence in differential binding dataset shows selectivity only against the muscle and EN-3 sequence is selective for the brain when compared against the liver, kidney and muscle. Both BH1 and BH2 sequences that were identified using differential binding approach were present on the list as number 8 and number 4 accordingly. Of 37 peptides identified through enrichment analysis in lung, LH1 was 16th most enriched sequence.
Selected peptide phages show expected in vivo tropism in playoff analysis
In an earlier study, we developed a playoff analysis for the validation of candidate homing peptides (5). The method is based on medium-throughput internally controlled auditioning of mixtures of selected peptide phages. An equimolar mix of CAGALCY, BH1, BH2, LH1 and G7 control phages was i.v. injected into mice (n = 3). Control organs (kidney, liver, muscle) and target organs (brain, lung) were collected and phage pools in each organ were sequenced using HTS. The input mixture was also sequenced. After normalization against representation in the input, fold change over G7 phage was calculated for each peptide phage in each organ (Figure 5).
Figure 5.
Intra-animal validation using in vivo peptide phage playoff and HTS. Individual peptide phages (G7, CAGALCY, BH1, BH2, LH1) were mixed at equimolar ratio and injected in BalbC mice (n = 3). Results were expressed as fold change over G7 control phage. CAGALCY (A), BH1 (B), BH2 (C), LH1 (D). Lower and upper hinges correspond to first and third quartiles, whiskers represent 1.5*IQR (inter-quartile range), middle line represents the median.
The phage displaying known brain-homing peptide CAGALCY showed the highest fold-change over the G7 control phage (Figure 5A), both in brain homing and selectivity. Also, BH1 and BH2 showed preferential accumulation in the brain (Figure 5B and C) and LH1 showed tropism for the lungs (Figure 5D).
In vivo homing of individual peptide phages
Biodistribution of peptide phages may be affected by other phages in the mix, with the bystander effect best characterized for tumor penetrating CendR peptides (36). Therefore, we decided to study biodistribution of individual candidate organ specific peptide phages. Individual phage clones displaying CAGALCY, BH1, BH2 and LH1 were i.v. injected in mice (n = 3 for each peptide sequence). After 30 min circulation, the organs (brain, lung, liver, kidney, muscle) were collected and phage titer in each organ was determined. For each peptide phage, tissue distribution was expressed as fold change over control G7 phage (Figure 6).
Figure 6.
Individual peptide-phage testing. Individual peptide displaying phages (CAGALCY, BH1, BH2, LH1, G7) were injected in mice (n = 3 for each peptide). Organs (brain, lung, liver, kidney, muscle) were collected and phages bound to the organ were titered. Fold change (y-axis) was calculated over the titer of G7 phage. CAGALCY (A) homes to brain. BH1(B) and BH2 (C) are selective toward brain when compared to other organs except lung. LH1 (D) shows selectivity toward lung. Lower and upper hinges correspond to first and third quartiles, whiskers represent 1.5IQR (inter-quartile range), middle line represents the median.
In the brain, CAGALCY-displaying phages showed ∼65-fold overrepresentation compared to G7 phage, and the phages displaying newly identified brain homing peptides BH1 and BH2 showed ∼20-fold overrepresentation. Second highest difference for BH1 and BH2 was observed in the lung. This was expected as both sequences did not show brain selectivity when compared to lung using differential binding approach.
Finally, we investigated whether BH1, BH2 or LH1 could be target-unrelated peptides, i.e. peptides that are enriched for reasons other than target recognition. TUPScan did not identify peptides BH1, BH2 or LH1 as target-unrelated peptide sequences. Furthermore, PhDFaster did not detect peptides BH1, BH2, LH1 as sequences that could cause phages to be amplified faster than other clones. MimoScan confirmed that BH1, BH2 and LH1 have not been previously identified in other phage display datasets that are included in BDB.
Brain-selective peptide functionalization increases delivery of AgNPs to the brain
Effect of brain homing peptides on biodistribution of synthetic nanoparticles was tested using peptide-coated isotopically barcoded AgNPs. Biotinylated CAGALCY, BH1 and BH2 peptides were coated on neutravidin-107AgNPs and biotin-blocked neutravidin109AgNPs were used as a control. Equimolar cocktail of targeted and control 107/109AgNPs was injected i.v. in mice; after 3 h circulation, animals were sacrificed, tissues isolated and AgNPs mapped in brain and liver sections using LA-ICP-MS. For all peptide-AgNPs, the liver ratio of 107/109AgNPs (∼1) was similar to the ratio of the particles in the input mixture. In the brain, the ratiometric 107/109AgNP analysis was performed by quantitation of 107Ag and 109Ag in five laser ablation paths per cryosection (Figure 7). The 107/109AgNP ratio was 1.62 (95% CI 1.62 ± 0.0719) for the mice injected with CAGALCY-107AgNPs, 1,18 (95% CI 1.18 ± 0.0331) for BH1–107AgNPs and 1.04 (95% CI 1.04 ± 0.027) for BH2–107AgNPs.
Figure 7.
Ratiometric in vivo auditioning of brain selectivity of peptide AgNPs. CAGALCY (first column), BH1 (second column), BH2 (third column) were coated on 107AgNPs, mixed with biotin-blocked 109AgNPs, injected i.v. to mice. Brains were isolated after 3 h circulation, 107/109Ag ratio (y-axis in each graph) was determined using LA-ICP-MS over five parallel ablation paths (shown as individual graphs; x-axis represents the ratio over the length, expressed as acquisition time, of the ablation line). Mean 107/109Ag ratio was 1.62 (95% CI 1.55–1.69) for CAGALCY, 1.18 (95% CI 1.15–1.21) for BH1 and 1.04 (95% CI 1.01–1.07) for BH2.
DISCUSSION
In vivo peptide phage display is a powerful technology that allows unbiased agnostic mapping of systemically accessible vascular heterogeneity in diseased and normal tissues and development of peptide affinity ligands for precision delivery of drugs, imaging agents and nanoparticles. However, the technology is laborious, and subject to biases and artifacts due to low stringency and reproducibility, and lack of validated tools for identification of the most avid binders in selected phage pools. Availability of powerful and affordable HTS platforms has made it feasible to determine homing peptide phage landscapes in target and control organs. In the current study, we developed an approach for identification of target-selective homing peptides based solely on in vivo display HTS data. Application of this approach enables faster and more reliable homing peptide discovery than provided by the current methods. A study using a peptide T7 phage library with spiked-in brain homing phage demonstrated brain-specific differential binding of brain homing phage and resulted in identification of novel lung and brain selective homing peptides.
Our HTS-based screening addresses the main shortcoming in HTS-based phage biopanning—excessive reliance on enrichment of peptide phages at the target site. Enrichment alone does not provide full indication of selectivity of the peptide, as representation of peptide phages at the target site is also influenced by nonspecific promiscuous interactions and peptide-dependent variation in the phage amplification rate (30). Our approach takes into account the statistically weighted enrichment for each peptide in the pool in the context of representation across target and nontarget sites. The method was inspired by published analysis of RNA sequencing data (37) and availability of computational tools such as edgeR that allow the use of quasi-likelihood F-tests and correct the results taking into account multiple comparisons (31,38).
To validate our approach to in vivo homing peptide identification, we decided to perform a model screen focused on the enrichment and tissue distribution analyses of a known brain-selective peptide, CAGALCY (20,21), spiked into a low-complexity peptide phage library. We showed that by using the differential binding approach we can not only select for CAGALCY as a brain homing peptide, which would be possible by conventional enrichment analysis, but also profile its biodistribution and selectivity across organs. The diversity of the peptide library used in the model screen, ∼1 × 105 pfu, is ∼4 orders of magnitude lower than the diversity of the naïve CX7C library. This is similar to typical diversity of the phage library after preselection ex vivo, or on cultured cells, steps often used to enrich for binding peptides prior to in vivo biopanning (5). Remarkably, even with the low-complexity peptide library used here, the differential binding approach resulted in identification of several lung- and brain-selective homing peptides. Individual phage homing and playoff studies confirmed that newly identified BH1, BH2 and LH1 peptide phages possess the intended systemic tropism. Complementary in vivo ratiometric studies with isotopically barcoded AgNPs confirmed the brain homing of CAGALCY-AgNPs (21) and showed that surface BH1 and BH2 peptide functionalization had a positive effect on AgNP brain delivery.
Similar strategies for novel peptide identification using phage screens that take advantage of statistical methods have already proven effective for HTS data in in vitro settings (30). HTS-based peptide differential profiling can be used for systematic identification of homing peptides and possibly other phage-displayed targeting ligands (e.g. single-chain antibodies, affibodies). Importantly, the differential binding approach developed here avoids the bias introduced during a phage amplification step between phage display rounds—one of the main drawbacks of the phage display (39).
It should be possible to develop affinity ligands targeting vascular ZIP codes—systemically accessible vascular heterogeneity across normal and diseased tissues (40) following a similar process. Variations on the theme to be explored in follow-up studies include additional tissue fractionation steps for profiling of cell type- and ECM-specific peptides in normal organs and in pathological conditions such as tumors.
In summary, differential binding-based HTS in vivo phage display developed in the current study is likely to render identification of vascular targeting peptides more reliable and consistent facilitating progress toward development of affinity-guided smart drugs and contrast agents.
DATA AVAILABILITY
R script for differential binding analysis and generation of figures is available at https://github.com/KarlisPleiko/diff_phage.
Supplementary Material
ACKNOWLEDGEMENTS
We thank Prof. Erkki Ruoslahti for constructive criticism of the manuscript. Graphic abstract, Figure 1 and Figure 3 were created using BioRender.com. K. Pleiko was supported by PhD research scholarship administered by the University of Latvia Foundation and funded by ‘Mikrotīkls’ Ltd and ERASMUS scholarship; A. Tobi was supported by a Smart Specialization PhD Scholarship of the European Regional Development Fund as a part of the Estonian Research and Development and Innovation Strategy 2014–2020.
Author contributions: K.P., K.P. and M.H. designed and carried out experiments, analyzed the data, composed the figures and wrote the manuscript. U.R. designed the differential binding experiments. K.K. performed sequencing. P.P. performed LA-ICP-MS analysis. A.T. prepared 107AgNPs and 109AgNPs and performed the peptide functionalization of the AgNPs. T.T. conceived and supervised the project, analyzed the data and wrote the manuscript.
Contributor Information
Karlis Pleiko, Laboratory of Precision- and Nanomedicine, Institute of Biomedicine and Translational Medicine, University of Tartu, Tartu, 50411 Tartu, Estonia; Faculty of Medicine, University of Latvia, Riga, LV-1586, Latvia.
Kristina Põšnograjeva, Laboratory of Precision- and Nanomedicine, Institute of Biomedicine and Translational Medicine, University of Tartu, Tartu, 50411 Tartu, Estonia.
Maarja Haugas, Laboratory of Precision- and Nanomedicine, Institute of Biomedicine and Translational Medicine, University of Tartu, Tartu, 50411 Tartu, Estonia.
Päärn Paiste, Department of Geology, University of Tartu, Ravila 14A, 50411 Tartu, Estonia.
Allan Tobi, Laboratory of Precision- and Nanomedicine, Institute of Biomedicine and Translational Medicine, University of Tartu, Tartu, 50411 Tartu, Estonia.
Kaarel Kurm, Laboratory of Precision- and Nanomedicine, Institute of Biomedicine and Translational Medicine, University of Tartu, Tartu, 50411 Tartu, Estonia.
Una Riekstina, Faculty of Medicine, University of Latvia, Riga, LV-1586, Latvia.
Tambet Teesalu, Laboratory of Precision- and Nanomedicine, Institute of Biomedicine and Translational Medicine, University of Tartu, Tartu, 50411 Tartu, Estonia; Cancer Research Center, Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA 92037, USA; Center for Nanomedicine and Department of Cell, Molecular and Developmental Biology, University of California, Santa Barbara, Santa Barbara, CA 93106, USA.
SUPPLEMENTARY DATA
Supplementary Data are available at NAR Online.
FUNDING
European Regional Development Fund [2014–2020.4.01.15-0012]; EMBO Installation [2344 to T.T.]; European Research Council grant GLIOGUIDE from European Regional Development Fund [780915 to T.T.]; Wellcome Trust International Fellowship [WT095077MA to T.T.]; Norway Grants [EMP181 to T.T.]; Estonian Research Council [PRG230, EAG79 to T.T.]. Funding for open access charge: Wellcome Trust International Fellowship.
Conflict of interest statement. None declared.
REFERENCES
- 1. Smith G.P. Filamentous fusion phage: novel expression vectors that display cloned antigens on the virion surface. Science. 1985; 228:1315–1317. [DOI] [PubMed] [Google Scholar]
- 2. Henninot A., Collins J.C., Nuss J.M.. The current state of peptide drug discovery: back to the future. J. Med. Chem. 2018; 61:1382–1414. [DOI] [PubMed] [Google Scholar]
- 3. Pasqualini R., Ruoslahti E.. Organ targeting in vivo using phage display peptide libraries. Nature. 1996; 380:364–366. [DOI] [PubMed] [Google Scholar]
- 4. Arap W., Pasqualini R., Ruoslahti E.. Cancer treatment by targeted drug delivery to tumor vasculature in a mouse model. Science. 1998; 279:377–380. [DOI] [PubMed] [Google Scholar]
- 5. Teesalu T., Sugahara K.N., Ruoslahti E.. Mapping of vascular ZIP codes by phage display. Methods in Enzymology. 2012; 503:Academic Press Inc; 35–56. [DOI] [PubMed] [Google Scholar]
- 6. Sugahara K.N., Teesalu T., Karmali P.P., Kotamraju V.R., Agemy L., Girard O.M., Hanahan D., Mattrey R.F., Ruoslahti E.. Tissue-penetrating delivery of compounds and nanoparticles into tumors. Cancer Cell. 2009; 16:510–520. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Teesalu T., Sugahara K.N., Kotamraju V.R., Ruoslahti E.. C-end rule peptides mediate neuropilin-1-dependent cell, vascular, and tissue penetration. Proc. Natl. Acad. Sci. U.S.A. 2009; 106:16157–16162. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Ruoslahti E. Tumor penetrating peptides for improved drug delivery. Adv. Drug Deliv. Rev. 2017; 110–111:3–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Aumailley M., Gurrath M., Müller G., Calvete J., Timpl R., Kessler H.. Arg-Gly-Asp constrained with cyclic pentapeptides. FEBS Lett. 1991; 29:50–54. [DOI] [PubMed] [Google Scholar]
- 10. Altmann A., Sauter M., Roesch S., Mier W., Warta R., Debus J., Dyckhoff G., Herold-Mende C., Haberkorn U.. Identification of a novel ITGαvβ6-binding peptide using protein separation and phage display. Clin. Cancer Res. 2017; 23:4170–4180. [DOI] [PubMed] [Google Scholar]
- 11. Pasqualini R., Koivunen E., Kain R., Lahdenranta J., Sakamoto M., Stryhn A., Ashmun R.A., Shapiro L.H., Arap W., Ruoslahti E.. Aminopeptidase N is a receptor for tumor-homing peptides and a target for inhibiting angiogenesis. Cancer Res. 2000; 60:722–727. [PMC free article] [PubMed] [Google Scholar]
- 12. Mann A.P., Scodeller P., Hussain S., Braun G.B., Mölder T., Toome K., Ambasudhan R., Teesalu T., Lipton S.A., Ruoslahti E.. Identification of a peptide recognizing cerebrovascular changes in mouse models of Alzheimer's disease. Nat. Commun. 2017; 8:1403. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Han Z., Zhou Z., Shi X., Wang J., Wu X., Sun D., Chen Y., Zhu H., Magi-Galluzzi C., Lu Z.R.. EDB fibronectin specific peptide for prostate cancer targeting. Bioconjug. Chem. 2015; 26:830–838. [DOI] [PubMed] [Google Scholar]
- 14. Lingasamy P., Tobi A., Haugas M., Hunt H., Paiste P., Asser T., Rätsep T., Kotamraju V.R., Bjerkvig R., Teesalu T.. Bi-specific tenascin-C and fibronectin targeted peptide for solid tumor delivery. Biomaterials. 2019; 219:119373. [DOI] [PubMed] [Google Scholar]
- 15. Lingasamy P., Tobi A., Kurm K., Kopanchuk S., Sudakov A., Salumäe M., Rätsep T., Asser T., Bjerkvig R., Teesalu T.. Tumor-penetrating peptide for systemic targeting of Tenascin-C. Sci. Rep. 2020; 10:5809. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Ikemoto H., Lingasamy P., Willmore A.A., Hunt H., Kurm K., Tammik O., Scodeller P., Simón-Gracia L., Kotamraju V.R., Lowy A.M.et al.. Hyaluronan-binding peptide for targeting peritoneal carcinomatosis. Tumour Biol. 2017; 39:1010428317701628. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Urakami T., Järvinen T.A.H., Toba M., Sawada J., Ambalavanan N., Mann D., McMurtry I., Oka M., Ruoslahti E., Komatsu M.. Peptide-directed highly selective targeting of pulmonary arterial hypertension. Am. J. Pathol. 2011; 178:2489–2495. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Scodeller P., Simón-Gracia L., Kopanchuk S., Tobi A., Kilk K., Säälik P., Kurm K., Squadrito M.L., Kotamraju V.R., Rinken A.et al.. Precision targeting of tumor macrophages with a CD206 binding peptide. Sci. Rep. 2017; 7:14655. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Scodeller P., Asciutto E.K.. Targeting tumors using peptides. Molecules. 2020; 25:808. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Fan X., Venegas R., Fey R., Van Der Heyde H., Bernard M.A., Lazarides E., Woods C.M.. An in vivo approach to structure activity relationship analysis of peptide ligands. Pharm. Res. 2007; 24:868–879. [DOI] [PubMed] [Google Scholar]
- 21. Toome K., Willmore A.M.A., Paiste P., Tobi A., Sugahara K.N., Kirsimäe K., Ruoslahti E., Braun G.B., Teesalu T.. Ratiometric in vivo auditioning of targeted silver nanoparticles. Nanoscale. 2017; 9:10094–10100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Matochko W.L., Cory Li S., Tang S.K.Y., Derda R.. Prospective identification of parasitic sequences in phage display screens. Nucleic Acids Res. 2014; 42:1784–1798. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Matochko W.L., Chu K., Jin B., Lee S.W., Whitesides G.M., Derda R.. Deep sequencing analysis of phage libraries using Illumina platform. Methods. 2012; 58:47–55. [DOI] [PubMed] [Google Scholar]
- 24. Matochko W.L., Ng S., Jafari M.R., Romaniuk J., Tang S.K.Y., Derda R.. Uniform amplification of phage display libraries in monodisperse emulsions. Methods. 2012; 58:18–27. [DOI] [PubMed] [Google Scholar]
- 25. He B., Chen H., Li N., Huang J.. Sarotup: a suite of tools for finding potential target-unrelated peptides from phage display data. Int. J. Biol. Sci. 2019; 15:1452–1459. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. He B., Chen H., Huang J.. PHD7Faster 2.0: predicting clones propagating faster from the Ph.D.-7 phage display library by coupling PseAAC and tripeptide composition. PeerJ. 2019; e7131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Hurwitz A.M., Huang W., Estes M.K., Atmar R.L., Palzkill T.. Deep sequencing of phage-displayed peptide libraries reveals sequence motif that detects norovirus. Protein Eng. Des. Sel. 2017; 30:129–139. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Rebollo I.R., Sabisz M., Baeriswyl V., Heinis C.. Identification of target-binding peptide motifs by high-throughput sequencing of phage-selected peptides. Nucleic Acids Res. 2014; 42:e169. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. He B., Jiang L., Duan Y., Chai G., Fang Y., Kang J., Yu M., Li N., Tang Z., Yao P.et al.. Biopanning data bank 2018: hugging next generation phage display. Database. 2018; 2018:doi:10.1093/database/bay032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Brinton L.T., Bauknight D.K., Dasa S.S.K., Kelly K.A.. PHASTpep: analysis software for discovery of cell-selective peptides via phage display and next-generation sequencing. PLoS One. 2016; 11:e0155244. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Robinson M., McCarthy D., Smyth G.K.. edgeR: differential expression analysis of digital gene expression data. Bioinformatics. 2010; 26:139–140. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Conway J.R., Lex A., Gehlenborg N.. UpSetR: an R package for the visualization of intersecting sets and their properties. Bioinformatics. 2017; 33:2938–2940. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Wickham H., Averick M., Bryan J., Chang W., McGowan L., François R., Grolemund G., Hayes A., Henry L., Hester J.et al.. Welcome to the Tidyverse. J. Open Source Softw. 2019; 4:1686. [Google Scholar]
- 34. Braun G.B., Friman T., Pang H.B., Pallaoro A., De Mendoza T.H., Willmore A.M.A., Kotamraju V.R., Mann A.P., She Z.G., Sugahara K.N.et al.. Etchable plasmonic nanoparticle probes to image and quantify cellular internalization. Nat. Mater. 2014; 13:904–911. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Tobi A., Willmore A.A., Kilk K., Sidorenko V., Braun G.B., Soomets U., Sugahara K.N., Ruoslahti E., Teesalu T.. Silver nanocarriers targeted with a CendR peptide potentiate the cytotoxic activity of an anticancer drug. Adv. Ther. 2020; 2000097:https://doi.org/10.1002/adtp.202000097. [Google Scholar]
- 36. Sugahara K.N., Teesalu T., Karmali P.P., Kotamraju V.R., Agemy L., Greenwald D.R., Ruoslahti E.. Coadministration of a tumor-penetrating peptide enhances the efficacy of cancer drugs. Science. 2010; 328:1031–1035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Wang Z., Gerstein M., Snyder M.. RNA-Seq: a revolutionary tool for transcriptomics. Nat. Rev. Genet. 2009; 10:57–63. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Lun A.T.L., Chen Y., Smyth G.K.. It's DE-licious: a recipe for differential expression analyses of RNA-seq experiments using quasi-likelihood methods in edgeR. Methods in Molecular Biology. 2016; 1418:Humana Press Inc; 391–416. [DOI] [PubMed] [Google Scholar]
- 39. Derda R., Tang S.K.Y., Li S.C., Ng S., Matochko W., Jafari M.R.. Diversity of phage-displayed libraries of peptides during panning and amplification. Molecules. 2011; 16:1776–1803. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Arap W., Kolonin M.G., Trepel M., Lahdenranta J., Cardó-Vila M., Giordano R.J., Mintz P.J., Ardelt P.U., Yao V.J., Vidal C.I.et al.. Steps toward mapping the human vasculature by phage display. Nat. Med. 2002; 8:121–127. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
R script for differential binding analysis and generation of figures is available at https://github.com/KarlisPleiko/diff_phage.