Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2018 Mar 16.
Published in final edited form as: ACS Chem Biol. 2018 Jan 9;13(3):527–532. doi: 10.1021/acschembio.7b00520

Discovery of a 29-amino-acid reactive abiotic peptide for selective cysteine arylation

Ethan D Evans 1, Bradley L Pentelute 1,*
PMCID: PMC5856623  NIHMSID: NIHMS934611  PMID: 29283243

1. Materials

Chemicals and enzymes

Pentafluorophenyl sulfide was purchased from Santa Cruz Biotechnology (Dallas, TX). 1,4-Dithio-DL-threitol (DTT), 1-[Bis(dimethylamino)methylene]-1H-1,2,3-triazolo[4,5-b]pyridinium 3-oxid hexafluorophosphate (HATU), Fmoc-L-Ala-OH, Fmoc-L-Cys(trt)-OH, Fmoc-L-Asp(tBu)-OH, Fmoc-L-Glu(tBu)-OH, Fmoc-L-Phe-OH, Fmoc-Gly-OH, Fmoc-L-His(Boc)-OH, Fmoc-L-Ile-OH, Fmoc-L-Lys(Boc)-OH, Fmoc-L-Leu-OH, Fmoc-L-Met-OH, Fmoc-L-Asn(Trt)-OH, Fmoc-L-Pro-OH, Fmoc-L-Gln(Trt)-OH, Fmoc-L-Arg(Pbf)-OH, Fmoc-L-Ser(tBu)-OH, Fmoc-L-Thr(tBu)-OH, Fmoc-L-Val-OH, Fmoc-L-Trp(Boc)-OH, Fmoc-L-Tyr(tBu)-OH, Fmoc-L-Lys(biotin)-OH, 2-chlorotrityl chloride resin were purchased from Chem-Impex International (Wood Dale, IL). H-rink-amide chemmatrix Hyr resin was obtained from PCAS BioMatrix, Inc (Quebec, Canada). (7-Azabenzotriazol-1-yloxy)tripyrrolidinophosphonium hexafluorophosphate (PyAOP) was purchased from P3 BioSystems (Lousiville, KY). Tris(2-carboxyethyl)phosphine hydrochloride was purchased from Hampton Research (Aliso Viejo, CA). Carboxy-PEG12-thiol was purchased from Thermo Fisher Scientific. SuperScript II reverse transcriptase and RNase OUT were purchased from Invitrogen (Carlsbad, CA), while Taq polymerase and T7 RNA polymerase (and their associated buffers) were obtained from New England Biolabs (Ipswich, MA). Flexi rabbit reticulocyte lysate along with rNTPs and dNTPs were purchased from Promega (Madison, WI). N, N-dimethylformamide (DMF), acetonitrile (ACN), diethyl ether were purchased from VWR (Radnor, PA). Trifluoroacetic acid (TFA) was obtained from Sigma-Aldrich. Other chemicals listed were purchased from either Sigma-Aldrich or VWR and used as received.

2. Methods

2.1. Liquid chromatography-mass spectrometry (LCMS)

For the remainder of this manuscript, solvent A will refer to water with 0.1% (v/v) TFA, while B will be acetonitrile with 0.1% (v/v) TFA. TIC refers to total ion current in the LCMS chromatogram. The majority of LCMS chromatograms and mass spectra were obtained using an Agilent 6520 ESI-Q-TOF mass spectrometer using method 1 unless otherwise noted (MS/MS analysis was conducted on an Agilent 6550 iFunnel Q-TOF mass spectrometer). Software used for LCMS analysis was the Agilent MassHunter package and deconvolution was performed using maximum entropy.

  • Method 1:

    • LC method: 0–2 minutes 5% B, 2–11 minutes 5–65% B linear ramp, 11–12 minutes 65% B, 0.8mL/min flow rate.

    • Column: Zorbax 300SB C3 column (2.1 × 150mm, 5µm), 40°C

    • MS parameters: positive electrospray ionization (ESI).

  • Method 2:

    • LC method: 0–3 minutes 5% B, 3–17 minutes 5–95% B linear ramp, 17–18 minutes 95% B, 0.8mL/min flow rate.

    • Column: Zorbax 300SB C18 column (2.1 × 150mm, 5µm), 40°C

    • MS parameters: positive ESI

  • Method 3:

    • LC method: 0–3 minutes 5% B, 3–15 minutes 5–80% B linear ramp, 15–16 minutes 80% B, 0.8mL/min flow rate.

    • Column: Zorbax 300SB C18 column (2.1 × 150mm, 5µm), 40°C

    • MS parameters: positive ESI, MS off at 11 minutes

2.2. Peptide synthesis, cleavage and purification

Peptides were synthesized using an automated flow peptide synthesizer built in house1 on a 0.09mmol scale using Fmoc-SPPS chemistry on H-rink amide chemmatrix Hyr resin. General synthesis was performed at 90°C using the following protocol with a 80mL/min flow rate: 15s amino acid coupling (0.14M HATU, 0.2M amino acid, 10% (v/v) N,N-diisopropylethylamine (DIEA), 4.8mL total), 38s 12mL DMF wash, 34s 11.2mL 20% piperidine (v/v) in DMF deprotection with a final 38s 12.8mL DMF wash. Deviations from this protocol included: Arg and Phe couplings that were completed using PyAOP instead of HATU and the HHHHHHRL sequence found on MP01-full that was synthesized using a 40mL/min coupling at 70°C with all other parameters the same. Following synthesis, peptides were cleaved from the resin and side-chain deprotected using a mixture of 94% TFA, 2.5% (v/v) 1,2-ethanedithiol (EDT), 2.5% (v/v) water and 1% (v/v) triisopropylsilane for 7 minutes at 60°C. Peptides were there triturated three times using cold diethyl ether. The resulting precipitate was then dissolved in 50% A: 50% B and lyophilized.

Crude peptides were then dissolved in the minimal amount of 95% A: 5% B and purified by reverse phase (RP) HPLC using an Agilent Zorbax C3 column (21.2 × 250 mm, 7µm) using a linear gradient from 95% A: 5% B to 55% A: 45% B over 120 minutes at a flow rate of 7mL/min. Fractions were analyzed for purify by RP-LCMS using method 1.

2.3. Capture Agent Synthesis

Conjugation of carboxy-PEG-thiol to pentafluorophenyl sulfide

A solution consisting of 5mM carboxy-(PEG)12-thiol, 500mM pentafluorophenyl sulfide, 20mM triphenylphosphine and 230mM DIEA in acetonitrile was vortexed and left at room temperature for 4 hours. The reaction was then diluted with 10.6× volume of 95% A: 5% B, solid phase extracted and lyophilized. The resulting material was analyzed by LCMS (Figure S1, method 2). This product will later be referred to as mCA (modified CA)

Conjugation of CT-Peg-pfp sulf to Lys(bio)-2-chloro trityl choloride resin

17.7mg of 2-chlorotrityl chloride resin (0.6–0.7mmol/g) was reacted with 51.4mg Fmoc-L-Lys(biotin)-COOH in 1mL DMF with 71µl DIEA. The solution was sparged with argon and left overnight. The resin was washed with DMF, DCM and dried. The Fmoc group was removed with 500µl of a 20% piperidine in DMF solution for 30 minutes at room temperature followed by DMF washes. 20.43mg of pentafluorophenyl sulfide-PEG-COOH was coupled to the 32.3mg of dried lysine attached resin with 490µl DMF, 0.4M HATU and 98µl DIEA. This was left for 2 hours at room temperature and then washed and dried in vacuo. The capture agent was cleaved from the resin with a two hour, room temperature treatment of 95% TFA, 2.5% water, 2.5% TIPS; the cleavage cocktail was evaporated and then 4mL of 50% A:50% B was added and the resulting solution lyophilized. Crude mass obtained = 17.5mg (Figure S2, method 2).

The capture agent was purified on a C3 with the following method: 10 minutes at 95%A:5%B, 30 minutes of a 1% B increase per minutes to 65%A:35% B, followed by a 150 min, 0.25%B per minute gradient to 35%A :65% B. Fractions were analyzed by LCMS, pure fractions were combined, lyophilized and the final material was characterized by LCMS (Figure S3, method 1).

Major impurity

Seen in figure S3 is a small peak to the left of the primary peak, this peak is believed to be an oxidized capture agent (oxidation in the biotin moiety) as its mass is 16 Da greater than that for the CA (see LCMS/MS analysis below). This impurity peak is seen in kinetics time courses and elutes immediately prior to the CA.

Capture Agent NMR

A resynthesized stock of the CA was diluted in DMSO-d6 and analyzed by NMR using either 282 or 500MHz.

19F NMR (282 MHz, DMSO-d6) δ -135.46 (d, J = 24.4 Hz), -135.97 (dd, J = 26.2, 10.7 Hz), -136.67 (dd, J = 27.3, 9.8 Hz), -153.35 (t, J = 22.5 Hz), -163.58 (t, J = 22.8 Hz).

13C NMR (150MHz, DMSO) δ 173.73, 171.87, 170.17, 162.76, 158.13 (d, J = 31.4 Hz), 147.90, 147.20 (d, J = 14.9 Hz), 146.28, 145.62, 145.52, 142.89, 141.18, 137.33 (d, J = 247.0 Hz), 117.23 (t, J = 20.6 Hz), 111.27 – 108.75 (m), 107.20 – 104.61 (m), 99.56, 69.94, 69.81, 69.75, 69.72, 69.67, 69.60, 69.53, 66.77, 61.08, 59.25, 55.45, 51.71, 39.87, 38.20, 35.82, 35.24, 33.80, 30.87, 28.81, 28.26, 28.07, 25.35, 22.87.

1H NMR (500 MHz, DMSO-d6) δ 12.54 (s, 1H), 8.09 (d, J = 7.8 Hz, 1H), 7.76 (t, J = 5.6 Hz, 1H), 6.43 (s, 1H), 6.37 (s, 1H), 4.30 (dd, J = 7.8, 4.9 Hz, 1H), 4.14 (ddd, J = 12.1, 7.9, 4.9 Hz, 2H), 2.53 – 2.47 (m, 1H), 3.65 – 3.32 (m, 52H), 3.16 (t, J = 5.9 Hz, 2H), 3.13 – 3.05 (m, 1H), 3.05 – 2.93 (m, 2H), 2.82 (dd, J = 12.4, 5.0 Hz, 1H), 2.57 (d, J = 12.4 Hz, 1H), 2.46 – 2.27 (m, 2H), 2.04 (t, J = 7.4 Hz, 2H), 1.78 – 1.13 (m, 11H).

* For the peaks at 3.65-3.32 and 1.78-1.13 we believe there to be water and grease contaminants respectively, thus throwing off the integrated value of protons.

2.4. mRNA Display

Library Design

The library was designed to display a 30mer random peptide with the 14th–17th amino acids being doped as 40–50% FCPF with the following 188mer DNA sequence:

5’ – TAA TAC GAC TCA CTA TAG GGA CAA TTA CTA TTT ACA ATT ACA ATG NNS NNS NNS NNS NNS NNS NNS NNS NNS NNS NNS NNS NNS WWZ WYZ XXZ WWZ NNS NNS NNS NNS NNS NNS NNS NNS NNS NNS NNS NNS NNS GGC TCC GGT AGC TTA GGC CAC CAT CAC CAT CAC CAC CGG CTA TAG GTA GCT AG – 3’

The doped FCPF was created with the following A:T:G:C ratios during DNA synthesis: W-(1:7:1:1), X-(1:1:1:7), Y-(1:1:7:1) while the G:C ratio for Z was (1:9). For this selection the following primers and oligonucleotides were purchased and used:

Librarya: 5’ – TCA CTA TAG GGA CAA TTA CTA TTT ACA ATT ACA ATG NNS NNS NNS NNS NNS NNS NNS NNS NNS NNS NNS NNS NNS WWZ WYZ XXZ WWZ NNS NNS NNS NNS NNS NNS NNS NNS NNS NNS NNS NNS NNS GGC TCC GGT AGC TTA GGC – 3’

F1b: 5’ – TAA TAC GAC TCA CTA TAG GGA CAA TTA CTA TTT ACA ATT ACA – 3’

R1b: 5’ – CTA GCT ACC TAT AGC CGG TGG TGA TGG TGA TGG TGG CCT AAG CTA CCG GAG CC – 3’

RTb: 5’ – TTT TTT TTT TTT TTT GTG ATG GTG ATG GTG GCC TAA – 3’

Psoralen Oligoa: 5’ – Psoralen C6-(uag ccg gug)2’-OMe-AAA AAA AAA AAA AAA-2x Spacer9-ACC-Puromycin – 3’

Oligos were purchased from either the Keck Oligonucleotide Synthesis facility at Yalea (New Haven, CT) or Integrated DNA Technologiesb (Coralville, IA)

Selection round protocols

The following is the complete selection protocol with parts from the manuscript methods repeated for clarity.

Initial library construction

The single stranded DNA library was converted to the desired length dsDNA library in 10mL of PCR reaction (6 cycles of 30s 52°C annealing, 1 min 72°C extension, 30s 95°C denaturing): 25nM library, 1µM F1 and R1 primers, 0.2mM dNTPs, 1x thermo pol buffer and 2.5U/µl Taq in individual 100µl total. The library was then phenol/chloroform extracted, 1-butanol concentrated and ethanol precipitated.

1st round transcription

A reaction containing 50nM dsDNA template, 1mM ATP, CTP, UTP, GTP, T7 buffer (1x), 0.4U/µl RNase OUT and 3U/µl T7 polymerase (carrying forward ~ 7.6×1013 sequences) was left at 37°C for ~15 hours. This was then purified by 6% denaturing PAGE and passively eluted. The eluted RNA was concentrated with 1-butanol and ethanol precipitated.

1st round crosslinking

Crosslinking was performed with the following reaction: 3µM RNA, 7.5µM psoralen oligo, 1x XL buffer (100mM KCl, 1mM spermidine, 1mM EDTA, 20mM HEPES pH 7.5) as previously described.2,3 The mixture was heated to 70°C for 5 minutes, cooled to RT slowly, then place on ice for >1 min. Then 100µl reactions were crosslinked in individual wells of a 96 well plate at 4°C with 365nm light for 20 minutes. All samples were then combined, concentrated with 1-butanol and run on 6% denaturing purification gel. For the first round, ~1.28×10−8 moles of RNA were input into the crosslinking reaction; thus, assuming a 2% recovery between crosslinking and input into the selection step along with a 3x oversampling, this amount would produce roughly 5.2×1013 unique sequences for the first round.

1st round translation

A bulk translation was performed using the following salt optimized mixture: 28nM XL-RNA, 12.5µM amino acid mixture without met (AA-met), 12.5µM AA-leu, 3.5mM DTT, ~1mM Mg(OAc)2, 140mM KCl, 0.2U/µl RNase OUT and 40% rabbit reticulocyte lysate. This was left at 30°C for 1.5 hours, subsequently salts were added to give ~50mM Mg2+ and 550mM K+, the reaction was then left for 42 minutes at room temperature, and finally placed in a −20°C freezer for 14 hours.

1st round oligo dT purification

6x, 1mL suspensions of oligo d(T)25 magnetic beads (New England Biolabs), were used for purification. The total round 1 translation was split evenly and the same protocol was performed for each of the six bead slurries. The translation mixture was combined with ~6x of dT binding buffer (20mM tris pH 7.5, 500mM NaCl, 1mM EDTA, 0.1% tween 20), added to a bead sample and rocked at room temperature for 1.5 hours. The beads were then washed 1x with 15 mL binding buffer, 3× 10mL wash buffer (20mM tris pH 7.5, 500mM NaCl, 1mM EDTA) and 1× 10mL low salt buffer (20mM tris pH 7.5, 200mM NaCl, 1mM EDTA, each ‘wash’ incorporated a 15 min incubation). Finally, 1mL of 20mM tris (pH 7.5) was added per bead suspension and all six were combined. This final mixture was placed at 65°C for 4 minutes upon which time the supernatant was removed. Then 5mL of 10mM tris buffer was added to the beads and the heating protocol was repeated. The two supernatants were combined and the concentration of RNA was determined by uv-vis spectrophotometry. The solution was filtered through a 0.22µm filter, concentrated on a 10K Amicon Ultra centrifugal filter (EMD Millipore) and ethanol precipitated.

1st round reverse transcription

Reverse transcription was performed with the following conditions: 0.5mM dNTPs, 1.5µM RT primer, 10mM DTT, 1× 1st strand buffer, 2U/µl RNase OUT, 5U/µl SSII and the suspended mRNA-peptide. The RNA and primer were heated together at 65°C for 5 min first, then cooled to room temperature and finally placed on ice. Then the rest of the components were added and the mixture was incubated at 42°C for 55 min.

1st round Ni-NTA

2mL of Ni-NTA agarose bead slurry were combined with the reverse transcription reaction along with 12mL of Ni-NTA binding buffer (100mM NaH2PO4, 6M Guan HCl, 0.2% triton X-100, 5mM β-mercaptoethanol, pH 8) and rocked at 4°C for 1 hour. The resin was washed with 3× 10mL wash buffer (100mM NaH2PO4, 0.2% triton X-100, 5mM β-mercaptoethanol, 300mM NaCl). Then 1mL aliquots of elution buffer (50mM NaH2PO4, 300mM NaCl, 5mM β-mercaptoethanol, 250mM imidazole) were added 8 times, each with a 5 min incubation. The elutions were combined, concentrated on a 10K filter and ethanol precipitated.

1st round selection

The pellet was diluted in the round one selection mixture (1mL total) containing: 1x selection buffer (25mM HEPES-KOH pH 7.5, 100mM NaCl, 5mM CaCl2, 5mM MgCl2, 0.01% triton X-100), and 50µM capture agent at ~80nM RNA-peptide – this was termed the non-reduced library. This reacted 15 hours at room temperature during which time a sample for qPCR was removed (for round 1 input cDNA). The reaction was washed on a 10K filter until the concentration of free capture agent was ~0.12µM in 550µl. The concentrated selection mixture was added to ~1mg of pre-blocked (1x selection buffer and 2mg/mL yeast tRNA (Roche, Switzerland)) Pierce streptavidin magnetic beads and rotated at room temperature for 1 hour. The supernatant was removed and the beads were washed twice with 200µl, 1x selection buffer, these two washes were then combined with the first supernatant (giving a total volume of ~950µl) to which 50µl of 1mM capture agent and 2µl of 1M DTT (giving ~2mM) was added. This new, reduced reaction was left at room temperature for 18 hours – likewise an ‘input’ cDNA sample was removed for qPCR. Following concentration and capture agent dilution, the reduced library was similarly pulled down with ~0.8mg streptavidin beads. To elute both the reduced and non-reduced libraries from the beads following the initial pulldown, 1mL washes of 1x selection buffer were performed ten times, then the cDNA was eluted 4x with 100µl of 10mM tris pH 7.45 at 95°C for 3 min each and combined. This cDNA was used for the ‘output’ of round one.

1st round PCR

Standard PCR conditions were used (30s at 95°C, 30s at 58°C, and 35s at 72°C) and both libraries were amplified for 16 rounds using the F1 and R1 primers. The mixture was then phenol and chloroform extracted, 1-butanol concentrated and ethanol precipitated. The pellet was then diluted in 10mM tris, 50mM NaCl and quantified by native PAGE densitometry.

Round 2

Selection steps through Ni-NTA purification were performed in a similar manner to the first round for both the reduced and non-reduced selections. However, for all steps, scaled down reaction sizes were used as it was no longer necessary to carry the entire volume of each step through. This round diverged from the previous one in the selection step. The precipitated libraries were suspended in 1x selection buffer with or without 2mM DTT and a sample of round 2 ‘input’ cDNA was removed. These mixtures were then added to 0.15mg blocked streptavidin magnetic beads and incubated for 1 hour. The supernatant was removed and combined with the supernatants of four washes of the beads (all using 1x selection buffer), to this, capture agent was added, giving a 50µM final concentration and ~2.8µM RNA-peptide. The resulting mixture was left for 1 hour at room temperature. The capture agent concentration was then reduced using a 10K filter. For the pull down, 1mg streptavidin magnetics beads were washed, blocked and finally the selection mixture was added and incubated at RT for 1 hour. The beads were then washed 6x 1mL of 10mM tris at RT, then 4x 1mL 10mM tris with a 1 min incubation at 40°C. cDNA was eluted 4x, 50µl 10mM tris at 95°C and then PCR amplified.

Round 3

Round three proceeded similarly to the previous rounds through the oligo dT purification step. Subsequently, excess salts from the dT purification were removed on a 10K concentrator, and the entire mixture for each selection was spun to 40µl. To this was added 20µl 5x selection buffer, 35µl water and 5µl 1mM capture agent, this mixture sat 30 min at room temperature (the ‘reduced’ library selection step did not have any DTT). This mixture was washed four times with water on a 10K filter; however, before the third spin, the entire solution (plus water wash) was removed from the filter and heated to 65°C for 2 min (to help remove excess capture agent) and then spun.

After the selection step a standard reverse transcription and Ni-NTA purification (both scaled appropriately) were performed and the final solution was then filtered until the imidazole was ~0.131µM. Here an ‘input’ cDNA sample was removed from the non-reduced library. The reduced library was reselected in the same final volume, concentration and time as before but with 2mM DTT, an ‘input’ cDNA sample was also removed. This ‘redo’ selection step was spun on a 10K concentrator to remove excess capture agent. Then both libraries were added to 0.2mg of prewashed and blocked streptavidin magnetic beads and incubated for 1 hour. These beads were then washed 5× 1mL at RT, 5× 1mL with 1 min at 40°C then eluted 4×, 50µl 3 min elutions at 95°C. The two libraries were then PCR amplified.

Round 4

The round four transcription was performed with 70nM template for 5 hours. This was then gel purified, crosslinked and translated. A standard oligo dT purification was done and then the samples were concentrated. Next a solution of 1x selection buffer, 100µM capture agent and ±3mM DTT (depending on the library) was created. After 30 minutes, excess capture agent was removed and a standard reverse transcription and Ni-NTA purified were performed. Following this, the solutions were spin filtered until there was only ~0.12µM imidazole. ‘Input’ samples for qPCR were then taken. Next 0.25mg streptavidin magnetic beads were added and incubated for 1 hour. The beads were then washed 3x 1mL of 10mM tris and 50mM NaCl at room temperature, 7× 1mL with a 1 min 42°C incubation. Finally four elutions of 50µl at 95°C were performed, and the cDNA was PCR amplified.

Round 5

Round five followed a scaled down version of round one until the selection step. The libraries were suspended in 1x selection buffer, and the reduced selection received 2mM TCEP (instead of DTT), these mixtures were then added to prewashed beads for a negative selection and left for 15 min at room temperature. The supernatant was again incubated with a fresh batch of blocked beads for 15 minutes. The supernatant was removed, and both resins were washed with 1x buffer which was then combined with the original supernatant to which was added capture agent to a final concentration of 50µM; this reacted for 30 minutes. Following excess capture agent removal on a spin filter, round ‘input’ qPCR samples were removed. The remaining solution was added to 1mg of washed and blocked streptavidin beads for 1 hour. The beads were then washed 4x 1mL at room temperature, 6x 1mL at 42°C with 1 min incubation, the cDNA was eluted and PCR amplified like normal.

2.5. qPCR analysis

Quantitative PCR was performed at MIT’s BioMicroCenter on a Light Cycler 480 II Real-Time PCR machine. To create a standard curve for each round, a sample of known concentration, reverse transcribed RNA was diluted to give a range of DNA concentrations (~4 orders of magnitude). PCR mixes were composed of 1µM primers, 50% (2x) SYBR Green PCR Master Mix (Applied Biosystems, Foster City, CA), DNA template and water. Each reaction was split into three wells for triplicate measurements of Cp values, which were then averaged. Selection round yields were determined using the Cp values from samples of the selection step input and cDNA elution. A yield for each round was determined based off the known volumes of each step and the standard curve correlating Cp and standard DNA concentration, these data were then plotted per round (Figure S7).

2.6. HTS analysis

MiSeq (2x150bp) Illumina sequencing was performed. The FASTQ data was analyzed via custom python scripts that first combined pairs and filtered out DNA sequences possessing less than 85% Q30 Phred scores with ambiguous bases being determined by the higher Q-score base of the pair. Sequences were then translated into amino acid sequences, filtered again based off length and the presence of part of the C terminal fixed region. Sequences were then analyzed according to frequency and size of Levenshtein families with an edit distance less than five.

2.7. Kinetics and LCMS analysis

5µl kinetics time point samples were made to capture the initial reaction rate and quenched with the addition of 95µl of 49.75% H20, 49.75% Acetonitrile and 0.5% TFA. Time points within the linear range of the instrument were used for kinetics analysis. A second order kinetics rate constant (k2) was extracted by fitting the data to the following equation:

k2(A0B0)t=ln(B0AtA0Bt)

For this, A0 refers to the initial capture agent concentration, At is its concentration at the given time point (for kinetics data, this was obtained from the values of Bt as the TIC values of At were outside the linear range of the LCMS, B0 is the initial peptide concentration and Bt signifies the peptide concentration when the sample was taken. The rate constant is an average of three measurements at different concentration (here error is estimated from the deviation in the three point estimates) while for the rest of the sequences it is estimated from a single MP concentration (for these samples, error bars represent the error determined from a linear regression fit).

2.8. Protein Expression and purification

Full length MP01 was appended to the sequence of Sortase A (with a TEV cleavage site between the two) and then placed into a pET-SUMO vector (Thermo Fisher) following factory protocols. Expression was performed in 1L cultures (30µg/mL kanamycin) of appropriately transformed E. Coli BL21(DE3), after an O.D. value of 0.5 was obtained at 37°C, the cultures were cooled to 16°C and induced with the addition of 0.2mM IPTG. Expression was conducted for 5.5 hours at 16°C followed by cell pelleting with 10 minute centrifugation at 7,000 RPM. The cell pellet was suspended in 25mL of Ni-NTA binding buffer (50mM Tris pH 8.1 150mM NaCl) with one protease inhibitor cocktail tablet (Roche Diagnostics, Switzerland), 20mg lysozyme (Calbiochem) and ~2mg DNase I (Sigma-Aldrich). Following sonication the cellular debris was removed by centrifugation at 17,000 RPM for 30 minutes. The supernatant was loaded directly onto a 5mL HisTrap FF crude Ni-NTA column (GE Healthcare, UK), following binding the column was washed with 25mL Ni-NTA binding buffer, 25mL Ni-NTA washing buffer (50mM Tris pH 8.1, 150mM NaCl, 500mM imidazole) and eluted with 10mL of Ni-NTA elution buffer (50mM Tris pH 8.1, 150mM NaCl, 500mM imidazole). The protein was then desalted on a HiPrep 26/10 Desalting Column (GE Healthcare, UK). Following concentrating, the SUMO group was removed by addition of 30µg of SUMO protease per mg of protein at 4°C overnight.

MP01-SrtA was further purified by anion exchange chromatography after being exchanged into buffer A (20mM HEPES pH 8.5, 1mM DTT). MP01-SrtA was loaded onto a HiTrap Q HP column (GE Healthcare, UK) in buffer A and eluted with buffer B (buffer A + 2M NaCl) during a 400mL linear gradient from 0–30% B. Fractions for the protein were combined and concentrated on a 10K spin filter and buffer exchanged into 0.5x selection buffer. The concentration were determined spectrophotometrically using 280nm light and extinction coefficients obtained from the ProtParam tool on web.expasy.org and a sample was taken for LCMS analysis (Figures S4 and S5). The protein was aliquoted and flash frozen in liquid nitrogen.

graphic file with name nihms934611u1.jpg

2.9. Labeling site determination

To determine the location of labeling for MP01 a reaction of 0.3mM MP, 0.3mM CA, 1x selection buffer, 5mM TCEP pH 7.4 and reacted 24 hours. Next the peptide was digested with 0.2 mg/mL trypsin and chymotrypsin for 24 hours at 37°C. Fragments were then analyzed by LCMS/MS on an Agilent 6550 iFunnel Q-TOF mass spectrometer. Similarly, the CA by itself was analyzed with the same LCMS/MS protocol.

4. Experimental data, tables and figures

4.1. Sequence and mass of peptides used

All peptides were chemically synthesized as C-terminal amides using a rink linker as described.

4.2. HTS Levenshtein clusters, library heat maps, FCPF analysis and statistics

Levenshtein clusters (edit distance = 5), showing sequence, red point mutants relative to the cluster parent sequence and number of times each peptide appeared in the sequencing.

graphic file with name nihms934611u2a.jpg

graphic file with name nihms934611u2b.jpg

4.3. Kinetics chromatograms and plots

Aliquots were quenched at select time points with 19x (by volume) of a mixture with 49.75% water, 49.75% acetonitrile, 0.5% trifluoroacetic acid and analyzed by LCMS. For all except the GCPG control, the LC method was method 1.

Note:

For the following chromatograms in this section, the CA elutes at ~10m (on LC method 1) in all and is not labeled after the first two kinetics TIC traces. Likewise, the +16 Da impurity from the CA elutes immediately before it and is likewise not labeled for the majority of the traces.

4.4. Determination of labeling location

4.5. Analysis of truncated MP01

4.6. Analysis of MP01 with urea

4.7. MP-SrtA labeling, TEV cleavage and SrtA reactions

Supplementary Material

References

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

RESOURCES