Abstract
DNA is organized into nucleosomes, composed of 147 base pairs of DNA wrapped around an octamer of histone proteins including H2A, H2B, H3, and H4. Histones are critical regulators of many nuclear processes, including transcription, DNA damage repair, and higher order chromatin structure. Much of their function is mediated through extensive and dynamic post-translational modification (PTM) by nuclear enzymes. Histone PTMs are thought to form a code, where combinations of PTMs are responsible for specific biological functions. Here, we present protocols to identify and quantify histone PTMs using nano-flow liquid chromatography (nLC) coupled to mass spectrometry (MS). We first describe how to purify histones and prepare them for MS. We then describe three MS platforms for histone PTM analysis, including bottom up, middle down, and top down approaches, and explain the relative benefits and pitfalls of each approach. We also include tips to increase the throughput of large experiments.
Keywords: Histone, post-translational modifications, epigenetics, mass spectrometry, bottom up, middle down, top down
1. Introduction
DNA must be highly organized and tightly regulated within the nucleus to maintain proper gene expression. The cell accomplishes this task by organizing DNA into a protein-DNA complex called chromatin. Within chromatin, DNA is contained in nucleosomes, which are composed of 147 base pairs of DNA wrapped around an octamer of histone proteins with two copies of each core histone-H2A, H2B, H3, and H4 (Luger, Mäder, Richmond, Sargent, & Richmond, 1997). Linker histone H1 can bind the free DNA that exists between nucleosomes. Due to their intimate association with DNA, histones are major regulators of chromatin structure and function. Histone proteins are extensively and dynamically post-translationally modified on specific residues by a myriad of enzymes in the nucleus. These post-translational modifications (PTMs) mediate histone function by directly altering the chemistry of the surrounding chromatin or through the action of other proteins that can bind these modifications. A growing body of research supports the hypothesis that PTMs form a “histone code” and can act in tandem to illicit a specific biological response (Jenuwein & Allis, 2001). Histone PTM profiles are critical to maintain nuclear stability, and aberrant regulation of histone PTMs is implicated in many diseases including cancer. As such, the ability to identify and quantify histone PTMs in biological systems is vital for understanding nuclear processes and how disease states may arise.
Histone PTM analysis had traditionally been accomplished using antibody-based approaches such as western blots, chromatin immunoprecipitation (ChIP), and deep sequencing. These methods have been instrumental in elucidating the roles of many histone PTMs but suffer from several critical drawbacks (Britton, Gonzales-Cope, Zee, & Garcia, 2011). For example, many antibodies are not entirely site-specific and can cross-react with similar modifications on different residues. A similar issue is epitope occlusion, where a PTM on a nearby residue can block interaction with an antibody. Perhaps the biggest drawback is that these methods require previous knowledge of the modification and are therefore unable to identify novel PTMs (Rothbart et al., 2015).
Mass spectrometry (MS) is an unbiased and quantitative method to comprehensively analyze histone PTM profiles. One of the greatest advantages of MS is that it can identify novel PTMs and can also measure the co-occurrence of PTMs on a single peptide. As such, MS has emerged as a critical tool for characterization of histone modifications. There are three major MS approaches, namely bottom-up, middle-down, and top-down MS, each of which is useful for specific applications (Figure 1). Bottom-up MS involves digestion of a protein sample into small peptides (5–15 amino acid residues) followed by online separation by reversed phase chromatography coupled to tandem MS. This method is very robust and sensitive. One major drawback, however, is that the co-occurrence between PTMs located on different peptides cannot be measured. Top-down MS, on the other hand, is performed by directly analyzing intact proteins and, as such, preserves complete connectivity between PTMs. However, this method is much less sensitive than bottom-up MS and thus has much larger sample requirement. Furthermore, the data analysis is much more challenging due to the large complexity of the tandem mass spectra, which sometimes results in the impossibility of discriminating proteoforms when co-fragmented. Using chip-based infusion, rather than injection with a syringe, can reduce the sample requirement for top-down experiments that do not include online chromatographic separation. Middle-down MS offers a compromise between these two methods and is performed by digesting proteins into large peptides (about 30–60 amino acid residues). Analyzing large peptides allows for connectivity of many PTM locations, but offers better sensitivity and simpler data analysis than the top-down approach.
Bottom-up MS is the most commonly used MS approach for histone PTM analysis as it is technically more facile than the other approaches and does not require specialized equipment such as a 2D HPLC or chip-based electrospray ionization. Furthermore, small peptides generated in bottom-up MS are fragmented by collision induced dissociation (CID) or high-energy collision dissociation (HCD), which is available in most commercial instruments. Large peptides or intact histone proteins, however, result in high charge state analytes when electrospray ionized and therefore do not fragment well with CID or HCD. Electron transfer dissociation (ETD) fragmentation is highly efficient for highly charged peptides or proteins and it is thus the fragmentation technique of choice for middle-down and top-down approaches.
In this chapter, we present the protocol to isolate histone proteins from cells and prepare the protein for MS analysis. We also outline how to perform bottom-up, middle-down and top-down MS to identify and characterize histone PTMs. Tips to increase the throughput of experiments are also included.
2. Histone Extraction from Cells
Histones are among the most basic proteins in the cell, and as such, can be easily purified using an acid extraction. Here, we describe how to first isolate nuclei from cells and then perform an acid extraction to obtain purified histones. The whole protocol takes one day.
2.1 Materials and Buffer Recipes
0.25% Trypsin
Phosphate Buffered Saline (PBS): 137 mM NaCl, 2.7 mM KCl, 10 mM Na2HPO4•2H2O, 2 mM KH2PO4
Nuclei Isolation Buffer (NIB): 15 mM Tris-HCl (pH 7.5), 60 mM KCl, 15 mM NaCl, 5 mM MgCl2, 1mM CaCl2, 250 mM sucrose.
1M DTT
200 mM AEBSF
2.5 μM microcystin
5 M sodium butyrate
2.2 Cell Harvest
2.2.1 Cell Harvest-Tissue Samples
Rinse tissue sample with cold PBS (4°C). This can be stored at −80°C after flash freezing or used immediately.
Use a razor blade to cut the tissue into small pieces, as small as possible.
Record the approximate volume of the tissue sample.
2.2.2 Cell Harvest-Cell Cultures
For adherent cells, detach the cells by covering the plate with a thin layer of trypsin 0.25% Trypsin for 5 minutes at room temperature. Move cells to a centrifuge tube and spin at 300 rcf for 5 minutes. For cells grown in suspension, centrifuge cells at 300 rcf for 5 minutes. Aspirate the trypsin or pour it off into bleach (10% final concentration).
Wash with PBS (10X volume). Centrifuge cells at 600 rcf for 5 minutes and remove PBS.
Repeat step 2. Pellets can be stored at −80°C or used immediately.
2.3 Nuclei Isolation
If cell pellets were frozen, thaw them on ice.
Estimate the amount of NIB buffer that will be needed, which is approximately 50X the total volume of cell pellets. Chill the buffer on ice.
Add inhibitors to the NIB buffer to the final concentrations of 1 mM DTT, 500 μM AEBSF, 5 nM microcystin, and 10 mM sodium butyrate. For 50mL buffer, add 50 μL 1 M DTT, 125 μL of 200mM AEBSF, 100 μL of 2.5 μM microcystin, and 100 μL of 5 M sodium butyrate. Inhibitors will degrade over time, so prepare NIB buffer + inhibitors freshly for each experiment and store inhibitor stocks at −20°C.
Move 25% of the NIB buffer + inhibitors to a new beaker and add 10% NP-40 alternative to a final concentration of 0.3%. For 50 mL, add 1.5 mL of 10% NP-40 alternative.
Resuspend cell pellets in 10X volume of NIB with NP-40 and homogenize by gentle pipetting (cultured cells) or douncing (tissue samples).
Incubate on ice for 5 minutes to lyse outer cell membranes. Nuclei isolation efficiency can be approximated using Trypan Blue staining.
Centrifuge at 4°C for 5 minutes at 1000 rcf. The supernatant contains the cytoplasm, and can be reserved if desired. Otherwise, discard the supernatant. The pellet contains nuclei and should be smaller than the original cell pellet.
Wash the pellet by resuspending it in NIB buffer + inhibitors without NP-40 alternative (10X volume of cell pellet).
Centrifuge cells at 4°C at 1000 rcf for 5 minutes.
Repeat steps 8 and 9 until no NP-40 remains. Usually, two washes are sufficient. NP-40 forms bubbles when resuspending, and so lack of bubbles indicates successful removal of NP-40.
Nuclei can be stored in NIB + inhibitors + 5% glycerol at −80°C after freezing in liquid nitrogen or used immediately for acid extraction.
2.4 Acid Extraction
Gently vortex the nuclei pellet and slowly add 0.4 N H2SO4 (5X volume of pellet, i.e. add 5 mL H2SO4 to a 1 mL pellet).
Incubate at 4°C with intermittent mixing or on a rotator for one hour up to overnight. We recommend incubation of 2 hours for pellets larger than 500 μL or 4 hours for pellets smaller than 500 μL. Longer incubation can result in extraction of other basic proteins besides histones.
Centrifuge extracts at 3400 rcf for 5 minutes. The supernatant contains the histone proteins and the pellet contains other proteins.
Transfer the supernatant to a new tube.
Repeat steps 3 and 4 to remove any traces of the pellet.
Gently add 100% TCA to the supernatant to a final concentration of 20% (i.e. add 1/4 volume of the supernatant).
Incubate on ice for at least 1 hour without agitation. For most samples, 1 hour is sufficient. We recommend overnight incubation for small samples (<50 μL pellet).
Centrifuge at 3400 rcf for 5 minutes. The histones will form a film around the bottom of the tube. Other proteins and non-protein material will form a white pellet at the bottom of the tube, which cannot be solubilized.
Carefully aspirate the supernatant, avoiding the protein film on the side of the tube.
Wash the protein with acetone + 0.1% HCl by pipetting gently down the side of the tube. Use a glass pipettor as acetone will dissolve plastic pipette tips.
Centrifuge at 3400 rcf for 5 minutes. Aspirate acetone.
Repeat steps 10 and 11 using 100% acetone two times.
Allow remaining acetone to evaporate by leaving the tubes open for 30 minutes up to overnight.
Resuspend the histone film in ddH2O. The volume will depend on the size of the tube and pellet, but generally 100 μL is sufficient for pellets in a 1.5 mL microcentrifuge tube.
Centrifuge at 3400 rcf for 2 minutes.
Move supernatant to new tube, being careful to avoid the pellet, which can be discarded.
Measure the protein concentration using a Bradford assay or another method. Histones can be stored in ddH2O at −80°C. If samples are dilute, concentrate them in a vacuum centrifuge.
If doing bottom-up MS, continue with section 3. If doing middle-down or top-down MS, continue with section 4 then 5 or 6, respectively.
3. Bottom-Up Mass Spectrometry
Bottom-up MS is the most commonly used MS platform for proteomics. In bottom-up MS, proteins are digested into small peptides (5–15 amino acid residues) with trypsin, which are then separated with online reversed-phase high-performance liquid chromatography (RP-HPLC) and analyzed via tandem MS (MS/MS). Histones are among the most basic proteins in the cell and contain a large number of lysine and arginine residues. Therefore, digestion with trypsin results in peptides that are too small to be retained by RP chromatography. To overcome this issue, histone proteins can be chemically derivatized on the ξ-amino groups of unmodified or mono-methylated lysine residues (Garcia et al., 2007). This derivatization prevents trypsin cleavage after lysine residues, allowing the enzyme to cleave only after arginine residues thus generating longer peptides. We recommend using propionic anhydride as the derivatization reagent due to its high efficiency (Sidoli, Yuan, et al., 2015). After digestion with trypsin, a second round of derivatization is performed to modify the amino groups of the newly generated N-termini. This increases the hydrophobicity of the peptides and allows for better interaction with RP columns. Propionylation and trypsinization can be performed either in microcentrifuge tubes or in 96-well plates to reduce sample preparation time if a large number of samples are being processed.
3.1 Materials and Buffer Recipes
100 mM ammonium bicarbonate, pH 8.5
Ammonium Hydroxide (NH4OH)
Propionic anhydride
Isopropanol
pH paper
Trypsin
Glacial acetic acid
Vacuum centrifuge
C18 disc (3M Empore)
Methanol (MeOH)
Wash buffer: 0.1% acetic acid in water
Elution buffer: 75% acetonitrile, 5% acetic acid, 20% water
Buffer A: 0.1% formic acid in water (all MS grade solvents)
Buffer B: 0.1% formic acid in 75% acetonitrile/25% water (all MS grade solvents)
3.2 Derivatization and Digestion
Dry samples down in a vacuum centrifuge and resuspend in 20 μL ammonium bicarbonate, pH 8.5. Ensure that the pH of the samples is between 7 and 9 using pH paper. If they are too basic, add some glacial acetic acid. If they are too acidic, add some powdered ammonium bicarbonate. If using the plate format, transfer samples to a 96 well plate.
Prepare the propionylation reagent by combining propionic anhydride and isopropanol in a 1:3 ratio. The propionic anhydride in the reagent will begin to dissociate into propionic acid so it is important to use it immediately after preparation. If processing samples on a plate, make new reagent for each plate. If processing samples in microcentrifuge tubes, make new reagent every three to five samples.
Add 10 μL of the propionylation reagent to the sample and vortex briefly. The pH will drop to 4–6 due to the propionic acid.
Immediately add 3–7 μL NH4OH to bring the pH up to ~8, checking with pH paper. Do not allow the sample to go above pH 10 as propionylation of other residues such as serine can occur at high pH.
Incubate the samples at 37°C for 15 minutes.
Dry down samples to less than 5 μL in a vacuum centrifuge.
Repeat steps 1–6 one more time.
Resuspend samples in 50 to 100 μL of ammonium bicarbonate.
Add trypsin in a 1:20 protease:histone ratio and incubate at 37°C for at least 6 hours up to overnight.
Quench trypsin by freezing at −80°C or adding glacial acetic acid to lower pH to ~4.
Dry samples to 20 μL or less. If dried to less than 20 μL, add more ammonium bicarbonate to a final volume of 20 μL.
Repeat steps 2–6 twice.
3.3 Desalting
Desalting is performed on home-made C18 columns called stage-tips.
Cut off the last centimeter of a P1000 tip to make the opening bigger. Use this pipet tip to punch out a piece of C18 material from the disc. If desalting more than 25 μg of protein, use 2 discs. Ensure that there is no space between the discs.
Use a piece of fused silica capillary (or any other long, thin item) to firmly push the C18 material out of the P1000 pipet tip into a P200 pipet tip. Ensure that the C18 disc is firmly positioned in the tip, but do not push hard enough to puncture the material.
Place the column in a microcentrifuge adaptor inside a 1.5 mL or 2 mL microcentrifuge tube.
Activate the resin by adding 50 μL methanol to the column. Push the methanol through using compressed air or by spinning in a centrifuge at 500g for 30 to 60 seconds.
Repeat step 4 once. After this point, keep the disc wet at all times.
Equilibrate the column by adding 200 μL washing buffer to the column, and push the solution through as described in step 5. Note that the collection tube may need to be emptied.
Repeat step 6 once.
Add wash buffer to samples to a final volume of approximately 200 μL. The pH should be below 4.0 (check with pH paper). If needed, adjust the pH using glacial acetic acid.
Add the sample to the stage-tip and centrifuge for 2–5 minutes at 200g until the sample passes through the column.
Wash the column by adding 50 μL wash buffer to the stage-tip and centrifuge at 500g for 30 to 60 seconds until the buffer passes through.
Repeat step 10 once.
Remove the collection tube and replace with a clean 1.5 mL microcentrifuge tube.
Add 75 μL elution buffer to the column and centrifuge at 200 g for 2–5 minutes.
Repeat step 13 once.
Dry down sample in a vacuum centrifuge to less than 5 μL. Desalted samples can be stored at −80°C.
Resuspend samples to approximately 1 μg/μL in buffer A for MS analysis.
3.4 Online RP-HPLC and MS Acquisition
We present protocols to perform bottom-up MS using nano-electrospray ionization (nano-ESI) on a hybrid LTQ-Orbitrap (i.e. Thermo Orbitrap Elite).
MS data can be acquired in two different modes: data dependent acquisition (DDA) or data independent acquisition (DIA). DDA is the traditional method used in histone PTM analysis, where a high-resolution full MS scan is acquired followed by low-resolution MS/MS acquisition of the topN (e.g. 10) most abundant peaks from the full MS scan. Peptide abundance is quantified by integrating the area of the extracted ion chromatogram at the MS1 level. One issue with DDA acquisition is that it cannot accurately quantify co-eluting isobaric peptides (different peptides that have the same mass) because they cannot be discriminated at the MS1 level. To overcome this limitation, co-eluting isobaric peptides must be targeted across their elution profile and quantified based on the fragment ion intensities.
DIA acquisition is performed by acquiring a high-resolution full MS scan followed by a series of high-resolution MS/MS scans covering a larger m/z window (i.e. 50) that step across the desired m/z range (Gillet et al., 2012). All ions present in the m/z window get fragmented and detected together, and so the size of the window and complexity of the sample will dictate the quality of the MS/MS spectra and consequently identification. DIA methodology has been optimized for histone PTM analysis also performing low resolution MS/MS scans, and 50 m/z windows were found to be a good balance for allowing sufficient identification and cycle speed (Sidoli, Simithy, Karch, Kulej, & Garcia, 2015). Peptide identifications should be obtained before the DIA experiment using a spectral library generated with DDA data. In other words, DIA is not the recommended acquisition method to identify unknown peptides. On the other hand, one major benefit of DIA is that MS/MS spectra are obtained for all peptides across the entire elution peak. Thus, DIA provides higher confidence in determining the correct chromatographic peak for a given peptide, as it allows for extracted ion chromatography for both precursor and fragment ions, which will appear as co-eluting peak profiles. The abundance of the peptide is still quantified by calculating the area under the curve for the parent ion, as the precursor ion signal is always more intense than any fragment mass ion. Moreover, DIA eliminates the need for targeting of co-eluting isobaric species, as all analytes are already fragmented at each MS scan cycle, and allows for future data mining. DIA methods have been gaining popularity due to these advantages.
Fit the HPLC with a C18 column (75 μM inner diameter, 10–20 cm in length), either purchased commercially or packed in-house.
Load 1–2 μg histone peptides onto the column with an autosampler.
Program the HPLC gradient: 0 to 30% buffer B in 30 min, 30 to 100% buffer B in 5 minutes, 100% buffer B for 8 minutes. For HPLC systems that do not automatically equilibrate the column before sample loading, include equilibration steps in your gradient: 100 to 0% buffer B in 1 minute and 0% buffer B for 10 minutes. Set the flow rate to 250–300 nL/min.
-
Program a method to acquire and record MS data.
-
DDA: Composed of three segments. All full MS scans are obtained in the Orbitrap, and all MS/MS scans are obtained in the ion trap.
Segment 1: 14 minutes. One full MS scan followed by CID MS/MS of the top seven most abundant ions (in DDA mode).
Segment 2: 27 minutes. One full MS scan followed by CID fragmentation of isobaric species [for e.g. human and mouse H3 9–17aa 1 acetyl (528.296 m/z), H3 18–26aa 1 acetyl (570.841 m/z), H4 4–17 1 acetyl (768.947 m/z), H4 4–17 2 acetyls (761.939 m/z), H4 4–17aa 3 acetyls (754.930 m/z)] and DDA of the top 5 most abundant ions.
Segment 3: 19 minutes. One full MS scan followed by DDA of the top ten most abundant ions.
NOTE: The time and length of each segment will vary depending on the exact elution times of the targeted species, which can vary between columns. For first time users, run one sample to determine the elution time for the targeted species and modify the method, if needed, to allow for accurate targeting of the desired species.
-
DIA: Set up method according to Table 1.
Note: The full MS1 scan is performed twice within the same duty cycle to allow for a more resolved definition of the precursor peak profile. This is not necessary for MS2 ions, as these are commonly not used for quantification, but only for increasing the confidence of the selected chromatographic peak and, in specific cases, to discriminate isobaric forms. The differences in MS2 scan range, in particular for the low mass range, are due to the intrinsic limitations of the ion trap, which cannot hold for scanning fragment ions smaller than ~1/3 of the isolated precursor mass.
-
Table 1.
Scan | Scan | Detector | Scan Range | MS2 scan range |
---|---|---|---|---|
1 | MS1 | Orbitrap | 300–1100 | N/A |
2 | MS2 | Ion Trap | 300–350 | 120–1500 |
3 | MS2 | Ion Trap | 350–400 | 120–1500 |
4 | MS2 | Ion Trap | 400–450 | 120–1500 |
5 | MS2 | Ion Trap | 450–500 | 130–1500 |
6 | MS2 | Ion Trap | 500–550 | 140–1500 |
7 | MS2 | Ion Trap | 550–600 | 155–1500 |
8 | MS2 | Ion Trap | 600–650 | 170–1500 |
9 | MS2 | Ion Trap | 650–700 | 185–1500 |
10 | MS1 | Orbitrap | 300–1100 | N/A |
11 | MS2 | Ion Trap | 700–750 | 195–1500 |
12 | MS2 | Ion Trap | 750–800 | 210–1500 |
13 | MS2 | Ion Trap | 800–850 | 225–1500 |
14 | MS2 | Ion Trap | 850–900 | 240–1500 |
15 | MS2 | Ion Trap | 900–950 | 250–1500 |
16 | MS2 | Ion Trap | 950–1000 | 265–1500 |
17 | MS2 | Ion Trap | 1000–1050 | 280–1500 |
18 | MS2 | Ion Trap | 1050–1100 | 295–1500 |
3.5 Data Analysis
Histone PTM analysis is a computationally challenging process. Given that histones are highly modified proteins, each peptide can contain several PTM acceptor sites, resulting in a large number of possible modified forms for a given peptide. For example, the H3 9–17 peptide can contain mono-, di-, or tri-methylation (me1/me2/me3) on K9 and K14 as well as acetylation (ac) on K9, for a total of 10 possible forms of the peptide. Some isobaric peptides are also generated by the derivatization process, further complicating analysis. For example, the H3 9–17 peptide has two sets of isobaric peptides: unmodified and K9me1K14ac ([M + 2H]2+ = 535.304 m/z) and K9me3K14ac and K9me2 ([M + 2H]2+ = 521.306 m/z). Many of these isobaric species arise because the mass of a propionyl group is the same as the mass of an acetyl group and mono-methyl group (56.026 Da). Fortunately, many of these peptides elute at different times and therefore do not require MS/MS targeting across their elution profiles in DDA experiments (i.e. K9me3K14ac and K9acK14ac). This is because the modifications impart different hydrophobicities, causing the peptides to elute at different times (Fig. 2).
Some isobaric species, however, co-elute and cannot be discriminated based on retention time. A prime example is the H4 4–17 peptide that contains four lysine residues that can be acetylated, resulting in many isobaric species. The di-acetylated peptide is the most complicated example, as there are six isobaric forms that co-elute. These peptides can be differentiated by collecting MS/MS spectra across their elution and using the elution profiles of unique fragment ions to define the ion chromatogram (Fig. 3A).
In both DDA and DIA experiments, the relative abundances of histone peptides must be calculated by extracting the raw peak area of all modified and unmodified forms for a given peptide sequence. The relative abundance is obtained by dividing the total area of a single peptide isoform (including all charge states) by the total area of that peptide sequence in all of its modified forms. We developed a software tool to automate this type of data analysis for both DDA and DIA experiments (Yuan et al., 2015).
3.5.1. Software based peak area extraction and abundance calculation
EpiProfile is a freely-available Matlab-based automated tool to quantify histone PTM profiles in DDA experiments (Yuan et al., 2015). The software reads raw data and provides a table of quantified histone peptides, layouts (MS1 elution profiles), and annotated MS/MS spectra used for identification. EpiProfile can quantify co-eluting isobaric peptides (if they were targeted in the MS method) as well as isotopically labeled peptides (i.e. labeled by SILAC).
EpiProfile uses previous knowledge about relative elution times to aid in quantification and identification of histone peptides. It uses this information, as well as mass information, to perform extracted ion chromatography (XIC) of each unique peptide. EpiProfile then calculates the area under the curve of the XIC, which is then used to calculate the relative abundance of each peptide.
Open the “params.txt” file in EpiProfile and specify the file path containing your raw files. Other options can be specified as well, such as isotopic labeling and mass tolerance.
Open Matlab and specify the file path containing EpiProfile.
Enter “EpiProfile” in the command window to start the program.
-
The program will run and the results will be stored in the same file path that was specified in the “params.txt” file. Several result files and folders will be provided:
An excel file called “histone_ratios.xls.” This file contains the retention time (RT), area, and relative abundance of each peptide for all raw files (Fig. 3B).
A folder called “histone_layouts.” This contains all of the XICs used for area calculations, including all of the XICs for fragment ions (i.e. Fig. 2). The “details” folder in this directory contains XICs for co-eluting isobaric species (i.e. Fig. 3A).
-
The layout files can be used to manually validate that the program chose the correct peak to quantify.
Note: EpiProfile, although it is the software we recommend for histone analysis, is not the only one available. Manual quantification can be performed using the Xcalibur QualBrowser (for Thermo instruments), which is mostly used to visualize raw files. The mass of the monoisotopic peak can be entered to obtain the XIC, and the area under the curve function automatically calculates the peak area. Moreover, the free software Skyline can be adopted for the purpose (MacLean et al., 2010). Skyline is optimized to extract precursor and fragment XIC upon a pre-compiled list of peptides, and thus it is definitely more automated than the manual quantification procedure. However, it does not include unique features of EpiProfile such as retention time based peak calling and automatic calculation of ratios for co-eluting isobaric peptides.
4. Offline fractionation of histone species
4.1 Materials and Buffer Recipes
Offline Buffer A: 5% Acetonitrile, 0.2% trifluoroacetic acid (TFA) in water (all HPLC-grade)
Offline Buffer B: 95% acetonitrile, 0.188% TFA in water (all HPLC-grade)
5 μm C18 column (size depending on application)
4.2 Histone variant purification
Attach a C18 column to the offline HPLC and set the flow rate according to the size of the column. Generally, a flow rate of 0.2 mL/min is used for a 2.1mm column, 0.8 mL/min for a 4.6mm column, and 2.5 mL/min for a 10mm column. Allow the column to equilibrate in buffer A for 10X column volumes.
Load the sample. Generally, 100 to 200 μg histone should be loaded for a 4.6mm column.
Run the following gradient: 30 to 60% buffer B over 100 minutes, 60 to 100% buffer B over 20 minutes, 100 to 30% buffer B over 10 minutes. The elution profile is shown in Fig 4.
Set up a fraction collector to collect samples in 1 minute intervals from 15 minutes to 80 minutes.
Transfer the desired fractions to centrifuge tubes and dry to completion in a vacuum centrifuge to remove organic solvent and TFA. Fractions corresponding to the same histone can be pooled before drying. Dried histones can be stored at room temperature. If reconstituted, store at −80°C.
5. Middle-Down Mass Spectrometry
Middle-down MS is a different proteomics strategy valuable for quantifying combinatorial histone PTMs. In middle-down MS, proteases that generate long peptides are employed to allow for greater connectivity between PTM sites than bottom-up MS. It is important to use a protease that does not cleave in the tail domain as this is where most PTMs are catalyzed. Endoproteinase GluC (in bicarbonate buffers) is recommended for histone H3, H4 and canonical H2A, as it generates N-terminal peptides of 50, 53 and 41 amino acid residues, respectively. Endoproteinase AspN is a valid alternative, as it cleaves all canonical histones providing middle-down sized N-terminal peptides; specifically, 71 amino acid residues for H2A, 76 for H3, 24 for H4 and 51 for H2B. Because the peptides are longer and trypsin is not being used, derivatization of the protein is unnecessary. The high charge states occupied by histone tail peptides are incompatible with CID fragmentation. ETD fragmentation, on the other hand, generally results in high coverage of the histone tail peptides.
Middle-down histone tail peptides do not separate well by RP chromatography. Weak cation exchange hydrophilic interaction liquid chromatography (WCX-HILIC) is currently the stationary phase of choice for this application, as it employs a hydrophilic stationary phase containing negatively charged residues (i.e. glutamic acid), which is the ideal binding pocket for basic hydrophilic polypeptides (Jung et al., 2013; Sidoli et al., 2014; Young et al., 2009). The HILIC separation occurs by decreasing the amount of organic buffer in the solvent so that more hydrophobic species elute first. The WCX separation is accomplished by incorporating a pH gradient into the mobile phase. As the pH decreases during the gradient, the resin becomes increasingly protonated, which removes the charge on the resin and abolishes the cation exchange interaction.
One drawback of middle-down MS compared to bottom-up MS is reduced sensitivity. Larger peptides are electrospray ionized in multiple charge states, which reduces the signal for any given charge state compared to smaller peptides. Similarly, larger peptides can have a larger number of modified forms, which dilutes the signal for any single form. This limitation can be partially alleviated by using larger amounts of starting material. Fractionation of histones before MS analysis is the most effective method to increase sensitivity because it reduces the complexity of the sample, and thus allows for loading more material of a single histone variant. Another caveat of middle-down MS is that the data analysis for middle-down experiments is more complicated and less automated compared to bottom-up MS, although few software tools have been developed for the purpose (DiMaggio, Young, Baliban, Garcia, & Floudas, 2009; Sidoli et al., 2014).
5.1 Materials and Buffer Recipes
100 mM ammonium acetate, pH 4.0
10 mM Tris-HCl, pH 7.5
GluC
AspN
Offline Buffer A: 5% acetonitrile, 0.2% trifluoroacetic acid (TFA) in water (all HPLC-grade)
Offline Buffer B: 95% acetonitrile, 0.188% TFA in water (all HPLC-grade)
Online Buffer A: 75% acetonitrile, 20 mM propionic acid, pH: 6.0 (generated using ammonium hydroxide)
Online Buffer B: 15% acetonitrile, 0.2% formic acid
WCX-HILIC column: 75 μm ID, 15 cm length, 1.7 μm (diameter) 1000 Å (porosity) particle size is recommended. The 3 μm (diameter) 1500 Å (porosity) particle size can also be used
5.2 Digestion
5.2.1. GluC
Resuspend sample in 100 mM ammionium acetate to a final concentration of 0.5 μg/μL.
Dilute the GluC enzyme to 0.2 μg/μL in the same buffer.
Add GluC to a final concentration of 1:10 GluC:histone.
Incubate for 6 hours at room temperature.
Block digestion by adding 1% formic acid.
5.2.2. AspN (alternative to GluC)
Resuspend enzyme to 40 ng/μL in 10 mM Tris-HCl, pH 7.5.
Resuspend sample in the same buffer and add enzyme at 1:100 enzyme:histone by weight.
Digest at 37°C for 6 hours.
Block digestion by adding 1% formic acid.
5.3 WCX-HILIC and MS
Middle-down has been optimized by different research laboratories (Jung et al., 2013; Sidoli et al., 2014; Young et al., 2009). Currently, the most automated platform includes an RP trap column for sample loading and a WXC-HILIC analytical column for the gradient (Fig. 5). This setup allows for sample loading in aqueous conditions and separation using WCX-HILIC (Sidoli et al., 2014).
Program the HPLC method as follows: from 0 to 55% buffer B in 1 min, from 55 to 85% B in 160 min and from 85 to 100% in 5 min. If the HPLC is not programmed for automated column equilibration before sample loading then include this part in the method: switch the valve in position load (Fig. 5B), from 100 to 0% B in 1 min and isocratic flow at 0% B for 10 min. The flow rate of the analysis should be 250–300 nL/min
Program the MS acquisition method to perform MS/MS data dependent acquisition of the 5–8 most abundant precursor masses. Do not apply dynamic exclusion, as this increases the number of isobaric forms quantified. The full MS scan range should be 450–750 m/z, as this is the region of the most intense charge states for histone polypeptides. If only histone H3 is analyzed, the window can be narrowed to 660–720 m/z, in order to include only charge state 8+.
Program the MS/MS acquisition to be performed with ETD at a resolution of ~30,000. The reaction time should be around 20 ms for polypeptides with 8–10 charges. Include 3 microscans to improve the quality of the MS/MS spectra acquired, as ETD spectra are overall less reproducible than CID.
Load ~2 μg of sample onto the HPLC trap column. The sample can be loaded as is from the digestion step (section 5.2.1 or 5.2.2)
Run the HPLC-MS/MS method as programmed. Since the sample is loaded onto a trap column it does not require prior desalting, as the salts will be eluted during the trap loading and thus they will not be sprayed into the MS
5.4 Data Analysis
Identification and quantification of middle-down peptides is a more challenging process than bottom-up. This is because each precursor mass might correspond to hundreds of isobarically modified peptides. Currently, the most comprehensive algorithm to quantify histone peptides employs mixed integer linear optimization (DiMaggio et al., 2009). However, this software is currently not in a user-friendly format and it is not flexible through unexpected modification sites. Thus, we developed a workflow that includes database searching and quantification based on virtual histograms. Briefly, the obtained LC-MS/MS result file is searched using Mascot (Matrix Science), which provides identification and spectrum intensities, and then filtered with the freely available isoScale slim (http://middle-down.github.io/Software) (Sidoli et al., 2014).
Collect all raw files and submit them to a deconvolution tool. MS/MS spectra ions should be all singly charged before Mascot database searching. We recommend Xtract as deconvolution algorithm if Thermo Scientific instrument is used (e.g. LTQ-Orbitrap), although it is important to highlight that utilizing Thermo instruments is not mandatory.
-
Perform database searching using a sequence database containing only histones; large databases increase dramatically searching time. Search parameters should be as following: MS mass tolerance: 2.2 Da, to include possible errors of the deconvolution algorithm in isotopic recognition. MS/MS mass tolerance: 0.01 Da. Enzyme: GluC (or AspN) with no missed cleavages. Sample preparation does not generate static modifications on the peptide, while variable modifications can be chosen as desired.
Note: recommended variable modifications are mono- and dimethylation (KR), trimethylation (K), acetylation (K) and, optionally, phosphorylation (ST).
Export Mascot results in .csv, including the following information to the file: all Query level information, all the default information (these last are already selected by default)
Import the .csv file in isoScale slim. Set the tolerance for the search (recommended: 30 ppm) and the type of fragmentation adopted (in this case ETD). The result table contains only confidently assigned combinatorial PTMs. A peptide is defined as confident if all the modifications are uniquely validated by site determining ions, which are ions that unambiguously confirm the localization of a modification site. For instance, in order to verify that a methylation is on H3K27 the software requires at least one ion proving that the modification is not either on H3R26 or H3K36. The output table contains the list of peptides that passed the site determining ions validation and their MS/MS ion intensity. isoScale also quantifies co-fragmented isobaric species, as soon as all these species pass the confidence threshold, in a similar manner as EpiProfile performs for bottom-up peptides (section 3.5.1).
-
The output table contains duplicates (peptides with the same sequence and PTM combination). Remove them by using the “Remove duplicates” option in Excel
Note: from the relative abundance of the combinatorial marks it is possible to extract the relative abundance of single marks simply by summing all relative abundances of peptides that contain the given PTM. To estimate which histone marks tend to co-exist with each other with high or low frequency it is possible to calculate the interplay score (Jung et al., 2013; Schwämmle, Aspalter, Sidoli, & Jensen, 2014). This score is calculated as:
where Ixy is the interplay score between the marks X and Y, Fxy is the co-existence frequency (or relative abundance) of the two marks and Fx or Fy are the frequencies of the single marks in the dataset. Basically, Fxy is the observed co-existence frequency, while (Fx * Fy) is the theoretical co-existence frequency, calculated based on the relative abundance of single PTMs. This calculation provides a score of how much two marks tend to co-exist or be mutually exclusive on the same histone protein. Positive values indicate tendency to coexist higher than if the two marks were completely independent from each other, while negative values indicate the opposite. This score has been used to predict cross-talk between histone marks (Schwämmle et al., 2014).
6. Top-Down Mass Spectrometry
Top-down MS is performed by fragmenting intact protein using ETD fragmentation. The biggest advantage of top-down MS over the other methods is that it provides a global view of the intact protein sequence. However, this advantage is accompanied by the caveat that it is also the least sensitive method because the histone proteins will occupy the maximal number of charge states and modified forms as compared to bottom-up and middle-down strategies. Furthermore, the data analysis is not only more difficult, but also currently prohibitive in specific cases. For instance, while the bottom-up sample preparation can produce peptides with up to 6 isobaric forms, an intact histone has potentially hundreds of thousands of combinatorial forms with the same identical precursor mass. The exponential number can be explained considering the many modifiable sites (e.g. K and R) and the permutations of isobaric modified forms (e.g. me3 is equivalent to me1me2, which is equivalent to me1me1me1, which is equivalent to the same combination on other sites etc.). Therefore, a single MS/MS spectrum might contain an impressive number of isobaric proteoforms, of which at the moment we can only scrape its surface.
Generally, fractionation is performed before conducting a top-down experiment to reduce the complexity of the sample. Histone species can be fractioned as described in section 4. Usually, histone proteins are directly infused into the MS in top-down analysis, but online RP separation can also be conducted if desired (Contrepois, Ezan, Mann, & Fenaille, 2010; Eliuk, Maltby, Panning, & Burlingame, 2010; Tian et al., 2010). Direct infusion using a syringe requires a large amount of sample (about 20–30 μg for 10 minute injection), so using a chip-based infusion system, such as an Advion Nanomate, is strongly recommended, as the amount of sample required can be as low as 5 μg for a 30 minute injection. In these systems, sample is picked up by a small tip and delivered to a chip containing nano-ESI nozzles. Voltage is applied to the chip to enable ionization into the MS. Very small sample volumes can be stably sprayed for long periods of time using this system (10 μL of sample lasts for about 30 to 40 minutes).
6.1 Materials and Buffer Recipes
Sample buffer: 75% acetonitrile/25% 0.8% formic acid adjusted to pH 2.5 with trifluoroacetic acid (TFA)
Chip-based nano-ESI autosampler such as Advion Nanomate
6.2 Top-down MS using direct infusion
The acquisition method in top-down MS will vary depending on the exact application. Most top-down experiments aim to characterize a specific modified form of histone protein. In this case,
If sample contains salt, desalt according to section 3.3 using C8 resin in place of C18 resin. One disc of C8 resin can bind approximately 10 μg of intact histone protein. Use multiple discs if the sample contains more than 10 μg of protein. If offline fractionation was used, there is no need to desalt the sample.
Resuspend histone protein in sample buffer to a concentration of approximately 1 μg/μL.
Infuse sample in a chip-based direct infusion system and adjust parameters to achieve stable spray. Generally, stable spray can be achieved using a capillary temperature of 150°C, voltage between 1.7–2.5 kV, and a gas pressure between 0.3–0.4 psi. Capillary temperatures above 170°C can damage the chip and are not recommended. These parameters will need to be adjusted for each run before data acquisition and may also need adjustment as data is being acquired. A sudden complete loss of signal usually indicates that the nozzle has become clogged, in which case another nozzle should be selected.
Run an MS acquisition method to collect data. The method will depend largely on the application. To analyze a specific modified form of a histone protein, set up an MS1 scan that spans the m/z range of that form but excludes other species (i.e. Fig. 6A). Perform a data-dependent MS/MS on the top isotope of the distribution with a large number of microscans (10–20 should provide a clean spectra depending on the quality of signal and spray). Generally, a 3–5 minute acquisition time is adequate to obtain a high-quality MS/MS spectrum that can be identified during data analysis. If characterization of more than one species is desired, set up the method to take MS1 scans corresponding to the other species followed by data-dependent MS2 scans.
6.3 Data Analysis
Top-down data requires deconvolution before analysis, where each multiply-charged ion is reduced to its singly-charged monoisotopic mass. This process facilitates data analysis as most programs cannot analyze MS/MS spectra containing highly-charged fragment ions. The deconvoluted data can then be searched using Mascot (Matrix Science) software.
Convert files into mzxml format using any of a number of available programs for this purpose
Deconvolute data using any of a number of available programs for this purpose. Examples include the Xtract module from Thermo Xcalibur software and MS-Deconv (freely available at http://bix.ucsd.edu/projects/msdeconv/).
-
Use proper software to search the deconvoluted data. The traditional Mascot license can analyze intact molecules up to 16kDa, making it suitable for intact histone proteins. Other software tools can be used, such as ProSightPC (Thermo) or the freely available MS-Align+ (Liu et al., 2012). Use a database containing all of the histone protein sequences, including variants. Select the modifications of interest as variable PTMs.
Note: manual validation of the results will be necessary, as all software are not optimized for heavily modified proteins such as histones. In order to achieve a confident localization for a given PTM it is necessary to have specific fragment ions between two possible modifiable sites, confirming on which amino acid residue the modification is localized (Fig. 6B/C). Software will provide the “most probable” localization of the PTMs, but they rarely provide a score describing whether another localization for a given modification is equally probable. isoScale slim is applicable for this purpose, as previously illustrated for middle-down (section 5.4). However, it demands result files from Mascot.
References
- Britton LMP, Gonzales-Cope M, Zee BM, Garcia BA. Breaking the histone code with quantitative mass spectrometry. Expert Review of Proteomics. 2011;8(5):631– 643. doi: 10.1586/epr.11.47. http://doi.org/10.1586/epr.11.47. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Contrepois K, Ezan E, Mann C, Fenaille F. Ultra-High Performance Liquid Chromatography–Mass Spectrometry for the Fast Profiling of Histone Post-Translational Modifications. Journal of Proteome Research. 2010;9(10):5501–5509. doi: 10.1021/pr100497a. http://doi.org/10.1021/pr100497a. [DOI] [PubMed] [Google Scholar]
- DiMaggio PA, Young NL, Baliban RC, Garcia BA, Floudas CA. A Mixed Integer Linear Optimization Framework for the Identification and Quantification of Targeted Post-translational Modifications of Highly Modified Proteins Using Multiplexed Electron Transfer Dissociation Tandem Mass Spectrometry. Molecular & Cellular Proteomics: MCP. 2009;8(11):2527–2543. doi: 10.1074/mcp.M900144-MCP200. http://doi.org/10.1074/mcp.M900144-MCP200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eliuk SM, Maltby D, Panning B, Burlingame AL. High Resolution Electron Transfer Dissociation Studies of Unfractionated Intact Histones from Murine Embryonic Stem Cells Using On-line Capillary LC Separation. Molecular & Cellular Proteomics: MCP. 2010;9(5):824–837. doi: 10.1074/mcp.M900569-MCP200. http://doi.org/10.1074/mcp.M900569-MCP200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Garcia BA, Mollah S, Ueberheide BM, Busby SA, Muratore TL, Shabanowitz J, Hunt DF. Chemical derivatization of histones for facilitated analysis by mass spectrometry. Nature Protocols. 2007;2(4):933–938. doi: 10.1038/nprot.2007.106. http://doi.org/10.1038/nprot.2007.106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gillet LC, Navarro P, Tate S, Röst H, Selevsek N, Reiter L, … Aebersold R. Targeted Data Extraction of the MS/MS Spectra Generated by Data-independent Acquisition: A New Concept for Consistent and Accurate Proteome Analysis. Molecular & Cellular Proteomics. 2012;11(6) doi: 10.1074/mcp.O111.016717. O111.016717. http://doi.org/10.1074/mcp.O111.016717. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jenuwein T, Allis CD. Translating the Histone Code. Science. 2001;293(5532):1074–1080. doi: 10.1126/science.1063127. http://doi.org/10.1126/science.1063127. [DOI] [PubMed] [Google Scholar]
- Jung HR, Sidoli S, Haldbo S, Sprenger RR, Schwämmle V, Pasini D, … Jensen ON. Precision Mapping of Coexisting Modifications in Histone H3 Tails from Embryonic Stem Cells by ETD-MS/MS. Analytical Chemistry. 2013;85(17):8232–8239. doi: 10.1021/ac401299w. http://doi.org/10.1021/ac401299w. [DOI] [PubMed] [Google Scholar]
- Liu X, Sirotkin Y, Shen Y, Anderson G, Tsai YS, Ting YS, … Pevzner PA. Protein Identification Using Top-Down Spectra. Molecular & Cellular Proteomics: MCP. 2012;11(6) doi: 10.1074/mcp.M111.008524. http://doi.org/10.1074/mcp.M111.008524. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Luger K, Mäder AW, Richmond RK, Sargent DF, Richmond TJ. Crystal structure of the nucleosome core particle at 2.8 Å resolution. Nature. 1997;389(6648):251–260. doi: 10.1038/38444. http://doi.org/10.1038/38444. [DOI] [PubMed] [Google Scholar]
- MacLean B, Tomazela DM, Shulman N, Chambers M, Finney GL, Frewen B, … MacCoss MJ. Skyline: an open source document editor for creating and analyzing targeted proteomics experiments. Bioinformatics. 2010;26(7):966–968. doi: 10.1093/bioinformatics/btq054. http://doi.org/10.1093/bioinformatics/btq054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rothbart SB, Dickson BM, Raab JR, Grzybowski AT, Krajewski K, Guo AH, … Strahl BD. An Interactive Database for the Assessment of Histone Antibody Specificity. Molecular Cell. 2015;59(3):502–511. doi: 10.1016/j.molcel.2015.06.022. http://doi.org/10.1016/j.molcel.2015.06.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schwämmle V, Aspalter CM, Sidoli S, Jensen ON. Large Scale Analysis of Co-existing Post-translational Modifications in Histone Tails Reveals Global Fine Structure of Cross-talk. Molecular & Cellular Proteomics: MCP. 2014;13(7):1855–1865. doi: 10.1074/mcp.O113.036335. http://doi.org/10.1074/mcp.O113.036335. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sidoli S, Schwämmle V, Ruminowicz C, Hansen TA, Wu X, Helin K, Jensen ON. Middle-down hybrid chromatography/tandem mass spectrometry workflow for characterization of combinatorial post-translational modifications in histones. PROTEOMICS. 2014;14(19):2200–2211. doi: 10.1002/pmic.201400084. http://doi.org/10.1002/pmic.201400084. [DOI] [PubMed] [Google Scholar]
- Sidoli S, Simithy J, Karch KR, Kulej K, Garcia BA. Low resolution data-independent acquisition in an LTQ-Orbitrap allows for simplified and fully untargeted analysis of histone modifications. Analytical Chemistry. 2015 doi: 10.1021/acs.analchem.5b03009. http://doi.org/10.1021/acs.analchem.5b03009. [DOI] [PMC free article] [PubMed]
- Sidoli S, Yuan Z-F, Lin S, Karch K, Wang X, Bhanu N, … Garcia BA. Drawbacks in the use of unconventional hydrophobic anhydrides for histone derivatization in bottom-up proteomics PTM analysis. Proteomics. 2015 doi: 10.1002/pmic.201400483. http://doi.org/10.1002/pmic.201400483. [DOI] [PMC free article] [PubMed]
- Tian Z, Zhao R, Tolić N, Moore RJ, Stenoien DL, Robinson EW, … Paša-Tolić L. Two-dimensional liquid chromatography system for online top-down mass spectrometry. Proteomics. 2010;10(20):3610–3620. doi: 10.1002/pmic.201000367. http://doi.org/10.1002/pmic.201000367. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Young NL, DiMaggio PA, Plazas-Mayorca MD, Baliban RC, Floudas CA, Garcia BA. High Throughput Characterization of Combinatorial Histone Codes. Molecular & Cellular Proteomics. 2009;8(10):2266–2284. doi: 10.1074/mcp.M900238-MCP200. http://doi.org/10.1074/mcp.M900238-MCP200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yuan ZF, Lin S, Molden RC, Cao XJ, Bhanu NV, Wang X, … Garcia BA. EpiProfile Quantifies Histone Peptides With Modifications by Extracting Retention Time and Intensity in High-resolution Mass Spectra. Molecular & Cellular Proteomics: MCP. 2015;14(6):1696–1707. doi: 10.1074/mcp.M114.046011. http://doi.org/10.1074/mcp.M114.046011. [DOI] [PMC free article] [PubMed] [Google Scholar]