Abstract
Loss-of-function mutations in the secreted enzyme ADAMTS7 (a disintegrin and metalloproteinase with thrombospondin motifs 7) are associated with protection for coronary artery disease. ADAMTS7 catalytic inhibition has been proposed as a therapeutic strategy for treating coronary artery disease; however, the lack of an endogenous substrate has hindered the development of activity-based biomarkers. To identify ADAMTS7 extracellular substrates and their cleavage sites relevant to vascular disease, we used TAILS (terminal amine isotopic labeling of substrates), a method for identifying protease-generated neo–N termini. We compared the secreted proteome of vascular smooth muscle and endothelial cells expressing either full-length mouse ADAMTS7 WT, catalytic mutant ADAMTS7 E373Q, or a control luciferase adenovirus. Significantly enriched N-terminal cleavage sites in ADAMTS7 WT samples were compared to the negative control conditions and filtered for stringency, resulting in catalogs of high confidence candidate ADAMTS7 cleavage sites from our three independent TAILS experiments. Within the overlap of these discovery sets, we identified 24 unique cleavage sites from 16 protein substrates, including cleavage sites in EFEMP1 (EGF-containing fibulin-like extracellular matrix protein 1/Fibulin-3). The ADAMTS7 TAILS preference for EFEMP1 cleavage at the amino acids 123.124 over the adjacent 124.125 site was validated using both endogenous EFEMP1 and purified EFEMP1 in a binary in vitro cleavage assay. Collectively, our TAILS discovery experiments have uncovered hundreds of potential substrates and cleavage sites to explore disease-related biological substrates and facilitate activity-based ADAMTS7 biomarker development.
Keywords: ADAMTS, ADAMTS7, Biomarkers, Coronary artery disease, Degradomics, EFEMP1, Mass spectrometry, Protease, Substrate identification, TAILS proteomics
Abbreviations: ACN, acetonitrile; ADAMTS7, A disintegrin and metalloproteinase with thrombospondin motifs 7; CS-GAG, chondroitin sulfate glycosaminoglycan; FA, formic acid; HUVEC, human umbilical vein endothelial cells; SMC, smooth muscle cells; TAILS, terminal amine isotopic labeling of substrates; TMT, tandem mass tag
Graphical Abstract
Highlights
-
•
Examination of ADAMTS7 substrates from vascular smooth muscle and endothelial cells.
-
•
Terminal amine isotopic labeling of substrates (TAILS) with active and dead enzyme.
-
•
Activity-dependent cleavage sites identified and compared from independent studies.
-
•
Verification of adjacent cleavage sites in new ADAMTS7 substrate EFEMP1/Fibulin-3.
In Brief
Catalytic inhibition of the secreted enzyme ADAMTS7 (a disintegrin and metalloproteinase with thrombospondin motifs 7) has been proposed as a therapeutic target for coronary artery disease. However, the substrates of ADAMTS7 are not well understood, impeding the understanding of disease biology and preventing the development of activity-based clinical biomarkers. In this study, we provide three discovery sets identifying new ADAMTS7 substrates. We report hundreds of new substrate cleavage sites, including 24 unique cleavage sites found in all ADAMTS7 TAILS experiments.
ADAMTS7 (a disintegrin and metalloproteinase with thrombospondin motifs 7) is a large, secreted protease with poorly defined substrates. Variants at the ADAMTS7 locus were identified in population-based genome wide association studies for coronary artery disease, with a risk haplotype including a coding variant in the prodomain, rs3825807 p.Ser214Pro (1, 2, 3). The Ser214 risk variant was shown to increase prodomain processing and maturation, correlating with an increase in COMP (Cartilage oligomeric matrix protein) degradation and vascular smooth muscle cell migration (4). Additionally, the Ser214 risk allele was associated with an unstable atherosclerotic plaque phenotype and an increase in secondary cardiac events (5, 6, 7, 8). Mouse KO studies have shown that Adamts7 loss of function significantly reduces vascular smooth muscle cell-mediated neointimal formation in arterial wire injury models (9, 10). These findings were supported by Adamts7 knockdown experiments in a rat carotid artery balloon injury model of restenosis (11, 12). Conversely, ADAMTS7 overexpression in the rat balloon injury model increased the rate of neointima formation, presumably through an increase in extracellular matrix degradation to allow for vascular smooth muscle cell migration at the site of vascular injury. ADAMTS7, and its paralog ADAMTS12, are defined as COMP proteases that utilize their carboxyl-terminal regions for substrate recognition (13, 14). Based on evidence from gel-based cleavage assays, the carboxyl-terminal region of ADAMTS7 was shown to be necessary for substrate cleavage, demonstrating a requirement of the full-length ADAMTS7 protease (15). However, the ADAMTS7 substrate cleavage sites for COMP and several other reported substrates from gel-based assays have not been identified. Furthermore, the lack of an endogenous substrate cleavage site for ADAMTS7 has hindered the development of activity-based biomarkers for clinical development.
ADAMTS7 belongs to a family of 19 secreted zinc metalloproteinases with a shared organization of a signal peptide, prodomain, metalloproteinase, disintegrin, thrombospondin, cysteine-rich, and spacer domains (16). Additionally, ADAMTS7 has a total of eight thrombospondin type I repeats and a highly glycosylated mucin domain with a chondroitin sulfate glycosaminoglycan (CS-GAG) attachment that set ADAMTS7 and ADAMTS12 apart from their family members (17, 18). Consequently, the CS-GAG–modified ADAMTS7 is both an extracellular protease and a proteoglycan. Other ADAMTS family members have known substrates and are associated with human disease (19). ADAMTS13 loss-of-function mutations or autoimmune function blocking antibodies to ADAMTS13 result in the clotting disorder thrombotic thrombocytopenic purpura. During hemostasis, von Willebrand factor is precisely cleaved by ADAMTS13, using an elaborate substrate recognition system involving the disintegrin, cysteine-rich, and spacer domains (20). Loss-of-function mutations in the procollagen I N-proteinase ADAMTS2 result in the connective tissue disorder dermatosparaxis Ehlers-Danlos syndrome. ADAMTS4 and ADAMTS5 (also known as aggrecanase-1 and aggrecanase-2) degrade the extracellular matrix proteins aggrecan and versican at defined cleavage sites. Antibodies raised to the aggrecan neo-epitopes are in use for clinical development of osteoarthritis therapeutics targeting ADAMTS5 enzymatic function (21, 22).
TAILS proteomics is an unbiased method for identifying cleavage sites from full-length substrates in their natural context (23). In this protocol, the primary amine groups from samples with and without protease activity are labeled and blocked by isobaric tags. N-terminally labeled peptides with higher ratios in the enzyme-treated samples represent neo N-termini or candidate cleavage sites for the enzyme. The TAILS method has been applied for the secreted metalloproteinases MMP2 and MMP9 (24) and for the procollagen I N-proteinases ADAMTS2, ADAMTS3, and ADAMTS14 (25, 26) to identify additional substrates and substrate cleavage sites. Recently TAILS was applied to ADAMTS7 using one active and one inactive condition from a truncated ADAMTS7 to identify substrates from a human fibroblast cell line, including a candidate site in LTBP4 (Latent-transforming growth factor beta-binding protein 4) (27).
Here, we present the results from TAILS experiments performed in human coronary artery smooth muscle cells and vascular endothelial cells using full-length WT and catalytic mutant ADAMTS7. In three independent experiments, we identified 24 unique cleavage sites from 16 protein substrates, including cleavage sites in EFEMP1 (EGF-containing fibulin-like extracellular matrix protein 1/Fibulin-3).
Experimental Procedures
ADAMTS7 Expression, Cell Culture, and Preparation of the Secreted Proteins
Full-length mouse Adamts7 1-1857 (Uniprot Q68SA9, WT or E373Q (EQ) catalytic mutant) with a carboxyl-terminal 3xFLAG tag was used to generate custom adenoviruses as previously described (28). Adeno-CMV-Luciferase (Luc) from Vector Biolabs was used as a negative control. Human Coronary Artery Smooth Muscle Cells were obtained from Lonza and expanded in Lonza SmGM-2 medium (containing phenol red) in 15-cm tissue culture plates. Twenty-four hours before medium collection, near confluent plates were transduced with 50 MOI adenovirus and switched to Lonza Basal Spectrum Mill (SM) medium (containing phenol red) with volumes of 20 ml per 15-cm plate. TAILS was also performed in Human umbilical vein endothelial cells (HUVEC) from Lifeline Cell Technology grown in VascuLife Medium (containing no phenol red) and transduced in Lifeline Basal EC (+bFGF2 10 ng/ml, containing no phenol red). Expression of the flag-tagged ADAMTS7 was confirmed from the medium using Western blot and detected using the anti-Flag M2-HRP antibody (Sigma, A8592). Collected medium was processed by adding protease inhibitors (1 mM EDTA and 1 mM PMSF) and clarified by 500g × 5 min spin at 4C to pellet cell debris. Cell medium supernatants were then passed through a 0.22 μm filter and kept chilled. Processed medium was then concentrated 50 to 100x using 3 kDa Centricon Plus-70 filter unit, spun at 3,500g × 60 min at 4C. Approximately, 10% of the processed, concentrated medium was set aside for total proteome analysis of the secretome. The remaining 90% of the processed, concentrated medium was buffer exchanged into 50 mM Hepes pH 8.0, 150 mM NaCl (2 × 50 ml, removing any phenol red from the original medium) using the 3 kDa Centricon Plus-70 filter unit and concentrated to >2 mg/ml to serve as input for the TAILS experiment. Samples for secretome proteome and TAILS were stored at −80 °C until further use.
For the first experiment, referred to as smooth muscle cell (SMC1), 120 ml of medium from 6 × 15 cm dishes was pooled from each condition (Luc, WT, EQ) and processed to produce three separate inputs for the TAILS experiment. Luc, WT, and EQ inputs for the SMC1 experiment were each divided into triplicate 400 μg samples to generate technical replicate TAILS data. For the second experiment, referred to as SMC2, medium was processed separately in triplicate for each condition (Luc1, Luc2, Luc3, WT1, WT2, WT3, EQ1, EQ2, EQ3) using 60 ml of medium from 3 × 15 cm dishes for each process replicate. Inputs from the SMC2 experiment used process replicates of 350 μg each to generate TAILS data. The third TAILS experiment from HUVEC cell line used the same strategy as the SMC2 experiment, with 40 ml of medium from 2 × 15 cm dishes for each process replicate and used 200 μg each to generate TAILS data. Inputs for process replicate experiments were restricted to the lowest concentration from the nine parallel samples for SMC2 or HUVEC.
TAILS Sample Preparation for LC-MS/MS Analysis
Samples were prepared according to the TAILS protocol from the Overall lab (23) with some modifications. For each experiment, nine samples in 50 mM Hepes pH 8.0, 150 mM NaCl were first denatured with guanidinium chloride (Sigma, G4505) to reach a final concentration of 2.5 M guanidinium chloride and 250 mM Hepes and incubated at 65 °C for 15 min. Reduction was achieved through addition of tris(2-carboxylethyl)phosphine at a final concentration of 20 nM tris(2-carboxylethyl)phosphine and incubated at 65 °C for 45 min. Alkylation reaction was performed with the addition of iodoacetamide (Sigma, A3221) to a final concentration of 10 mM iodoacetamide for 15 min at room temperature in the dark. Following denaturation, reduction, and alkylation steps, 10 percent of each of the nine samples was removed and combined to serve as a pooled reference. Isobaric labeling at protein level with tandem mass tag (TMT)10 (ThermoFisher, 90113) reagent was performed at the ratio of 10:1 TMT:protein in 50% dimethyl sulfoxide, shaking at 850 rpm. TMT channels were randomly assigned to the nine samples, and TMT 131 was assigned to the pooled reference sample created by pooling equal amount from all nine samples. A second round of TMT labeling was performed sequentially for 30 min more to increase labeling efficiency. Labeling reactions were quenched with 100 mM ammonium bicarbonate for 15 min at room temperature. Five percent of each reaction was removed to assess isobaric labeling efficiencies. 10 TMT labeled samples were then combined and precipitated using eight volumes of cold acetone and one volume of cold methanol in Beckman BK357001 tubes and stored at −80 °C for 3 h. Mixed samples as well as 10 labeling efficiency samples were then centrifuged in a JA-17 rotor at 14,000g for 20 min. Supernatants were discarded and the samples were washed twice with 20 ml of ice-cold methanol to remove residual guanidinium chloride before trypsin digestion. Pellets were air dried (with SpeedVac briefly when needed), resuspended with 50 mM NaOH, and adjusted to 1 mg/ml protein in 50 mM Hepes pH 8.0. SMC1 experiment was digested solely with sequencing grade Trypsin (Promega, V5113) at a ratio of 1:50 protease to protein at 37 °C overnight. For SMC2 and HUVEC experiments, half of the pooled sample was digested with trypsin and the other half was digested with sequencing grade AspN (Promega, V1621) at 37 °C overnight and then mixed together after quenching the reactions the next day. Five percent of each pooled digestion reaction (preTAILS) was removed to assess negative selection efficiencies. Digested samples were adjusted to pH 6 to 7 and were enriched for TMT-blocked N-termini using a hyperbranched polyglycerol aldehyde polymer from the Kizhakkedathu lab, University of British Columbia (Flintbox). Hyperbranched polyglycerol aldehyde polymer was washed with water and added at 5-fold excess to the digested protein with sodium cyanoborohydride (20 mM final concentration) and incubated at 37 °C overnight. Polymer and polymer-bound peptides were retained in 3 kDa Amicon column and the flow-through representing labeled peptides was collected for LC-MS/MS analysis. 10 percent of the flow-through (postTAILS) was removed to assess negative selection efficiencies. Enrichment efficiency was evaluated by comparing preTAILS and postTAILS isobaric labeling efficiencies for each experiment.
Remaining flow-through was desalted on a 30 mg Oasis HLB cartridge. After sample cleanup, the flow-through was separated using basic reverse-phase chromatography on a 2.1 × 250 mm Zorbax 300 extend-c18 column with a 60 min gradient using 20 mM ammonium formate/2% acetonitrile (ACN) pH 10 as buffer A and 20 mM ammonium formate/90% ACN pH 10 as buffer B. The sample was separated into 96 fractions and concatenated down to 12 by combining every 13th fraction. The 12 fractions were dried in the SpeedVac, reconstituted in 9 μl of 3%/0.1% ACN/formic acid (FA), and 4 μl of it was analyzed by LC-MS/MS
Secretome Proteome Sample Preparation for LC-MS/MS Analysis
Sample processing for analysis of the total secretome was performed as previously described (29). Briefly, samples were reduced, alkylated, and LysC/trypsin digested followed by TMT labeling of 40 μg aliquots for each. Similar to the TAILS experiment, channels were assigned randomly and channel 131 was used for the pooled peptide reference consisting of equal amounts of all nine samples. Following 1 h incubation of the samples with TMT reagents, a small aliquot was taken for assessing labeling efficiency. After confirming over 95% labeling efficiency, the reactions were quenched, and the samples were mixed and desalted on 30 mg Oasis HLB cartridge. Sample was then fractionated as described above and the resulting 12 fractions were reconstituted in 30 μl of 3%/0.1% ACN/formic acid, and 1μl of it was analyzed by LC-MS/MS.
LC-MS/MS Analysis
The TAILS and proteome fractions were separated on a Proxeon nanoLC using 3%/0.1% ACN/FA for buffer A and 90%/0.1% ACN/FA for buffer B on a 27 cm 75 μm ID picofrit column packed in-house with Reprosil C18-AQ 1.9 μm beads (Dr Maisch GmbH)C with a 90 min gradient consisting of 6 to 20% buffer B in 62 min, 20 to 35% B for 22 min, 35 to 60% B for 9 min, 60 to 90% B for 1 min followed by a hold at 90% B for 5 min. Online LC-MS/MS was performed on a Thermo Q-Exactive Plus mass spectrometer. The mass spectrometry (MS) method consisted of a full MS scan at 70,000 resolution and an automated gain control target of 3e6 from 300 to 1800 m/z followed by MS2 scans collected at 35,000 resolution with an automatic gain control target of 5e4 with a maximum injection time of 120 ms and a dynamic exclusion of 20 s. The isolation window used for MS2 acquisition was 0.7 m/z, and 12 most abundant precursor ions were fragmented with a normalized collision energy) of 29 optimized for TMT10 data collection.
Peptide/Protein Quantification and Statistical Analysis
Both TAILS and proteome datasets were processed using Spectrum Mill (https://proteomics.broadinstitute.org/, v7.07). The raw MS files were extracted and searched against the Uniprot human database downloaded on December 28th 2017 with the mouse ADAMTS7 sequence appended containing 65,070 protein entrees. Allowed miscleavage events were set to 4, and parent and fragment mass tolerance of 20 was used for all the searches. For TAILS datasets in the SMC1 experiment, the search was done in 2 rounds while in SMC2 and HUVEC experiments with the split AspN/Trypsin digest in a total of 4 rounds. The first search using a nonspecific N-termini semi tryptic cleavage motif (K.; R.) was done with TMT10 as a fixed modification of peptide N-termini and lysine side chains and acetylated protein N-termini as a variable modification to identify all TMT-labeled peptides and acetylated N-terminal peptides of proteins. Peptide spectra not identified in this first search were searched again adding acetylated peptide N-termini to the fixed modifications to identify any peptides that are acetylated but are not the N-termini of the protein. This process was then repeated using a nonspecific N-termini semi-AspN (.D;.E) cleavage motif for SMC2 and HUVEC experiments. Proteome datasets were searched once with trypsin cleavage and TMT10 as fixed modification. Using the SM autovalidation module peptide spectrum matches for individual spectra were confidently assigned by applying target-decoy based false discovery rate (FDR) estimation to achieve <1.0% FDR at the peptide spectrum match, peptide, and protein levels. For the proteome datasets, additional protein level autovalidation was performed with the criteria of 0% FDR at the protein level and a minimum protein score of 13. The SM protein score is the sum of the scores of distinct peptides. A distinct peptide is the single highest scoring instance of a peptide detected through an MS/MS spectrum. Minimum protein score threshold is used to eliminate proteins with low scoring peptides so that each identified protein reported in the study is comprised of multiple peptides, unless a single excellent scoring peptide was the sole match.
Distinct peptide and protein level exports were generated in Spectrum Mill for TAILS and proteome datasets, respectively. Log2 TMT reporter ion ratio of every channel to the pooled reference channel was used for the quantification and further statistical analysis. TMT10 reporter ion intensities were corrected for isotopic impurities in the Spectrum Mill protein/peptide summary module using the afRICA correction method, which implements determinant calculations according to Cramer's Rule and correction factors obtained from the reagent manufacturer’s certificate of analysis (https://www.thermofisher.com/order/catalog/product/90406) for lot numbers UA280170 and TE270748 for SMC and HUVEC datasets, respectively (30). In TAILS datasets, Log2 TMT ratios of each sample were normalized to the median ratio of the quantified natural N-termini peptides for that sample. This was done by filtering for TMT-labeled peptides at the N-termini with a start amino acid number of 1 or 2 and finding the median ratio of these peptides in each channel/sample. Then, this median value was subtracted from the Log2 TMT ratios of all peptides (23). This median normalization centers distribution of Log2 TMT ratio for each sample around zero in order to nullify the effect of differential protein loading and/or systematic MS variation. Although this differs from the median of all observed peptides by only about 5%, using only the terminal peptides for normalization is more suitable because the levels of these peptides should be the same in all the sample groups. A moderated two-sample t test was used to compare the different groups together. Significant regulation was assessed using Benjamini Hochberg corrected p-values, referred as adjusted p-values below. Correlation plots and heatmaps of regulated peptides/proteins was performed in Protigy (Proteomics Toolset for Integrated Data Analysis, Broad Institute).
Experimental Design and Statistical Rationale
Three separate TAILS experiments have been performed, 2 in SMC and 1 in HUVEC cell lines. Each experiment contained Adeno-CMV-Luciferase (Luc) sample group as negative control and full-length mouse Adamts7 1-1857 WT and E373Q (EQ) catalytic mutant as the experimental sample groups for comparison, each group in triplicates. SMC1 experiment consisted of technical replicates whereas SMC2 and HUVEC experiments consisted of full process replicates. Moderated two-sample t test was used for comparing the groups in the datasets generated. For the SMC1 technical replicate experiment, a Benjamini Hochberg adjusted p-value <0.01 cut off for significant hits was applied while an adjusted p-value <0.05 cut off for significant hits was used for the SMC2 and HUVEC process replicate experiments.
Candidate Prioritization
From the TAILS datasets, the adjusted p-value (adj.p.val) and log Fold Change (logFC) values were used to identify substrate cleavage sites enriched for ADAMTS7 proteolytic activity. Results plotting logFC and −log10 (adj.p.val) data points were visualized in Volcano plots generated by Prism 9. Initial filtering of significant hits from both the adj.p.Val.mWT.over.mEQ and adj.p.Val.mWT.over.Luc identified regulated peptides associated with ADAMTS7 protease activity. No logFC cut off was applied, but positive logFC.mWT.over.mEQ values predicted candidate cleavage sites. In some cases, the same substrate cleavage site was identified from multiple peptides representing missed cleaved and fully cleaved at the C-termini versions of the peptide. To collapse the discovery set in to a single unique site, the geneSymbol field was concatenated with the StartAA field to generate a cleavage site identifier. Additional filtration of ADAMTS7 autocatalytic sites with removal of significant hits from adj.p.Val.mEQ.over.Luc (associated with overexpression of the catalytically inactive mutant) was performed to generate a high confidence discovery set for each TAILS experiment. Overlap analysis between the SMC1, SMC2, and HUVEC TAILS experiments was performed to identify consistently regulated peptides as candidate ADAMTS7 substrate cleavage sites. Venn diagrams were made manually in Adobe Illustrator. Analysis of cleavage site positions −4 to +4 was performed with Weblogo (31) and iceLogo (32) to generate logo consensus and cleavage site heatmaps factoring in the natural abundance of each amino acid in the human proteome.
EFEMP1 Substrate Validation Experiments
Cultured HUVEC with high endogenous EFEMP1 (Fibulin-3) expression were chosen for cell-based validation experiments of TAILS identified cleavage sites. Medium from HUVEC transduced with 50 MOI Ad-Luc, Ad-mADAMTS7 WT, or Ad-mADAMTS7 EQ was collected in serum-free conditions (Lifeline Basal EC +bFGF2 10 ng/ml) and concentrated using 3 kDa Amicon spin columns. Ten percent of the concentrated medium was run on a 4 to 20% Mini-PROTEAN gel (Bio-Rad) and analyzed by Western blot to detect the carboxyl region of EFEMP1/Fibulin-3 (antigen 140-209, Novus, NBP2-57871) or ADAMTS7-Flag using M2-HRP. Remaining concentrated medium was run on a parallel gel for Coomassie blue staining to collect size specific bands. Gel slices were digested with trypsin or chymotrypsin for mass spectrometry analysis of semi-trypsin and semi-chymotrypsin peptides (Whitehead Institute Proteomics Core Facility). Unique EFEMP1 nontryptic or nonchymotryptic sites identified more than once from the combined gel slices were compared across Luc, WT, and EQ samples. The number of unique mass spectrometry identified peptides representing a potential cleavage site, and the combined area for these peptides was used as a semi-quantitative method to identify and compare the proportion of EFEMP1 123.124 and EFEMP1 124.125 cleavage sites. To validate EFEMP1 cleavage in a binary assay, purified recombinant human HA-tagged EFEMP1/Fibulin-3 (R&D, 8416-FB) provided in PBS was dialyzed into TBS pH 8.0 to prevent precipitation with the CaCl2 in the assay buffer. Purified mouse Adamts7 WT S3A 3xFlag contained a 250 kDa full-length form and 150 kDa truncated Flag tagged form enriched in later size-exclusion chromatography fractions as previously described (28). For the mouse ADAMTS7 S3A constructs, the conserved Ser991 CS-GAG attachment site and adjacent Ser989 and Ser993 from the SGSGS motif were mutated to alanine (S989A/S991A/S993A or AGAGA) to facilitate protein purification. Purified mouse ADAMTS7 S3A proteins (0.5 μg) were incubated with HA-EFEMP1/Fibulin-3 (1.0 μg) at 37 °C in an assay buffer containing 50 mM Tris pH 8.0, 150 mM NaCl, 5 mM CaCl2, 10 μM ZnCl2, and 0.004% Bridj35. The carboxyl region of EFEMP1/Fibulin-3 was detected with Novus antibody NBP-57581, and the amino terminal HA-tag was detected with anti-HA antibody (Cell Signaling, C29F4). Coomassie-stained gel slices were submitted for mass spectrometry analysis of semi-trypsin and semi-chymotrypsin peptides (Whitehead Institute Proteomics Core Facility). Analysis was performed similar to the HUVEC validation experiment to identify and compare the proportion of EFEMP1 1223.124 and EFEMP1 124.125 cleavage sites.
Results
ADAMTS7 TAILS Experimental Design
All TAILS experiments were constructed to allow for comparison of the active WT protease in triplicate with either of the two negative control groups (supplemental Fig. S1). Control groups contained either the expression of a nonspecific luciferase or an inactive EQ catalytic mutant protease with a glutamate to glutamine substitution in the HExxH metalloproteinase active site (28). We used three different experimental designs to perform TAILS discovery experiments in human coronary artery smooth muscle cells and HUVEC (supplemental Fig. S1, A–D). In the first experiment, referred to as SMC1, medium was pooled from one of three different conditions (Ad-Luc control, Ad-mADAMTS7 WT, or Ad-mADAMTS7 EQ) and then split into three technical replicates for each condition to minimize biological variation. In the second experiment, referred to as SMC2, input medium from each triplicate condition was processed separately. In the third experiment, process replicates from adenovirus-transduced HUVEC were used to identify ADAMTS7 substrate cleavage sites from secreted factors and extracellular matrix proteins originating from vascular endothelial cells (supplemental Fig. S1, E and F). For all three experiments, the total proteome of the secretome was also measured by quantitative proteomics.
Greater than 80% of the postTAILS identified spectra contained an isobaric label to enable quantification and allowed for quantitative comparison of 8818 peptides from 3152 proteins, 10,964 peptides from 3579 proteins, and 13,276 peptides from 3826 proteins in SMC1, SMC2, and HUVEC experiments, respectively (supplemental Table S1). Total secretome proteome analysis resulted in 1847 in SMC1, 1808 in SMC2, and 2031 in HUVEC fully quantitated proteins.
Statistical Analysis of the Datasets
TMT reporter ion ratio of each sample to the pooled reference was used for the quantitative analysis. Data was normalized using median ratio of TMT-labeled natural N-termini peptides prior to application of moderated two-sample t test for comparison of the three groups for each TAILS experiment (supplemental Table S2). As expected, correlation of the technical replicates in SMC1 experiment was higher than the process replicates in SMC2 and HUVEC experiments (supplemental Fig. S2, A–C). Therefore, we used multiple hypothesis tested adjusted p-value (adj. p-val) threshold of 0.01 for the SMC1 experiment and 0.05 for the SMC2 and HUVEC experiments. Results of the two-sample t test are shown as volcano plots in supplemental Fig. S3, A–C. Differentially regulated peptides in WT/EQ, WT/Luc, and EQ/Luc comparisons passing the significance threshold for each experiment were predominantly in the positive log fold change (logFC) side of each TAILS volcano plot. Peptides identified as mouse ADAMTS7 are colored in red for each plot. Significantly regulated peptides in the WT/EQ and WT/Luc comparisons with positive logFC values represent neo-N-termini potentially associated with ADAMTS7 catalytic activity. In contrast, the EQ/Luc significant and positive logFC neo-N-termini are more likely to be artifacts from adenoviral expression of a full-length catalytically inactive protein. A cluster analysis of the significantly regulated TAILS peptides (supplemental Fig. S2, D–F) revealed that the Luc and EQ negative controls clustered together and away from the active WT condition.
Secretome proteome datasets followed the same trends as the TAILS datasets with the highest correlation of replicates observed in SMC1 experiment (supplemental Fig. S4, A–C). Normalized TMT-labeled secretomes were analyzed similar to the TAILS experiment and a moderated two-sample t test was used to compare the three groups for each total secretome experiment (supplemental Table S3). Cluster analysis of the secretome proteins showed more similarity in the secreted proteomes between the WT- and EQ-expressing samples than the Luc control sample (supplemental Fig. S4, D–F). Therefore, the secretomes clustered not with ADAMTS7 activity, as with the TAILS experiments, but with ADAMTS7 expression regardless of catalytic activity. Comparative secretome analysis of the WT/EQ, WT/Luc, and EQ/Luc conditions was performed using the same significance thresholds from the corresponding TAILS experiments and visualized using volcano plots (supplemental Fig. S3, D–F). The position of mouse ADAMTS7 protein was noted in each volcano plot with a red diamond. As expected, mouse ADAMTS7 was significantly upregulated in WT/Luc and EQ/Luc comparisons, with a logFC range of 2.5 to 3 for SMC experiments and a logFC range of 8 to 9 for the HUVEC experiment. In contrast, there was no significant expression difference in the WT/EQ secretome comparisons for mouse ADAMTS7, indicating a similar level of WT and EQ protein abundance. Next, we filtered the significantly regulated proteins to logFC >1 (greater than 2-fold upregulated) or <-1 (down regulated more than 2-fold) and looked for commonalities in the WT/EQ, WT/Luc, and EQ/Luc comparisons using a Venn diagram for each experiment (supplemental Fig. S5, A–C). Using these criteria, very few proteins were associated purely with proteolytic activity from the WT/EQ secretomes. Furthermore, there were no commonalities between the three secretome experiments with the exception of mouse ADAMTS7 in the overlap between WT/Luc and EQ/Luc.
Examination of ADAMTS7 Autocleavage Events
Next, we focused on the regulated peptides from mouse ADAMTS7 in the TAILS experiments. Of the significantly regulated peptides for the WT/EQ comparisons associated with ADAMTS7 activity from each experiment, around 17% corresponded to mouse ADAMTS7 peptides and nearly all were upregulated in the WT sample (supplemental Fig. S3, A–C). The WT/EQ ADAMTS7 peptides that were significantly enriched were also found in the WT/Luc comparisons associated with ADAMTS7 activity. Therefore, these ADAMTS7 neo N-termini from the WT/EQ comparison were likely due to cis or trans autocleavage events. On the other hand, there were EQ/Luc-regulated peptides that were also present in the WT/Luc comparisons, possibly representing proteolysis from other cellular proteases. Consistent with this hypothesis, the total number of WT/EQ and EQ/Luc significant ADAMTS7 logFC+ peptides added up to roughly the number of WT/Luc logFC+ peptides (supplemental Table S4). WT/EQ significantly enriched peptides filtered against EQ/Luc significant peptides resulted in our list of ADAMTS7 autocleavage sites from each TAILS experiment (supplemental Table S5).
We compared the locations of the prospective autocleavage events from each TAILS experiment and found that a third to nearly half of the unique sites were located in the prodomain (residues 21-220). Only 11 of the ADAMTS7 sites were common to all experiments, with 7 in the prodomain, 3 in the spacer domain, and 1 in the PLAC domain (Fig. 1A). The most abundant prospective autocleavage sites based on spectral total intensity were found at 170.171 (HAQP↓HVVY) in the prodomain, 1061.1062 (SYSG↓FEEP) in the mucin domain, and 1600.1601 (EDCE↓LVEP) in the PLAC domain (Fig. 1B and supplemental Table S5). It is notable that the predominant TAILS-identified ADAMTS7 autocleavage site is identical to the mucin domain autocleavage product we previously reported (28) and serves to validate our approach for identifying ADAMTS7-dependent cleavage sites.
ADAMTS7 Substrate Cleavage Sites Identified by TAILS
To generate a high confidence list of substrates from each TAILS experiment, we applied a series of requirements and filters to the significant hits from the WT/EQ comparisons (see Experimental Procedures). First, we excluded any ADAMTS7 sites or any sites with a logFC less than zero from the WT/EQ significant hits (supplemental Table S4). Second, we performed the same exclusions to the WT/Luc significant hits as a separate comparison for ADAMTS7 function and used the filtered overlap from the WT/EQ and WT/Luc as a stringent constraint for ADAMTS7 catalytic activity. Lastly, we removed any sites that were significantly upregulated in the EQ/Luc comparisons or any duplicate identifications from multiple peptides to generate a unique list of high confidence substrate cleavage sites. Volcano plots with the mouse ADAMTS7 peptides removed from the dataset show the high confidence substrate cleavage site regulated peptides (labeled in green) within the upper right quadrant (Fig. 2, A–C). Histograms illustrate the overlap of significantly regulated unique cleavage sites from each of the comparisons and display a similar trend for the independent TAILS experiments (Fig. 3, A, C, and E). It is noteworthy that the WT/EQ high confidence–regulated peptides on average were more significant and had higher logFC values than WT/EQ-only regulated peptides, which lacked independent verification within each TAILS dataset (supplemental Fig. S6).
From the original number of WT/EQ significantly regulated peptides, 66% of the SMC1, 67% of the SMC2, and 60% of the HUVEC significant hits passed all these criteria (supplemental Tables S4 and S6). Next, we compared the substrate logo and heatmaps of high confidence substrate cleavage sites from the SMC and HUVEC TAILS experiments. Based on the iceLogo plots, AA↓L at the P2, P1, and P1′ positions was most commonly observed (Fig. 3, B, D, and F); however, each of these amino acids was present at no more than 30% in their respective positions (supplemental Fig. S7). Although a strict consensus was not apparent from our individual high confidence datasets, similarities were evident in candidate substrate heatmaps and amino acid frequency plots from different TAILS experiments (supplemental Fig. S8).
TAILS Discovery Set Overlap Analysis
By virtue of having multiple independent TAILS discovery experiments for ADAMTS7, we compared the list of high confidence substrate cleavage sites to emphasize those that were found more than once. From the comparison of substrate cleavage sites from SMC1 (207 identified solely by trypsin digests) and SMC2 (210 identified from half trypsin or half AspN digests), 66 were identical resulting in an overlap of 32% of the identified sites (Fig. 4A). Although both SMC samples were digested with trypsin, 13 of the 66 unique sites were identified by AspN-regulated peptides from the SMC2 TAILS experiment rather than the comparable semi-tryptic peptide from the SMC1 TAILS experiment, demonstrating that multiple peptides can identify the same cleavage site. Sorting the overlapping SMC cleavage sites into common genes identified multiple cleavage sites in several candidate substrates, including eight for COL1A2 (Collagen type I alpha-2 chain), six for FN1 (Fibronectin), four for HSPG2 (basement membrane-specific heparan sulfate proteoglycan core protein/perlecan) and four for LOX (protein-lysine 6-oxidase). Comparison of the candidate sites from SMC2 and HUVEC (127, also identified from half trypsin or half AspN digests) resulted in 44 identical cleavage sites, including additional sites in FN1, HSPG2, and LOX not present in the SMC1 TAILS high confidence cleavage site list (Fig. 4B). Of these 44 shared sites, the 20 that were unique to the SMC2/HUVEC comparison were predominantly from the AspN digestion (15 semi-AspN peptides and 5 purely semi-tryptic peptides), which may explain why these sites were not identified in the SMC1 trypsin only digestion TAILS. In total, there were 91 unique sites identified multiple times in our TAILS discovery experiments for ADAMTS7 substrates (supplemental Table S6). When the individual sites are grouped into their corresponding gene, 48 potential substrates emerged (Fig. 4C). The most unique cleavage sites from multiple datasets were identified from FN1, including four unique cleavage sites from significantly regulated peptides in all TAILS experiments. Furthermore, both FN1 and LOX are amongt the coronary artery disease risk loci categorized as nonlipid–vascular remodeling pathway genes along with ADAMTS7 (33).
Analyzing the overlap between all three TAILS discovery sets identified 24 unique cleavage sites encoded by 16 different genes (Table 1). Most ADAMTS7 candidate substrates from this list were primarily localized in the extracellular region, with the exception of proteins with defined roles in the cytoskeletal (FLNA and MAP4) and nuclear (SERBP1) intracellular regions. The 24 cleavage sites were found in a variety of substrate protein domains and were commonly found in N-terminal regions or unstructured linker regions. Remarkably, some of the unique cleavage sites were found at adjacent positions in the same candidate substrate, as was the case with EFEMP1 and MAP4. In both cases, the logFC ratios favored the more N-terminal cleavage site, which may indicate either an initial preference for the first site or a sequential cleavage event while the enzyme remained associated with the cleaved substrate. By analyzing the overlap between independent TAILS discovery sets, we have identified reproducible substrate cleavage sites which we used to prioritize validation experiments to confirm ADAMTS7 protease specificity.
Table 1.
TAILS_id | SMC1 logFC WT/EQ | SMC1 total intensity | SMC2 logFC WT/EQ | SMC2 total intensity | HUVEC logFC WT/EQ | HUVEC total intensity | Cleavage site | Substrate location |
---|---|---|---|---|---|---|---|---|
AEBP1_37 | 3.41 | 4.54E+07 | 3.48 | 8.64E+07 | 4.34 | 5.26E+06 | EIEE↓FLEG | N-terminal region |
COL18A1_1505 | 1.42 | 1.68E+09 | 1.47 | 1.11E+10 | 3.88 | 1.34E+11 | EVAA↓LQPP | NC1 hinge region |
EFEMP1_124 | 4.43 | 3.19E+09 | 3.27 | 2.58E+10 | 3.31 | 2.92E+11 | ASAA↓AVAG | Within atypical EGF1 |
EFEMP1_125 | 2.25 | 2.03E+09 | 2.42 | 7.46E+09 | 2.45 | 9.33E+10 | SAAA↓VAGP | Within atypical EGF1 |
FBN1_29 | 4.49 | 1.26E+10 | 3.57 | 2.40E+10 | 3.91 | 1.85E+09 | ADAN↓LEAG | N-term/propeptide |
FLNA_1285 | 5.06 | 5.65E+08 | 5.72 | 1.38E+09 | 3.81 | 2.92E+09 | DARA↓LTQT | Filiamin 11 |
FN1_36 | 3.34 | 4.40E+10 | 3.69 | 1.54E+11 | 3.45 | 1.76E+11 | QAQQ↓MVQP | N-terminal region |
FN1_40 | 4.27 | 3.67E+09 | 2.66 | 3.29E+09 | 2.96 | 1.37E+10 | MVQP↓QSPV | N-terminal region |
FN1_1143 | 0.86 | 1.36E+08 | 2.8 | 9.55E+08 | 3.68 | 4.67E+09 | VVSG↓LTPG | Fibronectin type-III 6 |
FN1_1656 | 1.52 | 4.82E+09 | 1.86 | 2.00E+10 | 1.63 | 2.23E+10 | KWLP↓SSSP | Fibronectin type-III 12 |
HSPG2_275 | 2.49 | 2.35E+07 | 2.1 | 1.16E+08 | 0.77 | 2.67E+07 | APQP↓LLPG | Linker (LDL-A1/A2) |
HSPG2_1863 | 2.03 | 1.64E+08 | 1.9 | 1.42E+08 | 2.53 | 4.95E+08 | ASGT↓LSAP | Ig-like C2-type 3 |
HSPG2_2331 | 2.04 | 7.64E+07 | 1.91 | 2.66E+08 | 1.73 | 2.45E+09 | GTQG↓ANLA | Ig-like C2-type 8 |
IGFBP7_101 | 3.69 | 5.68E+07 | 2.57 | 4.97E+08 | 1.89 | 1.12E+08 | KAGA↓AAGG | IGFBP N-term region |
LOX_124 | 3.62 | 1.65E+08 | 3.45 | 1.64E+09 | 3.35 | 8.74E+07 | TARH↓WFQA | N-term/propeptide |
LOXL2_ 37 | 3.28 | 5.20E+09 | 3.3 | 9.09E+09 | 3.48 | 1.61E+09 | YPEY↓FQQP | N-terminal region |
LTBP1_393 | 3.89 | 9.30E+08 | 6.3 | 6.93E+09 | 3.33 | 9.19E+08 | AADT↓LTAT | Linker (EGF1/2) |
LTBP3_239 | 2.62 | 1.25E+08 | 1.59 | 8.70E+08 | 1.57 | 5.41E+08 | PPEA↓SVQV | Linker (EGF1/TB1) |
MAP4_815 | 1.19 | 5.58E+08 | 1.7 | 5.60E+08 | 2.51 | 6.08E+08 | TTAA↓AVAS | Pro-rich linker |
MAP4_816 | 1.62 | 1.57E+08 | 2.51 | 8.83E+08 | 2.05 | 2.73E+08 | TAAA↓VAST | Pro-rich linker |
NID2_291 | 4.04 | 2.94E+09 | 3.16 | 6.67E+09 | 2.9 | 5.96E+08 | LSAA↓HSSV | Linker (NIDO/EGF1) |
SERBP1_60 | 2.1 | 2.09E+08 | 1.62 | 2.49E+08 | 0.94 | 2.20E+08 | QAAA↓QTNS | N-terminal region |
SERBP1_116 | 2.55 | 4.56E+08 | 3.25 | 7.27E+09 | 2.44 | 4.39E+09 | RPDQ↓QLQG | N-terminal region |
SRGN_59 | 4.33 | 2.70E+09 | 3.66 | 2.86E+09 | 1.17 | 2.32E+08 | PMFE↓LLPG | N-term (after disulfide) |
High confidence ADAMTS7 cleavage sites consistently detected in all TAILS discovery datasets.
Fold change enrichment with ADAMTS7 activity (logFC WT/EQ) and TAILS peptide ID total intensity displaying relative abundance in each TAILS experiment is shown for the 24 candidate sites.
Sequence context for each cleavage site is shown (P4 through P4′) and location within the target candidate protein.
Validation of ADAMTS7 Preference for EFEMP1 Cleavage at Adjacent Sites
EFEMP1 is a secreted extracellular matrix protein with multiple EGF domains and carboxyl-terminal Fibulin domain. The first EGF domain is atypical and contains an extended linker region with documented sensitivity to proteases and is also the location of candidate ADAMTS7 123.124 and 124.125 cleavage sites (Fig. 5A). By comparison, a previous TAILS experiment with ADAMTS3 reported EFEMP1 cleavage at 122.123 and 123.124, although no EFEMP1 cleavage sites were identified in TAILS experiments for paralogs ADAMTS2 or ADAMTS14 (25). MMP3 and MMP7 are reported to cleave EFEMP1 at 124.125, and notably, there was an observed background cleavage at 123.124 within experiments from the same study (34). Data from our TAILS experiments consistently showed a higher logFC and total intensities for the 123.124 site, predicting 2 to 3 fold more cleavage at 123.124 (ASAA↓AVAG) than the adjacent 124.125 (SAAA↓VAGP) site (Table 1). To verify that EFEMP1 is a substrate for ADAMTS7 and to compare the relative amount of cleavage at the 123.124 and 124.125 sites, we performed two key validation experiments.
We first examined the specificity of ADAMTS7 cleavage to endogenously expressed EFEMP1. HUVEC express higher levels of EFEMP1 than SMC, as shown by higher total intensities of EFEMP1 spectra from the secretome experiments (supplemental Table S3). Similar to the initial HUVEC discovery experiment, we expressed full-length mouse ADAMTS7 WT and EQ using adenovirus and examined the concentrated medium. As expected, ADAMTS7 WT migrated at two bands matching the observed autocleavage event in the mucin domain (Fig. 5B). Using an antibody to an EFEMP1 epitope c-terminal to the candidate ADAMTS7 cleavage sites, we detected a single lower mobility band consistent with cleavage within the atypical first EGF domain. MS analysis of these bands following digestion with trypsin or chymotrypsin identified semi-tryptic and semi-chymotryptic peptides indicating cleavage at 123.124 and 124.125 sites from the Ad-ADAMTS7 WT sample. Importantly, these sites were not detected in the Luc or EQ samples while our methods were sensitive enough to identify cleavage outside of the first EGF repeat in all samples, which were presumably independent of ADAMTS7 activity. Using the area for each of these observed spectra cumulatively, we assigned a relative abundance to each of these detected cleavage products and observed a preference for 123.124 over 124.125 (Fig. 5C).
Next, we analyzed the site and preference of EFEMP1 cleavage by ADAMTS7 in a binary in vitro system. We obtained commercially purified full-length epitope tagged HA-EFEMP1 and combined it with our purified full-length mouse ADAMTS7 S3A WT or ADAMTS7 S3A EQ for 4 h at 37 °C. The engineered ADAMTS7 S3A constructs alter the CS-GAG attachment consensus from “SGSGS” to “AGAGA” to prevent GAG-modification and allow for full-length ADAMTS7 purification as previously described (28). Mobility of EFEMP1 was detected by Western blot using samples in reducing or nonreducing conditions. Full-length HA-EFEMP1 was detected at the expected position, and a lower band was detected matching the predicted HA-tagged amino terminus after cleavage by ADAMTS7 in the atypical first EGF domain (Fig. 5D). The antibody to the EFEMP1 carboxyl terminus provided a stronger signal under nonreducing than reducing conditions. Here, we could detect an abundant lower band at 40 kDa consistent with ADAMTS7 cleavage under reducing and nonreducing conditions. A faint band at this position was also present in the no enzyme and ADAMTS7 EQ control reactions under nonreducing conditions raising the possibility of background cleavage from the commercial EFEMP1. We performed a second experiment with digestion for 12 h and examined the products by Coomassie staining (Fig. 5E). In the WT-treated condition, we could detect two lower bands matching to the amino and carboxyl-terminal fragments of EFEMP1, consistent with ADAMTS7 cleavage restricted to the atypical first EGF domain. Corresponding bands from this gel were analyzed by mass spectrometry to investigate the relative abundance of these events. Here, we identified the suspected background cleavage under conditions with no enzyme or with ADAMTS7 S3A EQ treatment, however, these occurrences were relatively minor compared to the activity stimulated by ADAMTS7 S3A WT (Figs. 5F and S9). From this binary in vitro system, we were able to demonstrate ADAMTS7-mediated cleavage of EFEMP1 in the atypical first EGF domain and confirm a preference for 123.124 over 124.125 cleavage.
In total, our detection and quantitation of both endogenous and purified EFEMP1 are consistent with the findings from three independent TAILS discovery experiments to identify new ADAMTS7 substrates.
Discussion
In this study, we performed TMT-TAILS to identify substrates for ADAMTS7 from the secretomes of vascular smooth muscle and endothelial cells. Each of the three independent TAILS experiments identified previously unknown candidate substrate cleavage sites associated with ADAMTS7 activity. To confirm our findings, we presented the validation of two adjacent cleavage sites 123.124 (ASAA↓AVAG) and 124.125 (SAAA↓VAGP) within the atypical first EGF repeat of EFEMP1.
EFEMP1, commonly known as Fibulin-3, is a secreted extracellular matrix protein highly expressed in the vasculature in a pattern overlapping with ADAMTS7 (GTEx Portal V8). Targeted mutation of mouse Efemp1 resulted in a viable KO mouse with abnormal connective tissue due to impaired elastogenesis, including a propensity for hernias and early aging phenotypes (35). Similar connective tissue disorders were found in a human patient with EFEMP1-truncating mutations (36). In contrast, a recurrent R345W gain-of-function mutation in the central region of EFEMP1 results autosomal dominant Doyne honeycomb retinal dystrophy (37). Thus, there are no readily apparent correlations between the known EFEMP1 gain- or loss-of-function phenotypes and the atheroprotection conferred from ADAMTS7 loss of function. While Adamts7 KO mice do not have a reported connective tissue disorder, a recent report describes abnormal collagen fibrillogenesis in Adamts7/Adamts12 double KO mice resulting in tendon heterotopic ossification (38). Compensation and substrate redundancies between the paralogs ADAMTS7 and ADAMTS12 may explain the lack of overlap from the individual enzyme loss-of-function phenotypes and the diseases associated with mutations in the candidate substrates.
Although it is presently unclear if ADAMTS7 regulated EFEMP1 cleavage will impact vascular phenotypes, experimental evidence shows that EFEMP1 124.125 cleavage by MMP3/MMP7 may alter interacting binding partners (34). Within the family of fibulin proteins, EFEMP1/Fibulin-3 is similar in structure to EFEMP2/Fibulin-4 and FBLN5/Fibulin-5 (37), however, neither fibulin-4 nor fibulin-5 were identified as ADAMTS7 candidate substrates from our experiments. In the related family member FBLN2/Fibulin-2, adjacent sites 258.259 (TAAA↓ALGP) and 259.260 (AAAA↓LGPP) in the N-terminal cysteine-free region were identified as candidate sites in both of the SMC TAILS experiments. Cleavage by ADAMTS7 at this location would release the FBLN2 N-terminal cysteine-rich domain with reported roles in FBLN2 oligomerization (39). Further work will be needed to understand the consequence of ADAMTS7 cleavage of the N-terminal regions of fibulins 2 and 3 in disease biology.
In the active form of full-length mouse ADAMTS7, we consistently observed a lower band at 150 kDa in the medium from SMC, HUVEC, and HEK293 fibroblasts. Amino terminal sequencing identified the WT 150 kDa band beginning at phenylalanine 1062 (FEEPHPDL) (28). In this study, TAILS experiments digested with AspN identified significantly regulated peptides in the WT/EQ comparison to support a predominant ADAMTS7 autocleavage event at 1061.1062 (SYGS↓FEEP) nearby the conserved Ser991 CS-GAG attachment site in the mucin domain. Removal of the amino acids 1062 to 1657 would preserve a GAG-modified ADAMTS7 that lacks a carboxyl terminus normally thought to be required for substrate recognition, potentially changing the exosite substrate specificity for ADAMTS7. Additionally, the mucin domain GAG modification itself may contribute to ADAMTS7 substrate recognition and/or specificity. However, we note in our validation experiments with EFEMP1, the absence of the GAG modification in the purified ADAMTS7 S3A construct did not alter the apparent preference or specificity for the 123.124 and 124.125 cleavage sites.
The mouse ADAMTS7 autocleavage site is adjacent to one of the few highly conserved regions within the mucin domain and is partially conserved in human ADAMTS7 1080.1081 (SYGP↓SEEP), although we were unable to confirm autocleavage for WT human ADAMTS7. Cleavage at this position was not identified in a TAILS experiment using a human ADAMTS7 truncated after the mucin domain, lacking the carboxyl-terminal TSR repeats 5 to 8 and PLAC domains (27). From this study, they identified autocleavage events in the spacer domain for human ADAMTS7 (729.730 RIQE↓VAEA and 732.733 EVAE↓AANF) with confirmed amino terminal sequencing of the latter site. Within our TAILS datasets using mouse ADAMTS7, we identified analogous autocleavage sites in the spacer domain at 714.715 (LIEE↓VAEA) for all TAILS experiments and 717.718 (EVAE↓AANF) in the SMC2 and HUVEC TAILS experiments (supplemental Table S5). Analysis of the WT/EQ significantly enriched peptides revealed multiple sites in the prodomain consistent with autocleavage events, with the most abundant site at 170.171 (HAQP↓HVVY). Although this site is entirely conserved in human ADAMTS7, it was not identified in the previous TAILS study (27). The ADAMTS7 prodomain contains a cysteine switch motif, which acts to maintain enzyme latency through interactions with the Zinc metal in the active site (40). The mouse ADAMTS7 prodomain is processed by furin protease at 60.61 and 220.221, with only the second furin processing site removing the inhibitory cysteine switch at Cys194 (17). Although mouse ADAMTS7 cleavage at 170.171 would retain the cysteine switch to the catalytic domain, it is possible this may affect the progressive zymogen processing by furin. Further work will be needed to understand how the secreted ADAMTS7 maintains latency and if autocleavage plays a role independent of, or in concert with, furin processing in the prodomain.
Analysis of P4 through P4′ positions from ADAMTS7 autocleavage and candidate substrate cleavage sites from the TAILS experiments suggests that ADAMTS7 is able to process substrates in a variety of contexts (Fig. 3). Visually from the iceLogo plots, a preference for Alanine in the P1 position and Leucine in the P1′ was the most common, however, many candidate cleavage sites identified in multiple TAILS experiments did not conform to the consensus logo plots. ADAMTS7 TAILS substrate specificity was notably similar to those reported for MMP2 TAILS experiments showing a preference for PAA↓L in the P3 through P1′ positions without an absolute requirement for any of these residues (24). Compared to the preferences from other ADAMTS family members, there appear to be more similarities with the procollagen N-proteinases ADAMTS2/3/14 preference for A/P↓Q (25) than the aggrecanases ADAMTS4/5 preference for E↓A/L (41). In peptide-based enzymatic assays, cleavage of aggrecan (TEGE↓ARGS or TAQE↓AGEG) or versican (EAAE↓ARRG) has been reported for ADAMTS1, ADAMTS4, ADAMTS5, ADAMTS8, ADAMTS9, and ADAMTS20 (42, 43, 44, 45, 46, 47, 48). Although ADAMTS7 activity generated many candidate E↓A cleavage sites within our TAILS datasets, we did not detect significantly regulated peptides for aggrecan or versican at these sites to suggest similar cleavage events. This is consistent with the initial characterization of ADAMTS7 reporting no activity for aggrecan or versican using neo-epitope antibodies for the common aggrecanase cleavage sites (17). In the previous ADAMTS7 TAILS study, a P1 glutamate preference was observed in the latent-transforming growth factor beta-binding proteins 3 and 4, specifically at LTBP3 220.221 (ISAE↓VQAP) and LTPBP4 229.230 (HPQE↓ASVV) (27). In our study, the LTPBP4 229.230 site was not significantly regulated in the TAILS datasets, but nearby LTBP4 233.234 (ASVV↓VHQV) was significantly regulated in the SMC1 dataset. The previously reported LTBP3 220.221 site was significant in the HUVEC TAILS experiment but not in the SMC TAILS datasets. Within this region, LTPBP3 238.239 (PPEA↓SVQV) was identified in all three TAILS experiments, and alignment of LTBP3 and LTBP4 brings this cleavage site very close to the reported LTBP4 229.230 (HPQE↓ASVV) site. Collectively, these examples from our TAILS experiments demonstrate that ADAMTS7 exhibits broad specificity for the amino acids at the site of cleavage. Nonetheless, the repeated discovery of the same diverse cleavage sites suggests that substrate specificity remains a feature of ADAMTS7 and likely relies on other nonenzymatic domains for substrate recognition.
Within our list of candidate substrate cleavage sites identified in more than one TAILS experiment, we observed that while some candidate substrates displayed cleavage within a specific region of the protein, others fell into a different category where multiple identified cleavage sites were present throughout the protein in multiple domains. This was the case for fibronectin and HSPG2/perlecan for all the TAILS experiments and for COL1A2 from the SMC TAILS experiments. One possibility is that ADAMTS7 normally associates with these proteins in the extracellular matrix and under our overexpression conditions, opportunistically cleaves these substrates in a less regulated manner. In contrast, other identified substrates displayed a more restricted pattern of cleavage confined to a particular region, which may suggest a more regulated interaction and cleavage process. This appears to be the case with EFEMP1 and may hold true for other candidate substrates such as LOX, which displayed a string of candidate sites in the prodomain at positions 122.123, 123.124, 124.125, and 125.126 from multiple TAILS experiments. The candidate site that was present in all three TAILS experiments was at LOX 123.124 (TARH↓WFQA) upstream from the zymogen processing site at 162.163 by the procollagen C-proteinase BMP1. The ADAMTS7-mediated prodomain LOX cleavage sites are distinct in location from those identified from the ADAMTS2/3/14 TAILS experiments or from the reported LOX catalytic domain cleavage by ADAMTS2/14 (25, 49). The LOX prodomain is essential for secretion and assists with substrate interaction. Following BMP1 cleavage, the LOX prodomain also has the ability to act as a bioactive product with tumor suppressor function independent from LOX enzymatic domain (50). Therefore, ADAMTS7 cleavage of the LOX prodomain may impact multiple functions of the pro-LOX zymogen association with substrates, pro-LOX zymogen processing, or the modification/inactivation of the bioactive free LOX propeptide. Another consequence of ADAMTS7-mediated LOX prodomain cleavage could be the secondary activation of LOX enzymatic function. From our independent TAILS experiments, we identified multiple instances of cleavage sites preceding the enzymatic domains of ADAM9, LOX, LOXL1, and LOXL2 (supplemental Table S6). Therefore, we cannot rule out the possibility of secondary proteolytic cleavage events due to ADAMTS7-mediated enzyme activation, similar to the reported meprin β activation of ADAM10 through prodomain cleavage (51). In the cases of EFEMP1 and LOX, the ADAMTS7 cleavage events are at adjacent amino acid positions which may produce similar biological effects, however, these phenomena may complicate the development of neo-epitope specific antibodies to a single defined cleavage site similar to the reagents developed for the aggrecanases ADAMTS4 and ADAMTS5.
In addition to inactivating structural and bioactive extracellular matrix factors, ADAMTS7 TAILS candidate cleavage sites have the potential to produce known bioactive products from unique cleavage sites in the hinge regions of COL18A1 (Collagen type XVIII alpha-1) and CTGF (Connective Tissue Growth Factor). Endostatin and endostatin-like fragments with anti-angiogenic properties originate from the carboxyl-terminal region of COL18A1 following MMP/elastase/cathepsin cleavage within the hinge region (amino acids 1502–1571). The ADAMTS7 candidate cleavage site COL18A1 1504.1505 (EVAA↓LQPP) found in all three TAILS experiments is located near the beginning of the hinge region, upstream from the first known MMP cleavage site at 1511.1512 by MMP7 (52). Our findings suggest that ADAMTS7 is capable of producing an endostatin-like fragment with similar anti-angiogenic activities. Additional significantly upregulated peptides corresponding to nearby cleavage at 1501.1502 (TDNE↓VAAL) and 1503.1504 (NEVA↓ALQP) were present in the HUVEC dataset and may correlate with the increase in COL18A1 protein levels detected in the HUVEC secretome. In fact, COL18A1 was one of the few examples where a protein significantly upregulated in the WT secretome, but not in the Luc or EQ secretomes, consistent with COL18A1 upregulation in response to ADAMTS7 catalytic activity in HUVEC. This may represent a feed forward circuit with ADAMTS7 activity stimulating both the upregulation and cleavage of COL18A1, resulting in an endostatin-like matrikine.
CTGF, also known as CCN2, is a secreted multidomain matricellular protein with a central proteolytically sensitive hinge region (amino acids 168–197). It has been previously shown that cleavage in the hinge domain at 180.181 (PALA↓AYRL) by MMPs generates a bioactive carboxyl-terminal fragment containing the TSR and cysteine-rich CT domains (53). CTGF is highly expressed in the HUVEC cell line and specifically in the HUVEC TAILS experiment, we identified significantly regulated peptides representing CTGF hinge region cleavage sites at 172.173 (PKDQ↓TVVG), 173.174 (KDQT↓VVGP), 177.178 (VVGP↓ALAA), and 186.187 (RLED↓TFGP). Although multiple protease cleavage sites have been reported for the CTGF hinge region (54, 55, 56), very few known sites overlap with the ADAMTS7 TAILS candidates, with the exception of the 186.187 site identified in an ADAMTS3 TAILS study (25). CTGF was previously identified as a potential substrate for ADAMTS7 through a yeast two-hybrid screen, demonstrating a requirement for the ADAMTS7 mucin, TSR5-8 and PLAC domains for interaction with the CTGF amino terminal region (57). Furthermore, it was shown in an in vitro cleavage assay that the ADAMTS7 catalytic domain processed CTGF, producing bands compatible with cleavage in the hinge region. A similar binding interaction between CTGF and the paralog ADAMTS12 was mapped to the mucin and TSR5-8 regions of ADAMTS12, along with the evidence of CTGF processing from cotransfected cells (58). Our ADAMTS7 TAILS study provides further evidence for a connection between ADAMTS7 and CTGF from an unbiased proteomic method. Based on the data from our HUVEC TAILS experiment, a cleavage site preference of 172.173 or 186.187 in the CTGF-hinge region would be predicted based on logFC values and total spectra intensities. Further work will be needed to validate these and other candidates identified from our TAILS discovery sets.
Although ADAMTS7 is characterized as a COMP protease, we were unable to identify significantly regulated peptides consistent with an ADAMTS7 candidate cleavage site. COMP protein and peptides were identified in each of the TAILS and secretome experiments, however, the total peptide coverage ranged from 18 to 23% indicating that a significant portion of COMP was not captured and quantitated in our TAILS experiments. In the case of COMP, this may be due to a limitation in the expression level of the substrate of interest in our cell lines and the depth of amino acid coverage in our experiments. Another reported ADAMTS7 substrate is thrombospondin 1 (10) and both the SMC and HUVEC cell lines expressed much higher levels of this protein resulting in 71 to 78% total protein coverage. Within THBS1, two regulated peptides were identified in separate experiments (629.630 in HUVEC and 971.972 in SMC1), however, these were not consistently identified in the other TAILS datasets. Although our coverage for THBS1 was much higher than that for COMP, we cannot definitively rule out the possibility that a true ADAMTS7 cleavage site was missed in our TAILS experiments.
Achieving appropriate coverage and depth for a given substrate is a challenge for any unbiased degradomics technique and we attempted to improve these traits in our successive TAILS experiments. For instance, in the first TAILS experiment, we digested the TMT-labeled peptides with only trypsin, limiting the identification of candidate sites to peptides greater than five residues proceeding a tryptic R. or K. site that could be identified through LC-MS/MS. To improve the peptide coverage for ADAMTS7 substrates in the following TAILS experiments, we analyzed both trypsin and AspN-digested products. Additionally, we analyzed the peptides with a relaxed AspN condition to capture both .D or .E cleavage events to increase the number of identifiable spectra. Compared to an analysis of our datasets with a strict AspN (.D only) consensus, application of relaxed AspN condition further increased the number of quantified TMT-labeled peptides by 23% for the SMC2 and 17% for the HUVEC TAILS experiments. Including additional enzymes such as chymotrypsin with cleavage sites distinct from trypsin and AspN would likely increase the depth and coverage even further to capture additional ADAMTS7 candidate cleavage sites.
Based on the diverse cleavage site specificity from ADAMTS7 in our TAILS experiments, our experimental design likely benefited from using the full-length protein with the carboxyl-terminal substrate interaction domains. Additional ADAMTS7 site specificity could be investigated using a PICS (Proteomic Identification of protease Cleavage Sites)-based strategy, utilizing a library of short peptides predigested with a specific enzyme like trypsin or LysC (41), however, this may not reliably identify endogenous substrate cleavage sites driven by exosite specificity. Consistent with our observations of ADAMTS7 cleavage site specificity, a broad specificity for the ADAMTS7 enzyme was observed in a library of internally quenched fluorogenic peptides where nearly half were appreciably cleaved (59).
In closing, this study represents the most comprehensive list of ADAMTS7 candidate substrates from the secreted factors of vascular smooth muscle and endothelial cells. Our method using 10 isobaric tags for a single TAILS experiment shows the power of performing an individual condition in triplicate and subtractive methods to select regulated peptides consistently associated with protease activity. And finally, we show the virtue of performing multiple TAILS experiments to prioritize candidate validation experiments from the list of overlapping discovery sets.
Data Availability
The original mass spectra and the protein sequence database used for searches have been deposited in the public proteomics repository MassIVE (http://massive.ucsd.edu/) and are accessible at ftp://massive.ucsd.edu/MSV000088508/.
supplemental Table S1 – TAILS experiment quantitation statistics.
supplemental Table S2 – TAILS datasets for the SMC1, SMC2, and HUVEC experiments.
supplemental Table S3 – Secretome datasets for the SMC1, SMC2, and HUVEC experiments.
supplemental Table S4 – TAILS-regulated peptide summary.
supplemental Table S5 – Annotated ADAMTS7 autocleavage sites from each TAILS experiment.
supplemental Table S6 –TAILS high confidence candidate substrate cleavage sites for each experiment and annotated overlap analysis of cleavage sites identified in multiple TAILS experiments.
Supplemental data
This article contains supplemental data.
Conflicts of interest
B. T. M., N. H. E., and Y. X. are named inventors on patent applications relating to ADAMTS7 assays and compounds. B. T. M., A. A., and N. H. E are named inventors on a patent application relating to ADAMTS7 biomarkers. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Acknowledgments
This work was supported by a research grant from Bayer AG within the Cardiovascular Disease Initiative at the Broad Institute. Dr Ellinor is supported by the National Institutes of Health (1RO1HL092577, K24HL105780) and the American Heart Association (18SFRN34110082). We thank Eric Spooner at the Whitehead Institute Proteomics Core Facility for assistance with mass spec protein identification and Michael Berne at Tufts University Core Facility for assistance with protein sequencing. This work was funded by a collaboration between Bayer AG and the Broad Institute.
Author contributions
B. T. M., H. K., C. C. M., A. A., H. S., N. H. E., V. K. K., S. A. C., and P. T. E. conceptualization; B. T. M., H. K., C. C. M., B. B., and K. R. C. formal analysis; B. T. M., C. C. M., A. A., D. L., K. B., N. R. P., and N. H. E. investigation; C. C. M., N. R. P., N. H. E., Dylan Laprise, and Y. X. resources; H. K. and K. R. C. data curation; B. T. M. visualization; B. T. M. writing–original draft; B. T. M., H. K., C. C. M., A. A., K. R. C., V. K. K., S. A. C. and P. T. E. writing–review and editing.
Funding and additional information
P. T. E has received sponsored research support from Bayer AG and IBM Health and he has consulted for Bayer AG, Novartis, MyoKardia, and Quest Diagnostics.
Supplemental Data
References
- 1.Reilly M.P., Li M., He J., Ferguson J.F., Stylianou I.M., Mehta N.N., Burnett M.S., Devaney J.M., Knouff C.W., Thompson J.R., Horne B.D., Stewart A.F., Assimes T.L., Wild P.S., Allayee H., et al. Identification of ADAMTS7 as a novel locus for coronary atherosclerosis and association of ABO with myocardial infarction in the presence of coronary atherosclerosis: Two genome-wide association studies. Lancet. 2011;377:383–392. doi: 10.1016/S0140-6736(10)61996-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Schunkert H., Konig I.R., Kathiresan S., Reilly M.P., Assimes T.L., Holm H., Preuss M., Stewart A.F., Barbalic M., Gieger C., Absher D., Aherrahrou Z., Allayee H., Altshuler D., Anand S.S., et al. Large-scale association analysis identifies 13 new susceptibility loci for coronary artery disease. Nat. Genet. 2011;43:333–338. doi: 10.1038/ng.784. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Coronary Artery Disease Genetics, C. A genome-wide association study in Europeans and South Asians identifies five new loci for coronary artery disease. Nat. Genet. 2011;43:339–344. doi: 10.1038/ng.782. [DOI] [PubMed] [Google Scholar]
- 4.Pu X., Xiao Q., Kiechl S., Chan K., Ng F.L., Gor S., Poston R.N., Fang C., Patel A., Senver E.C., Shaw-Hawkins S., Willeit J., Liu C., Zhu J., Tucker A.T., et al. ADAMTS7 cleavage and vascular smooth muscle cell migration is affected by a coronary-artery-disease-associated variant. Am. J. Hum. Genet. 2013;92:366–374. doi: 10.1016/j.ajhg.2013.01.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Pereira A., Palma Dos Reis R., Rodrigues R., Sousa A.C., Gomes S., Borges S., Ornelas I., Freitas A.I., Guerra G., Henriques E., Rodrigues M., Freitas S., Freitas C., Brehm A., Pereira D., et al. Association of ADAMTS7 gene polymorphism with cardiovascular survival in coronary artery disease. Physiol. Genomics. 2016;48:810–815. doi: 10.1152/physiolgenomics.00059.2016. [DOI] [PubMed] [Google Scholar]
- 6.Chan K., Pu X., Sandesara P., Poston R.N., Simpson I.A., Quyyumi A.A., Ye S., Patel R.S. Genetic Variation at the ADAMTS7 locus is associated with reduced severity of coronary artery disease. J. Am. Heart Assoc. 2017;6 doi: 10.1161/JAHA.117.006928. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Bengtsson E., Hultman K., Duner P., Asciutto G., Almgren P., Orho-Melander M., Melander O., Nilsson J., Hultgardh-Nilsson A., Goncalves I. ADAMTS-7 is associated with a high-risk plaque phenotype in human atherosclerosis. Sci. Rep. 2017;7:3753. doi: 10.1038/s41598-017-03573-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Li H.W., Shen M., Gao P.Y., Li Z.R., Cao J.L., Zhang W.L., Sui B.B., Wang Y.X., Wang Y.J. Association between ADAMTS7 polymorphism and carotid artery plaque vulnerability. Medicine (Baltimore) 2019;98 doi: 10.1097/MD.0000000000017438. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Bauer R.C., Tohyama J., Cui J., Cheng L., Yang J., Zhang X., Ou K., Paschos G.K., Zheng X.L., Parmacek M.S., Rader D.J., Reilly M.P. Knockout of Adamts7, a novel coronary artery disease locus in humans, reduces atherosclerosis in mice. Circulation. 2015;131:1202–1213. doi: 10.1161/CIRCULATIONAHA.114.012669. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Kessler T., Zhang L., Liu Z., Yin X., Huang Y., Wang Y., Fu Y., Mayr M., Ge Q., Xu Q., Zhu Y., Wang X., Schmidt K., de Wit C., Erdmann J., et al. ADAMTS-7 inhibits re-endothelialization of injured arteries and promotes vascular remodeling through cleavage of thrombospondin-1. Circulation. 2015;131:1191–1201. doi: 10.1161/CIRCULATIONAHA.114.014072. [DOI] [PubMed] [Google Scholar]
- 11.Wang L., Zheng J., Bai X., Liu B., Liu C.J., Xu Q., Zhu Y., Wang N., Kong W., Wang X. ADAMTS-7 mediates vascular smooth muscle cell migration and neointima formation in balloon-injured rat arteries. Circ. Res. 2009;104:688–698. doi: 10.1161/CIRCRESAHA.108.188425. [DOI] [PubMed] [Google Scholar]
- 12.Zhang L., Yu F., Wang L., Zheng J., Du Y., Huang Y., Liu B., Wang X., Kong W. ADAMTS-7 promotes vascular smooth muscle cells proliferation in vitro and in vivo. Sci. China Life Sci. 2015;58:674–681. doi: 10.1007/s11427-015-4843-2. [DOI] [PubMed] [Google Scholar]
- 13.Liu C.J., Kong W., Ilalov K., Yu S., Xu K., Prazak L., Fajardo M., Sehgal B., Di Cesare P.E. ADAMTS-7: A metalloproteinase that directly binds to and degrades cartilage oligomeric matrix protein. FASEB J. 2006;20:988–990. doi: 10.1096/fj.05-3877fje. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Liu C.J., Kong W., Xu K., Luan Y., Ilalov K., Sehgal B., Yu S., Howell R.D., Di Cesare P.E. ADAMTS-12 associates with and degrades cartilage oligomeric matrix protein. J. Biol. Chem. 2006;281:15800–15808. doi: 10.1074/jbc.M513433200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Bai X.H., Wang D.W., Kong L., Zhang Y., Luan Y., Kobayashi T., Kronenberg H.M., Yu X.P., Liu C.J. ADAMTS-7, a direct target of PTHrP, adversely regulates endochondral bone growth by associating with and inactivating GEP growth factor. Mol. Cell Biol. 2009;29:4201–4219. doi: 10.1128/MCB.00056-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Kelwick R., Desanlis I., Wheeler G.N., Edwards D.R. The ADAMTS (A disintegrin and metalloproteinase with thrombospondin motifs) family. Genome Biol. 2015;16:113. doi: 10.1186/s13059-015-0676-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Somerville R.P., Longpre J.M., Apel E.D., Lewis R.M., Wang L.W., Sanes J.R., Leduc R., Apte S.S. ADAMTS7B, the full-length product of the ADAMTS7 gene, is a chondroitin sulfate proteoglycan containing a mucin domain. J. Biol. Chem. 2004;279:35159–35175. doi: 10.1074/jbc.M402380200. [DOI] [PubMed] [Google Scholar]
- 18.Cal S., Arguelles J.M., Fernandez P.L., Lopez-Otin C. Identification, characterization, and intracellular processing of ADAM-TS12, a novel human disintegrin with a complex structural organization involving multiple thrombospondin-1 repeats. J. Biol. Chem. 2001;276:17932–17940. doi: 10.1074/jbc.M100534200. [DOI] [PubMed] [Google Scholar]
- 19.Mead T.J., Apte S.S. ADAMTS proteins in human disorders. Matrix Biol. 2018;71-72:225–239. doi: 10.1016/j.matbio.2018.06.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Petri A., Kim H.J., Xu Y., de Groot R., Li C., Vandenbulcke A., Vanhoorelbeke K., Emsley J., Crawley J.T.B. Crystal structure and substrate-induced activation of ADAMTS13. Nat. Commun. 2019;10:3781. doi: 10.1038/s41467-019-11474-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Santamaria S. ADAMTS-5: A difficult teenager turning 20. Int. J. Exp. Pathol. 2020;101:4–20. doi: 10.1111/iep.12344. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Brebion F., Gosmini R., Deprez P., Varin M., Peixoto C., Alvey L., Jary H., Bienvenu N., Triballeau N., Blanque R., Cottereaux C., Christophe T., Vandervoort N., Mollat P., Touitou R., et al. Discovery of GLPG1972/S201086, a potent, selective, and orally bioavailable ADAMTS-5 inhibitor for the treatment of osteoarthritis. J. Med. Chem. 2021;64:2937–2952. doi: 10.1021/acs.jmedchem.0c02008. [DOI] [PubMed] [Google Scholar]
- 23.Kleifeld O., Doucet A., Prudova A., auf dem Keller U., Gioia M., Kizhakkedathu J.N., Overall C.M. Identifying and quantifying proteolytic events and the natural N terminome by terminal amine isotopic labeling of substrates. Nat. Protoc. 2011;6:1578–1611. doi: 10.1038/nprot.2011.382. [DOI] [PubMed] [Google Scholar]
- 24.Prudova A., auf dem Keller U., Butler G.S., Overall C.M. Multiplex N-terminome analysis of MMP-2 and MMP-9 substrate degradomes by iTRAQ-TAILS quantitative proteomics. Mol. Cell Proteomics. 2010;9:894–911. doi: 10.1074/mcp.M000050-MCP201. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Bekhouche M., Leduc C., Dupont L., Janssen L., Delolme F., Vadon-Le Goff S., Smargiasso N., Baiwir D., Mazzucchelli G., Zanella-Cleon I., Dubail J., De Pauw E., Nusgens B., Hulmes D.J., Moali C., et al. Determination of the substrate repertoire of ADAMTS2, 3, and 14 significantly broadens their functions and identifies extracellular matrix organization and TGF-beta signaling as primary targets. FASEB J. 2016;30:1741–1756. doi: 10.1096/fj.15-279869. [DOI] [PubMed] [Google Scholar]
- 26.Leduc C., Dupont L., Joannes L., Monseur C., Baiwir D., Mazzucchelli G., Deroanne C., Colige A., Bekhouche M. In vivo N-terminomics highlights novel functions of ADAMTS2 and ADAMTS14 in skin collagen matrix building. Front. Mol. Biosci. 2021;8:643178. doi: 10.3389/fmolb.2021.643178. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Colige A., Monseur C., Crawley J.T.B., Santamaria S., de Groot R. Proteomic discovery of substrates of the cardiovascular protease ADAMTS7. J. Biol. Chem. 2019;294:8037–8045. doi: 10.1074/jbc.RA119.007492. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Mizoguchi T., MacDonald B.T., Bhandary B., Popp N.R., Laprise D., Arduini A., Lai D., Zhu Q.M., Xing Y., Kaushik V.K., Kathiresan S., Ellinor P.T. Coronary disease association with ADAMTS7 is due to protease activity. Circ. Res. 2021;129:458–470. doi: 10.1161/CIRCRESAHA.121.319163. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Keshishian H., Burgess M.W., Specht H., Wallace L., Clauser K.R., Gillette M.A., Carr S.A. Quantitative, multiplexed workflow for deep analysis of human blood plasma and biomarker discovery by mass spectrometry. Nat. Protoc. 2017;12:1683–1701. doi: 10.1038/nprot.2017.054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Shadforth I.P., Dunkley T.P., Lilley K.S., Bessant C. i-Tracker: For quantitative proteomics using iTRAQ. BMC Genomics. 2005;6:145. doi: 10.1186/1471-2164-6-145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Crooks G.E., Hon G., Chandonia J.M., Brenner S.E. WebLogo: A sequence logo generator. Genome Res. 2004;14:1188–1190. doi: 10.1101/gr.849004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Colaert N., Helsens K., Martens L., Vandekerckhove J., Gevaert K. Improved visualization of protein consensus sequences by iceLogo. Nat. Methods. 2009;6:786–787. doi: 10.1038/nmeth1109-786. [DOI] [PubMed] [Google Scholar]
- 33.Erdmann J., Kessler T., Munoz Venegas L., Schunkert H. A decade of genome-wide association studies for coronary artery disease: The challenges ahead. Cardiovasc. Res. 2018;114:1241–1257. doi: 10.1093/cvr/cvy084. [DOI] [PubMed] [Google Scholar]
- 34.Djokic J., Fagotto-Kaufmann C., Bartels R., Nelea V., Reinhardt D.P. Fibulin-3, -4, and -5 are highly susceptible to proteolysis, interact with cells and heparin, and form multimers. J. Biol. Chem. 2013;288:22821–22835. doi: 10.1074/jbc.M112.439158. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.McLaughlin P.J., Bakall B., Choi J., Liu Z., Sasaki T., Davis E.C., Marmorstein A.D., Marmorstein L.Y. Lack of fibulin-3 causes early aging and herniation, but not macular degeneration in mice. Hum. Mol. Genet. 2007;16:3059–3070. doi: 10.1093/hmg/ddm264. [DOI] [PubMed] [Google Scholar]
- 36.Driver S.G.W., Jackson M.R., Richter K., Tomlinson P., Brockway B., Halliday B.J., Markie D.M., Robertson S.P., Wade E.M. Biallelic variants in EFEMP1 in a man with a pronounced connective tissue phenotype. Eur. J. Hum. Genet. 2020;28:445–452. doi: 10.1038/s41431-019-0546-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Timpl R., Sasaki T., Kostka G., Chu M.L. Fibulins: A versatile family of extracellular matrix proteins. Nat. Rev. Mol. Cell Biol. 2003;4:479–489. doi: 10.1038/nrm1130. [DOI] [PubMed] [Google Scholar]
- 38.Mead T.J., McCulloch D.R., Ho J.C., Du Y., Adams S.M., Birk D.E., Apte S.S. The metalloproteinase-proteoglycans ADAMTS7 and ADAMTS12 provide an innate, tendon-specific protective mechanism against heterotopic ossification. JCI Insight. 2018;3 doi: 10.1172/jci.insight.92941. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Sasaki T., Mann K., Wiedemann H., Gohring W., Lustig A., Engel J., Chu M.L., Timpl R. Dimer model for the microfibrillar protein fibulin-2 and identification of the connecting disulfide bridge. EMBO J. 1997;16:3035–3043. doi: 10.1093/emboj/16.11.3035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Cerda-Costa N., Gomis-Ruth F.X. Architecture and function of metallopeptidase catalytic domains. Protein Sci. 2014;23:123–144. doi: 10.1002/pro.2400. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Biniossek M.L., Niemer M., Maksimchuk K., Mayer B., Fuchs J., Huesgen P.F., McCafferty D.G., Turk B., Fritz G., Mayer J., Haecker G., Mach L., Schilling O. Identification of protease specificity by combining proteome-derived peptide libraries and quantitative proteomics. Mol. Cell Proteomics. 2016;15:2515–2524. doi: 10.1074/mcp.O115.056671. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Rodriguez-Manzaneque J.C., Westling J., Thai S.N., Luque A., Knauper V., Murphy G., Sandy J.D., Iruela-Arispe M.L. ADAMTS1 cleaves aggrecan at multiple sites and is differentially inhibited by metalloproteinase inhibitors. Biochem. Biophys. Res. Commun. 2002;293:501–508. doi: 10.1016/S0006-291X(02)00254-1. [DOI] [PubMed] [Google Scholar]
- 43.Collins-Racie L.A., Flannery C.R., Zeng W., Corcoran C., Annis-Freeman B., Agostino M.J., Arai M., DiBlasio-Smith E., Dorner A.J., Georgiadis K.E., Jin M., Tan X.Y., Morris E.A., LaVallie E.R. ADAMTS-8 exhibits aggrecanase activity and is expressed in human articular cartilage. Matrix Biol. 2004;23:219–230. doi: 10.1016/j.matbio.2004.05.004. [DOI] [PubMed] [Google Scholar]
- 44.Westling J., Fosang A.J., Last K., Thompson V.P., Tomkinson K.N., Hebert T., McDonagh T., Collins-Racie L.A., LaVallie E.R., Morris E.A., Sandy J.D. ADAMTS4 cleaves at the aggrecanase site (Glu373-Ala374) and secondarily at the matrix metalloproteinase site (Asn341-Phe342) in the aggrecan interglobular domain. J. Biol. Chem. 2002;277:16059–16066. doi: 10.1074/jbc.M108607200. [DOI] [PubMed] [Google Scholar]
- 45.Somerville R.P., Longpre J.M., Jungers K.A., Engle J.M., Ross M., Evanko S., Wight T.N., Leduc R., Apte S.S. Characterization of ADAMTS-9 and ADAMTS-20 as a distinct ADAMTS subfamily related to Caenorhabditis elegans GON-1. J. Biol. Chem. 2003;278:9503–9513. doi: 10.1074/jbc.M211009200. [DOI] [PubMed] [Google Scholar]
- 46.Silver D.L., Hou L., Somerville R., Young M.E., Apte S.S., Pavan W.J. The secreted metalloprotease ADAMTS20 is required for melanoblast survival. PLoS Genet. 2008;4 doi: 10.1371/journal.pgen.1000003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.McCulloch D.R., Nelson C.M., Dixon L.J., Silver D.L., Wylie J.D., Lindner V., Sasaki T., Cooley M.A., Argraves W.S., Apte S.S. ADAMTS metalloproteases generate active versican fragments that regulate interdigital web regression. Dev. Cell. 2009;17:687–698. doi: 10.1016/j.devcel.2009.09.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Sandy J.D., Westling J., Kenagy R.D., Iruela-Arispe M.L., Verscharen C., Rodriguez-Mazaneque J.C., Zimmermann D.R., Lemire J.M., Fischer J.W., Wight T.N., Clowes A.W. Versican V1 proteolysis in human aorta in vivo occurs at the Glu441-Ala442 bond, a site that is cleaved by recombinant ADAMTS-1 and ADAMTS-4. J. Biol. Chem. 2001;276:13372–13378. doi: 10.1074/jbc.M009737200. [DOI] [PubMed] [Google Scholar]
- 49.Rosell-Garcia T., Paradela A., Bravo G., Dupont L., Bekhouche M., Colige A., Rodriguez-Pascual F. Differential cleavage of lysyl oxidase by the metalloproteinases BMP1 and ADAMTS2/14 regulates collagen binding through a tyrosine sulfate domain. J. Biol. Chem. 2019;294:11087–11100. doi: 10.1074/jbc.RA119.007806. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Trackman P.C. Functional importance of lysyl oxidase family propeptide regions. J. Cell Commun. Signal. 2018;12:45–53. doi: 10.1007/s12079-017-0424-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Jefferson T., Auf dem Keller U., Bellac C., Metz V.V., Broder C., Hedrich J., Ohler A., Maier W., Magdolen V., Sterchi E., Bond J.S., Jayakumar A., Traupe H., Chalaris A., Rose-John S., et al. The substrate degradome of meprin metalloproteases reveals an unexpected proteolytic link between meprin beta and ADAM10. Cell Mol. Life Sci. 2013;70:309–333. doi: 10.1007/s00018-012-1106-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Heljasvaara R., Aikio M., Ruotsalainen H., Pihlajaniemi T. Collagen XVIII in tissue homeostasis and dysregulation - lessons learned from model organisms and human patients. Matrix Biol. 2017;57-58:55–75. doi: 10.1016/j.matbio.2016.10.002. [DOI] [PubMed] [Google Scholar]
- 53.Kaasboll O.J., Gadicherla A.K., Wang J.H., Monsen V.T., Hagelin E.M.V., Dong M.Q., Attramadal H. Connective tissue growth factor (CCN2) is a matricellular preproprotein controlled by proteolytic activation. J. Biol. Chem. 2018;293:17953–17970. doi: 10.1074/jbc.RA118.004559. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Hashimoto G., Inoki I., Fujii Y., Aoki T., Ikeda E., Okada Y. Matrix metalloproteinases cleave connective tissue growth factor and reactivate angiogenic activity of vascular endothelial growth factor 165. J. Biol. Chem. 2002;277:36288–36295. doi: 10.1074/jbc.M201674200. [DOI] [PubMed] [Google Scholar]
- 55.de Winter P., Leoni P., Abraham D. Connective tissue growth factor: Structure-function relationships of a mosaic, multifunctional protein. Growth Factors. 2008;26:80–91. doi: 10.1080/08977190802025602. [DOI] [PubMed] [Google Scholar]
- 56.Butler G.S., Connor A.R., Sounni N.E., Eckhard U., Morrison C.J., Noel A., Overall C.M. Degradomic and yeast 2-hybrid inactive catalytic domain substrate trapping identifies new membrane-type 1 matrix metalloproteinase (MMP14) substrates: CCN3 (Nov) and CCN5 (WISP2) Matrix Biol. 2017;59:23–38. doi: 10.1016/j.matbio.2016.07.006. [DOI] [PubMed] [Google Scholar]
- 57.Pi L., Jorgensen M., Oh S.H., Protopapadakis Y., Gjymishka A., Brown A., Robinson P., Liu C., Scott E.W., Schultz G.S., Petersen B.E. A disintegrin and metalloprotease with thrombospondin type I motif 7: A new protease for connective tissue growth factor in hepatic progenitor/oval cell niche. Am. J. Pathol. 2015;185:1552–1563. doi: 10.1016/j.ajpath.2015.02.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Wei J.L., Fu W., Hettinghouse A., He W.J., Lipson K.E., Liu C.J. Role of ADAMTS-12 in protecting against inflammatory arthritis in mice by interacting with and inactivating proinflammatory connective tissue growth factor. Arthritis Rheumatol. 2018;70:1745–1756. doi: 10.1002/art.40552. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Muia J., Sadler J.E., Zhu J., Abdi M., Westfield L. The atherogenic metalloprotease ADAMTS7 secretion requires propeptide and substrate screening suggests broad specificity. Arterioscler Thromb. Vasc. Biol. 2019;39:A306. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The original mass spectra and the protein sequence database used for searches have been deposited in the public proteomics repository MassIVE (http://massive.ucsd.edu/) and are accessible at ftp://massive.ucsd.edu/MSV000088508/.
supplemental Table S1 – TAILS experiment quantitation statistics.
supplemental Table S2 – TAILS datasets for the SMC1, SMC2, and HUVEC experiments.
supplemental Table S3 – Secretome datasets for the SMC1, SMC2, and HUVEC experiments.
supplemental Table S4 – TAILS-regulated peptide summary.
supplemental Table S5 – Annotated ADAMTS7 autocleavage sites from each TAILS experiment.
supplemental Table S6 –TAILS high confidence candidate substrate cleavage sites for each experiment and annotated overlap analysis of cleavage sites identified in multiple TAILS experiments.