Skip to main content
Frontiers in Bioinformatics logoLink to Frontiers in Bioinformatics
. 2021 Aug 30;1:723482. doi: 10.3389/fbinf.2021.723482

Protein Complex Organization Imposes Constraints on Proteome Dysregulation in Cancer

Gökçe Senger 1, Martin H Schaefer 1,*
PMCID: PMC9580999  PMID: 36303728

Abstract

Protein assembly is a highly dynamic process and proteins can interact in different ways and stoichiometries within a complex. The importance of maintaining protein stoichiometry for complex function and avoiding aggregation of orphan subunits has been demonstrated. However, how exactly the organization of proteins into complexes constrains differential protein abundance in extreme cellular conditions like cancer, where a lot of protein abundance changes occur, has not been systematically investigated. To study this, we collected proteomic data made available by the Clinical Proteomic Tumor Analysis Consortium (CPTAC) to quantify proteomic changes during carcinogenesis and systematically tested five interaction types in complexes to investigate which of these features impact on protein abundance correlation patterns in cancer. We found that higher than expected fraction of protein complex subunits does not show changes in their abundances compared to those in the normal samples. Furthermore, we found that the way proteins interact in complexes indeed constrains their co-abundance patterns. Our results highlight the role of the interactions between the proteins and the need of cancer cells to deal with aberrant changes in protein abundance.

Keywords: protein-protein interactions, cancer, protein complexes, abundance dysregulation, stoichiometry

Introduction

Cancer is a system that is characterized by a large number of somatic molecular alterations. Many of these alterations appear on the level of the transcriptome and proteome of tumor cells. Among other factors, copy number alterations (CNAs), gain or losses in DNA copies, can cause these imbalances in mRNA expression and protein abundances (Mitelman et al., 2007; Beroukhim et al., 2010). However, CNAs are not linearly translated into transcriptome and proteome levels (Liu et al., 2016). Indeed, multi-omics studies estimated that nearly 40% genes are under compensatory post-transcriptional control in response to CNAs (Sousa et al., 2019). Interestingly, those buffered proteins are significantly enriched in protein complexes (Stingele et al., 2012; Gonçalves et al., 2017; Ishikawa et al., 2017; Ryan et al., 2017). Furthermore, it has been shown that members of the same complex have strong co-regulation in protein abundances (Dephoure et al., 2014; Kustatscher et al., 2017; Sousa et al., 2019). Together these results suggest that protein-protein interactions (PPIs) and their organization into complexes limit the dysregulation of protein abundance; however, to which degree complex organization is important and how this buffering is achieved is poorly understood.

Proteins are interacting with each other as complexes to govern cellular functions. Protein complexes are not static units but assemble and disassemble dynamically. This process relies on cellular abundances and localization of proteins, and binding affinities between proteins (Nooren and Thornton, 2003a). For example, quantitative proteomics analysis on different cell lines has shown that stable (core) complex subunits of the anaphase promoting complex/cyclosome (APC/C) are associated with higher cellular abundances and unique stoichiometries while the KIAA1430 subunit, transient interactor of the APC/C, can bind with different stoichiometric ratios (Hein et al., 2015). Furthermore, it has been demonstrated that protein stoichiometries vary depending on the environment such as different tissues and cell types (Ori et al., 2016; Nusinow et al., 2020), which highlights the dynamic organization of proteins in a context-specific manner. Since the pioneering work of Jones and Thornton (Jones and Thornton, 1996), many studies have focused on formalizing PPIs based on binding affinity, composition and stability of the complex, resulting in three main classes; homo- and hetero-oligomeric interactions, non-obligate and obligate interactions, and transient and permanent interactions (Mintseris and Weng, 2003; Nooren and Thornton, 2003a; Acuner Ozbabacan et al., 2011). Moreover, some studies have focused on indirect classification in which functional associations, co-expression, and genetic interactions are considered rather than physical interactions (Eisenberg et al., 2000; Chatr-Aryamontri et al., 2013; Franceschini et al., 2013). However, we still lack a consistent formalization of these partly redundant definitions of interaction types within complexes.

Here, we performed two related tasks: 1) developing a definition and systematic characterization of protein interaction types in complexes; 2) asking how those different interaction types result in (co-)abundance rules between the complex subunits in particular in cellular conditions like cancer that are characterized by dysregulation of gene expression. To answer these questions, we first systematically categorized protein interactions by using structural data, proteomics measurements in different cellular conditions, interaction data, and literature information and then, with the integration of cancer proteomics data, tested how each category impacts on protein abundance correlation patterns in cancer. The outcomes emphasize the role of the interactions between the proteins and their role in complexes in constraining differential protein abundances in cancer.

Materials and Methods

Code for data analysis is provided as part of the PPIs_Data-Code repository and can be found at https://github.com/SengerG/PPIs_Data-Code.

Data Processing

Proteomics data used in this publication were generated by the Clinical Proteomic Tumor Analysis Consortium (NCI/NIH). TMT-based log-transformed proteomics data for the following CPTAC cohorts (for which proteomic profiling was done for tumor and normal adjacent tissue samples and processed data is available) were obtained: Colon (COAD) from Vasaikar et al. (2019) (8,067 proteins for 96 tumor and 96 matched normal samples), HBV-related hepatocellular carcinoma (HCC) from Gao et al. (2019) (6,478 proteins for 159 tumor and 159 matched normal samples) and lung (LUAD) from Gillette et al. (2020) (10,699 proteins for 110 tumors and 101 matched normal samples). For the ones for which the confirmatory or discovery study is available, only the representative study was considered. For the COAD cohort, proteomics data were further filtered by excluding proteins that were quantified in less than 50% of the samples, leaving a total number of 6,554 proteins. For the LUAD cohort, proteins mapped to the same gene symbol were merged by taking the mean of the log-transformed TMT values, leaving a total number of 10,316 proteins.

Proteomics data for the available The Cancer Genome Atlas (TCGA) projects were directly obtained from the CPTAC consortium, comprising 3 cohorts, spectral counts for colorectal (COREAD) (Cancer Genome Atlas Network, 2012a; Zhang et al., 2014), and relative abundances for ovarian (OV) (Cancer Genome Atlas Network, 2011; Zhang et al., 2016), and breast (BRCA) (Cancer Genome Atlas Network, 2012b; Mertins et al., 2016). Spectral counts for TCGA COREAD tumor samples were normalized by quantile normalization followed by log-transformation. For the replicated samples, the mean value was considered. This left us 5,561 proteins and 90 samples for COREAD, 7,169 proteins and 174 samples for OV, and 10,625 proteins and 105 samples for BRCA.

Statistical Analysis

Differential protein abundance analysis was performed by using Wilcoxon test to detect abundance changes between the tumor and matched normal samples and then p-values were subjected to multiple testing correction by using the Bonferroni correction. Log2 fold change (log2FC) for proteins was calculated as the median difference of log2 transformed TMT-values between tumor and normal samples. Proteins with absolute log2FC greater than 1 and adjusted p-value less than or equal to 0.05 were considered differentially abundant. Associations between the differentially abundant proteins and protein complex subunits, obtained from the CORUM database (Giurgiu et al., 2019), were tested by chi-square test. To create an abundance-matched background set of the same size, we binned proteins based on their abundance and replaced each complex subunit with a non-complex protein from the same bin. Standard deviations in protein abundances were calculated across tumor and normal samples separately and then compared by using t-test.

Protein abundance correlations, for all possible protein pairs among proteins covered by proteomics data, were calculated across tumor samples by using the Spearman method separately for all 6 CPTAC cohorts (COAD, HCC, LUAD, COREAD, OV, and BRCA). The difference between the distributions of correlations between members of the same complex and members of different complexes was tested by Wilcoxon test. To this end, correlations from different cohorts for each protein pair were pooled.

Calculation of Stoichiometric Ratio

Stoichiometry information for proteins was retrieved from the structural data available in the Protein Data Bank (PDB) (for 9,840 PDB entries) (Berman et al., 2000) in March 2020. Uniprot IDs were then converted to gene names and only the human complexes were considered for further analysis, which left us with 8,388 protein complexes comprising 3,075 proteins. For proteins within the same complex, the stoichiometric ratio was calculated as the number of chains of proteins relative to each other. Then, protein pairs were grouped as the ones involved in complexes with even (e.g., 1:1, 2:2, 4:4) and with uneven (e.g., 2:1, 1:2, 3:1) stoichiometric ratio (Supplementary Table S1). Protein pairs that participate in complexes sometimes with even and sometimes with uneven ratio were not included for further analyses.

Calculation of Co-occurrence Frequency

The known human protein complexes (n = 2,916) and their subunits (n = 3,664 unique proteins) were obtained from the CORUM database (Giurgiu et al., 2019) (Corum 3.0 current release, September 2018). For each protein pair found together in at least one protein complex, we first counted the number of complexes in which the two proteins were found together. To address a possible bias due to different tendencies of proteins to participate in complexes, we calculated the Jaccard index as a representative of the co-occurrence frequencies by dividing the number of complexes a protein pair is found together by the number of complexes in which at least one of them is found.

Defining Context-Specific and General Interactions

167,374 protein interactions were obtained from BioPlex Interactome (Huttlin et al., 2021) for the 293T and HCT116 human cell lines. Protein interactions that were detected by the baits targeted in both cell lines were considered, leaving a total number of 33,739 interactions detected in both cell lines (general interactions), and 89,330 interactions detected either in 293T or in HCT116 cell lines (context-specific interactions).

Defining Competitive and Cooperative Interactions

Protein interaction interface information was extracted from Interactome INSIDER (where interface residues are defined as the ones with a decrease in solvent-accessible surface area equal to or larger than 1.0 Å2 upon binding) (Meyer et al., 2018) covering 121,575 experimentally determined human binary interactions among 14,380 proteins. Uniprot IDs were converted to gene names. The binary interactions that have binding site information only for one protein were filtered out, which left us with 70,355 binary protein interactions. To determine if two proteins bind to their common partner in a competitive or cooperative manner, we calculated the Jaccard index (Eq. 1) as a measure of binding similarity between their corresponding interaction sites on the common partner. For the protein pairs that have more than one common partner, the one with the highest Jaccard index was considered for further analyses. Two proteins are classified as competitive if the Jaccard index is equal to or higher than 0.1 and cooperative otherwise.

Jaccard Index = Number of residues in ABNumber of residues in AB (1)

where A and B represent the corresponding interaction sites on the common partner for protein A and B.

Transient and Permanent Interactions

Transient and permanent PPIs were obtained from Block et al. (2006) (147 permanent and 198 transient interactions among 340 PDB complexes) and from Mintseris and Weng (2003) (209 transient interactions among 207 PDB complexes). After selecting for heterodimers and human complexes and removing the duplicated interactions predicted by both studies, 58 transient and 9 permanent interactions were left.

Statistical Analysis on Protein-Protein Interaction Types

For each comparison between groups in each class of PPIs, Wilcoxon test was applied.

For each protein pair covered by the structural data obtained from the PDB (Berman et al., 2000), we tested the relationship between proteins by using a linear regression model. In the model, the dependent variable was the protein abundances of the protein with smaller copy number across tumor samples. For the protein pairs with even stoichiometric ratio, the first protein was considered as the dependent variable. Only the relationships where the coefficient of the dependent variable was significantly different from zero (p-value < 0.05, linear regression model) were considered for the comparison of slopes between protein pairs with even and uneven stoichiometric ratio. Wilcoxon test was used for the comparison.

Results

Complex Members Are to a Certain Degree Protected From Abundance Changes

To catalog a representative set of protein abundance changes during carcinogenesis, we obtained proteome quantification data for tumor and normal adjacent tissue samples from the CPTAC consortium, comprising three cohorts; COAD (Vasaikar et al., 2019), HCC (Gao et al., 2019) and LUAD (Gillette et al., 2020) (Figure 1A). Then, differentially abundant proteins were detected between tumor and normal samples. We found a variation in the fraction of quantified proteins that showed significant abundant changes (adjusted p-value ≤ 0.05, Wilcoxon test, and absolute log2FC > 1); 457 out of 6,554 proteins, 481 out of 6,478 proteins and 3,971 out of 10,316 proteins for COAD, HCC, and LUAD, respectively. The majority of those differentially abundant proteins were down-regulated in tumor samples (∼90% for COAD and HCC, 60% for LUAD) (Supplementary Figure S1A).

FIGURE 1.

FIGURE 1

Protein abundance changes in cancer. (A) The total number of quantified proteins in each CPTAC cohort after filtering (LUAD, COAD, HCC, BRCA, OV, and COREAD). (B) The expected and observed number of differentially abundant proteins in complex subunits. Chi-square test was used to determine if the difference between expected and observed values is statistically significant (n.s: no statistical significance; ****: p < 0.0001). (C) The difference between the distribution of standard deviations of protein abundances in tumor and matched normal samples. Statistical significance was tested by t-test. (D) Spearman correlation coefficients of protein pairs across tumor samples. Pairwise correlations were calculated across tumor samples, separately in each cohort (LUAD, COAD, HCC, BRCA, OV, and COREAD) and then pooled. Wilcoxon test was used to test if two distributions are significantly different.

We next aimed to understand if complex subunits are to a lower degree affected by differential abundance changes when comparing tumors with healthy tissue. To do this, we performed an association test between the differentially abundant proteins, with up- and down-regulated proteins pooled, and protein complex subunits obtained from the CORUM database (Giurgiu et al., 2019). We observed a significant depletion of differentially abundant proteins in protein complex subunits for COAD and HCC (p < 0.0001, chi-square test) (Figure 1B). In order to avoid a potential bias due to the abundance differences between complex and non-complex proteins, we repeated the association test by using a size- and abundance-matched non-complex subunits as background. We could reproduce our previous result in COAD and HCC cohorts where we observed significant association. We hypothesize that complex subunits are protected from abundance changes as stoichiometric imbalances in protein complexes cause proteotoxicity and prevent proper functioning of complexes. Similar observations have been previously described for abundance changes triggered by focal genomic copy number alterations (Gonçalves et al., 2017).

For the LUAD cohort, we observed in relative terms a higher proportion of up-regulated proteins when compared to abundance changes in COAD and HCC, where around 90% of the differentially abundant proteins were down-regulated (Supplementary Figure S1A). A chi-square test was performed for up- and down-regulated proteins separately in the LUAD cohort where this comparison was feasible to assess differences in their overlap with the complex subunits. We observed that up-regulated proteins were significantly enriched in complex subunits (p = 6.28e-17, chi-square test) while a significant depletion was observed (consistent with the other cohorts) in the overlap between the down-regulated proteins and complex subunits (p = 8.42e-19, chi-square test) (Supplementary Figure S1B). The overall strong depletion of complex members among downregulated proteins suggests that complexes are protected from downregulation of their components as the lack of subunits will prevent the proper functioning of complexes, which might be more detrimental for the tumor than upregulation of components.

In summary, we observed that protein complex membership protects proteins from downregulation in cancer to a certain degree. Still, many protein abundance changes happen in cancer (Guang et al., 2019). Indeed, we observed a higher variation in protein abundances across tumor samples as compared to the matched normal samples in all of the 3 cohorts (p < 0.0001, t-test) (Figure 1C). We therefore decided to test if protein abundance correlation differs between complex partners vs. non-partners. To this end, we additionally included the CPTAC proteome data for the available TCGA projects (for which no matched normal samples were available and, hence, had been excluded from the differential abundance analysis): COREAD (including both colon and rectal tumors differently from the COAD cohort) (The Cancer Genome Atlas Network, 2012a; Zhang et al., 2014), OV (The Cancer Genome Atlas Network, 2011; Zhang et al., 2016) and BRCA (The Cancer Genome Atlas Network, 2012b; Mertins et al., 2016) cohorts. For all six cohorts (COAD, HCC, LUAD, COREAD, OV, and BRCA), we calculated protein level Spearman correlations for all possible pairs of proteins covered by CORUM complexes across tumor samples. We found that proteins involved in the same complex have significantly stronger protein abundance correlations (p < 2.2e-16, Wilcoxon test) (Figure 1D).

Together these observations suggest that protein complex organization constrains protein abundance changes in cancer, which is characterized by dysregulation of proteins, and has an impact on the strength of co-abundance patterns.

Protein-Protein Interaction Classifications

To further understand how the dynamics of protein complex formation and the interaction between their components affect co-regulation of abundance changes of complex subunits, we first systematically categorized interaction types in complexes into five classes based on protein structure, proteomics measurements in different cellular conditions, biochemical properties of the binding interfaces, and literature information: i) stoichiometric ratio between proteins; ii) co-occurrence frequency of proteins; iii) context-specific vs. general interactions; iv) competitive vs. cooperative interactions; and v) transient vs. permanent interactions (Table 1). Next, we tested which of the complex interaction types contributes to the strength of abundance correlations between co-complex members in cancer. To do this, we calculated protein level Spearman correlations for all possible pairs of quantified proteins across tumor samples separately for each cohort included in this study (COAD, HCC, LUAD, COREAD, OV, and BRCA).

TABLE 1.

Categories of PPIs. Five interaction types between proteins, their definition and the source from which the interaction information is obtained. The illustrations for each defined interaction type are in Figure 2.

Class Definition Source
Stoichiometric ratio between proteins Copy numbers of proteins within a complex relative to each other PDB Berman et al. (2000)
Co-occurrence frequency of proteins The number of complexes two proteins are found together CORUM database Giurgiu et al. (2019)
Context-specific vs. general interactions PPIs are classified into two groups based on their detection in different human cell lines (HEK293T and HCT116) BioPlex Interactome Huttlin et al. (2021)
Competitive vs. cooperative interactions The terms, “competitive” and “cooperative,” are used to define the relationships between two proteins that bind to the same protein Interactome Insider Meyer et al. (2018)
Transient vs. permanent interactions Based on the stability of a complex and physicochemical characteristics of protein interfaces, protein-protein interaction can be grouped as transient, formed temporarily, and permanent interactions, naturally formed (Nooren and Thornton, 2003a; Nooren and Thornton, 2003b) Block et al. (2006), Mintseris and Weng (2003)

Proteins interact with each other in varying stoichiometric ratios in protein complexes (Taggart et al., 2020). To test whether the stoichiometric ratio between co-complex members constrains abundance changes within a complex, we used structural data obtained from the PDB (Berman et al., 2000) for the available human heteromeric protein complexes (n = 8,388) and calculated the stoichiometric ratio between the proteins within the same complex. Then we grouped protein pairs as the ones with even stoichiometric ratio (e.g., 1:1, 2:2, 4:4) and those with uneven stoichiometric ratio (e.g., 2:1, 1:2, 3:1) and compared protein level correlations between these two groups in each cohort. We observed that protein pairs involved in complexes in an even stoichiometric ratio have significantly stronger correlations than other protein pairs (p-value < 0.0001, Wilcoxon test) in all cohorts (Figure 2A; Supplementary Figure S2A). We next tested if the stoichiometric ratio would have an impact on the steepness of the regression curve. We found, considering only the significant relationships (where coefficients are different from zero; p-value < 0.05, linear regression model), that positively correlated protein pairs with even stoichiometric ratio are associated with higher slopes (significant for 4 cohorts out of 6; p-value < 0.0001, Wilcoxon test) while their negatively correlated counterparts are associated with lower slopes (significant for 2 cohorts out of 6; p-value < 0.05, Wilcoxon test) (Supplementary Figure S3). These observations make sense as when a protein is upregulated its co-complex members that participate with a higher number of copies in the complex need to be upregulated to an even higher degree to fulfil the stoichiometric constraints.

FIGURE 2.

FIGURE 2

Protein-protein interaction types and protein level correlations between proteins across tumor samples in HCC. (A) Stoichiometric ratio between proteins. (B) Co-occurrence frequency of proteins. (C) Context-specific vs. general interactions. (D) Competitive vs. cooperative interactions. (E) Permanent vs. transient interactions. Wilcoxon test was performed to compare correlations between two groups in each class of PPIs.

Some protein pairs are found together in many complexes while others co-occur only in a few complexes. Thus, we asked if the frequency of the co-occurence of protein pairs in complexes has an effect on co-abundance changes in cancer. To do this, for each protein pair among CORUM protein complex subunits, we computed co-occurrence frequencies in complexes. Comparison of the co-occurrence frequencies between positively and negatively correlated proteins showed that positively correlated protein pairs tend to occur more often in the same complexes, consistently in each cohort (p-value < 0.0001, Wilcoxon test) (Figure 2B; Supplementary Figure S2B). This observation matches our expectation as frequent co-membership in complexes means a larger number of complexes will depend on the proper abundance ratios between the two proteins and this will increase the need for coregulation of the protein pair.

We further tested context-specific vs. general interactions. As a proxy for the interaction specificity, we included experimentally determined cell-line specific PPIs (Huttlin et al., 2021) and classified them as context-specific interactions if they were detected in only one of the two cell lines, and general interactions if present in both cell lines. We found that proteins interacting independently from the context are associated with stronger correlations than those interacting in a context-dependent manner in all cohorts (p-value < 0.0001, Wilcoxon test) (Figure 2C; Supplementary Figure S2C). The result makes sense as general interactions, by definition, are less prone to be affected by different cellular environments, and thus their co-abundance changes are expected to be more correlated.

While some proteins bind to few partners, some have multiple partners binding at similar (overlapping) interaction sites (Keskin and Nussinov, 2007). To assess whether competing for binding affects co-abundance changes in cancer, we first identified competitive and cooperative interactions, and then compared protein level correlations between them. To this end, we used experimentally determined human binary interactions curated from Interactome INSIDER (Meyer et al., 2018) and grouped them as competitive and cooperative interactions based on the similarity of their corresponding interaction sites on their shared partners. We observed that competitively interacting proteins have significantly higher correlations than cooperatively interacting proteins, consistently for 5 cohorts out of 6 (p-value < 0.05, Wilcoxon test) (Figure 2D; Supplementary Figure S2D). This can be robustly reproduced when comparative and cooperative interactions were grouped based on different binding similarity scores (Supplementary Figure S4). This observation surprised us as we expected weaker or negative correlations between competitively interacting proteins as those, by definition, should not participate in the same complex at the same time and hence an opposing expression pattern would be expected.

Based on stability, PPIs are classified as permanent and transient interactions. We obtained permanent and transient interactions estimated by machine learning algorithms based on physicochemical properties of PPIs from Minteris and Weng (2003) and Block et al. (2006), and then compared protein level correlations between those two groups. While not significant in each single cohort, we observed consistent trends: permanent interactions correspond to stronger correlations while relatively weaker, transient interactions were observed between proteins whose abundances are less dependent on each other, and this trend is statistically significant in COREAD, COAD, and HCC cohorts (p-value < 0.05, Wilcoxon test) (Figure 2E; Supplementary Figure S2E). This is expected as the transient interactions are more flexible for a change in binding partners during the assembly of complexes (Nooren and Thornton, 2003a).

Discussion

Cancer is characterized by many alterations including transcriptome and proteome dysregulation. There seems to be a larger number of dysregulated transcripts as compared to proteins (Stingele et al., 2012). This suggests compensatory mechanisms on translation level and raises the interesting question which of the changes in gene expression are tolerated by the cancer cell and which not. Our analysis reconfirms that complex members are to a certain degree protected from abundance changes that could mess with complex stoichiometry. In addition, we showed that the type of interaction within a protein complex constrains proteome abundance and dysregulation to maintain functional complex organization. To address this, we systematically categorized interaction types in complexes by integrating experimental measurements and computational predictions, and tested each category for its impact on abundance changes by using cancer proteomics data.

In most yet not in every case, our observations matched our expectations. However, in some comparisons we observed different degrees of the expected trend in different cohorts: e.g., for the observation that permanent interactions are associated with higher correlations (Figure 2E; Supplementary Figure S2E). This variation might be explained by the way the physiological conditions and local environment affect the stability of an interaction meaning that permanent interaction may become transient under certain conditions or vice versa (Nooren and Thornton, 2003a). Additionally, computational prediction of transient and permanent interactions might not fully capture how proteins interact in different local environments represented by different cancer types used in this study.

For proteins binding to the same proteins through overlapping interaction sites, competition is expected. We hypothesized that proteins competing with each other to bind their common partners will show weaker correlations in their protein abundances. However, we were surprised that the results were in the opposite direction of our expectation (Figure 2D; Supplementary Figure S2D). Those results could be reproduced over a range of thresholds for binding site overlap definition (Supplementary Figure S4). The reason could be that alternative regulatory mechanisms prevent competition between partners (Li et al., 2015) or that our computational estimates of competition vs. cooperative binding are not accurate. For example, biophysical properties of interaction sites of proteins (e.g., steric hindrance) can favor or prevent interactions between them and thus, the classification of proteins based on the overlap in the interaction interface might not represent competitive or cooperative binding in reality. In addition, it should be noted that those estimates are based on the measurement from in vitro experiments. The proteins classified as competing could have different localization or be expressed in different cells/tissues in vivo and in reality never meet.

The purpose of this study was to provide a proof of concept that interaction types within complexes affect co-abundance patterns in cancer. To this end, we picked only a limited set of studies to classify interactions or quantify protein abundances. Hence, the primary limitation of our study is the small number of representative examples for several types of interactions such as permanent vs. transient interactions where mining the literature revealed only a few instances. Additionally, for the class of context-specific vs. general interactions, we considered the proteomics measurements of two different cell lines. Thus, integrating a larger number of cell-line specific proteomics interaction datasets will potentially provide a more comprehensive understanding in the future.

Data Availability Statement

The datasets analyzed in this study are publicly available and can be found here: Proteomics data in the CPTAC (https://cptac-data-portal.georgetown.edu/), the known human protein complexes in the CORUM database (http://mips.helmholtz-muenchen.de/corum/#download), structural information of protein complexes in the PDB (rcsb.org), 121,575 experimentally determined human binary interactions in Interactome INSIDER (http://interactomeinsider.yulab.org/downloads.html), and cell-line specific PPIs in BioPlex Interactome (https://bioplex.hms.harvard.edu/interactions.php). The datasets generated for this study can be found as part of PPIs_Data-Code repository at (https://github.com/SengerG/PPIs_Data-Code).

Author Contributions

GS and MS designed the research. GS performed all analyses. MS supervised the study. GS and MS interpreted the data. GS and MS wrote the manuscript.

Funding

The work leading to this manuscript was supported by Fondazione AIRC, grant reference number MFAG 21791.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fbinf.2021.723482/full#supplementary-material

References

  1. Acuner Ozbabacan S. E., Engin H. B., Gursoy A., Keskin O. (2011). Transient protein-protein interactions. Protein Eng. Des. Sel. 24 (9), 635–648. 10.1093/protein/gzr025 [DOI] [PubMed] [Google Scholar]
  2. Berman H. M., Battistuz T., Bhat T. N., Bluhm W. F., Bourne P. E., Burkhardt K., et al. (2000). The protein data bank. Acta Crystallogr. D Biol. Crystallogr. 58 (6 I), 899–907. 10.1107/S0907444902003451 [DOI] [PubMed] [Google Scholar]
  3. Beroukhim R., Mermel C. H., Porter D., Wei G., Raychaudhuri S., Donovan J., et al. (2010). The landscape of somatic copy-number alteration across human cancers. Nature 463 (7283), 899–905. 10.1038/nature08822 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Block P., Paern J., Hüllermeier E., Sanschagrin P., Sotriffer C. A., Klebe G. (2006). Physicochemical descriptors to discriminate protein-protein interactions in permanent and transient complexes selected by means of machine learning algorithms. Proteins 65 (3), 607–622. 10.1002/prot.21104 [DOI] [PubMed] [Google Scholar]
  5. Chatr-Aryamontri A., Breitkreutz B. J., Heinicke S., Boucher L., Winter A., Stark C., et al. (2013). The BioGRID interaction database: 2013 Update. Nucleic Acids Res. 41 (D1), D816–D823. 10.1093/nar/gks1158 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Dephoure N., Hwang S., O'Sullivan C., Dodgson S. E., Gygi S. P., Amon A., et al. (2014). Quantitative proteomic analysis reveals posttranslational responses to aneuploidy in yeast. ELife 3 (July2014), e03023–27. 10.7554/eLife.03023 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. The Cancer Genome Atlas Network (2012b). “Genome sequencing centres,” in Comprehensive molecular portraits of human breast tumours. Editor Koboldt D. (St Louis: Washington University; ). Nature . 490, 61–70. 10.1038/nature11412 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. The Cancer Genome Atlas Network (2012a). “Genome Sequencing Center Baylor College of Medicine,” in Comprehensive molecular characterization of human colon and rectal cancer. Editor Muzny D. (St Louis: Washington University; ), 487, 330–337. Nature. 10.1038/nature11252 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Eisenberg D., Marcotte E. M., Xenarios I., Yeates T. O. (2000). Protein function in the post-genomic era. Nature 405 (6), 823–826. 10.1038/35015694 [DOI] [PubMed] [Google Scholar]
  10. Franceschini A., Szklarczyk D., Frankild S., Kuhn M., Simonovic M., Roth A., et al. (2013). STRING v9.1: Protein-protein interaction networks, with increased coverage and integration. Nucleic Acids Res. 41 (D1), D808–D815. 10.1093/nar/gks1094 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Gao Q., Zhu H., Dong L., Shi W., Chen R., Song Z., et al. (2019). Integrated Proteogenomic Characterization of HBV-Related Hepatocellular Carcinoma. Cell 179 (2), 561–e22. e22. 10.1016/j.cell.2019.08.052 [DOI] [PubMed] [Google Scholar]
  12. Gillette M. A., Satpathy S., Cao S., Dhanasekaran S. M., Vasaikar S. V., Krug K., et al. (2020). Proteogenomic Characterization Reveals Therapeutic Vulnerabilities in Lung Adenocarcinoma. Cell 182 (1), 200–e35. e35. 10.1016/j.cell.2020.06.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Giurgiu M., Reinhard J., Brauner B., Dunger-Kaltenbach I., Fobo G., Frishman G., et al. (2019). CORUM: the comprehensive resource of mammalian protein complexes-2019. Nucleic Acids Res. 47 (D1), D559–D563. 10.1093/nar/gky973 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Gonçalves E., Fragoulis A., Garcia-Alonso L., Cramer T., Saez-Rodriguez J., Beltrao P. (2017). Widespread Post-transcriptional Attenuation of Genomic Copy-Number Variation in Cancer. Cell Syst. 5 (4), 386–e4. e4. 10.1016/j.cels.2017.08.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Guang M. H. Z., Kavanagh E. L., Dunne L. P., Dowling P., Zhang L., Lindsay S., et al. (2019). Targeting proteotoxic stress in cancer: A review of the role that protein quality control pathways play in oncogenesis. Cancers (Basel) 11 (1), 66. 10.3390/cancers11010066 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Hein M. Y., Hubner N. C., Poser I., Cox J., Nagaraj N., Toyoda Y., et al. (2015). A Human Interactome in Three Quantitative Dimensions Organized by Stoichiometries and Abundances. Cell 163 (3), 712–723. 10.1016/j.cell.2015.09.053 [DOI] [PubMed] [Google Scholar]
  17. Huttlin E. L., Bruckner R. J., Navarrete-Perea J., Cannon J. R., Baltier K., Gebreab F., et al. (2021). Dual proteome-scale networks reveal cell-specific remodeling of the human interactome. Cell 184 (11), 3022–e28. e28. 10.1016/j.cell.2021.04.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Ishikawa K., Makanae K., Iwasaki S., Ingolia N. T., Moriya H. (2017). Post-Translational Dosage Compensation Buffers Genetic Perturbations to Stoichiometry of Protein Complexes. Plos Genet. 13 (1), e1006554–22. 10.1371/journal.pgen.1006554 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Jones S., Thornton J. M. (1996). Principles of protein-protein interactions. Proc. Natl. Acad. Sci. U S A. 93 (1), 13–20. 10.1073/pnas.93.1.13 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Keskin O., Nussinov R. (2007). Similar Binding Sites and Different Partners: Implications to Shared Proteins in Cellular Pathways. Structure 15 (3), 341–354. 10.1016/j.str.2007.01.007 [DOI] [PubMed] [Google Scholar]
  21. Kustatscher G., Grabowski P., Rappsilber J. (2017). Pervasive coexpression of spatially proximal genes is buffered at the protein level. Mol. Syst. Biol. 13 (8), 937. 10.15252/msb.20177548 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Li H., Zhou Y., Zhang Z. (2015). Competition-cooperation relationship networks characterize the competition and cooperation between proteins. Sci. Rep. 5 (February), 1–13. 10.1038/srep11619 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Liu Y., Beyer A., Aebersold R. (2016). On the Dependency of Cellular Protein Levels on mRNA Abundance. Cell 165 (3), 535–550. 10.1016/j.cell.2016.03.014 [DOI] [PubMed] [Google Scholar]
  24. Mertins P., Mani D. R., Ruggles K. V., Gillette M. A., Clauser K. R., Wang P., et al. (2016). Proteogenomics connects somatic mutations to signalling in breast cancer. Nature 534 (7605), 55–62. 10.1038/nature18003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Meyer M. J., Beltrán J. F., Liang S., Fragoza R., Rumack A., Liang J., et al. (2018). Interactome INSIDER: a structural interactome browser for genomic studies. Nat. Methods 15 (2), 107–114. 10.1038/nmeth.4540 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Mintseris J., Weng Z. (2003). Atomic Contact Vectors in Protein-Protein Recognition. Proteins 53 (3), 629–639. 10.1002/prot.10432 [DOI] [PubMed] [Google Scholar]
  27. Mitelman F., Johansson B., Mertens F. (2007). The impact of translocations and gene fusions on cancer causation. Nat. Rev. Cancer 7 (4), 233–245. 10.1038/nrc2091 [DOI] [PubMed] [Google Scholar]
  28. Nooren I. M., Thornton J. M. (2003b). Structural characterisation and functional significance of transient protein-protein interactions. J. Mol. Biol. 325 (5), 991–1018. 10.1016/S0022-2836(02)01281-0 [DOI] [PubMed] [Google Scholar]
  29. Nooren I. M. A., Thornton J. M. (2003a). NEW EMBO MEMBER ’ S REVIEW Diversity of protein ± protein interactions. EMBO J. 22 (14), 3486–3492. 10.1093/emboj/cdg359 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Nusinow D. P., Szpyt J., Ghandi M., Rose C. M., McDonald E. R., Kalocsay M., et al. (2020). Quantitative Proteomics of the Cancer Cell Line Encyclopedia. Cell 180 (2), 387–e16. e16. 10.1016/j.cell.2019.12.023 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Ori A., Iskar M., Buczak K., Kastritis P., Parca L., Andrés-Pons A., et al. (2016). Spatiotemporal variation of mammalian protein complex stoichiometries. Genome Biol. 17 (1), 47–15. 10.1186/s13059-016-0912-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Ryan C. J., Kennedy S., Bajrami I., Matallanas D., Lord C. J. (2017). A Compendium of Co-regulated Protein Complexes in Breast Cancer Reveals Collateral Loss Events. Cel. Syst. 5 (4), 399–e5. e5. 10.1016/j.cels.2017.09.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Sousa A., Gonçalves E., Mirauta B., Ochoa D., Stegle O., Beltrao P. (2019). Multi-omics Characterization of Interaction-mediated Control of Human Protein Abundance levels. Mol. Cel. Proteomics 18 (8), S114–S125. 10.1074/mcp.RA118.001280 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Stingele S., Stoehr G., Peplowska K., Cox J., Mann M., Storchova Z. (2012). Global analysis of genome, transcriptome and proteome reveals the response to aneuploidy in human cells. Mol. Syst. Biol. 8 (608), 608. 10.1038/msb.2012.40 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Taggart J. C., Zauber H., Selbach M., Li G. W., McShane E. (2020). Keeping the Proportions of Protein Complex Components in Check. Cel. Syst. 10 (2), 125–132. 10.1016/j.cels.2020.01.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. The Cancer Genome Atlas Research Network (2011). (Participants are arranged by area of contribution and then by institution.)., Disease working group and tissue source sites. et al.Integrated genomic analyses of ovarian carcinoma. Nature 474, 609–615. 10.1038/nature10166 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Vasaikar S., Huang C., Wang X., Petyuk V. A., Savage S. R., Wen B., et al. (2019). Proteogenomic Analysis of Human Colon Cancer Reveals New Therapeutic Opportunities. Cell 177 (4), 1035–e19. e19. 10.1016/j.cell.2019.03.030 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Zhang B., Wang J., Wang X., Zhu J., Liu Q., Shi Z., et al. (2014). Proteogenomic characterization of human colon and rectal cancer. Nature 513 (7518), 382–387. 10.1038/nature13438 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Zhang H., Liu T., Zhang Z., Payne S. H., Zhang B., McDermott J. E., et al. (2016). Integrated Proteogenomic Characterization of Human High-Grade Serous Ovarian Cancer. Cell 166 (3), 755–765. 10.1016/j.cell.2016.05.069 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

The datasets analyzed in this study are publicly available and can be found here: Proteomics data in the CPTAC (https://cptac-data-portal.georgetown.edu/), the known human protein complexes in the CORUM database (http://mips.helmholtz-muenchen.de/corum/#download), structural information of protein complexes in the PDB (rcsb.org), 121,575 experimentally determined human binary interactions in Interactome INSIDER (http://interactomeinsider.yulab.org/downloads.html), and cell-line specific PPIs in BioPlex Interactome (https://bioplex.hms.harvard.edu/interactions.php). The datasets generated for this study can be found as part of PPIs_Data-Code repository at (https://github.com/SengerG/PPIs_Data-Code).


Articles from Frontiers in Bioinformatics are provided here courtesy of Frontiers Media SA

RESOURCES