Abstract
The anabolic response of bone to mechanical load is partially the result of osteocyte response to fluid flow-induced shear stress. Understanding signaling pathways activated in osteocytes exposed to fluid flow could identify novel signaling pathways involved in the response of bone to mechanical load. Bioinformatics allows for a unique perspective and provides key first steps in understanding these signaling pathways. We examined proteins encoded by genes differentially expressed in response to fluid flow in murine osteocytic MLO-Y4 cells. We considered structural and functional characteristics including putative intrinsic disorder, evolutionary conservation, interconnectedness in protein-protein interaction networks, and cellular localization. Our analysis suggests that proteins encoded by fluid flow activated genes have lower than expected conservation, are depleted in intrinsic disorder, maintain typical levels of connectivity for the murine proteome, and are found in the cytoplasm and extracellular space. Pathway analyses reveal that these proteins are associated with cellular response to stress, chemokine and cytokine activity, enzyme binding, and osteoclast differentiation. The lower than expected disorder of proteins encoded by flow activated genes suggests they are relatively specialized.
Introduction
Mechanical fluctuations in the extracellular environment, such as interstitial fluid flow-induced shear stress, induce cell perturbations including membrane deformations, cytoskeletal restructuring, conformational changes in transmembrane proteins, changes in the glycol calyx and movement of cilia. These cell responses to mechanical signals are translated throughout the cell and can result in alterations in gene expression across many tissues1.
In bone, osteocytes are well positioned to detect mechanical signals from loading and communicate them to downstream effector cells including osteoblasts, osteoclasts, and bone lining cells2. However, the signal transduction pathways involved in this process are only partially understood. A better understanding of signaling pathways activated by mechanical signals in vitro may lead to a better understanding of how bone adapts to load in vivo.
We have previously used gene microarrays, proteomics and RNA sequencing analysis (RNA-Seq) to identify increases in inflammatory C-X-C motif chemokines, including Ccl2, and the HIF-1α, IL-17, and AMPK signaling pathways induced by fluid flow3,4. However, these studies were limited as they did not examine functional and structural characteristics of proteins encoded by the corresponding differentially expressed genes and have focused only on genes with the highest signal. To address this we analyzed previously obtained RNA-seq data and gene micro array data3,4 using several high-throughput computational methods to quantify evolutionary conservation, interconnectedness, and putative intrinsic disorder of the proteins encoded by all differentially expressed genes. We also functionally characterized these proteins using pathway analysis and gene ontology. We analyzed which of these characteristics are unique to proteins encoded by genes affected by fluid flow by comparing them to a generic set of proteins from the mouse proteome.
Conservation quantifies the amount of evolutionary variation in a protein sequence across species. When measured at the protein level it reflects the abundance of functionally and structurally important residues that are typically retained across species5–7. Interconnectedness measures the number of interactions between proteins in protein-protein interaction (PPI) networks, and can predict, along with conservation, the extent to which this protein is essential8,9 and central to cellular processes10. Intrinsic disorder reflects the degree to which a protein lacks an ordered 3D structure under physiologic conditions11,12. While the levels of intrinsic disorder vary from protein to protein, disorder itself is ubiquitous. Recent estimates show that about 19% of amino acid residues are disordered in Eukaryotes13 and between 6% and 17% of proteins encoded by various genomes are entirely disordered14. Intrinsic disorder is implicated in a diverse range of cellular functions including transcription15,16, translation17, protein-protein interactions18,19, protein-RNA interactions20,21 and cell signaling19,22,23, to name a few. Functional characteristics that are specific to disorder include one-to-many binding, where one disordered region can interact with several structurally diverse partners; ability to fold upon binding; high accessibility to and regulation by post-translational modifications; and ability to implement cascade interactions24. At the same time, a high level of disorder is a substantial obstacle to solving protein structure25–27 and to performing rational drug design that relies on knowledge of protein structure28–30.
Gene ontology (GO) is a bioinformatics framework that defines biological processes, molecular functions and subcellular location of genes and proteins31,32. These annotations allow for the computational identification of functions and locations that are characteristic of a given group of genes or proteins. In a similar manner, pathway and network analysis can compare differentially expressed proteins and genes to known pathways to identify possible downstream effects.
We hypothesized that examining protein conservation, interconnectedness, intrinsic disorder, as well as pathway analysis, would provide insights to the function of osteocytic proteins encoded by genes differentially expressed in response to fluid flow-induced shear stress. This is the first study to examine evolutionary conservation, interconnectedness and putative intrinsic disorder in the context of mechanobiology.
Results
We comprehensively characterized several major structural and functional characteristics of osteocytic proteins encoded by genes differentially expressed due to exposure to fluid flow-induced shear stress. Figure 1 summarizes results focusing on evolutionary conservation, content of putative intrinsic disorder and interconnectedness. We compared the entire complement of proteins encoded by differentially expressed genes as well as proteins encoded by genes upregulated and downregulated against a generic set of murine proteins. Figure 1A reveals that proteins that correspond to the differentially expressed genes, and in particular proteins encoded by genes that are upregulated, have significantly lower than expected evolutionary conservation (p-value < 0.05). Proteins encoded by genes that are downregulated are also characterized by lower levels of conservation, however, the difference did not reach statistical significance when p-value considered significant was set at 0.05 (p-value = 0.18). Figure 1B reveals similar observations for the content of intrinsic disorder. That is, proteins encoded by differentially expressed and upregulated genes have significantly lower than expected disorder content (p-value < 0.05). Proteins encoded by downregulated genes also have lower disorder content, however, the difference did not reach statistical significance when p value considered significant was set at 0.05 (p-value = 0.32). The median disorder content across proteins encoded by differentially regulated genes equals 0.09 compared to the twice larger (0.18) content that is typical for murine proteins13. On the other hand, the degree of interconnectedness is not significantly different. Proteins encoded by differentially expressed genes, including both upregulated and downregulated subsets, have similar levels of PPIs (p-values > 0.6) compared to an overall set of murine proteins. The average number of PPIs in mouse listed in the metha resource equals 5.9733 and is, as expected, virtually identical to the number that we report for our randomly chosen subset of murine proteins (6.03), and only slightly higher than for the proteins encoded by differentially expressed genes (5.63).
For GO analyses, we collected the most frequent term for each path in the GO hierarchy to reduce redundancy between the terms and to include the most frequent terms. Figure 2 depicts a list of subcellular locations (black bars), molecular functions (light gray bars) and biological processes (dark gray bars) that are significantly enriched in the proteins encoded by differentially expressed genes (p-value < 0.05 and at least 50% increase compared to the expected value to ensure that magnitude of the difference is substantial). Our genes of interest are primarily located in the cytoplasm and extracellular space. The GO terms that have high enrichment and high counts (second number inside square brackets next to the GO term name in Fig. 2) suggests that they are involved in chemokine (GO:0008009 and 0070098) and cytokine (GO:0034097) activities, response and regulation of response to stress (GO:0006950 and 0080134), and cellular response to chemical stimulus and oxygen-containing compound (GO:0070887 and 1901700). Annotations of molecular functions (light gray bars) also suggest that they take part in enzyme and G-protein coupled receptor binding.
We further focused on several key functions related to chemokine and cytokine processes and response and regulation of response to stress. We visualized the distributions of the evolutionary conservation, putative intrinsic disorder and interconnectedness for these proteins encoded by differentially expressed genes and compared them to a generic set of murine proteins (Fig. 3). Consistent with the results in Fig. 1, conservation (Fig. 3A) and amount of intrinsic disorder (Fig. 3B) are significantly lower than expected in these three sets of proteins that correspond to the differentially expressed genes (p-value < 0.05, except for the difference in conservation for proteins with chemokine activity where p-value = 0.3). Moreover, the numbers of PPIs are similar to a generic murine protein (p-value = 0.99).
Ingenuity Pathway Analysis constructs causal networks from individual relationships detailed in curated literature34. Using these networks, downstream effects can be predicted based on our data. Casual network analysis of our RNA-seq data found a fluid flow-induced upregulation of proteins encoded by differentially expressed genes that are associated with the downregulation of osteoclast differentiation (Fig. 4). Shown in Fig. 4 are all the proteins that correspond to the differentially expressed genes in our dataset expected to influence osteoclast differentiation. NOS2, LPRC17 and PLK3R1 that are associated with inhibition of osteoclast differentiation were moderately upregulated; PTGS2 was also upregulated but would be predicted to increase osteoclast differentiation while TLR2 may effect osteoclast differentiation, IPA’s database did not have enough power to generate a prediction. NOS2, TLR2, and PTGS2 all play a role in the regulation of the inflammatory response which is consistent with our gene ontology analysis which found increase of cytokines and chemokines.
Discussion
Our results suggest that proteins encoded by genes that are differentially expressed in response to fluid flow have lower than expected levels of sequence conservation and putative intrinsic disorder while maintaining typical levels of PPIs. Given their relatively low conservation, they are likely to carry out more specialized (non-essential) functions compared to the functions of proteins encoded by essential genes that have older evolutionary origins and are more conserved34–36. Proteins encoded by these essential genes are typically involved in basic cellular functions including gene expression, metabolism, morphogenesis, cell division, proliferation and differentiation, DNA replication, repair and transcription, and embryonic development and they are essential to the survival of the organism34. At the same time, the levels of interconnectedness of the proteins encoded by differentially expressed genes that are comparable to a typical mouse protein suggests that they are functionally important even though they may not be essential. The significant depletion in the putative intrinsic disorder suggests that they are highly structured and substantially more structured than a typical murine protein. The main implication of this observation is that structures of the proteins encoded by differentially expressed genes should be relatively easy to produce25,37. These structures can be used to decipher molecular level details of their functions38 and they are necessary for virtual screening and design of novel therapeutics39,40. Taken together, our analyses suggest that genes involved in mechanotransduction are non-essential, functionally important, and suitable for structure-based rational drug design. We note that the above observations rely on the putative intrinsic disorder and an incomplete PPI network, both of which may adversely affect accuracy of our analysis. However, we designed the analysis to reduce these effects. We employed a consensus of five predictors of disorder to minimize the prediction error and we used a recently released database that collects PPIs from a comprehensive set of five sources to provide the currently most complete network. Additionally, there are limitations to our fluid flow model that are shared by several in vitro models. These include analyses of only a single bone cell type, the two dimensional nature of our fluid flow apparatus and the fact that the cells are not in contact with a physiologically relevant extracellular matrix.
In this study we used IPA to assess RNA-seq data. We opted to assess RNA-seq data, rather than microarray data, because we previously showed that RNA-Seq yielded greater magnitude fold-changes in expression level than did microarrays, resulting in a broader overall dynamic range3. Furthermore, RNA-seq, compared to microarray was capable of detecting approximately 3 times the gene products of previously utilized microarrays. Our IPA analyses predicted fluid flow-induced changes in cytokines consistent with a down regulation of osteoclastogenesis, which one would expect as a result of exposure to anabolic mechanical signals. Additionally, consistent with our results, osteocytes have been shown to downregulate osteoclast differentiation through effects on TGF-β, an upstream inducer of IL-17, a factor that increases osteoclastogenesis41–43. On the other hand, our previous work showed fluid flow-induced upregulation of pathways dependent on IL-174 and another study showed that mechanical load upregulates TGF-β in mesenchymal stem cells44. These findings emphasize the complexity of the bone microenvironment and suggest the effects of cytokines on bone are highly context dependent. Ultimately, cytokines can have either anabolic or catabolic effects on bone which has complicated efforts at treatment of metabolic bone disease45. This complexity is illustrated by IL-17 which can be upregulated in both catabolic and anabolic states. Phase II trials of IL-17 inhibitors such as secukinumab for rheumatoid arthritis have yielded disappointing results46. Thus, there is a need for further research in this area to characterize the bone microenvironment an how it is affected by mechanical load, especially as regards the pleiotropic effect of cytokines in bone. The complexity of microenvironment also suggest the therapeutic target potential of mechanotransduction pathways because we have shown them to be less essential while still functionally important when compared to the mouse proteome.
In summary, this is the first study to quantify the structural and functional characteristics such as intrinsic disorder, evolutionary conservation, and interconnectedness of the proteins encoded by differentially expressed genes involved in mechanotransduction. These results suggest that the osteocytic signaling pathways activated by fluid flow are non-essential and populated by highly structured proteins. Furthermore, extracellular inflammatory mediators are involved in the downregulation of osteoclast differentiation. The well-defined tertiary structure of proteins encoded by differentially expressed genes in this pathway suggests that protein modeling could be successful in developing a better understanding of the signaling molecules involved in the response of bone to mechanical load.
Methods
Collection of experimental data
In the original study3, murine osteocytic MLO-Y4 cells were cultured in normal growth medium (α-MEM [Invitrogen, Grand Island, NY] with 2.5% calf serum [Hyclone, Logan, UT], 2.5% fetal bovine serum [Lonza, Walkersville, MD], 1% Penicillin/Streptomycin) throughout all portions of the experiment. Cells were seeded 48 h prior to fluid flow on 75 × 38 × 1 mm glass slides coated with 300 μg/ml Type I Collagen (BD Biosciences, Bedford, MA) for 1 h and washed. Cell seeding density was 1.35 × 104 cells/cm2 so that upon flow exposure, cells were roughly 60% confluent and interconnected by dendritic processes. Cells were then exposed to 2 hours of sinusoidally oscillating fluid flow in parallel plate flow chambers, inducing 1 Pa (10 dynes/cm2) shear stress at 1 Hz. Paired sham controls were maintained in identical, static chambers. Triplicates of both flowed and static cells were collected following post-incubation in fresh medium for 2 hours.
Total RNA was isolated using Qiagen RNeasy Mini Kits, deep sequenced using Illumina HiSeq 2500, and processed computationally to identify genes that are differentially expressed between the samples exposed to the fluid flow and the controls4. Similar and complementary analysis was performed using DNA microarrays to identify further gene products that are characterized by differential abundance3. In both cases only the genes characterized by significant differences (p-values < 0.05) were selected. These data are available in the Gene Expression Omnibus with accession number GSE70667 for the RNA-Seq and GSE42874 for the microarrays.
Dataset
Our prior analyses revealed two sets of 55 differentially expressed genes based on the RNA-Seq and microarray experiments. After removing duplicates of the 6 genes that were in common between the two sets, we mapped 99 genes to their UniGene identifiers. The remaining 5 genes could not be mapped to these identifiers. Next, we searched the UniProt resource47,48 to map the UniGene identifiers to the corresponding mouse proteins. Once we removed duplicate proteins and protein fragments, we extracted 103 proteins encoded by the 99 differentially expressed genes that we were able to map into unique UniProt accession numbers. Among the 103 proteins encoded by the differentially expressed genes, 45 and 58 that are significantly upregulated or downregulated by fluid flow, respectively.
Analysis
The 103 proteins encoded by differentially expressed genes in response to fluid flow were subjected to a comprehensive bioinformatics analysis. We contrasted their structural and functional characteristics against the reference mouse proteome collected from UniProt (C57BL/6 J strain, proteome ID: UP000000589). Since the amount of intrinsic disorder in eukaryotic organisms was shown to depend on the protein length49, we randomly selected a matching number of mouse proteins to obtain the same distribution of lengths when compared to our 103 proteins encoded by differentially expressed genes. This ensures that our comparison of intrinsic disorder accommodates sequence-length bias; the same correction was made in several recent studies50,51. Equalization of sample size (103 proteins encoded by differentially expressed genes plus 103 proteins sampled from the reference proteome) also ensures that these data are suitable for robust statistical analysis. This is in contrast to using the whole mouse proteome of over 50 thousand proteins, which would result in over two orders of magnitude difference in sample size relative to proteins differentially expressed in response to fluid flow. The two sets have virtually identical distributions of protein length; p-value = 1 based on the two-sample Kolmogorov-Smirnov test. The difference in their median chain length, which is 412 and 416 for the random set of mouse proteins and our proteins encoded by differentially expressed genes, respectively, is not significant; p-value = 0.97 based on the Mann-Whitney test.
We estimated evolutionary conservation from multiple alignment profiles generated with HHblits against the UniProt-20 database52. These profiles were used to compute relative entropy from each amino acid53,54. An average per-amino acid conservation was used to quantify conservation for each proteins encoded by differentially expressed genes. We collected the mouse PPI network from the mentha resource that integrates data from five manually curated source databases33. With that integrative approach, the mentha resource offers arguably the most complete currently available PPI network in mouse. The degree of interconnectedness was quantified by its number of PPIs. We annotated putative intrinsic disorder using a majority-vote based consensus of five predictors: three versions of the Espritz method55 and two versions of the IUPred method56; These methods are characterized by complementary designs and were empirically demonstrated to offer strong predictive quality for the prediction of intrinsic disorder57–59. We used the consensus to minimize the prediction error, which is in line with an observation that this results in a better predictive quality when compared to the use of individual predictors60,61. Such consensuses are also implemented in the current databases of putative disorder: MobiDB62,63 and D2P2 64, and were utilized in many other studies13,17,21,50,51,65,66. We quantified amount of disorder with the disorder content, which is defined as a fraction of disordered amino acids in a given sequence. We measured statistical significance of differences in distributions of conservation, disorder and interconnectedness values between our proteins of interest and the mouse proteome using a two-sample Kolmogorov-Smirnov test (K-S test) and assumed that a given difference is significant at p-values < 0.05.
We performed functional analysis using GO with the PANTHER system67. PANTHER performs enrichment analysis of GO terms associated with our proteins encoded by differentially expressed genes against the expected frequency of these terms in the mouse proteome68. We executed this analysis separately for each of the three classes of GO terms: molecular functions, biological processes and cellular component (subcellular location). The pathway and causal network analyses was completed using the Ingenuity Pathway Analysis (IPA) tool69.
Acknowledgements
This work was supported by grants from the National Institutes of Health, National Institute of Arthritis and Musculoskeletal and Skin Diseases, R01AR068132-18 (HJD), a VCU School of Engineering Foundation Endowment (HJD), a Scholarship from the China Scholarship Council (FM), the VCU Qimonda Endowment (LK) and National Science Foundation grant 1617369 (LK).
Author Contributions
F.M. collected the data, performed analysis, summarized the results, and contributed to writing the manuscript, G.M. performed data analyses and contributed to writing the manuscript, L.K. conceptualized the study, performed analysis, summarized and analyzed results, and wrote the manuscript, H.D. collected original experimental data, conceptualized the study and wrote the manuscript.
Competing Interests
The authors declare no competing interests.
Footnotes
Fanchi Meng and Graeme F. Murray contributed equally to this work.
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Lukasz Kurgan, Email: lkurgan@vcu.edu.
Henry J. Donahue, Email: hjdonahue@vcu.edu
References
- 1.Paluch EK, et al. Mechanotransduction: use the force(s) BMC Biol. 2015;13:47. doi: 10.1186/s12915-015-0150-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Bonewald LF. The amazing osteocyte. J Bone Miner Res. 2011;26:229–238. doi: 10.1002/jbmr.320. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Govey PM, et al. Integrative transcriptomic and proteomic analysis of osteocytic cells exposed to fluid flow reveals novel mechano-sensitive signaling pathways. J Biomech. 2014;47:1838–1845. doi: 10.1016/j.jbiomech.2014.03.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Govey PM, Kawasawa YI, Donahue HJ. Mapping the osteocytic cell response to fluid flow using RNA-Seq. Journal of biomechanics. 2015;48:4327–4332. doi: 10.1016/j.jbiomech.2015.10.045. [DOI] [PubMed] [Google Scholar]
- 5.Zhang T, et al. Analysis and prediction of RNA-binding residues using sequence, evolutionary conservation, and predicted secondary structure and solvent accessibility. Curr Protein Pept Sci. 2010;11:609–628. doi: 10.2174/138920310794109193. [DOI] [PubMed] [Google Scholar]
- 6.Chakrabarti S, Lanczycki CJ. Analysis and prediction of functionally important sites in proteins. Protein science: a publication of the Protein Society. 2007;16:4–13. doi: 10.1110/ps.062506407. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Fang C, Noguchi T, Yamana H. Analysis of evolutionary conservation patterns and their influence on identifying protein functional sites. J Bioinform Comput Biol. 2014;12:1440003. doi: 10.1142/S0219720014400034. [DOI] [PubMed] [Google Scholar]
- 8.He XL, Zhang JZ. Why do hubs tend to be essential in protein networks? PLoS genetics. 2006;2:826–834. doi: 10.1371/journal.pgen.0020088. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Zotenko, E., Mestre, J., O’Leary, D. P. & Przytycka, T. M. Why Do Hubs in the Yeast Protein Interaction Network Tend To Be Essential: Reexamining the Connection between the Network Topology and Essentiality. Plos Comput Biol4, 10.1371/journal.pcbi.1000140 (2008). [DOI] [PMC free article] [PubMed]
- 10.Han JD, et al. Evidence for dynamically organized modularity in the yeast protein-protein interaction network. Nature. 2004;430:88–93. doi: 10.1038/nature02555. [DOI] [PubMed] [Google Scholar]
- 11.Dunker AK, et al. What’s in a name? Why these proteins are intrinsically disordered. Intrinsically Disordered Proteins. 2013;1:e24157. doi: 10.4161/idp.24157. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.van der Lee R, et al. Classification of Intrinsically Disordered Regions and Proteins. Chemical Reviews. 2014;114:6589–6631. doi: 10.1021/cr400525m. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Peng Z, et al. Exceptionally abundant exceptions: comprehensive characterization of intrinsic disorder in all domains of life. Cell Mol Life Sci. 2015;72:137–151. doi: 10.1007/s00018-014-1661-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Tompa P. Intrinsically unstructured proteins. Trends in Biochemical Sciences. 2002;27:527–533. doi: 10.1016/S0968-0004(02)02169-2. [DOI] [PubMed] [Google Scholar]
- 15.Liu J, et al. Intrinsic disorder in transcription factors. Biochemistry. 2006;45:6873–6888. doi: 10.1021/bi0602718. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Fuxreiter M, et al. Malleable machines take shape in eukaryotic transcriptional regulation. Nat Chem Biol. 2008;4:728–737. doi: 10.1038/nchembio.127. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Peng Z, et al. A creature with a hundred waggly tails: intrinsically disordered proteins in the ribosome. Cell Mol Life Sci. 2014;71:1477–1504. doi: 10.1007/s00018-013-1446-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Fuxreiter M, et al. Disordered proteinaceous machines. Chem Rev. 2014;114:6806–6843. doi: 10.1021/cr4007329. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Dunker AK, Cortese MS, Romero P, Iakoucheva LM, Uversky VN. Flexible nets. The roles of intrinsic disorder in protein interaction networks. FEBS J. 2005;272:5129–5148. doi: 10.1111/j.1742-4658.2005.04948.x. [DOI] [PubMed] [Google Scholar]
- 20.Varadi M, Zsolyomi F, Guharoy M, Tompa P. Functional Advantages of Conserved Intrinsic Disorder in RNA-Binding Proteins. PloS one. 2015;10:e0139731. doi: 10.1371/journal.pone.0139731. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Wang C, Uversky VN, Kurgan L. Disordered nucleiome: Abundance of intrinsic disorder in the DNA- and RNA-binding proteins in 1121 species from Eukaryota, Bacteria and Archaea. Proteomics. 2016;16:1486–1498. doi: 10.1002/pmic.201500177. [DOI] [PubMed] [Google Scholar]
- 22.Xie H, et al. Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions. J Proteome Res. 2007;6:1882–1898. doi: 10.1021/pr060392u. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Galea CA, Wang Y, Sivakolundu SG, Kriwacki RW. Regulation of cell division by intrinsically unstructured proteins: intrinsic flexibility, modularity, and signaling conduits. Biochemistry. 2008;47:7598–7609. doi: 10.1021/bi8006803. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Uversky VN. A decade and a half of protein intrinsic disorder: biology still waits for physics. Protein science: a publication of the Protein Society. 2013;22:693–724. doi: 10.1002/pro.2261. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Oldfield CJ, et al. Utilization of protein intrinsic disorder knowledge in structural proteomics. Biochim. Biophys. Acta. 2013;1834:487–498. doi: 10.1016/j.bbapap.2012.12.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Linding R, et al. Protein disorder prediction: implications for structural proteomics. Structure. 2003;11:1453–1459. doi: 10.1016/j.str.2003.10.002. [DOI] [PubMed] [Google Scholar]
- 27.Mizianty MJ, Kurgan L. Sequence-based prediction of protein crystallization, purification and production propensity. Bioinformatics (Oxford, England) 2011;27:i24–33. doi: 10.1093/bioinformatics/btr229. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Jazayeri A, Dias JM, Marshall FH. From G Protein-coupled Receptor Structure Resolution to Rational Drug Design. J Biol Chem. 2015;290:19489–19495. doi: 10.1074/jbc.R115.668251. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Lounnas V, et al. Current progress in Structure-Based Rational Drug Design marks a new mindset in drug discovery. Comput Struct Biotechnol J. 2013;5:e201302011. doi: 10.5936/csbj.201302011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Gane PJ, Dean PM. Recent advances in structure-based rational drug design. Curr Opin Struct Biol. 2000;10:401–404. doi: 10.1016/S0959-440X(00)00105-6. [DOI] [PubMed] [Google Scholar]
- 31.Ashburner M, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000;25:25–29. doi: 10.1038/75556. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Gene Ontology C. Gene Ontology Consortium: going forward. Nucleic Acids Res. 2015;43:D1049–1056. doi: 10.1093/nar/gku1179. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Calderone A, Castagnoli L, Cesareni G. Mentha: a resource for browsing integrated protein-interaction networks. Nature Methods. 2013;10:690–691. doi: 10.1038/nmeth.2561. [DOI] [PubMed] [Google Scholar]
- 34.Kabir M, Barradas A, Tzotzos GT, Hentges KE, Doig AJ. Properties of genes essential for mouse development. PLoS One. 2017;12:e0178273. doi: 10.1371/journal.pone.0178273. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Jordan IK, Rogozin IB, Wolf YI, Koonin EV. Essential genes are more evolutionarily conserved than are nonessential genes in bacteria. Genome Res. 2002;12:962–968. doi: 10.1101/gr.87702. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Luo H, Gao F, Lin Y. Evolutionary conservation analysis between the essential and nonessential genes in bacterial genomes. Sci. Rep. 2015;5:13210. doi: 10.1038/srep13210. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Oldfield CJ, Ulrich EL, Cheng Y, Dunker AK, Markley JL. Addressing the intrinsic disorder bottleneck in structural proteomics. Proteins. 2005;59:444–453. doi: 10.1002/prot.20446. [DOI] [PubMed] [Google Scholar]
- 38.Teichmann SA, Murzin AG, Chothia C. Determination of protein function, evolution and interactions by structural genomics. Curr Opin Struct Biol. 2001;11:354–363. doi: 10.1016/S0959-440X(00)00215-3. [DOI] [PubMed] [Google Scholar]
- 39.Waszkowycz B. Structure-based approaches to drug design and virtual screening. Current opinion in drug discovery & development. 2002;5:407–413. [PubMed] [Google Scholar]
- 40.Rognan D. Structure-Based Approaches to Target Fishing and Ligand Profiling. Mol. Inform. 2010;29:176–187. doi: 10.1002/minf.200900081. [DOI] [PubMed] [Google Scholar]
- 41.Okamoto K, et al. Osteoimmunology: The Conceptual Framework Unifying the Immune and Skeletal Systems. Physiol Rev. 2017;97:1295–1349. doi: 10.1152/physrev.00036.2016. [DOI] [PubMed] [Google Scholar]
- 42.Gaffen SL. The role of interleukin-17 in the pathogenesis of rheumatoid arthritis. Current rheumatology reports. 2009;11:365–370. doi: 10.1007/s11926-009-0052-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Heino TJ, Hentunen TA, Vaananen HK. Osteocytes inhibit osteoclastic bone resorption through transforming growth factor-beta: enhancement by estrogen. J Cell Biochem. 2002;85:185–197. doi: 10.1002/jcb.10109. [DOI] [PubMed] [Google Scholar]
- 44.Gardner OF, Fahy N, Alini M, Stoddart MJ. Joint mimicking mechanical load activates TGFbeta1 in fibrin-poly(ester-urethane) scaffolds seeded with mesenchymal stem cells. J Tissue Eng Regen Med. 2016 doi: 10.1002/term.2210. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Jung SM, Kim KW, Yang CW, Park SH, Ju JH. Cytokine-mediated bone destruction in rheumatoid arthritis. Journal of immunology research. 2014;2014:263625. doi: 10.1155/2014/263625. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Koenders MI, van den Berg WB. Secukinumab for rheumatology: development and its potential place in therapy. Drug design, development and therapy. 2016;10:2069–2080. doi: 10.2147/DDDT.S105263. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Apweiler R, et al. UniProt: the Universal Protein knowledgebase. Nucleic Acids Res. 2004;32:D115–119. doi: 10.1093/nar/gkh131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.UniProt C. UniProt: a hub for protein information. Nucleic Acids Res. 2015;43:D204–212. doi: 10.1093/nar/gku989. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Howell M, et al. Not That Rigid Midgets and Not So Flexible Giants: On the Abundance and Roles of Intrinsic Disorder in Short and Long Proteins. J Biol Syst. 2012;20:471–511. doi: 10.1142/S0218339012400086. [DOI] [Google Scholar]
- 50.Meng, F., Na, I., Kurgan, L. & Uversky, V. N. Compartmentalization and Functionality of Nuclear Disorder: Intrinsic Disorder and Protein-Protein Interactions in Intra-Nuclear Compartments. Int J Mol Sci17, 10.3390/ijms17010024 (2016). [DOI] [PMC free article] [PubMed]
- 51.Na I, Meng F, Kurgan L, Uversky VN. Autophagy-related intrinsically disordered proteins in intra-nuclear compartments. Mol Biosyst. 2016;12:2798–2817. doi: 10.1039/C6MB00069J. [DOI] [PubMed] [Google Scholar]
- 52.Remmert M, Biegert A, Hauser A, Söding J. HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment. Nature Methods. 2012;9:173–175. doi: 10.1038/nmeth.1818. [DOI] [PubMed] [Google Scholar]
- 53.Wang K, Samudrala R. Incorporating background frequency improves entropy-based residue conservation measures. BMC bioinformatics. 2006;7:385. doi: 10.1186/1471-2105-7-385. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Wang K, Samudrala R. Automated functional classification of experimental and predicted protein structures. BMC bioinformatics. 2006;7:278. doi: 10.1186/1471-2105-7-278. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Walsh I, Martin AJ, Di Domenico T, Tosatto SC. ESpritz: accurate and fast prediction of protein disorder. Bioinformatics (Oxford, England) 2012;28:503–509. doi: 10.1093/bioinformatics/btr682. [DOI] [PubMed] [Google Scholar]
- 56.Dosztanyi Z, Csizmok V, Tompa P, Simon I. IUPred: web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy content. Bioinformatics (Oxford, England) 2005;21:3433–3434. doi: 10.1093/bioinformatics/bti541. [DOI] [PubMed] [Google Scholar]
- 57.Peng ZL, Kurgan L. Comprehensive comparative assessment of in-silico predictors of disordered regions. Curr Protein Pept Sci. 2012;13:6–18. doi: 10.2174/138920312799277938. [DOI] [PubMed] [Google Scholar]
- 58.Walsh I, et al. Comprehensive large-scale assessment of intrinsic protein disorder. Bioinformatics (Oxford, England) 2015;31:201–208. doi: 10.1093/bioinformatics/btu625. [DOI] [PubMed] [Google Scholar]
- 59.Meng F, Uversky VN. & Kurgan, L. Comprehensive review of methods for prediction of intrinsic disorder and its molecular functions. Cell Mol Life Sci. 2017 doi: 10.1007/s00018-017-2555-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Peng, Z. & Kurgan, L. On the complementarity of the consensus-based disorder prediction. Pac Symp Biocomput, 176–187 (2012). [PubMed]
- 61.Fan X. & Kurgan, L. Accurate prediction of disorder in protein chains with a comprehensive and empirically designed consensus. Journal of biomolecular structure & dynamics. 2014;32:448–464. doi: 10.1080/07391102.2013.775969. [DOI] [PubMed] [Google Scholar]
- 62.Di Domenico T, Walsh I, Martin AJM, Tosatto SCE. MobiDB: a comprehensive database of intrinsic protein disorder annotations. Bioinformatics (Oxford, England) 2012;28:2080–2081. doi: 10.1093/bioinformatics/bts327. [DOI] [PubMed] [Google Scholar]
- 63.Potenza E, Di Domenico T, Walsh I, Tosatto SC. MobiDB 2.0: an improved database of intrinsically disordered and mobile proteins. Nucleic Acids Res. 2015;43:D315–320. doi: 10.1093/nar/gku982. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Oates ME, et al. D(2)P(2): database of disordered protein predictions. Nucleic Acids Res. 2013;41:D508–516. doi: 10.1093/nar/gks1226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Hu G, Wu Z, Wang K, Uversky VN, Kurgan L. Untapped Potential of Disordered Proteins in Current Druggable Human Proteome. Curr Drug Targets. 2016;17:1198–1205. doi: 10.2174/1389450116666150722141119. [DOI] [PubMed] [Google Scholar]
- 66.Peng Z, Uversky VN, Kurgan L. Genes Encoding Intrinsic Disorder in Eukaryota Have High GC Content. Intrinsically Disordered Proteins. 2016;4:e1262225. doi: 10.1080/21690707.2016.1262225. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Mi H, Muruganujan A, Thomas PD. PANTHER in 2013: modeling the evolution of gene function, and other gene attributes, in the context of phylogenetic trees. Nucleic Acids Res. 2013;41:D377–386. doi: 10.1093/nar/gks1118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Mi H, Muruganujan A, Casagrande JT, Thomas PD. Large-scale gene function analysis with the PANTHER classification system. Nat Protoc. 2013;8:1551–1566. doi: 10.1038/nprot.2013.092. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Kramer A, Green J, Pollard J, Jr, Tugendreich S. Causal analysis approaches in Ingenuity Pathway Analysis. Bioinformatics (Oxford, England) 2014;30:523–530. doi: 10.1093/bioinformatics/btt703. [DOI] [PMC free article] [PubMed] [Google Scholar]