Abstract
Background
Integrins, a family of transmembrane receptor proteins, play complex roles in cancer development and metastasis. These roles could be better delineated through machine learning of transcriptomic data to reveal relationships between integrin expression patterns and cancer.
Methods
We collected publicly available RNA-Seq integrin expression from 8 healthy tissues and their corresponding tumors, along with data from metastatic breast cancer. We then used machine learning methods, including t-SNE visualization and Random Forest classification, to investigate changes in integrin expression patterns.
Results
Integrin expression varied across tissues and cancers, and between healthy and cancer samples from the same tissue, enabling the creation of models that classify samples by tissue or disease status. The integrins whose expression was important to these classifiers were identified. For example, ITGA7 was key to classification of breast samples by disease status. Analysis in breast tissue revealed that cancer rewires co-expression for most integrins, but the co-expression relationships of some integrins remain unchanged in healthy and cancer samples. Integrin expression in primary breast tumors differed from their metastases, with liver metastasis notably having reduced expression.
Conclusions
Integrin expression patterns vary widely across tissues and are greatly impacted by cancer. Machine learning of these patterns can effectively distinguish samples by tissue or disease status.
Full Text Availability
The license terms selected by the author(s) for this preprint version do not permit archiving in PMC. The full text is available from the preprint server.
