Summary
Meta-learning is showing promise in recent genomic studies in oncology. Meta-learning can facilitate transfer learning and reduce the amount of data that is needed in a target domain by transferring knowledge from abundant genomic data in different source domains enabling the use of AI in data scarce scenarios.
Subject terms: Cancer, Computational science
Main
Transfer learning has emerged as a powerful technique in machine learning to train neural network models. In principle transfer learning proceeds in two steps. First, a pretraining stage where abundant labelled data from another domain or task is used to train the model, followed by a fine-tuning stage where the model is further trained in the target domain on the task of interest. Transfer learning enables few-shot learning in the target domain, in which a classifier must adapt to distinguish novel classes not seen during training given only a few examples (called shots) of these classes.1 This has been particularly effective in biomedical data with primarily successful examples in medical imaging.2 More recently, several advanced techniques have been proposed including multi-task learning and meta-learning. In multi-task learning, the model is pretrained on not one, but multiple tasks simultaneously—this encourages the model to learn richer internal representations of the data and improves generalisation in the target domain or task. A related approach called meta-learning can be thought of as “learning how to learn”. In “model agnostic meta learning” introduced in ref. 3, for example, the model is first trained on a distribution of tasks with the objective of encouraging model parameters that can be quickly adapted, using gradient descent, to any task drawn from the task distribution. Thus, the model can quickly learn a new task from a small amount of data in the target domain (Fig. 1).4
The use of meta-learning holds great promise to reduce the amount of biomedical data needed to train predictive models in the target domain of interest. In biomedicine this situation—having few data in the target domain—occurs frequently, as new technologies continue to emerge, such as single-cell DNA and RNA sequencing and spatial transcriptomics, while similar abundant source data is often available from different contexts, tissues, cells or generated from older related technologies. Meta-learning provides a solution for this situation by taking advantage of these source data sets from other organisms, institutes or cell types, and first training a model that is poised to learn a new task before touching the data in the target domain.
Meta-learning and few or zero-shot learning has been widely adopted in computer science applications. In particular, several successful applications have been reported using non-medical image data including multi-class classification of pictures given only a small number of examples for new classes,5 detection of objects with few examples,6 prediction of human motion7 and image segmentation.8 Despite such success, there have not been many successful use cases in biomedicine so far.
Within genomics, a few examples exist that take advantage of meta-learning. Here, I mention three key early adopters of meta-learning in genomics with diverse uses cases in the area of oncology. First, Brbic et al. showed a successful example of the use of meta-learning to predict existing and novel cell types based on single-cell RNA sequencing data.9 The proposed method, MARS, uses deep learning to learn an embedding of labelled cell types, and then uses this in the meta-learning setting to annotate new cell types also across tissues. Next, Ma et al. use meta-learning to model drug response for a singular patient.10 In this study, the investigators take advantage of the huge amounts of genetic perturbation data that has been generated in cell lines and translate this to benefit individual patients in different contexts with likely much less data available. Ma et al. show that meta-learning works in different contexts, transferring between tissues and different culture systems ranging from cell lines to patient-derived tumour cell cultures in vitro and patient-derived tumour xenografts in vivo, contexts where it is very costly to collect large volumes of data. In a final example, we have used meta-learning on gene expression profiles from RNA sequencing data to predict clinical outcome of cancer patients.4 We show that meta-learning outperforms regular transfer learning and direct learning when predicting survival outcome in a target cancer using data from 33 other cancer sites as source data, reducing the amount of data that is needed in the target domain.
The common theme in each of these applications is that meta-learning is successful in drastically reducing the amount of data that is needed in the target domain. Qiu et al. show that one order of magnitude less gene expression profiles is needed to train an optimal model predictive of clinical outcome using meta-learning.4 These success stories show highly promising applications particularly in the context of rare diseases. For example, paediatric oncology has lacked behind in terms of new discoveries compared with adult oncology, in most cases due to fact that cancers are far more common in adults than in children. Therefore, large scale efforts such as The Cancer Genome Atlas (TCGA) and The Clinical Proteomic Tumor Analysis Consortium (CPTAC) have focused on adult cancers and paediatric efforts such as the Child Brain Tumor Tissue Consortium (CBTTC) have more difficulty collecting large sets of samples. Meta-learning provides opportunities to take advantage of abundant adult data and transfer knowledge to solve questions in paediatric oncology.
One important area of further research is to determine when meta-learning will be successful. What metrics exist that can tell whether the source domain data is sufficient for the target of interest? Therefore, still much work has to be done beyond these studies to test meta-learning strategies across broader applications areas in genomics in oncology and also beyond.
Acknowledgements
Thanks to Dr. Yeping Lina Qiu and Dr. Pritam Mukherjee for suggestions and edits to this commentary.
Author contributions
O.G. performed literature review and wrote the manuscript.
Ethics approval and consent to participate
Not applicable.
Consent to publish
Not applicable.
Data availability
Not applicable.
Competing interests
The author declares no competing interests.
Funding information
O.G. is supported by the National Institute of Biomedical Imaging and Bioengineering of the National Institutes of Health under Award Number R56 EB020527. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Fei-Fei L, Fergus R, Perona P. One-shot learning of object categories. IEEE Trans. Pattern Anal. Mach. Intell. 2006;28:594–611. doi: 10.1109/TPAMI.2006.79. [DOI] [PubMed] [Google Scholar]
- 2.Raghu, M., Zhang, C., Kleinberg, J. & Bengio, S. Transfusion: understanding transfer learning for medical imaging. Preprint at https://arxiv.org/abs/1902.07208 (2019).
- 3.Finn, C., Abbeel, P. & Levine, S. Model-agnostic meta-learning for fast adaptation of deep networks. in International Conference on Machine Learning. (eds Precup, D. & Teh, Y. W.) 1126–1135 (PMLR, 2017).
- 4.Qiu YL, Zheng H, Devos A, Selby H, Gevaert O. A meta-learning approach for genomic survival analysis. Nat. Commun. 2020;11:6350. doi: 10.1038/s41467-020-20167-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Finn, C., Abbeel, P. & Levine, S. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks. in Proceedings of the 34th International Conference on Machine Learning. Vol. 70 (eds Precup, D. & Teh, Y. W.) 1126–1135 (PMLR, International Convention Centre, Sydney, 2017).
- 6.Pérez-Rúa, J.-M., Zhu, X., Hospedales, T. M. & Xiang, T. Incremental few-shot object detection. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 13843–13852 (IEEE, Seattle, WA, 2020).
- 7.Liang-Yan, G., Wang, Y.-X., Ramanan, D. & Moura, J. M. Few-shot human motion prediction via meta-learning. in Proceedings of the European Conference on Computer Vision (ECCV). (eds Ferrari, V., Hebert, M., Sminchisescu, C. & Weiss, Y.) 432–450 (Springer, 2018).
- 8.Shaban, A., Bansal, S., Liu, Z., Essa, I. & Boots, B. One-shot learning for semantic segmentation. Preprint at https://arxiv.org/abs/1709.03410 (2017).
- 9.Brbic M, Zitnik M, Wang S, Pisco AO, Altman RB, Darmanis S, et al. MARS: discovering novel cell types across heterogeneous single-cell experiments. Nat. Methods. 2020;17:1200–1206. doi: 10.1038/s41592-020-00979-3. [DOI] [PubMed] [Google Scholar]
- 10.Ma, J., Fong, S. H., Luo, Y., Bakkenist, C. J., Shen, J. P., Mourragui, S. et al. Few-shot learning creates predictive models of drug response that translate from high-throughput screens to individual patients. Nat. Cancer2, 233–244 (2021). [DOI] [PMC free article] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Not applicable.