Skip to main content
Philosophical Transactions of the Royal Society B: Biological Sciences logoLink to Philosophical Transactions of the Royal Society B: Biological Sciences
. 2024 Mar 4;379(1900):20230043. doi: 10.1098/rstb.2023.0043

Stochastic processes in development and disease

Dagan Jenkins 1,, Jonathan R Chubb 3, Gabriel Galea 2
PMCID: PMC10909502  PMID: 38432319

1. Introduction

Stochastic processes impact upon many areas of biology and investigating them requires diverse approaches. In April 2023, a Hooke Discussion Meeting (Royal Society, London, UK) brought together scientists with backgrounds in genetics, developmental biology and cancer together with experts from more traditionally quantitative disciplines to combine viewpoints into moulding a sense of where the field is and where it might go. Sessions were organized around cell state and single-cell transcriptomics, mathematical modelling, genetic and epigenetic disease, and cellular signalling and canalization.

Much of the emphasis on stochastic processes in biology has been on gene expression, perhaps because the techniques to measure gene expression are accessible and widely used, rather than effects on gene expression being especially dominant compared to other aspects of cell physiology. Early work reporting seemingly random gene expression goes back at least to the 1950s [1], although the ‘field’ of stochastic gene expression had to wait until the early 2000s for a substantial surge in popularity [2]. A workshop on stochastic processes in development [3] was held in 2010, marrying then-new single-cell technology together with some new developmental phenomena that might have been explainable by stochastic gene expression as an underlying cause. The available technology included imaging methods to study gene expression dynamics in single cells and initial attempts to bring these approaches into both in vitro differentiation models and tissues. However, the subsequent development and extensive use of genome-wide single-cell approaches—primarily to measure single-cell transcriptomes, but more recently other features, such as chromatin state, RNA turnover and the spatial organization of transcript content in complex cell populations—suggested it would be timely to organize another meeting to discuss recent progress and current challenges in this area.

One persistent challenge that needed to be absorbed by the discussion was the fuzziness of nomenclature to describe different sources of stochasticity. The heterogeneity of gene expression in genetically identical cells exposed to the same conditions is often referred to as gene expression noise. A popular operational sub-division of noise into ‘intrinsic’ and ‘extrinsic’ noise [4] is still commonly used. Intrinsic noise describes the variation in the expression between two identical reporter genes within single cells, which is ascribed to the low numbers of molecules (e.g. transcription factors) with control functions on gene expression. By contrast, extrinsic noise reflects the variation in reporter expression between cells in otherwise identical conditions, which would represent, for example, different cell-cycle states, mitochondrial content or signalling histories. While extrinsic noise is generally perceived to have an underlying cause, the sheer complexity of distinguishing mechanisms involved means that extrinsic noise tends to be bundled into intrinsic noise, at least in modelling approaches. While the early experiments comparing reporters in microbes found clear separation of sources of variation, in more complex developmental and disease models, with many hidden variables, these definitions cannot be cleanly applied. The need for consistent use of definitions in different systems was debated and use of operational definitions without sufficient care can be misleading: intrinsic noise is often bundled together with transcriptional bursting—this may be convenient in modelling, but it ignores the reality that transcription is regulated by signalling. Along similar lines, it seems important to be clear whether a disease phenotype becomes penetrant because of molecular noise or some (albeit unknown) chemical or mechanical stress.

To generalize these examples, considering noise within biological systems—whether this be ‘real’ noise or a convenient approach to modelling complex deterministic processes—raises questions about where we are to find the clear causal objects. There is a tendency to look at the gene because there is so much data available at that level and discoveries of mutants for many genes that cause striking developmental and disease phenotypes anchored the gene-as-instructive view. However, variability among genetically identical individuals, ranging from cellular diversity within a multicellular organism to incomplete penetrance for complex morphological traits, forces us to acknowledge that genetic instructions interact with mechanisms and information within the wider system, such that for the most part, there is no privileged scale at which biological functions are determined [5].

2. Variable biological properties can influence biological function positively or negatively

A key challenge in studying probabilistic biological systems remains the identification of intrinsic noise when the measured output is often an emergent property assessed at different length scales and/or later timepoints with multiple potential causal objects. Robert Johnston and co-workers [6] illustrate this through elegant cell fate specification models, including rhodopsin choice by photoreceptors in the Drosophila eye. The proportion of cells which adopt rhodopsin 4 expression is largely consistent between individuals, but their position is variable. However, the stochastic distribution of rhodopsin 4 expression is deterministically controlled by prior expression of a transcription factor, spineless. Stochastic initiation of spineless expression acts as a binary switch, deterministically instructing differentiation. Thus, the stochastic positioning of rhodopsin 4-expressing cells is deterministically controlled by the binary expression of spineless, itself stochastically distributed by the action of as yet unknown earlier events.

A similar trans-scale problem is discussed in the pathological context of congenital malformations. Some congenital malformations correspond to a binary event earlier in development. For example, incomplete closure of the neural tube causes a portion of the central nervous system to remain exposed on the back of the embryo, causing neural tube defects. Even between isogenic individuals, these malformations are often ‘partially penetrant’—affecting some individuals but not others—and can be phenotypically variable, for example in the length of exposed neural tissue. Dagan Jenkins [7] discusses partial penetrance and phenotypic variability in the context of congenital limb malformations, proposing a genetic threshold model to explain their variable emergence. Limb malformations are a particularly tractable system to identify variable outcomes by comparing the left and right sides, which develop largely independently and often show different levels of pathology. Manuscripts in this special issue also discuss pathological contributions of stochastic processes related to cancer. Andrew Teschendorff [8] describes ‘quasi-stochastic’ DNA methylation changes which cells accrue during division and accumulate with age. He proposes an epigenetic ‘clock’ wherein accumulation of gene dysregulation, particularly suppression of tissue-specific transcription factors, increases cell plasticity and the probability of becoming cancerous. A review by Cristina Pina [9] considers stochastic gene expression as a promoter of cancer progression. She discusses a type of leukaemia in which loss of an epigenetic regulator increases transcriptional variability in leukaemic compared with pre-leukaemic cells. This variability can then allow exploration of different phenotypes making the cancer more robust and/or pernicious, for example, by allowing the persistence of rare cells when subject to drug treatment.

This epigenetic repression of gene expression noise suggests evolution of noise-buffering mechanisms. Carla Mulas [10] provides another example of transcriptional noise buffering: post-transcriptional modification of proteins to achieve rapid and coherent transitions in cell state during differentiation of stem cells, providing greater synchronicity than would be expected from the variable levels of their transcripts. Various presentations during the meeting also discussed buffering through redundancy; at the molecular level through feedback loops or complementary branches of gene regulatory networks, at the organelle level by sequestration of ‘excess’ proteins in phase separated compartments [11] and at the cellular level as exemplified by the dispensability of practically any individual cell during vertebrate morphogenesis. Effects at the level of the cell are considered by Pilar Guerrero & Ruben Perez-Carasco [12], who explore mechanical causes and consequences of biological variability, namely the effect of friction on cellular deformations, for example during mitosis.

Active buffering—or adaptation to functionally consequential biological noise—suggests that the sources of variability are either uncontrollable (e.g. ion diffusion rates) or potentially beneficial. James Locke and co-workers [13] review examples of beneficial stochastic variability in germination of isogenic seedlings as ‘bet-hedging’ against catastrophic environmental changes. Veronica van Heyningen [14] discusses further benefits of variability, such as stochastic DNA recombination which generates immunological diversity and the inherent need for genetic diversity to permit evolutionary adaptation. Also discussed in depth was the potential for noise to enable cellular plasticity—although likely a bad thing in a tumour, the potential for some random fluctuations may enhance the impetus for reversal of differentiation for a cell that finds itself in the wrong niche [15]. These and other examples emphasize the importance of quantifying variability in biological parameters as potentially beneficial or pathogenic.

3. Technology

One of the motivations for the meeting was the explosion in single-cell technologies in the past 10 years. Single-cell RNAseq is perhaps, in isolation, not especially good at measuring stochastic gene expression—the data are too noisy, and highly top-sliced, capturing accurate estimates of transcript abundance for only the most strongly expressed genes, although simulations indicate scRNAseq can certainly reflect the outputs of noisy transcription [16]. Rory Maizels [17] writes a thoughtful piece on how these technologies have been further developed and exploited to superimpose temporal information onto the data. Although data on specific genes in single cells may suffer from technical noise, aggregated information on similar cells is providing the potential to predict how differences (random or otherwise) between cells at one time point can map onto phenotypic differences later on in development (or cancer, infection, etc.). The inferred cell trajectories are only predictions, it would of course be useful to test these predictions, by following the individual cells in real time, while imaging their gene expression. In addition, with single-cell proteomics appearing on the horizon, we should also be concerned that much of the variability we observe at the transcript level may not be particularly strong at the protein level, at least in an initial survey [18]. Maizels finishes by implying that it is perhaps not the technology that is limiting us, but our own conceptual frameworks for defining causality.

4. Future outlook

An important outcome of the Hooke meeting and this special issue resulting from it is the raising of new questions. Stochastic effects have been invoked to explain processes ranging from the activity of individual ion channels to the emergence of cancer, yet the field is rarely brought together to compare notes. We hope that conversations started during this meeting will lead to rich interdisciplinary collaborations providing new insights into the causes and consequences of stochastic processes in biology.

Data accessibility

This article does not contain any additional data.

Declaration of AI use

We have not used AI-assisted technologies in creating this article.

Authors' contributions

D.J.: writing—original draft, writing—review and editing; J.C.: writing—original draft, writing—review and editing; G.G.: writing—original draft, writing—review and editing.

All authors gave final approval for publication and agreed to be held accountable for the work performed therein.

Conflict of interest declaration

This theme issue was put together by the Guest Editor team under supervision from the journal's Editorial staff, following the Royal Society's ethical codes and best-practice guidelines. The Guest Editor team invited contributions and handled the review process. Individual Guest Editors were not involved in assessing papers where they had a personal, professional or financial conflict of interest with the authors or the research described. Independent reviewers assessed all papers. Invitation to contribute did not guarantee inclusion.

Funding

We received no funding for this study.

References

  • 1.Novick A, Weiner M. 1957. Enzyme induction as an all-or-none phenomenon. Proc. Natl Acad. Sci. USA 43, 553-566. ( 10.1073/pnas.43.7.553) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Raj A, van Oudenaarden A. 2008. Nature, nurture, or chance: stochastic gene expression and its consequences. Cell 135, 216-226. ( 10.1016/j.cell.2008.09.050) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Oates AC. 2011. What's all the noise about developmental stochasticity? Development 138, 601-607. ( 10.1242/dev.059923) [DOI] [PubMed] [Google Scholar]
  • 4.Elowitz MB, Levine AJ, Siggia ED, Swain PS. 2002. Stochastic gene expression in a single cell. Science 297, 1183-1186. ( 10.1126/science.1070919) [DOI] [PubMed] [Google Scholar]
  • 5.Noble D. 2012. A theory of biological relativity: no privileged level of causation. Interface Focus 2, 55-64. ( 10.1098/rsfs.2011.0067) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Ordway AJ, Helt RN, Johnston RJ Jr. 2024. Transcriptional priming and chromatin regulation during stochastic cell fate specification. Phil. Trans. R. Soc. B 379, 20230046. ( 10.1098/rstb.2023.0046) [DOI] [PubMed] [Google Scholar]
  • 7.Jenkins D. 2024. How do stochastic processes and genetic threshold effects explain incomplete penetrance and inform causal disease mechanisms? Phil. Trans. R. Soc. B 379, 20230045. ( 10.1098/rstb.2023.0045) [DOI] [PubMed] [Google Scholar]
  • 8.Teschendorff AE. 2024. On epigenetic stochasticity, entropy and cancer risk. Phil. Trans. R. Soc. B 379, 20230054. ( 10.1098/rstb.2023.0054) [DOI] [PubMed] [Google Scholar]
  • 9.Pina C. 2024. Contributions of transcriptional noise to leukaemia evolution: KAT2A as a case-study. Phil. Trans. R. Soc. B 379, 20230052. ( 10.1098/rstb.2023.0052) [DOI] [PubMed] [Google Scholar]
  • 10.Mulas C. 2024. Control of cell state transitions by post-transcriptional regulation. Phil. Trans. R. Soc. B 379, 20230050. ( 10.1098/rstb.2023.0050) [DOI] [PubMed] [Google Scholar]
  • 11.Klosin A, Oltsch F, Harmon T, Honigmann A, Julicher F, Hyman AA, Zechner C. 2020. Phase separation provides a mechanism to reduce noise in cells. Science 367, 464-468. ( 10.1126/science.aav6691) [DOI] [PubMed] [Google Scholar]
  • 12.Guerrero P, Perez-Carasco R. 2024. Choice of friction coefficient deeply affects tissue behaviour in stochastic epithelial vertex models. Phil. Trans. R. Soc. B 379, 20230051. ( 10.1098/rstb.2023.0051) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Abley K, Goswami R, Locke JCW. 2024. Bet-hedging and variability in plant development: seed germination and beyond. Phil. Trans. R. Soc. B 379, 20230048. ( 10.1098/rstb.2023.0048) [DOI] [PubMed] [Google Scholar]
  • 14.van Heyningen V. 2024. Stochasticity in genetics and gene regulation. Phil. Trans. R. Soc. B 379, 20230476. ( 10.1098/rstb.2023.0476) [DOI] [PubMed] [Google Scholar]
  • 15.Zalc A, Sinha R, Gulati GS, Wesche DJ, Daszczuk P, Swigut T, Weissman IL, Wysocka J. 2021. Reactivation of the pluripotency program precedes formation of the cranial neural crest. Science 371, eabb4776. ( 10.1126/science.abb4776) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Antolovic V, Miermont A, Corrigan AM, Chubb JR. 2017. Generation of single-cell transcript variability by repression. Curr. Biol. 27, 1811-1817. e1813. ( 10.1016/j.cub.2017.05.028) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Maizels RJ. 2024. A dynamical perspective: moving towards mechanism in single-cell transcriptomics. Phil. Trans. R. Soc. B 379, 20230049. ( 10.1098/rstb.2023.0049) [DOI] [PubMed] [Google Scholar]
  • 18.Brunner AD, et al. 2022. Ultra-high sensitivity mass spectrometry quantifies single-cell proteome changes upon perturbation. Mol. Syst. Biol. 18, e10798. ( 10.15252/msb.202110798) [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

This article does not contain any additional data.


Articles from Philosophical Transactions of the Royal Society B: Biological Sciences are provided here courtesy of The Royal Society

RESOURCES