Skip to main content
PLOS One logoLink to PLOS One
. 2024 Mar 28;19(3):e0301022. doi: 10.1371/journal.pone.0301022

Stochastic modeling of a gene regulatory network driving B cell development in germinal centers

Alexey Koshkin 1,2, Ulysse Herbach 3, María Rodríguez Martínez 4, Olivier Gandrillon 1,2,*, Fabien Crauste 5,*
Editor: Nihad AM Al-Rashedi6
PMCID: PMC10977792  PMID: 38547073

Abstract

Germinal centers (GCs) are the key histological structures of the adaptive immune system, responsible for the development and selection of B cells producing high-affinity antibodies against antigens. Due to their level of complexity, unexpected malfunctioning may lead to a range of pathologies, including various malignant formations. One promising way to improve the understanding of malignant transformation is to study the underlying gene regulatory networks (GRNs) associated with cell development and differentiation. Evaluation and inference of the GRN structure from gene expression data is a challenging task in systems biology: recent achievements in single-cell (SC) transcriptomics allow the generation of SC gene expression data, which can be used to sharpen the knowledge on GRN structure. In order to understand whether a particular network of three key gene regulators (BCL6, IRF4, BLIMP1), influenced by two external stimuli signals (surface receptors BCR and CD40), is able to describe GC B cell differentiation, we used a stochastic model to fit SC transcriptomic data from a human lymphoid organ dataset. The model is defined mathematically as a piecewise-deterministic Markov process. We showed that after parameter tuning, the model qualitatively recapitulates mRNA distributions corresponding to GC and plasmablast stages of B cell differentiation. Thus, the model can assist in validating the GRN structure and, in the future, could lead to better understanding of the different types of dysfunction of the regulatory mechanisms.

Introduction

Adaptive immune response is a complex mechanism, relying on B and T lymphocytes, which protects the organism against a range of pathogens. Crucial elements of adaptive immune response, the germinal centers (GCs) are the structures in lymphoid organs where activated naive B cells are expanded (in a dark zone, DZ) and selected (in a light zone, LZ) and can have multiple exit fates, such as antibody production (plasmablasts and plasma cells, PB_PC), long term storage of antigen information (memory B cells, MC), or death via apoptosis [1, 2].

It is currently thought that B cell differentiation in GC is controlled by a small network of transcription factors (TFs) constituted by B-cell lymphoma 6 (BCL6), interferon regulatory factor 4 (IRF4) and PR domain zinc finger protein 1 (BLIMP1) [3]. BCL6 controls formation of GC, terminal differentiation of B cells and lymphomagenesis [4, 5]. BCL6 disturbance can be triggered by several mechanisms, including proteasome degradation by BCR, T-cell-mediated CD40-induced IRF4 repression of BCL6 [4, 6], or disruption of BCL6 autoregulation loop [4, 7]. Transcription factor IRF4 is involved in the termination of GC B cell differentiation, in immunoglobulin class switch recombination (CSR) and plasma cell development [8]. Impairment of IRF4 expression is tightly connected with the appearance of multiple malignancies [8]. BLIMP1 regulates pathways responsible for B cell lineage (e.g., PAX5) and GC proliferation and metabolism (e.g., MYC) [9, 10]. BLIMP1 is also involved in the induction of genes (e.g., XBP-1, ATF6, Ell2) facilitating antibody synthesis [1113]. These three TFs interact, through various activation/inhibition processes: IRF4 represses BCL6 and activates BLIMP1 [14] (hence being essential for GC maturation and B cell differentiation into plasmablast), BLIMP1 and BCL6 mutually repress each other [1518].

Martinez et al. [3] developed a deterministic kinetic ODE model capable of simulating normal and malignant GC exits using a GRN based on these three transcription factors. For the normal differentiation of GC B cells towards PB_PC stage, the kinetic ODE model fits microarray data at two steady-states: the first one associated with the GC stage of B cell differentiation (with high levels of BCL6 and low levels of IRF4 and BLIMP1), and the second one associated with PB_PC stage (with low levels of BCL6 and high levels of IRF4 and BLIMP1).

Recently, multiple protocols for SC RNA-seq data generation have been developed and used to answer various questions in biology [19, 20]. At the same time, different groups showed that gene transcription in eukaryotes is a discontinuous process and follows bursting kinetics [2124]. Such results suggest that the stochastic nature of gene expression at the single cell (SC) level can be partly responsible for the phenotype variation in living organisms [25]. Thus, by gaining access to a stochastic behavior of gene expression, the SC viewpoint may lead to further improvement of the understanding of the biological systems and their variability.

Nevertheless, stochastic modeling of GRNs using SC gene expression data is still in its early stage [26, 27] and has never been studied for GC B cells. Here, we apply a particular class of stochastic models combining deterministic dynamics and random jumps, called piecewise-deterministic Markov processes (PDMPs) [28], to the description of GC B cell differentiation. It is a two-state model of gene expression introduced in [29] that allows a description of the system’s dynamics at the promoter, transcription and translation levels for a given GRN. We apply this model to the GRN made of the three key genes, BCL6, IRF4 and BLIMP1, and simulate single B cell mRNA data [30]. We show that the model can qualitatively simulate the SC mRNA patterns for normal B cell differentiation at GC and PB_PC stages.

Materials and methods

Single-cell data

We used the B cell dataset from human lymphoid organs published by Milpied et al. [30]. The authors studied normal B cell subsets from germinal centers of the human spleen and tonsil and performed integrative SC analysis of gene expression. They used an adapted version of the integrative single-cell analysis protocol [31]. In short, the authors prepared cells for flow cytometry cell sorting. Then in every 96-well plate the authors sorted three to six ten-cell samples of the same phenotype as a single-cell. They performed multiplex qPCR analysis using the Biomark system (Fluidigm) with 96x96 microfluidic chips (fluidigm) and Taqmann assays (Thermofisher) [30]. They obtained results in the form of fixed fluorescence threshold to derive Ct values. We used Ct values to derive Expression threshold (Et) values: Et = 30 − Ct. When there was an unreliably low or undetected expression (Ct > 30), Et was set to zero [30]. Using SC gene expression analysis of a panel of 91 preselected genes and pseudotime analysis (based on the cartesian coordinates of SC on the first and second principal components of the PCA), the authors separated GC DZ cells, GC LZ cells, memory cells and PB_PC cells.

Here we focused on three genes, BCL6, IRF4 and BLIMP1. We selected the SC gene expression values for BCL6, IRF4 and BLIMP1 for GC DZ cells (317 SC) and for PB_PC (104 SC) (see Fig 5). The experimental dataset includes at the GC B cell stage 30 cells with zero BCL6 mRNA amount, 292 cells with zero IRF4 mRNA amount and 292 cells with zero BLIMP1 mRNA amount. For the end of the B cell differentiation (PB_PC), there were 25 cells with zero BCL6 mRNA amount, 79 cells with zero IRF4 and 5 cells with zero BLIMP1 mRNA amount.

Kinetic ODE model

Martinez et al. [3] derived an ODE model that simulates B cell differentiation from mature GC cells towards PB_PC. Dynamics of each protein (BCL6, IRF4 and BLIMP1) are defined by a production rate (μ), a degradation rate (λ), a dissociation constant (k) and a maximum transcription rate (σ). Dynamics are described by System (1)–(3), where p, b and r account for proteins BLIMP1, BCL6 and IRF4, respectively:

dpdt=μp+σpkb2kb2+b2+σpr2kr2+r2-λpp, (1)
dbdt=μb+σbkp2kp2+p2kb2kb2+b2kr2kr2+r2-(λb+BCR)b, (2)
drdt=μr+σrr2kr2+r2+CD40-λrr. (3)

In this model, CD40 and BCR act as stimuli on genes: BCR temporary represses BCL6 and CD40 temporary activates IRF4.

Stochastic model

The stochastic model that describes the coupled dynamics of gene i and the other genes of the GRN is defined by the series of equations:

{Ei(t):0kon,i(P1,P2,P3,Qs)1,1koff,i(P1,P2,P3,Qs)0,Mi(t)=s0,iEi(t)-d0,iMi(t),Pi(t)=s1,iMi(t)-d1,iPi(t), (4)

where Ei(t), Mi(t) and Pi(t) are, respectively, the activation status of the promoter, the quantity of mRNA and the quantity of proteins of gene i, for i ∈ {1, 2, 3}. Each index i refers to one of the gene in the GRN, either BCL6, IRF4, or BLIMP1 (see Table 1). For s ∈ {BCR, CD40}, Qs accounts for external stimuli intensity.

Table 1. Correspondence between gene names and model index.

Index Gene/Stimulus
1 BCL6
2 IRF4
3 BLIMP1

For each gene i, System (4) is defined by the promoter state switching rates kon,i (h−1) and koff,i (h−1), by a degradation rate of mRNA (d0,i, h−1), a protein degradation rate (d1,i, h−1), a transcription rate (s0,i, mRNA × h−1), a translation rate (s1,i, protein × mRNA−1 × h−1), and interaction parameters θw,i with either gene (w = 1, 2, 3) or stimulus (w = BCR, CD40). Interactions between genes are based on the assumption that kon,i and koff,i are functions of the proteins P1, P2, P3 and stimuli Qs. Parameter kon,i is given by:

kon,i(P1,P2,P3,Qs)=kon,imin+kon,imaxβiΦi(P1,P2,P3,Qs)1+βiΦi(P1,P2,P3,Qs) (5)

where

Φi(P1,P2,P3,Qs)=s=BCRCD401+eθs,iQs/Hs,i1+Qs/Hs,ij=131+eθj,i(Pj/Hj,i)γ1+(Pj/Hj,i)γ. (6)

Parameter Hj,i in (6) represents an interaction threshold for the protein j on gene i and Hs,i an interaction threshold for stimulus s on gene i, while in (5) βi is a scaling parameter. For defining koff,i, all θi,j values must be replaced by -θi,j in (6). The structure of System (4)–(6) for the particular network considered in this paper is illustrated in Fig 1.

Fig 1. Schematic representations of the GRN and the stochastic model.

Fig 1

A) Schematic representation of the three-gene GRN involved in B cell differentiation. It consists of BCL6 (gene 1), IRF4 (gene 2) and BLIMP1 (gene 3), and with stimuli BCR and CD40 acting on the network. The interaction ji between a regulating protein j and a target gene i is represented by the interaction parameter θj,i. B) Schematic representation of the associated stochastic model. A gene is represented by its promoter state (dashed rectangle), which can switch randomly from on to off (and vice versa), with rates kon,i (koff,i). When promoter state is on, mRNA molecules are continuously produced at s0,i rate. Proteins are constantly translated from mRNA at s1,i rate. Parameters d0,i and d1,i are degradation rates of mRNA and proteins. The interaction between a regulator gene j and a target gene i is defined by the dependence of both kon,i and koff,i on the protein level Pj and the interaction parameter θj,i. IRF4 gene exhibits an autoactivation loop (θ2,2). BCL6 gene exhibits an autorepression loop (θ1,1). Additionally, two external stimuli, BCR and CD40, act on the GRN.

A detailed derivation of the model is presented in the supplementary material of [29]. Starting from a simple biochemical model of gene expression, the authors described higher-order interactions and took into consideration possible auto-activations. After normalization and simplification steps, Herbach et al. [29] and Bonnaffoux et al. [32] described the promoter switching rates kon,i and koff,i in the form of (5) and (6) by introducing the scaling parameter βi. Following the approach in [32], the values of βi were computed when initializing the simulation, in order to set the values of parameters kon,i and koff,i to their initial values. Parameter γ was set to a default value, equal to 2, and values of kon,imax and koff,imax were estimated by the method of moments and bootstrapping as previously described by Bonnaffoux et al. [32]. These parameters were no longer modified through this study.

It can be noted that the promoter state evolution of gene i between times t and t + δt in System (4)–(6) is defined, for small δt, as a Bernoulli-distributed random variable [29, 32]:

Ei(t+δt)Bernoulli(πi(t)),

where probability πi(t), derived by solving the master equation [29, 33], is given by

πi(t)=Ei(t)e-δt(kon,i+koff,i)+kon,ikon,i+koff,i(1-e-δt(kon,i+koff,i)).

It follows that the promoter state of gene i averages to kon,i/(kon,i + koff,i) in the fast promoter regime (kon,i + koff,i ≫ 1/δt). This quantity will be used to reduce System (4)–(6) into an ordinary differential equation (ODE) system in Section.

Simulating the stochastic model

During B cell differentiation in GC, B cells first receive BCR signal, through follicular dendritic cells interaction, that represses BCL6. Then, B cells integrate CD40 signals, through T follicular helper, that activate IRF4 [3, 6, 34].

In order to simulate these interactions, we assumed that BCR was acting on BCL6 from 0h until 25h, and CD40 was acting on IRF4 from until 61h. Stimuli were implemented in three steps: first a linear increase (tBCR ∈ [0.5h; 1.5h]; tCD40 ∈ [35h; 36h]), then a stable stimulus (tBCR ∈ [1.5h; 24h]; tCD40 ∈ [36h; 60h]), finally a linear decrease (tBCR ∈ [24h; 25h]; tCD40 ∈ [60h; 61h]) (see S1 Fig).

In all simulations, the system evolves for 500h so it can reach a steady state before applying the stimuli (at time t = 0h). After the first stimulus (BCR) is applied, the system is simulated for an additional 500h. For each simulation, the amounts of mRNA counts have been collected every 0.5h.

Values of parameters defining the stochastic system (4)–(6) are given in Tables 2 to 5.

Table 2. Parameter set of the stochastic model (4)–(6) and reduced model (9) that are the same in all versions.

Parameter Version I, II, III
H 1,2 1
H 3,2 1
H 3,3 1
θ 1,1 -0.2
θ 1,2 0
θ 3,2 0
θ 1,3 -1
θ 3,3 0
s 0,BCL6 100
d 0,BCL6 0.05
d 0,IRF4 0.05
s 1,BCL6 100
s 1,IRF4 160
s 1,BLIMP1 40
d 1,BCL6 0.138
d 1,IRF4 0.173
d 1,BLIMP1 0.173
k off,init,BCL6 1
k off,init,IRF4 1
k off,init,BLIMP1 1

Version I—initial parameter set. Version II—parameter set obtained from the automatized approach. Version III—parameter set obtained after semi-manual tuning. Parameters are defined in the text.

Table 5. Parameter set of the stochastic model (4)–(6) and reduced model (9) that are equal between versions II and III.

Parameter Version I Version II, III
H 1,1 1 0.1
H 2,1 0.1 0.01
H 2,3 0.01 0.1
H 3,1 1 0.01
k on,init,BCL6 0.1 0.15
k on,init,IRF4 0.0017 0.007
k on,init,BLIMP1 0.1 0.001

Version I—initial parameter set. Version II—parameter set obtained from the automatized approach. Version III—parameter set obtained after semi-manual tuning. Parameters are defined in the text.

Table 3. Parameter set of the stochastic model (4)–(6) and reduced model (9) that are different between all versions.

Parameter Version I Version II Version III
H 1,3 0.1 1 0.01
H BCR,1 0.01 1 0.001
H CD40,2 1 0.001 1

Version I—initial parameter set. Version II—parameter set obtained from the automatized approach. Version III—parameter set obtained after semi-manual tuning. Parameters are defined in the text.

Table 4. Parameter set of the stochastic model (4)–(6) and reduced model (9) that are equal between versions I and II.

Parameter Version I, II Version III
H 2,2 0.01 0.1
θ 2,1 -100 -50
θ 2,2 5 11
θ 2,3 40 50
θ 3,1 -20 -0.5
θ BCR,1 -20 -200
θ CD40,2 40 10
d 0,BLIMP1 0.1733 0.007
s 0,IRF4 1 2.1
s 0,BLIMP1 1 100

Version I—initial parameter set. Version II—parameter set obtained from the automatized approach. Version III—parameter set obtained after semi-manual tuning. Parameters are defined in the text.

Model execution in a computational center

All models were established as part of the WASABI pipeline [32] and were implemented in Python 3. All computations were performed using the computational center of IN2P3 (Villeurbanne/France).

Tuning of the PDMP model

Parameters estimation for the ODE-reduced model

In Section, we use a reduced, deterministic version of System (4)–(6), namely System (9). Initial guess of each parameter has been chosen randomly in the same order of magnitude as in Bonnaffoux et al. [32]. Specifically, the initial value of kon for IRF4 (kon,init,IRF4) has been estimated by comparison with values of the kinetic model from Martinez et al. [3]. Initial values of kon for BCL6 and BLIMP1 were selected in the same order of magnitude as kon,init,IRF4.

Estimation of the parameters for the stochastic model: Automatized approach

After we have established the parameters for the reduced model (9), and we have shown that (9) has two steady states, we used these values as initial guess for the stochastic model (4)–(6). The goal was then to further tune parameter values so the stochastic model (4)–(6) fits the experimental SC data.

We investigated a possible effect of Hw,i values (w = 1, 2, 3, BCR, CD40), θj,i values and kon,init values on the quality of the fitting (for each parameter combination, simulation was performed for 200 SC).

First, let us mention that, based on the network depicted in Fig 1, there is no influence of BCL6 on IRF4, of BLIMP1 on IRF4 and there is no self-activity of BLIMP1 on itself, so parameters θ1,2, θ3,2 and θ3,3 are set to 0 while parameters H1,2, H3,2 and H3,3 are set to 1. Also, BCR acts only on BCL6 and CD40 on IRF4, so θBCR,2 = θBCR,3 = 0, θCD40,1 = θCD40,3 = 0, and only parameters θBCR,1 and θCD40,2 are non-zero.

We tested the values of interaction threshold H1,1, H1,3, H2,1, H2,3 and H3,1 within the set {0.01, 0.1, 1}, and the set {0.0001, 0.001, 0.1, 1, 100} for H2,2, HBCR,1, HCD40,2. We also tested the values of θ1,1, θ1,3, θ2,1, θ2,3 and θ3,1 by multiplying by a factor fθ ∈ {1, 5}, and by multiplying by a factor fθ ∈ {1, 10} for θ2,2, θBCR,1, θCD40,2.

In total we tested two different values of θj,i for 5 interactions (θ1,1, θ1,3, θ2,1, θ2,3, θ3,1), 2 values of θj,i for 3 interactions (θ2,2, θBCR,1, θCD40,2), 3 values of Hj,i for 5 interactions (H1,1, H1,3, H2,1, H2,3, H3,1), and 5 values of Hj,i for 3 interactions (HBCR,1, HCD40,2, H2,2), generating 25 × 23 × 35 × 53 ≈ 7.8 × 106 combinations of parameters. Parameters that do not appear in the previous list have not been tested.

During this automatized tuning procedure, we selected a set of parameter values that allows the system to provide the best fit of the experimental mRNA values for BCL6, IRF4 and BLIMP1 at the GC stage, based on a quality-of-fit criterion. This criterion was defined as a comparison between the average model-derived values (Υ) and the average experimental values (Ω), with an objective function (OF) to minimize for the set of genes G = {BCL6, IRF4, BLIMP1} and stages ST = {GC, PB_PC} defined by

OF=δ=1|G|δ=1|ST||Ωδ,δ-ϒδ,δΩδ,δ|. (7)

The quality-of-fit criterion is then

minPSOF, (8)

where PS is the set of parameter values from Tables 2 to 5.

Estimation of the parameters for the stochastic model: Semi-manual tuning

The automatized estimation procedure was followed by a semi-manual tuning of the parameters of the stochastic model (4)–(6) to improve the quality of the fit.

Values of candidate parameters have been tested in an interval of interest and the rest of the parameter values have been fixed at this stage. After model execution, model-simulated SC values of gene expression were collected. Then we selected the values of the parameters that provided the best qualitative fitting (see Eq (8)) of the experimental SC data. Ranges of tested values are summarised in Table 6.

Table 6. Parameters tested during the semi-manual tuning of the stochastic model.
Parameter Definition Tested values Selected value
θ 1,1 Interaction parameter [−200; −10−2] -0.2
θ 1,3 Interaction parameter [−200; −0.1] -1
θ 2,1 Interaction parameter [−200; −10−2] -50
θ 2,2 Interaction parameter [0.1; 200] 11
θ 2,3 Interaction parameter [0.1; 200] 50
θ 3,1 Interaction parameter [−200; −10−2] -0.5
θ BCR,1 Interaction parameter [−200; −0.1] -200
θ CD40,2 Interaction parameter [0.1; 200] 10
s 0,BCL6 Transcription rate [0.1; 625] 100
s 0,IRF4 Transcription rate [0.1; 625] 2.1
s 0,BLIMP1 Transcription rate [0.1; 625] 100
d 0,BCL6 Degradation rate of mRNA [10−3; 10] 0.05
d 0,IRF4 Degradation rate of mRNA [10−3; 10] 0.05
d 0,BLIMP1 Degradation rate of mRNA [10−3; 10] 0.007
s 1,BCL6 Translation rate [1; 1000] 100
s 1,IRF4 Translation rate [1; 1000] 160
s 1,BLIMP1 Translation rate [1; 1000] 40
d 1,BCL6 Degradation rate of protein [0.1; 10] 0.138
d 1,IRF4 Degradation rate of protein [0.1; 10] 0.173
d 1,BLIMP1 Degradation rate of protein [0.1; 10] 0.173
k on,init,BCL6 Initial value of kon,BCL6 [10−5; 10] 0.15
k on,init,IRF4 Initial value of kon,IRF4 [10−5; 10] 0.007
k on,init,BLIMP1 Initial value of kon,BLIMP1 [10−5; 10] 0.001

Evaluation of model variability using Kantorovich distance

To compare distributions and to evaluate model variability, we used the Kantorovich distance (KD, particular case of Wasserstein distance, WD), as defined by Baba et al. [35] and implemented in Python 3 by Bonnaffoux et al. [32].

Consider two discrete distributions p and q, defined on N bins of equal sizes, and denote by xk the center of the k-th bin. Then the Kantorovich distance between p and q is given by

KD=n=1N|k=1np(xk)-k=1nq(xk)|.

We chose WD because it suggested to be preferable over alternative methods such as Kullback-Leibler (KL) divergence or Jensen-Shannon (JS) divergence [36]. More specifically, WD does not require that distributions belong to the same probability space. At the same time, WD is more tractable and has higher performance compared to KL divergence [37]. JS divergence, in turn, does not provide a gradient for the distributions of non-overlapping domains, compared to WD [36]. Also, because WD is a metric and accounts both for the “cost” for the transfer (distance) and “the number of counts” to transfer, we selected its 1D case of WD (Kantorovich Distance, KD) for comparison of discrete experimental distributions versus model-derived distributions [38].

Results

Reduced model

In [3], Martinez et al. applied the kinetic ODE model (1)–(3) to the BCL6-IRF4-BLIMP1 GRN associated with B cell differentiation and successfully simulated GC B cell dynamics based on microarray data. Before using the complex, stochastic model (4)–(6) to fit SC data, we considered a reduced version of System (4)–(6) that can be compared to model (1)-(3), hence providing an initial guess for a key parameter of the model.

Since model (1)–(3) is deterministic, it is necessary to simplify the stochastic model (4)–(6) to perform a comparison of both models dynamics. We assume, in this section, that the stochastic process E(t) (promoter status) in (4)–(6) equals its mean value, 〈E(t)〉, given by kon/(kon + koff). System (4)–(6) then reduces to

{E(t)=kon(t)kon(t)+koff(t),dMdt=s0E(t)-d0M(t),dPdt=s1M(t)-d1P(t). (9)

Comparing mathematical formulations of systems (1)–(3) and (9), one can see that it is possible to identifiy an initial value of the promoter state E(t) for IRF4 gene in System (9) that will correspond to GC differentiation stage (S1 File). Indeed, after rewriting System (9) in terms of System (1)–(3), we obtained the candidate value of kon,init,IRF4 = 1.7 × 10−3. Using this value of kon,init,IRF4, System (9) successfully simulates two steady states for IRF4, i.e. it recapitulates the qualitative dynamics of System (1)–(3) (Fig 2).

Fig 2. Temporal evolution of mRNA counts generated by the reduced model (9).

Fig 2

Temporal evolutions of IRF4 (A), BCL6 (B) and BLIMP1 (C) (see Fig 1). BCR stimulus was applied from 0h until 25h and CD40 stimulus from 35h until 61h. Microarray gene expression dataset from GEO accession no. GSE12195 was used to estimate model’s parameters (see Tables 2 to 5, version I) and are shown as dots with error bars.

Before application of BCR and CD40 stimuli, the system is at a steady state (simulating GC B cell stage) that corresponds to a low amount of IRF4 and BLIMP1 and a high amount of BCL6 mRNA molecules. After application of both stimuli, the system has transitioned to a second steady state that corresponds to a high number of IRF4 and BLIMP1 mRNA molecules and a low number of BCL6 mRNA molecules. However, it can be noted that for the current parameter set (Tables 25, version I), System (9) underestimates the amount of IRF4 mRNA at both steady states (Fig 2).

Dynamics of System (9) shows the existence of two steady-states for the parameter set from Tables 25, version I. Notably, if we test a random value of kon,init,IRF4 in combination with the parameters from Tables 25, version I (S1 Table), System (9) has only one steady-state (S2 Fig). To our knowledge, there may be more than one set of parameter values associated with two steady states of System (9).

We showed that for the parameter set from Tables 25, version I, the reduced model (9) is capable to qualitatively recapitulating the expected behavior of GC B cell differentiation GRN (Fig 2). Due to the stochastic nature of gene expression, we are hereafter interested in evaluating how stochastic system (4)–(6) is capable of simulating this stochastic behavior in B cell differentiation in GC and recapitulates the shape of the mRNA distributions from the experimental SC dataset.

Stochastic modeling of B cell differentiation

Assessing the variability of the stochastic model

Due to the stochastic nature of the stochastic system (4)–(6), it is important to first evaluate the variability of the model-generated SC data, that is of model’s outputs. Indeed, when one repeatedly simulates a finite number of cells from the stochastic system (4)–(6) for the same parameter value set (Tables 25, version I), the resulting model-derived empirical distributions are slightly different between each run due to the stochasticity of the model. We investigated how strongly shapes of distributions of simulated SC mRNA molecules vary for different executions of model (4)–(6).

We evaluated the level of variability of model (4)–(6) using the Kantorovich distance (KD, see Section). We simulated 200 datasets, each containing 200 single cells, of System (4)–(6) with a fixed parameter set (see Tables 25, version I). We estimated the KD between pairs of simulated datasets (mRNA counts for three genes at GC and PB_PC stages for 200 simulated cells), and obtained a distribution of all KD that we call the model-to-model (m-t-m) distribution (Fig 3). Shapes of m-t-m distributions are different for each gene and stage of differentiation. For instance, for BLIMP1, long tails are observed. We can also notice that the mean value of IRF4 at GC stage is low compared to other genes.

Fig 3. Model-to-model distributions of KD for GC and PB_PC stages and the three genes, BCL6, IRF4, BLIMP1.

Fig 3

Model (4)–(6) was simulated with parameter values from Tables 25, version I. The violin plots show the shapes of the distributions, median value, interquartile range and 1.5x interquartile range of the KD values.

In order to get a more accurate evaluation of the variability in model’s outputs, we plotted distributions of the number of mRNA molecules (model’s outputs) for each node of the GRN with the highest m-t-m distribution at both GC and PB_PC stages (Fig 4). Qualitatively, no difference is detected in the shapes of model-generated distributions. For all 6 nodes, the shapes of distributions are remarkably similar.

Fig 4. Histograms of two model-generated mRNA counts of BCL6, IRF4 and BLIMP1 at GC and PB_PC stages with the highest KD.

Fig 4

The subgraphs A, C, E (resp., B, D, F) represent the relative frequency of cells (y-axis) for log2 (molecule+1) transformed values of BCL6, IRF4 and BLIMP1 (x-axis) at GC (resp., PB_PC) stage. Parameters from Tables 25, version I.

These results suggest that it may be sufficient to perform parameter tuning of the stochastic model (4)–(6) using only one simulation run for each parameter value set.

Initial estimation step based on an automatized approach

Variability of the stochastic model being assessed, and comparison of experimental data and a single model’s output in order to assess their closeness being validated, we now focus on the estimation of parameter values. Model (4)–(6) comprises 40 parameters, so we first apply a straightforward strategy, that we call automatized approach, which consists in solving the stochastic system (4)–(6) for a number of fixed parameter values and selecting the set of parameter values associated with the best fit (see Section Estimation of the parameters for the stochastic model: Automatized approach) of experimental data [30].

Approximately 8 × 106 combinations of parameter values have been tested (see Section Estimation of the parameters for the stochastic model: Automatized approach), then the best set of parameter values has been selected based on the quality of BCL6, IRF4 and BLIMP1 fitting at the GC and PB_PC stages (Eqs (7) and (8)).

Numbers of mRNA molecules estimated by the stochastic model (4)–(6) are in a similar range of magnitude as the experimental SC data (S3 Fig). However, the selected parameter values (Tables 25, version II) generate model-derived mRNA distributions that have sufficient overlap with experimental data for GC stage but insufficient overlap for PB_PC stage (S3 Fig). Indeed, distributions of numbers of mRNA molecules at PB_PC stage mostly underestimate the experimental SC data (S3B, S3D and S3F Fig).

Implementing an automatized approach for estimating parameter values helped to establish a set of parameter values that allows System (4)–(6) to correctly estimate the number of mRNA molecules for 3 out of 6 nodes of the GRN. In order to improve the quality of the fit, a more directed and sensitive tuning of the parameter set is then performed (see Section Estimation of the parameters for the stochastic model: Semi-manual tuning).

Generation of simulated distributions of mRNA counts describing B cell differentiation

Due to the complexity of the stochastic model (4)–(6) that is made of 40 parameters, it is important to identifiy which parameters should be targeted to improve the quality of fit. To do so, we rely on the properties of the GRN (Fig 1A). Thanks to the topological structure of the BCL6-IRF4-BLIMP1 GRN, where IRF4 activates BLIMP1 and autoactivates itself, we hypothesize that System (4)–(6) underestimates the experimental SC data at the PB_PC stage due to low values of the parameters responsible for IRF4 autoactivation (θ2,2, and to a lesser extent s0,IRF4) and BLIMP1 activation by IRF4 (θ2,3). Further, we improved the quality of the fit, in particular of BLIMP1 distribution, by focusing on BLIMP1-related interaction parameters (θ1,3, θ3,1).

Indeed, if IRF4 autoactivation reaction is not efficient enough, there are not enough IRF4 molecules to affect BCL6 and BLIMP1 activity at PB_PC stage. Because IRF4 activity is only impacted by its autoactivation loop, we first modulated values of the parameter related to this reaction (θ2,2). During preliminary tests, we noticed that this reaction is crucial for the transition from GC towards PB_PC stage and that when interaction parameter θ2,2 and transcription rate s0,IRF4 have low absolute values then the system cannot reach PB_PC stage, even after application of the stimuli. It can be explained by the insufficient amount of IRF4 molecules produced (S3C and S3D Fig). On the other hand, when parameters θ2,2 and s0,IRF4 have high values, model (4)–(6) transitions from GC towards PB_PC stage even before application of stimuli, exhibiting an abnormal behavior.

After comparison of the stochastic system (4)–(6) outputs for a range of different θ2,2 and s0,IRF4 values (described in Table 6), we selected the parameter set for which model (4)–(6) correctly fits the IRF4 experimental data at both GC and PB_PC stages. Such model-derived SC pattern is obtained using the values (θ2,2 = 11 and s0,IRF4 = 2.1 molecule.h−1).

We additionally performed simulations to improve the quality of the fitting of BLIMP1 and BCL6 distributions by testing parameters that are directly responsible for the balance between BLIMP1 and BCL6, such as interaction parameters θ1,3, θ3,1 and θ2,3. We also tested parameters which can influence BCL6 and BLIMP1 indirectly, such as transcription rates (s0,BCL6 and s0,BLIMP1), and degradation rates of mRNA (d0,BCL6, d0,IRF4 and d0,BLIMP1).

After comparison of the stochastic system (4)–(6) outputs, we selected the parameters which allow the model to have a qualitative fit of the experimental data for all nodes at GC and PB_PC stages (Fig 5, and Tables 25, version III). For this tuned parameter set, we see that the model (4)–(6) can have a good qualitative fitting of experimental data for all nodes. Results also show that for this parameter set (version III), the stochastic model (4)–(6) fits SC data at the GC stage for BCL6 (Fig 5A). The model-derived empirical distribution of BLIMP1 was capable of showing overlap with experimental data at the PB_PC stage (Fig 5F), but it overestimated the number of BLIMP1 mRNA molecules at the GC stage (Fig 5E).

Fig 5. Histograms of model-generated and experimental mRNA counts of BCL6, IRF4, BLIMP1 at GC and PB_PC stages.

Fig 5

The subgraphs A, C, E (resp., B, D, F) represent the relative frequency of cells (y-axis) for log2 (molecule+1) transformed values of BCL6, IRF4 and BLIMP1 (x-axis) at GC (resp., PB_PC) stage, compared between the model estimations at GC or PB_PC stage (grey) vs the experimental data from GC (blue) or PB_PC (green) B cells. Simulation of 200 single cells were used based on the parameter set, selected after semi-automatized parameter screening (see Tables 25, version III). Performed based on the dataset from Milpied et al. [30].

The current parameter set (Tables 25, version III) has difficulties to correctly evaluate the number of zero values. The model (4)–(6) tends to overestimate the number of BCL6 mRNA molecules at PB_PC stage, as well as the number of IRF4 mRNA molecules at GC stage and number of BLIMP1 mRNA molecules at GC stage (Fig 5). Nevertheless, this parameter set allowed the model to generate SC data with a similar level of magnitude of the amount of mRNA as experimentally observed.

Discussion

In this work, we applied a particular class of stochastic models combining deterministic dynamics and random jumps to the simulation of SC data from two stages of B cell differentiation in germinal centers.

We first defined a reduced model (9) whose dynamics were compared to the ones of the kinetic model (1)–(3) and we established an initial parameter value for the key parameter kon,init,IRF4. We then showed that for a given parameter set (Table 25, Version I), the reduced model (9) admits two steady states. Secondly, we evaluated the effect of stochasticity on multiple independent generations of the number of mRNA molecules by the stochastic model (4)–(6) and we confirmed that for the same parameter set there is no noticeable difference between each model-generated outputs for BCL6-IRF4-BLIMP1 GRNs (Fig 4). These results allow performing a combined parameter screening with the confidence that for each candidate parameter set, the algorithm needs to perform only one run of the model (4)–(6). Lastly, we showed that the model (4)–(6) can simulate distributions of the number of mRNA molecules for BCL6, IRF4, BLIMP1 at GC and PB_PC stages with the same order of magnitude as experimental data. However, as future scope of this work, a few strategies to improve the final parameter value set (Tables 25, version III) can be investigated.

Since in BCL6-IRF4-BLIMP1 GRN, IRF4 activity depends only on its autoactivation reaction, we have only succeeded, by writing the reduced model (9) in terms of the kinetic model (1)–(3), in estimating the value of kon,init,IRF4. It would be advantageous to additionally estimate the values of kon,init,BCL6 and kon,init,BLIMP1, using the same logic. However, because BLIMP1 depends on BLIMP1, IRF4 and BCL6 (see Eq (1)) and BCL6 depends on both IRF4 and BLIMP1 (Eq (2)), the rewriting of system (4)–(6) in terms of (1)–(3) would require additional calculations and simplifications.

The effect of mutual repression between BCL6 and BLIMP1 could be evaluated by performing a more extensive parameter value search. The current parameter value set (Tables 25, version III) makes model (4)–(6) overestimate the number of mRNA molecules of BLIMP1 at GC stage. Increasing BCL6 repression of BLIMP1 could potentially decrease the quantity of BLIMP1 at the GC stage.

The effect of the duration of the BCR and CD40 stimuli on the differentiation from GC B cells towards PB_PC could be investigated. Multiscale modeling of GCs performed by Tejero et al. [39] showed that CD40 signalling in combination with the asymmetric division of B cells results in a switch from memory B cells to plasmablasts. It would be relevant to evaluate a possible application of the stochastic model to study the effect of combined CD40 and BCR signaling with different intensities and durations at the SC level.

Additionally, one can evaluate the impact of including additional genes into the BCL6-IRF4-BLIMP1 GRN on the quality of data fitting by the stochastic model. One of the possible candidates to incorporation in the GRN is PAX5, which plays an important role in directing lymphoid progenitors towards B cell development [40]. PAX5 positively regulates IRF8 and BACH2, which indirectly positively regulate IRF4 and negatively regulate BLIMP1 at an early stage of B cell differentiation. During further development, BLIMP1 starts to repress PAX5, consequently decreasing the expression of IRF8 and BACH2. The correct orchestration of PAX5-IRF8-BACH2 during B cell differentiation is important for the successful differentiation towards antibody producing cells (PB_PC), while its malfunction can cause aberration in GC B cell development [41].

CD40 stimulation of B cells initiates NF-κB signaling which is associated with cellular proliferation. In B cells, NF-κB activates IRF4, negatively regulates BACH2, what leads to positive regulation of BLIMP1 and consecutive repression of BCL6 [4, 34].

Another important transcription factor in GC development is MYC, which regulates B cell proliferation [42] and the DZ B cell phenotype [43]. MYC indirectly activates the histone methyltransferase enhancer of zeste homologue 2 (EZH2), which is responsible for the repression of IRF4 and BLIMP1 [4447].

The transcription factors mentioned above are present in the SC RT-qPCR dataset from Milpied et al. [30] that we used and could be used to extend the current GRN. Inclusion of additional transcription factors may have both positive and negative effects on the application of model (4)–(6). On one side, it can increase the computational time and the number of parameters required for simulating System (4)–(6). On the other side, because the inclusion of transcription factors can more precisely describe the biological system it could improve the quality of the fitting. However, any inclusion of new nodes to GRN should be carefully evaluated and only essential transcription factors should be added. For instance, there are no advantages in adding a transcrption factor that would only have one downstream output. As an example, MYC activates E2F1 and further activates EZH2. For this reason, incorporation of the chain MYC-E2F1-EZH2 should have a similar outcome, as the incorporation of the simplified MYC-EZH2 reaction. This is expected because in the modeling, intermediate elements of one-to-one redundant reactions can be omitted without significant changes in the quality of the simulations.

To further continue our study, we could also use SC RNA-seq dataset from Milpied et. al [30]. The authors have produced SC RNA-seq dataset from GC B cells and analysed the similarities between SC RNA-seq and SC RT-qPCR dataset. Even though the gene-gene correlation levels were lower in SC RNA-seq compared to SC RT-qPCR, SC RNA-seq analysis confirmed the observation obtained by SC RT-qPCR [30]. From the stochastic modeling perspective, combining the data from SC RT-qPCR and SC RNA-seq should improve our understanding of the SC dataset variability and the quality of the fitting.

To summarise, the stochastic model (4)–(6) is capable of qualitatively simulating and depicting the stochasticity of experimental SC gene expression data of human B cell differentiation at the GC and PB_PC stages using a GRN made of three-key genes (BCL6, IRF4, BLIMP1). These results are encouraging, and suggest that our model may be used to test the different B cell exits from GC. Future steps may include testing of the model (4)–(6) on alternative SC datasets [4850] and investigating the malignant formations, by evaluating differences of the associated GRN compared to the normal B cell differentiation from GC towards PB_PC.

Supporting information

S1 Fig. Scheme of application of the stimuli Qs, where s ∈ {BCR, CD40}.

(TIF)

pone.0301022.s001.tif (241KB, tif)
S2 Fig. Absence of bistability in model (9).

(TIF)

pone.0301022.s002.tif (582.1KB, tif)
S3 Fig. Histograms of model-generated and experimental mRNA counts of BCL6, IRF4, BLIMP1 at GC and PB_PC stages.

(TIF)

pone.0301022.s003.tif (847.3KB, tif)
S1 File. Modeling.

This file introduces the methodology for reducing the stochastic model and estimating the initial activation rate.

(PDF)

pone.0301022.s004.pdf (107.7KB, pdf)
S1 Table. Parameters of system (9) with values accordingly to Bonnaffoux et al. [32].

(PDF)

pone.0301022.s005.pdf (76.7KB, pdf)

Acknowledgments

We thank Arnaud Bonnaffoux and Matteo Bouvier for their help with the WASABI framework and their critical reading of the manuscript. We thank the computational center of IN2P3 (Villeurbanne/France), especially Gino Marchetti and Renaud Vernet. We also thank Aurelien Pélissier and Elias Ventre for their scientific discussions.

Data Availability

All biological data are available as Supporting information in the original publications: - Fig 2: Martinez et al. (2012) Quantitative modeling of the terminal differentiation of B cells and mechanisms of lymphomagenesis. Proc Natl Acad Sci 09(7): 2672-2677. - Figs 4&5: Milpied et al. (2018) Human germinal center transcriptional programs are desynchronized in B cell lymphoma. Nat Immunol 19(9): 1013-1024.

Funding Statement

This work was supported by the COSMIC grant (www.cosmic-h2020.eu) which received funding from European Union’s Horizon 2020 research and innovation program under the Marie Sklodowska-Curie grant agreement no. 765158. There was no additional external funding received for this study.

References

  • 1. Kurosaki T, Kometani K, Ise W. Memory B cells. Nat Rev Immunol 2015; 15(3): 149–159. doi: 10.1038/nri3802 [DOI] [PubMed] [Google Scholar]
  • 2. Mesin L, Ersching J, Victora GD. Germinal center B cell dynamics. Immunity 2016; 45(3): 471–482. doi: 10.1016/j.immuni.2016.09.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Martínez MR, Corradin A, Klein U, Álvarez MJ, Toffolo GM, di Camillo B, et al. Quantitative modeling of the terminal differentiation of B cells and mechanisms of lymphomagenesis. Proc Natl Acad Sci 2012; 109(7): 2672–2677. doi: 10.1073/pnas.1113019109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Saito M, Gao J, Basso K, Kitagawa Y, Smith PM, Bhagat G, et al. A signaling pathway mediating downregulation of BCL6 in germinal center B cells is blocked by BCL6 gene alterations in B cell lymphoma. Cancer Cell 2007; 12(3): 280–292. doi: 10.1016/j.ccr.2007.08.011 [DOI] [PubMed] [Google Scholar]
  • 5. Song S, Matthias PD. The transcriptional regulation of germinal center formation. Front Immunol 2018; 9: 2026. doi: 10.3389/fimmu.2018.02026 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Mittrücker H-W, Matsuyama T, Grossman A, Kündig TM, Potter J, Shahinian A, et al. Requirement for the transcription factor LSIRF/IRF4 for mature B and T lymphocyte function. Science 1997; 275(5299): 540–543. doi: 10.1126/science.275.5299.540 [DOI] [PubMed] [Google Scholar]
  • 7. Pasqualucci L, Migliazza A, Basso K, Houldsworth J, Chaganti RSK, Dalla-Favera R. Mutations of the BCL6 proto-oncogene disrupt its negative autoregulation in diffuse large B-cell lymphoma. Blood 2003; 101(8): 2914–2923. doi: 10.1182/blood-2002-11-3387 [DOI] [PubMed] [Google Scholar]
  • 8. De Silva NS, Simonetti G, Heise N, Klein U. The diverse roles of IRF4 in late germinal center B-cell differentiation. Immunol Rev 2012; 247(1): 73–92. doi: 10.1111/j.1600-065X.2012.01113.x [DOI] [PubMed] [Google Scholar]
  • 9. Lin Y, Wong K-k, Calame K. Repression of c-myc transcription by Blimp-1, an inducer of terminal B cell differentiation. Science 1997; 276(5312): 596–599. doi: 10.1126/science.276.5312.596 [DOI] [PubMed] [Google Scholar]
  • 10. Nutt SL, Fairfax KA, Kallies A. BLIMP1 guides the fate of effector B and T cells. Nat Rev Immunol 2007; 7(12): 923–927. doi: 10.1038/nri2204 [DOI] [PubMed] [Google Scholar]
  • 11. Minnich M, Tagoh H, Bönelt P, Axelsson E, Fischer M, Cebolla B, et al. Multifunctional role of the transcription factor Blimp-1 in coordinating plasma cell differentiation. Nat Immunol 2016; 17(3): 331–343. doi: 10.1038/ni.3349 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Radtke D, Bannard O. Expression of the plasma cell transcriptional regulator Blimp-1 by dark zone germinal center B cells during periods of proliferation. Front Immunol 2019; 9: 3106. doi: 10.3389/fimmu.2018.03106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Tellier J, Shi W, Minnich M, Liao Y, Crawford S, Smyth GK, et al. Blimp-1 controls plasma cell function through the regulation of immunoglobulin secretion and the unfolded protein response. Nat Immunol 2016; 17(3): 323–330. doi: 10.1038/ni.3348 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Sciammas R, Shaffer AL, Schatz JH, Zhao H, Staudt LM, Singh H. Graded expression of interferon regulatory factor-4 coordinates isotype switching with plasma cell differentiation. Immunity 2006; 25(2): 225–236. doi: 10.1016/j.immuni.2006.07.009 [DOI] [PubMed] [Google Scholar]
  • 15. Bereshchenko OR, Gu W, Dalla-Favera R. Acetylation inactivates the transcriptional repressor BCL6. Nat Genet 2002; 32(4): 606–613. doi: 10.1038/ng1018 [DOI] [PubMed] [Google Scholar]
  • 16. Pasqualucci L, Dominguez-Sola D, Chiarenza A, Fabbri G, Grunn A, Trifonov V, et al. Inactivating mutations of acetyltransferase genes in B-cell lymphoma. Nature 2011; 471(7337): 189–195. doi: 10.1038/nature09730 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Shaffer AL, Lin K-I, Kuo TC, Yu X, Hurt EM, Rosenwald A, et al. Blimp-1 orchestrates plasma cell differentiation by extinguishing the mature B cell gene expression program. Immunity 2002; 17(1): 51–62. doi: 10.1016/S1074-7613(02)00335-7 [DOI] [PubMed] [Google Scholar]
  • 18. Shaffer AL, Yu X, He Y, Boldrick J, Chan EP, Staudt LM. BCL-6 represses genes that function in lymphocyte differentiation, inflammation, and cell cycle control. Immunity 2000; 13(2): 199–212. doi: 10.1016/S1074-7613(00)00020-0 [DOI] [PubMed] [Google Scholar]
  • 19. Chen G, Ning B, Shi T. Single-cell RNA-seq technologies and related computational data analysis. Front Genet 2019; 10: 317. doi: 10.3389/fgene.2019.00317 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Nomura S. Single-cell genomics to understand disease pathogenesis. J Hum Genet 2021; 66(1): 75–84. doi: 10.1038/s10038-020-00844-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Chubb JR, Trcek T, Shenoy SM, Singer RH. Transcriptional pulsing of a developmental gene. Curr Biol 2006; 16(10): 1018–1025. doi: 10.1016/j.cub.2006.03.092 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Raj A, Peskin CS, Tranchina D, Vargas DY, Tyagi S. Stochastic mRNA synthesis in mammalian cells. PLoS Biol 2006; 4(10): e309. doi: 10.1371/journal.pbio.0040309 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Raj A, Rifkin SA, Andersen E, Van Oudenaarden A. Variability in gene expression underlies incomplete penetrance. Nature 2010; 463(7283): 913–918. doi: 10.1038/nature08781 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Suter DM, Molina N, Gatfield D, Schneider K, Schibler U, Naef F. Mammalian genes are transcribed with widely different bursting kinetics. Science 2011; 332(6028): 472–474. doi: 10.1126/science.1198817 [DOI] [PubMed] [Google Scholar]
  • 25. Raj A, Van Oudenaarden A. Nature, nurture, or chance: stochastic gene expression and its consequences. Cell 2008; 135(2): 216–226. doi: 10.1016/j.cell.2008.09.050 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Dibaeinia P, Sinha S. Sergio: a single-cell expression simulator guided by gene regulatory networks. Cell Systems 2020; 11(3): 252–271. doi: 10.1016/j.cels.2020.08.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Gallivan CP, Ren H, Read EL. Analysis of single-cell gene pair coexpression landscapes by stochastic kinetic modeling reveals gene-pair interactions in development. Front Genet 2020; 10: 1387. doi: 10.3389/fgene.2019.01387 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Davis MHA. Piecewise-deterministic Markov processes: a general class of non-diffusion stochastic models. J R Stat Soc Series B (Methodological) 1984; 46(3): 353–376. [Google Scholar]
  • 29. Herbach U, Bonnaffoux A, Espinasse T, Gandrillon O. Inferring gene regulatory networks from single-cell data: a mechanistic approach. BMC Syst Biol 2017; 11(1): 1–15. doi: 10.1186/s12918-017-0487-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Milpied P, Cervera-Marzal I, Mollichella M-L, Tesson B, Brisou G, Traverse-Glehen A, et al. Human germinal center transcriptional programs are desynchronized in B cell lymphoma. Nat Immunol 2018; 19(9): 1013–1024. doi: 10.1038/s41590-018-0181-4 [DOI] [PubMed] [Google Scholar]
  • 31. McHeyzer-Williams LJ, Milpied PJ, Okitsu SL, McHeyzer-Williams MG. Class-switched memory B cells remodel BCRs within secondary germinal centers. Nat Immunol 2015; 16(3): 296–305. doi: 10.1038/ni.3095 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Bonnaffoux A, Herbach U, Richard A, Guillemin A, Gonin-Giraud S, Gros P-A, et al. WASABI: a dynamic iterative framework for gene regulatory network inference. BMC Bioinformatics 2019; 20(1): 1–19. doi: 10.1186/s12859-019-2798-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Peccoud J, Ycart B. Markovian modeling of gene-product synthesis. Theor Popul Biol 1995; 48(2): 222–234. doi: 10.1006/tpbi.1995.1027 [DOI] [Google Scholar]
  • 34. Laidlaw BJ, Cyster JG. Transcriptional regulation of memory B cell differentiation. Nat Rev Immunol 2021; 21(4): 209–220. doi: 10.1038/s41577-020-00446-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Baba A, Komatsuzaki T. Construction of effective free energy landscape from single-molecule time series. Proc Natl Acad Sci 2007; 104(49): 19297–19302. doi: 10.1073/pnas.0704167104 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Kolouri S, Pope PE, Martin CE, Rohde GK. Sliced Wasserstein auto-encoders. In Proceedings of the International Conference on Learning Representations 2018.
  • 37.Rakotomamonjy A, Traoré A, Berar M, Flamary R, Courty N. Distance measure machines. arXiv:1803.00250v3 [Technical Report]. 2018.
  • 38. Peyré G, Cuturi M. Computational optimal transport: With applications to data science. Found Trends Mach Learn 2019; 11(5-6): 355–607. doi: 10.1561/2200000073 [DOI] [Google Scholar]
  • 39. Merino Tejero E, Lashgari D, García-Valiente R, Gao X, Crauste F, Robert PA, et al. Multiscale Modeling of Germinal Center Recapitulates the Temporal Transition From Memory B Cells to Plasma Cells Differentiation as Regulated by Antigen Affinity-Based Tfh Cell Help. Front Immunol 2020; 11. doi: 10.3389/fimmu.2020.620716 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Medvedovic J, Ebert A, Tagoh H, Busslinger M. Pax5: a master regulator of B cell development and leukemogenesis. Adv Immunol 2011; 111: 179–206. [DOI] [PubMed] [Google Scholar]
  • 41. Nutt SL, Hodgkin PD, Tarlinton DM, Corcoran LM. The generation of antibody-secreting plasma cells. Nat Rev Immunol 2015; 15(3): 160–171. doi: 10.1038/nri3795 [DOI] [PubMed] [Google Scholar]
  • 42. Calado DP, Sasaki Y, Godinho SA, Pellerin A, Köchert K, Sleckman BP, et al. The cell-cycle regulator c-Myc is essential for the formation and maintenance of germinal centers. Nat Immunol 2012; 13(11): 1092–1100. doi: 10.1038/ni.2418 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Tanaka H, Matsumura I, Ezoe S, Satoh Y, Sakamaki T, Albanese C, et al. E2F1 and c-Myc potentiate apoptosis through inhibition of NF-κB activity that facilitates MnSOD-mediated ROS elimination. Mol Cell 2002; 9(5): 1017–1029. doi: 10.1016/S1097-2765(02)00522-1 [DOI] [PubMed] [Google Scholar]
  • 44. Béguelin W, Rivas MA, Fernández MTC, Teater M, Purwada A, Redmond D, et al. EZH2 enables germinal centre formation through epigenetic silencing of CDKN1A and an Rb-E2F1 feedback loop. Nat Commun 2017; 8(1): 1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Herviou L, Jourdan M, Martinez A-M, Cavalli G, Moreaux J. EZH2 is overexpressed in transitional preplasmablasts and is involved in human plasma cell differentiation. Leukemia 2019; 33(8): 2047–2060. doi: 10.1038/s41375-019-0392-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Raaphorst FM, van Kemenade FJ, Fieret E, Hamer KM, Satijn DPE, Otte AP, et al. Cutting edge: polycomb gene expression patterns reflect distinct B cell differentiation stages in human germinal centers. J Immunol 2000; 164(1): 1–4. doi: 10.4049/jimmunol.164.1.1 [DOI] [PubMed] [Google Scholar]
  • 47. Velichutina I, Shaknovich R, Geng H, Johnson NA, Gascoyne RD, Melnick AM, et al. EZH2-mediated epigenetic silencing in germinal center B cells contributes to proliferation and lymphomagenesis. Blood 2010; 116(24): 5247–5255. doi: 10.1182/blood-2010-04-280149 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Andor N, Simonds EF, Czerwinski DK, Chen J, Grimes SM, Wood-Bouwens C, et al. Single-cell RNA-Seq of follicular lymphoma reveals malignant B-cell types and coexpression of T-cell immune checkpoints Blood 2019; 133(10); 1119–1129. doi: 10.1182/blood-2018-08-862292 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Holmes AB, Corinaldesi C, Shen Q, Kumar R, Compagno N, Wang Z, et al. Single-cell analysis of germinal-center B cells informs on lymphoma cell of origin and outcome. J Exp Med 2020; 217(10). doi: 10.1084/jem.20200483 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. King HW, Orban N, Riches JC, Clear AJ, Warnes G, Teichmann SA, et al. Single-cell analysis of human B cell maturation predicts how antibody class switching shapes selection dynamics. Sci Immunol 2021; 6(56): eabe6291. doi: 10.1126/sciimmunol.abe6291 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Nihad AM Al-Rashedi

20 Dec 2023

PONE-D-23-18580Stochastic modeling of a gene regulatory network driving B cell development in germinal centersPLOS ONE

Dear Dr. Crauste,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Your manuscript, "Stochastic modeling of a gene regulatory network driving B cell development in germinal centers" (PONE-D-23-18580), has been assessed by our reviewers. Although it is of interest, we are unable to consider it for publication in its current form. The reviewers have raised a number of mains points that we believe would improve the manuscript and may allow a revised version to be published in PLOS ONE.

Please submit your revised manuscript by Feb 03 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Nihad A.M Al-Rashedi

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf.

2. Thank you for stating in your Funding Statement:

“AK, UH, MRM, OG, FC. This work was supported by the COSMIC grant (www.cosmic-h2020.eu) which has received funding from European Union’s Horizon 2020 research and innovation program under the Marie Sklodowska-Curie grant agreement no. 765158.”

Please provide an amended statement that declares *all* the funding or sources of support (whether external or internal to your organization) received during this study, as detailed online in our guide for authors at http://journals.plos.org/plosone/s/submit-now.  Please also include the statement “There was no additional external funding received for this study.” in your updated Funding Statement.

Please include your amended Funding Statement within your cover letter. We will change the online submission form on your behalf.

3. PLOS requires an ORCID iD for the corresponding author in Editorial Manager on papers submitted after December 6th, 2016. Please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field. This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager. Please see the following video for instructions on linking an ORCID iD to your Editorial Manager account: https://www.youtube.com/watch?v=_xcclfuvtxQ.

4. Please update your submission to use the PLOS LaTeX template. The template and more information on our requirements for LaTeX submissions can be found at http://journals.plos.org/plosone/s/latex.

5. We notice that your supplementary figures are uploaded with the file type 'Figure'. Please amend the file type to 'Supporting Information'. Please ensure that each Supporting Information file has a legend listed in the manuscript after the references list.

6. Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: I Don't Know

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The manuscript by Alexey Koshkin et al. deals with the investigation of the gene regulatory networks (GRNs) associated with germinal center (GC) cell development and differentiation based on public available single-cell (SC) transcriptomic data. Including three key gene regulators (BCL6, IRF4, BLIMP1), influenced by two external stimuli signals (surface receptors BCR and CD40), a model was established that qualitatively recapitulates mRNA distributions corresponding to GC and plasmablast stages of B cell differentiation, which can be used in validating the GRN in physiological and pathophysiological conditions.

The manuscript is written well and meets almost all criterias for publishing in PlosOne:

Comments:

The validation (testing) of the model on a test-data set (for example the sc-RNA data set from the same sample source but also an external data set would highly improve the quality of the paper.

Reviewer #2: In this paper the authors use stochastic modeling of a gene regulatory network to fit single cell expression data related to B cell differentiation in germinal centers. This is an important problem which can lead to better understanding of malignancies in B cells. The work is technically sound, and the paper is well-written with the mathematical descriptions and the figures doing a good job of clearly presenting the work to the readers. I would like to point out some revisions that are still needed.

- The benefit of using a stochastic model over a kinetic model, as also the improvements of using Version III over Version II are not fully clear without including a figure like Figure 2 each for the latter two parameter tune cases. For example, in version III PB_PC improves for two genes, however the GC performance becomes worse. So plots showing the mRNA counts like in Figure 2 would show how close the total model predictions are to the observed values.

- Some of the parameters depicted in 4-6 are not present later, for example beta_i, gamma, k^min, k^max. It needs to be clarified how they are replaced (e.g. maybe by k_init) or not used anymore.

- In Pages 6 and 7, line 146-154, the ranges for some parameters are not given, e.g. theta_{1,1}, H_{1,1}, etc. In line 152, aren't there 6 H_{j,i} interactions and 11 theta_{i,j} interactions? The values in Line 153 do not match, so that section needs to corrected so that the number of parameter combinations adds up.

- Why is CD40 stimuli upto 61 instead of 60?

- In Figures 4, 5, S3, some further details about the y axis scale are needed for the reader.

- In Figure 1 caption, k_{off,i} is not shown to be dependent on P_j and theta_ji in the text, only k_{on,i}, so those should match. Also autoactivation loop for BCL6 needs to be mentioned, as it is mentioned just for IRF4.

- Error in Line 363 (antibody producing, not antigen producing) should be corrected.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2024 Mar 28;19(3):e0301022. doi: 10.1371/journal.pone.0301022.r002

Author response to Decision Letter 0


20 Feb 2024

All our responses to reviewers comments are detailed in the Response-to-Reviewers.pdf file attached to this submission.

Attachment

Submitted filename: Response-to-reviewers.pdf

pone.0301022.s006.pdf (88.4KB, pdf)

Decision Letter 1

Nihad AM Al-Rashedi

11 Mar 2024

Stochastic modeling of a gene regulatory network driving B cell development in germinal centers

PONE-D-23-18580R1

Dear Dr. Crauste,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at http://www.editorialmanager.com/pone/ and clicking the ‘Update My Information' link at the top of the page. If you have any questions relating to publication charges, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Nihad A.M Al-Rashedi

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: (No Response)

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: (No Response)

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: (No Response)

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: (No Response)

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The manuscript by Alexey Koshkin et al. deals with the investigation of the gene regulatory networks (GRNs) associated with germinal center (GC) cell development and differentiation based on public available single-cell (SC) transcriptomic data. Including three key gene regulators (BCL6, IRF4, BLIMP1), influenced by two external stimuli signals (surface receptors BCR and CD40), a model was established that qualitatively recapitulates mRNA distributions corresponding to GC and plasmablast stages of B cell differentiation, which can be used in validating the GRN in physiological and pathophysiological conditions.

The manuscript is written well and meets almost all criterias for publishing in PlosOne. My concern was

taken into account and the response was accepted by the reviewer. Therefore, the paper is fit for publication in "Plos ONE"

Reviewer #2: (No Response)

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: Gourab Ghosh Roy

**********

Acceptance letter

Nihad AM Al-Rashedi

13 Mar 2024

PONE-D-23-18580R1

PLOS ONE

Dear Dr. Crauste,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

If revisions are needed, the production department will contact you directly to resolve them. If no revisions are needed, you will receive an email when the publication date has been set. At this time, we do not offer pre-publication proofs to authors during production of the accepted work. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few weeks to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Nihad A.M Al-Rashedi

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Scheme of application of the stimuli Qs, where s ∈ {BCR, CD40}.

    (TIF)

    pone.0301022.s001.tif (241KB, tif)
    S2 Fig. Absence of bistability in model (9).

    (TIF)

    pone.0301022.s002.tif (582.1KB, tif)
    S3 Fig. Histograms of model-generated and experimental mRNA counts of BCL6, IRF4, BLIMP1 at GC and PB_PC stages.

    (TIF)

    pone.0301022.s003.tif (847.3KB, tif)
    S1 File. Modeling.

    This file introduces the methodology for reducing the stochastic model and estimating the initial activation rate.

    (PDF)

    pone.0301022.s004.pdf (107.7KB, pdf)
    S1 Table. Parameters of system (9) with values accordingly to Bonnaffoux et al. [32].

    (PDF)

    pone.0301022.s005.pdf (76.7KB, pdf)
    Attachment

    Submitted filename: Response-to-reviewers.pdf

    pone.0301022.s006.pdf (88.4KB, pdf)

    Data Availability Statement

    All biological data are available as Supporting information in the original publications: - Fig 2: Martinez et al. (2012) Quantitative modeling of the terminal differentiation of B cells and mechanisms of lymphomagenesis. Proc Natl Acad Sci 09(7): 2672-2677. - Figs 4&5: Milpied et al. (2018) Human germinal center transcriptional programs are desynchronized in B cell lymphoma. Nat Immunol 19(9): 1013-1024.


    Articles from PLOS ONE are provided here courtesy of PLOS

    RESOURCES