Abstract
Replication errors influence mutations, and thus, lifetime cancer risk can be explained by the number of stem-cell divisions. Additionally, mutagens also affect cancer risk, for instance, high-dose radiation exposure increases lifetime cancer risk. However, the influence of low-dose radiation exposure is still unclear because this influence, if any, is very slight. We can assess the minimal influence of the mutagen by virtually comparing the states with and without mutagen using a mathematical model. Here, we constructed a mathematical model to assess the influence of replication errors and mutagens on cancer risk. In our model, replication errors occur with a certain probability during cell division. Mutagens cause mutations at a constant rate. Cell division is arrested when the number of cells reaches the capacity of the cell pool. When the number of cells decreases because of cell death or other reasons, cells resume division. It was assumed that the mutations of cancer driver genes occur stochastically with each mutation and that cancer occurs when the number of cancer driver gene mutations exceeds a certain threshold. We approximated the number of mutations caused by errors and mutagens. Then, we examined whether cancer registry data on cancer risk can be explained only through replication errors. Although the risk of leukemia was not fitted to the model, the risks of esophageal, liver, thyroid, pancreatic, colon, breast, and prostate cancers were explained only by replication errors. Even if the risk was explained by replication errors, the estimated parameters did not always agree with previously reported values. For example, the estimated number of cancer driver genes in lung cancer was larger than the previously reported values. This discrepancy can be partly resolved by assuming the influence of mutagen. First, the influence of mutagens was analyzed using various parameters. The model predicted that the influence of mutagens will appear earlier, when the turnover rate of the tissue is higher and fewer mutations of cancer driver genes were necessary for carcinogenesis. Next, the parameters of lung cancer were re-estimated assuming the influence of mutagens. The estimated parameters were closer to the previously reported values. than when considering only replication errors. Although it may be useful to explain cancer risk by replication errors, it would be biologically more plausible to consider mutagens in cancers in which the effects of mutagens are apparent.
Introduction
Nordling showed that age-dependent cancer death-rate becomes a straight line on the doubly logarithmic plane [1]. This is mathematically formularized and called the multistage theory of carcinogenesis [2]. This theory assumes that some successive mutations cause carcinogenesis, and each mutation arises with a certain probability depending on time. Then, the logarithm of the incidence rate increases linearly with the logarithm of the time unit, such as age. The slope of the line on the doubly logarithmic plane is used for estimating the number of mutations that are necessary for carcinogenesis. It was estimated that six or seven successive mutations are necessary for the mortality of cancer if the probability of mutation remains constant throughout life. Nowadays, it is known that the number of driver mutations in the tumor has a certain distribution, and it can be smaller than seven [3]. In addition, a different mathematical model showed that the risk of chronic myeloid leukemia could be explained by a single mutation [4]. Although molecular genetic data and the mathematical model have been updated, the fundamental concept that one or more mutations cause carcinogenesis is predominant. One of the important points of the multistage theory is that mutations arise depending on time. In other words, the number of mutations increases in an age-dependent manner with the number of cell divisions. Tomasetti and Vogelstein showed that cancer risk variation is explained by the number of stem-cell divisions [5]. They estimated the total number of stem-cell divisions and investigated its correlation with the lifetime risk of different types of cancers. They found that the risk of cancers and the number of cell divisions were strongly correlated, implying that cancer driver gene mutations are caused by cell divisions. Therefore, based on the multistage theory of carcinogenesis, not only the lifetime risk of cancer but also age-dependent cancer risk might be explained by replication errors. A study focused on cell division and cancer risk to explain the risk of colorectal cancer assuming that the rate of mutation is constant [6]. If the division rate of cells is constant, the assumption would be reasonable. However, the rate of cell division can change with age [5, 7]. Especially, cell division during the development of tissues might affect cancer risk [5]. Therefore, it is prudent to formularize assuming that the rate of cell proliferation varies with age.
Even if cancer risk is explained by the number of cell divisions, some mutagens such as ionizing radiation and smoking influence it. A study showed that the number of stem-cell divisions is not correlated with radiation- or smoking-associated cancer risk [8]. For example, a life-span study of atomic bomb survivors showed that the risk of cancer mortality increased significantly for some tissues such as the stomach, lung, liver, colon, breast, and gallbladder [9]. Stem and early progenitor cells are recognized as important target cells in radiation carcinogenesis, and ionizing radiation contributes to the carcinogenic process by adding a few mutations [10]. Since the accident at the Fukushima Daiichi Nuclear Power Plant, the carcinogenic effects of low-dose and low-dose-rate radiation have become a major public concern in Japan. Although evaluation of the biological effects of low dose and/or low dose-rate ionizing radiation has important implications in radiation protection, the magnitude of cancer risk associated with low dose and/or low dose-rate radiation exposure is still unclear [11]. One of the difficulties in estimating the influence of a very low dose is that its impact on cancer risk is, if any, very low. To discuss a low influence of mutagens, the baseline of cancer risk must be considered. As mentioned above, the influence of replication errors on cancer risk may be the baseline. Considering the relationship between replication error and cancer risk using a simple model and comparing it with actual data will help identify the baseline. Examining how mutagens affect this baseline will thus help us evaluate it, even if the effect of a mutagen is very small.
Here, we constructed a simple mathematical model to assess the influence of cell division and mutagens on cancer risk. The model assumed cell division phases of growth and stability. In the growth phase, stem cells increase exponentially until the number of cells reaches the capacity of the stem-cell pool. The stable phase starts after that. Stem cells divide only when the cells are decreased in the stable phase. In these processes, mutations can arise due to replication errors and mutagens. Cancer risk was assumed to depend on mutations. Next, we examined whether age-dependent cancer risk can be explained only by replication errors through a comparison of the model with cancer registry data. Finally, the influences of mutagens are discussed qualitatively by varying the parameters, and lung cancer data were reanalyzed.
Methods
Overview of the mathematical model
Cells generally do not proliferate indefinitely. Some studies assume that stem cells frequently divide until the tissue is fully developed, and the frequency of cell division decreases after development [5, 12]. We assumed that tissues exhibit growth and stable phases. During the growth phase, stem cells divide at a constant rate until the total cell number is the complete size of the stem-cell pool. Then, the growth phase ends, and the stable phase starts. In the stable phase, stem cells do not divide as long as the number of cells can be maintained. Stem cells can be removed from the cell pool as a result of cell death and/or differentiation; however, we did not distinguish this here. When cells are removed, the remaining cells undergo division with a constant growth rate until the total cell number reaches the complete size of the cell pool. We assumed that mutations are classified into errors and lesions. Errors correspond to DNA replication errors, which can arise during stem-cell division. We assumed errors to occur according to a Poisson distribution with parameter λ1 when a stem cell undergoes division. By contrast, lesions arise independently from cell division, thus reflecting effects of mutagens. We assumed that the number of lesions per cell per time depends on a Poisson distribution with parameter λ2. When λ3 is independent from time, there is a chronic influence of mutagens from the beginning. When discussing the temporary influence of mutagens, λ2 should be the function of time. In this paper, we assumed that λ2 is constant, and we examined the accumulation of errors and lesions. A schematic diagram is shown in Fig 1.
Fig 1. Schematic diagram of the model.
This process is referred to as growth phase, followed by a stable phase. In the stable phase, cells can divide only when their number decreases, such as when they are eliminated. Errors can occur during division, and lesions occur with a certain probability at any time, regardless of division.
Simulation using the Gillespie algorithm
As the number of stem cells is not constant, we used the Gillespie algorithm [13–15] to simulate changes in the numbers of mutations over time (S1 File). Let n{i,j} be the number of stem cells whose errors and lesions are i and j, respectively. We express the condition of the stem-cell pool as a set of n{i,j} as follows: , where I and J are maximum numbers of errors and lesions, respectively. Although these limitations are necessary for the simulation, we can ignore their influence by setting sufficiently large I and J. The condition of the tissue varies with stochastic events such as stem-cell division, formation of lesions, and cell removal. The chance of each event is proportional to its normalized form by the sum of the rates of all possible events [15]. First, we show the rate of each event in the growth phase. Stem cells divide at a rate r, and cell removal does not occur during the growth phase. As the maximum number of errors is I, the rate at which a stem cell carries i errors and j lesions and divides with k errors is
(1) |
(2) |
X→Y represents the rate of the changing the condition from X to Y. Cells can also suffer lesions without cell division. The rate at which a cell having i errors and j lesions and suffers l lesions is written as follows:
(3) |
(4) |
Then, the sum of the rates of all possible events in the growth phase, Γg, is
(5) |
Denoting the probability that condition X changes to Y as Pr[X→Y], we can write the probability of each event as follows:
(6) |
(7) |
(8) |
(9) |
Let NL be the capacity of the stem-cell pool. In the stable phase, cells do not divide when there are NL stem cells, and cells may be removed. We denote a{i,j} as the rate of cell division of a stem cell, which carries i errors and j lesions in the stable phase. As the division rate is 0 when there are NL cells,
(10) |
where N is the total cell number, which is defined as
(11) |
When the total cell number is smaller than NL, the stem cells divide at a rate of . Similar to the growth phase, the rate at which a stem cell, which carries i errors and j lesions, divides with k errors is
(12) |
(13) |
The rate of cell removal is
(14) |
where b{i,j} is the removal rate of a stem cell carrying i errors and j lesions.
As the mutations caused by lesions is the same as that during the growth phase the sum of the rates of all possible events in the stable phase, Γs, is
(15) |
Then, the probability of each event in the stable phase can be denoted as follows:
(16) |
(17) |
(18) |
(19) |
(20) |
Although a{i,j} and b{i,j} can depend on the number of mutations if mutations affect the character of cells, we assumed that those parameters are independent of the number of mutations, a = a{i,j}, b = b{i,j}, in the following analysis, for simplicity.
Multi-stage carcinogenesis model
Based on the multistage carcinogenesis theory, we assumed that cancer arises through successive mutations. We did not distinguish between mutations arising from errors or lesions and assumed that mutations of g cancer driver genes cause carcinogenesis. Strictly, accumulation of mutations should be considered for each stem cell, however, we considered the average number of mutations per cell, for simplicity. The total numbers of errors and lesions at time t in the tissue were calculated, respectively, as follows:
(21) |
(22) |
where the number of stem cells that have i errors and j lesions at time t is denoted as n{i,j}(t). The average number of mutations in a stem cell is
(23) |
where N(t) is the total number of stem cells at time t. The probability of having g or more mutations of cancer driver genes equals one minus the probability of the number of mutations smaller than g. Let p be the probability that one cancer driver gene mutation occurs per mutation. By using the cumulative distribution function of the binomial distribution, when there are m(t) mutations, the cumulative cancer risk is
(24) |
The second term is the probability that the number of cancer driver gene mutations is smaller than g. If m(t) is sufficiently high and p is sufficiently low, binomial distribution can be approximated using the Poisson distribution. Then, G can be described as follows:
(25) |
In the following, we use Eq (25) for simplifying the calculation.
Results
Simulation and approximation of the accumulation of errors and lesions
First, the accumulation of errors and lesions was simulated using the Gillespie algorithm. An example of the simulation is shown in Fig 2. The number of stem cells increases exponentially in the growth phase until it reaches NL (Fig 2A). When the number of stem cells reaches NL, the growth phase ends, and the stable phase starts. When stem cells are removed, cell division resumes (Fig 2A). The time series of the number of errors and lesions are shown in Fig 2B. Then, we approximate the time course of the accumulation of errors and lesions to discuss cancer risk.
Fig 2. Results of a simulation by the Gillespie algorithm.
(a) Time course of the number of cells. The label n{i,j} is the number of cells carrying i errors and j lesions. Total number of cells are denoted as N(t). (b) Time course of the total number of errors E(t) and lesions W(t). Parameter values are N0 = 10, NL = 100, r = 0.15, ad = 0.16, b = 0.01, λ1 = 0.01, and λ2 = 0.001. The results were output when reaching a certain number of divisions and were combined.
We consider the average accumulated mutations in the tissue. In the growth phase, stem cells increase with the growth rate r. Setting the initial stem-cell number to N0, the number of cell divisions at time t is d, and the following equation holds:
(26) |
Rearranging this equation, we can express d as the function of time as follows:
(27) |
As the average number of errors per cell division is λ1, the number of errors, E, in the growth phase is
(28) |
By contrast, the average number of lesions per stem cell per time is λ2. The number of stem cells at time t is N0 exp[rt] in the growth phase. These mean that the increase rate of W is λ2N0 exp[rt]. Considering there is no lesion in the initial state,
(29) |
From Eq (27), t can be written as the function of d as follows:
(30) |
Substituting Eq (30) into Eq (29) the number of lesions can be written as the function of d as follows:
(31) |
In the stable phase, the total number of cell divisions, d, is the summation of the total number of cell divisions in the growth and stable phases. The former is trivially calculated as NL−N0. To estimate the number of cell divisions in the stable phase, the waiting time until cell division needs to be taken into account. In the stable phase, stem cells do not divide when the number of stem cells is NL. The waiting time until cell division is the summation of waiting time until cell removal and cell division. As the rate of cell removal is b, the waiting time until cell removal is 1/bNL. In this analysis, we assume the blank of a stem cell in the stem-cell pool is filled before another cell is removed. Then, the waiting time until cell division is approximated as 1/a(NL−1). Therefore, the waiting time until cell division in the stable phase is calculated as follows:
(32) |
Letting dT be the number of cell divisions during T in the stable phase, the following equation holds:
(33) |
When growth phase ends at t′, the total number of cell divisions at t>t′ is
(34) |
As the growth phase ends when the total stem-cell number reaches NL, t′ can be calculated as follows:
(35) |
Substituting this equation into Eq (34), the total number of cell divisions d at t>t′ is
(36) |
Then, the number of errors, E, in the stable phase is
(37) |
Considering the number of lesions in the growth phase, it is convenient to express t as a function of d. From Eq (36), the relationship between d and t in the stable stage is
(38) |
Let W′ be the number of lesions at the end of the growth phase i.e. W at t = t′. From Eqs (31) and (35), W′ can be calculated as
(39) |
The number of lesions emerged in the stable phase is expressed as the product of NL, λ2, and the time elapsed since entering the stable phase. The number of lesions in the stable phase is
(40) |
Fig 3 shows a comparison between the results of simulations and approximations. The approximations explain the simulation well. Next, we discuss cancer risk using these approximations.
Fig 3. Comparison of simulation and analytical approximations.
The horizontal line indicates the number of cell divisions, and the vertical line indicates the number of errors and lesions. Dots are the average values of 50-fold simulations. Lines are the approximations in Eqs 28, 29, 37, and 40. Blue and orange lines show errors and lesions, respectively. Parameter values are N0 = 10, NL = 100, r = 0.15, ad = 0.16, b = 0.01, λ1 = 0.01, and λ2 = 0.001.
Fitting to cancer registry data
We consider the cumulative risk of cancer as mutations accumulate during the lifetime. If cancer risk is explained by stem-cell divisions, the risk should be explained only by replication errors. In other words, the presented model can fit cancer registry data of cancer risk even if λ2 = 0. We applied our model to cancer registry data of cumulative cancer incidence risk in 2015 obtained from the Cancer Information Service, National Cancer Center, Japan (Ministry of Health, Labour and Welfare, National Cancer Registry), accessible under https://ganjoho.jp/reg_stat/index.html. We assumed that a tissue originates from only a single stem cell and that the number of stem cells reaches NL at the age of 18, i.e., the growth phase continues until the age of 18. Therefore, the rate of cell division in the growth phase r is
(41) |
The dimension of r is cell divisions per year in this formula. When we assume that the cell division occurs immediately after a stem cell is removed, a≫b, the waiting time of cell division in the stable phase, as in Eq (32), is 1/bNL. This means that the proliferation rate in the stable phase equals the cell-removal rate b. The number of stem cells in the tissue and that caused by the divisions of each stem cell per year are estimated in previous studies [5, 7] (Table 1). As we assume λ2 = 0, we should estimate λ1, p, and g. Explicitly denoting d as a function of t as d(t), from Eqs (28) and (37), pm(t) = pλ1d(t) if λ2 = 0. Therefore, λ1p is regarded as a single parameter in Eq (25) when λ2 = 0. This parameter can be understood as the average number of cancer driver genes mutations per cell division. Therefore, we have to estimate the values of λ1p and g. Although λ1p is a real number, g is an integer. We varied the value of g and calculated the least-square value with λ1p. Then, we chose a parameter set of λ1p and g that made the least-square value minimum (S1 Fig). The curves of G(t) were fitted to cancer registry data of esophageal cancer (Fig 4A), leukemia (Fig 4B), liver cancer (Fig 4C), lung cancer (Fig 4D), thyroid cancer (Fig 4E), pancreatic cancer (Fig 4F), colon cancer (Fig 4G), breast cancer (Fig 4H), and prostate cancer (Fig 4I). The curve was not fitted to the risk of leukemia because the risk increases from a young age (Fig 4B). Except for leukemia, the curve was fitted to the data. This implies that age-dependent cancer risk can be explained well by replication errors. However, it should be noted that the estimated parameter values do not always agree with the values expected from the previously reported values, as elaborated in the Discussion. For example, g and λ1p of male lung cancer are very large. It is well known that smoking increases lung cancer risk. Therefore, it may be possible to reduce this discrepancy by considering the influence of mutagens.
Table 1. List of parameters for the fitting to cancer registry data shown in Fig 4.
Number of divisions of each stem cell per year (b) | Number of normal stem cells in tissue origin (NL) | Cell division rate in growth phase () | |
---|---|---|---|
Esophageal cancer | 33.2* | 6.6528 × 106* | 0.87 |
Leukemia | 12* | 1.35 × 108* | 1.04 |
Liver cancer | 0.91* | 3.01 × 109* | 1.21 |
Lung cancer | 0.07* | 1.22 × 109* | 1.16 |
Thyroid cancer | 0.087* | 6.5 × 107* | 1.00 |
Pancreatic cancer | 1* | 4.18 × 109* | 1.23 |
Colon cancer | 73* | 2.00 × 108* | 1.06 |
Breast cancer | 4.32** | 8.7 × 109** | 1.27 |
Prostate cancer | 2.99** | 2.1 × 108** | 1.06 |
Fig 4. Comparison of model and cancer registry data of age-dependent cancer risk.
Cancer risk G(t) was compared with the cumulative risk of (a) esophageal cancer, (b) leukemia, (c) liver cancer, (d) lung cancer, (e) thyroid cancer, (f) pancreatic cancer, (g) colon cancer, (h) breast cancer, and (i) prostate cancer. Blue and red lines are the cases of males and females, respectively. Dots indicate cancer registry data of cumulative risk of cancer for the age range of 5–85. Curves are the fitting of G(t) to data when λ2 = 0. Fitted parameters are shown in each figure. The other parameters values are summarized in Table 1.
Influence of lesions on carcinogenesis
Finally, we provide a qualitative hypothesis for the influence of lesions on cancer risk by analyzing the model using different parameters. We used risk difference (RD) as a quantitative measure for the influence of lesions [16]. Let G1(t) and G0(t) be the cancer risks at time t in the case of λ2 ≠ 0 and λ2 = 0, respectively. Then, RD at time t is defined as
(42) |
The other measure called, risk ratio (RR), which is defined as RR = G1/G0, is widely used in the field of public health [16]. We, however, consider only RD because G0 can be 0 in this theoretical analysis. Since the absolute value of RD depends on unknown biological parameters, such as sensitivity to mutagens, we assumed some parameter sets and obtained qualitative results. The typical shape of RD is shown in Fig 5. RD is very low at the beginning and after a sufficiently long time. Even if there are mutagens, the number of cancer driver gene mutations is not sufficient for carcinogenesis in the beginning. As a result, RD is low. By contrast, after a sufficiently long time, most cancers are caused by replication errors, and hence, RD is low. The largest values of RD are observed when t is moderate. When λ2 is large, the start of the increase is early, and the peak of RD is large (Fig 5A). This means that the greater the influence of mutagens, the earlier and greater the influence on RD. This result is intuitive. The biological features of the tissues might affect the influence of the mutagens. When the turnover of the stem cells is less frequent in the stable phase, i.e., b is low, the increase of RD is late, and the maximum value is large (Fig 5B). In this case, the accumulation of errors is slow as the cell division is less frequent. Then, cancer driver gene mutations are relatively unlikely to occur even if there are mutagens. However, as there are few cancers caused by errors, the maximum value of RD becomes large. The influence of the number of cancer genes is contrary to the turnover rate. If carcinogenesis needs fewer mutations, the increase of RD is faster, and the maximum value is smaller (Fig 5C). In this case, a relatively low number of mutations are sufficient for the complete mutation of all cancer driver genes; therefore, carcinogenesis occurs early because of additional mutations caused by mutagens. By contrast, the maximum value of RD is low because cancer arises relatively frequently without mutagens. These results suggest that if the turnover rate is low and/or the number of cancer driver gene mutations is large, the influence of mutagens emerges slower and larger. We re-analyzed the data on lung cancer to consider the influence of lesions (Fig 6). In this analysis, we assumed that lesions would accumulate after an age of 20 years. That is, λ2 = 0 before age 20, and λ2 has a constant value after that. This is because the smoking age in Japan is 20. As the fitting was already good even when considering replication errors only, it did not improve markedly. However, the estimated values were smaller than when considering replication errors only.
Fig 5. Time course of risk difference with various parameters.
Parameter values are N0 = 1, NL = 109, r = log[NL]/18, λ1 = 0.5, p = 0.2, (a) b = 0.1, g = 6, (b) λ2 = 0.01, g = 6, and (c) b = 1, λ2 = 0.1. The remaining parameters are shown in each figure.
Fig 6. Comparison of models with lesion and lung cancer data of age-dependent cancer risk.
Dots indicate cancer registry data of cumulative risk of cancer for the age range of 5–85. Curves are the fitting of G(t) when λ2 = 0 if the age is < 20, λ2 is a constant value if the age is > 20. The other parameters values are summarized in Table 1.
Discussion
We constructed a mathematical model to discuss the risk of cancer caused by the accumulation of mutations. In the model, a single stem cell divides up to a certain number, NL, and stops dividing as long as the total cell number can remain at NL. Cell division resumes when cells are removed. Mutations are caused by errors and lesions. Errors arise stochastically with cell division, and lesions arise without cell division. First, we simulated the accumulation of errors and lesions using the Gillespie algorithm (Fig 2). The accumulation of errors and lesions were approximated and compared with the simulation (Fig 3). As so many cell divisions occur in real tissue, calculation with the Gillespie algorithm takes a long time. Using these approximations, it is easier to compare with cancer registry data than using a simulation.
Next, we considered the probability of carcinogenesis based on the multistage model. Cancer risk is correlated to the number of stem-cell divisions [5]. This suggests that cancer risk might be explained by the accumulation of replication errors. In our model, this means that cancer risk is explained without lesions. We fitted our model to cancer registry data from the Cancer Information Service, National Cancer Center, Japan (Fig 4). Except for leukemia, our model fitted to the risks of cancers. This supports the hypothesis that the risks of some cancers are explained by replication errors. Leukemia was not explained by our model as the risk occurs at a very young age. The model assumes that cells divide vigorously at a younger age. If we try to explain the early risk of leukemia through replication errors, the risk in the stable phase becomes too large because hematopoietic stem cells are considered to divide even in adults. It is said that chronic myeloid leukemia can be explained by a single mutation when the mutated cells acquire reproductive advantages [4]. We assumed that mutations do not affect the biological characters of the cells in this study. Therefore, the influence of mutations on cells should be considered in future studies. We estimated 3–10 cancer driver gene mutations. Some studies assessed the numbers of cancer-related mutations (Table 2).
Table 2. Estimated parameters from fitting and related parameters from previous studies.
Estimated Number of cancer driver genes (g) | Average number of cancer driver genes mutations per cell division. (λ1p) | Number of driver mutations from previous studies | Number of cancer driver genes mutations per cell division estimated from [18–20] | |||
---|---|---|---|---|---|---|
Male | Female | Male | Female | |||
Leukemia | 2 | 2 | 0.0016 | 0.00014 | 1.94** | 0.000228, 0.000274 |
Lung cancer | 10 | 7 | 1.045 | 0.54 | 5.33**, 5.16** | 0.000228, 0.000274 |
Pancreatic cancer | 5 | 6 | 0.0226 | 0.0290 | 4.25* | 0.000228, 0.000274 |
Colon cancer | 5 | 5 | 0.00041 | 0.00037 | 3.59*, 3.74** | 0.000228, 0.000274 |
Breast cancer | - | 2 | - | 0.0018 | 3.32*, 1.76** | 0.000228, 0.000274 |
Vogelstein et al. [3] reviewed the distribution of the numbers of driver mutations in tumors. Pancreatic cancer, colorectal cancer, and breast cancer were associated with, on average, 4.25, 3.59, and 3.32 mutations, respectively. A different study analyzed 3,281 tumors from 12 cancer types and identified 127 genes that showed significant frequencies of mutations [17]. The average numbers of mutations in colon and rectal carcinoma, acute myeloid leukemia, lung adenocarcinoma, lung squamous cell carcinoma, and breast adenocarcinoma were 3.74, 1.94, 5.33, 5.16, and 1.76, respectively. The estimation of the present study showed that leukemia required 2 mutations, pancreatic cancer requires 5 or 6 mutations, colon cancer requires 5 mutations, and breast cancer requires 2 mutations. These estimations therefore deviate only slightly from the previously reported values. However, the estimated value of g = 10 or 7 for lung cancer is larger than the values. In addition, the estimated mutation rate λ1p of lung cancer is very large. The mutation rate is estimated at approximately 1.14 mutations per genome per division in healthy haematopoiesis and 1.37 per genome per division in brain development [18], which corresponds to λ1. Then, p can be interpreted as the proportion of cancer genes in the genome. Combined with the estimation that more than 1% of the genes contribute to human cancer [19] and the estimation that the protein-coding gene is 2% [20], p is considered to be approximately 0.01×0.02 = 0.0002. From these estimations, λ1p will be approximately 1.14×0.0002 = 0.000228 or 1.37×0.0002 = 0.000274. The estimated values of λ1p are too large in liver, lung, thyroid, pancreatic, breast, and colon cancer. This may suggest that biological processes other than replication errors affect the cancer risk. We discuss the biological processes that can affect the results in the last section.
The change in cancer risk caused by mutagens is an important problem in the field of oncology. For example, high dose and dose-rate radiation increase cancer risk [9]. However, the influence of very low dose and dose-rate radiation, which is important for radiation protection, is unclear [11]. We discussed the influence of lesions caused by mutagens on cancer risk using risk difference (RD) as an indicator (Fig 5). RD increased initially and then began to decrease. The decrease in the latter half is due to the assumption that carcinogenesis occurs even in the absence of mutagens because of the accumulation of replication errors. In reality, the decrease will be difficult to observe because life is limited. The timing of the start of RD increases depending on biological parameters. The increase is delayed when the cell-removal rate b is low (Fig 5B) and/or the number of cancer driver genes g is large (Fig 5C). The removal rate b can be regarded as the division rate if the removal of a stem cell is immediately compensated by the cell division of the remnant stem cells. For example, astrocytes and neural stem cells are the origins of glioblastoma and medulloblastoma, respectively. The number of divisions of those stem cell per year is estimated at 0 [5]. The influence of mutagens on these cancer risks might appear slowly. However, it is estimated that colonic stem cells divide 75-times per year [5]. The influence of mutagens might be observed relatively early in the colon. Recently, a hypothesis that radiation, especially high dose-rate radiation, might lead to an earlier onset of cancer has been advocated [21]. The time course of RD in this study focuses on cancer risk change with time. When risk G is fixed and solved with respect to t, the difference might be an indicator of the early onset of cancer.
In order to integrate the effects of mutagens, we focused on lung cancer, which showed a large difference between estimated parameters when only replication errors and previously reported values. In addition, the estimated the number of cancer driver genes in lung cancer also largely differed between sexes. This discrepancy will be partly due to incompleteness of the estimation, as discussed below, but can be partially resolved by assuming mutagen. It is known that smoking increases lung cancer risk (e.g., [22]). The data included smokers and non-smokers, with more male than female smokers. This may lead to a larger estimated value for the number of cancer driver genes in male data than in female data and the large difference between sexes. When we integrated the influence of mutagen, the fitting did not differ as much. However, the estimated value was smaller and closer to the previously reported values than in the case of replication error alone. The difference between sexes became slightly smaller. By considering the effects of mutagens in more detail, the effects of replication errors might be estimated more accurately.
The present analysis was very simple, and there are limitations that must be solved in the future to derive more accurate results. We did not consider population fluctuation when fitting the cancer registry data. In addition, fitting by the least-squares method was performed, but this assumes an age-independent normal distribution. These may have caused the differences between sexes in the parameter estimates. For more accurate estimates, it will be necessary to take into account demographics and different health conditions among generations. Moreover, this study ignored many biological processes. The end of growth phase was assumed at age 18, but the actual age may vary depending on the organ or tissue. Assuming a longer growth phase, estimated g and λ1p would be smaller. We assumed that properties of the cells were independent of errors and lesions. However, the acquisition of driver mutations may increase the growth rate, and mutations in DNA damage repair genes may increase the mutation rate. In addition, as the accumulation of mutations was treated as the average of the population in the multi-stage carcinogenesis model, polyclonality of cancer could not be expressed. Many cancers are known to be polyclonal [23]. It may be possible to incorporate changes in cell properties and cancer polyclonality by focusing on each cell, rather than on the population average, in the process of carcinogenesis. For example, if the properties of each cell are changed by mutations, a{i,j}, b{i,j}, and λ1 should depend on the number of errors i and that of lesions j. In the stable phase, the division rate was constant; however, it is known to decreases with age [7]. It is also known that the effect of radiation is more pronounced in younger people than in adults [9]. The relationship between division rates and mutagens will also be an important issue when considering radiation protection. For example, assuming that a{i,j} and/or λ2 are time-dependent, it may be possible to incorporate those phenomena. Moreover, we assumed that the interaction of stem cells does not affect the proliferation and removal of cells. However, cells exposed to mutagens such as ionizing radiations might be removed through stem-cell competition [24, 25]. If so, the removal rate of damaged cells may be larger than that of intact cells. These phenomena should be considered in the future to examine cancer risk in more detail.
Supporting information
The sum of the squares of the residuals for each g was minimized by λ1p. Then, a combination of g and λ1p showing the minimum of the sum of squares of the residuals was established. The order of the figures corresponds to that in Fig 5: (a) esophageal cancer, (b) leukemia, (c) liver cancer, (d) lung cancer, (e) thyroid cancer, (f) pancreatic cancer, (g) colon cancer, (h) breast cancer, and (i) prostate cancer. The upper and lower panels show the results of males and females, respectively.
(TIF)
This is a C language code for Gillespie algorithm. The default parameters correspond to Fig 2.
(C)
Acknowledgments
We thank Dr. Hiroshi Haeno for technical discussions on the mathematical model.
Data Availability
Statistical data used in the manuscript can be accessed from the "graph database" in "latest cancer statistics" of Cancer Information Service, National Cancer Center, Japan (National Cancer Registry, Ministry of Health, Labour and Welfare (https://ganjoho.jp/reg_stat/index.html). The direct URL to the "graph database" for each cancer and the values used there are shown below. “Graph database” of esophageal cancer: https://gdb.ganjoho.jp/graph_db/gdb1?dataType=30&graphId=106&totalTarget=40&_useLog=on&_stackedRaito=on&_showErrorBar=on&_useUnknownStage=on&smTypes=1&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&smType=4&year=2015&years=2015&_years=1&avgStep=&survivalAgeKbn=AAA&sexType=0&ageSybt=0&ageSt=009&ageEd=A85¤tAge=0&marumeAgeKbn=A85&stage=0&elapsedYears=5&_showBreastOnlyFemale=on&_showOnlyPrefectures=on&showGraph=Submit The values of the data of esophageal cancer: https://gdb.ganjoho.jp/graph_db/gdb1?showData=&dataType=30&graphId=106&totalTarget=40&year=2015&years=2015&avgStep=&survivalAgeKbn=AAA&ageSybt=0&ageSt=009&ageEd=A85¤tAge=0&smTypes=1&smType=4&sexType=0&marumeAgeKbn=A85&stage=0&elapsedYears=5 “Graph database” of leukemia: https://gdb.ganjoho.jp/graph_db/gdb1?dataType=30&graphId=106&totalTarget=40&_useLog=on&_stackedRaito=on&_showErrorBar=on&_useUnknownStage=on&smTypes=1&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&smType=27&year=2015&years=2015&_years=1&avgStep=&survivalAgeKbn=AAA&sexType=0&ageSybt=0&ageSt=009&ageEd=A85¤tAge=0&marumeAgeKbn=A85&stage=0&elapsedYears=5&_showBreastOnlyFemale=on&_showOnlyPrefectures=on&showGraph=Submit The values of the data of leukemia: https://gdb.ganjoho.jp/graph_db/gdb1?showData=&dataType=30&graphId=106&totalTarget=40&year=2015&years=2015&avgStep=&survivalAgeKbn=AAA&ageSybt=0&ageSt=009&ageEd=A85¤tAge=0&smTypes=1&smType=27&sexType=0&marumeAgeKbn=A85&stage=0&elapsedYears=5 “Graph database” of liver cancer: https://gdb.ganjoho.jp/graph_db/gdb1?dataType=30&graphId=106&totalTarget=40&_useLog=on&_stackedRaito=on&_showErrorBar=on&_useUnknownStage=on&smTypes=1&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&smType=8&year=2015&years=2015&_years=1&avgStep=&survivalAgeKbn=AAA&sexType=0&ageSybt=0&ageSt=009&ageEd=A85¤tAge=0&marumeAgeKbn=A85&stage=0&elapsedYears=5&_showBreastOnlyFemale=on&_showOnlyPrefectures=on&showGraph=Submit The values of the data of liver cancer: https://gdb.ganjoho.jp/graph_db/gdb1?showData=&dataType=30&graphId=106&totalTarget=40&year=2015&years=2015&avgStep=&survivalAgeKbn=AAA&ageSybt=0&ageSt=009&ageEd=A85¤tAge=0&smTypes=1&smType=8&sexType=0&marumeAgeKbn=A85&stage=0&elapsedYears=5 “Graph database” of lung cancer: https://gdb.ganjoho.jp/graph_db/gdb1?dataType=30&graphId=106&totalTarget=40&_useLog=on&_stackedRaito=on&_showErrorBar=on&_useUnknownStage=on&smTypes=1&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&smType=12&year=2015&years=2015&_years=1&avgStep=&survivalAgeKbn=AAA&sexType=0&ageSybt=0&ageSt=009&ageEd=A85¤tAge=0&marumeAgeKbn=A85&stage=0&elapsedYears=5&_showBreastOnlyFemale=on&_showOnlyPrefectures=on&showGraph=Submit The values of the data of lung cancer: https://gdb.ganjoho.jp/graph_db/gdb1?showData=&dataType=30&graphId=106&totalTarget=40&year=2015&years=2015&avgStep=&survivalAgeKbn=AAA&ageSybt=0&ageSt=009&ageEd=A85¤tAge=0&smTypes=1&smType=12&sexType=0&marumeAgeKbn=A85&stage=0&elapsedYears=5 “Graph database” of thyroid cancer: https://gdb.ganjoho.jp/graph_db/gdb1?dataType=30&graphId=106&totalTarget=40&_useLog=on&_stackedRaito=on&_showErrorBar=on&_useUnknownStage=on&smTypes=1&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&smType=24&year=2015&years=2015&_years=1&avgStep=&survivalAgeKbn=AAA&sexType=0&ageSybt=0&ageSt=009&ageEd=A85¤tAge=0&marumeAgeKbn=A85&stage=0&elapsedYears=5&_showBreastOnlyFemale=on&_showOnlyPrefectures=on&showGraph=Submit The values of the data of thyroid cancer: https://gdb.ganjoho.jp/graph_db/gdb1?showData=&dataType=30&graphId=106&totalTarget=40&year=2015&years=2015&avgStep=&survivalAgeKbn=AAA&ageSybt=0&ageSt=009&ageEd=A85¤tAge=0&smTypes=1&smType=24&sexType=0&marumeAgeKbn=A85&stage=0&elapsedYears=5 “Graph database” of pancreatic cancer: https://gdb.ganjoho.jp/graph_db/gdb1?dataType=30&graphId=106&totalTarget=40&_useLog=on&_stackedRaito=on&_showErrorBar=on&_useUnknownStage=on&smTypes=1&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&smType=10&year=2015&years=2015&_years=1&avgStep=&survivalAgeKbn=AAA&sexType=0&ageSybt=0&ageSt=009&ageEd=A85¤tAge=0&marumeAgeKbn=A85&stage=0&elapsedYears=5&_showBreastOnlyFemale=on&_showOnlyPrefectures=on&showGraph=Submit The values of the data of pancreatic cancer: https://gdb.ganjoho.jp/graph_db/gdb1?showData=&dataType=30&graphId=106&totalTarget=40&year=2015&years=2015&avgStep=&survivalAgeKbn=AAA&ageSybt=0&ageSt=009&ageEd=A85¤tAge=0&smTypes=1&smType=10&sexType=0&marumeAgeKbn=A85&stage=0&elapsedYears=5 “Graph database” of colon cancer: https://gdb.ganjoho.jp/graph_db/gdb1?dataType=30&graphId=106&totalTarget=40&_useLog=on&_stackedRaito=on&_showErrorBar=on&_useUnknownStage=on&smTypes=1&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&smType=6&year=2015&years=2015&_years=1&avgStep=&survivalAgeKbn=AAA&sexType=0&ageSybt=0&ageSt=009&ageEd=A85¤tAge=0&marumeAgeKbn=A85&stage=0&elapsedYears=5&_showBreastOnlyFemale=on&_showOnlyPrefectures=on&showGraph=Submit The values of the data of colon cancer: https://gdb.ganjoho.jp/graph_db/gdb1?showData=&dataType=30&graphId=106&totalTarget=40&year=2015&years=2015&avgStep=&survivalAgeKbn=AAA&ageSybt=0&ageSt=009&ageEd=A85¤tAge=0&smTypes=1&smType=6&sexType=0&marumeAgeKbn=A85&stage=0&elapsedYears=5 “Graph database” of breast cancer: https://gdb.ganjoho.jp/graph_db/gdb1?dataType=30&graphId=106&totalTarget=40&_useLog=on&_stackedRaito=on&_showErrorBar=on&_useUnknownStage=on&smTypes=1&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&smType=14&year=2015&years=2015&_years=1&avgStep=&survivalAgeKbn=AAA&sexType=0&ageSybt=0&ageSt=009&ageEd=A85¤tAge=0&marumeAgeKbn=A85&stage=0&elapsedYears=5&_showBreastOnlyFemale=on&_showOnlyPrefectures=on&showGraph=Submit The values of the data of breast cancer: https://gdb.ganjoho.jp/graph_db/gdb1?showData=&dataType=30&graphId=106&totalTarget=40&year=2015&years=2015&avgStep=&survivalAgeKbn=AAA&ageSybt=0&ageSt=009&ageEd=A85¤tAge=0&smTypes=1&smType=14&sexType=0&marumeAgeKbn=A85&stage=0&elapsedYears=5 “Graph database” of prostate cancer: https://gdb.ganjoho.jp/graph_db/gdb1?dataType=30&graphId=106&totalTarget=40&_useLog=on&_stackedRaito=on&_showErrorBar=on&_useUnknownStage=on&smTypes=1&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&_smTypes=on&smType=20&year=2015&years=2015&_years=1&avgStep=&survivalAgeKbn=AAA&sexType=0&ageSybt=0&ageSt=009&ageEd=A85¤tAge=0&marumeAgeKbn=A85&stage=0&elapsedYears=5&_showBreastOnlyFemale=on&_showOnlyPrefectures=on&showGraph=Submit The values of the data of prostate cancer: https://gdb.ganjoho.jp/graph_db/gdb1?showData=&dataType=30&graphId=106&totalTarget=40&year=2015&years=2015&avgStep=&survivalAgeKbn=AAA&ageSybt=0&ageSt=009&ageEd=A85¤tAge=0&smTypes=1&smType=20&sexType=0&marumeAgeKbn=A85&stage=0&elapsedYears=5 The authors confirm that others would be able to access or request these data in the same manner as the authors. The authors did not have any special access or request privileges that others would not have.
Funding Statement
KU was supported by JSPS KAKENHI Grant Number JP 20K19972. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Nordling CO. A new theory on the cancer-inducing mechanism. British journal of cancer. 1953; 7(1): 68–72. doi: 10.1038/bjc.1953.8 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Armitage P, Doll R. The age distribution of cancer and a multi-stage theory of carcinogenesis. British journal of cancer. 1954; 8(1), 1–12. doi: 10.1038/bjc.1954.1 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Vogelstein B, Papadopoulos N, Velculescu VE, Zhou S, Diaz LA Jr, Kinzler KW. Cancer genome landscapes. Science. 2013; 339(6127); 1546–1558. doi: 10.1126/science.1235122 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Michor F, Iwasa Y, Nowak MA. The age incidence of chronic myeloid leukemia can be explained by a one-mutation model. Proc Natl Acad Sci U S A. 2006; 103(40), 14931–14934. doi: 10.1073/pnas.0607006103 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Tomasetti C, Vogelstein B. Variation in cancer risk among tissues can be explained by the number of stem cell divisions. Science. 2015; 347(6217), 78–81. doi: 10.1126/science.1260825 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Calabrese P, Shibata D. A simple algebraic cancer equation: calculating how cancers may arise with normal mutation rates. BMC Cancer. 2010; 10(1), 1–12. doi: 10.1186/1471-2407-10-3 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Tomasetti C, Poling J, Roberts NJ, London NR Jr, Pittman ME, Haffner MC, et al. Cell division rates decrease with age, providing a potential explanation for the age-dependent deceleration in cancer incidence. Proc Natl Acad Sci U S A. 2019; 116(41), 20482–20488. doi: 10.1073/pnas.1905722116 , PMCID: PMC6789572. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Little MP, Hendry JH, Puskin JS. Lack of Correlation between Stem-Cell Proliferation and Radiation- or Smoking-Associated Cancer Risk. PLoS One. 2016; 11(3), e0150335. doi: 10.1371/journal.pone.0150335 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Ozasa K, Shimizu Y, Suyama A, Kasagi F, Soda M, Grant EJ, et al. Studies of the mortality of atomic bomb survivors, Report 14, 1950–2003: an overview of cancer and noncancer diseases. Radiat Res. 2012; 177(3), 229–243. doi: 10.1667/rr2629.1 . [DOI] [PubMed] [Google Scholar]
- 10.Hendry JH, Niwa O, Barcellos-Hoff MH, Globus RK, Harrison JD, Martin MT, et al. ICRP Publication 131: Stem cell biology with respect to carcinogenesis aspects of radiological protection. Ann ICRP. 2016;45(1 Suppl):239–252. doi: 10.1177/0146645315621849 . [DOI] [PubMed] [Google Scholar]
- 11.United Nations Scientific Committee on the Effects of Ionizing Radiation, ANNEX B Epidemiological studies of cancer risk due to low-dose-rate radiation from environmental sources. In: UNSCEAR 2017 Report: Sources, effects and risks of ionizing radiation, with. New York: United Nations; 2018.
- 12.Wu S, Powers S, Zhu W, Hannun YA. Substantial contribution of extrinsic risk factors to cancer development. Nature. 2016; Nature, 529(7584), 43–47. doi: 10.1038/nature16166 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Gillespie DT. A general method for numerically simulating the stochastic time evolution of coupled chemical reactions. Journal of computational physics. 1976; 22(4), 403–434. 10.1016/0021-9991(76)90041-3 [DOI] [Google Scholar]
- 14.Gillespie DT. Exact stochastic simulation of coupled chemical reactions. The journal of physical chemistry. 1977; 81(25), 2340–2361. 10.1021/j100540a008 [DOI] [Google Scholar]
- 15.Haeno H, Iwasa Y, Michor F. The evolution of two mutations during clonal expansion. Genetics. 2007; 177(4): 2209–2221. doi: 10.1534/genetics.107.078915 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Krickeberg K, Pham VT, Pham TMH. Epidemiology: Key to Prevention. 2012. New York, N.Y. Springer. [Google Scholar]
- 17.Kandoth C, McLellan MD, Vandin F, Ye K, Niu B, Lu C, et al. Mutational landscape and significance across 12 major cancer types. Nature. 2013; 17;502(7471):333–339. doi: 10.1038/nature12634 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Werner B, Case J, Williams MJ, Chkhaidze K, Temko D, Fernández-Mateos J, et al. Measuring single cell divisions in human tissues from multi-region sequencing data. Nat Commun. 2020; 25;11(1):1035. doi: 10.1038/s41467-020-14844-6 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Futreal PA, Coin L, Marshall M, Down T, Hubbard T, Wooster R, et al. A census of human cancer genes. Nat Rev Cancer. 2004; 4(3):177–83. doi: 10.1038/nrc1299 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.International Human Genome Sequencing Consortium. Finishing the euchromatic sequence of the human genome. Nature. 2004. Oct 21;431(7011):931–45. doi: 10.1038/nature03001 . [DOI] [PubMed] [Google Scholar]
- 21.Nakamura N. A hypothesis: radiation carcinogenesis may result from tissue injuries and subsequent recovery processes which can act as tumor promoters and lead to an earlier onset of cancer. Br J Radiol. 2020; 93(1115):20190843 doi: 10.1259/bjr.20190843 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Schwartz AG, Cote ML. Epidemiology of Lung Cancer. Adv Exp Med Biol. 2016; 893:21–41. doi: 10.1007/978-3-319-24223-1_2 . [DOI] [PubMed] [Google Scholar]
- 23.Parsons BL. Many different tumor types have polyclonal tumor origin: evidence and implications. Mutat Res. 2008; 659(3):232–47. doi: 10.1016/j.mrrev.2008.05.004 . [DOI] [PubMed] [Google Scholar]
- 24.Bondar T, Medzhitov R. p53-mediated hematopoietic stem and progenitor cell competition. Cell Stem Cell. 2010; 6(4):309–22. doi: 10.1016/j.stem.2010.03.002 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Fujimichi Y, Otsuka K, Tomita M, Iwasaki T. An Efficient Intestinal Organoid System of Direct Sorting to Evaluate Stem Cell Competition in Vitro. Sci Rep. 2019;9(1):20297. doi: 10.1038/s41598-019-55824-1 . [DOI] [PMC free article] [PubMed] [Google Scholar]