Skip to main content
International Journal of Epidemiology logoLink to International Journal of Epidemiology
. 2014 Dec 12;44(4):1408–1421. doi: 10.1093/ije/dyu192

Understanding variation in disease risk: the elusive concept of frailty

Odd O Aalen 1,2,*, Morten Valberg 1, Tom Grotmol 2, Steinar Tretli 2
PMCID: PMC4588855  PMID: 25501685

Abstract

The concept of frailty plays a major role in the statistical field of survival analysis. Frailty variation refers to differences in risk between individuals which go beyond known or measured risk factors. In other words, frailty variation is unobserved heterogeneity. Although understanding frailty is of interest in its own right, the literature on survival analysis has demonstrated that existence of frailty variation can lead to surprising artefacts in statistical estimation that are important to examine. We present literature that demonstrates the presence and significance of frailty variation between individuals. We discuss the practical content of frailty variation, and show the link between frailty and biological concepts like (epi)genetics and heterogeneity in disease risk. There are numerous suggestions in the literature that a good deal of this variation may be due to randomness, in addition to genetic and/or environmental factors. Heterogeneity often manifests itself as clustering of cases in families more than would be expected by chance. We emphasize that apparently moderate familial relative risks can only be explained by strong underlying variation in disease risk between families and individuals. Finally, we highlight the potential impact of frailty variation in the interpretation of standard epidemiological measures such as hazard and incidence rates.

Keywords: Frailty, heterogeneity, random variation, epigenetics


Key Messages.

  • Variation in risk of disease often goes far beyond what is captured by measured risk factors.

  • Heterogeneity in risk may be established early in life, and stochastic variation in these processes could be a major contributor in this regard.

  • Even a moderate familial relative risk points to the existence of large variations in disease risk between families, and Individuals, across the population.

  • Failing to take into account the unobserved heterogeneity between individuals may lead to erroneous interpretations of standard epidemiological measures such as age-incidence curves and hazard ratios.

Introduction

In epidemiology and clinical science, it is often tacitly assumed that the risk of a certain disease is similar among individuals across the population at equal levels of known risk factors. It is often presumed, for instance, that all individuals are vulnerable to the same risk factors, and to the same degree. Differences between individuals tend to be ignored unless they can be expressed in terms of known risk factors such as known genetic properties. However, the fact of the matter is that individuals are generally highly dissimilar also for many unknown, or just partly known, reasons. Indeed, the extent of this heterogeneity is probably not generally appreciated.

Heterogeneity which is unknown, or not represented in available data, is often referred to as frailty, a quantity that varies between individuals. The term frailty comes from the statistical field of survival analysis, where there is a strong interest in this type of heterogeneity. Frailty is usually modelled by assuming that the hazard rate (baseline hazard) for an average individual, α(t), is multiplied by a frailty factor Z that renders the level for a specific individual, i.e. the individual hazard rate is;

λ(t)=Zα(t).

When we integrate out the variation in Z to get the (observable) population hazard rate, the resultant function is quite different from the baseline hazard α(t). A number of various distributions exists for the frailty Z.1,2 Note that the baseline hazard may be a function of observed individual covariates, e.g. through a Cox model, and that including a frailty term may improve the fit of such a model.3

A number of diseases exhibit incidence rates that peak at young ages, including cancers like childhood leukaemia and Hodgkin lymphoma, but also schizophrenia, which has recently been analysed from a frailty point of view.4 In several cases frailty variation is a reasonable explanation for an early peak in incidence, especially when the disease has a strong heritability, which is the case for diseases like schizophrenia and testicular cancer. The frailty approach has yielded particularly fruitful insights for testicular cancer.5–8 Furthermore, the so-called frailty models form a basis for the analysis of familial association in cancer incidence.7–9

One goal of the present paper is to point out the ubiquity of heterogeneity, or varying frailty. We shall also emphasize the role of stochasticity. Furthermore, we will discuss how important indications of frailty variation may be deduced from data on familial disease association. Indeed, moderate familial association implies surprisingly strong variation in risk between individual families. Although understanding frailty variation is important in its own right, it may also be essential in order to correctly interpret statistical analyses in epidemiological studies. Finally, we will cover an issue that has been pointed out in the statistical literature: that not taking into account unobserved frailty variation in statistical analyses may lead to misleading comparisons of hazard rates and incidence rates resulting in, among other things, an artificial cross-over effect.

Heterogeneity between individuals may be high

Individual variation in susceptibility

It is often obvious that disease risk is a fluid phenomenon, dependent on environmental and lifestyle risk factors, genes, age and the country of residence, among other things. For example, the risk of being diagnosed with colorectal cancer (CRC) varies widely across different countries; it has increased sharply (in fact more than tripled) in the past few decades in many industrialized countries, and it varies substantially between different countries worldwide. This means that the risk of CRC is not a given quantity, but rather something that varies widely. It would follow that the individual risk of CRC is also a fluid phenomenon; that it varies considerably between individuals even when the outer circumstances are similar. Furthermore, there are large differences in risk across regions (Figure 1).10 Therefore, it seems logical that the risk will not be homogeneous within regions (especially given the arbitrariness of many borders).11 In short, the variation in risk between regions strongly suggests a considerable variation within regions. This kind of variation has been clearly demonstrated for the USA with regard to the dependence on race of CRC incidence.12 However, it is highly likely that there are other variations based on both known and unknown risk factors. Large variations in the susceptibility to CRC have been estimated in Norway,13 and it has been estimated that only 12% of the US population is susceptible to colon cancer.14

Figure 1.

Figure 1.

Age-standardized rates (ASR) of colorectal cancer, standardized with respect to the world (W) population, in various regions for 2012. Picture constructed by Globocan10.

Some of the biomedical literature has indicated that there is a high degree of variation in individual cancer susceptibility, thereby supportingthe presence of frailty variation. An interesting paper is that of Balmain et al.15 where they indicate a strong variation in susceptibility to breast cancer. They studied a population without high-risk individuals (individuals with BRCA1 and BRCA2 mutations excluded), and still suggest a 40-fold difference in the risk of breast cancer between the top 20% and bottom 20% of the study population. Their model also suggests that more than 50% of cancers occur in the 12% of the population that is most susceptible. Peto and Mack16 reached similar conclusions by studying the relatives of breast cancer patients.They observed a very strong familial risk in monozygotic twins which could only be explained by a large individual variation in risk. They make the following statement: ‘Our most surprising conclusion is that a high proportion of all breast cancers, and perhaps the majority, arise in women at very high risk’. Also, the increased risk of a another breast malignancy after ductal carcinoma in situ of the breast, even after adjustment for type of treatment, points to a large variation between individuals in susceptibility to this disease.17

Regarding CRC, Win et al.18 suggest that ‘the risk of developing CRC varies approximately 20-fold between the people in the lowest quartile (average 1.25% lifetime risk of CRC) vs the highest quartile for familial risk profile (average 25% risk)’. Another study of colorectal cancer in DNA mismatch repair gene mutation carriers showed a U-formed distribution in risk distinguishing a high-risk group from a moderate risk-group.19

It is important to be aware that even for cancers with strong attributable risk factors, frailty remains a large component. In lung cancer, for instance, there are strong indications that some people are much more vulnerable to the damage inflicted by smoking than others, with just 10–15% of smokers developing lung cancer.20

Susceptibility may be established early in life

The notion of ‘early life programming’ has become popular. This idea was formulated in the Forsdahl-Barker hypothesis, which states that the risk of many diseases is strongly influenced by what happens very early in life, e.g. at or prior to birth. Forsdahl21 showed a strong correlation between mortality rates for arteriosclerotic heart disease in people aged 40–69 years, and infant mortality in the same birth cohorts within Norwegian counties. Earlier works include that of Ravelli et al.22who observed babies born to women who were pregnant during the Dutch famine. They found that the weight of these children later in life depended on which trimester the famine affected. Kermack et al. found an association between childhood and adult mortality for different birth cohorts.23,24 Barker et al. studied heart disease and found that areas in England that had the highest coronary heart disease mortality in the 1980s also had the highest child mortality rates 70 years earlier.25,26 Since then, many papers have been published indicating a relationship between early life conditions and disease risk later in life,27 including the recent paper by Eriksson et al.28 who assert in the title that ‘Boys live dangerously in the womb’.

Epigenetics is the study of changes in gene expression that are not caused by alterations in the nucleotide sequence of the genome, and examples of cellular mechanisms producing such changes are DNA methylation and histone modification. Epigenetic processes are involved in both mitotic and meiotic cell division, as discussed by Davey Smith.29 In the former, they ensure the transmission of cellular traits, essential for development from the pluripotent zygote to the formed organism. Although it is not clearly established what happens epigenetically during meiosis, these mechanisms clearly play a role in mediating transgenerational inheritance as addressed by Heard and Martienssen.30

Forsdahl-Barker type effects have been tied to such epigenetic alterations.31,32 Painter et al. showed that the children of those exposed to the Dutch famine in utero during World War II (WWII) were also at increased risk for ill health, which indicates that epigenetic effects in utero can even have transgenerational consequences.33 Studies from The Netherlands and Scandinavia have shown a decreased risk of testicular cancer for birth cohorts born during WWII compared to those born before and after.34,35

All these examples suggest that epigenetic alterations early in life may lead to a large degree of heterogeneity between adult individuals in a population.

Different types of variation

Figure 2 illustrates various ways in which frailty, Z, can be distributed between individuals. Panel 1 indicates a frailty that is quite similar across individuals, with some variation. Panel 2 shows a situation where most individuals have a relatively similar frailty, but there are some individuals who deviate quite a lot (the upper tail). This is expressed even more clearly in panel 3, where many individuals have a frailty close to zero whereas there are a number of individuals with high frailty. An even stronger variation is illustrated in panel 4, where most individuals have frailty close to zero but some individuals have a very high frailty. All these types of variation could actually occur. The examples given in this paper show that even the types of variation shown in panels 3 and 4 could be common. However, another issue is how frailty develops over time. One view is that there is a rather small variation in frailty at an early age, which increases over time as the result of the varying stresses of life. An alternative view argues that much of the variation in frailty between individuals is determined very early in life, maybe even prior to birth.

Figure 2.

Figure 2.

Various types of possible distributions of the frailty (unexplained risk), Z, at an early age. The panels illustrate: (1) small variation in frailty between individuals, (2) large group have moderate frailty, and a smaller group of individuals have a high frailty, (3) very skewed: many individuals have a low frailty and a small group have a high frailty, (4) most individuals have close to zero frailty and a few individuals have a high frailty.

Genetic variation and rare variants

Over past years, genome-wide association (GWA) studies have led to new discoveries about genes and pathways involved in common diseases and other complex traits.36 However, most of the associated single nucleotide polymorphisms (SNPs) have small effect sizes and the proportion of heritability is modest, which has led people to think that rare mutations might be responsible for many diseases.37 Rare gene combinations are difficult to discover in GWA studies, which may explain the apparent lack of genetic effects.38,39 Also, other authors explain how disease susceptibility may be an effect of common low-penetrance genes or rare gene combinations.40,41 On the other hand, there are examples of common SNPs explaining a large fraction of the heritability of complex traits in human populations, such as height, by considering all SNPs simultaneously.42

A simple polygenic model, where the risk (or liability) is a linear combination of a (possibly large) number of factors with no single factor dominating, will often be assumed to give a normally distributed risk like that in panel 1 of Figure 2. On the other hand, rare variants make it more likely that we will get skewed variations of the type seen in panels 3 and 4.41

Heterogeneity may be due to stochastic processes

Randomness and chaos

Heterogeneity, or varying frailty, between individuals may have a number of different explanations: environment, genetics or epigenetics; or it may be a purely stochastic phenomenon. There is a growing recognition in biology that both stochastic variation and chaotic variation are important. By stochastic variation we mean a purely random phenomenon, but it is well known that unpredictable variation may also be produced by deterministic mathematical relations if they are nonlinear; this is often termed chaos. This mathematical theory and its implication for biology and other fields are discussed in detail in a book by Strogatz.43 A characteristic of the chaos phenomenon is that small variations in starting conditions may yield very big differences in the end product; even simple nonlinear equations may have this effect. Hence, dynamic systems may develop in a very complex and unpredictable way. This is seen in many fields, like meteorology, physics and economics and is likely to play a considerable role in biology as well. The sensitive dependence on initial conditions is often popularized by the term ‘butterfly effect’, where one imagines that a butterfly flapping its wings may produce a hurricane on the other side of the world several weeks later. Although randomness and chaos are different mathematical phenomena, they are also related and a mixture of both might be present.Often they cannot be clearly distinguished. In general, random and chaotic variation is to be expected on purely mathematical principles.

Many studies point to large individual differences that do not have obvious explanations, where the above discussion could be relevant. Kirkwood and Finch44 show that even genetically identical (i.e. isogenic) worms have great variation in their lifetimes. They stress the random and unpredictable nature of cell damage that occurs with ageing. Epigenetic factors are also likely contributors to these time-dependent processes.

Le and Cheng45 studied the problem of why genetically identical cells in the body vary widely in their storage of fat, even when there is no difference in the expression of the particular genes that affect this storage. They found that the differences between cells were due to variation occurring in a cascade of events within an insulin-signalling pathway. These variations were slight at the beginning of the cascade, but led to very different results at the end. Possibly this is an example of chaos; cascade phenomena would be expected to be nonlinear with complex feedback dynamics.

In an interesting Nature letter, Frank and Nowak46 suggest a model where random mutations at a very young age can produce a developmental disposition to cancer.The idea is that during the gestational phase, stem cells may mutate and then multiply randomly with long-lasting effects. If the mutation rate is high enough, this initial random variation could be a dominant feature in later life.47,48

A classic paper by Gärtner, on the importance of apparently random variation in biology, was recently reprinted in the International Journal of Epidemiology with a series of commentaries that discuss the nature of this variation, whether it is due for example to epigenetic effects, and whether nonlinearity could be a source.49–56

The examples given here show that great individual variation in genetically identical organisms may simply be an accumulation of purely random variation combined with nonlinear dynamics.

Davey Smith offered a fascinating discussion of the importance of randomness in epidemiology.57 He asserted that epidemiology cannot capture the pervasive randomness which averages out at the population level. Our point here, though, is that when time is considered, there are tell-tale indications of random variation.

Epigenetic stochasticity

During recent years, there has been growing recognition that environmental exposure affects cancer susceptibility through epigenetic changes, in addition to the traditional gene-environment interactions that can promote mutations. This is particularly relevant in the developmental origins of health and disease hypothesis.58 Some authors argue for a paradigm shift, where the old view on the importance of DNA mutations is down-weighted and supplemented by the modern view of epigenetic modifications.59 There are, however, indications of an important stochastic component to these modifications, and it has even been speculated that the majority of important epigenetic changes may not be due to the environment, but to random events early in life.59 This might explain the large variation that is often observed between genetically identical individuals.60

Familial cancer risk points to large individual heterogeneity

For many diseases there is a familial association in risk more than can be explained by chance. A surprising and counter-intuitive issue is that even a moderate familial association points towards a large variation in risk between families. Hence, the existence of a familial association is another argument for the presence of considerable individual heterogeneity in risk.

There is generally a familial association when it comes to cancer risk. For example, in breast cancer some mutations in the genes BRCA1 and BRCA2 confer a very high risk in the specific family. But even in the absence of such ‘important’ genes, sizeable familial association is still observed. Johns and Houlston61 pointed out that having a first-degree relative with CRC is associated with more than a doubling of one’s risk for the disease, whereas the risk is increased more than 4-fold when one has two first-degree relatives with CRC. The risk of testicular cancer for a brother of a case is increased about 6-fold.7,8 Tumours of the nervous system also show a strong heritability (standardized incidence ratios around 2, but up to 27 for the rare multiplex families).62

Even familial risks that appear modest, like the relative risk of about 2 seen for relatives of breast or colon cancer patients, still imply a large variation in risk between individuals.62–64 This has also been pointed out by Moger et al.7 and by Aalen65 in a cardiovascular disease setting. In fact the variation in individual risk when even small familial risks are observed will typically be of the type in panels 3 and 4 of Figure 2. An interesting quote from Hopper63 stresses this surprising fact:

Even for a disease for which there is only what one might consider in epidemiological terms ‘modest’ familial aggregation (such as a 2-fold increased risk for close relatives of affected), people of the same age and sex must differ greatly in their familial risks of disease (e.g. a 20-fold or more difference in risk between the quarter of the population at lowest familial risk and the quarter of the population at greatest familial risk). This familial risk gradient is in addition to differences due to ‘non-familial’ environmental or lifestyle factors that are specific to individuals. Finding the causes of even a modest proportion of familial aggregation of a disease could be a major step in understanding the causes of the disease itself.

Let us consider a very simple situation: assume that the population is divided into two groups of equal size, and such that the probability of acquiring a specific disease is 1% in one group and 20% in the other. All the members of a given family belong to the same group, be it the high-risk or low-risk group. Consider that the familial relative risk is defined as follows: the conditional probability of developing the disease if a specific family member has acquired it, divided by the average risk of getting the disease. In our example the familial relative risk equals 1.82. Hence a relative risk of 20 at the individual level translates into a very modest familial risk, just as suggested by Hopper.63

Since familial relationships are important for disease risk, it is useful to use study designs that to some extent control for such relationships. Within-pair twin studies are useful in this regard.66

Statistical models for familial risk

In order to get a deeper understanding, one has to consider statistical models. The familial risk association depends on two conditions, namely the correlation between the risk factors within a family, and the variation in risk within the population associated with these factors. Assume that the risk depends exponentially on normally distributed risk factors with a correlation ρ, and that s denotes the relative risk associated with a change in the risk factor from mean –2 standard deviations (SD) to mean + 2SD. The familial relative risk, r, associated with a diseased sibling is given by:

r=exp{ρ(lns)2/16}, (1)

which is a special case of a more general formula given by Aalen.65 Assume for instance that ρ = 0.5 which is a very strong familial correlation. Then formula (1) as a function of s is plotted in Figure 3. One sees that even for s = 10, which represents a very strong effect of the risk factor, the value of r is still less than 1.2. Hence, for simple polygenetic inheritance at the risk factor level, the familial relative risk associated with even strong risk factors is very moderate.

Figure 3.

Figure 3.

The familial relative risk, r, associated with a diseased sibling as a function of s according to formula (1) in the text, where s denotes the relative risk associated with a change in a risk factor from mean minus two standard deviations to mean plus two standard deviations. The familial correlation, ρ, is set to 0.5. Based on normally distributed variation in risk.

In practice, familial association will have several sources, partly genetic and partly a shared environment or culture, or attitude toward various risk behaviours. It can be shown that known environmental influences contribute only very slightly to the observed familial risk association. However, measured risk factors could be poor surrogates for risk factors that are stronger, more strongly familial, and the effect could be somewhat prone to measurement error, for example. 64

Formula (1) presumes a normal distribution of the risk factor(s), which one would usually assume for simple polygenetic inheritance. Some skewness might be introduced, which might appear more realistic if some genes have a stronger effect than others, for example due to higher penetrance. To investigate the effect of introducing additional skewness in the distribution of the risk factor(s) into the model, we shall assume that two individuals have a common risk component which is gamma distributed with shape parameter δ. Following Aalen,65 the modified familial relative risk, rF, is given by:

rF={1+lnrδ2(δlnr)1/2}δ. (2)

Note that when the shape parameter δ goes to infinity, this expression will converge to r (because an infinite δ implies a normal distribution for the common component). Plots of formula (2) as a function of δ and r are given in Figure 4. The major deviation occurs for δ=1 which corresponds to an exponential distribution of the common familial risk. This represents a high degree of skewness (Figure 5). It means that members of a minority of families have a much higher risk than others. However, the familial relative risk still appears to be moderate. Figure 5 also includes an illustration of an even more skewed gamma distribution.

Figure 4.

Figure 4.

The modified familial relative risk, rF, associated with a diseased sibling as a function of r according to formula (2) in the text, where the two individuals in the family have a common risk component that is gamma distributed with shape parameter δ. Here r is the familial relative risk from formula (1), that is the familial relative risk without the skewness introduced by the common, gamma distributed component. rF is plotted for given values of δ. Note that δ implies rF=r.

Figure 5.

Figure 5.

Probability density for a random variable X, following either the exponential distribution (solid line) or the gamma distribution with shape parameter 0.5 (dashed line).

Similar results are presented in the work of Moger et al.7 where a totally different mathematical model also indicated that even a very skewed familial frailty distribution would result in very moderate familial relative risks. The paper presents the following useful formula:

R=CV2+1

where CV is the coefficient of variation of the probability of being susceptible, as it varies between families, and R is the relative risk of another member of the family acquiring the disease if there is already a case in the family. From the above formula it is seen that assuming, for example, R = 2 implies CV = 1. This means that the standard deviation equals the expectation. If the distribution comes from the gamma family, then it has to be an exponential distribution. If CV is greater than 1, then the shape parameter of the gamma distribution is less than 1, which yields an extremely skewed distribution (Figure 5). In fact the cases discussed here correspond to panels 3 and 4 of Figure 2.

The conclusion from this brief review of familial association is that a familial relative risk of 2 or above is a strong indication of wide variation in individual familial risk and of the existence of high risk groups of individuals.

The competing explanations: frailty selection vs biological mechanism

Frailty explanations of observed incidence rates will typically attribute certain findings to statistical selection effects. A disease where frailty is likely to play a role is testicular cancer. The age-incidence curve of this disease is typical of cancer forms originating in early (fetal) life, reaching a peak at a rather young age (approximately 30 years) and then declining sharply. A reasonable explanation for this observation is that some men are susceptible to acquiring testicular cancer, and do so relatively early. This leads to an increasing age-incidence rate at quite young ages. The subsequent declining incidence of testicular cancer with age is presumed to be due to high-risk individuals being selected out from the population after they acquire the disease.5 The individual risk of testicular cancer is thus increasing throughout life for susceptible individuals, whereas the age-incidence rate observed in the population is peaking due to selection effects. This fits well with biological evidence suggesting that testicular cancer may be caused by cellular damage during fetal life, which has been used as a basis for a so-called frailty analysis of incidence.5 The origin of testicular cancer is believed to be carcinoma in situ cells, the malignant transformation of which is initiated during early development from primordial germ cells, or gonocytes that fail either to end their proliferation or to undergo proper differentiation.67 Since the incidence rate of testicular cancer also has increased substantially during past decades, this damage appears to have become more prevalent over time. It should be noted that this kind of statistical explanation typically competes with a biological mechanistic one. It has also been suggested that the decline in the risk of testicular cancer with age could be due to a declining testosterone level. Although the surge in testosterone level during puberty is important for the transformation of dormant carcinoma in situ cells to invasive testicular cancer, there is no evidence that individual testosterone level is a risk factor for testicular cancer.68 Furthermore, the decline in testosterone is rather modest from the age of 30 years.

On the other hand, there are clearly cases where frailty is not the major cause of the decline in risk. One example is retinoblastoma, where there are almost no cases in individuals over 10 years of age. The likely explanation is that the retinoblasts are fully differentiated at the age of 10, and thus thereafter are not susceptible to malignant transformation.69 However, in his seminal study on retinoblastoma, Knudson actually took varying frailty into account.70 Long before the Rb1 gene was identified, he separated a very frail group (those with an inherited germ line mutation) from a less frail group (those who had the non-hereditary form), and used this to formulate his famous two-hit hypothesis. The case of retinoblastoma is thus an example of how the consideration of varying frailty combined with biological knowledge may provide valuable insights.

Competing frailty and biological mechanistic explanations are often suggested, and it may not be obvious which one is correct. Part of the difficulty is that when frailty is estimated from single event data (e.g. the single occurrence of a specific type of cancer for an individual), there will necessarily be uncertainty. A much more precise assessment of frailty can be done in a setting where there are repeated events (e.g. cancer in both breasts, kidneys or testicles), or when studying cancer incidence in families, e.g. testicular cancer among brothers.7,8

Interpretation of epidemiological measures

Taking heterogeneity, or varying frailty, between individuals into account can be of crucial importance for the understanding of epidemiological features in a population. There is a natural tendency to assume that hazard rates and incidence rates can be taken at face value. Although these concepts appear to be simple, their interpretation can still be very difficult. The statistical interest in frailty stems in part from the fact that it can lead to curious statistical artefacts.

Cross-over effects

Consider two groups of individuals with hazard rates α(t) and 2α(t), such that the hazard ratio is 2. In each of these groups there would necessarily be some unobserved heterogeneity between individuals. By introducing equally distributed frailty variables in the two groups, a decreasing hazard ratio over time may be obtained. Depending on the choice of frailty distribution, the hazard ratio may even cross over and become lower than 1, such that the high-risk group appears to become the low-risk group (Figure 6). The decrease (and possible cross-over) of the hazard ratio over time is a frailty effect. Individuals in the high-risk group will on average experience events earlier than those in the low-risk group. This causes the proportion of highly susceptible individuals in the high-risk group to decrease faster than in the low-risk group, leaving an increasing proportion of less susceptible individuals. Thus, the hazard ratio will decrease. If, for instance, the population contains a non-susceptible subgroup, then the susceptible individuals in the high-risk group would be exhausted earlier than in the low-risk group, causing the relative risk to cross over and become lower than 1, even if the hazard ratio stays constant on the individual level. This means that when frailty is not observed and cannot be accounted for, a wrong conclusion could be drawn regarding the true relationship between two groups. This is in fact a time-dependent version of Simpson’s paradox, which means that the observed relationship (concerning risk of disease, for example) between two groups is reversed at an aggregate level compared with what would be observed at a more detailed level if covariates could be conditioned on.

Figure 6.

Figure 6.

Assume that the hazard rates in two risk groups are α(t) and 2α(t) respectively. When frailty variables are introduced, the observed relative risk declines over time as shown in the figure. Three frailty distributions are used; one leads to a crossover of the hazard ratio. This case corresponds to a frailty distribution with a positive probability of zero frailty (i.e. a non-susceptible group). See Aalen et al.2, Chapter 6, for technical details.

Likely cross-over phenomena are observed in practice, for example in the work of Gulsvik et al. where it is shown that high serum cholesterol appears as a ‘protective’ factor with respect to general mortality at old age.71 This could be a frailty artefact, especially since statin treatments has been shown to reduce disease incidence of cardiovascular disease in the elderly.72 A strongly reduced risk with age was also seen for smokers compared with never smokers in the paper by Gulsvik et al. which could be, at least partially, a frailty phenomenon.71 Increasing reverse causation with age could also contribute to explaining such results.

Another interesting effect of frailty occurs when discontinuing treatment in a clinical trial. Let us assume that the treatment group has hazard rate α(t) and the control group has hazard rate 2α(t), presuming the treatment is effective. At the start of the study, the hazard ratio is 2. Because the treatment is effective, patients in the control group will on average have events earlier than in the treatment group, and the hazard ratio will decrease with time. At some point the difference between the hazard rates is so small that it is decided treatment is no longer effective, and it is stopped. A possible consequence of this decision is that the hazard ratio drops below 1, and it appears protective to be a member of the control group (Figure 7). In the control group the frailest individuals would already have had an event at this point and, at the time of discontinuing the treatment, there would be a higher proportion of less frail individuals in the control group. In the treatment group, frail individuals that would already have had an event if they had not been treated, have a very high risk immediately after the treatment is stopped. Not being aware of a possible frailty effect may lead to a wrong impression of the effect of discontinuing a treatment for an individual.

Figure 7.

Figure 7.

Effect of discontinuing treatment. A control group with hazard rate 2α(t) is compared with a treatment group with hazard rate α(t). Treatment is discontinued at time point 1.

False protectivity

In a competing risks framework, two (or more) events compete in determining the failure of an individual. The failure rate of each cause is expressed in terms of a cause-specific hazard rate. As in the example above, the hazard rates may be influenced by frailties. If these frailties are correlated, then one may observe a false protectivity.73 If a covariate has a detrimental effect on one of the two competing risks, it may, at the population level, appear to be protective in the other cause-specific hazard rate.

Frailty and models of carcinogenesis

The famous multi-stage model of Armitage and Doll set the stage for a mathematical approach to understanding cancer incidence,74 and it continues to play a fundamental role in our understanding of the carcinogenic process. This was exemplified by the re-publication of the original article in the International Journal of Epidemiology at its 50-year anniversary in 2004.75 In a commentary to this reprint, Doll points out his views on the random nature of cancer development.76 Since the Armitage-Doll model was first suggested, however, more sophisticated models have been published. Moolgavkar and Knudson proposed a two-hit model (combined with clonal expansion of initiated cells),77 andexplained peaking incidence rates of certain cancers by the varying (decreasing) number of stem cells susceptible to mutation. Their model has later been expanded to allow for a cell to undergo several transitions before going into the clonal expansion phase,78 as well as other further developments of the model.47,79,80 All these models were created to facilitate the understanding of cancer development on an individual level. Meza et al. studied the effect of gestational mutations on cancer risk, and stated that: ‘Even with identical gestational mutation rates in all individuals in a population, at birth individuals are at different risk because of random variation in the number of mutated cells at birth’.47 Heidenreich modelled risk functions (at certain ages) for liver cancer by treating the two-stage clonal expansion model as fully stochastic, and demonstrated that this leads to heterogeneity in a population even when considering genetically identical individuals.81,82 Taking varying frailty into account (i.e. heterogeneity in risk between individuals), a Weibull hazard rate, as suggested by the Armitage-Doll model, is a sensible approximation to the carcinogenic process within an individual.83 A mathematical formulation is that, on the individual level, the hazard rate of an event is given as a product of the Weibull hazard and an individual frailty factor. The frailty component may include both underlying heritable predispositions and an increased susceptibility due to purely random events. As opposed to the exhaustion of susceptible stem cells within the individual, the model considers the exhaustion of initially highly susceptible individuals as an explanation of a peaking age-incidence curve. This approach may also be modified in several ways, including taking into account an expanding host tissue during, for example, puberty.84 An important element is thus to combine models of carcinogenesis with a realistic understanding of individual differences,6,84,85 to better understand features of population age-incidence rates.

Interpretation of incidence rates

It turns out that changes in epidemiological incidence rates over calendar time also can be wrongly interpreted if one does not take into consideration the possible heterogeneity between individuals. Consider the simple Armitage-Doll multistage model of carcinogenesis, which states that a cell has to go through a certain number of transitions to reach malignancy. As an example, consider the simple version of a multistage model as shown in Figure 8. Assume that the transition rates are not the same for all individuals, but that there is a strong variation in susceptibility. Consider for instance a population where only a small subgroup is susceptible to the cancer in question, and the majority has a zero rate of cancer initiation (transition from a normal cell to an intermediate cell). If the initiation rate increases abruptly at a given point in (calendar) time, the incidence rate may increase to a peak, then drop and stabilize on a higher level. This is illustrated in Figure 9a, for the simple multistage model in Figure 8, with only 1% of the population being susceptible. The same point, with 90% being susceptible, is illustrated in Figure 9b. Although a simplification, the abrupt increase in the initiation rate could be the result of a risk factor that becomes more pronounced in the population at a given time.

Figure 8.

Figure 8.

Simple illustration of an Armitage-Doll multi-stage model of carcinogenesis. The states represent the stages of the carcinogenic process. State one is the healthy state, state two is an intermediate state, and in state three a malignant cell has developed. State four is a censored state. The αs and μs are transition rates.

Figure 9.

Figure 9.

Incidence rates for the model in Figure 8. Assume that 10,000 individuals enter state one per time unit. The transition rates are α1=0.01 for time <20, and α1=0.03 for time 20. Also, α2=0.02, μ1=0.01 and μ2=0.05 a) 1% of the population is susceptible, i.e. having α10. b) 90% of the population is susceptible, i.e. having α10.

The above example is simple, but illustrates that changes in the prevalence of risk factors may have an impact on observed incidence rates, even a long time after the change occurred. Whereas the real biological change here was an abrupt increase in the prevalence of a risk factor, the observed incidence rates gave the impression of a risk that first increased and then decreased. It is of course more likely that the presence of risk factors changes gradually over time, and this will have a similar effect on observed incidence rates as in the above example. The idea is that the increase in the prevalence of the risk factor over time could cause cancers that would not have emerged earlier to appear, even a long time after the external risk factor has ceased to change, and the observed incidence rate will thus continue to change after the prevalence of the risk factor has stabilized. If a cell requires more events to become malignant, changes in prevalence of different risk factors may affect the transition rates to various states. This could possibly also lead to multimodal shapes of hazard rates.

The point we are making is that changes in the incidence rate may not be a simple reflection of what is happening at a biological level. It is well known that underlying effects will be smoothed out in the observed incidence. But in addition to this, frailty may produce incidence rates with aspects that are unrepresentative of the underlying changes. Care should be taken before drawing conclusions on an individual level based on observations in a population.

Conclusion

We have pointed out a number of findings that indicate the presence of a considerable individual variation in the risk of cancer and other diseases that goes beyond what is due to measured risk factors. Varying frailty may create artefacts when studying incidence rates and other epidemiological measures, such as a decline in incidence due to the frailest individuals experiencing the event early.

Familial associations in disease that appear moderate may be the result of a large underlying variation in risk between individuals. This and other aspects of individual variation point towards caution in interpretation. The presence of individual heterogeneity cannot be ignored. It may be necessary to perform mathematical modelling to get a proper understanding of the nature and magnitude of the phenomenon of frailty in any given study population.

Funding

This work was partially supported by a grant from the Norwegian Research Council (191460/V50), and by the Norwegian Cancer Society, project/grant number 171851.

Conflict of interest: None declared.

References

  • 1.Hougaard P. Analysis of Multivariate Survival Data. New York: Springer, 2000. [Google Scholar]
  • 2.Aalen OO, Borgan Ø, Gjessing HK. Survival and Event History Analysis: A Process Point of View. New York: Springer, 2008. [Google Scholar]
  • 3.Wienke A. Frailty Models in Survival Analysis. Boca Raton, FL: Chapman & Hall/CRC, 2011. [Google Scholar]
  • 4.Svensson E, Rogvin M, Hultman CM, Reichborn-Kjennerud T, Sandin S, Moger TA. Schizophrenia susceptibility and age of diagnosis – A frailty approach. Schizophr Res 2013;147:140–46. [DOI] [PubMed] [Google Scholar]
  • 5.Aalen OO, Tretli S. Analyzing incidence of testis cancer by means of a frailty model. Cancer Causes Control 1999;10:285–92. [DOI] [PubMed] [Google Scholar]
  • 6.Moger TA, Aalen OO, Halvorsen TO, Storm HH, Tretli S. Frailty modelling of testicular cancer incidence using Scandinavian data. Biostatistics 2004;5:1–14. [DOI] [PubMed] [Google Scholar]
  • 7.Moger TA, Aalen OO, Heimdal K, Gjessing HK. Analysis of testicular cancer data using a frailty model with familial dependence. Stat Med 2004;23:617–32. [DOI] [PubMed] [Google Scholar]
  • 8.Valberg M, Grotmol T, Tretli S, Veierød MB, Moger TA, Aalen OO. A hierarchical frailty model for familial testicular germ-cell tumors. Am J Epidemiol 2014;179:499–506. [DOI] [PubMed] [Google Scholar]
  • 9.Moger TA, Haugen M, Yip BH, Gjessing HK, Borgan Ø. A hierarchical frailty model applied to two-generation melanoma data. Lifetime Data Anal 2011;17:445–60. [DOI] [PubMed] [Google Scholar]
  • 10.Ferlay J, Soerjomataram I, Ervik M, et al. .GLOBOCAN 2012 v1.0, Cancer Incidence and Mortality Worldwide: IARC CancerBase No. 11. Lyon, France: International Agency for Research on Cancer, 2013. [Google Scholar]
  • 11.Ferlay J, Steliarova-Foucher E, Lortet-Tieulent J, et al. Cancer incidence and mortality patterns in Europe: Estimates for 40 countries in 2012. Eur J Cancer 2013;49:1374–403. [DOI] [PubMed] [Google Scholar]
  • 12.Ollberding NJ, Nomura AM, Wilkens LR, Henderson BE, Kolonel LN. Racial/ethnic differences in colorectal cancer risk: the multiethnic cohort study. Int J Cancer 2011;129:1899–906. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Svensson E, Grotmol T, Hoff G, Langmark F, Norstein J, Tretli S. Trends in colorectal cancer incidence in Norway by gender and anatomic site: an age-period-cohort analysis. Eur J Cancer Prev 2002;11:489–95. [DOI] [PubMed] [Google Scholar]
  • 14.Soto-Ortiz L, Brody JP. Similarities in the age-specific incidence of colon and testicular cancers. PLoS One 2013;8:e66694. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Balmain A, Gray J, Ponder B. The genetics and genomics of cancer. Nat Genet 2003;33:238–44. [DOI] [PubMed] [Google Scholar]
  • 16.Peto J, Mack TM. High constant incidence in twins and other relatives of women with breast cancer. Nat Genet 2000;26:411–14. [DOI] [PubMed] [Google Scholar]
  • 17.Falk RS, Hofvind S, Skaane P, Haldorsen T. Second events following ductal carcinoma in situ of the breast: a register-based cohort study. Breast Cancer Res Treat 2011;129:929–38. [DOI] [PubMed] [Google Scholar]
  • 18.Win AK, Macinnis RJ, Hopper JL, Jenkins MA. Risk prediction models for colorectal cancer: a review. Cancer Epidemiol Biomarkers Prev 2012;21:398–410. [DOI] [PubMed] [Google Scholar]
  • 19.Dowty JG, Win AK, Buchanan DD, et al. Cancer risks for MLH1 and MSH2 mutation carriers. Hum Mutat 2013;34 490–97. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Landvik NE, Zienolddiny S, Skaug V, Haugen A. Gene variants and lung cancer risk. BMC Proceedings 2010;4(Suppl 2):O3. [Google Scholar]
  • 21.Forsdahl A. Are poor living conditions in childhood and adolescence an important risk factor for arteriosclerotic heart disease? Br J Prev Soc Med 1977;31:91–95. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Ravelli GP, Stein ZA, Susser MW. Obesity in young men after famine exposure in utero and early infancy. N Engl J Med 1976;295:349–53. [DOI] [PubMed] [Google Scholar]
  • 23.Kermack WO, McKendrick AG, McKinlay PL. Death-rates in Great Britain and Sweden: Expression of specific mortality rates as products of two factors, and some consequences thereof. J Hyg (Lond) 1934;34:433–57. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Davey Smith G, Kuh D. Commentary: William Ogilvy Kermack and the childhood origins of adult health and disease. Int J Epidemiol 2001;30:696–703. [DOI] [PubMed] [Google Scholar]
  • 25.Barker DJ, Winter PD, Osmond C, Margetts B, Simmonds SJ. Weight in infancy and death from ischaemic heart disease. Lancet 1989;2:577–80. [DOI] [PubMed] [Google Scholar]
  • 26.Barker DJ, Osmond C. Infant mortality, childhood nutrition, and ischaemic heart disease in England and Wales. Lancet 1986;1:1077–81. [DOI] [PubMed] [Google Scholar]
  • 27.van der Pols JC, Bain C, Gunnell D, Davey Smith G, Frobisher C, Martin RM. Childhood dairy intake and adult cancer risk: 65-y follow-up of the Boyd Orr cohort. Am J Clin Nutr 2007;86:1722–29. [DOI] [PubMed] [Google Scholar]
  • 28.Eriksson JG, Kajantie E, Osmond C, Thornburg K, Barker DJ. Boys live dangerously in the womb. Am J Hum Biol 2010;22:330–35. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Davey Smith G. Epigenesis for epidemiologists: does evo-devo have implications for population health research and practice? Int J Epidemiol 2012;41:236–47. [DOI] [PubMed] [Google Scholar]
  • 30.Heard E, Martienssen RA. Transgenerational epigenetic inheritance: myths and mechanisms. Cell 2014;157:95–109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Thompson RF, Einstein FH. Epigenetic basis for fetal origins of age-related disease. J Womens Health (Larchmt) 2010;19:581–87. [DOI] [PubMed] [Google Scholar]
  • 32.Pearson H. Study of a lifetime. Nature 2011;471:20–24. [DOI] [PubMed] [Google Scholar]
  • 33.Painter RC, Osmond C, Gluckman P, Hanson M, Phillips DI, Roseboom TJ. Transgenerational effects of prenatal exposure to the Dutch famine on neonatal adiposity and health in later life. BJOG 2008;115:1243–49. [DOI] [PubMed] [Google Scholar]
  • 34.Verhoeven R, Houterman S, Kiemeney B, Koldewijn E, Coebergh JW. Testicular cancer: marked birth cohort effects on incidence and a decline in mortality in southern Netherlands since 1970. Int J Cancer 2008;122:639–42. [DOI] [PubMed] [Google Scholar]
  • 35.Bergström R, Adami HO, Mohner M, et al. Increase in testicular cancer incidence in six European countries: a birth cohort phenomenon. J Natl Cancer Inst 1996;88:727–33. [DOI] [PubMed] [Google Scholar]
  • 36.Visscher PM, Brown MA, McCarthy MI, Yang J. Five years of GWAS discovery. Am J Hum Genet 2012;90:7–24. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Cirulli ET, Goldstein DB. Uncovering the roles of rare variants in common disease through whole-genome sequencing. Nat Rev Genet 2010;11:415–25. [DOI] [PubMed] [Google Scholar]
  • 38.Maher B. Hiding place for missing heritability uncovered. Nature 2010. doi: 10.1038/news.2010.33 [Google Scholar]
  • 39.Dickson SP, Wang K, Krantz I, Hakonarson H, Goldstein DB. Rare variants create synthetic genome-wide associations. PLoS Biol 2010;8:e1000294. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Fletcher O, Houlston RS. Architecture of inherited susceptibility to common cancer. Nat Rev Cancer 2010;10:353–61. [DOI] [PubMed] [Google Scholar]
  • 41.Gibson G. Rare and common variants: twenty arguments. Nat Rev Genet 2012;13:135–45. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Yang J, Benyamin B, McEvoy BP, et al. Common SNPs explain a large proportion of the heritability for human height. Nat Genet 2010;42:565–69. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Strogatz SH. Nonlinear Dynamics and Chaos: With Applications to Physics, Biology, Chemistry, and Engineering (Studies in Nonlinearity). Cambridge, MA: Perseus Books Group, 2001. [Google Scholar]
  • 44.Kirkwood TB, Finch CE. The old worm turns more slowly. Nature 2002;419:794–95. [DOI] [PubMed] [Google Scholar]
  • 45.Le TT, Cheng JX. Single-cell profiling reveals the origin of phenotypic variability in adipogenesis. PLoS One 2009;4:e5189. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Frank SA, Nowak MA. Developmental predisposition to cancer. Nature 2003;422:494. [DOI] [PubMed] [Google Scholar]
  • 47.Meza R, Luebeck EG, Moolgavkar SH. Gestational mutations and carcinogenesis. Math Biosci 2005;197:188–210. [DOI] [PubMed] [Google Scholar]
  • 48.Frank SA. Evolution in health and medicine Sackler colloquium: Somatic evolutionary genomics: mutations during development cause highly variable genetic mosaicism with risk of cancer and neurodegeneration. Proc Natl Acad Sci U S A 2010;107(Suppl 1):1725–30. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Gärtner K. A third component causing random variability beside environment and genotype. A reason for the limited success of a 30 year long effort to standardize laboratory animals? Lab Animals 1990;24:71–77. [DOI] [PubMed] [Google Scholar]
  • 50.Gärtner K. A third component causing random variability beside environment and genotype. A reason for the limited success of a 30 year long effort to standardize laboratory animals? Int J Epidemiol 2012;41:335–41. [DOI] [PubMed] [Google Scholar]
  • 51.Gärtner K. Commentary: Random variability of quantitative characteristics, an intangible epigenomic product, supporting adaptation. Int J Epidemiol 2012;41:342–46. [DOI] [PubMed] [Google Scholar]
  • 52.Kan K-J, Boomsma DI, Dolan CV, van der Maas HL. Commentary: The presence of bifurcations as a ‘third component of individual differences’: implications for quantitative (behaviour) genetics. Int J Epidemiol 2012;41:346–51. [DOI] [PubMed] [Google Scholar]
  • 53.Kirkwood TB. Commentary: Ageing—what's all the noise about? Developments after Gärtner. Int J Epidemiol 2012;41:351–52. [DOI] [PubMed] [Google Scholar]
  • 54.Ruvinsky A. Commentary: Why are there difficulties in controlling genetic variability? Int J Epidemiol 2012;41:353–54. [DOI] [PubMed] [Google Scholar]
  • 55.Martin GM. Commentary: A gerontological perspective on Klaus Gärtner's discovery that phenotypic variability of mammals is driven by stochastic events. Int J Epidemiol 2012;41:354–56. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Oey H, Whitelaw E. Commentary: Gärtner's ‘third component’: still an open question. Int J Epidemiol 2012;41:356–58. [DOI] [PubMed] [Google Scholar]
  • 57.Davey Smith G. Epidemiology, epigenetics and the ‘Gloomy Prospect’: embracing randomness in population health research and practice. Int J Epidemiol 2011;40:537–62. [DOI] [PubMed] [Google Scholar]
  • 58.Walker CL, Ho SM. Developmental reprogramming of cancer susceptibility. Nat Rev Cancer 2012;12:479–86. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Petronis A. Epigenetics as a unifying principle in the aetiology of complex traits and diseases. Nature 2010;465:721–27. [DOI] [PubMed] [Google Scholar]
  • 60.Finch CE, Kirkwood TBL. Chance, Development, and Aging. New York: Oxford University Press, 2000. [Google Scholar]
  • 61.Johns LE, Houlston RS. A systematic review and meta-analysis of familial colorectal cancer risk. Am J Gastroenterol 2001;96:2992–3003. [DOI] [PubMed] [Google Scholar]
  • 62.Hemminki K, Tretli S, Sundquist J, Johannesen TB, Granström C. Familial risks in nervous-system tumours: a histology-specific analysis from Sweden and Norway. Lancet Oncol 2009;10:481–88. [DOI] [PubMed] [Google Scholar]
  • 63.Hopper JL. Disease-specific prospective family study cohorts enriched for familial risk. Epidemiol Perspect Innov 2011;8:2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Hopper JL, Carlin JB. Familial aggregation of a disease consequent upon correlation between relatives in a risk factor measured on a continuous scale. Am J Epidemiol 1992;136:1138–47. [DOI] [PubMed] [Google Scholar]
  • 65.Aalen OO. Modelling the influence of risk-factors on familial aggregation of disease. Biometrics 1991;47:933–45. [PubMed] [Google Scholar]
  • 66.Stone J, Dite GS, Giles GG, Cawson J, English DR, Hopper JL. Inference about causation from examination of familial confounding: application to longitudinal twin data on mammographic density measures that predict breast cancer risk. Cancer Epidemiol Biomarkers Prev 2012;21:1149–55. [DOI] [PubMed] [Google Scholar]
  • 67.McGlynn KA, Cook MB. Etiologic factors in testicular germ-cell tumors. Future Oncol 2009;5:1389–402. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Swerdlow AJ, Huttly SR, Smith PG. Testis cancer: post-natal hormonal factors, sexual behaviour and fertility. Int J Cancer 1989;43:549–53. [DOI] [PubMed] [Google Scholar]
  • 69.Chial H. Tumor suppressor (TS) genes and the two-hit hypothesis. Nature Education 2008;1:177. [Google Scholar]
  • 70.Knudson AG. Mutation and cancer: statistical study of retinoblastoma. Proc Natl Acad Sci U S A 1971;68:820–23. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Gulsvik AK, Thelle DS, Samuelsen SO, Myrstad M, Mowé M, Wyller TB. Ageing, physical activity and mortality—a 42-year follow-up study. Int J Epidemiol 2012;41:521–30. [DOI] [PubMed] [Google Scholar]
  • 72.Savarese G, Gotto AM, Jr, Paolillo S, et al. Benefits of statins in elderly subjects without established cardiovascular disease: a meta-analysis. J Am Coll Cardiol 2013;62:2090–99. [DOI] [PubMed] [Google Scholar]
  • 73.Di Serio C. The protective impact of a covariate on competing failures with an example from a bone marrow transplantation study. Lifetime Data Anal 1997;3:99–122. [DOI] [PubMed] [Google Scholar]
  • 74.Armitage P, Doll R. The age distribution of cancer and a multi-stage theory of carcinogenesis. Br J Cancer 1954;8:1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Armitage P, Doll R. The age distribution of cancer and a multi-stage theory of carcinogenosis. Int J Epidemiol 2004;33:1174–79. [DOI] [PubMed] [Google Scholar]
  • 76.Doll R. Commentary: The age distribution of cancer and a multistage theory of carcinogenesis. Int J Epidemiol 2004;33:1183–84. [DOI] [PubMed] [Google Scholar]
  • 77.Moolgavkar SH, Knudson AG., Jr Mutation and cancer: a model for human carcinogenesis. J Natl Cancer Inst 1981;66:1037–52. [DOI] [PubMed] [Google Scholar]
  • 78.Luebeck EG, Moolgavkar SH. Multistage carcinogenesis and the incidence of colorectal cancer. Proc Natl Acad Sci U S A 2002;99:15095–100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Jeon J, Meza R, Moolgavkar SH, Luebeck EG. Evaluation of screening strategies for pre-malignant lesions using a biomathematical approach. Math Biosci 2008;213:56–70. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Meza R, Jeon J, Moolgavkar SH, Luebeck EG. Age-specific incidence of cancer: Phases, transitions, and biological implications. Proc Natl Acad Sci U S A 2008;105:16284–89. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Heidenreich WF. Heterogeneity of cancer risk due to stochastic effects. Risk Anal 2005;25:1589–94. [DOI] [PubMed] [Google Scholar]
  • 82.Heidenreich W. Heterogeneity of cancer risk due to stochastic effects: emphasis on radiation-induced effects. Radiat Environ Biophys 2006;45:33–37. [DOI] [PubMed] [Google Scholar]
  • 83.Kopp-Schneider A. Carcinogenesis models for risk assessment. Stat Methods Med Res 1997;6:317–40. [DOI] [PubMed] [Google Scholar]
  • 84.Valberg M, Grotmol T, Tretli S, Veierød MB, Devesa SS, Aalen OO. Frailty modeling of age-incidence curves of osteosarcoma and Ewing sarcoma among individuals younger than 40 years. Stat Med 2012;31:3731–47. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Haugen M, Bray F, Grotmol T, Tretli S, Aalen OO, Moger TA. Frailty modeling of bimodal age-incidence curves of nasopharyngeal carcinoma in low-risk populations. Biostatistics 2009;10:501–14. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from International Journal of Epidemiology are provided here courtesy of Oxford University Press

RESOURCES