Abstract
As a part of the complement system, factor H regulates phagocytosis and helps differentiate between a body's own and foreign cells. Owing to mimicry efforts, some pathogenic microorganisms such as Candida albicans are able to bind factor H on their cell surfaces and, thus, become similar to host cells. This implies that the decision between self and foreign is not clear-cut, which leads to a classification problem for the immune system. Here, two different alleles determining the binding affinity of factor H are relevant. Those alleles differ in the SNP Y402H; they are known to be associated with susceptibility to certain diseases. Interestingly, the fraction of both alleles differs in ethnic groups. The game-theoretical model proposed in this article explains the coexistence of both alleles by a battle of the sexes game and investigates the trade-off between pathogen detection and protection of host cells. Further, we discuss the ethnicity-dependent frequencies of the alleles. Moreover, the model elucidates the mimicry efforts by pathogenic microorganisms.
Keywords: complement factor H, innate immune system, evolutionary game theory, battle of the sexes game, molecular mimicry
1. Introduction
Complement factor H (also known as CFH) plays an important role in the immune system, as it helps to distinguish between intact host cells and modified or damaged host cells and foreign cells or microbes. It belongs to the alternative pathway of the complement system of the innate immune response [1].
Factor H is structured as a chain of 20 units, called either short consensus repeat (SCR) or complement control protein module. Each of them consists of about 60 amino acids [2,3]. The alternative pathway is permanently activated by the cleavage of the complement protein C3 in C3a and C3b. C3a particles diffuse and act as chemotactic proteins, whereas C3b particles can attach to any cell surface and opsonize it to enhance the clearance by phagocytosis. This complement activation is controlled by multiple regulatory proteins. Depending on the cell surface, different regulators bind to C3b. On target cells, factor B binds to C3b and is cleaved into Ba and Bb, and forms the protease complex C3-convertase, which generates more C3b by cleaving C3. Thus opsonization is promoted. On self cells membrane-bound regulators inhibit complement activation. In addition, factor H binds to C3b where it accelerates the cleavage of the C3-convertase and thus limits the C3b production (figure 1). Therefore, the host cells are not opsonized and hence protected by factor H [1,3].
Many pathogenic microorganisms express surface proteins that recruit factor H and therefore escape the immune system and increase their pathogenicity [1]. This is an example of molecular mimicry or camouflage, a widespread mechanism in biological systems. The general mechanism of mimicry originates in a similarity between two organisms that has evolved because the apparent resemblance is selectively advantageous for one species relating to a common signal receiver.
Mimicry was first described by the English naturalist Henry Walter Bates. He was immediately aware of the extensive consequences of his discovery for evolutionary biology. He wrote: ‘The process by which a mimetic analogy is brought about in nature is a problem which involves that of the origin of all species and all adaptations’ [4, p. 511]. Subsequently, besides Batesian mimicry, several other forms of mimicry were described (see also [5]).
Besides mimicry at the macroscopic level, also molecular mimicry has been studied [6] and related to autoimmune diseases [7–9]. Our work theoretically analyses an example of the latter category.
Poltermann et al. [10] discovered that the yeast Candida albicans binds factor H with its surface protein CaCRASP-1 via the two binding sites SCR 6 − 7 and SCR 19 − 20. Factor H remains active and provides a protection against the complement system. Similar strategies have been found for other fungi. For example, Aspergillus fumigatus and Aspergillus terreus strongly bind C4bp [11]. C4bp also has the ability to bind C3b and accelerate the decay of C3-convertase [12].
Related behaviour has also been found in different bacterial species, e.g. Bordetella pertussis [13] or Streptococcus pyogenes [14,15]. Here, binding of C4bp is mediated by SCRs 1 − 2. Escherichia coli binds the SCR 3 of C4bp, conferring serum resistance [16]. By contrast Campylobacter jejuni and Helicobacter pylori use lipopolysaccharides for mimicry on the level of the adaptive immune response [17].
Defects in the complement regulation system can lead to diseases caused by both undesirable damage of host cells and accumulation of immunological debris. Examples for the first case are the atypical haemolytic uraemic syndrome (aHUS) [18], dense-deposit disease (DDD) [19] and age-related macular degeneration (AMD) [20] occurring in different organs including the kidney and eyes. On the other hand, mimicking pathogens may escape phagocytosis and cells may exhibit unrestricted growth [1].
Factor H occurs in two different variants (alleles), which differ in only one single amino acid [21,22]. This single nucleotide polymorphism (SNP) is defined rs1061170 and affects the nucleotide position 43097, which can be either thymine or cytosine. As the result, the amino acid at position 402 of factor H (lying in SCR 7) becomes translated either into tyrosine (Y) or histidine (H). Therefore, this SNP is also called Y402H. These two variants are named CFH-402H and CFH-402Y. The SNP is located in the binding site for C-reactive protein (CRP), which activates the complement system. CFH binds CRP and inhibits the immune response [23]. The CFH-402H variant shows a significantly reduced binding to CRP [24]. This leads to a higher proinflammatory state, in general [25], and to an increased severity in autoimmune diseases. The CFH-402H variant is significantly associated with AMD [26–30]. Also, patients with DDD and aHUS are found to have a high prevalence of the CFH-402H variant of factor H [31].
The frequency of the H allele differs between human ethnic groups. It is about for Caucasians, African-Americans and Somalis, about for Hispanics and about for Japanese (cf. table 1). The frequency of this allele may even vary between Caucasian subpopulations [34]. Subsequently, the prevalence of diseases caused by defects in complement regulation differs in populations of miscellaneous descent, too. For example early AMD in Japanese populations is much less frequent than in Caucasians of European descent [38].
Table 1.
cultural area | frequency H-allele | frequency Y-allele | YH pay-off (wYH) | references |
---|---|---|---|---|
South America (native) | 0 | 1 | 1 | [33] |
North America (native) | 0.03 | 0.97 | 1.03 | [33] |
South Asia | 0.05–0.09 | 0.91–0.95 | 1.06–1.11 | [32–35] |
Mesoamerica (*) | 0.15–0.17 | 0.73–0.85 | 1.21–1.59 | [33,36] |
Oceania | 0.22 | 0.78 | 1.39 | [33] |
Middle East | 0.27–0.3 | 0.7–0.73 | 1.59–1.75 | [33] |
Africa | 0.31–0.35 | 0.65–0.7 | 1.75–2.17 | [33–35] |
Europe | 0.3–0.36 | 0.64–0.7 | 1.75–2.29 | [33–35] |
Iceland (*) | 0.39 | 0.61 | 2.77 | [35] |
Finland (*) | 0.43 | 0.57 | 4.07 | [37] |
Mathematical models describing mimicry have been presented for higher organisms [39–41], e.g. for hoverflies imitating wasps [42]. By contrast, models describing molecular mimicry of pathogenic fungi or bacteria are rare and are desperately needed to understand infections and autoimmune diseases (e.g. AMD and aHUS). Our approach aims to fill this gap of understanding by extending the theoretical framework of molecular mimicry, especially for interactions between pathogenic microorganisms and the immune system.
Evolutionary game theory (EGT) is an established and profound method to describe biological interactions between species. It has been successfully applied to models of host–pathogen interactions [43–50]. Mimicry is a form of deception, which has been analysed by EGT earlier [51]. In this work, we present a game-theoretic model analysing whether and to what extent the human immune system can distinguish between self and foreign in spite of the mimicry of the pathogens. We consider the (stable) existence of both alleles as two alternative strategies and its ratio within an ethnic group as an evolutionarily stable equilibrium.
2. Methods and model
2.1. The basic game
In each individual, the two genes—maternal and paternal—of factor H (where the polymorphism is localized) are viewed as players and the concrete realizations of the SNP (Y or H) as the respective strategy. We are not interested in details in the underlying biological processes but rather in an abstract description of the phenomena. That is why we use the framework of evolutionary games. The pay-off is given by the (average) fitness for the concrete genotype (YY, YH or HH). Moreover, the two heterozygous cases (YH and HY) gain the same fitness. Consequently, the pay-off for both players (genes) is the same and the pay-off matrix becomes symmetric, so that the game has a high degree of symmetry. In game theory, situations where the pay-off matrix is symmetric are called partnership games [52,53].
Y | H | |
Y | wYY | wYH |
H | wYH | wHH |
Let wYY be the pay-off for the homozygous genotype resulting in YY, wYH be the pay-off for the heterozygous and symmetric cases (i.e. YH and HY ) and wHH be the pay-off for the homozygous genotype resulting in HH. We assume that wYY, wYH, wHH are real values and pairwise different. Thus individuals with different specific allele combinations differ in their fitness. From a game-theoretical point of view these three genotypes can be considered as the result of two different strategies realized by the two genes in a concrete individual.
We assume that the given deviating allele frequencies within ethnic groups represent different evolutionarily stable solutions for the trade-off between the false positive and false negative cell detection. Let y*∈(0, 1) denote the evolutionarily stable frequency of the allele realization Y in the population. In particular, populations with homozygous individuals, either pure HH or pure YY, can be invaded by the contrary strategy, respectively (cf. table 1). Thus, there must be a pay-off advantage to a heterozygous individual to have both strategies. In population genetics this case is called overdominance [54]. Considering this, we can derive directly the inequalities
2.1 |
and
2.2 |
The evolutionarily stable frequency y*∈(0, 1) of the allele realization Y in the population can be calculated by using the Bishop–Cannings theorem [55] (see also [56], appendix A):
2.3 |
Although the value of y* varies in different populations, its values are always higher than [34]; cf. table 1. Together with equation (2.3) we derive the condition
2.4 |
meaning that comparing the two homozygous cases shows that YY gets a higher pay-off than HH. Combining the inequalities (2.1), (2.2) and (2.4), we get
2.5 |
A symmetric game with this pay-off relation is a battle of the sexes game. This is one subtype of the leader game, a classical two-player two-strategy game with mixed Nash equilibria. Here, the follower does not have a disadvantage in pay-off compared to the leader. Thus, it is on the boundary between the battle of the sexes game and another subtype of the leader game (cf. [56]). In the interpretation (cover story) of the battle of the sexes game, that parameter combination means that both partners enjoy the togetherness equally (pay-off wYH), independent of the event they visit together [56].
The inequalities (2.5) simplify the pay-off matrix. As parameter values can be normalized without loss of generality [57], it can be assumed that
2.6 |
and
2.7 |
The value of y* can then be reduced with equation (2.3) to
2.8 |
or, expressed for the value of wYH:
2.9 |
For example, the frequency of the H allele (which equals 1 − y*) is about for Caucasians, about for Hispanics and about for Japanese [34]. The value of wYH then becomes 2, and , respectively. Therefore, for example, the pay-off matrix for Caucasians becomes
Y | H | |
Y | 1 | 2 |
H | 2 | 0 |
We summarized several studies addressing the factor H polymorphism and calculated the expected pay-offs of the H allele (see equation (2.9)). The examined ethnic groups are organized in cultural areas. The cultural areas coincide with seven of the eight cultural areas identified by Hunter & Whitten [32] (North Asia is missing due to lack of data). For a more detailed view three subareas, i.e. Mesoamerica, Finland and Iceland are included.
2.2. Relating the pay-offs to false positives and false negatives
In the previous section, we analysed the observation of the two alleles, where H occurs less often than Y . The conclusion was that, for an individual, being heterozygous is most advantageous, whereas being homozygous HH is least advantageous. The next step is to find an appropriate explanation for this order of the pay-off values. For this, we have a closer look on the task of factor H and the difference of the two alleles.
A significant role of factor H is to distinguish between foreign cells (or cells to be cleared) and intact self cells (cells to be protected from complement attack and phagocytosis). Cells that are labelled with factor H are protected against the damage function of the innate immune system. Thus, these cells can be defined as labelled to be negative. Cells without factor H are indirectly labelled as positive, independently of whether the cell of interest is host or foreign. This represents a classical classification problem.
If factor H binds to host cells, the labelling is true negative (TN). A detection of foreign cells, by not binding to their surfaces, is a true positive labelling (TP). In the distinction between self and foreign two inevitable errors occur: (I), foreign cells erroneously recognized as self cells, i.e. false negatives (FN) and (II) self cells erroneously recognized as foreign, i.e. false positives (FP). See table 2 for an overview.
Table 2.
foreign cell | host cell | |
---|---|---|
label a cell as foreign | TP | FP |
label a cell as host | FN | TN |
Let f1 be the number of foreign cells that bind factor H in relation to all foreign cells and f2 be the number of all host cells that do not bind factor H in relation to all host cells. Thus the probabilities for the two errors can be denoted by
2.10 |
and
2.11 |
How do the different alleles Y and H influence these errors? The value of f2, the probability of clearing a host cell, is influenced by the CFH-402Y allele as it binds well to CRP on the cells [24]. This leads to a lower opsonization (and thus a lower rate of phagocytosis) and thus a lower value of f2. On the other hand, CFH-402H is less able to bind [24], resulting in a higher value of f2:
2.12 |
If equation (2.12) described the only effect by the SNP, then minimizing the error with genotype YY would provide the optimum, i.e. the highest pay-off. With equation (2.5) and table 1, this has to be rejected and we hypothesize that there is also an advantage of CFH-402H.
Recalling the ability of pathogens to bind factor H, the low binding affinity of CFH-402H [24] might prevent pathogens from reducing the opsonization too much. This means that f1, the probability of not recognizing a foreign cell, is lower in the presence of CFH-402H than of CFH-402Y:
2.13 |
How is the classification problem of self and foreign cells (i.e. the labelling) triggered by the CFH polymorphism? And, how and to which extent does the heterozygous genotype influence the values of the f1 and f2 errors? Let all surface properties promoting factor H binding be summarized by a single hypothetical parameter collapsing down to one single dimension (figure 2). Low values of this parameter may reflect surface conditions where factor H readily binds to a cell surface. High values instead reflect surface properties with a decreased binding affinity of factor H. Further let the existence of self cells and pathogens satisfy two distinct (e.g. bell-shaped) distributions forming two probability density functions (PDF), for host and pathogenic cells, within this dimension. Note distributions are not necessarily bell-shaped.
A major task of the immune system is to distinguish between those two PDFs. Whereas their relative positions are not of major relevance, their degree of overlap is. Thus a considerable overlap exacerbates the classification problem. The reliability of distinction of both surface types by the immune system decreases. We expect the specific detection threshold of a host to be determined by its genotypic state. The integrated area under the pathogen's PDF on the right of the threshold (light grey area) represents the pathogen's probability of not being detected by the host. It represents the error f1 where molecular mimicry is valuable to the pathogen. Accordingly, the integrated area under the host's PDF on the left of the threshold (dark grey area) represents the probability of self cells not being protected and prone to clearance. This area reflects the error f2.
Every possible detection threshold with f1, f2 > 0 is a Pareto-efficient state. This means that every decrease of f1 increases f2 and vice versa, which represents the trade-off between pathogenic detection and the tenuous control of self cells. Hence, both errors at once are not minimal at any genotype. There are three possible thresholds given by the three possible genotypes. For each genotype the threshold is fixed, whereas the PDFs of host and pathogen cells depend on environmental differences.
For the homozygous genotype YY , we expect the degree of opsonization (and phagocytosis) being generally lower for host cells and also for pathogens binding factor H. This results in a low f2 error, on the one hand, as host cells are well protected, but also in a high f1 error, on the other hand, as pathogens are less well detected (figure 2a). For the homozygous genotype HH, we expect a high degree of opsonization leading to a low f1 error. Here the factor H binding abilities of pathogens are low, thus molecular mimicry is futile. Simultaneously the f2 error is high, as the host cells are less well protected (figure 2c). Our model predicts an intermediate degree of opsonization for the heterozygous genotype, i.e. YH (figure 2b). This implies that the extent of errors (i.e. fYH1 and fYH2), occurring in the heterozygous genotype, also exhibits an intermediate state:
2.14 |
and
2.15 |
Consider a fitness function F depending on the position of the threshold t: F = F(t). Different environmental and internal conditions (e.g. pathogen load and ethnicity, respectively) lead to differently shaped distributions for host and pathogenic cells, resulting in different parameters λ of F = Fλ(t). Independently of λ, according to the normalization (2.6), (2.7) the (normalized) fitness function F should fulfil two equations for the thresholds of the two homozygous genotypes tYY and tHH:
2.16 |
and
2.17 |
As we showed in the previous section that the fitness (pay-off) for the genotype YH is higher than for the homozygous genotypes (inequality (2.5)), F must increase beyond tYY, reach a maximum and decrease until tHH. Of course, Fλ(tYH) = wYH, but it need not be the maximum value of F. In EGT, the expected pay-off is maximized, i.e. the evolutionarily stable Y-allele frequency y* is not the result of maximizing the fitness of the heterozygous genotype, but of maximizing the fitness of the whole population.
3. Discussion
In this manuscript, we investigated a SNP within the human complement factor H, i.e CFH-H402Y. This SNP was identified to represent a typical classification problem prompting two Pareto-efficient errors during the detection between ‘self’ and ‘foreign’ by the innate immune system. We emphasize the dilemma between an efficient defence against pathogenic agents and an attentive handling of self cells. Allele frequencies are found to deviate significantly between ethnic groups (cf. table 1). Here we interpret allele variants as alternative evolutionary strategies and current allele frequencies as evolutionarily stable within the above-mentioned ethnic groups. Sticking to classical EGT, the identified game type is a special case of the battle of the sexes game.
Molecular mimicry describes the ability of pathogens to adapt the structure of their proteins and carbohydrates to the structure of those of the host to evade the immune recognition and attack by the host. This strategy has been observed for bacteria, viruses, fungi, parasites and protozoa. These pathogenic agents achieve mimicry through molecular surfaces that mimic host protein surfaces, which have been obtained by convergent evolution [6]. It has also been theorized that these similar protein folds have been obtained by horizontal gene transfer [58]. This further supports the theory that microbial organisms have evolved a mechanism of concealment similar to that of higher organisms such as the African praying mantis or chameleon that camouflage themselves so that they can mimic their background so as not to be recognized by others [58].
Investigating the host's consequences of molecular mimicry contributes to a deeper understanding of the arms race between host and pathogen. While the pathogen's objective is to be as similar as possible to the cells of its host, the host aims to improve its sensitivity to detect pathogens while not promoting autoimmune reactions. Analogous trends have been shown for parasites, which adapt to the locally common genotypes of its host [43]. Within the overlapping range of cell surface properties (figure 2) the distinction of self cells and foreign surfaces are incompatible aspects. We showed that the two errors during cell detection cannot be minimized simultaneously. For this type of bi-criteria problem methods of Pareto optimization are helpful [59]. This is reminiscent of classical test statistics, where the sensitivity and specificity trade-off is imminent. Typical optimization problems including contradicting aspects are often part of biological questions, i.e. classification problems, inverse problems, sequence and structure alignment, protein structure prediction, system optimization and experimental design [60]. Furthermore, scenarios of optimization problems with multiple (also conflicting) aspects can be found in daily life. To tackle those problems, several multi-objective optimization tools are developed [61].
An alternative approach to solving the problem under study could be based on population genetics, an academic discipline originating in the pioneering work of Ronald Fisher, John B. S. Haldane and Sewall Wright [62]. In population genetics, equations have been derived that are practically equivalent to the Bishop–Cannings Theorem (equation (2.3)) for the case of partnership games [53,54,63]. Relative fitness is usually written as 1 − si, where si denotes the selection coefficient for allele i [54]. Some authors write the equation for the equilibrium frequency of alleles in a form that is very close to the Bishop-Cannings Theorem [53,54, ch. 4.6]. For the relation between game theory and population genetics, see also [52, ch. 22].
Here, we have adopted a game-theoretical approach. Finding the evolutionarily stable strategies is analogous to calculating equilibrium allele frequencies in the presence of selection. As we study frequency-dependent interactions, it is more convenient to use a game theory approach. Although it is, to some extent, a matter of taste, we consider this to be an intuitive and comprehensible way of understanding complex genetic phenomena.
Our model predicts a high susceptibility to infectious diseases for the YY-homozygous genotype due to low detection capabilities of foreign cells (see figure 2). Accordingly, the HH-homozygous genotype should tend to suffer autoimmune diseases, e.g. AMD, with an increased probability. Our model predicts the highest expected pay-off for the YH-heterogeneous genotypes (see equation (2.5)), as its detection threshold tends to minimize both errors. Self-evidently, the location of the detection thresholds underlies a tremendous selection pressure, on the one hand for pathogenic agents and on the other hand, for the potential host. Whereas the purpose of the pathogen is to be undetected by imitating self cell surface properties, i.e. by molecular mimicry, self cells have to fit a well tuned trade-off between pathogenic detection (and clearance) and the tenuous control of self cells.
Exploring the predictive potential of the here presented model admits an outlook on the opposite effect. Whereas a high H-allele frequency should promote a right-shifted detection threshold with high probabilities to distinct surfaces of pathogenic agents simultaneously the risk for erroneous self cell detection is increased (f1 ≫ f2), an essential part of the pathology promoting autoimmune diseases. This is in good accordance with the hypothesis of [34] that populations with a higher prevalence of AMD also have a higher prevalence for the H-allele. This fact has also been claimed by several other authors (see references cited in table 1). Further, so far unidentified genetic parameters that are important in the pathogenesis of AMD are suggested [34]. Factor H like protein (FHL-1) is an alternative product of the factor H gene. It mainly consists of the first seven (of the 20) SCRs, and therefore also contains the first binding site for CRP. The SNP effect is stronger, as the second binding site (SCR 20) is not present [24]. As the function is basically the same as for factor H [1], for this argumentation, it is not necessary to make a distinction between factor H and FHL-1. Thus, the here presented model can also be applied to this situation.
It is believed that specific T cells can cross-react with self-epitopes, thus leading to tissue pathology (autoimmunity) [64]. Nevertheless, Xu et al. [65] state that the Y402H variant in factor H is not associated with exudative AMD in a northern Chinese population, and Kim et al. [66] did not find a statistically significant proof for this relation in Koreans. It is likely that the lack of statistical significance is based on the fact that the Y402H variant is very rare in the South Asian cultural area. Thus, to gain statistical significance for rare events larger sample groups are necessary.
As the frequency y* of the Y-allele significantly varies between ethnic groups (table 1), we conclude that the fitness costs of detection errors, i.e. f1 and f2, may also vary between these groups. Thereby ordering of pay-offs is constant and equation (2.5) always holds true. Thus a low frequency y* also implies a lower expected pay-off a and a left-shifted position of the detection threshold (f1 ≫ f2) with a suboptimal detection of pathogenic agents. Consequently the H-allele frequency could be interpreted as a proxy for severity of the local pathogen pressure on the corresponding ethnic group. Severity here represents the degree of local pathogens to mimic self cell surfaces. According to the allele frequencies given by Grassi et al. [34], Caucasians, Somalis (H-allele frequency 0.347 + −0.03) and African-Americans (0.357 + − 0.04) originate from environments with more aggressive pathogens. Africa in general has a greater disease prevalence than all other regions and a different spectrum of infectious pathogens [67], showing good accordance of the allele frequencies to the prevalence of pathogens. We conclude that the fitness consequences of the f1 and f2 trade-off may vary locally. Note that the pay-off of an individual is determined by its genotypic state and its environment.
The distribution of pathogens has been critical in shaping historical patterns of conquest and settlement, and the role of pathogens in shaping ethnic diversity cannot be overestimated [68]. With their emigration to America, Europeans have introduced many diseases, whereas no new severe diseases awaited Europeans in the New World [69]. By contrast, the Hispanics (0.177 + −0.03) mainly originate from Central and South America, regions with a great richness of pathogens but a very low prevalence [67]. The origin of the Japanese (0.077 + −0.02) has low pathogen richness and prevalence [67].
Focusing on just two alleles is an obvious simplification of the biological problem, as in the whole factor H gene region 30 SNPs have been detected [70]. Here the SNP 8 corresponds to Y402H and lies in the first of three haplotype blocks containing four haplotypes. De Córdoba & De Jorge [3] state that there are a dozen of SNPs in the promotor region or coding sequence of the factor H gene. An allocation of strategies to the haplotypes instead of alternative nucleotides, i.e. Y402H, would be also valid, and the expected results are identical. Several susceptible and also protective haplotypes have been identified ([26,71,72]). Nevertheless, Hageman et al. [26] found, among five haplotypes with a set of the eight most informative SNPs, that the haplotype bearing the major risk for AMD includes the Y402H SNP. Li et al. [72] investigated haplotypes on the basis of five SNPs and also found the Y402H SNP to be the most common risk haplotype. Also Francis et al. [71] found a haplotype with the Y402H SNP which is strongly associated with AMD.
The here presented model aims to contribute to understanding host–pathogen interactions, especially the host's perspective. The concept of molecular mimicry is a useful tool in understanding the aetiology, pathogenesis, treatment and prevention of autoimmune disorders. Mimicry can only be improved at some cost to the mimetic organism and on an evolutionary timescale [42,51]. Therein, evolutionary game theoretical models of mimicry were developed. Quality of mimicry, frequency of models and mimics, costs on the operator due to contact with mimics are considered to explain different mimicry scenarios. Molecular mimicry is, however, only one mechanism by which an autoimmune disease can occur in association with a pathogen [73]. Understanding the mechanism of molecular mimicry may allow future research to be directed towards uncovering the initiating infectious agent as well as recognizing the self-determinant. This way, future research may be able to design strategies for treatment and prevention of autoimmune disorders.
Acknowledgements
The authors thank Kerstin Voigt for stimulating discussions.
Data accessibility
The datasets supporting this article are included in the main paper.
Authors' contributions
S.H. and C.G. designed the study, set up the mathematical model and performed the analysis. S.H., S.G., P.F.Z., C.S. and S.S. interpreted the results, S.G. and S.S. coordinated the study, S.H., C.G. and C.H. wrote the manuscript, S.L. and C.H. aided in interpretation and S.L., S.G., P.F.Z., C.S. and S.S. contributed to writing the manuscript.
Competing interests
We have no competing interests.
Funding
The authors acknowledge support by the Deutsche Forschungsgemeinschaft (DFG) in the CRC/Transregio 124 FungiNet sub-projects B1 (S.S.), C4 (C.S.) and C6 (P.F.Z.) and in the Jena School for Microbial Communication (C.G. and S.L.).
References
- 1.Zipfel PF, Skerka C. 2009. Complement regulators and inhibitory proteins. Nat. Rev. Immunol. 9, 729–740. ( 10.1038/nri2620) [DOI] [PubMed] [Google Scholar]
- 2.Ripoche J, Day AJ, Harris TJR, Sim RB. 1988. The complete amino acid sequence of human complement factor H. Biochem. J. 249, 593–602. ( 10.1042/bj2490593) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.De Córdoba SR, De Jorge EG. 2008. Translational mini-review series on complement factor H: genetics and disease associations of human complement factor H. Clin. Exp. Immunol. 151, 1–13. ( 10.1111/j.1365-2249.2007.03552.x) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Bates HW. 1862. Contributions to an insect fauna of the Amazon Valley. Lepidoptera: Heliconidae. Trans. Linnean Soc. Lond. 23, 495–556. ( 10.1111/j.1096-3642.1860.tb00146.x) [DOI] [Google Scholar]
- 5.Maynard Smith J, Harper D. 2004. Animal signals. Sunderland, MA: Oxford University Press. [Google Scholar]
- 6.Damian RT. 1964. Molecular mimicry: antigen sharing by parasite and host and its consequences. Am. Nat. 98, 129–149. ( 10.1086/282313) [DOI] [Google Scholar]
- 7.Oldstone MBA. 1998. Molecular mimicry and immune-mediated diseases. FASEB J. 12, 1255–1265. ( 10.1096/fj.1530-6860) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Blank M, Barzilai O, Shoenfeld Y. 2007. Molecular mimicry and auto-immunity. Clin. Rev. Allergy Immunol. 32, 111–118. ( 10.1007/BF02686087) [DOI] [PubMed] [Google Scholar]
- 9.Cusick MF, Libbey JE, Fujinami RS. 2012. Molecular mimicry as a mechanism of autoimmune disease. Clin. Rev. Allergy Immunol. 42, 102–111. ( 10.1007/s12016-011-8294-7) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Poltermann S, Kunert A, Zipfel P. 2007. Complement escape of Candida albicans: identification and characterization of the yeast surface protein CaCRASP-1 which binds factor H,FHL-1 and plasminogen. Mol. Immunol. 44, 229–230. ( 10.1016/j.molimm.2006.07.193) [DOI] [Google Scholar]
- 11.Vogl G. et al. 2008. Immune evasion by acquisition of complement inhibitors: the mould Aspergillus binds both factor H and C4b binding protein. Mol. Immunol. 45, 1485–1493. ( 10.1016/j.molimm.2007.08.011) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Lublin DM, Atkinson JP. 1989. Decay-accelerating factor: biochemistry, molecular biology, and function. Annu. Rev. Immunol. 7, 35–58. ( 10.1146/annurev.iy.07.040189.000343) [DOI] [PubMed] [Google Scholar]
- 13.Berggård K, Lindahl G, Dahlbäck B, Blom AM. 2001. Bordetella pertussis binds to human C4b-binding protein (C4BP) at a site similar to that used by the natural ligand C4b. Eur. J. Immunol. 31, 2771–2780. ( 10.1002/1521-4141(200109)31:9%3C2771::AID-IMMU2771%3E3.0.CO;2-0) [DOI] [PubMed] [Google Scholar]
- 14.Berggård K, Johnsson E, Morfeldt E, Persson J, Stålhammar-Carlemalm M, Lindahl G. 2001. Binding of human C4BP to the hypervariable region of M protein: a molecular mechanism of phagocytosis resistance in Streptococcus pyogenes. Mol. Microbiol. 42, 539–551. ( 10.1046/j.1365-2958.2001.02664.x) [DOI] [PubMed] [Google Scholar]
- 15.Jenkins HT, Mark L, Ball G, Persson J, Lindahl G, Uhrin D, Blom AM, Barlow PN. 2006. Human C4b-binding protein, structural basis for interaction with streptococcal M protein, a major bacterial virulence factor. J. Biol. Chem. 281, 3690–3697. ( 10.1074/jbc.M511563200) [DOI] [PubMed] [Google Scholar]
- 16.Prasadarao NV, Blom AM, Villoutreix BO, Linsangan LC. 2002. A novel interaction of outer membrane protein A with C4b binding protein mediates serum resistance of Escherichia coli K1. J. Immunol. 169, 6352–6360. ( 10.4049/jimmunol.169.11.6352) [DOI] [PubMed] [Google Scholar]
- 17.Moran AP, Prendergast MM. 2001. Molecular mimicry in Campylobacter jejuni and Helicobacter pylori lipopolysaccharides: contribution of gastrointestinal infections to autoimmunity. J. Autoimmun. 16, 241–256. ( 10.1006/jaut.2000.0490) [DOI] [PubMed] [Google Scholar]
- 18.Skerka C, Józsi M, Zipfel PF, Dragon-Durey M-A, Fremeaux-Bacchi V. 2009. Autoantibodies in haemolytic uraemic syndrome (HUS). Thromb. Haemostasis 101, 227–32. ( 10.1160/TH08-05-0322) [DOI] [PubMed] [Google Scholar]
- 19.Smith RJH. et al. 2007. New approaches to the treatment of dense deposit disease. J. Am. Soc. Nephrol. 18, 2447–2456. ( 10.1681/ASN.2007030356) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Klein RJ. et al. 2005. Complement factor H polymorphism in age-related macular degeneration. Science 308, 385–389. ( 10.1126/science.1109557) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.dbSNP. 2017. Database of single nucleotide polymorphisms, dbSNP accession: rs1061170. Bethesda (MD), National Center for Biotechnology Information, National Library of Medicine. (dbSNP Build ID: 144). See http://www.ncbi.nlm.nih.gov/SNP/.
- 22.Sherry ST, Ward M-H, Kholodov M, Baker J, Phan L, Smigielski EM, Sirotkin K. 2001. dbSNP: the NCBI database of genetic variation. Nucleic Acids Res. 29, 308–311. ( 10.1093/nar/29.1.308) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Sjöberg AP, Trouw LA, Blom AM. 2009. Complement activation and inhibition: a delicate balance. Trends Immunol. 30, 83–90. ( 10.1016/j.it.2008.11.003) [DOI] [PubMed] [Google Scholar]
- 24.Laine M. et al. 2007. Y402H polymorphism of complement factor H affects binding affinity to C-reactive protein. J. Immunol. 178, 3831–3836. ( 10.4049/jimmunol.178.6.3831) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Machalińska A, Kawa MP, Marlicz W, Machaliński B. 2012. Complement system activation and endothelial dysfunction in patients with age-related macular degeneration (AMD): possible relationship between AMD and atherosclerosis. Acta Ophthalmol. 90, 695–703. ( 10.1111/j.1755-3768.2011.02295.x) [DOI] [PubMed] [Google Scholar]
- 26.Hageman GS. et al. 2005. A common haplotype in the complement regulatory gene factor H (HF1/CFH) predisposes individuals to age-related macular degeneration. Proc. Natl Acad. Sci. USA 102, 7227–7232. ( 10.1073/pnas.0501536102) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Fritsche LG. et al. 2010. An imbalance of human complement regulatory proteins CFHR1, CFHR3 and factor H influences risk for age-related macular degeneration (amd). Hum. Mol. Genet. 19, 4694–4704. ( 10.1093/hmg/ddq399) [DOI] [PubMed] [Google Scholar]
- 28.Ryu E, Fridley BL, Tosakulwong N, Bailey KR, Edwards AO. 2010. Genome-wide association analyses of genetic, phenotypic, and environmental risks in the age-related eye disease study. Mol. Vis. 16, 2811–2821. [PMC free article] [PubMed] [Google Scholar]
- 29.Kondo N, Bessho H, Honda S, Negi A. 2011. Complement factor H Y402H variant and risk of age-related macular degeneration in Asians: a systematic review and meta-analysis. Ophthalmology 118, 339–344. ( 10.1016/j.ophtha.2010.06.040) [DOI] [PubMed] [Google Scholar]
- 30.Dong L, Qu Y, Jiang H, Dai H, Zhou F, Xu X, Bi H, Pan X, Dang G. 2011. Correlation of complement factor H gene polymorphisms with exudative age-related macular degeneration in a Chinese cohort. Neurosci. Lett. 488, 283–287. ( 10.1016/j.neulet.2010.11.048) [DOI] [PubMed] [Google Scholar]
- 31.Józsi M, Licht C, Strobel S, Zipfel SLH, Richter H, Heinen S, Zipfel PF, Skerka C. 2008. Factor H autoantibodies in atypical hemolytic uremic syndrome correlate with CFHR1/CFHR3 deficiency. Blood 111, 1512–1514. ( 10.1182/blood-2007-09-109876) [DOI] [PubMed] [Google Scholar]
- 32.Hunter DE, Whitten P. 1976. Culture areas of the world. In Encyclopedia of anthropology, pp. 104–111. New York, NY: Harper and Row. [Google Scholar]
- 33.Hageman GS. et al. 2006. Extended haplotypes in the complement factor H (CFH) and CFH-related (CFHR) family of genes protect against age-related macular degeneration: characterization, ethnic distribution and evolutionary implications. Ann. Med. 38, 592–604. ( 10.1080/07853890601097030) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Grassi MA. et al. 2006. Ethnic variation in AMD-associated complement factor H polymorphism p.Tyr402His. Hum. Mutat. 27, 921–925. ( 10.1002/humu.20359) [DOI] [PubMed] [Google Scholar]
- 35.Magnusson KP. et al. 2006. CFH Y402H confers similar risk of soft drusen and both forms of advanced AMD. Plos Med. 3, 109–114. ( 10.1371/journal.pmed.0030005) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Contreras AV. et al. 2014. CFH haplotypes and ARMS2, C2, C3, and CFB alleles show association with susceptibility to age-related macular degeneration in mexicans. Mol. Vis. 20, 105–116. [PMC free article] [PubMed] [Google Scholar]
- 37.Jylhävä J, Eklund C, Pessi T, Raitakari OT, Juonala M, Kähönen M, Viikari JSA, Lehtimäki T, Hurme M. 2009. Genetics of C-reactive protein and complement factor H have an epistatic effect on carotid artery compliance: the cardiovascular risk in young finns study. Clin. Exp. Immunol. 155, 53–58. ( 10.1111/j.1365-2249.2008.03752.x) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Oshima Y, Ishibashi T, Murata T, Tahara Y, Kiyohara Y, Kubota T. 2001. Prevalence of age related maculopathy in a representative Japanese population: the Hisayama study. Brit. J. Oophthalmol. 85, 1153–1157. ( 10.1136/bjo.85.10.1153) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Huheey JE. 1964. Studies of warning coloration and mimicry. IV. A. mathematical model of model-mimic frequencies. Ecology 45, 185–188. ( 10.2307/1937125) [DOI] [Google Scholar]
- 40.Oaten A, Pearce CEM, Smyth MEB. 1975. Batesian mimicry and signal detection theory. B. Math. Biol. 37, 367–387. ( 10.1016/S0092-8240(75)80037-1) [DOI] [PubMed] [Google Scholar]
- 41.Huheey JE. 1988. Mathematical models of mimicry. Am. Nat. 131, S22–S41. ( 10.1086/284765) [DOI] [Google Scholar]
- 42.Holen H, Johnstone RA. 2004. The evolution of mimicry under constraints. Am. Nat. 164, 598–613. ( 10.1111/j.1365-2249.2008.03752.x) [DOI] [PubMed] [Google Scholar]
- 43.Lively CM, Dybdahl MF. 2000. Parasite adaptation to locally common host genotypes. Nature 405, 679–681. ( 10.1038/35015069) [DOI] [PubMed] [Google Scholar]
- 44.Gandon S, Michalakis Y. 2002. Local adaptation, evolutionary potential and host–parasite coevolution: interactions between migration, mutation, population size and generation time. J. Evol. Biol. 15, 451–462. ( 10.1046/j.1420-9101.2002.00402.x) [DOI] [Google Scholar]
- 45.Keesing F, Holt RD, Ostfeld RS. 2006. Effects of species diversity on disease risk. Ecol. Lett. 9, 485–498. ( 10.1111/j.1461-0248.2006.00885.x) [DOI] [PubMed] [Google Scholar]
- 46.Hummert S, Hummert C, Schröter A, Hube B, Schuster S. 2010. Game theoretical modelling of survival strategies of Candida albicans inside macrophages. J. Theor. Biol. 264, 312–318. ( 10.1016/j.jtbi.2010.01.022) [DOI] [PubMed] [Google Scholar]
- 47.Leach MD, Tyc KM, Brown AJP, Klipp E. 2012. Modelling the regulation of thermal adaptation in Candida albicans, a major fungal pathogen of humans. PLoS ONE 7, 1–14. ( 10.1371/journal.pone.0032467) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Tyc KM, Kühn C, Wilson D, Klipp E. 2014. Assessing the advantage of morphological changes in Candida albicans: a game theoretical study. Front. Microbiol. 5, 1–11. ( 10.3389/fmicb.2014.00041) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Pollmächer J, Figge MT. 2014. Agent-based model of human alveoli predicts chemotactic signaling by epithelial cells during early Aspergillus fumigatus infection. PLoS ONE 9, e111630 ( 10.1371/journal.pone.0111630) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Dühring S, Ewald J, Germerodt S, Kaleta C, Dandekar T, Schuster S. 2017. Modelling the host–pathogen interactions of macrophages and Candida albicans using game theory and dynamic optimization. J. R. Soc. Interface 14, 20170095 ( 10.1098/rsif.2017.0095) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Szolnoki A, Perc M. 2014. Costly hide and seek pays: unexpected consequences of deceit in a social dilemma. New J. Phys. 16, 113003 ( 10.1088/1367-2630/16/11/11300) [DOI] [Google Scholar]
- 52.Hofbauer J, Sigmund K. 1998. Evolutionary games and population dynamics. Cambridge, UK: Cambridge University Press. [Google Scholar]
- 53.Traulsen A, Reed FA. 2012. From genes to games: cooperation and cyclic dominance in meiotic drive. J. Theor. Biol. 299, 120–125. ( 10.1016/j.jtbi.2011.04.032) [DOI] [PubMed] [Google Scholar]
- 54.Hedrick PW. 2005. Genetics of populations. Sudbury, MA: Jones and Bartlett. [Google Scholar]
- 55.Bishop DT, Cannings C. 1978. A generalized war of attrition. J. Theor. Biol. 70, 85–124. ( 10.1016/0022-5193(78)90304-1) [DOI] [PubMed] [Google Scholar]
- 56.Hummert S, Bohl K, Basanta D, Deutsch A, Werner S, Theißen G, Schröter A, Schuster S. 2014. Evolutionary game theory: cells as players. Mol. BioSyst. 10, 3044–3065. ( 10.1039/C3MB70602H) [DOI] [PubMed] [Google Scholar]
- 57.Hauert C. 2002. Effects of space in 2 × 2 games. Int. J. Bifurcat. Chaos 12, 1531–1548. ( 10.1142/S0218127402005273) [DOI] [Google Scholar]
- 58.Stebbins CE, Galan JE. 2001. Structural mimicry in bacterial virulence. Nature 412, 701–705. ( 10.1038/35089000) [DOI] [PubMed] [Google Scholar]
- 59.Fudenberg D, Tirole J. 1991. Game theory. Cambridge, MA: Massechusetts Institute of Technology. [Google Scholar]
- 60.Handl J, Kell DB, Knowles J. 2007. Multiobjective optimization in bioinformatics and computational biology. IEEE/ACM Trans. Comput. Biol. Bioinformatics 4, 279–292. ( 10.1109/TCBB.2007.070203) [DOI] [PubMed] [Google Scholar]
- 61.Ehrgott M. 2005. Multicriteria optimization. Berlin, Germany: Springer; ISBN 3-540-21398-8. [Google Scholar]
- 62.Rose MR, Oakley TH. 2007. The new biology: beyond the modern synthesis. Biol. Direct 2, 30 ( 10.1186/1745-6150-2-3) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Hofbauer J, Sigmund K. 1988. The theory of evolution and dynamical systems: mathematical aspects of selection. Cambridge, UK: Cambridge University Press. [Google Scholar]
- 64.Kohm APJ, Fuller KG, Miller SD. 2003. Mimicking the way to autoimmunity: an evolving theory of sequence and structural homology. TRENDS Microbiol. 11, 101–105. ( 10.1016/S0966-842X(03)00006-4) [DOI] [PubMed] [Google Scholar]
- 65.Xu Y. et al. 2008. Association of CFH, LOC387715, and HTRA1 polymorphisms with exudative age-related macular degeneration in a northern chinese population. Mol. Vis. 14, 1373–1381. [PMC free article] [PubMed] [Google Scholar]
- 66.Kim NR, Kang JH, Kwon OW, Lee SJ, Oh JH, Chin HS. 2008. Association between complement factor H gene polymorphisms and neovascular age-related macular degeneration in koreans. Invest. Ophth. Vis. Sci. 49, 2071–2076. ( 10.1167/iovs.07-1195) [DOI] [PubMed] [Google Scholar]
- 67.Dunn RR, Davies TJ, Harris NC, Gavin MC. 2010. Global drivers of human pathogen richness and prevalence. Proc. R. Soc. B. 277, 2587–2595. ( 10.1098/rspb.2010.0340) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Cashdan E. 2001. Ethnic diversity and its environmental determinants: effects of climate, pathogens, and habitat diversity. Am. Anthropol. 4, 968–991. (doi:10.1525) [Google Scholar]
- 69.Wolfe ND, Dunavan CP, Diamond J. 2007. Origins of major human infectious diseases. Nature 477, 279–283. ( 10.1038/nature05775) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Hughes AE, Orr N, Esfandiary H, Diaz-Torres M, Goodship T, Chakravarthy U. 2006. A common CFH haplotype, with deletion of CFHR1 and CFHR3, is associated with lower risk of age-related macular degeneration. Nat. Genet. 38, 1173–1177. ( 10.1038/ng1890) [DOI] [PubMed] [Google Scholar]
- 71.Francis PJ, Schultz DW, Hamon S, Ott J, Weleber RG, Klein ML. 2007. Haplotypes in the complement factor H (CFH) gene: associations with drusen and advanced age-related macular degeneration. PLoS ONE 2, e1197 ( 10.1371/journal.pone.0001197) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Li M. et al. 2006. CFH haplotypes without the Y402H coding variant show strong association with susceptibility to age-related macular degeneration. Nat. Genet. 38, 1049–1054. ( 10.1038/ng1871) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Ermann J, Fathman CG. 2001. Autoimmune diseases: genes, bugs and failed regulation. Nat. Immunol. 2, 759–761. ( 10.1038/ni0901-759) [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The datasets supporting this article are included in the main paper.