Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2011 Nov 12.
Published in final edited form as: Mol Cell. 2010 Nov 12;40(3):347–348. doi: 10.1016/j.molcel.2010.10.032

Achieving Fidelity in Homologous Recombination Despite Extreme Complexity: Informed Decisions by Molecular Profiling

Robert P Rambo 1, Gareth J Williams 1, John A Tainer 1,2,*
PMCID: PMC3003302  NIHMSID: NIHMS250111  PMID: 21070960

Homologous recombination (HR) is a biologically critical, conserved, genomic maintenance system found in all three domains of life that is essential for genome integrity. An essential step in homologous recombination (HR) is the direct search of the homologous sequence within an extremely high background of heterologous DNA sequences. Although the proteins comprising the HR machine vary among species, HR search effectors include the filament forming RecA/Rad51 protein superfamily (RecA in bacteria, RadA in archaea, Rad51 and DMC1 in eukarya). HR is a very complex process that occurs through formation of the RecA nucleoprotein filament (NPF) on a select single-stranded DNA (ssDNA). NPF formation determines the DNA sequence for recombination. Within the context of the human genome (46 chromosomes at ~6 billion base pairs), finding a homologous DNA sequence becomes a challenging search problem akin to “finding a needle in a haystack,” when the needle is just another piece of hay! Crystal structures of RecA bound to ssDNA revealed that the NPF stabilizes the ssDNA in a distorted conformation (Figure 1A and B), splaying out bound nucleotides as short tri-nucleotide repeats with B-DNA geometry (Chen et al., 2008). This architecture facilitates recognition of homologous regions through Watson and Crick base pairing; however, actual searching requires probing of double-stranded DNA (dsDNA). NPF short tri-nucleotide segments suggest that testing or profiling of homologous DNA segments is incremental (Sagi et al., 2006). Presumably, when sufficient bound segments base pair (Figure 1C) (Kowalczykowski, 2008), HR machinery must decide to switch from search mode to found thereby committing DNA to the next stage, strand exchange. Likewise, if the segment is not homologous, HR machinery must decide to continue searching. Failure to make the proper decision can lead to non-homologous recombination and loss of genetic information.

Figure 1. HR directed search for the homologous sequence, conformational proofreading, cost optimization, and signal detection theory.

Figure 1

(A) Structure of RecA (light blue) bound ssDNA complex. Alternating tri-nucleotide repeats (green or purple) illustrate B-DNA like geometry. (B) Arrows demarcate regions of extended deformation along the DNA backbone between tri-nucleotide units. (C) Base-pairing between homologous DNA strands with deformed backbone geometry. (D) Binary game decision tree. Possible mis-interpretation of coin toss can lead to false signal detection.

Savir and Tlusty (2010) re-conceptualized the homologous recombination framework using signal detection theory. The RecA mediated homologous recombination search problem is essentially the recognition of a signal (complementary DNA sequence) within a naturally high background (heterologous DNA sequences). However, the exceptionally low signal-to-noise ratio creates uncertainty in recognizing the signal, which implies decisions come at a cost. Consider a repeated coin toss game, heads we make a positive decision (stand inside a hot house) or tails we make a negative decision (stand outside in the cold) (Figure 1D). Failure to make the proper decision i.e., standing inside on tails, results in a lost article of clothing. Success would be easy, if we could see the coin, as heads would always imply the positive decision and tails would always imply the negative decision. But in the HR version of this game, we must feel the upside of the coin, such that the implications are weakly defined. With no tactile experience the first time the game is played, our reasoning is insufficient in guiding a choice (Jaynes and Bretthorst, 2003). Yet, with increasing trial decisions, experience helps define the cost of making a decision.

Under signal detection theory, Savir and Tlusty (2010) proposed that recognition of homologous DNA by HR machinery is essentially a cost optimization problem. The positive decision to commit the HR machinery to a recombination event is a cost balance of NPF correctly recognizing a homologous sequence that preserves genetic information versus the heterologous sequence that generates new genetic information. Through the experience of evolutionary selection, HR has been optimally tuned to emphasize homologous DNA while minimizing heterologous DNA. Savir and Tlusty (2010) suggest a mechanism for cost optimization by noting that the ssDNA in the NPF is extended, thus requiring a large-scale geometric rearrangement of the DNA found within the targeted dsDNA sequence. Consequently, the total free energy gain by pairing the NPF tri-nucleotide unit with a test sequence will be the sum of the binding free energy (pairing) and the extension free energy (deformation). This suggests HR machinery not only examines the target sequence at the Watson and Crick base-pairing level but also at the conformational level. Can the target sequence deform similarly to bound NPF sequence? Savir and Tlusty (2010) provide a thermodynamic argument that these events, which can be performed without ATP hydrolysis by utilizing natural dsDNA breathing, provide improved detection through a mechanism called conformational proofreading.

Like physics before the discovery of unifying principles, biology is discovering extreme complexity of interactions governing pathway selection, signaling, regulation, progression, and biological outcomes. In the face of challenges from extreme biological complexity, we must strive to seek unifying, mechanistic principles that yield a simplifying understanding of complex data from experimental and bioinformatical approaches. In this context, application of information, signal detection, or economic theory to biological problems creates novel opportunities for discoveries that unify our understanding of complex processes, such as HR. Savir and Tlusty (2010) have thus provided us with a unifying framework for understanding key aspects of the molecular recognition problem for HR.

How generally does this analysis apply to cell biology? In fact, many DNA pairing and repair events involve dsDNA geometric rearrangement, such that decisions are made by testing total free energy gain by partner pairing compared to deformation free energy. Conformational proofreading likely occurs from related DNA processes such as DNA synthesis (Johnson and Beese, 2004) and base repair (Qi et al., 2009); to comparable systems such as promoter recognition by transcription factors (Pabo and Sauer 1992) and DNA mimicry (Putnam and Tainer, 2005); to the reversible assembly of macromolecular machines, such as bacterial pili (Craig et al., 2006). Indeed, throughout the cell, molecular surfaces are ubiquitous, and proteins and RNA, or more specifically macromolecular machines, must evolve to recognize the correct signal from high background noise or be selected against by impacting cellular functions resulting in toxicity and mutagenesis. Looking ahead, we expect the need to discover unifying principles aiding the understanding and prediction of biological interactions in the face of extremely complex data and low signal to noise is a critical challenge for cell biology in the next decade and beyond.

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  1. Savir, Tlusty . This ref Molecular Cell. 2010. [DOI] [PubMed] [Google Scholar]
  2. Chen Z, Yang H, Pavletich NP. Nature. 2008;453:489–484. doi: 10.1038/nature06971. [DOI] [PubMed] [Google Scholar]
  3. Craig L, Volkmann N, Arvai A, Pique M, Yeager M, Egelman E, Tainer J. Molecular Cell. 2006;23:651–662. doi: 10.1016/j.molcel.2006.07.004. [DOI] [PubMed] [Google Scholar]
  4. Jaynes ET, Bretthorst GL. Probability theory: the logic of science. Cambridge, UK; New York, NY: Cambridge University Press; 2003. [Google Scholar]
  5. Johnson SJ, Beese LS. Cell. 2004;116:803–816. doi: 10.1016/s0092-8674(04)00252-1. [DOI] [PubMed] [Google Scholar]
  6. Kowalczykowski SC. Nature. 2008;453:463–466. doi: 10.1038/453463a. [DOI] [PubMed] [Google Scholar]
  7. Pabo CO, Sauer RT. Annu Rev Biochem. 1992;61:1053–1095. doi: 10.1146/annurev.bi.61.070192.005201. [DOI] [PubMed] [Google Scholar]
  8. Putnam CD, Tainer J. DNA Repair. 2005;4:1410–1420. doi: 10.1016/j.dnarep.2005.08.007. [DOI] [PubMed] [Google Scholar]
  9. Qi Y, Spong MC, Nam K, Banerjee A, Jiralerspong S, Karplus M, Verdine G. Nature. 2009;462:762–766. doi: 10.1038/nature08561. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Sagi D, Tlusty T, Stavans J. Nucleic Acids Res. 2006;34:5021–5031. doi: 10.1093/nar/gkl586. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES