Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2020 Jun 19.
Published in final edited form as: Bioessays. 2018 Mar 1;40(4):e1700161. doi: 10.1002/bies.201700161

Replicating and Cycling Stores of Information Perpetuate Life

Antony M Jose 1
PMCID: PMC7303024  NIHMSID: NIHMS965205  PMID: 29493806

Abstract

Life is perpetuated through a single-cell bottleneck between generations in many organisms. Here, I highlight that this cell holds information in two distinct stores: in the linear DNA sequence that is replicated during cell divisions, and in the three-dimensional arrangement of molecules that can change during development but is recreated at the start of each generation. These two interdependent stores of information – one replicating with each cell division and the other cycling with a period of one generation – coevolve while perpetuating an organism. Unlike the genome sequence, the arrangement of molecules, including DNA, RNAs, proteins, sugars, lipids, etc., is not well understood. Because this arrangement and the genome sequence are transmitted together from one generation to the next, analysis of both is necessary to understand evolution and origins of inherited diseases. Recent developments suggest that tools are in place to examine how all the information to build an organism is encoded within a single cell, and how this cell code is reproduced in every generation.

Keywords: development, epigenetics, evolution, homeostasis, synthetic biology, systems biology, transgenerational inheritance

1. Introduction

One of the amazing aspects of living things is that they transmit the information for building themselves from one generation to the next. While much of what an organism is made of depends on the linear sequence information in its DNA genome, this sequence is not the only store of information that is transmitted across generations. We can see evidence for the transmission of extra-genomic information in cases where changes that do not alter DNA sequence nevertheless persist for many generations (see Box 1 for a classic example). Such inheritance of extra-genomic changes invites a consideration of the scope and formulation of all the inherited information that specifies the developing organism’s ensemble of traits – not only the information encoded in the sequence of bases in the DNA, but also the non-genetic information encoded in molecular assemblies independent of DNA sequence.

Box 1. Cortical inheritance.

Classic experiments illustrating the inheritance of changes in non-genomic information were performed using the ciliate Paramecium aurelia.[104] These organisms have hair-like cilia in their cell cortex that help them swim and that arise from cortical units arranged in regularly spaced rows (Figure, Box 1). The first hints that changes in these cortical patterns could be inherited came from the analysis of a spontaneous swimming mutant called twisty. Unlike normal cells, twisty cells had a patch of four rows with reversed polarity of cortical units, flanked by altered spacing – a defect that could be observed even after ≈300 generations (Figure, Box 1). This defect was deduced to be the result of a patch of cortex acquired during sexual reproduction by one cell of a conjugant pair from the other cell oriented at 180°. Armed with an explanation for how twisty was generated, various grafts of a piece of one cell on a whole cell were generated by letting similar conjugating pairs separate spontaneously such that a part of one conjugant went with the other or by removing most of one conjugant with a micromanipulator, leaving a residual graft on the other. The resultant cells that had rows of cortical units with reversed polarity could be maintained indefinitely with periodic selection. Even those that reverted to normal cortical structure had transmitted the variant organization to a remarkably large number of progeny cells (109–1012). These observations illustrate the principle that pre-existing structure can inform the newly generated structure in a cell (see Ref. [5] for additional examples).

graphic file with name nihms965205f4.jpg

Cortical architecture of wild-type and twisty cells. The cortical units from which cilia arise can be seen using silver nitrate staining as regularly spaced units with left-right asymmetry (point vs. base of each triangle). Left) schematic of a patch of cortex from wild-type cells. Right) schematic of a patch of cortex from twisty cells showing four rows with reversed polarity flanked by rows with normal polarity (based on Ref. [104]).

The information needed to perpetuate an organism must be present even in the life stage that has the least number of cells. This stage serves as a bottleneck for the transmission of information from one generation to the next and is minimally a single cell. Therefore, all the molecules and their arrangement that is reproduced in every such bottleneck stage of an organism is the minimal information that specifies the perpetuation of that organism. This minimal information includes the genome sequence and a spatial arrangement of the genome along with other pre-existing molecules, including RNAs, proteins, sugars, lipids, etc. Organisms transmit these two stores of information from one generation to the next using different strategies. The linear DNA sequence is transmitted by replicating it at each cell division. This strategy allows for retention throughout development of largely the same sequence in every cell that forms a continuous lineage from one generation to the next. The three-dimensional arrangement, on the other hand, is transmitted by cycling it with a period of one generation. This strategy allows for potentially extensive transformation of the arrangement throughout development before returning to a similar configuration in the next generation. In addition to changes in the arrangement of molecules, components could be replenished or added to in each generation using the DNA template and raw materials from the environment over the course of a life cycle. Thus, life is perpetuated by two stores of information that are interdependent yet distinct.

The need for analyzing both forms of information together is evident if we consider a simple regulatory loop where a transcription factor binds its own promoter and sustains its production in every cell. At the start of each generation, in addition to general cellular machinery, this “developmental program” needs the DNA sequence encoding for the transcription factor, the promoter sequence that the transcription factor binds to, and the transcription factor itself arranged such that the transcription factor can bind to its binding site in the promoter. In other words, developmental programs are specified using DNA sequence and an arrangement of additional pre-existing molecules (see Refs. [16] for related views).

Here, I develop this perspective on inherited information further, discuss its implications, and suggest approaches for the discovery of the stores of information that are transmitted along with the genome sequence across generations to drive development and evolution (see Box 2 for key concepts). While this article has an emphasis on animal biology to give it focus, the concepts presented here are applicable to all cellular life: bacteria, archaea, and eukarya.

Box 2. Key Concepts.

Replicating stores of information

These are molecules with measurable features that are propagated through templated replication. Genomic DNA with its sequence information is the best characterized example of such a store. However, other non-genetic information can also be propagated through templated replication. For example, the inheritance of alternative folding states of proteins like prions illustrates the transmission of structural information through replication, independent of DNA sequence.[51]

Cycling stores of information

These are molecules with measurable features that cycle such that their quantity, quality, and/or localization can change over time but return periodically to a similar configuration. The concentration and overall three-dimensional arrangement of DNA, RNAs, proteins, ions, metabolites, chemical modifications, and potentially much more constitute cycling stores of information. Importantly, the information contained in the collective arrangement of molecules at any given time is a function of the DNA sequence and the collective arrangement at an earlier time. The extent of heritable variation depends on the fidelity with which cycling stores of information are recreated in every generation.

Transgenerational homeostasis

This is the process that opposes inheritance of non-genetic changes and preserves the ancestral configuration. Non-genetic changes that occur in one generation, especially within the germline, can sometimes persist in subsequent generations – a phenomenon referred to as transgenerational epigenetic inheritance.[55] However, these changes are often lost after a few generations,[5965,6777] ensuring preservation of the ancestral configuration (see Ref. [105] for a discussion of homeostasis and evolution).

2. DNA Proposes, Cell Disposes

While the genome is a repository of all sequences that can be transcribed and used for making other components, at any given moment, which RNA is transcribed from DNA depends on what else is within the cell and on the cell’s interactions with the external environment. The information contained in the genome of an organism is thus not sufficient to make that organism. To appreciate this insufficiency, consider a single cell in an organism: (a) the DNA within this cell does not encode all aspects of all molecules in the cell and (b) whether a molecule is made using the DNA depends on other contents in the cell.

2.1. The Linear DNA Sequence is Not Sufficient to Build the Three-Dimensional Cell

The RNA and protein components of a cell ultimately rely on DNA for production through the process of transcription and translation using smaller molecules, nucleotides, and amino acids, as raw materials. These proteins and RNA can go on to catalyze the formation of additional components such as lipids and sugars from other raw materials. However, the three-dimensional structure and spatial arrangement of all the molecules within a cell is not encoded in the linear DNA. For example, where a protein is localized in the cell can change over time based on interactions with other components within a cell.[7] While the DNA encodes the sequence of a protein, the structure it takes depends on its immediate environment.[8] Environmental changes can even induce cells to over-ride defects caused by mutations in their DNA. For example, mutation of the GTPase Rac1 can invert the polarity of epithelia and yet normal polarity can be restored by the addition of exogenous laminin protein.[9] Components in organelles such as mitochondria are not entirely encoded by the nuclear genome and can be considered as endosymbionts with partial autonomy within a eukaryotic cell.[10,11] Finally, the sizes, numbers, and shapes of organelles, and indeed the collective architecture of a cell all depend on dynamic interactions between components within the cell and with its external environment.[12]

2.2. Many Cell Types Use the Same Genome

The presence of different cell types that retain their identity over time and across cell divisions within an organism reveals that the contents of a cell can exert a controlling effect on the DNA. The mechanisms that make and maintain different cell types were initially referred to as epigenetic control systems[13] to signify that they are above (“epi”) genetic control. Pioneers studying such control mechanisms in bacteria imagined multiple modes of regulation for components within a cell in the context of biochemical reactions,[14] many examples of which have been characterized in the past half-century. These regulatory modes provide programmed constraints that explain “how come?” when events happen within a cell.[15] Furthermore, computational exploration suggests that even random networks of elements with high molecular specificity can result in the emergence of different cell states that remain stable over time.[16] Extensions of these ideas have led to the identification of gene regulatory networks in model organisms,[17] where collections of active genes and gene products maintain different cell types while being able to respond to signals during development.

3. Replicating and Cycling Stores of Information Together Drive Development

Since its beginnings as Entwickelungsmechanik more than a century ago,[18] the rich field of developmental biology has been addressing various aspects of how an organism is made. But, the minimal information that is necessary to specify the development of a particular organism in a given external environment is still unknown, and is in fact, relatively unexplored.

3.1. An Integration of Past Events Influences the Future

Consider an organism that progresses from one generation to the next through a single-cell stage, as is the case with humans and most known multicellular organisms. To discover the minimal information that is necessary for the development of this organism, the entire life cycle of the organism needs to be examined (Figure 1A). For chicken, this would mean examining from egg-to-egg and not from egg-to-hen (developmental biology) or from hen-to-egg (reproductive biology). The molecules that could store information within a cell are expected to have accrued over time based upon the action of pre-existing molecules on DNA in a changing extracellular environment (Figure 1B, top). This process of iterative accumulation of material and progressive change in the state of a cell implies that the DNA and everything else accrued in the past that is now within the cell together predicts the next state of a cell in a given environment. Therefore, the single cell that starts each generation must have a similar three-dimensional arrangement of molecules to ensure the development of similar organisms in successive generations (Figure 1B, bottom). This minimal information that encodes the making of an organism and is contained within a single cell can be considered as the cell code for the organism (see Box 3).

Figure 1.

Figure 1

Perpetuation of life. A) Understanding what perpetuates life requires examination of entire life cycles in addition to the study of how organisms are made (developmental biology) and how they reproduce (reproductive biology). B) Accrual of molecules in living systems. Top, Contents of a cell accrue through sequential production of different gene products and the molecules they modify (colored shapes matching color in DNA) while the environment of the cell (rounded bar) and previously made components continue to change over time. Bottom, Life is perpetuated through the recreation of a similar arrangement of cellular contents at the start of each generation.

Box 3. Cell code.

Many multicellular organisms cycle through a single-cell bottleneck – the fertilized zygote in sexually reproducing organisms – that carries all the information necessary to make the next generation.[19] In animals that lay eggs, all the information necessary for development is contained within the egg. In animals that give birth to live young, the fertilized zygote grows into a fetus that shares circulation with the mother. In this case, the growing fetus could potentially acquire additional information from the mother after the single-cell bottleneck. Organisms that undergo parthenogenesis[106] also cycle through a single cell, an egg that does not need fertilization by sperm. In contrast, parasites go through elaborate cycles within different cells and even different host organisms,[107] and regenerating organisms can begin each generation from many different collections of cells.[108] Nevertheless, in every scenario, the simplest life stage of an organism minimally consists of a single cell that encodes all the information required for making the organism in the next generation.

To clarify the relationship between the contents of a single cell and its genome sequence, let us define all molecules (including the DNA genome) and their spatial arrangement in a single cell as “C” and environmental inputs as “E” (Figure Box 3A). These external inputs include factors that influence the cell (e.g., temperature), interact with the cell (e.g., extracellular signals), or are imported into the cell (e.g., nutrients). At an arbitrary initial time, t = 0, C0 indicates the state of molecules present in the cell, E0 indicates the external inputs, and ● indicates their interaction. At a later time, t = 1, C0 could have changed (C0→1) and can now interact with, modify, and/or be modified by the new C1 made at t = 1 (C0→1●C1). Similar changes could also occur in the imported or interacting external material (E0→1●E1). Furthermore, both the accrued cellular components (C0→1●C1) and the changing external input (E0→1●E1) can interact and/or modify each other (C0→1●C1●E0→1●E1). Thus, the changing contents and their arrangement in a cell are not predictable from the sequence of its genome alone. This influence of the past on the current state of the cellular contents and their arrangement implies that a cell at time t = n includes information stored in its genome sequence (DNAseq) and in the arrangement of contents accrued since the origin of cellular life (Cpast→n●Epast→n). If such a single cell is a zygote that begins each generation, then everything in this cell (DNAseq + Cpast→n●Epast→n) that is nearly reproduced in the zygote of successive generations is the upper limit of the cell code for the organism (Figure Box 3B). This recreation of similar cell codes in successive generations perpetuates life and is a hallmark of organisms that constrains their evolution.

Transmission of the cell code from the zygote of one generation to the zygote of the next generation through a developing organism is akin to transmission of coded signals from a sender to a receiver through a communication channel (Figure Box 3C), which has been given mathematical form in information theory.[109] In this comparison, the pathways through which individual aspects of the cell code (DNA, RNAs, proteins, spatial arrangements, etc.) are transmitted from one generation to the next are analogous to communication channels, whereas, systematic and stochastic external influences are analogous to noise. However, application of information theory to study cell codes additionally requires accounting for the following characteristics:

  1. Interdependence of channels. The channels through which the cell code is transmitted from one generation to the next are related to each other through complex functions. For example, consider transmission of the information specifying the three-dimensional structure of a protein. The primary sequence of this protein depends on how the transcribed RNA is spliced into mRNA. The secondary and tertiary structures of the protein depend on the composition and abundance of molecules in its immediate vicinity (ions, sugars, etc.).

  2. Dynamics of information in transit. Cycling stores of information also include unstable molecules and potentially ephemeral interactions that need to be transmitted from one generation to the next. Therefore, the encoded information must exist either in multiple interconvertible configurations or be transferrable to other molecules during development.

Systematic and stochastic changes can occur during the transmission of a cell code from one generation to the next. The nematode C. elegans provides a well-characterized example of such “noise” during transmission (Figure Box 3D). The DNA sequence is subject to stochastic exchange between homologs during the two meioses that generate gametes (sperm and oocyte). The cytoplasmic contents undergo dramatic loss because of cannibalism by endodermal cells during embryonic development, and dramatic gain because of material from the distal germline and the intestine during oocyte maturation.

In summary, further development of information theory that incorporates the above considerations for application to biological systems coupled with experimental manipulation in well-characterized systems like C. elegans could enable rigorous analyses of cell codes (see Ref. [110] for additional perspectives on information in biology).

graphic file with name nihms965205f5.jpg

Cell code the information for making an organism that is stored in single cells and transmitted from one generation to the next by interdependent replicating and cycling molecules. A) A cell accrues components guided by the action of preexisting components on DNA sequence (C0, C1, C2, …). New components made at each moment (t) can interact with, modify, and/ or be modified by (●) preexisting components, which could have changed since they were made (e.g., C0→2 and C1→2 at t = 2). This process extends to external inputs the cell interacts with or acquires independent of its DNA (E0, E1, E2, …). At a given time (t = n), the cellular content is equal to the accrued cellular contents (Cpast→n) that have changed through interactions with the accrued external input (Epast→n). B) DNA sequence (DNAseq) and a specific arrangement of components accrued over time (Cpast→n●Epast→n) that are recreated in single cells that begin successive generations of an organism (Gen x, Gen x + 1) define the cell code of that organism. C) Transmission of the cell code may be fruitfully analyzable as information transfer through interdependent channels for the transmission of cellular components (DNA, proteins, RNAs, spatial arrangements, etc.). Changes that occur during transmission could be thought of as being akin to noise in a communication channel and could impact the transmission of DNA sequence information (e.g., mutations and meiosis) or other channels (e.g., interaction with somatic cells and the environment). D) The well-characterized C. elegans germline, along which its cell code is transmitted from one generation to the next, invites examination of systematic and stochastic changes that occur during such transmission (see text for details). Barring the DNA sequence, all other aspects of the cell code (including replicating and cycling stores of information) are transmitted via unknown mechanisms.

Early attempts at unifying biology focused on understanding the cell in development and inheritance.[19] More than a century ago, some experimental embryologists clearly appreciated the iterative progression (e.g., Ref. [20]) and influence of history (e.g., Ref. [21]) when describing the nature of organisms. The interdependence of genes and non-genetic factors within a cell was also well appreciated (e.g., Refs. [22,23]). In fact, the term “epigenetics” was initially defined as “… causal mechanisms by which the genes of the genotype bring about phenotypic effects,”[24] but later stated as “the causal interactions between genes and their products which bring the phenotype into being,”[25] making it clear that the genotype (DNA) alone does not bring the phenotype into being. These insights from embryology were also recognized by early proponents of the Modern synthesis that combined Mendelian genetics with Darwinian evolution,[26] and when ignored, can lead to the mistaken popular view that DNA is the blueprint of life.

In the context of an organism, the contents of a cell can change dramatically over time independent of changes driven by the DNA within the cell. For example, consider progression from one generation to the next in the worm Caenorhabditis elegans[27] (see Box 3). From the single-celled zygote, two primordial germ cells are established five cell divisions later. These two cells then go through an extended period of quiescence while the rest of the organism develops. During this period intestinal cells cannibalize a large portion of the cytoplasmic contents of both cells.[28] Then, the cells proliferate to generate germ cells that subsequently differentiate, first producing sperm, and then oocytes. Maturing oocytes acquire cytoplasmic contents from the rest of the germline[29] and yolk from the intestine,[30] potentially along with extracellular RNAs.[3133] These acquisitions make the oocyte larger than all the 558 cells of a hatching larva combined. Fertilization of this enlarged oocyte creates the zygote for the next generation. Both dramatic changes in this cycle – loss of cytoplasmic material from primordial germ cells and gain of cytoplasmic material by oocytes – occur during periods of relative transcriptional quiescence.

3.2. What is the Minimal Information Necessary to Make and Perpetuate an Organism?

The upper limit for the cell code is the entire contents and their arrangement in the zygote that begins each generation. However, the cell code need not equal everything in a zygote. If only a subset of the components and arrangement within a zygote are reproduced in every generation (Figure 2), then the cell code can be less than the contents of a zygote. Furthermore, because many molecular assemblies in life are capable of self-organization (e.g., mitotic spindle[34]) and templated processes (e.g., transcription[35]), it is possible that essentially the same cell code could be specified using a subset of the molecules and arrangement that are recreated in every zygote. This possibility is supported by experiments on single-cell regeneration in the ciliate Stentor polymorphus – 1/27th of its initial volume can regenerate all structural features.[36]

Figure 2.

Figure 2

Nature of the cell code. Part of the cell code from the perspective of a single gene (thick black line), its RNA (curved lines), and its protein (square) is depicted. Complexes of other proteins/RNAs/lipids, etc. (circles) potentially localized in specific places within the cell and interactions between them (gray lines) that can impact the gene and that are reproduced in every generation are part of the cell code. Some components (stars) in the zygote may not be inherited in successive generations and compensatory changes in the rest of the zygote may permit different configurations for this gene while preserving the overall cell code of the organism.

Determining the minimal representation of the cell code of an organism requires minimization of features in a zygote akin to current efforts to minimize the genomes of cells.[37,38] For example, imagine a gene that has two copies of partially transcribed RNA, 32 copies of the spliced RNA, 17 copies of translated protein, 12 copies of the protein with correct subcellular localization, etc., that are all reproduced in the zygote of every generation, i.e., that are all part of the cell code. Which of these details are necessary for the perpetuation of a similar arrangement? What other arrangements of these components are essentially equivalent? What changes to the rest of the zygote can impact this gene and the arrangement of its products while perpetuating life?

A problem that is separate from determining the cell code of an organism is determining how it is reproduced in every generation. Reproduction of the genetic component of the cell code, that is, DNA sequence, is relatively well-understood and is through the process of templated replication at each cell division. Reproduction of the non-genetic components of the cell code, that is, the spatial arrangement of molecules including DNA, is likely to be more complex. One possibility is that the essential arrangements are preserved through cell divisions despite differentiation and development of the organism such that a continuous line of cells from the zygote of one generation to the next carries the cell code for the organism. Alternatively, and more likely, these arrangements vary during differentiation and development but go through an elaborate cycle such that they return to a similar configuration in the zygote of each generation. In the case of unicellular organisms, the cell code needs to be reproduced at a certain time in each cell division cycle. In organisms such as plants where many cells are capable of generating the entire organism, every such cell needs to be able to reproduce the cell code when exposed to conditions that stimulate differentiation and development. Currently, we have a fragmentary understanding of changes in a few potential components of the cell code during development and are far from understanding the mechanism(s) by which the cell code is reproduced in any organism. Nevertheless, a range of components and arrangements are likely to be nearly equivalent cell codes for an organism. We already know that the genome sequences of individuals within a species can vary a little and yet result in the generation of similar organisms. Additionally, discovering alterations to non-genetic aspects of the cell code that are compatible with the perpetuation of life will reveal the full extent of novelty that can arise from one generation to the next in an organism.

4. Persistent Ancestral Information Modifies the Cell Code

A key concern in thinking about how organisms evolve is what changes in one generation can be passed on to subsequent generations. The nature of organisms presented here (Figure 1 and 2) makes it clear that the persistence of changes across many generations requires modification of the cell code and thus evolution occurs through “descent with modification”[39] of the cell code. Note that this does not include information (induction by signals, uptake of nucleic acids, etc.) accrued from additional sources such as the microbiome or maternal circulation during development after the single-cell bottleneck that could also vary across generations.

Modifications to the cell code can alter its genomic sequence (DNAseq, see Box 3) or three-dimensional arrangement of all components (Cpast→n●Epast→n, see Box 3) within the cell. Changes to the DNA sequence can persist for many generations and are indeed the best-studied changes to the cell code. These can include both random mutations and changes derived from experience (e.g., CRISPR-Cas system of antiviral immunity in bacteria[40]). But, changes to the cell code that do not alter DNA sequence also have the potential to persist for many generations. In fact, one of the earliest mutants ever described – in 1744 – was found to not alter the DNA sequence. This variant of the toadflax Linaria vulgaris called peloria (greek πελωρ, monster)[41,42] arises at ≈1%frequency in each generation[43] and is correlated with methylation but not sequence changes at the Lcyc locus.[44]Such inheritance of changes that do not alter DNA sequence and yet affect how the DNA sequence is used in any generation can be viewed as the influence of organismal history. The controlling influence of past cellular componentson the production of future components from DNA is powerfully illustrated in ciliates that develop using RNA from one generation to splice or unscramble germline DNA sequences in the next generation. In these ciliates, such as Tetrahymena and Oxytricha, changes in RNA introduced in one generation can alter how the DNA is spliced or unscrambled in many subsequent generations.[45,46] In other organisms, extensively studied phenomena like paramutation (see Ref. [47] and reviewed in Ref. [48]) and RNA interference[49] that cause persistent silencing of genes also illustrate the persistence of non-genetic information. While the precise mechanisms for how the gene silencing information is transmitted across generations are still being worked out, persistent chromatin modifications and amplified RNAs have emerged as possibilities (reviewed in Ref. [50]). Transmission for many generations is likely to require inheritance of molecules containing the information for silencing a gene for one or a few generations combined with periodic replication of molecules to reinforce this information. For example, an ancestral event such as exposure to double-stranded RNA during RNA interference could trigger production of chromatin modifications in each generation based on instructions held in RNA. All that is needed to make this information stable for many generations is transmission of chromatin and/or RNA, even for just one generation, followed by reproduction in every generation. Crucially, the resulting permanent changes to the cell code do not involve changes in the DNA sequence. Other mechanisms for changing the cell code without mutating DNA include the transmission of prions,[51] where alternative folding states of proteins that can template recreation of similar states are transmitted across generations.

When complex organisms, like humans, that can make tools and artifacts by interacting with both the environment and other organisms are considered, there is no end to the longevity of ancestral information and its influence on physiology. For example, the spread of ideas within a culture (i.e., a meme[52]) can have a profound influence on human behavior and can drive physiological changes as evidenced in dietary practices. Thus, the evolution of such organisms is shaped by much more than the molecules present in their cells.[53] In fact, contributions by many mechanisms that transmit information across generations can be incorporated into a consideration of how organisms evolve[54] and discerning the relative importance of these mechanisms requires carefully controlled experiments.

Even if only the transmission of biological material is considered, the precise way in which information is transmitted across generations is unclear. While DNA sequence (DNAseq) in a zygote is from the immediate parents, the provenance of the information that specifies the arrangement of all molecules (Cpast→n●Epast→n) in a zygote is less clear. If life originated once on earth as suggested by the commonalities among extant organisms, then every zygote can be traced back to the first cell by descent. This initial cell must have accrued complexities that were transmitted from one generation to the next. Therefore, although the molecules present in the fertilized zygote are inherited from the gametes of the immediate parents, it is conceivable that their amounts and/or arrangement changed in an ancestral generation, and this change has since been propagated without modification. Nevertheless, there have only been a few cases of clear evidence for such non-genetic changes that persist for many generations (see Ref. [55] for review of early work). This paucity could reflect limited experimentation, incompatibility of the changes with the perpetuation of life, or mechanisms that oppose such persistence.

5. Forces That Oppose Variation Preserve the Cell Code

The development of an organism in a given environment follows a path that reflects the programmed constraints imposed by epigenetic control systems within cells and cell-cell interactions in that organism. This constrained path was referred to as “chreod” (“necessary path” from greek roots χρη, it is necessary, and όδοζ, path[56]) and has been given modern form within the framework of dynamical systems theory.[57] Such developmental constraints have been recognized as one of the forces that oppose change in organisms during evolution.[58]

Some studies show correlation of certain molecular changes in an animal with environmental or dietary changes in parents or recent ancestors (e.g., Ref. [5963]) and molecular changes such as histone methylation that occur within germ cells can cause effects that persist for a few generations (e.g., Ref. [64,65]). These instances of heritable changes to non-genetic aspects of the cell code urge consideration of the possible ancestral origins of disease and the impact of medical intervention on descendants. However, loss or erasure of such chemical changes after a few generations reflects a homeostatic return to a prior cell code (Figure 3). For example, during early mammalian development parental DNA methylation is erased at most sites and new modifications are added.[66] Because this happens in every generation, the information for adding these new modifications must be passed from one generation to the next as an aspect (molecule and/or arrangement) of the cell code. This developmental reprogramming thus preserves the cell code and opposes transgenerational epigenetic changes. Nevertheless, cases of persistent non-genetic changes – some lasting for tens to hundreds of generations (e.g., Ref. [6778]) – provide us with opportunities to analyze how non-genetic aspects of the cell code can be reconfigured.

Figure 3.

Figure 3

The balance between transgenerational homeostasis and transgenerational epigenetic inheritance determines heritable non-genetic variation. A) Schematic illustrating impact on cellular components of high (gene on) versus low (gene off) expression of a gene encoding an enzyme. B) Non-genetic changes to a cell code (e.g., down regulation of gene expression) that occur during one generation (x, say) can persist through transgenerational epigenetic inheritance for many generations (until x + n, say) and then return to ancestral states because of transgenerational homeostasis. Cases of indefinite persistence of change (i.e., large n) could reflect establishment of a new set point for transgenerational homeostasis. Non-genetic changes that persist for a few or for hundreds of generations have been detected in some organisms,[6778] but the reasons for susceptibility to change, persistence of change, or homeostatic return to ancestral states after a period of change are unknown.

Together, these considerations inform attempts to identify past events that have shaped the evolution of organisms and future attempts to synthesize organisms.

6. To Make Life We Need to Know the Cell Code

Considering what we would need to know to make an organism from raw materials reveals the extent of our ignorance of the cell code. In recent years, we have begun to remove, replace, augment, or modify aspects of the cell code. In particular, techniques for manipulating DNA sequence have advanced to the point that practically any change in DNA sequence can be made using genome engineering (reviewed in Ref. [79]) and even the entire genome of a simple organism can be replaced with a synthetic genome.[80]We can effectively transfer genetic material to avoid mutant mitochondria,[81] augment the genetic code,[82] and even use gene drives to change wild populations.[83] All these efforts are akin to modifying a pre-existing machine by tinkering with or replacing parts without fully understanding how the machine is put together. As with complex machinery, such manipulations without deep understanding can be dangerous. Additionally, all manipulations of life warrant careful ethical considerations (e.g., Ref. [84]). To make a machine from raw materials, however, we need to know the entire design. Because organisms build themselves, assembling the cell code of an organism could be sufficient to make an organism. Although an assembly of chemicals that display characteristics of life has been imagined in theory (e.g., The Chemoton Theory[4]), the enormity of this challenge in practice is clear in our struggle to use raw materials to make a rudimentary cell that perpetuates itself.[85,86] Nevertheless, we have begun to coax pre-existing cells into making complex parts of organisms in vitro – for example, eye[87] and brain.[88] Eventually making an entire complex organism that reproduces will require first discovering the cell code of that organism and then determining the simplest versions of that code. The “endless forms most beautiful and most wonderful”[39] found in nature each have a different cell code. Such organisms sculpted by evolution, however, are unlikely to already have the minimal essential cell code because evolution tinkers with what exists[89] but anticipates poorly. Our current understanding suggests that evolution proceeds through measure and counter-measure, including nonadaptive processes (e.g., Ref. [90,91]), resulting in a cellular architecture with superfluous aspects.[92] Thus, organisms found in nature have layers of post hoc regulation that can obscure essential design principles. Understanding such principles can enable the design of organisms free of the historically contingent tinkering of evolution.

7. Approaches to Decipher the Cell Code

Discovering the cell code of an organism minimally requires comparison of features in zygotes of successive generations (Figure 2). Our increasing ability to perform molecular studies on single cells[93] makes this approach reasonable. Parental effect mutants can be used to identify potential components of the cell code that are maternally or paternally deposited into the zygote. Such mutants may also identify signaling from one generation to the next. However, the arrangement of these molecules in the zygote is more difficult to discover and would require systematic cell biological and biochemical analyses. After these components are identified and their arrangements are discovered, the zygotes of successive generations can be compared.

7.1. Perturbation Studies

Penetrating insights often require perturbation of the system and not mere observation. Past perturbation studies of development and reproduction have largely focused on the analysis of essential genes that impact viability or fertility, respectively. Consequently, the effects of a perturbation are typically only analyzed during a limited period of the life cycle. Furthermore, defects in an essential gene (required for development, say) would kill the organism precluding examination of the same stage in successive generations. Thus, when we interfere with an essential gene or process in one generation, we lose the ability to see its impact on the next generation because the intervening organism is affected. In other words, we cannot know if an essential gene or process is required to make the organism or to reproduce the cell code or both. Studies using viable and fertile mutants on the other hand permit examination of the zygote in successive generations. But, careful subsequent analysis will be needed to separate defects in the mere making of the organism from perturbations of the cell code. For example, a mutation in DNA that changes a residue in hemoglobin alters genome sequence in the cell code and affects the structure of the protein made in red blood cells, but likely does not affect nongenetic aspects of the cell code or its propagation via the germline. However, a mutation in DNA that changes a protein that is reproducibly present in the zygote or the germline changes not only the genome sequence in the cell code but potentially also non-genetic aspects of the cell code and/or its propagation. Finally, a non-genetic change (e.g., chromatin modification or addition of double-stranded RNA) that causes gene silencing that persists for many generations alters non-genetic aspects of the cell code and potentially its propagation without changing the genome sequence.

Nuclear transplantation experiments were essential for the realization that most cells in an organism retain the same genome.[94,95] A similar approach could be used to discover all the cells that retain the ability to use the DNA genome to generate the entire organism, that is, to discover all the cells that have the non-genetic aspects (molecules and arrangement) of the cell code. Every cell that can be induced to generate the entire organism is expected to either have, or be able to reconfigure its contents to create, the entire cell code. For sexually reproducing organisms, minimally every cell that is continuous within the female germline from one zygote to the next is expected to have all the non-genetic aspects of the cell code except those obtained from the male gamete upon fertilization. Conversely, every cell that is continuous within the male germline from one zygote to the next is expected to have all the non-genetic aspects of the cell code except those obtained from the female gamete upon fertilization. It will be interesting to discover the somatic cells that retain non-genetic aspects of the cell code despite differentiation during normal development and to discover how the cell code is reduced from being present in its entirety in the zygote to the portion in each gamete awaiting union upon fertilization.

The ability to make random changes in DNA and examine its consequences – forward genetics – and more recently to turn off any gene through RNA interference – reverse genetics – were crucial for correlating changes in the genome with changes in the organism. To correlate changes in non-genetic components of the cell code with changes in the organism, we additionally need forward epigenetics and reverse epigenetics. While we do not yet have a way to perform forward epigenetics, reverse epigenetics has begun.[96] For example, a guide RNA and the Cas9 enzyme fused to a histone modifier can be used to target chromatin modifications to histones located in a specific region of the genome (e.g., Ref. [97]). Such a manipulation could be performed in one generation and its impact analyzed in subsequent generations to determine whether the manipulation altered non-genetic aspects of the cell code.

Although the total information that is transmitted from one generation to the next is high-dimensional, to make rapid progress in understanding the principles of the cell code, we are likely to also benefit from focusing on one or a few genes. Analyzing a few genes, their protein and/or RNA products, and the factors that impact their recreation in successive generations will allow us to ask specific questions and generate a preliminary understanding of the cell code. This focused approach could be an effective complement to comprehensive approaches that compare many components of zygotes in successive generations in response to experimental perturbation. In support of this traditional approach, the basic principles of gene regulation were worked out through sustained effort on a few genes – for example, the lac operon.[98]

7.2. Tracer Studies

An alternative to perturbation studies that overcomes the impasse of lethality or sterility when the cell code is disturbed is the use of tracer studies. To illustrate this approach, imagine you wanted to discover the extent of the circulatory system of an organism. A perturbation study could involve making cuts throughout its body and examining if blood spurts out. A tracer study could involve injecting an inert molecule that permeates the entire circulatory system and then imaging that molecule. Therefore, taking the tracer approach to examine the cell code of an organism, we could insert benign sequences (e.g., encoding a fluorescent protein) into its genome, add or remove various regulatory features, and examine if these changes have consequences that persist across generations. In support of the power of this approach, the deciphering of the genetic code began with a “tracer” study where one RNA sequence (poly-U) was used to synthesize one peptide sequence (poly-Phe) in an in vitro translation system.[99]

7.3. Different Organisms Warrant Different Considerations

In organisms that have well-differentiated somatic and germ cells,[100] it will be interesting to discover the extent of somatic influence on the perpetuation of the cell code via the germline. A germ cell from one animal could be transplanted into another[101] to evaluate the extent of somatic influence on the cell code. Similarly, the circulatory system of two animals could be joined together (reviewed in Ref. [102]) to examine the effect of secreted material from the cells of one animal on the germline of another. Evidence for such influence of secreted material is provided by intriguing observations like the entry of double-stranded RNA from neurons into the germline in the worm C. elegans, resulting in gene silencing that persists for more than 25 generations.[73] Analyzing somatic influences in different environments could reveal mechanisms, if any, by which the environment could alter the cell code in different organisms.

Simpler organisms likely have simpler cell codes. Single-celled organisms like the marine green alga Ostreococcus tauri[103] or bacteria could be chosen as model systems to discover the simplest cell codes. Simpler still are organisms that result from efforts to generate cells that have a minimal genome.[37,38] While the analysis of these organisms would reveal the logic of cell codes for single-celled organisms, it is conceivable that the need for differentiation in complex animals and plants results in fundamentally different strategies for the assembly and reproduction of their cell codes.

8. Conclusion

The sequence of a genome can be used to identify an organism. However, the genome sequence is not sufficient information for making that organism. To make an organism we need to know its cell code, which is the evolving arrangement of molecules, including the genome, that is nearly reproduced in every generation. Building on more than a century of work in biology, we can now begin to decipher the cell code of an organism by analyzing single cells that start successive generations.

Acknowledgments

The ideas presented here were honed through long conversations with my friend Katerina Ragkousi, through lectures to students of a course on Ignorance run by José Feijo, and through a faculty chalk talk to colleagues at the University of Maryland. I thank Steve Wolniak for initially pointing me to relevant books on cell biology; Karen Carleton, David Mosser, Zhongchi Liu, Caren Chang, Norma Andrews, José Feijo, Wenxia Song, Carlos Machado, Tom Kocher, Charles Delwiche, William Snell, Steve Wolniak, Sougata Roy, Geraldine Seydoux, Kevin O’Connell, Andy Golden, John Hanover, Victor Ambros, Marie-Anne Felix, Pierre-Emmanuel Jabin, and anonymous reviewers for comments on the manuscript; all past and present members of the Jose lab for lively discussions; and my friend John Horner for early support when these ideas were in their infancy. I apologize to any colleagues whose work could not be highlighted in this short article. Research in the author’s lab is supported through funds from the University of Maryland and from the NIH (R01GM111457).

Footnotes

Conflict of Interest

The author has declared no conflict of interest.

References

  • 1.Commoner B. Nature. 1964;202:960. doi: 10.1038/202960a0. [DOI] [PubMed] [Google Scholar]
  • 2.Dyson FJ. J Mol Evol. 1982;18:344. doi: 10.1007/BF01733901. [DOI] [PubMed] [Google Scholar]
  • 3.Maynard Smith J. J Theor Biol. 1990;143:41. doi: 10.1016/s0022-5193(05)80287-5. [DOI] [PubMed] [Google Scholar]
  • 4.Ganti T. The principles of life. Oxford University Press; Oxford: 2003. p. 4. (originally published in 1971) [Google Scholar]
  • 5.Harold FM. Microbiol Mol Biol Rev. 2005;69:544. doi: 10.1128/MMBR.69.4.544-564.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Ptashne M. Proc Natl Acad Sci USA. 2013;110:7101. doi: 10.1073/pnas.1305399110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Thanbichler M, Shapiro L. Nat Rev Microbiol. 2008;6:28. doi: 10.1038/nrmicro1795. [DOI] [PubMed] [Google Scholar]
  • 8.Englander SW, Mayne L. Proc Natl Acad Sci USA. 2014;111:15873. doi: 10.1073/pnas.1411798111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.O’Brien LE, Jou TS, Pollack AL, Zhang Q, Hansen SH, Yurchenco P, Mostov KE. Nat Cell Biol. 2001;3:831. doi: 10.1038/ncb0901-831. [DOI] [PubMed] [Google Scholar]
  • 10.Mereschkowsky C. Biol Centralbl. 1905;25:593. [Google Scholar]
  • 11.Sagan L. J Theoret Biol. 1967;14:255. doi: 10.1016/0022-5193(67)90079-3. [DOI] [PubMed] [Google Scholar]
  • 12.Rafelski SM, Marshall WF. Nat Rev Mol Cell Biol. 2008;9:593. doi: 10.1038/nrm2460. [DOI] [PubMed] [Google Scholar]
  • 13.Nanney DL. Proc Natl Acad Sci USA. 1958;44:712. doi: 10.1073/pnas.44.7.712. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Monod J, Jacob F. Cold Spring Harbor Symp Quant Biol. 1961;26:389. doi: 10.1101/sqb.1961.026.01.048. [DOI] [PubMed] [Google Scholar]
  • 15.Mayr E. Science. 1961;134:1501. doi: 10.1126/science.134.3489.1501. [DOI] [PubMed] [Google Scholar]
  • 16.Kauffman S. Nature. 1969;224:177. doi: 10.1038/224177a0. [DOI] [PubMed] [Google Scholar]
  • 17.Levine M, Davidson EH. Proc Natl Acad Sci USA. 2005;102:4936. doi: 10.1073/pnas.0408031102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Sander K. Roux’s Arch Dev Biol. 1991;200:1. doi: 10.1007/BF02457634. [DOI] [PubMed] [Google Scholar]
  • 19.Wilson EB. The Cell in Development and Inheritance. The Macmillian Company; New York: 1896. [Google Scholar]
  • 20.Boveri T. Neu Folge. 1902;35:67. Translated by Gluecksohn-Waelsch S, (B. H. Willer and J. Oppenheimer, eds). Prentiss-Hall, Inc., Englewood Cliffs, NJ, 1964. [Google Scholar]
  • 21.Boveri T. Die Organismen als Historche Wesen (Organisms as historical beings) Kgl. Universitats Druckerei von H. Sturtz; Wurzburg: 1906. [Google Scholar]
  • 22.Conklin EG. Science. 1908;27:89. doi: 10.1126/science.27.681.89. [DOI] [PubMed] [Google Scholar]
  • 23.Waddington CH. New Patterns in Genetics and Development. Columbia University Press; New York: 1962. [Google Scholar]
  • 24.Waddington CH. Endeavour. 1942;1:18. [Google Scholar]
  • 25.Waddington CH. Towards a Theoretical Biology, Vol. 1, Prolegomena. International Union of Biological Sciences & Edinburgh University Press; Edinburgh: –1968.pp. 1 [Google Scholar]
  • 26.Huxley J. Evolution: The Modern Synthesis – the definitive edition. MIT Press; Cambridge: 2010. p. 555. (originally published in1942 London: Allen and Unwin) [Google Scholar]
  • 27.Hubbard EJA, Greenstein D WormBook. The C. elegans Research Community. WormBook; 2015. https://doi.org/10.1895/wormbook.1.18.1http://www.wormbook.org http://www.wormbook.org. [Google Scholar]
  • 28.Abdu Y, Maniscalco C, Heddleston JM, Chew TL, Nance J. Nat Cell Biol. 2016;18:1302. doi: 10.1038/ncb3439. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Wolke U, Jezuit EA, Priess JR. Development. 2007;134:2227. doi: 10.1242/dev.004952. [DOI] [PubMed] [Google Scholar]
  • 30.Grant B, Hirsh D. Mol Biol Cell. 1999;10:4311. doi: 10.1091/mbc.10.12.4311. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Marré J, Traver EC, Jose AM. Proc Natl Acad Sci USA. 2016;113:12496. doi: 10.1073/pnas.1608959113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Marré J, Jose A. Mol Reprod Dev. 2017;84:283. [Google Scholar]
  • 33.Wang E, Hunter CP. Genetics. 2017;207:547. doi: 10.1534/genetics.117.300067. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Salmon ED. J Cell Biol. 1975;65:603. doi: 10.1083/jcb.65.3.603. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Weiss SB, Gladstone L. J Am Chem Soc. 1959;81:4118. [Google Scholar]
  • 36.Lillie FR. J Morphol. 1896;12:239. [Google Scholar]
  • 37.Hutchison CA, 3rd, Chuang RY, Noskov VN, Assad-Garcia N, Deerinck TJ, Ellisman MH, Gill J, Kannan K, Karas BJ, Ma L, Pelletier JF, Qi ZQ, Richter RA, Strychalski EA, Sun L, Suzuki Y, Tsvetanova B, Wise KS, Smith HO, Glass JI, Merryman C, Gibson DG, Venter JC. Science. 2016;351:aad6253. doi: 10.1126/science.aad6253. [DOI] [PubMed] [Google Scholar]
  • 38.Glass JI, Merryman C, Wise KS, Hutchison CA, 3rd, Smith HO. Cold Spring Harb Perspect Biol. 2017;9:a023861. doi: 10.1101/cshperspect.a023861. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Darwin C. On the Origin of Species. 1. John Murray; London: 1859. [Google Scholar]
  • 40.Barrangou R. Curr Opin Immunol. 2015;32:36. doi: 10.1016/j.coi.2014.12.008. [DOI] [PubMed] [Google Scholar]
  • 41.Linnaeus C, Rudberg D. De Peloria Diss. Ac. Amoenitates Academicae III; Uppsala: 1744. [Google Scholar]
  • 42.Gustafsson A. Theor Appl Genet. 1979;54:241. doi: 10.1007/BF00281206. [DOI] [PubMed] [Google Scholar]
  • 43.DeVries H. Species and Varieties: Their origin and mutation. 2. The Open Court Publishing Company; London: 1906. p. 475. [Google Scholar]
  • 44.Cubas P, Vincent C, Coen E. Nature. 1999;401:157. doi: 10.1038/43657. [DOI] [PubMed] [Google Scholar]
  • 45.Bracht JR, Fang W, Goldman AD, Dolzhenko E, Stein EM, Landweber LF. Cell. 2013;152:406. doi: 10.1016/j.cell.2013.01.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Chalker DL, Meyer E, Mochizuki K. Cold Spring Harb Perspect Biol. 2013;5:a017764. doi: 10.1101/cshperspect.a017764. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Brink RA. Genetics. 1956;41:872. doi: 10.1093/genetics/41.6.872. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Hollick JB. Nat Rev Genet. 2017;18:5. doi: 10.1038/nrg.2016.115. [DOI] [PubMed] [Google Scholar]
  • 49.Fire A, Xu S, Montgomery MK, Kostas SA, Driver SE, Mello CC. Nature. 1998;391:806. doi: 10.1038/35888. [DOI] [PubMed] [Google Scholar]
  • 50.Rankin CH. J Exp Biol. 2015;218:41. doi: 10.1242/jeb.108340. [DOI] [PubMed] [Google Scholar]
  • 51.Halfmann R, Lindquist S. Science. 2010;330:629. doi: 10.1126/science.1191081. [DOI] [PubMed] [Google Scholar]
  • 52.Dawkins R. The Selfish Gene. Oxford University Press; Oxford: 1989. p. 192. [Google Scholar]
  • 53.Jablonka E, Lamb ML. Evolution in Four Dimensions: Genetic, Epigenetic, Behavioral, and Symbolic Variation in the History of Life. The MIT Press; Cambridge: 2005. [Google Scholar]
  • 54.Rivoire O, Leibler S. Proc Natl Acad Sci USA. 2014;111:E1940. doi: 10.1073/pnas.1323901111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Jablonka E, Raz G. Quant Rev Biol. 2009;84:131. doi: 10.1086/598822. [DOI] [PubMed] [Google Scholar]
  • 56.Waddington CH. The strategy of the genes. George Allen & Unwin Ltd; New York: 1957. p. 32. [Google Scholar]
  • 57.Pisco AO, d’Herouel AF, Huang S. Bio Rxiv. 2016. p. 1. Online on May 12. [Google Scholar]
  • 58.Gould SJ, Eldredge N. Nature. 1993;366:223. doi: 10.1038/366223a0. [DOI] [PubMed] [Google Scholar]
  • 59.Rodgers AB, Morgan CP, Bronson SL, Revello S, Bale TL. J Neurosci. 2013;33:9003. doi: 10.1523/JNEUROSCI.0914-13.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Rechavi O, Houri-Ze’evi L, Anava S, Goh WS, Kerk SY, Hannon GJ, Hobert O. Cell. 2014;158:277. doi: 10.1016/j.cell.2014.06.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Sharma U, Conine CC, Shea JM, Boskovic A, Derr AG, Bing XY, Belleannee C, Kucukural A, Serra RW, Sun F, Song L, Carone BR, Ricci EP, Li XZ, Fauquier L, Moore MJ, Sullivan R, Mello CC, Garber M, Rando OJ. Science. 2016;351:391. doi: 10.1126/science.aad6780. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Chen Q, Yan M, Cao Z, Li X, Zhang Y, Shi J, Feng GH, Peng H, Zhang X, Zhang Y, Qian J, Duan E, Zhai Q, Zhou Q. Science. 2016;351:397. doi: 10.1126/science.aad7977. [DOI] [PubMed] [Google Scholar]
  • 63.Klosin A, Casas E, Hidalgo-Carcedo C, Vavouri T, Lehner B. Science. 2017;356:320. doi: 10.1126/science.aah6412. [DOI] [PubMed] [Google Scholar]
  • 64.Greer EL, Maures TJ, Ucar D, Hauswirth AG, Mancini E, Lim JP, Benayoun BA, Shi Y, Brunet A. Nature. 2011;479:365. doi: 10.1038/nature10572. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Siklenka K, Erkek S, Godmann M, Lambrot R, McGraw S, Lafleur C, Cohen T, Xia J, Suderman M, Hallett M, Trasler J, Peters AH, Kimmins S. Science. 2015;350:aab2006. doi: 10.1126/science.aab2006. [DOI] [PubMed] [Google Scholar]
  • 66.Feng S, Jacobsen SE, Reik W. Science. 2010;330:622. doi: 10.1126/science.1190614. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Vastenhouw NL, Brunschwig K, Okihara KL, Müller F, Tijsterman M, Plasterk RH. Nature. 2006;442:882. doi: 10.1038/442882a. [DOI] [PubMed] [Google Scholar]
  • 68.Ashe A, Sapetschnig A, Weick EM, Mitchell J, Bagijn MP, Cording AC, Doebley AL, Goldstein LD, Lehrbach NJ, Le Pen J, Pintacuda G, Sakaguchi A, Sarkies P, Ahmed S, Miska EA. Cell. 2012;150:88. doi: 10.1016/j.cell.2012.06.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Buckley BA, Burkhart KB, Gu SG, Spracklin G, Kershner A, Fritz H, Kimble J, Fire A, Kennedy S. Nature. 2012;489:447. doi: 10.1038/nature11352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Luteijn MJ, van Bergeijk P, Kaaij LJ, Almeida MV, Roovers EF, Berezikov E, Ketting RF. EMBO J. 2012;31:3422. doi: 10.1038/emboj.2012.213. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Shirayama M, Seth M, Lee HC, Gu W, Ishidate T, Conte D, Jr, Mello CC. Cell. 2012;150:65. doi: 10.1016/j.cell.2012.06.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.de Vanssay A, Bougé AL, Boivin A, Hermant C, Teysset L, Delmarre V, Antoniewski C, Ronsseray S. Nature. 2012;490:112. doi: 10.1038/nature11416. [DOI] [PubMed] [Google Scholar]
  • 73.Devanapally S, Ravikumar S, Jose AM. Proc Natl Acad Sci USA. 2015;112:2133. doi: 10.1073/pnas.1423333112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Leopold LE, Heestand BN, Seong S, Shtessel L, Ahmed S. Proc Natl Acad Sci USA. 2015;112:E2667. doi: 10.1073/pnas.1501979112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Minkina O, Hunter CP. Mol Cell. 2017;65:659. doi: 10.1016/j.molcel.2017.01.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Lev I, Seroussi U, Gingold H, Bril R, Anava S, Rechavi O. Curr Biol. 2017;27:1138. doi: 10.1016/j.cub.2017.03.008. [DOI] [PubMed] [Google Scholar]
  • 77.Ciabrelli F, Comoglio F, Fellous S, Bonev B, Ninova M, Szabo Q, Xuéreb A, Klopp C, Aravin A, Paro R, Bantignies F, Cavalli G. Nat Genet. 2017;49:876. doi: 10.1038/ng.3848. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Devanapally S, Allgood S, Jose AM. Bio Rxiv. 2017:1. Online on Jun 11. [Google Scholar]
  • 79.Komor AC, Badran AH, Liu DR. Cell. 2017;168:20. doi: 10.1016/j.cell.2016.10.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Gibson DG, Glass JI, Lartigue C, Noskov VN, Chuang RY, Algire MA, Benders GA, Montague MG, Ma L, Moodie MM, Merryman C, Vashee S, Krishnakumar R, Assad-Garcia N, Andrews-Pfannkoch C, Denisova EA, Young L, Qi ZQ, Segall-Shapiro TH, Calvey CH, Parmar PP, Hutchison CA, 3rd, Smith HO, Venter JC. Science. 2010;329:52. doi: 10.1126/science.1190719. [DOI] [PubMed] [Google Scholar]
  • 81.Tachibana M, Sparman M, Sritanaudomchai H, Ma H, Clepper L, Woodward J, Li Y, Ramsey C, Kolotushkina O, Mitalipov S. Nature. 2009;461:367. doi: 10.1038/nature08368. Erratum in: Nature 2014, 516, 276. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Ostrov N, Landon M, Guell M, Kuznetsov G, Teramoto J, Cervantes N, Zhou M, Singh K, Napolitano MG, Moosburner M, Shrock E, Pruitt BW, Conway N, Goodman DB, Gardner CL, Tyree G, Gonzales A, Wanner BL, Norville JE, Lajoie MJ, Church GM. Science. 2016;353:819. doi: 10.1126/science.aaf3639. [DOI] [PubMed] [Google Scholar]
  • 83.Gantz VM, Bier E. Science. 2015;348:442. doi: 10.1126/science.aaa5945. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Baltimore D, Berg P, Botchan M, Carroll D, Charo RA, Church G, Corn JE, Daley GQ, Doudna JA, Fenner M, Greely HT, Jinek M, Martin GS, Penhoet E, Puck J, Sternberg SH, Weissman JS, Yamamoto KR. Science. 2015;348:36. doi: 10.1126/science.aab1028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Blain JC, Szostak JW. Annu Rev Biochem. 2014;83:615. doi: 10.1146/annurev-biochem-080411-124036. [DOI] [PubMed] [Google Scholar]
  • 86.Schwille P. J Cell Biol. 2015;210:687. doi: 10.1083/jcb.201506125. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Eiraku M, Takata N, Ishibashi H, Kawada M, Sakakura E, Okuda S, Sekiguchi K, Adachi T, Sasai Y. Nature. 2011;472:51. doi: 10.1038/nature09941. [DOI] [PubMed] [Google Scholar]
  • 88.Lancaster MA, Renner M, Martin CA, Wenzel D, Bicknell LS, Hurles ME, Homfray T, Penninger JM, Jackson AP, Knoblich JA. Nature. 2013;501:373. doi: 10.1038/nature12517. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Jacob F. Science. 1977;196:1161. doi: 10.1126/science.860134. [DOI] [PubMed] [Google Scholar]
  • 90.Lynch M. Proc Natl Acad Sci USA. 2007;1:8597. [Google Scholar]
  • 91.Madhani HD. Cell. 2013;155:744. doi: 10.1016/j.cell.2013.10.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Gould SJ, Lewontin RC. Proc R Soc Lond B Biol Sci. 1979;205:581. doi: 10.1098/rspb.1979.0086. [DOI] [PubMed] [Google Scholar]
  • 93.Tsioris K, Torres AJ, Douce TB, Love JC. Annu Rev Chem Biomol Eng. 2014;5:455. doi: 10.1146/annurev-chembioeng-060713-035958. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Briggs R, King TJ. Proc Natl Acad Sci USA. 1952;38:455. doi: 10.1073/pnas.38.5.455. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Gurdon JB, Elsdale TR, Fischberg M. Nature. 1958;182:64. doi: 10.1038/182064a0. [DOI] [PubMed] [Google Scholar]
  • 96.Park M, Keung AJ, Khalil AS. Genome Biol. 2016;17:183. doi: 10.1186/s13059-016-1046-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Hilton IB, D’Ippolito AM, Vockley CM, Thakore PI, Crawford GE, Reddy TE, Gersbach CA. Nat Biotechnol. 2015;33:510. doi: 10.1038/nbt.3199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Jacob F, Monod J. J Mol Biol. 1961;3:318. doi: 10.1016/s0022-2836(61)80072-7. [DOI] [PubMed] [Google Scholar]
  • 99.Nirenberg MW, Matthaei JH. Proc Natl Acad Sci USA. 1961;47:1588. doi: 10.1073/pnas.47.10.1588. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Extavour CG, Akam M. Development. 2003;130:5869. doi: 10.1242/dev.00804. [DOI] [PubMed] [Google Scholar]
  • 101.Brinster RL, Zimmermann JW. Proc Natl Acad Sci USA. 1994;91:11298. doi: 10.1073/pnas.91.24.11298. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Eggel A, Wyss-Coray T. Swiss Med Wkly. 2014;144:w13914. doi: 10.4414/smw.2014.13914. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103.Derelle E, Ferraz C, Rombauts S, Rouzé P, Worden AZ, Robbens S, Partensky F, Degroeve S, Echeynié S, Cooke R, Saeys Y, Wuyts J, Jabbari K, Bowler C, Panaud O, Piégu B, Ball SG, Ral JP, Bouget FY, Piganeau G, De Baets B, Picard A, Delseny M, Demaille J, Van de Peer Y, Moreau H. Proc Natl Acad Sci USA. 2006;103:11647. doi: 10.1073/pnas.0604795103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Beisson J, Sonneborn TM. Proc Natl Acad Sci USA. 1965;53:275. doi: 10.1073/pnas.53.2.275. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Torday JS. Biology (Basel) 2015;4:573. doi: 10.3390/biology4030573. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Phillips EF. Proc Amer Phil Soc. 1903;42:275. [Google Scholar]
  • 107.Olsen OW. Animal Parasites: Their Life Cycles and Ecology. Dover Publications; Mineola, New York: 1986. [Google Scholar]
  • 108.Birnbaum KD, Sanchez-Alvarado A. Cell. 2008;132:697. doi: 10.1016/j.cell.2008.01.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 109.Shannon C. Bell Syst Tech J. 1948;27:379. [Google Scholar]
  • 110.Cartwright JHE, Giannerini S, Gonzalez DL. Philos Trans Royal Soc A. 2016:374. doi: 10.1098/rsta.2015.0071. [DOI] [PMC free article] [PubMed] [Google Scholar]; a) Barbieri M. Philos Trans Royal Soc A. 2016;374:20150060. [Google Scholar]; b) Koonin EV. Philos Trans Royal Soc A. 2016;374:20150065. [Google Scholar]; c) Wills PR. Philos Trans Royal Soc A. 2016;374:20150066. [Google Scholar]; d) Ball P. Philos Trans Royal Soc A. 2016;374:20150072. [Google Scholar]; e) Barbieri M. Philos Trans Royal Soc A. 2016;374:20150148. [Google Scholar]; f) Adami C. Philos Trans Royal Soc A. 2016;374:20150230. [Google Scholar]

RESOURCES