Skip to main content
Philosophical transactions. Series A, Mathematical, physical, and engineering sciences logoLink to Philosophical transactions. Series A, Mathematical, physical, and engineering sciences
. 2017 Nov 13;375(2109):20160358. doi: 10.1098/rsta.2016.0358

How causal analysis can reveal autonomy in models of biological systems

William Marshall 1, Hyunju Kim 2,3, Sara I Walker 2,3, Giulio Tononi 1, Larissa Albantakis 1,
PMCID: PMC5686412  PMID: 29133455

Abstract

Standard techniques for studying biological systems largely focus on their dynamical or, more recently, their informational properties, usually taking either a reductionist or holistic perspective. Yet, studying only individual system elements or the dynamics of the system as a whole disregards the organizational structure of the system—whether there are subsets of elements with joint causes or effects, and whether the system is strongly integrated or composed of several loosely interacting components. Integrated information theory offers a theoretical framework to (1) investigate the compositional cause–effect structure of a system and to (2) identify causal borders of highly integrated elements comprising local maxima of intrinsic cause–effect power. Here we apply this comprehensive causal analysis to a Boolean network model of the fission yeast (Schizosaccharomyces pombe) cell cycle. We demonstrate that this biological model features a non-trivial causal architecture, whose discovery may provide insights about the real cell cycle that could not be gained from holistic or reductionist approaches. We also show how some specific properties of this underlying causal architecture relate to the biological notion of autonomy. Ultimately, we suggest that analysing the causal organization of a system, including key features like intrinsic control and stable causal borders, should prove relevant for distinguishing life from non-life, and thus could also illuminate the origin of life problem.

This article is part of the themed issue ‘Reconceptualizing the origins of life’.

Keywords: autonomy, origin of life, causality, integrated information

1. Introduction

The emergence of life from non-living matter is widely regarded as one of the most challenging open problems in science, standing alongside other stubborn problems such as understanding agency, or characterizing the nature of consciousness [1]. There are at present as many open questions as there are unique approaches to solving the origins of life. Most approaches tend to fall within three major categories: historical, dealing with how life first emerged on Earth; synthetic, addressing how to construct life from scratch; and universal, attempting to extract features such as ‘aliveness’ that might apply to life anywhere in the universe [2]. Progress in understanding the chemical basis of life, heralded by the molecular revolution of the twentieth century, has set the focus of origins research on historical and synthetic approaches. Much less progress has been made with universal approaches. In part, this is due to a lack of alternative examples to contrast with the one chemical example of life we have (i.e. life on Earth): without even a second example it is difficult to separate potential universal features from those that are contingent features of life on Earth.

Despite intense debate on the matter, there have been numerous attempts to delineate a universal definition for life. Recent efforts have specifically focused on properties of life that might lay the ground for a theory of biology [36]. Among the properties of life most amenable to this approach is autonomy [4,5]. Roughly speaking, autonomous systems can be characterized as forming a unitary ‘whole’ from their own intrinsic perspective, and being able to maintain themselves in the face of changing internal and external states. That is, autonomous systems are integrated wholes with self-defined and self-maintained borders. This implies some notion of intrinsic causal control. It has been previously proposed that the origin of life might be identified as a transition in causal structure [6,7]. If autonomy is universal to life, causation should be expected to play a prominent role in developing a more universal understanding of living processes: to be a living entity requires the power to regulate one's own internal state. Here we aim to bring quantitative rigour to the discussion by characterizing the intrinsic causal mechanisms of a model of a biological system as a case study.

As a demonstrative example, we use a well-studied biological model: the Boolean network model of the fission yeast cell cycle [8]. Previous work has focused on the dynamics of this network, including the shape of the attractor landscape associated with cell-cycle function [8], network robustness and function [9], controllability for regulating function [10] and informational structure [11,12]. While being a very simple model of biological function, the Boolean network model for the fission yeast cell cycle displays informational structure that is distinct from random networks with similar topological structure [11,12], which suggests that it might be capturing some characteristic features of living systems. However, from the dynamics of the system alone we cannot understand why a particular state happens (i.e. what caused it), or how the system's components constrain each other (i.e. what caused what). A causal approach is necessary to know which elements or sets of elements form mechanisms with cause–effect power within the system [13,14], and, thereby, to characterize architectures capable of intrinsic control.

Moreover, autonomy requires a system to construct its own ‘umwelt’ [15], causally separating itself from its environment. Nevertheless, living systems are open systems that dynamically and materially interact with their environment. For this reason, a purely dynamical approach cannot distinguish between a living entity and its environment. While classical information theory can be used to characterize an established individual within the framework of dynamical approaches [16], it does not inform how individuals can emerge in the first place. Only by employing counterfactual [17], interventionist [18] notions of causality can we properly decouple the intrinsic control a system has over itself from that of its environment and thereby hope to recover how such systems emerge.

To fully characterize the causal mechanisms involved in the fission yeast cell-cycle model we employ the formalism of integrated information theory (IIT) [19]. IIT provides a rigorous definition for intrinsic cause–effect power as integrated information (see Methods). By IIT's composition principle, any subset of elements within the system can be a mechanism of the system if its intrinsic cause–effect power is irreducible. Emergent mechanisms composed of more than one element are termed ‘high-order mechanisms’. Irreducibility is assessed by the integrated information φ (small phi) of the subset of elements. If φ > 0, the subset in its current state constrains the past and future states of the system in an irreducible way (so that information is lost under any partition).

The set of all mechanisms and their constraints within the system comprises a system's cause–effect structure. By IIT's integration principle, a system has integrated information Φ > 0 (big phi) if its cause–effect structure is irreducible (so that mechanisms and constraints are lost under any system partition). A system with Φ > 0 thus forms a unitary whole, as the elements within it constrain, and are being constrained by, the other elements of the system in an irreducible manner, above a background of external influences. While in principle many sets of elements within a larger system can have Φ > 0, it is the local maxima of Φ that define where intrinsic causal borders emerge [20]. In this way, IIT provides a formal and quantitative framework to capture intrinsic properties of a system which may prove essential in distinguishing an entity from its environment [7,20].

In the following, we demonstrate how the IIT analysis confirms established results regarding controllability and robustness of the Boolean network model of the fission yeast cell cycle, while providing a causal explanation for these properties. In addition, the analysis reveals previously overlooked attributes of the cell-cycle model, intrinsic control and causal borders, which are key features of biological autonomy [4,5]. We propose that the IIT analysis has the capacity to provide a quantitative framework for establishing autonomy in biological systems, and outline the future work necessary to validate this proposal. Finally, we briefly discuss implications for the origins of life, and in particular, how the notions of integration and causal control might inform a universal definition of life that could be useful in classifying systems on the edge between life and non-life.

(a). Boolean network model of the fission yeast cell cycle

In modern cells, cellular division is tightly regulated and produces two daughter cells from a single parent cell through a cell cycle consisting of a sequence of four phases, G1–S–G2–M. During the G1 phase, the cell grows, and if conditions are favourable, division begins. DNA is then replicated in the S stage. The G2 stage is a ‘gap’ between DNA replication (S phase) and mitosis (M phase) where the cell continues to grow. During the M stage, the cell undergoes mitosis, and two daughter cells are produced. The daughter cells then enter G1 again, thereby completing the full cycle.

The Boolean network model for the fission yeast cell-cycle process reproduces the protein expression states through each of these phases for nine proteins known to be key regulators governing the cell-cycle process [8]. Proteins are represented as nodes, and the links between two nodes are the inhibiting or activating biochemical interactions between pairs of proteins (mediated through regulation of gene expression). For each node, a binary state value ‘0’ or ‘1’ is assigned, which indicates the absence or presence of the particular protein, respectively. The successive states Si of a node i are updated in discrete time steps by the following rule, which can be written as a function of the connectivity pattern, the states of nodes connected to i and the threshold θi:

(a). 1.1

where aij denotes the weight for a directed edge (j → i). For each edge, a weight is assigned according to the type of the interaction: aij = −1 for inhibition and aij = +1 for activation, and aij = 0 for no biochemical interaction. The threshold for all nodes in the network is 0 with the exception of θCdc2/13 = −0.5 and θCdc2/13* = 0.5. If not stated otherwise, by ‘(cell-cycle) network’ we always refer to the Boolean network model rather than the real biological system in the following.

The dynamical process of the network obtained by iterating the update rule in equation (1.1) reproduces the time sequence of protein expression states (shown in figure 1b) corresponding to the four phases of the cellular division process. Here we define this particular progression of states as the biological sequence for the fission yeast cell-cycle model. Reproducing this sequence of states can be considered as the main function of the cell-cycle model. Previously, a subgraph of the network, called the ‘backbone’ motif, was discovered as the minimal set of connections necessary to exactly reproduce this biological sequence [9]. Other connections in the network, not included in the backbone, add robustness (associated with the shape of the attractor landscape, see below). Thus, for the fission yeast cell-cycle model, function is separable from robustness.

Figure 1.

Figure 1.

Fission yeast cell-cycle network. (a) Network structure for a Boolean model of the fission yeast cell cycle [6]. Control nodes of the system as identified in [8] are highlighted by green dashed lines. (b) Sequence of nine states corresponding to the biological function of the network. The final state t9 is the fixed point of the primary attractor in the system. (Online version in colour.)

The majority of studies on the fission yeast cell-cycle model are focused on its global dynamics in the state space of 29 = 512 possible states for the nine-node network. The time evolution of the network initialized with each of the 512 possible states yields a flow diagram of network states that details all possible dynamical trajectories in the state space [8,10]. In the flow diagram, each initial state converges into one of 13 possible attractors (12 fixed-point attractors and one limit-cycle attractor). The number of initial states that converges to each attractor is defined as its basin size and the attractor with the biggest basin size is called a primary attractor. For the network this corresponds to the final state of the biological sequence (herein called the biological attractor). The basin size of the biological attractor is associated with the robustness of the network, where networks with larger basins are more robust. Although this biological attractor is a fixed point in the state space of the model, in reality this state represents the completion of replication, at which point there are now two daughter cells each of which has returned to its initial state and the cycle can start again (here modelled as an ‘external’ influence, by switching the SK node to ON).

The fission yeast cell-cycle model, moreover, features a subset of nodes {Ste9, Rum1, Wee1, Cdc25}—called the control kernel—that can regulate the network's robustness by external intervention [10]. The basin of the biological attractor naturally contains 378 of the 512 network states. To study the influence of individual elements and sets of elements on basin size, a ‘pinning’ operation is employed. External intervention is used to continuously pin the states of specific elements to their biological attractor state during network evolution to observe how the pinning influences the size of the attractor basin. The control kernel is the minimal set of nodes such that when pinned, the basin of the biological attractor is the entire state space of the network [10]. The presence of the control kernel also underlies the network's distinctive informational properties distinguishing it from random networks [11,12]. Here, we further quantify the influence of individual nodes on the basin of the biological attractor by performing the pinning operation separately on each node in the network. We measured the change in the basin size of the biological attractor as a result of the time evolution with each individual node continuously pinned in its biological attractor state. This pinning operation performed on nodes Cdc2/13, Ste9 or Rum1 produces a larger basin size for the biological attractor, SK does not change the basin size, and all other nodes decrease the basin size as shown in the figure 2c.

Figure 2.

Figure 2.

Composition of the cell-cycle network's cause–effect structure in the biological attractor state. (a) Example high-order mechanism in the eight-node cell-cycle system. The two nodes Cdc 2/13 and PP together have joint, irreducible constraints on the past states of nodes Ste9, Rum1 and Slp1 and the future states of nodes Rum1 and Ste9, as indicated by their cause and effect repertoires. Since it is impossible to partition the mechanism and its purviews without losing part of the constraints, {Cdc2/13, PP} form a second-order mechanism in the system. Red dashed lines indicate the MIP corresponding to the partitioned cause and effect repertoires, as well as integrated information φ = 0.15. (b) Role of control nodes in the cause–effect structure. Despite their supposed functional role in the system, control nodes do not contribute to more system mechanisms than non-control nodes. Instead, control nodes appear in more future purviews, meaning that their future states are being constrained by more system mechanisms than those of the non-control nodes. (c) Contribution of individual nodes to the basin of the biological attractor as measured by the pinning operation (see text). (d) The third-order mechanism composed of the most influential elements from (c), {Cdc2/13, Ste9, Rum1} constrains its future purview, consisting of the four control nodes, to be in state ‘1110’ with highest probability, which corresponds to the biological attractor. (Online version in colour.)

In what follows, we apply the IIT formalism to perform a comprehensive causal analysis of the fission yeast cell-cycle model. We note that elementary interactions in the model (i.e. the elements' input–output functions and which elements can affect which other elements) have already been established through extensive prior experimental manipulation and observation [8]. The IIT analysis exposes the compositional causal structure hidden within the network of elementary interactions, by making its intrinsic, irreducible causal constraints explicit. Of course, the IIT analysis can only infer the causal structure of the specific model under consideration. The Boolean network model represents only a small fraction of the interactions that occur within a real fission yeast cell during the division process, and an extended model that includes these interactions would certainly reveal additional causal structure.

Applying the IIT analysis to the cell-cycle model reveals that the network has many high-order mechanisms and forms an integrated whole that is maintained through the phases of the cell cycle. The model's cause–effect structure elucidates how the system's high-order mechanisms drive regulation of the control kernel nodes and thus provides deeper understanding of how the network internally regulates its own function. We also demonstrate that the backbone motif of the cell-cycle network [9] is not similarly integrated through all phases. The function of the network can thus be detached from its robustness and integration.

2. Results

The analysis of the cause–effect structure of the fission yeast cell-cycle model is broken down into three parts. First, we focus specifically on the biological attractor (fixed point t9 in figure 1). We identify all mechanisms in the cause–effect structure and the specific way they constrain the system. Particular attention is paid to the role of extrinsically identified control nodes from the intrinsic perspective of the system. Next we identify local maxima of intrinsic cause–effect power Φ out of all the possible subsystems. Finally, we assess the robustness of these results across all nine states of the biological sequence. The element SK is an input: it signals to the rest of the network but never receives any feedback, and thus is not part of any integrated system. In all stages of the analysis, we focus on the remaining eight nodes of the network.

(a). Cause–effect structure of the fission yeast cell-cycle model

The IIT causal analysis reveals that the cell-cycle network in its biological fixed point has 49 irreducible mechanisms (φ > 0), including all eight possible first-order mechanisms and also 41 high-order mechanisms which irreducibly constrain the system's past and future states. A full list of mechanisms is given in electronic supplementary material, table S1. An example of a second-order mechanism composed of Cdc2/13 and PP is shown in figure 2a. The mechanism irreducibly constrains the potential past states of Ste9, Rum1, and Slp1 (called its past purview) and the potential future states of Ste9 and Rum1 (called its future purview). The specific way that the mechanism constrains its past and future purviews is described by its cause and effect repertoires. The irreducibility of the mechanism is assessed by finding its minimum information partition (MIP; shown as a red dashed line), and measuring the difference it makes to the repertoires. For this mechanism, we find that the integrated information is φ = 0.15. Overall, the first-order mechanisms have an average φ value of 0.21, while the high-order mechanisms have an average φ of 0.05.

Since the control kernel has been identified as a handle for extrinsic control [10], we further analyse the cause–effect structure of the cell-cycle network by investigating the contribution of control nodes to the mechanisms and their purviews (figure 2b). Of the 49 mechanisms, 42 include at least a single control node, and 45 contain at least a single non-control node. The average φ for mechanisms with control nodes is 0.067, and with non-control nodes is 0.069. There is no significant difference between control nodes and non-control nodes, with respect to the composition of mechanisms or their φ. Furthermore, the set of all four control nodes is reducible (φ = 0) and thus does not form a high-order mechanism within the system. This means that the control nodes together do not have cause–effect power within the network. On the other hand, the control nodes are significantly over-represented in the future purviews of the mechanisms (p = 0.029 using nonparametric permutation test). Of the 49 mechanisms, 42 constrain the future state of at least one control node, while only 30 constrain the future state of at least one non-control node. The average φ for a mechanism that constrains a control node is 0.071, while the average φ for a mechanism that constrains a non-control node is 0.044.

In contrast to the control kernel, the three elements identified by the pinning procedure (figure 2c) do form an irreducible third-order mechanism within the system, meaning they have an irreducible effect within the system (φ = 0.15). Moreover, the specific way they constrain the potential future states of the system is to coerce the control nodes into the biological attractor (figure 2d). Given that setting the control kernel into a specific state ensures that the system enters its biological attractor, the result that intrinsic mechanisms constrain the control nodes into the biological attractor can be seen as a self-regulating property of the network.

(b). Local maxima of cause–effect power

Next we evaluate Φ of all possible subsystems of elements defined from the cell-cycle network in its biological fixed point. We are particularly interested in the local maxima of Φ, as these points correspond to subsystems (groupings of elements) where unique causal properties emerge. Evaluating all possible subsystems of the cell-cycle network, we identify five subsystems with Φ > 0, three of which are local maxima of Φ (figure 3). The global maximum value of Φ = 0.431 occurs for the whole system of eight elements; this not only demonstrates the fission yeast cell-cycle network is an integrated whole, but that there is irreducible cause–effect power that can only be seen when the system is viewed as such. There are also local maxima for three nodes {Cdc2/13, Ste9, Rum1} with Φ = 0.090, and for six nodes {Cdc2/13, Ste9, Rum1, Cdc2/13*, Wee1, Cdc25} with Φ = 0.001, suggesting that these sets of elements form causally relevant subsystems within the cell-cycle network. Indeed, the system with Φ = 0.090 corresponds exactly to the biologically relevant elements identified by the pinning procedure (figure 2c), suggesting that this subsystem plays an important role in the self-regulation of the cell-cycle network. Finally, the six-node system with Φ = 0.001 demonstrates the network is still integrated without PP and Slp1, albeit with greatly diminished cause–effect power.

Figure 3.

Figure 3.

Local maxima of intrinsic cause–effect power in the biological attractor. (a) The most irreducible cause–effect structure (global maximum) is specified by the largest strongly connected set of eight elements, excluding only the input node SK. (b) The three nodes that individually expand the biological attractor basin the most when set into their attractor state (figure 2c) also form an integrated system with many mutual constraints. (c) Taking SK, PP and Slp1 as background conditions yields another system with an integrated cause–effect structure comprised of six nodes. (Online version in colour.)

(c). Dynamics of causal properties

Cause–effect power quantifies how a system, by being in a particular state, constrains its potential past and future states through its intrinsic mechanisms. Thus, the causal properties of a system are state dependent: in different states, different mechanisms within the network may be irreducible, and they may constrain the past and future states of the system in different ways. To investigate the dynamical properties of cause–effect power, we analyse the cause–effect structure of the cell-cycle network in all the states comprised by its biological sequence. To contrast these results with standard functionalist approaches, we also analyse the backbone motif of the cell-cycle network [9], the minimal set of edges required to maintain biological function. Here, function is defined as reproducing the biological sequence of states associated with the real cell-cycle trajectory. There are nine states in the biological sequence (figure 1b); however, the second state cannot be reached from within the eight-node system if SK is treated as a background condition. This means that this state has no causes from within the system (it is caused by the external element SK) and thus has undefined intrinsic cause–effect power. We thus restrict our analysis to the eight well-defined states.

The dynamic analysis reveals the cell-cycle network to be robustly integrated. Over the course of the biological sequence, the full cell-cycle network of eight elements is integrated and, in fact, a local maximum of Φ in all eight states (figure 4a). In each state, the cause–effect structure contains eight first-order mechanisms, one for each element in the system. In addition, there are always high-order mechanisms in the cause–effect structure, as few as 10 for t1 and as many as 111 at t4 (figure 4b). Note, however, that the full network is not necessarily an integrated whole for states outside the biological sequence. For example, in the state where only SK is active, the system has only a single local maximum consisting of six elements {Cdc2/13, Ste9, Slp1, Cdc2/13*, Cdc25, PP}. The identified robust integration during the biological sequence is thus not simply the result of the model's network structure, but rather the specific state-dependent causal mechanisms that provide intrinsic control.

Figure 4.

Figure 4.

Stability of local maxima of Φ across biological cell-cycle network states. (a) A total of four sets of elements form local maxima of Φ during the biological sequence. In all but one undefined state (t2, see text) the eight-node system forms an integrated whole from the intrinsic perspective. (b) Comparison of Φ and number of mechanisms in the eight-node system and the largest strongly connected component of the cell-cycle network's backbone motif, which is sufficient to reproduce the function of the cell cycle, but has a severely diminished cause–effect structure. (c) Backbone of the cell-cycle network [7]. (Online version in colour.)

Moreover, in contrast to the full network, the backbone network (figure 4c) is not an integrated whole (Φ = 0). The largest integrated subsystem of the backbone network contains six nodes (Rum1 and Wee1 are no longer integrated nodes in the reduced backbone network, as they lack outputs). This six-node subsystem is only a local maximum of Φ in three of the eight states. Furthermore, this system of six elements contains only six first-order mechanisms, and no high-order mechanisms in any of the biological states. This indicates that recapitulating the function of the biological cell cycle does not necessitate integration as observed in the full cell-cycle model. While both the full network and the reduced backbone motif network display the same function, only the full network is integrated, with consistent causal borders, for all states in the biological sequence. The identified stable borders in the biological sequence of the cell-cycle network thus emerge due to the intrinsic control that the system's mechanisms exert on each other, exhibiting self-maintenance, and not as a consequence of modelling the cell cycle's biological function.

3. Discussion

In this work, we apply the causal framework of IIT to analyse the full cause–effect structure of the Boolean network model of the fission yeast cell cycle. We demonstrate how the framework's composition and integration principles identify emergent high-order mechanisms and emergent causal borders, defined as compositions of system elements that have irreducible cause–effect power, and as subsets of elements that define local maxima of intrinsic, irreducible cause–effect power, respectively. The results of the causal analysis reveal several properties of the cell-cycle network that cannot be uncovered by a purely dynamical approach, namely that the Boolean network model of the fission yeast cell cycle is a robustly integrated whole and is self-regulating. In summary, we have demonstrated how IIT's causal analysis can be applied to reveal defining features of biological autonomy—the ability of a physical system to self-define, and self-maintain its borders [4,5].

Previous analysis of the cell-cycle network identified a control kernel that, when externally intervened upon, dictates the dynamics of the network [10]. However, studying the high-order mechanisms within the system reveals an intrinsic, causal mechanism which self-regulates not only the network's function, but also its causal borders, without the need for external manipulation. Implicit in this analysis is a rejection of the reductionist assumption according to which only individual micro elements have true cause–effect power. Reductionist approaches, as well as holist approaches that lack composition cannot account for how subsets of elements work together, constraining the system jointly and irreducibly to achieve a specific state transition [13,19]. The study of high-order mechanisms may also help to inform genetic experiments on Schizosaccharomyces pombe cells, which so far have been primarily holist (sequencing the entire genome [21]), or reductionist (using ‘gene knockout’ to identify the functional role of individual genes [22]). Here, we find that the third-order mechanism composed of {Cdc2/13, Spe9, Rum1} plays an important role for the self-regulation of the cell-cycle model (see figure 2d), and also that these three elements form a local maximum of cause–effect power (figure 3b). Each of these three genes have already been individually identified as important for cell-cycle function [23,24], yet we predict that a multiple knockout of these three genes would have some effect on the function of the real cell cycle that is ‘greater than the sum of its parts’.

Furthermore, a causal approach allows us to disentangle the intrinsic mechanisms within a system from the contributions of its external environment and thus establish self-defined borders necessary for autonomy. A functionalist approach, which only cares about what happens and not why it happens, has no way to distinguish the two contributions, and thus cannot firmly identify the borders between an entity and its environment. This distinction becomes important when comparing different networks with similar function. Comparing the causal structure of the cell-cycle network to that of its backbone motif reveals that only the former is robustly integrated. For the cell-cycle network, the whole network is a local maximum of Φ across all biological states. In contrast, the backbone network is not integrated (Φ = 0), and the causal borders identified are not robust, varying across states. Thus, while the robustly integrated cell-cycle network can be viewed as a causally autonomous system, with self-defined and self-maintained borders, the backbone network lacks borders and the ability to maintain itself in the face of changing internal and external conditions. This establishes a distinction between function and integration, which may be important to the origins of life. A prebiotic system, such as an autocatalytic network or a protocell [25], even if functional, may not be integrated and thus would not be an autonomous system.

In general, the IIT analysis is computationally intense and requires extensive perturbational data, or an already established model of the basic system elements and their interactions. Suitable models of biological systems relevant for studying autonomy are still sparse. It is important to note that the cell-cycle network studied here is only a coarse model of a functional subpart of a S. pombe cell. While the cell-cycle model displays key features of biological autonomy, it lacks crucial components that are typically associated with the autonomy of the cell per se, for example, the cytoplasmic membrane. To confirm whether the set of elements comprising the cell-cycle network could still form an autonomous system once these additional components are taken into account, a properly extended model (supported by appropriate experiments) is required. One interesting question here is whether an extended model of an entire cell including the cytoplasmic membrane would reveal, as typically assumed, that the cytoplasmic membrane is within the self-defined borders of a cell. It might also turn out that the cytoplasmic membrane is not an intrinsic, causal part of the system, but rather provides advantageous background conditions to facilitate life.

The utility of the approach presented here is that it requires no a priori assumptions about what constitutes the borders of an entity: the model could be expanded to include any components within a supposed entity, as well as putative environmental variables, or even multiple entities. Once computationally feasible, and once a sufficiently detailed model of the cell is available, a demonstration that the IIT analysis indeed reveals appropriate causal borders in such an extensive model (i.e. which elements are in, and which are out) would provide strong evidence for the proposed approach. This quality, moreover, is a necessary prerequisite for any theoretical attempt to identify autonomy of a system on the edge between life and non-life. Causal control internal to a system's dynamics is another requirement for autonomy which has previously been proposed as a critical step in the emergence of living networks [26,27]. Indeed, it has been shown that control elements can emerge dynamically in catalytic networks [28]. Our analysis reveals that both internal control and causal borders can be identified using IIT. Since autonomy is likely a universal property of life, the kind of causal analysis presented here could place new constraints on models for the origin of life applicable to diverse chemical systems and help to identify at what point a living system has emerged.

4. Methods

A system is a set of elements S in a state s, where each element has two or more states, inputs that influence its state and outputs which influence the state of system elements. A system has a corresponding transition probability function p which describes the probabilities with which the system transitions from one state to another for all possible system states. The transition probability function provides the basis for our causal analysis. For a given system, the transition probability function has to be determined by experiments involving intervention and manipulation of variables [18], which either identify the input–output relationship of the system's elements, from which the transition probability function can be constructed, or directly measure the state-to-state transition probabilities of the system. A properly determined transition probability function should satisfy the Markov property. In addition, we require that the current states of elements are independent, conditional on the past state of the system (prohibiting instantaneous, non-causal interactions). For any two subsets of S, called the mechanism Y and the purview Z, we can define the cause and effect repertoires of Y over Z, that is, how Y in its current state yt constrains the potential past or future states of Z:

4.

where Kcause and Keffect are normalizing constants [18,19,29]. The superscript ‘c’ labels the complement of a set of elements, e.g. Zc = SZ.

The integrated cause–effect information of Y is then defined as the distance between the cause–effect repertoires of the mechanism, and the cause–effect repertoires of their MIP over the purview that is maximally irreducible,

4.

where cut is a partition of the mechanism into two parts, and pcut the repertoire under the partition,

4.

The integrated information of the mechanism is the minimum of its corresponding integrated cause and effect information,

4.

All sets of elements with φ > 0 are irreducible mechanisms M within the system and thus contribute to the system's cause–effect structure C = {{pcause, peffect, φ}M}, the set of all irreducible cause–effect repertoires with their corresponding φ values. The integrated information of the entire system is then defined as the distance between the cause–effect structure of the system, and cause–effect structure defined by its MIP, eliminating constraints from one part of the system to the rest:

4.

For both the integrated information of a mechanism (φ) and the integrated information of a system (Φ), the distance metric used is the earth mover's distance [19,30]. Finally, if S is a subset of elements within a larger system, all elements outside of S are considered as part of the environment and are conditioned on their current state throughout the causal analysis. All computations for this study were performed by the PyPhi software package [31], using the ‘CUT_ONE_APPROXIMATION’ to Φ.

Supplementary Material

Table of Mechanisms
rsta20160358supp1.doc (66KB, doc)

Data accessibility

This article has no additional data.

Authors' contributions

W.M., G.T. and L.A. conceived of the experiment. W.M. and H.K. performed the analyses. W.M., S.I.W. and L.A. interpreted the results. All authors contributed to drafting and critically editing the manuscript.

Competing interests

S.I.W. is the guest editor for the ‘Origins’ theme issue. The manuscript was given to an alternative editor for assessment.

Funding

W.M., L.A. and G.T. are supported by the Templeton World Charities Foundation (grant no. TWCF 0067/AB41).

References

  • 1.Walker SI, Davies PCW. 2017. The ‘Hard Problem’ of life. In From matter to life: information and causality (eds Walker SI, Davies PCW, Ellis GFR), pp. 19–37. Cambridge, UK: Cambridge University Press. [Google Scholar]
  • 2.Scharf C, et al. 2015. A strategy for origins of life research. Astrobiology 15, 1031–1042. ( 10.1089/ast.2015.1113) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Cleland CE. 2012. Life without definitions. Synthese 185, 125–144. ( 10.1007/s11229-011-9879-7) [DOI] [Google Scholar]
  • 4.Maturana HR, Varela FJ. 1991. Autopoiesis and cognition: the realization of the living. Boston Studies in the Philosophy of Science, vol. 42 Dordrecht, The Netherlands: Reidel. [Google Scholar]
  • 5.Ruiz-Mirazo K, Peretó J, Moreno A. 2004. A universal definition of life: autonomy and open-ended evolution. Orig. Life Evol. Biosph. 34, 323–346. ( 10.1023/B:ORIG.0000016440.53346.dc) [DOI] [PubMed] [Google Scholar]
  • 6.Walker SI, Davies PCW. 2013. The algorithmic origins of life. J. R. Soc. Interface 10, 20120869 ( 10.1098/rsif.2012.0869) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Walker SI. 2014. Top-down causation and the rise of information in the emergence of life. Information 5, 424–439. ( 10.3390/info5030424) [DOI] [Google Scholar]
  • 8.Davidich MI, Bornholdt S, Stolovitzky G. 2008. Boolean network model predicts cell cycle sequence of fission yeast. PLoS ONE 3, e1672 ( 10.1371/journal.pone.0001672) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Wang G, Du C, Chen H, Simha R, Rong Y, Xiao Y, Zeng C. 2010. Process-based network decomposition reveals backbone motif structure. Proc. Natl Acad. Sci. USA 107, 10 478–10 483. ( 10.1073/pnas.0914180107) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Kim J, Park S-M, Cho K-H. 2013. Discovery of a kernel for controlling biomolecular regulatory networks. Sci. Rep. 3, 2223 ( 10.1038/srep02223) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Kim H, Davies P, Walker SI. 2015. New scaling relation for information transfer in biological networks. J. R. Soc. Interface 12, 20150944 ( 10.1098/rsif.2015.0944) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Walker SI, Kim H, Davies PCW. 2016. The informational architecture of the cell. Phil. Trans. R. Soc. A 374, 20150057 ( 10.1098/rsta.2015.0057) [DOI] [PubMed] [Google Scholar]
  • 13.James RG, Crutchfield JP. 2016. Multivariate dependence beyond Shannon information. (http://arxiv.org/abs/1609.01233)
  • 14.Albantakis L, Tononi G. 2015. The intrinsic cause-effect power of discrete dynamical systems—from elementary cellular automata to adapting animats. Entropy 17, 5472–5502. ( 10.3390/e17085472) [DOI] [Google Scholar]
  • 15.Emmeche C. 1998. Defining life as a semiotic phenomenon. Cybernetics Human Knowing 5, 3–17. [Google Scholar]
  • 16.Krakauer D, et al. 2014. The information theory of individuality. (http://arxiv.org/abs/1412.2447. )
  • 17.Lewis D. 1974. Causation. J. Philos. 70, 556–567. ( 10.2307/2025310) [DOI] [Google Scholar]
  • 18.Pearl J. 2009. Causality. Cambridge, UK: Cambridge University Press. [Google Scholar]
  • 19.Oizumi M, Albantakis L, Tononi G, Sporns O. 2014. From the phenomenology to the mechanisms of consciousness: integrated information theory 3.0. PLoS Comput. Biol. 10, e1003588 ( 10.1371/journal.pcbi.1003588) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Marshall W, Albantakis L, Tononi G.2016. Black-boxing and cause-effect power. (http://arxiv.org/abs/1608.03461. )
  • 21.Wood V, et al. 2002. The genome sequence of Schizosaccharomyces pombe. Nature 415, 871–880. ( 10.1038/nature724) [DOI] [PubMed] [Google Scholar]
  • 22.Kim D-U, et al. 2010. Analysis of a genome-wide set of gene deletions in the fission yeast Schizosaccharomyces pombe. Nat. Biotechnol. 28, 617–623. ( 10.1038/nbt.1628) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Nurse P, Bissett Y. 1981. Gene required in G1 for commitment to cell cycle and in G2 for control of mitosis in fission yeast. Nature 292, 558–560. ( 10.1038/292558a0) [DOI] [PubMed] [Google Scholar]
  • 24.Kominami K-I, Seth-Smith H, Toda T. 1998. Apc10 and Ste9/Srw1, two regulators of the APC–cyclosome, as well as the CDK inhibitor Rum1 are required for G1 cell-cycle arrest in fission yeast. EMBO J. 17, 5388–5399. ( 10.1093/emboj/17.18.5388) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Adamala K, Szostak JW. 2013. Nonenzymatic template-directed RNA synthesis inside model protocells. Science 342, 1098–1100. ( 10.1126/science.1241888) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Nghe P, Hordijk W, Kauffman SA, Walker SI, Schmidt FJ, Kemble H, Yeates JAM, Lehman N. 2015. Prebiotic network evolution: six key parameters. Mol. Biosyst. 11, 3206–3217. ( 10.1039/C5MB00593K) [DOI] [PubMed] [Google Scholar]
  • 27.Smith E, Morowitz HJ. 2016. The origin and nature of life on earth: the emergence of the fourth geosphere. Cambridge, UK: Cambridge University Press. [Google Scholar]
  • 28.Kamimura A, Kaneko K. 2010. Reproduction of a protocell by replication of a minority molecule in a catalytic reaction network. Phys. Rev. Lett. 105, 268103 ( 10.1103/PhysRevLett.105.268103) [DOI] [PubMed] [Google Scholar]
  • 29.Tononi G. 2015. Integrated information theory. Scholarpedia 10, 4164 ( 10.4249/scholarpedia.4164) [DOI] [Google Scholar]
  • 30.Rubner Y, Tomasi C, Guibas LJ. 2000. The earth mover's distance as a metric for image retrieval. Int. J. Comput. Vision 4, 99–121. ( 10.1023/A:1026543900054) [DOI] [Google Scholar]
  • 31.Mayner W, Marshall W, Marchman B. 2016. PyPhi: 0.8.1.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Table of Mechanisms
rsta20160358supp1.doc (66KB, doc)

Data Availability Statement

This article has no additional data.


Articles from Philosophical transactions. Series A, Mathematical, physical, and engineering sciences are provided here courtesy of The Royal Society

RESOURCES