Skip to main content
Interface Focus logoLink to Interface Focus
. 2011 Sep 29;2(1):126–140. doi: 10.1098/rsfs.2011.0062

Top-down causation and emergence: some comments on mechanisms

George F R Ellis 1,*
PMCID: PMC3262299  PMID: 23386967

Abstract

Both bottom-up and top-down causation occur in the hierarchy of structure and causation. A key feature is multiple realizability of higher level functions, and consequent existence of equivalence classes of lower level variables that correspond to the same higher level state. Five essentially different classes of top-down influence can be identified, and their existence demonstrated by many real-world examples. They are: algorithmic top-down causation; top-down causation via non-adaptive information control, top-down causation via adaptive selection, top-down causation via adaptive information control and intelligent top-down causation (the effect of the human mind on the physical world). Through the mind, abstract entities such as mathematical structures have causal power. The causal slack enabling top-down action to take place lies in the structuring of the system so as to attain higher level functions; in the way the nature of lower level elements is changed by context, and in micro-indeterminism combined with adaptive selection. Understanding top-down causation can have important effects on society. Two cases will be mentioned: medical/healthcare issues, and education—in particular, teaching reading and writing. In both cases, an ongoing battle between bottom-up and top-down approaches has important consequences for society.

Keywords: top down causation, emergence of complexity, adaptive selection

1. Complexity and emergence

Physics underlies all complexity, including our own existence. How is this possible? How can our own lives emerge from interactions of electrons, protons and neutrons?

The basis of complexity is modular hierarchical structures, leading to emergent levels of structure and function based on lower level networks. Each of these aspects (‘modular’, ‘hierarchical’ and ‘structure’) is crucial in the emergence of complexity out of interactions between simpler units [1,2]. The basic principle is that when you have a complex task to perform, you break it up into subtasks that are each simpler than the overall project, requiring less data and less computing power, and assign these tasks to specific modules. Each module is again split up into submodules until you reach a base level where the necessary tasks are simple operations that can be carried out by simple mechanisms. This is the level where the real work is done, each of these components feeding its results into the next higher level components until the desired result emerges at the appropriate higher level. The modules at each level will interact with each other in some way: maybe just statistically, if they all carry out the same task, or maybe in the form of a complex interaction network, when each of them will generally carry out different tasks. The result is a highly structured hierarchy of interacting entities.

A simplified version of the basic hierarchy of complexity and causality for natural systems (left) and for human beings (right) is given in table 1. There is a similar hierarchy for artificial systems [3]—language, mathematics, computers, aircraft, cities, organizations, societal roles, for example—that is not given here. Each level of the hierarchy is made up of interacting modules that are relatively strongly bound internally, with higher frequency and higher energy internal dynamics, interacting with other modules through weaker bonds and lower frequency interaction dynamics. The internal dynamical variables are hidden from outside view; external entities interact with the module through interface variables linked to the internal variables.

Table 1.

The hierarchy of structure and causation. This figure gives a simplified representation of this hierarchy of levels of reality (as characterized by corresponding academic subjects) for natural systems (left) and human beings (right). Each lower level underlies what happens at each higher level, in terms of causation. There is no correlation between the left- and the right-hand columns above the level of chemistry, as emergence and causation are quite different in the two cases; but the first four levels are identical (life emerges out of physics!) For a more detailed description, see http://www.mth.uct.ac.za/∼ellis/cos0.html.

level 8 cosmology sociology/economics/politics
level 7 astronomy psychology
level 6 space, solar system science physiology
level 5 geology, earth science cell biology
level 4 materials science biochemistry
level 3 physical chemistry organic chemistry
level 2 atomic physics atomic physics
level 1 particle physics particle physics

Examples are nuclei in atoms, cells in a human body, individuals in society and subroutines in a computer program. The way they interact with each other at a specific level can be characterized by an interaction network showing which modules interact with which other modules through various possible interaction modes (inter-module forces, or matter, energy and information exchange); this is the structure of the system. Emergence of a higher level system from the lower level modules takes place when reliable higher level behaviour arises out of the lower level actions taking place in the context of this structure, with lower level units grouped together to form higher level modules that one can identify as meaningful entities that persist over time and have identifiable laws of behaviour. Identifying modules in really interaction networks may not be obvious; various algorithms have been devised for this purpose. As implied earlier, a crucial feature is the possibility of recursion: each module may itself be made up of an interacting network of lower level modules (e.g. the description of the hierarchical structure of life in Campbell & Reece [4]).

Each of the different levels of the hierarchy function according to laws of behaviour appropriate to that level, and are describable only in terms of language suited to that level (the concepts that are basic to molecular biology, such as genes and proteins, cannot be described in the language of a particle physicist, such as quarks and gluons). Ideas applicable to lower level causation do not by themselves succeed in explaining the higher level behaviours, for the concepts employed are simply not appropriate to the higher level kinds of causation. Higher level entities, such as plans and intentions, have causal power in their own right, which partially determine what happens at lower levels in the hierarchy (billions of atoms and molecules move in accord with our intentions when we raise our arm). Here, we characterize as level as ‘higher’ when it can be shown to influence another level (‘a lower’ level) by setting a context that guides the way the lower level actions take place.

Multi-fold causation takes place in such systems. A network of causal influences and constraints interact to produce an outcome. In order to understand such systems, we often take for granted most of these influences and concentrate on one of them, which we then label as ‘the cause’, meaning the dominant cause. But a web of influences and multiple causations is in action all the time. Nevertheless, in order to understand what is going on, it is useful to single out particular links in this causal pattern, taking all the rest for granted; indeed we have to do so, for it is not possible to explicitly take into account all causal factors at all levels of the hierarchy. Note that as is discussed later, ‘causal power’ is not restricted to ‘efficient causality’.

There are limits to what can be achieved by bottom-up emergence alone. Self-assembly and self-structuring based on bottom-up action alone can lead to emergence of structures such as crystals and simple biological molecules, or of dynamical systems with attractors leading to entities such as stars and galaxies, and to more complex phenomena such as Bénard cells, patterns associated with the reaction–diffusion equation and sand piles, the dynamics of the Game of Life, properties of slime mould, existence of ant colonies and the behaviour of flocks of birds. These, however, do not extend to truly complex systems such as a single living cell. It seems that developing very complex systems such as those occurring in biology requires top-down causation, needed in order to build up the necessary biological information [5,6]. This information cannot be derived in a bottom-up way, because it implicitly embodies information about environmental niches. It would be different in a different environment. Hence, higher level conditions influence what happens at the lower levels, even if the lower levels do the work. This is what I characterize as top-down causation.

This viewpoint of interacting bottom-up and top-down influences is very helpful in understanding complex systems, and has significant implications for our models of society and consequent social policy. This paper aims to outline the importance of top-down causation in the functioning of complex systems, and particularly in humans. This significance is illustrated in other papers in this special issue. I will in particular make the case that symbolic systems are causally effective variables at the higher levels of causation, when the hierarchy is suitably extended to include such abstract entities (as is necessary in order to satisfactorily represent causal influences emanating from the social level). Two examples will be given to illustrate possible social implications of the viewpoint presented here.

2. Bottom-up and top-down effects

Both bottom-up and top-down causation occur in the hierarchy of structure and causation. Bottom-up causation is the basic way physicists think: lower level action underlies higher level behaviour, for example physics underlies chemistry, biochemistry underlies cell biology and so on. As the lower level dynamics proceeds, for example diffusion of molecules through a gas, the corresponding coarse-grained higher level variables will change as a consequence of the lower level change, for example a non-uniform temperature will change to a uniform temperature. However, while lower levels generally fulfil necessary conditions for what occurs on higher levels, they only sometimes (very rarely in complex systems) provide sufficient conditions. It is the combination of bottom-up and top-down causation that enables same-level behaviour to emerge at higher levels, because the entities at the higher level set the context for the lower level actions in such a way that consistent same-level behaviour emerges at the higher level.

A key concept here is coarse-graining of lower level variables to give higher level variables [7,8], with consequent loss of detailed information, thus enabling higher level behaviour to emerge from lower level properties. One averages lower level properties and thereby determines higher level properties, for example the pressure and density of a gas emerge from the underlying molecular spatial and velocity distribution. The opposite is fine-graining—we look at the situation on finer and finer scales. However, there is not enough information in the coarse-grained view to determine what fine-grained state we will discover—this an essential consequence of the information hiding that occurs when we adopt a higher level view. There are many fine-grained states that can realize any specific coarse-grained state (this is ‘multiple realizability’ of the higher level state, see the next paragraph). Some physicists and philosophers claim that from a fundamental viewpoint, higher levels are ‘nothing more than’ an aggregation of lower level phenomena, i.e. they will all emerge by coarse-graining. However, some higher level causally effective variables cannot be obtained in this way, as they are demonstrably not coarse grainings of lower level variables. To accommodate the effect of these holistic higher level variables, the hierarchy must be understood not simply as one of increasing complexity or scale, but as one of lower to higher levels of causation, whether associated with physical entities or not.

Top-down causation takes place owing to the crucial role of context in determining the outcomes of lower level causation. Higher levels of organization constrain and channel lower level interactions, paradoxically thereby increasing higher level possibilities. A key feature here is multiple realizability of higher level functions, and consequent existence of equivalence classes of lower level variables as far as higher level actions are concerned. An equivalence class identifies all lower level states that correspond to the same higher level state. For example, billions of different micro-states will correspond to the same higher level state of a gas, as characterized by its temperature, pressure and density; the same higher state of a computer, as characterized by gate states and currents flowing in its components can be realized by many different specific electronic states; numerous different molecular configurations give the same functional state of a neuron in a human brain. The possibility of coherent higher level action emerging from the lower level dynamics is based on the principle of equivalence classes: the same higher level state leads to the same higher level outcome, independent of which lower level states instantiates the higher level state. This is the crucial feature characterizing effects as being owing to top-down causation [9]. If different outcomes result from different lower level realizations of the same higher level state, we do not have reliable same level action resulting from top-down influences of the higher levels (the lower level actions do not mesh together to cause reliable higher level behaviour, as is the case in chaotic systems1). One can also consider the possibility of higher level causation by choice among, or ongoing constraint on the future of, lower level phenomena which may not be multiply realizable. However, in practice, there will not be many cases where the higher level state is not multiply realizable: this is a consequence of the huge number of lower level components that make up complex systems, because of the atomic nature of matter and the cellular nature of living beings.

Note that ‘top-down’ causation is an interlevel concept: it applies between any two adjacent levels. Claiming such causation happens is agnostic as to whether there is or is not a topmost or lowest level.

2.1. Characterizing top-down causation

To characterize some specific causal effect as a top-down effect, we must demonstrate that a change of higher level conditions alters the sequence of processes at lower levels; we do this by changing higher level conditions and seeing what happens at the lower levels (for example, we decrease the volume of a gas and see that it makes molecules move faster). We should if possible demonstrate the existence of equivalence classes of lower level effects that give the same higher level outcome (for example, we show how to determine gas density, pressure and temperature from integrals over molecular variables); this is solid evidence that top-down causation is happening, and shows how to coarse-grain lower level variables to obtain higher level effective variables. Emergent higher level behaviour occurs when such higher level variables determine outcomes without any recourse to lower level variables (for example, the gas laws relating pressure, density and temperature are stated purely in terms of higher level variables); the number of lower level states realizing a single higher level state determines the entropy of the system [7,8]. However in some cases, there exist causally effective higher level conditions that cannot be obtained by coarse graining of any lower level variables (for example, the rules of chess control the way chess pieces are allowed to move on a chess board, and thus they are causally effective; but they are not determined by any lower level variables).

2.1.1. Five kinds of top-down causation

Five essentially different classes of top-down causation can be identified, and their existence demonstrated by many real-world examples [2]. They will be discussed in turn.

2.2. Top-down causation 1: algorithmic top-down causation

Algorithmic top-down causation occurs when high-level variables have causal power over lower level dynamics through system structuring, so that the outcome depends uniquely on the higher level structural, boundary conditions and initial conditions. The lower level variables determine the outcome in an algorithmic way from the initial and boundary conditions (for example, the software loaded in a computer) as a consequence of the structural relations (for example, the wiring in a computer or interconnections of neurons); changing these conditions leads to different lower level events and dynamical outcomes. Provided the lower level interactions mesh together in a coherent way, the constrained operation of lower level forces, operating in a law-like/algorithmic way, leads to reliable higher level behaviour whose outcome depends on the nature of the constraints and initial conditions. These are often in the form of networks of interactions [11], usually including recurring network motifs [12]. These are higher level features because they cannot be described in terms of lower level concepts (for example, the specific connections between transistors in a computer cannot be described in terms of properties of electrons) and the system ceases to function if the higher level relationships are disrupted, even though the individual lower level elements are unchanged.

2.2.1. Examples

An excellent example—indeed the present day canonical one—is digital computers: the low-level gates and transistors act in accord with the data and program loaded (word processor, music program, image processing program, etc.), which is a high-level concept whose structure and function cannot be explained in lower level terms. The hardware and software are each hierarchically structured in a symbiotic way so as to allow this higher level functionality [13]. A second example is the way the outcomes of many physical systems are determined by partial differential equations, where the outcome depends on the boundary conditions and initial conditions (see the articles by Denis Noble [14] and Robert Bishop [15]).

The mathematics and the theory underlying these algorithmic effects are varied: it includes dynamical systems theory [16], partial differential equations theory [17], numerical methods such as finite elements [18], statistical physics [19], the analysis of computer algorithms [20], electronic circuit design [21] and the analysis of network motifs [12].

2.3. Top-down causation 2: top-down causation via non-adaptive information control

In non-adaptive information control, higher level entities influence lower level entities, so as to attain specific, fixed goals through the existence of feedback control loops, whereby information on the difference between the system's actual state and desired state is used to lessen this discrepancy. Unlike the previous case, the outcome is not determined by the boundary or initial conditions; rather it is determined by the goals, indeed the whole purpose of such systems is to make initial conditions irrelevant. A different outcome will occur if the goal is changed. Thus, the nature of causality is quite different than the previous case, when feedback control systems are guided by goals, which are higher level entities. This contrasts dramatically with how physics is usually considered to operate, but is fully in accord with engineering and biological precepts. The goals are established through the process of natural selection and genetically embodied, in the case of biological systems, or are embodied via the engineering design and subsequent user choice, in the case of manufactured systems.

2.3.1. Examples

An excellent example is a thermostat controlling the temperature in a room; the goal is set by setting a desired temperature on an input panel. All of biology embodies numerous genetically determined homeostatic systems, based on the principle of feedback control; in particular, this is true of the human body: homeostasis is the key to physiological research [22]. Thus, for example, we have inbuilt bodily systems that interact to maintain body temperature at 98.4 F to high accuracy.

The mathematics involved is linear and nonlinear control systems theory [23], including its applications to the biological context [24].

2.4. Top-down causation 3: top-down causation via adaptive selection

Adaptive processes [25] take place when many entities interact, for example the cells in a body or the individuals in a population, and variation takes place in the properties of these entities, followed by selection of preferred entities that are better suited to their environment or context. Higher level environments provide niches that are either favourable or unfavourable to particular kinds of lower level entities; those variations that are better suited to the niche are preserved and the others decay away. Criteria of suitability in terms of fitting the niche can be thought of as fitness criteria guiding adaptive selection. On this basis, a selection agent or selector (the active element of the system) accepts some of the variations and rejects the rest; these selected entities then form the current system state that is the starting basis for the next round of selection, ultimately leading to the emergence and nature of biological form. A different lower level structure will result if the higher level context is changed.

Thus, this is top-down causation from the context to the system. An equivalence class of lower level variables will be favoured by a particular niche structure in association with specific fitness criteria; if the top-level conditions change, the outcome will change. Unlike feedback control, this process does not attain pre-selected internal goals by a specific set of mechanisms or systems; rather it creates systems that favour the meta-goals embodied in the fitness criteria. This is an adaptive process rather than a control process. It is the way new information is generated that was not present before [5], and enables emergence of complexity with an increase of embodied information, for the process searches the solution space in a way that is not pre-ordained and adapts to the context. The outcome is usually not predictable either from the initial conditions or from the meta-goals, because of the random element involved, although both clearly influence the outcome. This underlies all life, including cells as well as plants and animals, and is the basis for building up biological information—the foundational difference between physics and biology [6].

2.4.1. Example: Darwinian evolution

Darwinian evolution is a specific example: the standard story on the evolution of life is that increasingly complex structures have evolutionary advantages precisely because they constrain the lower level interactions to which they are subjected [4]. Cell walls are the most striking case, but the principle applies from bio-polymers to societies. It results in DNA structuring via adaptive selection over geological timescales, with the meta-goal—the higher level ‘purpose’ that guides the dynamics—being survival of populations of the organism (both higher level concepts). The development of DNA codings (the particular sequence of base pairs in the DNA) occurs through an evolutionary process that results in adaptation of an organism to its ecological niche; the selector is death, and the implicit fitness criterion is survival. A different niche structure results in a different set of genes. As a specific example: a polar bear Ursus maritimus has genes for white fur in order to adapt to the polar environment, whereas a black bear Ursus americanus has genes for black fur in order to be adapted to the North American forest. The detailed DNA coding differs in the two cases because of the different environments in which the respective animals live.

This is a classic case of top-down causation from the large-scale context to detailed biological microstructure—through the process of evolutionary adaptation, the environment (along with other causal factors) fixes the specific DNA coding. There is no way you could predict or explain this coding on the basis of biochemistry or microphysics alone. The survival of the organism is the fitness criterion, leading to existence of all those detailed conditions that must be fulfilled for survival to be assured. This meta-goal is the same for every organism because it is what leads to the existence of populations that fit environmental niches better than competitors. Note that the claim is not that the environment is the only relevant factor; rather that it is one of the causally effective factors. There will always be multiple causal factors, some bottom-up and some top-down; the final result comes from the confluence of these effects. How do you demonstrate this is top-down causation? Change the niche structure (e.g. by changing the global climate), and a different population will adapt to it. The fact that lower level equivalence classes for the same higher level purposes are selected by higher level conditions and resulting niches is demonstrated by many examples of convergent evolution [26] and studies in microbiology [27].

A key feature of the biological world is that similar processes of adaptive selection take place not only on evolutionary time scales, but also on developmental and functional timescales [28]; for example, brain plasticity is based on processes of neuronal group selection that underlie learning on a minute-by-minute basis [29,30]. One should note that the concept is of much wider applicability; however, it can occur in a once-off selection event in physics cases such as state-vector preparation [31]. Repetition increases its effectiveness, but is not necessary to the concept.

The mathematics in general cases is the mathematics of adaptive selection [25], but in specific cases it results in the standard equations of population genetics [32] and molecular evolution [33].

Computational models are given by theories of artificial neural nets [34] and genetic algorithms [35].

2.5. Top-down causation 4: top-down causation via adaptive information control

Adaptive information control takes place when there is adaptive selection of goals in a feedback control system, thus combining both feedback control and adaptive selection. The goals of the feedback control system are irreducible higher level variables determining the outcome, but are not fixed as in the case of non-adaptive feedback control; they can be adaptively changed in response to experience and information received. The overall process is guided by fitness criteria for selection of goals. This allows great flexibility of response to different environments; indeed in conjunction with memory, it enables learning and anticipation and underlies effective purposeful action [36], as it enables the organism to adapt its behaviour in response to the environment in the light of past experience, and hence to build up complex levels of behaviour.

2.5.1. Example

The classical example is associative learning in animals, such as Pavlovian conditioning: an animal responds to a stimulus such as a sound, which is taken as a sign of something else and causes physical reactions implemented by motor neurons [36]. The training is causally effective by top-down action from the brain to cells in muscles. The fitness criterion is avoidance of negative stimuli; change of the associated goals (through a change in the environment) results in change of behaviour. More generally, the mind works by adaptive prediction of what is likely to happen, updated on an ongoing basis [37]. This underlies most of our mental ability. For example, the process of perception is a predictive adaptive process using Bayesian statistics to update the current perception on the basis of prediction errors [38]. This includes prediction of the intention of others, which is the basis of theories of other minds [36].

It is claimed by some that the mathematics of evolutionary game theory [39] will act as an adequate basis for understanding these processes. Personally, I have doubts about how far this can succeed, because of the simplistic nature of the reductive models of human behaviour implied.

2.6. Top-down causation 5: intelligent top-down causation (i.e. the effect of the human mind on the physical world)

Intelligent top-down causation is the special case of feedback control with adaptive choice of goals where the selection of goals involves the use of symbolic representation to investigate the outcome of goal choices. Here, a symbolic system is a set of structured patterns realized in time or space, that is arbitrarily chosen by an individual or group to represent objects, states and relationships [40]. It will generally involve hierarchical structuring and recursion, as is required if it is to be useful in understanding complex situations, and has the potential to enable quantitative as well as qualitative investigation of outcomes.

2.6.1. Example: aircraft design

Plans for a jumbo jet aircraft result in billions of atoms being deployed to create the aircraft in accordance with those plans. This is a non-trivial example: it costs a great deal of money to employ experts in aerodynamics, structures, materials, fuels, lubrication, controls, etc., to design and then to manufacture the aircraft in accordance with those plans. The plan itself is not equivalent to any single person's brain state: it is an abstract hierarchically structured equivalence class of representations (spoken, drawn, in computers, in brains, etc.) that together comprise the design. It is clearly causally effective (the aircraft would not exist without it).

Symbolic representation and choice of goals underlie the causal efficacy of abstract entities such as action plans and the value of money, represented symbolically. Thus, the key feature of this higher level of causation, distinguishing it from the general case of adaptive control systems, is its use of language (spoken or written) and abstract symbolism, extending to the quantitative and geometrical representations of mathematical models. These are all irreducible higher level variables of an abstract nature: they form equivalence classes of representations, inter alia because they can be represented in different languages, and in spoken or written form or in computers. They enable information to be stored and retrieved, classified and selected as relevant or discarded, processed in the light of other information and used to make qualitative and quantitative projections of outcomes and plan future actions in a rational way, altering goals according to an intelligent understanding of past experiences and future expectations. Intentional action then enables one to implement the resulting plans, and so change the physical world. The outcome is thus the result of human agency.

A key feature is the causal power of images and formal and informal causal models of the natural and social worlds, ranging from mental images of what might happen to elaborate quantitative models of physical entities and societies [41]. These abstract entities (which are shared among many minds) play a large part in formulating our understandings and consequent actions, and hence are causally effective in the real world as they help us attain our goals. This is based on subjective personal experience. Language, science, mathematics and myriad other artefacts and customs are social constructions that strongly influence the behaviour of individuals and cultures; these are examples of higher level causal variables that are not coarse-grained lower level variables. Of course, we do not fully understand how the mind is able to plan and make choices resulting in top-down action as discussed here; the fact that we do not know how it works does not affect the fact that we are certain it does happen.

2.6.2. Example: the value of money

Physically, fiat money is just coins or pieces of paper with patterned marks on them. This does not explain its causal significance. The effectiveness of money, which can cause physical change in the world such as the construction of buildings, roads, bridges and so on by top-down action of the mind to material objects, is based on social agreements that lead to the value of money and exchange rates. These are abstract entities arising from social interaction over an extended period of time, and are neither the same as individual brain states, nor equivalent to an aggregate of current values of any lower level variables (although they are causally effective through such states and variables).

2.6.3. Example: roles, expectations and values

Roles are socially determined abstract entities that are causally effective in structuring society. They are a key aspect of the causal power of social structures [42]. Roles are developed by an adaptive process that is a combination of bottom-up and top-down interactions between society and the individuals who make up the society. They are then inculcated into the individual by top-down social processes, whereafter they become a core feature of individual psychology in relation to society, together with expectations guiding the choice of goals and actions and hence being causally effective in a top-down way from the mind to the body [36]. Thus, our understandings of meaning and purpose are abstract entities that form a high level in the hierarchy of causation in the mind. The imperative to search for meaning is a key aspect of human nature [43], without which the entire edifice of science would not exist.

Roles embody social values, which, together with individual values relating to life purpose, guide the individual and communal choice of goals and the methods used to attain these goals. Thus, the highest level adaptive goals are values related to ethics, aesthetics and meaning, which are all causally effective in a top-down way by determining the nature of desirable and/or acceptable lower level goals. They are a set of abstract principles that are causally effective in the real physical world; indeed they crucially determine what happens. For example, wars will be waged or not depending on ethical stances; large-scale physical devastation of the Earth will result if thermonuclear war takes place, and so the nature of our values has crucial effects on the way human activity impacts on society and the world. Values are irreducible higher level entities: there is no way they can result from coarse-graining of lower level variables.

Are there useful mathematical models of human behaviour? Many have tried to make such models. In addition to the claims of games theorists and mathematical models of evolutionary processes [44], the main thrust has been that of mathematical economics and financial mathematics, involving, for example, the Brownian motion model of financial markets, rational pricing assumptions and the Black–Scholes model [45,46]. Any such models can, however, only be partly successful, giving correct predictions some of the time; their unthinking use to guide policy can lead to disaster, as happened in the global financial crisis of 2008–2009 [47]. All such models should be treated with great caution.

2.7. Complex adaptive systems

The last three classes of top-down causation are all examples of complex adaptive systems [25,48]. These systems are the only way that biological information can be generated and incorporated into the living systems. The importance of adaptive selection is because it can let a system adapt to ongoing changes in the environment, indeed it is the only way of doing so [49]. It is also the key to the way life can apparently violate the second law of thermodynamics. Adaptive selection can accumulate structure and information by selecting a subset of entities from a set of many variants, selecting only those lower level states that correspond to a higher level selection principle, thus embodying a form of top-down action.

This is an analogue of Maxwell's demon: a micro-entity that chooses molecules with high energy from an ensemble of molecules in a container, and lets them pass a trapdoor into a compartment, thus heating up the gas in the compartment and so locally violating the second law of thermodynamics, as negligible energy is used in the selection [50]. The second law remains globally valid because of the entropy increase in the environment. Darwinian selection in effect envisages a macro demon who acts down to the molecular level to select a specific sequence encoding desirable genetic information from an ensemble of nucleic acids. This again enables a local violation of the second law. Note that this process can take place once-off: in biology, it is repeated many thousands of times, but in physical systems it may occur only once—for example, in the process of state vector preparation in quantum physics [31].

2.8. Bottom-up and top-down causation

It is the combination of these different kinds of top-down action with bottom-up causation that enables true complexity to emerge, particularly because feedback loops are set up between lower and higher level variables. For example, social constructions such as roles influence individual behaviour [51], but they are not fixed: they are also subject to evolution, through bottom-up action of individuals in society on longer timescales.

3. Aristotle's forms of causation

Reductionist analysis ‘explains’ the properties of the machine by analysing its behaviour in terms of the functioning of its component parts (the lower levels of structure). Systems thinking tries to understand the properties of the interconnected complex whole [52,53], and ‘explains’ the behaviour or properties of an entity by determining its role or function within the higher levels of structure. For example, the question: ‘Why is an aircraft flying?’ can be answered,

  • — in Bottom-up terms: it flies because air molecules impinge against the wing with slower moving molecules below creating a higher pressure as against that owing to faster moving molecules above, leading to a pressure difference described by Bernoulli's Law, and this counteracts gravity;

  • — in terms of same-level explanation: it flies because the pilot is flying it, after a major process of training and testing that developed the necessary skills, and she is doing so because the airline's timetable dictates that there will be a flight today at 16.35 h from London to Berlin, as worked out by the airline executives on the basis of need and carrying capacity at this time of year;

  • — in terms of top-down explanation: it flies because it is designed to fly! This was done by a team of engineers working in a historical context of the development of metallurgy, combustion, lubrication, aeronautics, machine tools, computer-aided design, etc. all needed to make this possible, and in an economic context of a society with a transportation need and complex industrial organizations able to mobilize all the necessary resources for design and manufacture. A brick does not fly because it was not designed to fly; and

  • — why was it designed to fly? Because it will potentially make a profit for the manufacturers and the airline company! Without the prospect of that profit, it would not exist. This is the topmost cause for its existence.

These are all simultaneously true non-trivial explanations; the plane would not be flying if they were not all true at the same time. The higher level explanations involving goal choices rely on the existence of the lower level explanations involving physical mechanisms in order that they can succeed, but are clearly of a quite different nature than the lower level ones, and are certainly not reducible to them nor dependent on their specific nature. The higher level goals can be realized in multiple ways. The bottom-up kind of explanation would not apply to a specific context if the higher level explanations, the result of human intentions, had not created a situation that made it relevant.

This situation was captured by Aristotle in his Metaphysics [54] through his proposal of four different kinds of causation. According to Falcon [55], they are:

  • — the material cause: ‘that out of which’, e.g. the bronze of a statue;

  • — the formal cause: ‘the form’, ‘the account of what-it-is-to-be’, e.g. the shape of a statue;

  • — the efficient cause: ‘the primary source of the change or rest’, e.g. the artisan, the art of bronze-casting the statue, the man who gives advice; and

  • — the final cause: ‘the end, that for the sake of which a thing is done’, e.g. health is the end of walking, losing weight, purging, drugs and surgical tools.

The last is a teleological explanation—an explanation that makes a reference to telos or purpose. Additionally, circular causation is possible: things can be causes of one another—a relation of reciprocal influence.

I suggest a modern adaptation of these four kinds of causes is to consider causation in the hierarchical context considered here, identifying as especially significant to the immediate lower level (Physical) cause, the same-level (Immediate) cause, the immediate higher (Contextual) cause and the topmost level of purpose or Telos, which activates the rest. We cannot identify an ultimate lower level cause because no one knows what the bottom level is (we have no fully successful ultimate theory of particle physics).

3.1. Example: physics experiments

Successful completion of a physics experiment, such as observing particle production in the Large Hadron Collider, involves all these forms of causation. The physical cause is the particle interactions that lead to the production of new particles. The immediate cause is that the experimenters turn the accelerator and measuring equipment on at a particular time. The contextual cause is that the collider was designed and manufactured so that the collisions would take place and outcomes could be observed. The purpose might simply be that the experimenters want to understand the collision in the context of a theory of anti-de Sitter/conformal field theory duality, or it might be because they aspire to attaining a Nobel Prize.

Thus, one can have top-down system explanations as well as bottom-up and same-level explanations, all being simultaneously applicable. Indeed, there will be numerous causal factors in any specific case forming a network of causes, including those identified here, but also the overall historical and physical and social environment, without which the identified events would not take place (for example, the laws of physics are as they are, the Earth exists, scientists are able to do experiments, measuring apparatus can be devised reliably and so on). An explanation usually takes most of this for granted and focuses just on one or two items that are the subject of attention, perhaps because they can be manipulated to alter the result.

The key point about causality in real-world contexts, then, is that simultaneous multiple causality (inter-level, as well as within each level) is always in operation in complex systems. Claiming that any specific single cause is the only causation in action is fundamentally misleading, as it ignores the complex nature of the real causal web [56].

4. The causal efficacy of human symbol systems

Some aspects of complex systems are emergent from their own internal nature and logic, but others rather arise from the way the nature of the external environment shapes human symbolic systems. A crucial point then is that the resulting higher level causal entities are not of a physical nature and are not reducible to or emergent from any physical entities. They do however have causal power. They are derived from the behaviour of the world around us and mirror the way that world behaves; hence they arise from the reality of the external world, which exists independent of the mind. They are thus not coarse-grained lower level variables. They are discovered and comprehended by the mind; so the top-down influence of operations at the level of symbolic systems is via intelligent top-down causation (TDC5).

4.1. Example: mathematics comprehension and utilization

This is a case of top-down causation from a world of mathematical abstractions to the human mind, being realized in details of neuronal connections, and then into the real world where it is causally effective both in terms of creating patterns on paper, and by underlying engineering and planning. Major parts of mathematics are discovered rather than invented (rational numbers, zero, irrational numbers and the Mandelbrot set being classic examples). They are not determined by physical experiment, but are rather arrived at by mathematical investigation. They have an abstract rather than embodied character; the same abstract quantity can be represented and embodied in many symbolic and physical ways, and these representations form an equivalence class. The underlying mathematical truths are independent of the existence and culture of human beings [57,58];2 it is plausible that the same features will be discovered by intelligent beings in the Andromeda galaxy as here, once their mathematical understanding is advanced enough (which is why these features are advocated as the basis for interstellar communication).

These features are discovered by humans, and represented by our mathematical theories; that representation is a cultural construct, but the underlying mathematical features they represent are not—indeed like physical laws, they are often unwillingly discovered, for example, the irrationality of √2 and the number π. These mathematical verities are causally efficacious through the actions of the human mind: one can, for example, print graphic versions of the Mandelbrot set in a book, resulting in a physical embodiment in the ink printed on the page.

4.2. Example: physics theories

Maxwell's theory of electromagnetism (an abstract entity, described by Maxwell's equations, see [60]) led to the development of radio, and then to existence of cell phones, TV and so on, based on manipulation of physical materials composed of atoms and electrons. Maxwell's theory is not a physical entity, nor is it the same as any single person's brain state. It can be represented in many ways (on blackboards, in print, on computer screens, in spoken words) and in many formalisms (via three-dimensional vectors or four-dimensional tensors, for example). These various representations together form an equivalence class, as they all lead to the same predicted outcomes. How do you demonstrate top-down causation by this theory? Design an artefact such as a cell-phone through the use of Maxwell's theory, and then construct it and operate it. The abstract theory will have altered physical configurations in the real world, and hence is causally effective. The theory is an irreducible higher level entity (it cannot be derived by coarse-graining any lower level variables) representing the nature of physical reality, in that is a representation of physical laws of behaviour that are eternal and omnipresent (physics is the same everywhere in the universe).

It is the accuracy of this representation of the way the world works that gives the theory its causal powers: it is demonstrably a good representation of the underlying physical reality (namely, the consistent regularities in the behaviour of matter that underlies what happens in the physical universe). Hence in this way, the causal regularities in the physical world can be represented as a set of abstract patterns, resulting in a mental theory that is causally efficacious. It is the underlying regularities in the behaviour of physical matter, independent of any human comprehension or mind, that is the ultimate source of this causal efficacy, and hence in this way (enabled by TDC5) has causal powers in the physical world, for example, by underlying engineering practice (as well by governing the physical behaviour of matter).

5. Room at the bottom

Where does the causal slack lie enabling top-down action to take place? If the underlying physics is deterministic and determines all physics outcomes at the lower level, how is there freedom for higher level causation to be efficacious? Three key features are relevant.

Firstly, in considering specific physical and biological systems, it lies partly in the structuring of the system so as to attain higher level functions—for example, the specific connections in a computer (which could have been different) act as constraints on lower level dynamics, thus channelling how they function; and partly in the boundary conditions together with openness of the system: new information can enter across the boundary and affect local outcomes. Together these features set the environment in which the lower level components operate, and so determine their outcomes (e.g. Denis Noble's article [14]).

Secondly, top-down causation can change the nature of the lower elements. There is not just a situation of invariant lower level elements obeying fixed physical laws; rather we have the nature of lower level elements being changed by context. Often this ensures that the lower level elements function so as to fulfil higher level purposes: this is an aspect of adaptive selection. Thus, the nature of micro-causation is changed by top-down processes, profoundly altering the mechanistic view of how things work.

5.1. Example: cell differentiation

Through the processes of developmental biology, cells get differentiated to perform specific functions; this changes their nature relative to other cells in an adaptive way [28]. Cells differentiate into neurons that get adapted to their location in the brain, into muscle cells adapted to their role in the heart and so on. They each develop so as to fit into their allotted role in the body, creating the body and its biological form as they do so, and are then fine-tuned for their function. A particular case is the development of sensory neurons [36] out of pluripotent cells.

5.2. Example: humans in society

Individual minds develop in the context of their interactions with other minds, and brain development cannot be understood outside this context [61]. Individuals are shaped by society so that they fit into that society—for example, learning a specific language and a variety of societal roles and expectations [51]. This is top-down causation from the society to the individual, and indeed to their synaptic connections: their brain is adapted to fit into the society in which it lives [62].

Thirdly, the required freedom lies in micro-indeterminism (random outcomes of microphysical effects), combined with adaptive selection: random outcomes at the micro-level allow variation at the macro-level, which then leads to selection at the micro-level, but based on macro-level properties and meaning. Statistical variation and quantum indeterminacy provide a repertoire of variant systems that are then subjected to processes of adaptive selection, based on higher level qualities of the overall system.

For this to work, one needs amplifying mechanisms in order to attain macroscopic variation from microscopic fluctuations. Some physical systems (such as photomultipliers and the human eye) amplify quantum effects to a macroscopic scale; classically, chaotic systems can amplify micro-fluctuations in initial data; some of the effects captured in Thom's catastrophe theory allow large amplification of microscopic changes and some molecular biology processes (for example, involving replication of mutated molecules) act as such amplifiers, even allowing quantum effects to change evolutionary outcomes [63]. Because of quantum uncertainty, at a profound level, the universe is indeterministic, allowing the needed causal slack for higher levels to be free of the tyranny of absolute control by lower level dynamics; and this affects biological processes [64,65]. By itself that does not lead to emergence of higher level order, but it does allow this to occur through the process of adaptive selection. That adaptive selection process will act on equivalence classes of lower level variables (see Jaeger [27]), guided by higher level selection principles.

Because of the existence of random processes at the bottom, there is sufficient causal slack to allow all these kinds of causation to occur without violation of physical causation, for example, developmental biology amplifies molecular-level variation to system-level changes. That these random processes do indeed occur at the lower levels is indicated by many kinds of evidence [6668]. This mechanism can only work because of the large gap between macro- and microphysics, together with the huge number of micro-components involved (atoms in a cell, cells in a human body, etc.): hence, emergence of genuine complexity requires the vast numbers of entities entailed in physical reality.

5.3. Criticism and response

In response to the earlier-mentioned about the causes leading to an aircraft flying, Tim O'Connor (2011, personal communication) has commented as follows: note that the higher level explanations appeal to (intentional) states long prior to the plane's flying. That is, the explanation works by setting the event to be explained in a larger spatio-temporal context. The reductionist might retort: if those prior intentional states are themselves wholly fixed by more fundamental physical facts that compose them, we could have in principle a completely physical (bottom-up) explanation spanning each step of the larger context to which you point. This would be an explanation wholly independent of high-level intentional explanations—an appeal to facts that are themselves collectively responsible for their being a coexisting intentional level of explanation. Taking the widest scope possible (the universe as a whole), the fundamental physical facts and the laws that directly govern them asymmetrically determine the existence of higher level systems and the forms of explanation they make possible. Or, at any rate, it is not clear that anything to which this paper appeals conflicts with this assertion. And if that is correct, then why is there not a perfectly good sense in which all action takes place down below?

As I understand it, this proposes that the larger spatio-temporal context of cosmology sets initial data that completely determines the present-day situation and so explains all current lower and higher levels. My answer is twofold.

Firstly, because of quantum uncertainty, such a proposal to explain present-day details in terms of cosmological initial data cannot work even in principle: For example, quantum fluctuations can change the genetic inheritance of animals [63] and so influence the course of evolutionary history on Earth. Indeed that is what occurred when cosmic rays—whose emission processes are subject to quantum uncertainty—caused genetic damage in the distant past [69]. Consequently, the specific evolutionary outcomes on life on Earth (the existence of dinosaurs, giraffes and humans) cannot even in principle be uniquely determined by causal evolution from conditions in the early universe, or from detailed data at the start of life on Earth. Quantum uncertainty prevents this, because it significantly affected the occurrence of radiation-induced mutations in this evolutionary history. The specific outcome that actually occurred was determined as it happened, when quantum emission of the relevant photons took place: the prior uncertainty in their trajectories was resolved by the historical occurrence of the emission event, resulting in a specific photon emission time and trajectory that was not determined beforehand, with consequent damage to a specific gene in a particular cell at a particular time and place that cannot be predicted even in principle. If our own existence cannot uniquely follow from that initial data, neither can any specific thoughts or intentions.

Secondly, if we disregard this impossibility, we are in effect faced with the proposal that the future occurrence of the battle of Trafalgar, the painting of the Mona Lisa and the discovery of general relativity theory by Albert Einstein are specifically written into the fluctuations on the last scattering surface in the early universe that we now observe through the Wilkinson microwave anisotropy probe satellite. I believe this is patently absurd. The only way these outcomes could have happened is for genuine higher level causal powers to have come into being with their own inherent logic, these then lead to these extraordinary outcomes (inter alia causing electrons and protons to move in brain in ways essentially determined by higher level causal factors). There is no way that they could be implied by physics per se.

Finally, O'Connor (2011, personal communication) suggests that although it has to be conceded that biological top-down causation in cell differentiation does show that some non-fundamental levels of causes and explanations are not independent of those above them, the reductionist will claim that whatever the fundamental physical facts and laws turn out to be will be independent of higher level entities that they make possible. However, according to the best current ‘theory of everything’, namely M-theory, the particles and forces that exist are not uniquely determined by fundamental laws, but rather result from the specific string vacuum state that occurs [70]. In that case, a purely bottom-up explanation cannot work: the very nature of the effective laws of physics is environmentally dependent. It has been claimed by Susskind [71] that the existence of intelligent life in the context of a multi-verse thereby gives an explanation of why the laws of physics we experience are as they are—it is a selection effect resulting from our existence as observers in a multi-verse. This is indeed a radical revision of the old idea of immutable physical laws underlying all physical causation.

A paper by Purves et al. [72], and comments by Simon Saunders (S. Saunders 2011, personal communication), express the view that in the end, one must acknowledge that it is still a fact that the essential work is happening at the microlevel, whatever contextual effects may occur. Yes indeed; but by itself that action does not determine the outcomes, even when put in the full historical context, as just explained. I believe the above response to O'Connor, together with the other arguments in this paper, suffices to respond to these comments: for they show that while the lower levels do the work, the higher levels decide what is to be done. And it is just as well we do not have to understand the lowest level in order to make higher level predictions, because we do not even know what the lowest level is.

6. Implications

Understanding top-down action can have important effects on society. Two cases will be mentioned here: medical/healthcare issues and education—in particular as regards language issues. In both cases, an ongoing battle between bottom-up and top-down approaches has important consequences for welfare in society.

6.1. Health: the welfare of the body

There are in essence competing bottom-up (reductionist) and top-down (holistic) views of how to look after illness and disease, resulting in different treatment modalities. Bottom-up methods emphasize the role of microsystems in determining health outcomes, whereas the top-down methods place the emphasis on the ways that the operations of the mind can affect bodily health.

The bottom-up emphasis leads to treatment regimes that place their faith in medicines, drugs and surgery; they emphasize the view that the way to heal the whole is to treat the parts. Top-down methods acknowledge that state of mind plays a significant role in physical health, and so takes that into account too. There are many ways that such top-down influences take place, an important one being the fact that many immune molecules are also neuro-modulators, allowing the mind to influence physical health through interaction of the brain and the immune system [73]. This is part of the way the mind influences health [74]; indeed, overall mental state crucially affects physical health. Some examples include: small infants depend on a caring adult for survival; they may suffer developmental problems, an array of emotional problems and perhaps failure to thrive if deprived of such care [75]. In extreme cases, they may lose weight and eventually die if they are not given individual attention and caring, even if all their bodily needs are met. Treatment paradigms should take this into account.3 Children's survival rates after heart surgery are affected by whether or not they have animal companions [76]. In a family context, survival rates depend on family communication patterns [77].

As regards specifically mental health, there are competing paradigms of how to treat mental problems, represented by neurology, psychiatry, clinical psychology, psychotherapy and cognitive behaviour therapy, the first being essentially bottom-up and the latter three top-down, with psychiatry in the middle. There may be different cases where one or other of these approaches is more appropriate. The challenge is to determine what combination of bottom-up and top-down interventions will give the best results in specific cases [78].

6.2. Education: learning to read and write

Closely related issues arise in education, with literacy teaching and learning providing an apt example. A bottom-up (part to whole) approach tackles detailed technical aspects of language such as phonics and handwriting first, and worries about the functional, communicative roles later, with the assumption that the parts will then come together in some kind of ‘building block’ way, to form a meaningful whole. In this case, if pursued to an extreme, children are first given decontextualized exercises that ask them to recognize and sound out numerous letter—I sound combinations such as ma, me, mi ma, mu, followed by phonetically regular words and even nonsense words such as tok, zat and fot. Only once children have accomplished learning this and other restricted texts, are they given opportunities to move on to engage with meaningful texts. The mechanics of language is thus taught separately from and prior to the essential purpose of language, which is to make and convey meaning. Crucially, testing of children is done that involves recognition of meaningless words and hence omits the core functioning of language. Dehaene makes this explicit in his book ([79], p. 200):

the child's brain, at this stage, is attempting to match the general shape of the words directly onto meaning, without paying attention to individual letters and their pronunciation—a sham form of reading.

He defines reading the wrong way round, thereby misinterpreting the learning process! He wants the parts to work rather than the whole, and characterizes as sham reading, what is in fact both the aim of fluent readers, and the way that young children learn language. What he deplores is precisely what we want children to learn: to read from the whole, not the parts. The testing involved in associated reading programmes is of the same nature as this quote indicates: it penalizes attempts at conventional reading practices.

By contrast, one can proceed top-down, starting with the aim of conveying the idea that symbols convey meaning, showing children how this works in practice by immersing them in a reading culture, while at the same time encouraging them into a process of experimentation and successive approximation, as they come to adjust initial formally conventional symbol usage to more conventional symbolic use over time. In this way, ability to read and write emerges in a way similar to how babies learn oral language, by a trial-and-error process with feedback, but always concentrating primarily on the way that language is essentially about conveying meaning [8082]. A beautiful illustration of how this occurs for young children's writing is given in Bloch [83]. In this case, holistic understanding and meaning are the primary educational themes, and detailed issues such as grammar and syntax (and the other component parts) are tackled within the context of the whole. This understanding of reading as a psycholinguistic process leads to the whole language approach to teaching reading and writing.

This is a crucial ongoing debate with major practical implications for education. The bottom-up approach centred on grasping phonemes first is strongly supported inter alia by Adams [84] and Dehaene [79]. The holistic view [85] strongly supports the top-down approach: children are inherently motivated to make sense of the complexities of their world, learning in a seemingly messy way that draws on all their knowledge and strengths to integrate bottom-up and top-down understandings predictively in order to do justice to this complexity. This is what happens in apprenticeships. It is also what happens for many children who grow up in well-resourced, literate homes. Because the conditions of learning described by Cambourne [86] are appropriate, and they are immersed in story-reading and play with written language, such children learn, as if by osmosis, many of the essential ‘concepts of print’ that are neglected in the first years of primary school. Children from home backgrounds which do not provide such experiences, such as in many African settings [87], find themselves flailing when their introduction to print concentrates on the technicalities alone. Many tend to lose interest in what they see as meaningless activities and this results in a lesser ability to read in a successful way.

This ties into a much larger picture of how the brain functions in a top-down way. The brain is exquisitely constructed to search for meaning [61] and to predict what is likely to happen [37]. This happens particularly in vision: it is not true that vision can be understood simply as data coming in from our eyes and being interpreted by the brain. Rather the brain is continually predicting what ought to be there, and filling in what it expects to see on the basis of only some of the data that it actually analyses at any one time. This can conclusively be shown to be the case by analysing visual illusions [38,88]. Hence, a top-down process of interpretation, based on our expectations and facilitated by specific neuronal connections, modulates and shapes what we actually see. Similar processes happen in listening to music: expectation is a key feature of how we experience music [89,90].

This is a crucial aspect of the way we read a text, which does not take place by reading each phoneme, assembling them into words, assembling those into phrases and so on. Rather the eye skips over words, reading whole phrases at a time and filling in the bits that are not actually read. This can be demonstrated by miscue analysis and eye movement research [9193]. You can experience it for yourself by carrying out the following simple exercise: read the following statement once or twice, and as you do so count the number of times the letter ‘F’ appears:

FINAL FOLIOS SEEM TO RESULT FROM YEARS OF DUTIFUL STUDY OF TEXTS ALONG WITH YEARS OF SCIENTIFIC EXPERIENCE

How many times does F occur in this statement—5? 6? 7? The answer is given at the end of the conclusion.

Our ability to read ambiguous texts derives from the fact that context sets the meaning and even the pronunciation of words: language is driven by word associations rather than individual words [94]. Consequently, context drives the process of reading: it is not bottom-up, it is a psycholinguistic guessing game [95]. This top-down-driven process is a fundamental aspect of how the brain works, and is at the core of what reading is about. Because brain imaging studies have not been conducted on this holistic reading process, they do not yet have the capacity to show whether or not the phonics approach to teaching reading is superior [93].4

6.3. Other applications

Obviously, these contested themes have not been dealt with in adequate depth here. I have included the two examples in this section in order to indicate that there is evidence demonstrating that the themes of this article might have significant practical consequences. This suggests that this possibility must be taken seriously also in regards to other aspects of education, sometimes determining educational success and other aspects of welfare, which is not determined in a simple bottom-up way.

7. Conclusion

7.1. Interaction of top-down and bottom-up causation as an integrating theme

The basic theme of this paper is that it is the interaction of bottom-up and top-down effects that enable the emergence of true complexity. This realization gives an integrative view of the relation between different levels of the hierarchy of complexity, both within each subject (particle physics to nuclear physics, nuclear physics to atomic physics and so on; also between each of the levels of biology) and between subjects (physics to chemistry, chemistry to biology, psychology to sociology and so on).

It is clear that top-down causation as envisaged here happens at the levels of physiology [96] and in the case of vision [38,97]. Indeed, there is a great deal of data supporting this view as regards the way the mind works in general, including the rapidly growing literature on cultural neuroscience [62,98,99,100]. This therefore supports the case made by Elder-Vass that it happens in society [42]. It can be demonstrated to happen in other cases—for example, in microbiology [27] and fluid flows [15]. It also plays a significant role in quantum physics, see [31].

In brief: the conclusion is that there are other forms of causation than those encompassed by bottom-up causation due to physics and physical chemistry processes considered on their own. The higher up the ladder of complexity one goes, the clearer this becomes; in particular, it is undeniable as regards the way the human mind operates, and in physiology. It is particularly clear in those cases where the higher level variables cannot be obtained by coarse-graining of any lower level variables, as is the case both in the causal power of money, and the causal effectiveness of Maxwell's theory of electromagnetism in the world of engineering.

A full scientific view of the world must recognize this fact, or else it will ignore important aspects of causation in the real world, and so will give a causally incomplete view of things. These forms of causation are based on the interaction of bottom-up and top-down effects: if we neglect either, we will be unable to understand genuinely complex systems.

7.2. Answer

The correct answer to the question in the last section (how many times does the letter F occur in the statement there?) is 8. If you got this stunningly simple algorithmic task wrong (as most people do), it is because we do not read a sentence word by word, phoneme by phoneme: the mind skips words and interpolates when reading! You do not even see the words ‘of’ in this text, because the mind takes them for granted and does not read them.5 This is an experimental proof of the fact that we read in a top-down way.

Acknowledgements

I thank Tim O'Connor and three referees for helpful comments that have considerably improved this paper, and Carole Bloch for crucial input as to how the reading process actually takes place. I thank the National Research Foundation (NRF), South Africa, and the University of Cape Town for support.

Footnotes

1

The relevant diagrams showing how this happens are given in Butterfield's [10] paper in this special issue.

2

Penrose and Connes propose that this makes best sense if one considers mathematics as existing in a Platonic sense [57,58]. I concur [59], but this is of course a philosophically contentious position.

3

In the 1970s, Dr Trudi Thomas ran a scheme called Sunshine Wards as St Matthew's Hospital, Keiskammahoek, to provide emotional contact to malnourished orphans in the Ciskei; this resulted in improved survival rates. This is described in her book Their Doctor Speaks (privately published: 1973). A current version of therapy for newborn children based on the same principles is called Kangaroo care; see http://en.wikipedia.org/wiki/Kangaroo_care for articles giving evidence of its medical benefits.

4

Some books (for example, [79]) claim the opposite on the basis of neuroscience evidence, including brain imaging data. However, these data do not in fact prove what they claim. The experiments that are claimed to lead to this conclusion do not test what is going on in the brain when meaningful reading is taking place; they relate only to reading meaningless texts or to some components of the reading process. This is not an adequate data from which one can make pronouncements about the genuine reading process as a whole, where meaningful texts are read and understood.

5

I am indebted to Kevin Dutton's intriguing book Flipnosis [101] for this example.

References

  • 1.Booch G. 1994. Object oriented analysis and design with applications. New York, NY: Addison Wesley [Google Scholar]
  • 2.Ellis G. F. R. 2008. On the nature of causation in complex systems. Trans. R. Soc. South Africa 63, 69–84 See http://www.mth.uct.ac.za/~ellis/Top-down%20Ellis.pdf [Google Scholar]
  • 3.Simon H. A. 1992. The sciences of the artificial. Cambridge, MA: MIT Press [Google Scholar]
  • 4.Campbell N. A., Reece J. B. 2005. Biology. San Francisco, CA: Benjamin Cummings [Google Scholar]
  • 5.Kuppers B.-O. 1994. Information and the origin of life. Cambridge, MA: MIT Press [Google Scholar]
  • 6.Roederer 2005. Information and its role in nature. Berlin, Germany: Springer [Google Scholar]
  • 7.Penrose R. 1989. The emperor's new mind: concerning computers, minds and the laws of physics. New York, NY: Oxford University Press [Google Scholar]
  • 8.Penrose R. 2005. The road to reality: a complete guide to the laws of the universe. New York, NY: Knopf [Google Scholar]
  • 9.Auletta G., Ellis G. F. R., Jaeger L. 2008. Top-down causation by information control: from a philosophical problem to a scientific research program. J. R. Soc. Interface 5, 1159–1172 10.1098/rsif.2008.0018 (doi:10.1098/rsif.2008.0018) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Butterfield J. 2012. Laws, causation and dynamics at different levels. Interface Focus 2, 101–114 10.1098/rsfs.2011.0052 (doi:10.1098/rsfs.2011.0052). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Barabási A.-L., Oltvai Z. N. 2004. Network biology: understanding the cell's functional organization. Nat. Rev. Genet. 5, 101–114 10.1038/nrg1272 (doi:10.1038/nrg1272) [DOI] [PubMed] [Google Scholar]
  • 12.Alon U. 2007. An introduction to systems biology: design principles of biological circuits. London, UK: Chapman and Hall/CRC [Google Scholar]
  • 13.Tanenbaum A. S. 1990. Structured computer organisation. Englewood Cliffs, NJ: Prentice Hall [Google Scholar]
  • 14.Noble D. 2012. A theory of biological relativity: no privileged level of causation. Interface Focus 2, 55–64 10.1098/rsfs.2011.0067 (doi:10.1098/rsfs.2011.0067) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Bishop R. C. 2012 Fluid convection, constraint and causation. Interface Focus 2, 4–12 10.1098/rsfs.2011.0065 (doi:10.1098/rsfs.2011.0065) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Busemeyer J. R. ‘Dynamic systems’ To appear in encyclopedia of cognitive science. New York, NY: Macmillan; 2011. See http://www.cogs.indiana.edu/Publications/techreps2000/241/241.html . [Google Scholar]
  • 17.Courant R., Hilbert D. 1962. Methods of mathematical physics, II. New York, NY: Wiley-Interscience [Google Scholar]
  • 18.Brenner S. C., Scott L. R. 2007. The mathematical theory of finite element methods. Heidelberg, Germany: Springer [Google Scholar]
  • 19.Huang K. 1990. Statistical mechanics. New York, NY: Wiley, John & Sons [Google Scholar]
  • 20.Knuth D. E. 2010. Selected papers on design of algorithms. Stanford, CA: Center for the Study of Language and Information [Google Scholar]
  • 21.Jaeger R. 1997. Microelectronic circuit design. New York, NY: McGraw-Hil [Google Scholar]
  • 22.Rhoades R., Pflanzer R. 1989. Human physiology. Fort Worth: Saunders College Publishing [Google Scholar]
  • 23.DiStefano J. J., Stuberub A. R., Williams I. J. 1995. Feedback and control systems. McGraw Hill: Schaum Outlines [Google Scholar]
  • 24.Iglesias P. A., Ingalis B. P. 2010. Control theory and systems biology. Cambridge, MA: MIT Press [Google Scholar]
  • 25.Holland J. H. 1992. Adaptation in natural and artificial systems. Cambridge, MA: MIT Press [Google Scholar]
  • 26.Conway Morris S. 2005. Life's solution: inevitable humans in a lonely universe. Cambridge, UK: Cambridge University Press [Google Scholar]
  • 27.Jaeger L., Calkins E. R. 2012. Downward causation by information control in micro-organisms. Interface Focus 2, 26–41 10.1098/rsfs.2011.0045 (doi:10.1098/rsfs.2011.0045) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Gilbert S. F. 2006. Developmental biology. Sunderland, MA: Sinauer Associates [Google Scholar]
  • 29.Edelman G. M. 1989. Neural Darwinism: the theory of group neuronal selection. Oxford, UK: Oxford University Press [Google Scholar]
  • 30.Ellis G. F. R., Toronchuk J. A. 2005. Neural development: affective and immune system influences. In Consciousness and emotion: agency, conscious choice, and selective perception (eds Ellis R. D., Newton N.), pp. 81–119 Philadelphia, PA: John Benjamins. [Google Scholar]
  • 31.Ellis G. F. R. 2011. On the limits of quantum theory: contextuality and the quantum-classical cut. See http://arxiv.org/PS_cache/arxiv/pdf/1108/1108.5261v2.pdf
  • 32.Gillespie J. H. 2004. Population genetics: a concise guide. Baltimore, MD: Johns Hopkins University Press [Google Scholar]
  • 33.Kuppers B.-O. 1994. Information and the origin of life. Cambridge, MA: MIT Press [Google Scholar]
  • 34.Bishop C. M. 1999. Neural networks for pattern recognition. Oxford, UK: Oxford University Press [Google Scholar]
  • 35.Mitchell M. 1998. An introduction to genetic algorithms (complex adaptive systems). Cambridge, MA: MIT Press [Google Scholar]
  • 36.Gray P. 2011. Psychology. New York, NY: Worth [Google Scholar]
  • 37.Hawkins J. 2004. On intelligence. New York, NY: Holt Paperbacks [Google Scholar]
  • 38.Purves D. 2010. Brains: how they seem to work. Upper Saddle River, NJ: FT Press Science [Google Scholar]
  • 39.Alexander J. M. 2009. Evolutionary game theory. In The Stanford encyclopedia of philosophy (autumn 2009 edn) (ed. Zalta E. N.). See http://plato.stanford.edu/archives/fall2009/entries/game-evolutionary/ [Google Scholar]
  • 40.Deacon T. 1997. The symbolic species: the co–evolution of language and the human brain. London, UK: Penguin [Google Scholar]
  • 41.Boulding K. E. 1969. The image: knowledge in life and society. Ann Arbor, MI: University of Michigan Press [Google Scholar]
  • 42.Elder-Vass D. 2010. The causal power of social structures: emergence, structure and agency. Cambridge, UK: Cambridge University Press [Google Scholar]
  • 43.Frankl V. 2006. Man's search for meaning. Boston, MA: Beacon Press [Google Scholar]
  • 44.Nowak M. A. 2006. Evolutionary dynamics: exploring the equations of life. Cambridge, MA: Harvard University Press [Google Scholar]
  • 45.Black F., Scholes M. 1973. The pricing of options and corporate liabilities. J. Polit. Econ. 81, 637–654 10.1086/260062 (doi:10.1086/260062) [DOI] [Google Scholar]
  • 46.Merton R. C. 1973. Theory of rational option pricing. Bell J. Econ. Manage. Sci. 4, 141–183 10.2307/3003143 (doi:10.2307/3003143) [DOI] [Google Scholar]
  • 47.Patterson S. 2010. The quants: how a new breed of Math Whizzes conquered Wall Street and nearly destroyed it. New York, NY: Crown Business [Google Scholar]
  • 48.Gellman M. The quark and the jaguar. London, UK: Abacus; 2002. [Google Scholar]
  • 49.Harford T. 2011. Adapt: why success always starts with failure. London, UK: Little Brown [Google Scholar]
  • 50.Von Baeyer H. C. 1998. Maxwell's demon: why warmth disperses and time passes. Uttar pradesh, India: Random House [Google Scholar]
  • 51.Berger P., Luckmann T. 1967. The social construction of reality: a treatise in the sociology of knowledge. New York, NY: Anchor [Google Scholar]
  • 52.Churchman C. W. 1968. The systems approach. New York. NY: Delacorte Press [Google Scholar]
  • 53.Flood R. L., Carson E. R. 1990. Dealing with complexity: an introduction to the theory and application of systems science. London, UK: Plenum Press [Google Scholar]
  • 54.Ross W. D. 1953. Aristotle's metaphysics, vol. 2 Oxford, UK: Clarendon Press [Google Scholar]
  • 55.Falcon A. 2006. Aristotle on causality. Stanford Encyclopedia of Philosophy. See http://plato.stanford.edu/entries/aristotle-causality/
  • 56.Ellis G. F. R. 2010. Fundamentalism in science, theology, and the academy. In Human identity at the intersection of science, technology, and religion (eds Murphy N., Knight C. C.), pp. 57–76 Farnham, UK: Ashgate [Google Scholar]
  • 57.Changeux J.-P., Connes A. 1998. Conversations on mind, matter, and mathematics. Princeton, NJ: Princeton University Press [Google Scholar]
  • 58.Penrose R. 1997. The large, the small and the human mind. Cambridge, UK: Cambridge University Press [Google Scholar]
  • 59.Ellis G. F. R. 2004. True complexity and its associated ontology. In Science and ultimate reality: quantum theory, cosmology and complexity (eds Barrow J. D., Davies P. C. W., Harper C. L.), pp. 607–636 Cambridge, UK: Cambridge University Press [Google Scholar]
  • 60.Fleisch D. 2008. A student's guide to Maxwell's equations. Cambridge, UK: Cambridge University Press [Google Scholar]
  • 61.Donald M. 2001. A mind so rare: the evolution of human consciousness. New York, NY: W W Norton [Google Scholar]
  • 62.Ambady N. 2011. The Mind in the world: culture and the brain. Assoc. Psychol. Sci. 24, 5–6, 49 [Google Scholar]
  • 63.Percival I. 1991. Schrödinger's quantum cat. Nature 351, 357. 10.1038/351357a0 (doi:10.1038/351357a0)2034285 [DOI] [Google Scholar]
  • 64.Ball P. 2011. The dawn of quantum biology. Nature 474, 272–274 10.1038/474272a (doi:10.1038/474272a) [DOI] [PubMed] [Google Scholar]
  • 65.Lloyd S. 2011. A bit of quantum hanky-panky. Phys. World 24, 26–29 [Google Scholar]
  • 66.Chouard T. 2011. Breaking the Protein rules: if dogma dictates that proteins need a structure to function, then why do so many of them live in a state of disorder? Nature 471, 151. 10.1038/471151a (doi:10.1038/471151a) [DOI] [PubMed] [Google Scholar]
  • 67.Eldar A., Elowitz M. B. 2010. Functional role for noise in genetic circuits. Nature 467, 167–173 10.1038/nature09326 (doi:10.1038/nature09326) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Glimcher P. W. 2005. Indeterminacy in brain and behaviour. Annu. Rev. Psychol. 56, 25. 10.1146/annurev.psych.55.090902.141429 (doi:10.1146/annurev.psych.55.090902.141429) [DOI] [PubMed] [Google Scholar]
  • 69.Scalo J., Craig Wheeler J., Williams P. 2001. Intermittent jolts of galactic UV radiation: Mutagenetic effects. In Proc. XIIth Rencontres de Blois, Frontiers of Life, Chateau de Blois, France, 25 June to 1 July, 2000 (eds Celnikier L. M., Tran Thanh Van J.). Hanoi, Vietnam: The Gioi Publishers [Google Scholar]
  • 70.Douglas M. 2003. The statistics of string/M theory vacua. JHEP 0305, 46 See http://arxiv.org/abs/hep-th/0303194 [Google Scholar]
  • 71.Susskind L. 2005. The cosmic landscape: string theory and the illusion of intelligent design. New York, NY: Little Brown and Company [Google Scholar]
  • 72.Purves D., Wojtach W. T., Lotto R. B. 2011. Understanding vision in wholly empirical terms. Proc. Natl Acad. Sci. USA 7 10.1073/pnas.1012178108 (doi:10.1073/pnas.1012178108). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Sternberg E. M. 2000. The balance within. New York, NY: W H Freeman [Google Scholar]
  • 74.Moyers W. 1995. Healing and the mind. Doubleday: Main Street Books [Google Scholar]
  • 75.Berry B. 2001. Bonding and attachment in maltreated children consequences of emotional neglect in childhood the childtrauma academy. See http://centerforchildwelfare.fmhi.usf.edu/kb/ChronicNeglect/ConseqOfChNeglect.pdf
  • 76.Friedmann E., Katcher A. H., Lynch J. L., Thomas S. A. 1980. Animal companions and one-year survival of patients after discharge from a coronary care unit. Public Health Rep. 95, 307–312 [PMC free article] [PubMed] [Google Scholar]
  • 77.Gerhke S., Kirschenbaum M. 1967. Survival patterns in family conjoint therapy. Fam. Proc. 6, 67–80 10.1111/j.1545-5300.1967.00067.x (doi:10.1111/j.1545-5300.1967.00067.x) [DOI] [Google Scholar]
  • 78.Holmes J. 2001. All you need is cognitive therapy? Br. Med. J. 324, 288. 10.1136/bmj.324.7332.288 (doi:10.1136/bmj.324.7332.288) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Dehaene S. 2010. Reading in the brain: the new science of how we read. London, UK: Penguin [Google Scholar]
  • 80.Hall N. 1989. Writing with reason. London, UK: Hodder and Stoughton [Google Scholar]
  • 81.Holdaway D. 1979. The foundations of literacy. Sydney: Ashton Scholastic [Google Scholar]
  • 82.Krashen S. D., Terrell T. D. 1983. The natural approach: language acquisition in the classroom. San Francisco, CA: Alemany Press [Google Scholar]
  • 83.Bloch C. 1997. Chloe's story. Cape Town: Juta; See http://www.mth.uct.ac.za/~ellis/Chloe_story.pdf [Google Scholar]
  • 84.Adams M. 1990. Beginning to read: thinking and learning about print. Cambridge, MA: MIT Press [Google Scholar]
  • 85.Goodman K. S., Smith E. B., Meredith R., Goodman Y. M. 1986. Language and thinking in school: a whole-language curriculum. New York, NY: Richard C Owen Publishers [Google Scholar]
  • 86.Cambourne B. 1995. Toward an educationally relevant theory of literacy learning: twenty years of inquiry. Read. Teach. 43, 182–190 10.1598/RT.49.3.1 (doi:10.1598/RT.49.3.1) [DOI] [Google Scholar]
  • 87.Bloch C. 2005. Enabling effective literacy learning in multilingual South African early childhood classrooms. Cape Town: PRAESA University of Cape Town [Google Scholar]
  • 88.Frith C. 2007. Making up the mind: how the brain creates our mental world. Malden: Blackwell [Google Scholar]
  • 89.Huron D. 2007. Sweet anticipation: music and the psychology of expectation. Cambridge, MA: MIT Press [Google Scholar]
  • 90.Levitin D. J. 2007. This is your brain on music: the science of a human obsession. London, UK: Plume [Google Scholar]
  • 91.Ebe A. 2008. What eye movement and miscue analysis reveals about the reading process of young bilinguals. In Scientific realism in studies of reading (eds Flurkey A. D., Paulson E. J., Goodman K. S.), pp. 131–152 London, UK: Taylor and Francis [Google Scholar]
  • 92.Flurkey A. D., Paulsen E. J., Goodman K. S. 2008. Scientific realism in studies of reading. New York, NY: Laurence Erlbaum [Google Scholar]
  • 93.Strauss S. L., Goodman K. S., Paulson E. J. 2009. Brain research and reading: how emerging concepts in neuroscience support a meaning constructionist view of the reading process. Educ. Res. Rev. 4, 21–33 [Google Scholar]
  • 94.Hoey M. 2005. Lexical priming: a new theory of words and language. London, UK: Routledge [Google Scholar]
  • 95.Goodman K. S. 1967. Reading: a psycholinguistic guessing game. In Language and literacy: the selected writiings of kenneth goodman, vol. 1 (ed. Gollasch F. V.), pp. 33–44 London, UK: Routledge and Kegan Paul [Google Scholar]
  • 96.Noble D. 2008. The Music of life: biology beyond genes. Oxford, UK: Oxford University Press [Google Scholar]
  • 97.Bressler S. L., Tang W., Sylvester C. M., Shulman G. L., Corbett M. 2008. Top-down control of human visual cortex by frontal and parietal cortex in anticipatory visual spatial attention. J. Neurosci. 28, 10 056–10 061 10.1523/JNEUROSCI.1776-08.2008 (doi:10.1523/JNEUROSCI.1776-08.2008) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Hansen J. L., Chandra A., Wolfe B. L., Pollak S. D. 2011. Association between income and the hippocampus. PLoS ONE 6, e18712. 10.1371/journal.pone.0018712 (doi:10.1371/journal.pone.0018712) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Harmon-Jones E., Winkielmen P. 2007. Social neuroscience: integrating biological and psychological explanations of human behaviour. New York, NY: Guilford Press [Google Scholar]
  • 100.Deuschle M., Meyer-Lindenberg A. 2011. City living and urban upbringing affect neural social stress processing in humans. Nature 474, 498–501 10.1038/nature10190 (doi:10.1038/nature10190) [DOI] [PubMed] [Google Scholar]
  • 101.Dutton K. 2010. Flipnosis: the art of split-second persuasion. London, UK: Random House [Google Scholar]

Articles from Interface Focus are provided here courtesy of The Royal Society

RESOURCES